摘要:AbstractThis paper presents a methodology for evaluating city logistics measures considering the behaviour of several stakeholders associated with urban freight transport using a multi-agent model. The model constructed consists of a learning model and a model for vehicle routing and scheduling problem with time window-forecasted (VRP-TW-F). We used a method of Q-learning, a technique of reinforcement learning, in constructing a learning model. We implemented the model on a test road network representing an urban area. The results indicate that implementing a truck ban directly to environmentally damaged areas and discounting motorway tolls entirely in the urban motorway network together has large environmental effects, and leads to an acceptable environment for all stakeholders.