东北大学学报(自然科学版) ›› 2012, Vol. 33 ›› Issue (10): 1513-1516.DOI: -

• 论著 • 上一篇    下一篇

基于强化学习算法的公交信号优先策略

舒波;李大铭;赵新良;   

  1. 东北大学工商管理学院;
  • 收稿日期:2013-06-19 修回日期:2013-06-19 出版日期:2012-10-15 发布日期:2013-04-04
  • 通讯作者: -
  • 作者简介:-
  • 基金资助:
    辽宁省教育厅人文社会科学基金资助项目(2009JD31)

Transit signal priority strategy based on reinforcement learning algorithm

Shu, Bo (1); Li, Da-Ming (1); Zhao, Xin-Liang (1)   

  1. (1) School of Business and Administration, Northeastern University, Shenyang 110819, China
  • Received:2013-06-19 Revised:2013-06-19 Online:2012-10-15 Published:2013-04-04
  • Contact: Li, D.-M.
  • About author:-
  • Supported by:
    -

摘要: 综合分析了影响城市公共交通系统运行的多种因素,提出了一种新型的基于强化学习算法的城市公交信号优先控制策略.该策略利用强化学习算法的试错-改进机制,根据不同交通环境下信号控制策略实施后反馈的结果,迭代优化路口的公交信号优先控制策略,从而使其具备了自学习的能力.基于Paramics的仿真实验表明,该算法能够在保障路口正常交通秩序的同时,显著提高公交车运行效率.

关键词: 公交系统, 交通信号控制, 公交信号优先, 强化学习, 回报函数

Abstract: Factors affecting public transit system were synthetically analyzed. An innovative transit signal priority (TSP) strategy based on reinforcement learning algorithm was proposed. The trial and error mechanism of reinforcement learning were utilized, so the signal plans could be optimized iteratively by implementing them and estimating the rewards. The proposed idea made the TSP strategy have a capability of self-learning. Based on the software of Paramics, simulations were carried out. And the results demonstrated that the proposed TSP strategy could not only improve the efficiency of transit operation, but also reduce the impacts on general traffic at signalized intersections.

中图分类号: