Sage Journals: Discover world-class research

Abstract

Unsignalized intersections are considered to be one of the most complex and challenging scenarios in autonomous driving. As deep reinforcement learning (DRL) continues to advance, much research has been devoted to its application within unsignalized intersections. The DRL algorithm requires a shaped reward function to achieve optimal performance, which demands extensive experimentation. In reality, humans not only receive reward-like feedback, but they also learn early strategies from demonstrations. Inspired by the learning paradigm, this paper proposes a method termed leveraging diminishing demonstrations in twin-delayed deep deterministic policy gradient (LDD-TD3) to mitigate the reliance of DRL on reward shaping. A risk assessment module was designed to evaluate the current environmental risk in the LDD-TD3, and a prediction-based driving strategy was developed to guide the agent’s actions away from high-risk scenarios during training, so that the agent could accumulate successful experiences and quickly acquire a rudimentary strategy. Demonstrations were gradually phased out as the training progressed, allowing the agent’s exploration process to identify superior strategies. The simulation results showed that the LDD-TD3 effectively overcame DRL’s dependence on reward shaping and increased the average success rate by 4.9% compared with the TD3 under a shaped reward setting. The proposed LDD-TD3 is expected to improve the decision-making abilities of autonomous vehicles at unsignalized intersections.

Keywords

deep reinforcement learning autonomous driving unsignalized intersection risk assessment

Get full access to this article

View all access options for this article.

References

Werneke

Vollrath

How do Environmental Characteristics at Intersections Change in Their Relevance for Drivers Before Entering an Intersection: Analysis of Drivers’ Gaze and Driving Behavior in a Driving Simulator Study. Cognition, Technology & Work, Vol. 16, 2014, pp. 157–169.

National Highway Traffic Safety Administration (NHTSA). Traffic Safety Facts 2017 Data (Report No. DOT HS 812 806, Revised September 2019). NHTSA, Washington, D.C., 2019. https://crashstats.nhtsa.dot.gov/Api/Public/ViewPublication/812806 (Accessed: February 14, 2020)

Muhammad

Ullah

Lloret

Del Ser

de Albuquerque

V. H. C.

Deep Learning for Safe Autonomous Driving: Current Challenges and Future Directions. IEEE Transactions on Intelligent Transportation Systems, Vol. 22, No. 7, 2020, pp. 4316–4336.

Kurt

Yester

J. L.

Mochizuki

Özgüner

Ü.

Hybrid-State Driver/Vehicle Modelling, Estimation and Prediction. Proc., 13th International IEEE Conference on Intelligent Transportation Systems, Funchal, Portugal, IEEE, New York, 2010, pp. 806–811.

Gadepally

Krishnamurthy

Ozguner

A Framework for Estimating Driver Decisions Near Intersections. IEEE Transactions on Intelligent Transportation Systems, Vol. 15, No. 2, 2013, pp. 637–646.

Wang

Liu

Zuo

Wang

Luo

Trajectory Planning and Safety Assessment of Autonomous Vehicles Based on Motion Prediction and Model Predictive Control. IEEE Transactions on Vehicular Technology, Vol. 68, No. 9, 2019, pp. 8546–8556.

Jeong

Target Vehicle Motion Prediction-Based Motion Planning Framework for Autonomous Driving in Uncontrolled Intersections. IEEE Transactions on Intelligent Transportation Systems, Vol. 22, No. 1, 2019, pp. 168–177.

Hsu

T.-M.

Chen

Y.-R.

Wang

C.-H.

Decision Making Process of Autonomous Vehicle with Intention-Aware Prediction at Unsignalized Intersections. Proc., International Automatic Control Conference (CACS), Hsinchu, Taiwan, IEEE, New York, 2020, pp. 1–4.

Qian

Gregoire

De La Fortelle

Moutarde

Decentralized Model Predictive Control for Smooth Coordination of Automated Vehicles at Intersection. Proc., European Control Conference (ECC), Linz, Austria, IEEE, New York, 2015, pp. 3452–3458.

10.

Nilsson

Brännström

Fredriksson

Coelingh

Longitudinal and Lateral Control for Automated Yielding Maneuvers. IEEE Transactions on Intelligent Transportation Systems, Vol. 17, No. 5, 2016, pp. 1404–1414.

11.

Liu

Zhan

Tomizuka

Speed Profile Planning in Dynamic Environments via Temporal Optimization. Proc., IEEE Intelligent Vehicles Symposium (IV), Los Angeles, CA, IEEE, New York, 2017, pp. 154–159.

12.

Hult

Zanon

Gros

Falcone

Optimal Coordination of Automated Vehicles at Intersections: Theory and Experiments. IEEE Transactions on Control Systems Technology, Vol. 27, No. 6, 2018, pp. 2510–2525.

13.

Cvok

Pavelko

Škugor

Deur

Tseng

H. E.

Ivanovic

Design and Comparative Analysis of Several Model Predictive Control Strategies for Autonomous Vehicle Approaching a Traffic Light Crossing. Energies, Vol. 16, No. 4, 2023, p. 2006.

14.

Bautista-Montesano

Galluzzi

Ruan

Autonomous Navigation at Unsignalized Intersections: A Coupled Reinforcement Learning and Model Predictive Control Approach. Transportation Research Part C: Emerging Technologies, Vol. 139, 2022, p. 103662.

15.

Tampuu

Matiisen

Semikin

Fishman

Muhammad

A Survey of End-to-End Driving: Architectures and Training Methods. IEEE Transactions on Neural Networks and Learning Systems, Vol. 33, No. 4, 2020, pp. 1364–1384.

16.

Codevilla

Müller

López

Koltun

Dosovitskiy

End-to-End Driving via Conditional Imitation Learning. Proc., IEEE International Conference on Robotics and Automation (ICRA), Brisbane, Australia, IEEE, New York, 2018, pp. 4693–4700.

17.

Wang

Fernandez

Stiller

Learning Safe and Human-Like High-Level Decisions for Unsignalized Intersections from Naturalistic Human Driving Trajectories. IEEE Transactions on Intelligent Transportation Systems, Vol. 24, No. 11, 2023, pp. 12477–12490.

18.

Isele

Rahimi

Cosgun

Subramanian

Fujimura

Navigating Occluded Intersections with Autonomous Vehicles Using Deep Reinforcement Learning. Proc., IEEE International Conference on Robotics and Automation (ICRA), Brisbane, Australia, IEEE, New York, 2018, pp. 2034–2039.

19.

Kamran

Lopez

C. F.

Lauer

Stiller

Risk-Aware High-Level Decisions for Automated Driving at Occluded Intersections with Reinforcement Learning. Proc., IEEE Intelligent Vehicles Symposium (IV), Las Vegas, NV, IEEE, New York, 2020, pp. 1205–1212.

20.

Shu

Liu

Cao

Driving Tasks Transfer Using Deep Reinforcement Learning for Decision-Making of Autonomous Vehicles in Unsignalized Intersection. IEEE Transactions on Vehicular Technology, Vol. 71, No. 1, 2021, pp. 41–52.

21.

Lin

Learning Automated Driving in Complex Intersection Scenarios Based on Camera Sensors: A Deep Reinforcement Learning Approach. IEEE Sensors Journal, Vol. 22, No. 5, 2022, pp. 4687–4696.

22.

Gutiérrez-Moreno

Barea

López-Guillén

Araluce

Bergasa

L. M.

Reinforcement Learning-Based Autonomous Driving at Intersections in CARLA Simulator. Sensors, Vol. 22, No. 21, 2022, p. 8373.

23.

Vecerik

Hester

Scholz

Wang

Pietquin

Piot

Heess

Rothörl

Lampe

Riedmiller

Leveraging Demonstrations for Deep Reinforcement Learning on Robotics Problems with Sparse Rewards. arXiv Preprint arXiv:1707.08817, 2017.

24.

Fujimoto

Hoof

Meger

Addressing Function Approximation Error in Actor-Critic Methods. In Proc., 35th International Conference on Machine Learning (ICML), Stockholm, Sweden, PMLR, New York, NY, 2018, pp. 1587–1596.

25.

Lillicrap

T. P.

Hunt

J. J.

Pritzel

Heess

Erez

Tassa

Silver

Wierstra

Continuous Control with Deep Reinforcement Learning. Proc., International Conference on Learning Representations (ICLR), San Juan, Puerto Rico, 2016.

26.

Rogers

C. A.

Hausdorff Measures. Cambridge University Press, Cambridge, England, 1998.

27.

Qin

Cao

Cheng

Deep Reinforcement Learning Enabled Decision-Making for Autonomous Driving at Intersections. Automotive Innovation, Vol. 3, 2020, pp. 374–385.

28.

Hamouda

A. H.

Mahfouz

D. M.

Elias

C. M.

Shehata

O. M.

Multi-Layer Control Architecture for Unsignalized Intersection Management via Nonlinear MPC and Deep Reinforcement Learning. Proc., IEEE International Intelligent Transportation Systems Conference (ITSC), Indianapolis, IN, IEEE, New York, 2021, pp. 1990–1996.

29.

Seong

Jung

Lee

Shim

D. H.

Learning to Drive at Unsignalized Intersections Using Attention-Based Deep Reinforcement Learning. Proc., IEEE International Intelligent Transportation Systems Conference (ITSC), Indianapolis, IN, IEEE, New York, 2021, pp. 559–566.

30.

Yang

Gong

Adaptive Decision Making at the Intersection for Autonomous Vehicles Based on Skill Discovery. Proc., IEEE 25th International Conference on Intelligent Transportation Systems (ITSC), Macau, China, IEEE, New York, 2022, pp. 2842–2847.

31.

Zhang

Kacem

Hinz

Knoll

Safe and Rule-Aware Deep Reinforcement Learning for Autonomous Driving at Intersections. Proc., IEEE 25th International Conference on Intelligent Transportation Systems (ITSC), Macau, China, IEEE, New York, 2022, pp. 2708–2715.

32.

S.-Y.

Chen

X.-M.

Wang

Z.-J.

Y.-H.

Han

X.-T.

Decision-Making Models for Autonomous Vehicles at Unsignalized Intersections Based on Deep Reinforcement Learning. Proc., International Conference on Advanced Robotics and Mechatronics (ICARM), Guilin, China, IEEE, New York, 2022, pp. 672–677.

33.

Liu

Gao

Zhang

Ding

Zhao

Multi-Task Safe Reinforcement Learning for Navigating Intersections in Dense Traffic. Journal of the Franklin Institute, Vol. 360, No. 17, 2023, pp. 13737–13760.

34.

Al-Sharman

Dempster

Daoud

M. A.

Nasr

Rayside

Melek

Self-Learned Autonomous Driving at Unsignalized Intersections: A Hierarchical Reinforced Learning Approach for Feasible Decision-Making. IEEE Transactions on Intelligent Transportation Systems, Vol. 24, No. 11, 2023, pp. 12345–12356.

35.

Dosovitskiy

Ros

Codevilla

Lopez

Koltun

CARLA: An Open Urban Driving Simulator. In Proc., 1st Conference on Robot Learning (CoRL), Mountain View, CA, PMLR, New York, NY, 2017, pp. 1–16.

36.

Noh

Decision-Making Framework for Autonomous Driving at Road Intersections: Safeguarding Against Collision, Overly Conservative Behavior, and Violation Vehicles. IEEE Transactions on Industrial Electronics, Vol. 66, No. 4, 2018, pp. 3275–3286.

37.

Coulter

Implementation of the Pure Pursuit Path Tracking Algorithm (Tech. Rep. No. CMU-RI-TR-92-01). Carnegie Mellon University, Robotics Institute, Pittsburgh, PA, 1992.

38.

Yang

Lyu

S. E.

Decision Making of Autonomous Vehicles in Lane Change Scenarios: Deep Reinforcement Learning Approaches with Risk Awareness. Transportation Research Part C: Emerging Technologies, Vol. 134, 2022, p. 103452.

Leveraging Diminishing Demonstrations in Deep Reinforcement Learning for Decision Making of Autonomous Driving at Unsignalized Intersections

Abstract

Keywords

Get full access to this article

References