Obstacle avoidance parking based on improved deep reinforcement learning SAC algorithm

Ting Liu, Ping Liu, Mingjie Liu, Changhao Piao, Kyunghi Chang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Recent research uses neural-network-based approaches to generate time-optimized parking trajectories in linear time. However, the generalization of these neural networks in different parking lot scenarios is not fully considered and relies on high-quality data. To address these issues, this paper proposes an improved soft actor critic (SAC)based parking trajectory planning method to achieve fast convergence and high success rate. A trajectory data-based reward function and an experience replay sampling strategy are designed simultaneously improve the learning efficiency. Meanwhile, the Flatten-T Swish Plus (FTSPlus) activation function is firstly introduced into the SAC's neural network structure to enhance the convergence capabilities. The simulation results show that compared with traditional SAC methods, the success rate has been greatly improved and convergence rate has also been improved by more than 60%. Furthermore, it has the good model generalization ability while maintaining collision-free trajectories.

Original languageEnglish
Title of host publicationProceedings - 2024 China Automation Congress, CAC 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages7277-7282
Number of pages6
ISBN (Electronic)9798350368604
DOIs
StatePublished - 2024
Event2024 China Automation Congress, CAC 2024 - Qingdao, China
Duration: 1 Nov 20243 Nov 2024

Publication series

NameProceedings - 2024 China Automation Congress, CAC 2024

Conference

Conference2024 China Automation Congress, CAC 2024
Country/TerritoryChina
CityQingdao
Period1/11/243/11/24

Bibliographical note

Publisher Copyright:
© 2024 IEEE.

Keywords

  • Automatic parking
  • Experience playback
  • Reward function
  • SAC

Fingerprint

Dive into the research topics of 'Obstacle avoidance parking based on improved deep reinforcement learning SAC algorithm'. Together they form a unique fingerprint.

Cite this