Recurrent Proximal Policy Optimization Based Tractor-Trailer Wheeled Robot Automatic Parking Algorithm

Authors

  • Hao Yan Department of Electrical and Computer Engineering, Oakland University, USA https://orcid.org/0009-0007-6773-5246
  • Mohamed A Zohdy Department of Electrical and Computer Engineering, Oakland University, USA
  • Abdelrahman Shaout Department of Electrical and Computer Engineering, Oakland University, USA
  • Amr Mahmoud Department of Electrical and Computer Engineering, Oakland University, USA

DOI:

https://doi.org/10.14738/tecs.114.15355

Keywords:

Tractor–trailer Wheeled Robot (TTWR), Proximal Policy Optimization, Recurrent Neural Network, Trajectory Planning, Obstacle Avoidance

Abstract

Truck-trailer reverse parking poses significant challenges due to the system's inherent instability, complex road geometry, and collision avoidance requirement. Traditional approaches for trailer control rely on manually designed control policies, which may have limited applicability and scalability. This paper presents a novel approach for autonomous reverse parking of a tractor-trailer wheeled robot (TTWR) system in tight and complex environments. Utilizing the Proximal Policy Optimization (PPO) algorithm, the controller is trained, with a long short-term memory (LSTM) network enhancing the handling of sequential observation data. To improve safety and reliability, ultrasonic sensors are installed on the trailer to detect nearby obstacles and ensure safety. Furthermore, a novel reward function is introduced that encourages the TTWR to maintain a safe distance from surrounding obstacles while minimizing the parking trajectory distance and the steering maneuver. The effectiveness of the proposed LSTM-PPO algorithm is compared with the original PPO and other trending reinforcement learning algorithms. The results demonstrate the improved convergence stability and speed of the proposed approach, as well as its successful execution of end-to-end reverse parking maneuvers in unknown environments.

Downloads

Published

2023-08-26

How to Cite

Yan, H., Zohdy, M. A., Shaout, A., & Mahmoud, A. (2023). Recurrent Proximal Policy Optimization Based Tractor-Trailer Wheeled Robot Automatic Parking Algorithm. Transactions on Engineering and Computing Sciences, 11(4), 124–142. https://doi.org/10.14738/tecs.114.15355