Cong-Son DUONG, Mohammed Adel Djelloul ABBOU, Francky RASATAHARISOA, Nikethan NIMALAKUMARAN, Lucas UCKERMANN
Supervisor: Nadir FARHI, HDR, Researcher
Report: Final Report
Source Code: Source Code
Demo:
Fixed-Control
D-DOC
Ramp metering is an effective strategy to reduce traffic congestion by controlling the rate of vehicles entering the highway. However, practical ramp-metering systems often face challenges such as delayed sensor data and the need for continuous control signals. To address these challenges, we propose D-DOC, a DDPG-based Reinforcement Learning framework specifically designed for ramp metering control in both delayed state information and continuous action spaces. Leveraging Deep Deterministic Policy Gradient, our approach adaptively learns optimal metering rates that minimize congestion on the ramp and maximize traffic flow on the highway.
We have implemented the proposed method in the SUMO simulation environment and evaluated performance across two contrasting scenarios: (1) a constant scenario with static flows, and (2) a dynamic scenario exhibiting peak hours from 8:15 to 8:45. Comparisons are made against a Fixed-Control approach (fixed or pre-timed signals) as well as RL (Reinforcement Learning) methods with decision-making strategies (e.g., E-Greedy, Double-DQN). Our results show that D-DOC effectively manages ramp inflows despite observation delays, maintaining higher flow rates, reducing congestion, and improving overall travel conditions. This study thus demonstrates the practicality and benefits of a DDPG-driven policy for real-time ramp metering under uncertain and continuous-control requirements.
This project uses the SUMO (Simulation of Urban MObility) environment to simulate traffic scenarios and evaluate the performance of the D-DOC framework. The simulation includes two main scenarios:
The reinforcement learning models (E-greedy, DQN, and DDPG) are trained and tested in these scenarios to evaluate the impact of the proposed D-DOC approach under different traffic conditions.
The project is structured into the following files:
delay_probability
variable to increase the number of vehicles experiencing delayed information.Note:
In all testing scripts, you can set theuse_model
variable toFalse
to experiment with the Fixed-Control scenario. In this case, traffic lights are switched at fixed intervals of 30 seconds.
To run the simulation and train the models, follow these steps:
git clone https://github.com/CongSon01/UGE-RL_Ramp.git
cd UGE-RL_Ramp
conda create --name RL_ramp python=3.10 -y
conda activate RL_ramp
Install Dependencies:
pip install -r requirements.txt
Run the Simulation:
python test_delayed_observation.py