UGE-RL_Ramp

D-DOC: DDPG-based Reinforcement Learning for Highway Ramp-Metering Control under Delayed Observations and Continuous-action Space

Cong-Son DUONG, Mohammed Adel Djelloul ABBOU, Francky RASATAHARISOA, Nikethan NIMALAKUMARAN, Lucas UCKERMANN

Supervisor: Nadir FARHI, HDR, Researcher

Report: Final Report

Source Code: Source Code

Demo:

Demo GIF

Fixed-Control

Demo GIF

D-DOC

Introduction

Ramp metering is an effective strategy to reduce traffic congestion by controlling the rate of vehicles entering the highway. However, practical ramp-metering systems often face challenges such as delayed sensor data and the need for continuous control signals. To address these challenges, we propose D-DOC, a DDPG-based Reinforcement Learning framework specifically designed for ramp metering control in both delayed state information and continuous action spaces. Leveraging Deep Deterministic Policy Gradient, our approach adaptively learns optimal metering rates that minimize congestion on the ramp and maximize traffic flow on the highway.

We have implemented the proposed method in the SUMO simulation environment and evaluated performance across two contrasting scenarios: (1) a constant scenario with static flows, and (2) a dynamic scenario exhibiting peak hours from 8:15 to 8:45. Comparisons are made against a Fixed-Control approach (fixed or pre-timed signals) as well as RL (Reinforcement Learning) methods with decision-making strategies (e.g., E-Greedy, Double-DQN). Our results show that D-DOC effectively manages ramp inflows despite observation delays, maintaining higher flow rates, reducing congestion, and improving overall travel conditions. This study thus demonstrates the practicality and benefits of a DDPG-driven policy for real-time ramp metering under uncertain and continuous-control requirements.

Simulation in SUMO

This project uses the SUMO (Simulation of Urban MObility) environment to simulate traffic scenarios and evaluate the performance of the D-DOC framework. The simulation includes two main scenarios:

  1. Constant Flow Scenario: Traffic flows are static, representing a typical non-peak scenario.
  2. Peak Hours Scenario: A dynamic scenario from 8:15 to 8:45, where traffic flows fluctuate to simulate peak-hour congestion.

The reinforcement learning models (E-greedy, DQN, and DDPG) are trained and tested in these scenarios to evaluate the impact of the proposed D-DOC approach under different traffic conditions.

Code Structure

The project is structured into the following files:

Training Files

Testing Files

Note:
In all testing scripts, you can set the use_model variable to False to experiment with the Fixed-Control scenario. In this case, traffic lights are switched at fixed intervals of 30 seconds.

Setup and Installation

To run the simulation and train the models, follow these steps:

Requirements:

Installation:

  1. Install SUMO: Follow the SUMO installation guide for your platform.
  2. Clone the repository:
    git clone https://github.com/CongSon01/UGE-RL_Ramp.git
    cd UGE-RL_Ramp
    
  3. Create a Conda Environment:
      conda create --name RL_ramp python=3.10 -y
      conda activate RL_ramp
    
  4. Install Dependencies:

    pip install -r requirements.txt
    
  5. Run the Simulation:

    python test_delayed_observation.py