Skip to content

Commit 74602a8

Browse files
authored
Merge pull request #22 from mh0797/main
version2.1.1
2 parents d34764b + c00db27 commit 74602a8

File tree

8 files changed

+442
-508
lines changed

8 files changed

+442
-508
lines changed

README.md

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -52,13 +52,14 @@
5252
<p align="right">(<a href="#top">back to top</a>)</p>
5353

5454
## Changelog <a name="changelog"></a>
55+
- **`[2025/04/13]`** NAVSIM v2.1.1 release (official devkit version for 2025 warm-up phase)
56+
- Updated dataset for the [Hugging Face Warmup leaderboard](https://huggingface.co/spaces/AGC2025/e2e-driving-warmup) with minor fixes
5557

56-
- **`[2025/04/09]`** NAVSIM v2.1 release (official devkit version for 2025 warm-up phase)
57-
- Added new dataset for the [Hugging Face Warmup leaderboard](https://huggingface.co/spaces/AGC2025/e2e-driving-warmup) (see [submission](docs/submission.md))
58+
- ⚠️ **IMPORTANT**: To submit to the updated leaderboard, you need to re-download the synthetic dataset v2.1.1 (see [download](docs/install.md))
5859

59-
- ⚠️ **IMPORTANT**: To submit to the updated leaderboard, you need to re-download the synthetic dataset (see [download](docs/install.md))
60+
- **`[2025/04/08]`** NAVSIM v2.1 release
61+
- Added new dataset for the [Hugging Face Warmup leaderboard](https://huggingface.co/spaces/AGC2025/e2e-driving-warmup) (see [submission](docs/submission.md))
6062
- Introduced support for two-stage reactive traffic agents (see [traffic simulation](docs/metrics.md))
61-
- Code refactoring and bug fixes
6263
- **`[2025/02/28]`** NAVSIM v2.0 release
6364
- Extends the PDM Score with more metrics and penalties (see [metrics](docs/metrics.md))
6465
- Adds a new two-stage pseudo closed-loop simulation (see [metrics](docs/metrics.md))
@@ -67,7 +68,6 @@
6768
- Leaderboard for `navtest` on [Hugging Face](https://huggingface.co/spaces/AGC2024-P/e2e-driving-navsim)
6869
- Release of baseline checkpoints on [Hugging Face](https://huggingface.co/autonomousvision/navsim_baselines)
6970
- Updated docs for [submission](docs/submission.md) and [paper](https://arxiv.org/abs/2406.15349)
70-
- Code refactoring, formatting, minor fixes
7171
- **`[2024/04/21]`** NAVSIM v1.0 release (official devkit version for [AGC 2024](https://opendrivelab.com/challenge2024/#end_to_end_driving_at_scale))
7272
- Parallelization of metric caching / evaluation
7373
- Adds [Transfuser](https://arxiv.org/abs/2205.15997) baseline (see [agents](docs/agents.md#Baselines))

docs/agents.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -76,7 +76,7 @@ You can configure the set of sensor modalities to use and how much history you n
7676

7777
**Why LiDAR?** Recent literature on open-loop planning has opted away from LiDAR in favor of using surround-view high-resolution cameras. This has significantly strained the compute requirements for training and testing SoTA planners. We hope that the availability of the LiDAR modality enables more computationally efficient submissions that use fewer (or low-resolution) camera inputs.
7878

79-
**Ego Status.** Besides the sensor data, an agent also receives the ego pose, velocity and acceleration information in local coordinates. Finally, to disambiguate driver intention, we provide a discrete driving command, indicating whether the intended route is towards the left, straight or right direction. Importantly, the driving command in NAVSIM is based solely on the desired route, and does not entangle information regarding obstacles and traffic signs (as was prevalent on prior benchmarks such as nuScenes). Note that the left and right driving commands cover turns, lane changes and sharp curves.
79+
**Ego Status.** Besides the sensor data, an agent also receives the ego pose, velocity and acceleration information in local coordinates. Finally, to disambiguate driver intention, we provide a discrete driving command, indicating whether the intended route is towards the left, straight or right direction. There is also a fourth command, representing 'unknown', which can be used to filter out data during training. Importantly, the driving command in NAVSIM is based solely on the desired route, and does not entangle information regarding obstacles and traffic signs (as was prevalent on prior benchmarks such as nuScenes). Note that the left and right driving commands cover turns, lane changes and sharp curves.
8080

8181
## Output
8282

docs/metrics.md

Lines changed: 2 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -62,15 +62,14 @@ The new NAVSIM v2 evaluation uses a two-stage aggregation process to approximate
6262

6363
2. **Second Stage Scoring:**
6464
- In addition to the initial scene, multiple potential follow-up scenes to this initial scene are included in the test set to be evaluated.
65-
- The follow-up scenes were pre-computed by rolling out several simulations starting from the initial scene, each with a different 3-second plan.
65+
- The follow-up scenes were pre-computed by rolling out several simulations starting from the initial scene, each with a different 4-second plan.
6666
- Therefore, each of these follow-up scenes starts from a state different to the endpoint of the initial scene, e.g. with a lateral offset or different speed.
6767
- We evaluate the submitted planner on each follow-up scene over a fixed horizon (4 seconds) using the EPDMS metric.
68-
- For these scenes, we compute the EPDMS with [reactive background traffic](traffic_agents.md) instead of log-replay agents.
6968

7069
3. **Weighting and Aggregation:**
7170
- To emulate the effects of closed-loop simulation, the relevance of each follow-up scene to the overall score depends on how close its starting position is to where the submitted planner actually ended in the first stage.
7271
- We assign higher weights to follow-up scenes that start closer to the submitted planner's end position.
73-
- Finally, we compute a weighted average of all scores second-stage to produce the aggregated metric.
72+
- We first compute a weighted aggregation if all second-stage scores. Finally, we aggregate the scores of the first and second stage to produce the aggregated metric.
7473

7574
# Run an evaluation
7675
To evaluate the PDM score for an agent you can run:

docs/splits.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -80,8 +80,8 @@ In Navsim-v1.1, the training/test split can bet set with a single config paramet
8080
<td rowspan="2">Competition</td>
8181
<td>warmup_two_stage</td>
8282
<td>Warmup test split to validate submission on hugging face. Available as a filter for test split.</td>
83-
<td>-</td>
84-
<td>-</td>
83+
<td>27M</td>
84+
<td>1.2G</td>
8585
<td>
8686
train_test_split=warmup_two_stage
8787
</td>

docs/submission.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,12 +1,12 @@
11
# Submitting to the leaderboard
22

3-
NAVSIM comes with an official leaderboard ([Leaderboard 2024](https://huggingface.co/spaces/AGC2024-P/e2e-driving-navsim), [Leaderboard 2025 Warmup](https://huggingface.co/spaces/AGC2025/e2e-driving-warmup)) on HuggingFace. The leaderboard prevents ambiguity in metric definitions between different projects, as all evaluation is performed on the server with the official evaluation script.
3+
NAVSIM comes with official leaderboards ([Leaderboard 2024](https://huggingface.co/spaces/AGC2024-P/e2e-driving-navsim), [Leaderboard 2025 Warmup](https://huggingface.co/spaces/AGC2025/e2e-driving-warmup)) on HuggingFace. The leaderboards prevent ambiguity in metric definitions between different projects, as all evaluation is performed on the server with the official evaluation script.
44

5-
For the [NAVSIM challenge 2025 warmup](https://huggingface.co/spaces/AGC2025/e2e-driving-warmup), we now open the leaderboard with the `warmup_navsafe_two_stage_extended` split. In this guide, we describe how to create a valid submission and what rules apply for the new leaderboard.
5+
For the [NAVSIM challenge 2025 warmup](https://huggingface.co/spaces/AGC2025/e2e-driving-warmup), we now open the leaderboard with the `warmup_two_stage` split. In this guide, we describe how to create a valid submission.
66

77
### Rules
8-
9-
- **Open-source code and models**:
8+
- [**General rules (for Leaderboard 2025)**](https://opendrivelab.com/challenge2025/#rule)
9+
- **Open-source code and models (for Leaderboard 2024)**:
1010

1111
- We will periodically (~every 6 months) be removing all entries on the leaderboard which **do not provide associated open-source training and inference code with the corresponding pre-trained checkpoints**. Even if removed for not having this information, an entry can be resubmitted once the code needed for reproducibility is made publicly available.
1212
- Code must be provided by setting the `TEAM_NAME` variable of the submission file as `"<a href=Link/to/repository>Method name</a>"`. Note that this can also be edited on the leaderboard for an existing submission, if the repo is created (or updated) after the initial submission.
Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,3 +1,3 @@
1-
wget https://huggingface.co/datasets/OpenDriveLab/OpenScene/resolve/main/navsim-v2/navsim_v2.1_warmup_two_stage.tar.gz
2-
tar -xzvf navsim_v2.1_warmup_two_stage.tar.gz
3-
rm navsim_v2.1_warmup_two_stage.tar.gz
1+
wget https://huggingface.co/datasets/OpenDriveLab/OpenScene/resolve/main/navsim-v2/navsim_v2.1.1_warmup_two_stage.tar.gz
2+
tar -xzvf navsim_v2.1.1_warmup_two_stage.tar.gz
3+
rm navsim_v2.1.1_warmup_two_stage.tar.gz

0 commit comments

Comments
 (0)