feat(ptv3): add a lidar segmentation model with onnx support #45

knzo25 · 2025-05-19T07:56:26Z

Summary

this PR ports Pointcept's PTv3 with the following features:

t4dataset support
onnx deployment support
most of the original codebase removed since we only want ptv3

Change point

Same as the summary

Note

Since the onnx compatible spconv had to be modified, BEVFusion and other spconv dependent modules should be trained with spconv from now instead of mmcv's implementation

Test performed

Before NaN fix

Logs [TIER IV INTERNAL LINK]

[2025-04-25 02:10:16,386 INFO test.py line 339 2191] Val result: mIoU/mAcc/allAcc 0.7411/0.8754/0.9103
[2025-04-25 02:10:16,386 INFO test.py line 345 2191] Class_0 - vehicle Result: iou/accuracy 0.9688/0.9838
[2025-04-25 02:10:16,386 INFO test.py line 345 2191] Class_1 - bicycle Result: iou/accuracy 0.3464/0.8544
[2025-04-25 02:10:16,386 INFO test.py line 345 2191] Class_2 - pedestrian Result: iou/accuracy 0.6848/0.7068
[2025-04-25 02:10:16,386 INFO test.py line 345 2191] Class_3 - road Result: iou/accuracy 0.9278/0.9616
[2025-04-25 02:10:16,386 INFO test.py line 345 2191] Class_4 - vegetation Result: iou/accuracy 0.7076/0.8744
[2025-04-25 02:10:16,386 INFO test.py line 345 2191] Class_5 - obstacle Result: iou/accuracy 0.8111/0.8714

After NaN fix

Logs [TIER IV INTERNAL LINK]

[2025-10-06 07:42:24,038 INFO test.py line 226 4469] Test: 1346372774a4ace253a23e3a2e66fe5f [696/696]-178114 Batch 4.134 (4.030) Accuracy 0.9575 (0.8678) mIoU 0.7423 (0.8008)                                                                                                   
[2025-10-06 07:42:24,103 INFO test.py line 243 4469] Syncing ...                                                                                                                                                                                                                 
[2025-10-06 07:42:24,104 INFO test.py line 269 4469] Val result: mIoU/mAcc/allAcc 0.8008/0.8678/0.9285                                                                                                                                                                           
[2025-10-06 07:42:24,105 INFO test.py line 271 4469] Class_0 - vehicle Result: iou/accuracy 0.9717/0.9880                                                                                                                                                                        
[2025-10-06 07:42:24,105 INFO test.py line 271 4469] Class_1 - bicycle Result: iou/accuracy 0.4604/0.5701                                                                                                                                                                        
[2025-10-06 07:42:24,105 INFO test.py line 271 4469] Class_2 - pedestrian Result: iou/accuracy 0.8322/0.8944                                                                                                                                                                     
[2025-10-06 07:42:24,105 INFO test.py line 271 4469] Class_3 - road Result: iou/accuracy 0.9336/0.9540                                                                                                                                                                           
[2025-10-06 07:42:24,105 INFO test.py line 271 4469] Class_4 - vegetation Result: iou/accuracy 0.7542/0.8863                                                                                                                                                                     
[2025-10-06 07:42:24,105 INFO test.py line 271 4469] Class_5 - obstacle Result: iou/accuracy 0.8523/0.9138                                                                                                                                                                       
[2025-10-06 07:42:24,105 INFO test.py line 279 4469] <<<<<<<<<<<<<<<<< End Evaluation <<<<<<<<<<<<<<<<<

…ve more and awml-fy it (can train/test) Signed-off-by: Kenzo Lobos-Tsunekawa <[email protected]>

Signed-off-by: Kenzo Lobos-Tsunekawa <[email protected]>

…neralize yet. no idea how many errors will appear in tensorrt yet Signed-off-by: Kenzo Lobos-Tsunekawa <[email protected]>

- limited range on eval - used max spatial shape throughout the network for tensorrt generalization. inference may have changed somewhat so may need to retrain Signed-off-by: Kenzo Lobos-Tsunekawa <[email protected]>

Signed-off-by: Kenzo Lobos-Tsunekawa <[email protected]>

…code Signed-off-by: Kenzo Lobos-Tsunekawa <[email protected]>

Signed-off-by: Kenzo Lobos-Tsunekawa <[email protected]>

amadeuszsz · 2025-05-31T06:09:07Z

@knzo25
Sorry for late response!
Are you still able to run environment and deploy ONNX for latest model (link)? I followed your instruction in Readme file, but seems the deployment script doesn't work due to missing ConcatDataset (I guess the true issue lies somewhere else).

scepter914 · 2025-06-09T00:57:14Z

Memo
As whole design of AWML, changes of this PR looks great to me.
I asked to review code-level for @amadeuszsz 🙏

knzo25 · 2025-06-15T06:03:54Z

@amadeuszsz
Can you look for a model compatible with the one I submitted in autowarefoundation/autoware_universe#10600?

The one you provided is 5cm per voxel, but for "real time" I recommend the 10cm one

amadeuszsz · 2025-06-16T00:03:07Z

@amadeuszsz Can you look for a model compatible with the one I submitted in autowarefoundation/autoware_universe#10600?

The one you provided is 5cm per voxel, but for "real time" I recommend the 10cm one

@knzo25
I confirm that the two available models use a grid size of 5 cm. Apart from these models, I can't find anything else in provided documentation

amadeuszsz

Great PR overall, but couldn't test as we miss some files.
Unfortunately @knzo25 is not available to look into it, so we may have to delve deeper into this issue (if there is time allocation)

projects/PTv3/README.md

projects/PTv3/datasets/dataloader.py

projects/PTv3/models/point_transformer_v3/point_transformer_v3m1_base.py

projects/SparseConvolution/sparse_conv.py

KSeangTan · 2025-09-10T07:00:22Z

Hi @amadeuszsz @knzo25
Is the PR still ongoing? Otherwise, we can assign someone else to take over if you dont mind

amadeuszsz · 2025-09-10T07:05:53Z

@KSeangTan

The code changes attached in review solve most of issues and I can push them. However, the issue regarding NaN loss still exists. Unfortunately, now I have no spare time in order to deeply investigate this issue, so if there is someone else who can take a look on this dataset issue, please let me know 🙏🏻

EDIT:
Already pushed fixes, NaN issue still has to be solved.

Signed-off-by: Amadeusz Szymko <[email protected]>

KSeangTan · 2025-09-11T04:09:42Z

Thanks @amadeuszsz
Do you think we can close the PR first, and leave a TODO and take a look at this once we have more buffer?

amadeuszsz · 2025-09-12T01:20:59Z

@KSeangTan
Ok, then let me look at it once again. If I will not be able to find the source of issue with our dataset, we merge it with TODO comment.

Signed-off-by: Amadeusz Szymko <[email protected]>

amadeuszsz · 2025-09-24T08:42:39Z

For now, we also have another issue: we can export to ONNX, but when our ROS node builds the engine, the TRT backend somehow assigns a static shape to the input tensors, even though I can see correctly defined dynamic axes in the ONNX file.

I see that this static shape overlaps with one of the GEMM block constants (160~ k). I believe the ONNX backend uses the concrete value from the sample input data and bakes it into the graph as a Constant node. Then in TRT:

[V] [TRT] Parsing node: /model/backbone/enc/enc0/block0/cpe/cpe.1/Constant [Constant]
[V] [TRT] /model/backbone/enc/enc0/block0/cpe/cpe.1/Constant [Constant] outputs: [/model/backbone/enc/enc0/block0/cpe/cpe.1/Constant_output_0 -> (161089, 32)[FLOAT]]

which further results with Nx161089 input tensor shapes.

Now we can't deploy ONNX properly, so ROS node cannot be merged as well... I will try to find the root cause.

Edit: Fixed. By accident I used wrong spconv implementation. Now I just need to properly make this project able to train and export without code modification. Also right now testing fix for crash during training.

Signed-off-by: Amadeusz Szymko <[email protected]>

amadeuszsz

LGTM!

Note:

We can deploy the model.
We still have an issue with NaNs during training, which later causes training loop crash. This issue is during investigation and I hope we can address it soon.
Code cleanup after NaNs issue fix.
Need to add dataset description.

Signed-off-by: Amadeusz Szymko <[email protected]>

amadeuszsz · 2025-10-06T10:24:48Z

@knzo25
NaN loss solved with 95f859f. Updated logs in PR description

Signed-off-by: Amadeusz Szymko <[email protected]>

KSeangTan

LGTM overall, let's approve and merge first

knzo25 and others added 12 commits April 16, 2025 00:26

feat: copied essential files from ptv3(pointcept). still need to remo…

54676f6

…ve more and awml-fy it (can train/test) Signed-off-by: Kenzo Lobos-Tsunekawa <[email protected]>

feat: added dockerfile and added segmented pointcloud output

3aaba6b

Signed-off-by: Kenzo Lobos-Tsunekawa <[email protected]>

feat: implemented export logic. onnx can be generated but will not ge…

c2ee526

…neralize yet. no idea how many errors will appear in tensorrt yet Signed-off-by: Kenzo Lobos-Tsunekawa <[email protected]>

feat:

f883f1f

- limited range on eval - used max spatial shape throughout the network for tensorrt generalization. inference may have changed somewhat so may need to retrain Signed-off-by: Kenzo Lobos-Tsunekawa <[email protected]>

feat: all changes for deployment implemented

5429304

Signed-off-by: Kenzo Lobos-Tsunekawa <[email protected]>

chore: removed unused evaluators and unused pointops

1a78fe6

Signed-off-by: Kenzo Lobos-Tsunekawa <[email protected]>

chore: removed most of the unused code

541afb8

Signed-off-by: Kenzo Lobos-Tsunekawa <[email protected]>

chore: added compatibility for torch>=2.6 loading and cleaned export …

aef8807

…code Signed-off-by: Kenzo Lobos-Tsunekawa <[email protected]>

chore: updated ptv3's docker

e35414d

Signed-off-by: Kenzo Lobos-Tsunekawa <[email protected]>

chore: applied pre-commit

b08b34c

Signed-off-by: Kenzo Lobos-Tsunekawa <[email protected]>

chore: added license and readme

ca6d486

Signed-off-by: Kenzo Lobos-Tsunekawa <[email protected]>

chore: removed unused imports

8fa52ca

Signed-off-by: Kenzo Lobos-Tsunekawa <[email protected]>

knzo25 self-assigned this May 19, 2025

knzo25 requested review from amadeuszsz and scepter914 May 19, 2025 07:56

knzo25 mentioned this pull request May 19, 2025

feat(autoware_ptv3): implemented an inference node for ptv3 using tensorrt autowarefoundation/autoware_universe#10600

Merged

Merge branch 'main' into feat/ptv3

faa9490

amadeuszsz removed the request for review from scepter914 July 3, 2025 06:26

amadeuszsz requested changes Jul 3, 2025

View reviewed changes

projects/PTv3/README.md Outdated Show resolved Hide resolved

projects/PTv3/datasets/dataloader.py Outdated Show resolved Hide resolved

projects/PTv3/models/point_transformer_v3/point_transformer_v3m1_base.py Show resolved Hide resolved

amadeuszsz reviewed Jul 9, 2025

View reviewed changes

projects/SparseConvolution/sparse_conv.py Show resolved Hide resolved

amadeuszsz added 4 commits September 10, 2025 16:10

fix(PTv3): update PYTHONPATH

fc953a4

Signed-off-by: Amadeusz Szymko <[email protected]>

fix(PTv3): circular import

ac27913

Signed-off-by: Amadeusz Szymko <[email protected]>

feat(PTv3): add scatter

12a509a

Signed-off-by: Amadeusz Szymko <[email protected]>

fix(PTv3): correct imports

0bc20cb

Signed-off-by: Amadeusz Szymko <[email protected]>

fix(ptv3): final adjustments

66b6ad1

Signed-off-by: Amadeusz Szymko <[email protected]>

amadeuszsz added 3 commits September 26, 2025 21:05

fix(PTv3): correct imports for training & deployment

12860c3

Signed-off-by: Amadeusz Szymko <[email protected]>

docs(PTv3): update paths

2830827

Signed-off-by: Amadeusz Szymko <[email protected]>

chore(github): update codeowners

c066e3c

Signed-off-by: Amadeusz Szymko <[email protected]>

amadeuszsz approved these changes Sep 27, 2025

View reviewed changes

amadeuszsz and others added 3 commits October 6, 2025 19:21

fix(PTv3): prevent NaN loss

95f859f

Signed-off-by: Amadeusz Szymko <[email protected]>

docs(PTv3): remove limitations

280e49b

Signed-off-by: Amadeusz Szymko <[email protected]>

ci(pre-commit): autofix

0bc5329

amadeuszsz and others added 11 commits October 6, 2025 22:58

chore(PTv3): clean-up dependencies

d1d75dc

Signed-off-by: Amadeusz Szymko <[email protected]>

fix(PTv3): torch warnings

e61f931

Signed-off-by: Amadeusz Szymko <[email protected]>

fix(PTv3): assertion for batch size > 1

d1df42a

Signed-off-by: Amadeusz Szymko <[email protected]>

fix(PTv3): deprecated function

6e425aa

Signed-off-by: Amadeusz Szymko <[email protected]>

fix(PTv3): wrong arg

08ea27c

Signed-off-by: Amadeusz Szymko <[email protected]>

feat(PTv3): config adjustments

6d5e819

Signed-off-by: Amadeusz Szymko <[email protected]>

ci(pre-commit): autofix

55509a8

feat(PTv3): Dockerfile adjustments

28dc434

Signed-off-by: Amadeusz Szymko <[email protected]>

feat(PTv3): path adjustments

d0f1019

Signed-off-by: Amadeusz Szymko <[email protected]>

chore(PTv3): remove unused symlink

0dd73ca

Signed-off-by: Amadeusz Szymko <[email protected]>

chore(PTv3): remove unused scripts

3133969

Signed-off-by: Amadeusz Szymko <[email protected]>

amadeuszsz requested a review from SamratThapa120 as a code owner October 8, 2025 04:54

amadeuszsz force-pushed the feat/ptv3 branch from 6aba74c to 3133969 Compare October 8, 2025 04:56

amadeuszsz requested review from KSeangTan and removed request for SamratThapa120 October 8, 2025 04:57

KSeangTan approved these changes Oct 8, 2025

View reviewed changes

Merge branch 'main' into feat/ptv3

834e8b9

amadeuszsz merged commit f25b474 into tier4:main Oct 9, 2025
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(ptv3): add a lidar segmentation model with onnx support #45

feat(ptv3): add a lidar segmentation model with onnx support #45

knzo25 commented May 19, 2025 •

edited by amadeuszsz

Loading

Uh oh!

amadeuszsz commented May 31, 2025

Uh oh!

scepter914 commented Jun 9, 2025

Uh oh!

knzo25 commented Jun 15, 2025

Uh oh!

amadeuszsz commented Jun 16, 2025

Uh oh!

amadeuszsz left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

KSeangTan commented Sep 10, 2025

Uh oh!

amadeuszsz commented Sep 10, 2025 •

edited

Loading

Uh oh!

KSeangTan commented Sep 11, 2025

Uh oh!

amadeuszsz commented Sep 12, 2025

Uh oh!

amadeuszsz commented Sep 24, 2025 •

edited

Loading

Uh oh!

amadeuszsz left a comment •

edited

Loading

Uh oh!

amadeuszsz commented Oct 6, 2025

Uh oh!

KSeangTan left a comment

Uh oh!

Uh oh!

Uh oh!

feat(ptv3): add a lidar segmentation model with onnx support #45

feat(ptv3): add a lidar segmentation model with onnx support #45

Conversation

knzo25 commented May 19, 2025 • edited by amadeuszsz Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Change point

Note

Test performed

Before NaN fix

After NaN fix

Uh oh!

amadeuszsz commented May 31, 2025

Uh oh!

scepter914 commented Jun 9, 2025

Uh oh!

knzo25 commented Jun 15, 2025

Uh oh!

amadeuszsz commented Jun 16, 2025

Uh oh!

amadeuszsz left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

KSeangTan commented Sep 10, 2025

Uh oh!

amadeuszsz commented Sep 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

KSeangTan commented Sep 11, 2025

Uh oh!

amadeuszsz commented Sep 12, 2025

Uh oh!

amadeuszsz commented Sep 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

amadeuszsz left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

amadeuszsz commented Oct 6, 2025

Uh oh!

KSeangTan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

knzo25 commented May 19, 2025 •

edited by amadeuszsz

Loading

amadeuszsz commented Sep 10, 2025 •

edited

Loading

amadeuszsz commented Sep 24, 2025 •

edited

Loading

amadeuszsz left a comment •

edited

Loading