Commit d08a403
[1/N] Refactored AutoQuantizeSearcher to _AutoQuantizeBaseSearcher & AutoQuantizeGradientSearcher; seperated quant modules and score modules
Signed-off-by: realAsma <[email protected]>
minor
Signed-off-by: realAsma <[email protected]>
minor
Signed-off-by: realAsma <[email protected]>
chery-picked some relevant changes
Signed-off-by: realAsma <[email protected]>
updated docs; code clean up
Signed-off-by: realAsma <[email protected]>
clean ups
Signed-off-by: realAsma <[email protected]>
clean ups
Signed-off-by: realAsma <[email protected]>
Remove torch.compile decorator to fix ONNX unittests
Signed-off-by: realAsma <[email protected]>
minor updates
Signed-off-by: realAsma <[email protected]>
updates
Signed-off-by: realAsma <[email protected]>
refactored auto_quantize dist_sync for score and cost
Signed-off-by: realAsma <[email protected]>
minor
minor
Signed-off-by: realAsma <[email protected]>
minor
Signed-off-by: realAsma <[email protected]>
minor
Signed-off-by: realAsma <[email protected]>
minor
Signed-off-by: realAsma <[email protected]>1 parent 01e24fd commit d08a403
File tree
6 files changed
+582
-300
lines changed- modelopt/torch
- opt
- quantization
- plugins
- tests
- gpu/torch/export
- unit/torch/quantization
6 files changed
+582
-300
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
48 | 48 | | |
49 | 49 | | |
50 | 50 | | |
51 | | - | |
| 51 | + | |
52 | 52 | | |
53 | 53 | | |
54 | 54 | | |
| |||
249 | 249 | | |
250 | 250 | | |
251 | 251 | | |
| 252 | + | |
| 253 | + | |
| 254 | + | |
| 255 | + | |
| 256 | + | |
252 | 257 | | |
253 | 258 | | |
254 | | - | |
255 | | - | |
| 259 | + | |
256 | 260 | | |
257 | 261 | | |
258 | 262 | | |
| |||
0 commit comments