Visual blocks are not quantized in code #2

Yiman-GO · 2025-01-08T07:12:41Z

The function "get_blocks" only return the llm blocks of VLM model. Will the code for quantizing visual blocks be released？

Albert-huyc · 2025-01-09T06:07:23Z

Thank you for your interest in the MBQ work !
The MBQ algorithm is focused on quantizing LLM blocks in VLMs. In our experiments, we directly quantized the ViT encoder using SmoothQuant and implemented it in a rough way. We are currently still refining this part of the code.

junghye0klee · 2025-02-04T02:46:34Z

@Albert-huyc If so, does VLM's visual block retain FP16 without quantization?

Albert-huyc · 2025-02-19T09:30:19Z

@junghye0klee You're right. In all our experiments, except those detailed in Sec. 5.3.3, the visual block of the VLM remained unquantized.

junghye0klee · 2025-02-19T11:17:49Z

@Albert-huyc Thank you for your kind response. If I ask one more question, does additional quantization of visual blocks result in a significant decrease in accuracy?

Albert-huyc · 2025-02-20T02:00:00Z

@junghye0klee That's a good question! We've actually conducted several experiments on this, and you can find the detailed results in Table 4 (Section 5.3.3) of our paper. The experimental results show that quantizing the visual blocks to W4A8 maintains the model's performance without any noticeable degradation.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Visual blocks are not quantized in code #2

Visual blocks are not quantized in code #2

Yiman-GO commented Jan 8, 2025

Albert-huyc commented Jan 9, 2025

junghye0klee commented Feb 4, 2025

Albert-huyc commented Feb 19, 2025

junghye0klee commented Feb 19, 2025

Albert-huyc commented Feb 20, 2025

Visual blocks are not quantized in code #2

Visual blocks are not quantized in code #2

Comments

Yiman-GO commented Jan 8, 2025

Albert-huyc commented Jan 9, 2025

junghye0klee commented Feb 4, 2025

Albert-huyc commented Feb 19, 2025

junghye0klee commented Feb 19, 2025

Albert-huyc commented Feb 20, 2025