Skip to content

Conversation

@rmes-ai
Copy link

@rmes-ai rmes-ai commented Jul 25, 2025

  • Fix image processor initialization to use vit_img_size and splitted_image_size
  • Simplify image processing pipeline to work with MPS backend
  • Add dummy lm_eos_token_id and splitted_image_size config parameters
  • Bypass complex image splitting logic that was causing MPS issues

This is a temporary fix to enable nanoVLM inference on Apple Silicon Macs. The changes maintain functionality while working around MPS limitations.

- Fix image processor initialization to use vit_img_size and splitted_image_size
- Simplify image processing pipeline to work with MPS backend
- Add dummy lm_eos_token_id and splitted_image_size config parameters
- Bypass complex image splitting logic that was causing MPS issues

This is a temporary fix to enable nanoVLM inference on Apple Silicon Macs.
The changes maintain functionality while working around MPS limitations.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant