Skip to content

Add Mimi codebook embedding visualization script#1

Open
pshishodiaa wants to merge 1 commit into
mainfrom
claude/visualize-mimi-embeddings-SJYp5
Open

Add Mimi codebook embedding visualization script#1
pshishodiaa wants to merge 1 commit into
mainfrom
claude/visualize-mimi-embeddings-SJYp5

Conversation

@pshishodiaa

@pshishodiaa pshishodiaa commented Feb 14, 2026

Copy link
Copy Markdown
Collaborator

Download kyutai/mimi from HuggingFace and visualize the acoustic
codebook embeddings (2048 vocab, 256 dim) reduced to 2D via t-SNE
for quantizer layers q=0,1,2,3,4,8,16,30.

https://claude.ai/code/session_01E6s5UqmJZazkWtPAjQFwf1


Note

Low Risk
Adds an offline analysis script only; no changes to library/runtime behavior, though it executes trust_remote_code=True when downloading the model.

Overview
Adds a standalone visualize_mimi_codebooks.py utility that downloads kyutai/mimi from HuggingFace, extracts selected acoustic quantizer codebook embeddings, runs t-SNE to reduce them to 2D, and saves a multi-panel scatterplot image (mimi_codebook_tsne.png).

Written by Cursor Bugbot for commit 5637a71. This will update automatically on new commits. Configure here.

Download kyutai/mimi from HuggingFace and visualize the acoustic
codebook embeddings (2048 vocab, 256 dim) reduced to 2D via t-SNE
for quantizer layers q=0,1,2,3,4,8,16,30.

https://claude.ai/code/session_01E6s5UqmJZazkWtPAjQFwf1
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants