Add Tunix demo in miniGPT #269

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Draft

windmaple wants to merge 4 commits into jax-ml:revamp-2025 from windmaple:tunix

Collaborator

windmaple commented Nov 14, 2025

Unfortunately Qwix has issues with both the new sharding annotation and KV caching previously added; so I had to revert both.

I also had to re-run the notebook (so that we can get Tunix output), which introduced a lot of metadata and output changes.

windmaple added 4 commits

November 14, 2025 14:54


          Add Tunix demo in miniGPT

93983e4

Unfortunately Qwix has issues with both the new sharding annotation
and KV caching previously added; so I had to revert both


          get rid of the Tunix training widget

e6722be


          get rid of TFDS widget as well

44c0bdb


          Remove troubling metadata in miniGPT

4d5b4e0

Collaborator Author

windmaple commented Nov 14, 2025 •

edited

Loading

@emilyfertig do you know why the test keeps failing? The notebook runs fine on Colab and these libs are properly installed, so the errors don't make sense to me.

NameError: name 'peft_trainer' is not defined
NameError: name 'tfds' is not defined
ModuleNotFoundError: No module named 'tqdm'

emilyfertig reviewed

View reviewed changes

docs/source/JAX_for_LLM_pretraining.md

    
              id: LS9sQEY3n0mB

              outputId: 9ffcf3a6-20ef-4f80-b006-f5d3c5644a15

              tags: [nbval-ignore-output]

              outputId: 6b9ee4b0-eed0-4bae-dd99-ffed14289ad7

Collaborator

emilyfertig Nov 14, 2025

It looks like the nbval-ignore-output and nbval-skip tags got deleted, that might be why the tests are failing.

emilyfertig reviewed

View reviewed changes

docs/source/JAX_for_LLM_pretraining.md

    
              :id: z0p-IHurrB9i

              # Define a triangular mask for causal attention with `jax.numpy.tril` and `jax.numpy.ones`.

Collaborator

emilyfertig Nov 14, 2025

Why remove the comments in this function?

docs/source/JAX_for_LLM_pretraining.md

    
              ```{code-cell}

              :cellView: form

              :tags: [hide-cell]

Collaborator

emilyfertig Nov 14, 2025

Reinstate this tag?

docs/source/JAX_for_LLM_pretraining.md

    
                          generated_text = model.generate_text(maxlen, start_tokens)

                      step += 1

                      # break

Collaborator

emilyfertig Nov 14, 2025

Remove?

docs/source/JAX_for_LLM_pretraining.md

    
              ## Tunix: Fine-tuning

              [REVAMP 6] Introduce Tunix.

              [Tunix](https://github.com/google/tunix) is a JAX-native LLM post-training library open sourced by Google. It supports a range of post-training techniques including supervised finetuning, preference tuning, reinforcement learning and model distillation. So in this section, we are going to use Tunix to finetune the miniGPT model we just pretrained using LoRA ([Low-Rank Adaptation](https://arxiv.org/abs/2106.09685)) so that the finetuned model generates output of a different style.

Collaborator

emilyfertig Nov 14, 2025

In the second sentence, could you remove "So" at the beginning, so it's just "In this section, ..."?

docs/source/JAX_for_LLM_pretraining.md

    
              +++ {"id": "mfnj-S02_yM5"}

              Previously we used Grain to load the Tiny Stories dataset. JAX is actually flexible to work with other dataloaders as well. So for LoRA fintuning we use TensorFlow Datasets to load the [Tiny Shakespeare](https://www.tensorflow.org/datasets/catalog/tiny_shakespeare) dataset.

Collaborator

emilyfertig Nov 14, 2025

We're thinking of removing TFDS from the stack and only including Grain, now that Grain is becoming more mature and TFDS is losing market share. I'd kind of like to stick with just Grain for the tutorial, wdyt?

docs/source/JAX_for_LLM_pretraining.md

    
              +++ {"id": "qvxSxbbcBim7"}

              We define a few helper functions to create the LoRA model, loss and etc.

Collaborator

emilyfertig Nov 14, 2025

Remove "and," so it reads "LoRA model, loss, etc."?

docs/source/JAX_for_LLM_pretraining.md

    
              +++ {"id": "UWVdZGK9COj0"}

              After the finetuning, you can easily see that now the model produces text of a different style, kind of like Shakespeare's work, which means our finetuning works.

Collaborator

emilyfertig Nov 14, 2025

Should this show output?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet