Add Gemma by yisi-wang-slalom · Pull Request #87 · aws-neuron/transformers-neuronx

yisi-wang-slalom · 2024-05-23T01:22:45Z

Description of changes:

We are excited to help adding support for Gemma model architectured by Google.

A few highlights, mainly referencing: huggingface/transformers#29402

Using gelu_new activation function, we've validated it against the PyTorch implementation of gelu_pytorch_tanh to ensure same result, and recommend implementing gelu_pytorch_tanh when possible.
Add Layernorm (w+1) of RMS Layernorm for gemma
Include additional normalization in embedding (multiplies the embeddings by sqrt(hidden_dim)) for gemma

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

aws-guangyhu · 2025-10-02T21:51:10Z

As of release 2.26.0, support has ended for the Transformers NeuronX library. If you are still using Transformers NeuronX, Neuron recommends that you migrate to the NxD Inference library, which provides a PyTorch-based inference library. Refer to the migration guide to learn how to migrate your Transformers NeuronX workloads to NxD Inference. PyTorch inference Deep Learning Container (DLC) will no longer include the transformers-neuronx package as well and Neuron no longer provides the transformers_neuronx virtual environment in both single and multi-framework DLAMIs. For more details, see Announcing end of support for Transformers NeuronX library starting in Neuron 2.26 release.

Add gemma folder and initial files

3b7d7f6

yisi-wang-slalom requested review from aws-maens and musunita as code owners May 23, 2024 01:22

yisi-wang-slalom added 4 commits May 26, 2024 18:41

add gemma_rms_norm, padding_idx, embedding normalization

80b2dd9

add gemma_rms_lm_head

cdee3a2

move changes into gemma/hlo.py

8185329

add space back to hlo.py

5930180

aws-maens requested a review from mmcclean-aws July 9, 2024 20:29

sssrijan-amazon closed this Oct 2, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Gemma#87

Add Gemma#87
yisi-wang-slalom wants to merge 5 commits intoaws-neuron:mainfrom
yisi-wang-slalom:add_gemma

yisi-wang-slalom commented May 23, 2024 •

edited

Loading

Uh oh!

aws-guangyhu commented Oct 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

yisi-wang-slalom commented May 23, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aws-guangyhu commented Oct 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

yisi-wang-slalom commented May 23, 2024 •

edited

Loading