Skip to content

Commit 9b25fec

Browse files
committed
update readme
1 parent ea40253 commit 9b25fec

File tree

1 file changed

+5
-0
lines changed

1 file changed

+5
-0
lines changed

README.md

+5
Original file line numberDiff line numberDiff line change
@@ -6,6 +6,7 @@
66
🌐 [WebDemo](https://rhymes.ai/) |
77
🟣 [Discord](https://discord.com/invite/u8HxU23myj)
88

9+
910
## Introduction
1011
Aria is a multimodal native MoE model. It features:
1112
- State-of-the-art performance on various multimodal and language tasks, superior in video and document understanding;
@@ -93,6 +94,10 @@ We offer additional inference methods, such as utilizing [vLLM](https://github.c
9394
Checkout these [inference examples](https://github.com/rhymes-ai/Aria/tree/main/inference/notebooks) that demonstrate how to use Aria on various applications such as chart understanding, PDF reading, video understanding, etc, available with both Hugging Face Transformers and [vLLM](https://github.com/vllm-project/vllm) backends.
9495

9596
## Fine-tuning
97+
> ⚠️ **Important Note on Fine-tuning**: Due to changes in the weight mapping after Aria's integration into transformers, the training code requires specific versions to work properly:
98+
> - Use transformers version 4.45.0
99+
> - Use model revision "4844f0b5ff678e768236889df5accbe4967ec845"
100+
96101

97102
> **Note:** For optimal fine-tuning performance, install the optional `grouped_gemm` dependency:
98103
> ```bash

0 commit comments

Comments
 (0)