Skip to content

Commit f8bf2cf

Browse files
authored
Relase base models for fine-tuning
1 parent 17cb159 commit f8bf2cf

File tree

1 file changed

+16
-2
lines changed

1 file changed

+16
-2
lines changed

README.md

+16-2
Original file line numberDiff line numberDiff line change
@@ -14,7 +14,9 @@ Aria is a multimodal native MoE model. It features:
1414

1515

1616
## News
17-
- 2024.10.10: We release Aria!
17+
- [Dec 1, 2024] We release the base models for Aria ([Aria-Base-8K](https://huggingface.co/rhymes-ai/Aria-Base-8K) and [Aria-Base-64K](https://huggingface.co/rhymes-ai/Aria-Base-64K))! They are fully compatible with this inference \& fine-tuning codebase.
18+
19+
- [Oct 10, 2024] We release Aria!
1820

1921
## Quick Start
2022

@@ -99,7 +101,19 @@ We offer both LoRA fine-tuning and full parameter tuning, using various dataset
99101
- Video datasets
100102
- Code datasets
101103
102-
For a quick try, visit the [examples](./examples) folder and choose one of the fine-tuning examples.
104+
For a quick try, visit the [examples](./examples) folder and choose one of the fine-tuning examples. If you would like to fine-tune from base models (recommended when you have a large database), please change the following model paths in the configs ([full](recipes/config_full.yaml) or [lora](recipes/config_lora.yaml))
105+
106+
```yaml
107+
model_name_or_path: rhymes-ai/Aria
108+
tokenizer_path: rhymes-ai/Aria
109+
```
110+
111+
to the ones corresponding to one of the base models:
112+
113+
```yaml
114+
model_name_or_path: rhymes-ai/Aria-Base-64K # rhymes-ai/Aria-Base-8K
115+
tokenizer_path: rhymes-ai/Aria-Base-64K # rhymes-ai/Aria-Base-8K
116+
```
103117
104118
### Prepare dataset
105119
Please refer to [custom_dataset.md](docs/custom_dataset.md) for how to prepare your dataset.

0 commit comments

Comments
 (0)