Skip to content

852866031/NLP_final_project

 
 

Repository files navigation

Team: Jiaxuan Chen, Xinyi Zhu, Junyu li

A Comparative Analysis Between mBERT and XLM-RoBERTa (XLM-R)

Original dataset citation

Phillip Keung, Yichao Lu, György Szarvas and Noah A. Smith. “The Multilingual Amazon Reviews Corpus.” In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020.

https://github.com/huggingface/datasets/tree/master/datasets/amazon_reviews_multi#dataset-description

Trim training set

MacOS

brew install coreutils
gshuf -n N input > output

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 100.0%