Skip to content
@swiss-ai

swiss-ai

Popular repositories Loading

  1. mmore mmore Public

    Massive Multimodal Open RAG & Extraction A scalable multimodal pipeline for processing, indexing, and querying multimodal documents Ever needed to take 8000 PDFs, 2000 videos, and 500 spreadsheets …

    Python 98 22

  2. apertus-tech-report apertus-tech-report Public

    Tech Report of the Apertus LLM Suite

    75 2

  3. pretrain-data pretrain-data Public

    Pretraining data reconstruction scripts for Apertus

    Python 63 3

  4. Megatron-LM Megatron-LM Public

    Forked from NVIDIA/Megatron-LM

    Ongoing research training transformer models at scale

    Python 25 12

  5. MoE MoE Public

    some mixture of experts architecture implementations

    Python 18 3

  6. parity-aware-bpe parity-aware-bpe Public

    Parity-Aware Byte-Pair Encoding: Improving Cross-lingual Fairness in Tokenization [arXiv 2025]

    Python 15 3

Repositories

Showing 10 of 44 repositories
  • mmore Public

    Massive Multimodal Open RAG & Extraction A scalable multimodal pipeline for processing, indexing, and querying multimodal documents Ever needed to take 8000 PDFs, 2000 videos, and 500 spreadsheets and feed them to an LLM as a knowledge base? Well, MMORE is here to help you!

    swiss-ai/mmore’s past year of commit activity
    Python 98 Apache-2.0 21 24 5 Updated Sep 5, 2025
  • posttraining Public
    swiss-ai/posttraining’s past year of commit activity
    Shell 8 MIT 2 0 0 Updated Sep 4, 2025
  • swiss-ai/model-spinning’s past year of commit activity
    Python 6 2 0 0 Updated Sep 4, 2025
  • sglang Public Forked from sgl-project/sglang

    SGLang is a fast serving framework for large language models and vision language models.

    swiss-ai/sglang’s past year of commit activity
    Python 0 Apache-2.0 2,850 0 0 Updated Sep 3, 2025
  • pretrain-code Public

    Pretraining codebase for Apertus models, based on Megatron-LM

    swiss-ai/pretrain-code’s past year of commit activity
    Shell 9 Apache-2.0 1 0 1 Updated Sep 3, 2025
  • Megatron-LM Public Forked from NVIDIA/Megatron-LM

    Ongoing research training transformer models at scale

    swiss-ai/Megatron-LM’s past year of commit activity
    Python 25 3,141 6 16 Updated Sep 3, 2025
  • apertus_memorization Public

    Reproduce the memorization analysis for Apertus

    swiss-ai/apertus_memorization’s past year of commit activity
    Python 5 2 0 0 Updated Sep 2, 2025
  • apertus-tech-report Public

    Tech Report of the Apertus LLM Suite

    swiss-ai/apertus-tech-report’s past year of commit activity
    75 2 0 0 Updated Sep 2, 2025
  • hfconverter Public
    swiss-ai/hfconverter’s past year of commit activity
    Python 2 2 0 0 Updated Sep 2, 2025
  • lm-evaluation-harness Public Forked from EleutherAI/lm-evaluation-harness

    A framework for few-shot evaluation of language models.

    swiss-ai/lm-evaluation-harness’s past year of commit activity
    Python 0 MIT 2,712 0 2 Updated Sep 2, 2025

Most used topics

Loading…