Skip to content
Change the repository type filter

All

    Repositories list

    • This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data
      Python
      MIT License
      1221.6k160Updated Jan 28, 2025Jan 28, 2025
    • B-STaR

      Public
      B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners
      Python
      116801Updated Jan 3, 2025Jan 3, 2025
    • mstar

      Public
      M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoning
      MIT License
      15100Updated Dec 25, 2024Dec 25, 2024
    • dart-math

      Public
      [NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*
      Jupyter Notebook
      MIT License
      39120Updated Dec 10, 2024Dec 10, 2024
    • deita

      Public
      Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]
      Python
      Apache License 2.0
      2953150Updated Dec 9, 2024Dec 9, 2024
    • On the Universal Truthfulness Hyperplane Inside LLMs (EMNLP 2024)
      Python
      0500Updated Oct 3, 2024Oct 3, 2024
    • Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]
      Python
      MIT License
      613000Updated Sep 20, 2024Sep 20, 2024
    • An Analytical Evaluation Board of Multi-turn LLM Agents
      SAS
      2827285Updated May 20, 2024May 20, 2024
    • In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)
      Python
      55031Updated Mar 30, 2024Mar 30, 2024
    • JavaScript
      0000Updated Jan 25, 2024Jan 25, 2024
    • felm

      Public
      Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)
      Python
      15730Updated Dec 25, 2023Dec 25, 2023
    • [NeurIPS 2023] Github repository for "Composing Parameter-Efficient Modules with Arithmetic Operations"
      Python
      Apache License 2.0
      95931Updated Nov 26, 2023Nov 26, 2023
    • ceval

      Public
      Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]
      Python
      MIT License
      791.7k80Updated Oct 26, 2023Oct 26, 2023
    • Python
      1700Updated Oct 3, 2023Oct 3, 2023
    • SynCSE

      Public
      This is the official implementation of the paper: "Contrastive Learning of Sentence Embeddings from Scratch"
      Python
      MIT License
      53810Updated Jun 9, 2023Jun 9, 2023