Skip to content
Change the repository type filter

All

    Repositories list

    • lab-site

      Public
      Website of GAIR Lab
      JavaScript
      0100Updated Apr 14, 2026Apr 14, 2026
    • SII-CLI

      Public
      03410Updated Apr 13, 2026Apr 13, 2026
    • Python
      1821.9k212Updated Apr 11, 2026Apr 11, 2026
    • AlphaEval

      Public
      Python
      MIT License
      01600Updated Apr 8, 2026Apr 8, 2026
    • Python
      Apache License 2.0
      5626720Updated Apr 1, 2026Apr 1, 2026
    • Apache License 2.0
      1012900Updated Mar 31, 2026Mar 31, 2026
    • OpenSWE

      Public
      Python
      Other
      1317400Updated Mar 16, 2026Mar 16, 2026
    • DataEvolve

      Public
      Python
      22900Updated Mar 15, 2026Mar 15, 2026
    • Med

      Public
      What Does Vision Tool-Use Reinforcement Learning Really Learn? Disentangling Tool-Induced and Intrinsic Effects for Crop-and-Zoom
      Python
      Other
      01700Updated Mar 10, 2026Mar 10, 2026
    • [ACL 2026] This is the repo of Data Darwinism.
      12200Updated Feb 17, 2026Feb 17, 2026
    • daVinci-Agency: Unlocking Long-Horizon Agency Data-Efficiently
      Python
      MIT License
      33710Updated Feb 4, 2026Feb 4, 2026
    • [ICLR 2026]InnovatorBench: Evaluating Agents' Ability to Conduct Innovative LLM Research
      Jupyter Notebook
      Apache License 2.0
      11700Updated Feb 3, 2026Feb 3, 2026
    • MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning
      Python
      Apache License 2.0
      711800Updated Feb 2, 2026Feb 2, 2026
    • Go
      Apache License 2.0
      65311Updated Jan 31, 2026Jan 31, 2026
    • [ICLR 2026] SR-Scientist: Scientific Equation Discovery With Agentic AI
      Python
      03700Updated Jan 27, 2026Jan 27, 2026
    • [ACL2026 Main] AgencyBench: Benchmarking the Frontiers of Autonomous Agents in 1M-Token Real-World Contexts
      Python
      MIT License
      37350Updated Jan 23, 2026Jan 23, 2026
    • Python
      0100Updated Jan 20, 2026Jan 20, 2026
    • ResearcherBench: Evaluating Deep AI Research Systems on the Frontiers of Scientific Inquiry
      Python
      44821Updated Jan 5, 2026Jan 5, 2026
    • LiveTalk

      Public
      Python
      Other
      2328090Updated Jan 2, 2026Jan 2, 2026
    • ASI-Arch

      Public
      AlphaGo Moment for Model Architecture Discovery.
      Python
      Apache License 2.0
      2111.1k90Updated Dec 3, 2025Dec 3, 2025
    • 2328310Updated Nov 6, 2025Nov 6, 2025
    • Scaling Deep Research via Reinforcement Learning in Real-world Environments.
      Python
      Apache License 2.0
      4972790Updated Oct 15, 2025Oct 15, 2025
    • LIMI

      Public
      LIMI: Less is More for Agency
      Python
      716160Updated Oct 14, 2025Oct 14, 2025
    • [AAAI 2025 oral] Evaluating Mathematical Reasoning Beyond Accuracy
      Python
      47820Updated Oct 9, 2025Oct 9, 2025
    • DatasetResearch: Benchmarking Agent Systems for Demand-Driven Dataset Discovery
      Python
      Apache License 2.0
      02000Updated Sep 24, 2025Sep 24, 2025
    • Python
      MIT License
      1300Updated Sep 9, 2025Sep 9, 2025
    • [ICLR 2026] Efficient Agent Training for Computer Use
      Python
      MIT License
      814000Updated Sep 5, 2025Sep 5, 2025
    • LIMO

      Public
      [COLM 2025] LIMO: Less is More for Reasoning
      Python
      551.1k60Updated Jul 30, 2025Jul 30, 2025
    • ASI4AI

      Public
      JavaScript
      1700Updated Jul 23, 2025Jul 23, 2025
    • Reproducible and flexible LLM evaluations for scientific reasoning.
      Python
      Apache License 2.0
      22810Updated Jul 23, 2025Jul 23, 2025
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.