Skip to content
@thu-rllab

thu-rllab

Popular repositories Loading

  1. MPTS MPTS Public

    Model Predictive Task Sampling

    Python 55 16

  2. MoPPS MoPPS Public

    [KDD 2026] Can Prompt Difficulty be Online Predicted for Accelerating RL Finetuning of Reasoning Models?

    Python 44 15

  3. PDTS PDTS Public

    ICML2025 accepted paper: Fast and Robust: Task Sampling with Posterior and Diversity Synergies for Adaptive Decision-Makers in Randomized Environments

    Python 42 12

  4. CFCQL CFCQL Public

    Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.

    Python 40 8

  5. LESR LESR Public

    LLM-Empowered State Representation for Reinforcement Learning (ICML2024 Accepted paper)

    Python 36 4

  6. LaRe LaRe Public

    Code for AAAI-25 accepted paper: Latent Reward: LLM-Empowered Credit Assignment in Episodic Reinforcement Learning.

    Python 23 4

Repositories

Showing 10 of 21 repositories

Top languages

Loading…

Most used topics

Loading…