thu-rllab
Popular repositories Loading
Repositories
Showing 10 of 21 repositories
- MoPPS Public
[KDD 2026] Can Prompt Difficulty be Online Predicted for Accelerating RL Finetuning of Reasoning Models?
thu-rllab/MoPPS’s past year of commit activity - ANQ Public
thu-rllab/ANQ’s past year of commit activity - PDTS_project_page Public
thu-rllab/PDTS_project_page’s past year of commit activity - PDTS Public
ICML2025 accepted paper: Fast and Robust: Task Sampling with Posterior and Diversity Synergies for Adaptive Decision-Makers in Randomized Environments
thu-rllab/PDTS’s past year of commit activity - LaRe Public
Code for AAAI-25 accepted paper: Latent Reward: LLM-Empowered Credit Assignment in Episodic Reinforcement Learning.
thu-rllab/LaRe’s past year of commit activity - CFCQL Public
Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.
thu-rllab/CFCQL’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…