thu-rllab

MPTS Public

Model Predictive Task Sampling

Python 55 16

MoPPS Public

[KDD 2026] Can Prompt Difficulty be Online Predicted for Accelerating RL Finetuning of Reasoning Models?

Python 44 15

PDTS Public

ICML2025 accepted paper: Fast and Robust: Task Sampling with Posterior and Diversity Synergies for Adaptive Decision-Makers in Randomized Environments

Python 42 12

CFCQL Public

Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.

Python 40 8

LESR Public

LLM-Empowered State Representation for Reinforcement Learning (ICML2024 Accepted paper)

Python 36 4

LaRe Public

Code for AAAI-25 accepted paper: Latent Reward: LLM-Empowered Credit Assignment in Episodic Reinforcement Learning.

Python 23 4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

thu-rllab

Popular repositories Loading

Repositories

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

People

Top languages

Uh oh!

Most used topics

Uh oh!