This collection contains curriculum-RLed Olmo models.
SeanWang0027 PRO
SeanWang0027
AI & ML interests
LLM Post-Training
Recent Activity
published a dataset about 8 hours ago
CL-From-Nothing/rlve_offline_20K_POPE_prefix updated a dataset about 8 hours ago
CL-From-Nothing/rlve_offline_20K_POPE_prefix updated a model 1 day ago
SeanWang0027/token_reward_direct