arxiv:2605.15726
Chanuk Lee
tally0818
AI & ML interests
LLM post-training
Recent Activity
upvoted a paper about 4 hours ago
Zone of Proximal Policy Optimization: Teacher in Prompts, Not Gradients upvoted a paper 6 days ago
On the Geometry of On-Policy Distillation upvoted a paper 6 days ago
SpatialClaw: Rethinking Action Interface for Agentic Spatial ReasoningOrganizations
None yet