Qi Yu

QiLeoYu

13 1

AI & ML interests

None yet

Recent Activity

upvoted a paper 11 days ago

Thinking with Visual Grounding

upvoted a paper 11 days ago

Context-Aware RL for Agentic and Multimodal LLMs

updated a model 17 days ago

QiLeoYu/Qwen3-1.7B-Base_W_Linear_GRPO_Math12K

View all activity

Organizations

upvoted 2 papers 11 days ago

Thinking with Visual Grounding

Paper • 2606.16122 • Published 17 days ago • 11

Context-Aware RL for Agentic and Multimodal LLMs

Paper • 2606.17053 • Published 17 days ago • 16

updated a model 17 days ago

QiLeoYu/Qwen3-1.7B-Base_W_Linear_GRPO_Math12K

2B • Updated 17 days ago • 229

published a model 17 days ago

QiLeoYu/Qwen3-1.7B-Base_W_Linear_GRPO_Math12K

2B • Updated 17 days ago • 229

updated a model 19 days ago

QiLeoYu/Qwen3-1.7B-Base_GRPO_Math12K

2B • Updated 19 days ago • 181

published a model 19 days ago

QiLeoYu/Qwen3-1.7B-Base_GRPO_Math12K

2B • Updated 19 days ago • 181

upvoted a paper about 1 month ago

Code as Agent Harness

Paper • 2605.18747 • Published May 18 • 223

upvoted a paper about 2 months ago

RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond Verifiable Rewards

Paper • 2605.10899 • Published May 11 • 79

upvoted a paper 2 months ago

Recursive Multi-Agent Systems

Paper • 2604.25917 • Published Apr 28 • 287

liked a dataset 3 months ago

YennNing/MC-Search

Viewer • Updated Feb 22 • 3.33k • 336 • 28

upvoted 3 papers 4 months ago

upvoted 2 papers 5 months ago

Weak-Driven Learning: How Weak Agents make Strong Agents Stronger

Paper • 2602.08222 • Published Feb 9 • 290

Agentic Reasoning for Large Language Models

Paper • 2601.12538 • Published Jan 18 • 207

upvoted a paper 6 months ago

Your Group-Relative Advantage Is Biased

Paper • 2601.08521 • Published Jan 13 • 158

upvoted a paper 7 months ago

Latent Collaboration in Multi-Agent Systems

Paper • 2511.20639 • Published Nov 25, 2025 • 129

updated a model 8 months ago

QiLeoYu/anlp-hw2-outputs

Updated Nov 12, 2025

published a model 8 months ago

QiLeoYu/anlp-hw2-outputs

Updated Nov 12, 2025

updated a model 9 months ago

QiLeoYu/adv-nlp-hw1-qiyu6

Text Classification • 22.7M • Updated Oct 8, 2025 • 4

Qi Yu

AI & ML interests

Recent Activity

Organizations

QiLeoYu's activity