AlloSpatial: Agentic Harness Framework for Spatial Reasoning in Foundation Models Paper • 2606.08952 • Published 8 days ago • 1
The Arbiter Agent: Continually Monitoring Multi-Agent Conversations to Detect Emergent Misalignment Paper • 2606.10747 • Published 7 days ago • 7
Hy-Embodied-0.5-VLA: From Vision-Language-Action Models to a Real-World Robot Learning Stack Paper • 2606.14409 • Published 4 days ago • 10
Skip a Layer or Loop It? Learning Program-of-Layers in LLMs Paper • 2606.06574 • Published 12 days ago • 17
HarnessX: A Composable, Adaptive, and Evolvable Agent Harness Foundry Paper • 2606.14249 • Published 4 days ago • 35
ArogyaSutra: A Multi-Agent Framework for Multimodal Medical Reasoning in Indic Languages Paper • 2606.13572 • Published 5 days ago • 2
HarnessBridge: Learnable Bidirectional Controller for LLM Agent Harness Paper • 2606.12882 • Published 5 days ago • 11
MoVerse: Real-Time Video World Modeling with Panoramic Gaussian Scaffold Paper • 2606.13376 • Published 5 days ago • 11
N-GRPO: Embedding-Level Neighbor Mixing for Enhanced Policy Optimization Paper • 2606.10768 • Published 7 days ago • 24
HYDRA-X: Native Unified Multimodal Models with Holistic Visual Tokenizers Paper • 2606.13289 • Published 5 days ago • 28
LabVLA: Grounding Vision-Language-Action Models in Scientific Laboratories Paper • 2606.13578 • Published 5 days ago • 53
Robust-U1: Can MLLMs Self-Recover Corrupted Visual Content for Robust Understanding? Paper • 2606.08063 • Published 10 days ago • 75
SpatialClaw: Rethinking Action Interface for Agentic Spatial Reasoning Paper • 2606.13673 • Published 5 days ago • 95
EvoArena: Tracking Memory Evolution for Robust LLM Agents in Dynamic Environments Paper • 2606.13681 • Published 5 days ago • 132
DRIFT: A Residual Flow Adapter for Decoding Continuous Outputs in Vision-Language Models Paper • 2606.05758 • Published 12 days ago • 5
Embodied-R1.5: Evolving Physical Intelligence via Embodied Foundation Models Paper • 2606.11324 • Published 7 days ago • 11