Adaptive Cold-Start Stage: construct multimodal and textual reasoning examples with trace lengths scaled to task difficulty, so the model learns a notion of difficulty awareness. Empirical results ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results