New MIT research shows that 7B-parameter models fine-tuned with reinforcement learning can match frontier-class reasoning on structured tasks.

The finding has significant implications for on-device AI and enterprise cost structures.