
Complete Accuracy Collapse
A new paper from Apple proves what I've discussed before – that so-called "reasoning" models aren't doing much reasoning at all – but now that is clear even by the bizarre industry definitions of the term. The researchers created a series of puzzles with similar problem-solving