Apple did this by showing that leading models such as ChatGPT, Claude and Deepseek may "look smart – but when complexity rises, they collapse". In short, these models are very good at a kind of pattern recognition, but often fail when they encounter novelty that forces them beyond the limits of their training, despite being, as the paper notes, "explicitly designed for reasoning tasks"...
https://www.theguardian.com/commentisfree/2025/jun/10/billion-dollar-ai-puzzle-break-down

No comments:
Post a Comment