I burned down a forest to confirm
Don’t ask it to name an NFL team that doesn’t end with ‘s’
DeepSeek eventually gets it, but it’s DeepThink takes a good ten minutes of racing ‘thoughts’ and loops to figure it out.
I burned down a forest to confirm
Don’t ask it to name an NFL team that doesn’t end with ‘s’
DeepSeek eventually gets it, but it’s DeepThink takes a good ten minutes of racing ‘thoughts’ and loops to figure it out.
It sucks at other things too. Counting errors are just really easy to objectively verify.
People like Altman claim they can use LLM for creating formal proofs, advancing our knowledge of physics and shit. Fat chance when it can’t even compete with a toddler at counting.