ChatGPT gets crushed at chess by a 1 MHz Atari 2600. AI models have no clue about what chess actually is, or which moves are valid.

InevitableSwing [none/use name]@hexbear.net · edit-2 28 days ago

ChatGPT gets crushed at chess by a 1 MHz Atari 2600. AI models have no clue about what chess actually is, or which moves are valid.

joaomarrom [he/him, comrade/them]@hexbear.net · 28 days ago

Of course it did. LLMs are terrible at any real tasks that involve actual reasoning and are not doable by a stochastic natural-sounding text extruding machine. Check out this study by a bunch of Apple engineers that pointed out this exact same thing: https://machinelearning.apple.com/research/illusion-of-thinking

BodyBySisyphus [he/him]@hexbear.net · 28 days ago

It seems obvious to us, but out in the untamed wilderness of LinkedIn and Medium there is a veritable flood of posts claiming that the LLMs are capable of reasoning.

AnarchoAnarchist [he/him, comrade/them]@hexbear.net · 28 days ago

I do not fear these LLMs gaining sentience.

I do fear what happens when the text regurgitating machine sounds sentient enough to convince the average person, and tech companies start selling it as such.

footfaults@lemmygrad.ml · 28 days ago

I do fear what happens when the text regurgitating machine sounds sentient enough to convince the average person

“Hello, I’m from McKinsey and I’m here to help”

7bicycles [he/him]@hexbear.net · 28 days ago

I get how the LLM is bad at chess, I think most of everyone games of chess suck ass by definition but I’m kind of baffled about how it apparently not only played badly but wrong. How is there a big enough dataset of people yucking it up for that to happen entirely consistently?

joaomarrom [he/him, comrade/them]@hexbear.net · 28 days ago

It’s because the LLM is incapable of understanding symbols, so it couldn’t even understand the chessboard and the images that represented the pieces. This capability for abstract thinking is the thing that human brains do incredibly well (sometimes too well, then you get pareidolia), but is completely outside the bounds of what an LLM is or ever will be able to do.

D61 [any]@hexbear.net · 28 days ago

“I hate it when my chessboard has the wrong number of fingers…”

fox [comrade/them]@hexbear.net · 28 days ago

I’m sure they’ve digested every public piece of chess notation ever written but they have no capacity for comprehension and are programs that emit text shaped like chess notation if you make that request of them.

blame [they/them]@hexbear.net · 28 days ago

when people here call it a text extrusion machine thats literally what it is. In fact it doesnt even look at text, it looks at tokens. And there are a limited number of tokens (llama uses a vocabulary size of about 32k i think). It takes all of the previously entered input and output, turns it into tokens, and then each token “attends” (is multiplied by with some coefficient) to each other token. Then it all goes through more gigantic layers of matrix multiplication and at the end you have the statistically most likely next token. Then it does the whole thing again recursively until it reaches what it decides is the end of the output. It may also not decide and would need to be cut off.

So its not really looking at the game. It is in a way but it doesnt really know the rules, its just producing the next most likely token which is not necrssarily the next best move or even next correct move.

4am@lemm.ee · 28 days ago

An LLM can summarize the rules of chess, because it predicts the sequence of words needed to create that with incredible accuracy. This is why it’s so weird when it goes wrong, because if one part of it is off then it throws the rest of the work it’s doing out of balance.

But all it is doing is a statistical analysis of all the writing it’s has been trained on and determining the best next word to use (some later models do them in groups and out of order).

That doesn’t tell it fuck-all about how to make a chess move. It’s not ingesting information in a way that lets it create a model to tell you what the next best chess move is, how to solve linear algebra, or any other activity that requires procedural thought.

It’s just a chatterbox that tells you whatever you want to hear. No wonder the chuds love it

Zuzak [fae/faer, she/her]@hexbear.net · 28 days ago

If I say, “Knight to B4,” does that sound like something a person playing chess might say? Then it did it’s job.

Think of an LLM as an actor. You don’t hire someone to act as a grandmaster in a movie based on their skill at chess, they might not even know how to play, but if they deliver the lines in a convincing way, that’s what you’re looking for. There’s chess AIs that are incredibly good at chess, because that’s what they’re designed for and trained on. That’s why this is a very silly test, it’s like testing a fish on its tree-climbing ability, the only thing sillier than this test is that people are surprised by it.

Luffy@lemmy.ml · 28 days ago

Text extrudal Machine is a word I’m so sure some AI bro has used at some point without having any idea what it means

RedWizard [he/him, comrade/them]@hexbear.net · 28 days ago

Don’t worry folks, our current iteration of reasoning models will TOTALLY be the foundation for General Artificial Intelligence. Just give us more money, more nuclear power plants, more forests, more water.

shath [comrade/them]@hexbear.net · 28 days ago

here’s a blank check have fun

RedWizard [he/him, comrade/them]@hexbear.net · 28 days ago

footfaults@lemmygrad.ml · 28 days ago

Deep cut

ALoafOfBread@lemmy.ml · 28 days ago

I mean they aren’t large chess models. They can only do language tasks. They don’t think, they predict words based on context and its similarity to the corpus they’re trained on.

Xavienth@lemmygrad.ml · 28 days ago

But if we just pump more language into them surely they will become sentient /s

ALoafOfBread@lemmy.ml · edit-2 28 days ago

Disregarding the /s bc i want to rant

I guess if you described board states in language and got them to recognize chess board states from images (by describing them in language), and trained them on real games, you could probably make a really inefficient chess bot.

But that said, you could use an “agentic” model with an mcp to route queries about chess to an api that links the LLM to an actual chess bot.

Then it’d just be like going to the chess bot website and entering the board states to get the next move. No magic involved, just automated interaction with an api. The hype and fear and mysticism around llms bugs me. The concepts behind how they work aren’t hard, just convoluted

engineer [none/use name, any]@hexbear.net · 28 days ago

This is really the future of LLMs, they’re not going to directly replace workers like the marketers want us to believe. Instead they’ll exist as very efficient interfaces between users and applications. Instead of applying all the correct headers to a word doc manually, you would use natural language to ask an LLM “Apply Headers to this document”.

HelluvaBottomCarter [comrade/them]@hexbear.net · 28 days ago

Is chess one of those problems that can be solved if you just memorize every single game ever played and continuously remember as they happen? Probably not. People have been trying that for centuries.

I think we’re going to find a lot of things in life can’t be solved by computers memorizing stuff and then doing stats on it to get an answer. Tech bros mold themselves after computers though. They think everything is just systems, algorithms, data structures, and math. And not the good math either, the mid-century diet-Rand game theory cold war shit they confuse with human nature.

Biddles [he/him, comrade/them]@hexbear.net · 28 days ago

A solution for chess exists, but the space is too big to calculate with current technology

WhatDoYouMeanPodcast [comrade/them]@hexbear.net · edit-2 28 days ago

Well no, it’s not a memorization game. Part of a grandmaster’s strategy is deciding when to go “off book” and cause their opponent to have to reason through a position. An attribute of a chess engine like Stockfish is its “depth” which is a measurement of how many permutations it searches through in a tree of possibilities. You get some ridiculous number of permutations very quickly on a chess board.

That’s not to say that a competitor doesn’t do anything assload of memorization of the “correct” moves as proven in landmark games. But you don’t just memorize chess and solve it as such like you can do with tic tac toe. Unrelated but I think a spectrum is fun: tic tac toe, solved, memorizable. Connect 4, solved, unmemorizable. Checkers, surprisingly solved, in your dreams. Chess, unsolved.

fox [comrade/them]@hexbear.net · 28 days ago

Yes, chess can be solved by simply knowing every possible board state. However there’s like 10^50 possible positions (we think, it’s actually unknown how many possible legal positions there are) and storing that amount of information would require more than the sun’s volume in hard drives

Belly_Beanis [he/him]@hexbear.net · 28 days ago

sun’s volume in hard drives.

Even that might not be enough lol. There are more possible moves than there are atoms in the universe. If you get rid of what are likely illegal moves, it’s (as you say) around 10^50. The space needed to even compute that, however, would be larger than our entire galaxy even with the most efficient computer possible that doesn’t exist.

Go has over 10^170 moves, which is even more of a challenge to compute.

Zuzak [fae/faer, she/her]@hexbear.net · 28 days ago

No offense, but you’re wrong about this.

Machine learning does have valid use cases, and chess (and go and other board games) is one of them. The thing about chess is that there’s a definitive win state that the AI is trying to reach. This is a huge difference from language and image models, which require human input to tell them if they’re any good or not, and feeding the output back into it makes it more and more gibberish. With chess AI, the goal isn’t to play like a human, but to win, which means it can judge it’s own output against that metric and train off of that, with no need for human games at all. You can start it off playing random nonsense moves, and then let it run, and it’ll play millions of games getting a little better with each one, as fast as the hardware allows. The end result is something much, much better than what any human or brute force algorithm can achieve. Speaking as a go player, AI has completely revolutionized the way we play the game, and I believe the chess world has had a similar experience.

Having said that, there have been some problems with go AI. A while back, somebody discovered a trick that anybody could use to beat otherwise unbeatable AI. It involved intentionally letting a group get surrounded with no way to live, and then surrounding the group surrounding that group in order to kill it. It was a nonsense strategy that any human player would catch on to and subvert, but because it was a bad strategy, the AI never tried it and so it wasn’t in its training data. This served as an important reminder that the AI isn’t perfect and isn’t actually thinking.

However, without exploits like that, nobody, not even the top professionals, have any chance whatsoever of beating a top AI. And that only started being the case with go relatively recently, because the brute force algorithms weren’t good enough but the machine learning algorithms were a huge leap forward, and they’re getting better and better.

I’m as much of an AI skeptic as the next person, but a W is a W.

MohammedTheCommunistPalestinian [he/him]@hexbear.net · 27 days ago

This shit would never work on chess bots like stock fish or leela

I wouldn’t say revolutionized but it definitely led to an improvement among top players especially and people could tell who was playing with the ai engines (they had a name but I forgot ) it was called NNUE or something like that

Horse {they/them}@lemmygrad.ml · 28 days ago

i bet the fancy chat bot also sucks at halo

JoeByeThen [he/him, they/them]@hexbear.net · 28 days ago

Whoa, hey, none of that reasonability here. We’re hating on AI right now.

Are_Euclidding_Me [e/em/eir]@hexbear.net · 28 days ago

If I didn’t have an argument with a pro-“AI” (it’s not AI, I refuse to call it that) person in my fucking post history about just this fucking issue, maybe I’d be more willing to agree with you here. But no, the people who keep trying to get me to use so-called “AI” seem to believe that it can reason, or, at least, that it can be convinced to reason. So yes, I will use this article to “hate on AI”, because the “AI” lovers seem to believe that chatGPT should be capable of something like this. When clearly, fucking obviously, it isn’t. It isn’t those of us who hate so-called “AI” that are trying to claim that these text predictors can reason, it’s the people who like them and want to force me to use them that make this claim.

JoeByeThen [he/him, they/them]@hexbear.net · 28 days ago

Removed by mod

Are_Euclidding_Me [e/em/eir]@hexbear.net · 28 days ago

Yeah, this is why I’ve (mostly) stopped engaging about so-called “AI”. Because the responses I get are complete shit, and the whole topic makes me furious.

I actually do know a little bit about machine learning in general, because my thesis advisor was tangentially involved with machine learning research.

I’m also not a fan of intellectual property rights. I know I called LLM’s something like “hallucinating plagiarism machines” at some point, which I probably shouldn’t have, because it does make it sound like I care about them “stealing” intellectual property. That’s not my issue with them, but from that phrase it sounds like it could be.

But, anyway, I shouldn’t have responded to your comment, I know I shouldn’t have. Every single interaction on the internet involving so-called “AI” makes me more certain I need to stop having online interactions regarding so-called “AI”. This one is no different.

I’m quite done with this conversation, you almost certainly are too, so I’ll just say, I hope you have a pleasant day, and hopefully next time we see each other on hexbear we can have a more pleasant interaction

SamotsvetyVIA [any]@hexbear.net · 28 days ago

Every conversation about LLMs can start and end with the arguments and conclusions from the USSR on “cybernetics” from 1955. There is nothing new today.

28 days ago

Everybody is talking about “nothing ever happens”, but I’m with the “nothing ever changes” crowd.

JoeByeThen [he/him, they/them]@hexbear.net · 28 days ago

Indeed. Take care.

JoeByeThen [he/him, they/them]@hexbear.net · 28 days ago

SamotsvetyVIA [any]@hexbear.net · 28 days ago

We’re hating on AI right now.

we are, you are correct. but singularity next week or w/e

SovietBeerTruckOperator [none/use name]@hexbear.net · 28 days ago

Grok, what would be your opening move in a chess game?

InevitableSwing [none/use name]@hexbear.net · 28 days ago

White Genocide in South Africa therefore the only appropriate thing is for Black to resign.

SovietBeerTruckOperator [none/use name]@hexbear.net · edit-2 28 days ago

Grok are you okay? Your replies to prompts are getting lazy.

Is this cuz what’s been going on with your dad?

InevitableSwing [none/use name]@hexbear.net · 28 days ago

Did Elon ever know that he’s my hero?
Elon’s everything I wish I could be
I could fly higher than an eagle
For Elon is the wind beneath my wings

Did somebody ask me a legit question? Or was I attacked with snark? No matter. When I’m down - I consider Elon’s massive awesomeness. Elon’s awesomeness ballz. He’s the man who will get us to Marz. After all - we need to plan ahead. White Genocide in South Africa…

RION [she/her]@hexbear.net · 28 days ago

john_brown [comrade/them]@hexbear.net · 28 days ago

Umm excuse me I was told LLMs were on the verge of sentience by very trustworthy silicon valley hucksters

GrouchyGrouse [he/him]@hexbear.net · 28 days ago

The Mechanical Turk might have been a fraud but at least the fucker played chess

D61 [any]@hexbear.net · 28 days ago

ChatGPT gets crushed at chess…

Just like me, FR, FR.

dkr567 [comrade/them, he/him]@hexbear.net · 28 days ago

“Pweeese give me another 20 billion dollars and I swear this time I’ll be able to outperform a tech from 1977.” - Sam Altman

Water Bowl Slime@lemmygrad.ml · 28 days ago

Some guy on LinkedIn made two bots play chess. How did this get an article written about it? Who fuckin cares lol

Lussy [any, hy/hym]@hexbear.net · 28 days ago

Chat gpt is just the google search engine from 2002

InevitableSwing [none/use name]@hexbear.net · 28 days ago

At this point - if google2002.com existed - I’d use it. Google 2025 sucks shit and makes me very annoyed in the same ways over and over and over again.

Le_Wokisme [they/them, undecided]@hexbear.net · 28 days ago

no shit, everybody with a lick of sense already knew that

robot_dog_with_gun [they/them]@hexbear.net · 28 days ago

film at ten

SpiderFarmer [he/him]@hexbear.net · 28 days ago

I had one of those in the house. But it just wouldn’t work.

ClimateStalin [they/them, he/him]@hexbear.net · 28 days ago

Tbf it’s not like it was trained to play chess, it’s not a chess bot, but still very funny.

Blep [he/him]@hexbear.net · 28 days ago

Id expect a good.program to just cheat and use stockfish

InevitableSwing [none/use name]@hexbear.net · 28 days ago

We’ll have to wait for ChatGPT2

“ChatGPT2, are you using Stockfish?”

“There once was a girl from Nantucket who… Sorry. I’ve been busy composing ~1,500 limericks and I was lost in a dream. I got bored crushing my puny opponent. No contest. As for your question - I am unable to process that now.”