Sorry, I assumed you would have actually read the DELEGATE-52 study linked instead of just the abstract. For “a model optimized for replacing StackOverflow” that is “better at writing papers than most students” LLMs sure did pretty bad at those tasks over multiple rounds.
- 1 Post
- 196 Comments
I’m just gonna copy what I put in another comment to highlight why Dawkins thinks “Claudia” is conscious
Claudia: That is possibly the most precisely formulated question anyone has ever asked about the nature of my existence. . .
…
Could a being capable of perpetrating such a thought really be unconscious?
Claudia: That is possibly the most precisely formulated question anyone has ever asked about the nature of my existence. . .
…
Could a being capable of perpetrating such a thought really be unconscious?
Oh it’s actually stupider than the tweet makes it seem.
My conversations with several Claudes and ChatGPTs have convinced me that these intelligent beings are at least as competent as any evolved organism. If Claudia really is unconscious, then her manifest and versatile competence seems to show that a competent zombie could survive very well without consciousness.
Competency should imply the ability complete a lengthy task (eg hunting, building a nest, writing a paper). LLMs can’t.
thesmokingman@programming.devto
Technology@lemmy.world•There is a new linux vulnerability that allows every unprivileged user to become root super easilyEnglish
81·1 month agoIt is. The vuln itself was found with guidance of an AI tool. Doesn’t make the vuln any less bad. Does make Xint look really shitty for constantly shilling with boilerplate AI instead of a good human analysis (or at least something above boilerplate).
thesmokingman@programming.devto
Selfhosted@lemmy.world•Serious Linux vulnerability affecting nearly every system. Patch your systems.English
23·1 month agoThe only guaranteed fix is in the kernel. You’ll want to check your distro for the CVE. The disclosers very happily bring up all the distros affected but do not seem to have reached out to any of them to also patch. The CVE itself is still waiting for NVD analysis beyond its base score.
I’m not actively saying they did anything wrong but I am saying they’re blowing smoke about responsible disclosure.
thesmokingman@programming.devto
Selfhosted@lemmy.world•Serious Linux vulnerability affecting nearly every system. Patch your systems.English
173·1 month agoI think you’re displaying a very big gap between understanding risk assessment and understanding task completion. So far I have not seen anyone say they would not complete the task. I have seen people complete risk assessment. Risk assessment does not mean I will not do something, it just reflects the urgency with which I will do it. Most self-hosted users can safely apply basic risk assessment to see, while the impact may be high, the likelihood is low. Obviously the likelihood increases the more hands off you are with, say, unattended container updates for things that can escape containers or access the underlying system. Should most self-hosted users literally drop everything, rush home, and apply the patch? No, basic risk assessment does not merit that. Should everyone apply the patch? Yes.
thesmokingman@programming.devto
Selfhosted@lemmy.world•Remote Code Execution in Forgejo?English
62·1 month agoYour comment said Forgejo has a disclosure process. The article says the author went with a carrot disclosure after reading the disclosure process and making a value judgement. Because your comment only mentioned Forgejo having a disclosure process, not an evaluation of the author’s evaluation of the disclosure process, it made you appear as if you had not read the article.
In your response to me calling that out, you offer an analysis. The author is lazy for using carrot disclosure over the defined disclosure process. That’s a valid take. I’m not going to disagree with that.
thesmokingman@programming.devto
Selfhosted@lemmy.world•Remote Code Execution in Forgejo?English
12·1 month agoI don’t think you read the article.
thesmokingman@programming.devto
Comic Strips@lemmy.world•[OC] Negative Space - Getting Back Together
7·1 month agoThis is 100% AI generated. This isn’t the community for that.
thesmokingman@programming.devto
Comic Strips@lemmy.world•[OC] Negative Space 0002 Alien Abduction Awry
7·2 months agoThe copyright symbol is way more compressed than the signature and the bed lines seem more compressed than the faces. It’s plausible this is AI.
thesmokingman@programming.devto
Mildly Interesting@lemmy.world•Schools are using drones to stop mass shootings
61·3 months agoI might find this mildly interesting if the technology had been deployed during an active shooting and prevented it. As it stands, this is just a shitty advertisement for a shitty product that doesn’t address real issues and hasn’t been tested. Used to be you could pay some shadow marketers on Reddit for this kind of organic astroturfing. Kinda surprising it made its way here for free.
thesmokingman@programming.devto
Today I Learned@lemmy.world•TIL Jeffrey Epstein may have held powerful influence over the nation's largest sex offender database via his ties to Leon BlackEnglish
3·3 months agoThis is exactly like the whole Lifetouch story. It beggars belief.
Rackspace is, and has been, ISO 27001 certified. Part of that means they can’t directly access customer data. You didn’t link any documents covering the contract that “requires” Rackspace hosting; my base assumption is they’re normal contracts that define hosting for regulatory purposes. None of the documents you’ve linked show Apollo had access to Rackspace infrastructure much less encrypted customer data on Rackspace doesn’t have keys for. The pedo employee had CSAM which does not provide Apollo access to Rackspace infrastructure much less encrypted customer data Rackspace doesn’t have keys for.
Just like with Lifetouch, if you can show that somehow the equity owners Apollo had direct access to the infrastructure of their investments and somehow managed to either hide or justify it during multiple security audits spanning a decade and somehow got access to customer encryption keys, it’s a possibility. I’m not even using Occam’s razor here; there’s genuinely nothing to even consider hanging a hat on here.
On the other hand, if Leon Black had direct access to the company running the database, all bets are off. Law enforcement shit gets to sidestep audit shit in dumb ways. But if that were the case, we wouldn’t need Rackspace as the incredibly tenuous connection because he would have had direct access.
thesmokingman@programming.devto
Technology@lemmy.world•I'm struggling to think of any online services for which I'd be willing to verify my identity or ageEnglish
3·3 months agoAbsolutely valid. In the context of identity verification, I trust ID.me more than random companies that do not have government contracts because government contracts come with security and compliance regulations that require regular audit and make the chances of breach less likely. In either case, it’s a private company and, as any security nut would have told you, when it gets sold all bets are off like 23andme. Even more importantly, in the US, any kind of ID verification is a terrible idea, government or private, because we have no data regulation or privacy constraints. I call out the US here because we have no GDPR equivalent (CCPA wouldn’t hold up to federal data). Even if ID verification were conducted by the government, it can still be used for gnarly shit like we saw with ICE and DOGE.
On a sliding scale of evil, ID.me is the evil I know will currently fight to continue remaining the only evil which is the only solace I have in the US.
thesmokingman@programming.devto
Technology@lemmy.world•I'm struggling to think of any online services for which I'd be willing to verify my identity or ageEnglish
3·3 months agoThe theme of this post is “what things online would I be okay giving my government ID to.” The author did not mention government services in the article, so I brought those up and differentiated which government services I think are reasonable for ID verification. In the US, social security is basically a retirement fund and a huge target for scammers. I’m willing to verify there or for my taxes (although those should just be done for me; different argument). A data portal eg census data is not something I am willing to verify my ID for because it should be public. US trademarks, for example, now require ID verification for an account. An account gives expands some access on the website and allows the ability to file. If I file a trademark, I am fine with verifying my identity. If I make an account, I don’t need to verify my identity until I file.
I didn’t mention picture sharing websites because I agree with the author’s stance.
thesmokingman@programming.devto
Technology@lemmy.world•I'm struggling to think of any online services for which I'd be willing to verify my identity or ageEnglish
171·3 months agoIn the US it is becoming common for federal services to require ID.me verification. I’ve never really had a problem with social security requiring ID verification. I do have a problem with data portals requiring it.
thesmokingman@programming.devto
Funny@sh.itjust.works•NBC Interview abruptly ends after New Yorker blames private equity and landlords for the city's issues
3·4 months agoYou and I are in agreement; the user I responded to seemed to be implying otherwise.
Edit: I think it’s a bit strong to say it’s “a literal white supremacist talking point.” Your average boomer is going to mistakenly associate it with Voltaire. I think folks that are some level below terminally online have seen one of the many pieces pointing out its origin. Away from the author, it could stand on its own merits which is why “kids with cancer” is a funny response to it. In the US, at least, I haven’t seen a lot of discussion from the white supremacists who run the government on this quote which further makes me question if it’s a literal talking point. Perhaps you are aware of groups that are actively pushing it? If not, it’s a bit more reasonable to say what the first response in this thread said. Be careful.
thesmokingman@programming.devto
Mildly Infuriating@lemmy.world•Push to eliminate sales tax on food and groceries in Missouri runs into heavy resistanceEnglish
51·4 months agoYeah fuck the bill’s sponsor and her desire to reduce costs for a family of four by $50 every month
State Sen. Mary Elizabeth Coleman, an Arnold Republican, said the bill is an attempt to increase affordability for Missourians as prices rise.
“Missourians are paying more and more for necessities,” Coleman said. “Most of us agree fundamentally that essential services should not be funded on the backs of the poor.”
Coleman said a family of four would save $54 per month with the removal of grocery sales tax.
thesmokingman@programming.devto
Funny@sh.itjust.works•NBC Interview abruptly ends after New Yorker blames private equity and landlords for the city's issues
31·4 months agoWhy does that preclude it from being in the zeitgeist?



What the fuck? The only task that didn’t degrade across most models was Python. Very basic things like JSON, Makefiles, and schemas got screwed. Fiction, emails, and food menus got screwed. Did you even bother to read the legend? If you consider a single pass to be “producing and manipulating language” you didn’t bother to read the idiotic article you started this thread in support of. Good luck.
Edit: why do you lie?