• Phen@lemmy.eco.br
    link
    fedilink
    arrow-up
    100
    ·
    5 个月前

    Another problem I’ve realized today, is the proliferation of data that was originally hallucinated by AI.

    I was discussing an issue on a software with a coworker and he asked an AI for help configure around it. He then sent me “apparently we can try changing this setting to this value”. I told him to first validate if that setting really existed because AI tends to make up things like that when it’s what you would want to hear and running a test would take us 20~30 minutes.

    He found some discussions about that setting not working as people expected. “ok at least it exists then” and we tried it. It didn’t work. I later cloned the source of that software and checked, the setting didn’t exist - ever.

    • alaphic@lemmy.world
      link
      fedilink
      arrow-up
      92
      ·
      5 个月前

      I love that you even specifically said, “Yea, let’s check to make sure that setting exists to begin with.” To which instead of actually fucking checking, they proceed to google more about the setting and use someone else’s ‘discussion’ online of it not working as proof that it does exist, even though they were likely having that discussion because the setting didn’t exist.

      This is also how I can tell this story is 100% true.

      I don’t miss working support at all and am reminded of it like this daily

      • Clent@lemmy.dbzer0.com
        link
        fedilink
        English
        arrow-up
        3
        arrow-down
        1
        ·
        5 个月前

        The benefit of working with open source code bases or being able to check the source for existing features.

        It’s very common for there to be hidden settings, with open source one can look at the code based but with closed source a search may be one’s only hope.

      • jagged_circle@feddit.nl
        link
        fedilink
        English
        arrow-up
        9
        ·
        5 个月前

        Depends what materials you’re recycling. Glass and plastic both require virgin material, else you’d get garbage out.

        • dutchkimble@lemy.lol
          link
          fedilink
          arrow-up
          16
          ·
          edit-2
          5 个月前

          Paper works up to 6 times at best, if someone is able to track the same batch. But recycling paper uses more energy than using virgin, and if virgin paper sourced from a sustainable place like Canada, then recycled paper is actually worse for the environment because of the energy thing and de-inking water waste. Also the timber is cut for housing actually and only the edges of the logs are used to make chips for making paper. So trees aren’t being cut solely for paper (from sustainable countries). Until we meet again (insert Skeletor running away)

        • The_Decryptor@aussie.zone
          link
          fedilink
          English
          arrow-up
          5
          ·
          5 个月前

          Glass and plastic both require virgin material, else you’d get garbage out.

          Everything I’m reading suggests the problem with glass recycling is contamination, and that once that’s accounted for what’s left over can be infinitely recycled without quality loss.

    • dalekcaan@lemm.ee
      link
      fedilink
      arrow-up
      3
      ·
      5 个月前

      This has always been true, but LLMs have expedited the process by taking the garbage out and sticking it right back into the input.

  • Churbleyimyam@lemm.ee
    link
    fedilink
    arrow-up
    46
    ·
    5 个月前

    It’s very tempting to have schadenfreude about this failure but also disgusting that so much has been invested in it that should have been put to better use.

    It’s just another example of a system whose narrow definition of success is taking human and environmental value and using it to extract more. It’s not aimed at solving worthwhile problems or making things better, which is why people are becoming more miserable and the planet is getting wrecked.

    You could say that it’s the system we live in which is the AI, feeding on itself and becoming more sick.

    • chonglibloodsport@lemmy.world
      link
      fedilink
      arrow-up
      10
      ·
      edit-2
      5 个月前

      The schadenfreude is what we’re here for! We can’t do anything about the waste of investors’ money. They could’ve spent it all on fireworks instead. That probably would’ve been more fun!

      As for the system? I prefer not to think about it. Too much systemic thinking is bad for mental health. Much better to enjoy some schadenfreude and save your serious thinking energy for things you have the power to change, especially where they can make life better for you and those around you.

      • Churbleyimyam@lemm.ee
        link
        fedilink
        arrow-up
        8
        ·
        5 个月前

        I agree with all your points! What I will add though is that what we think of as ‘investors money’ is actually value that has been extracted from the environment and from workers.

    • leadore@lemmy.world
      link
      fedilink
      arrow-up
      6
      ·
      5 个月前

      In my case it’s not so much schadenfreude as just wanting this nightmare era to end as quickly as possible. The sooner this LLM shit dies the sooner we can start to recover and move on, in terms of stopping the senseless waste of water and energy and maybe starting to rebuild some kind of useful internet.

  • Blaster M@lemmy.world
    link
    fedilink
    English
    arrow-up
    40
    ·
    edit-2
    5 个月前

    So, reading this article, it’s not about model collapse, but about RAG - letting the AI model google the question essentially. The problem is, the first 10 pages of google search results are all low effort adfarming slop sites, because of course it is, which is making the answers from the AI worse, as these slop sites often have incorrect or otherwise unproofed articles, which biases the AI to fork out the wrong answer.

    I’m sure the major AI services will try and fix this with some slop site detection routines.

    • frunch@lemmy.world
      link
      fedilink
      arrow-up
      20
      ·
      5 个月前

      I’m sure the major AI services will try and fix this with some slop site detection routines.

      Which will be run by AI 🙃

      • melechric@lemmy.world
        link
        fedilink
        arrow-up
        14
        ·
        5 个月前

        Don’t forget! A lot of the slop on those first few pages of results is AI-generated.

        Ouroboros is a very apt moniker for this phenomena.

        • avattar@lemmy.sdf.org
          link
          fedilink
          arrow-up
          2
          arrow-down
          1
          ·
          5 个月前

          We need a new, stronger name for this. Like shit ouroboros, or shouroboros. Yes, AI eating it’s own shit and then regurgitating it is shouroboros.

    • ℍ𝕂-𝟞𝟝@sopuli.xyz
      link
      fedilink
      English
      arrow-up
      5
      ·
      5 个月前

      some slop site detection routines.

      Why would they? I mean how are their incentives different from that of the search engine operators themselves?

      I can see a future when the internet is degraded to a point where if you try to find out how to peel an apple, you will get back word salad and 25 different porn ads.

    • MrSilkworm@lemmy.ml
      link
      fedilink
      arrow-up
      3
      arrow-down
      1
      ·
      5 个月前

      I’m sure the major AI services will try and fix this with some slop site detection routines.

      No they will not, because this will harm their short term bottom line, which is always, “add short term value for the shareholder”

    • JeremyHuntQW12@lemmy.world
      link
      fedilink
      arrow-up
      2
      ·
      edit-2
      5 个月前

      Yesterday, there was the usual slew of artificial computer-generated news stories on YouTube about GM closing down all factories in north america (happens about once a month).

      Well I typed in “is GM closing down in the US” in Google and the Gemini generated answer said “Yes, GM has announced the closure of all plants in the US” and put up those fake YT videos as reference…

      I’m sure the major AI services will try and fix this with some slop site detection routines.

      They already do this through data determination routines in LLMs, unfortunately they suffer from the same type of infection as the data itself.

      • Echo Dot@feddit.uk
        link
        fedilink
        arrow-up
        1
        ·
        5 个月前

        You probably would get better results from literally any other AI Gemini is routinely the worst. I don’t know what Google are playing at surely they could actually put some real effort into this but they just seem to be doing it in the most naive way possible.

        It comes to something when the Chinese are been the most innovative.

  • _druid@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    33
    ·
    5 个月前

    Aww boo hoo, did someone generate a degenerative feedback loop? Yeah? Did someone make a big ol’ oppsy whoopsy that’s gunna accelerate in hallucinations and slop as it collapses in on itself? How’s the coded version of a microphone whine going to go, you silly buttholes?

    • Wilco@lemm.ee
      link
      fedilink
      English
      arrow-up
      5
      ·
      5 个月前

      People are putting AI generated pitfalls to guard their content.

      They reference nonsense links that usually cannot even be seen by normal users, the AI reads the pages and finds more garbage links even as more are generated by the site.

      • _druid@sh.itjust.works
        link
        fedilink
        English
        arrow-up
        5
        ·
        5 个月前

        It’s just so unfortunate that, in causing AI to delve down these winding paths, to propagate these slopfest feedback loops, the computers that are running the AI are burning real resources, polluting our atmosphere.

        Unfortunate is not the right word to describe the deep lament I feel, to cause such destruction for so little, if any, gain at all. My heart is heavy with regret for us all. Not just you and I, but for beast, bird, plant as well. Such a shame.

  • Rooty@lemmy.world
    link
    fedilink
    arrow-up
    26
    ·
    5 个月前

    Ffs, neural networks and LLMs have their place and can be useful, but setting up datacentres that snort up the entire internet indiscriminately to create a glorified chatbot that spews data that may or may not be correct is insane.

  • BigMacHole@lemm.ee
    link
    fedilink
    arrow-up
    27
    arrow-down
    2
    ·
    5 个月前

    Oh no! I HOPE us Taxpayers can Bail Out these AI Companies when they go Under! AFTER ALL we CUT my Child’s LIFESAVING MEDICATION so I KNOW we have the Funds to Help these Poor Billionaire CEOS!

    • Etterra@discuss.online
      link
      fedilink
      English
      arrow-up
      7
      ·
      5 个月前

      I can’t afford groceries now! I’m sure all those billionaires will help us out now that they’ve got a little but more though.

    • utopiah@lemmy.world
      link
      fedilink
      arrow-up
      3
      ·
      5 个月前

      Help these Poor Billionaire CEOS!

      Right, self-made billionaires for whom the way to success was already paved by subsidies. Yes, those surely need help to “build” absolutely pointless non-working projects that are supposed to “save humanity”. That’s great. /$

  • avattar@lemmy.sdf.org
    link
    fedilink
    arrow-up
    15
    ·
    5 个月前

    There is a solution to this. Make a **perfect ** AI detecting tool. The only way I can think of is through adding a tag to every bit of AI-generated data,

    Though it could easily be removed from text, I guess.And no, training AI to recognize AI will never work. Also every model would have to join this, or it won’t work.

    Related XKCD

    • Etterra@discuss.online
      link
      fedilink
      English
      arrow-up
      10
      ·
      5 个月前

      LOL you’re suggesting people already doing something unbelievably stupid should do something smart to compensate.

    • bthest@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      edit-2
      5 个月前

      Also people won’t be able to pass AI work off as their own if it is labeled as such. Cheating and selling slop is the chief use for AI so any tag or watermark will be removed on the vast majority of stuff.

      There’s also liability. If your AI generates code that’s used to program something important and a lot of people are injured or die, do you really want a tag that can be traceable to back the company to be on the evidence? Or slapped all over the child sex abuse images that their wonderful invention is churning out?

  • GoldenQuetzal@lemmy.world
    link
    fedilink
    arrow-up
    16
    arrow-down
    1
    ·
    5 个月前

    I’ve been predicting this for a while now and people kept telling me I was wrong. Prepare for dot com burst two, electric boogaloo.

    • bthest@lemmy.world
      link
      fedilink
      English
      arrow-up
      3
      ·
      edit-2
      5 个月前

      I hope it crashes but what if the market completely embraces feels-based economics and just says that incomprehensible AI slop noise is what customers crave? Maybe CEOs will interpret AI gibberish output in much the same way as ancient high priests made calls by sifting through the entrails of sacrificed animals. Tesla meme stock is evidence that you can defy all known laws of economic theory and still just coast by.