• peoplebeproblems@midwest.social
    link
    fedilink
    English
    arrow-up
    3
    ·
    7 个月前

    The core language model isn’t a nueral network? I agree that the full application is more Markov chainy but I had no idea the LLM wasn’t.

    Now I’m wondering if there are any models that are actual neutral networks

    • kn0wmad1c@programming.dev
      link
      fedilink
      English
      arrow-up
      2
      arrow-down
      6
      ·
      7 个月前

      I’m not an expert. I’d just expect a neural network to follow the core principle of self-improvement. GPT is fundamentally unable to do this. The way it “learns” is closer to the same tech behind predictive text in your phone.

      It’s the reason why it can’t understand why telling you to put glue on pizza is a bad idea.

      • lime!@feddit.nu
        link
        fedilink
        English
        arrow-up
        6
        ·
        7 个月前

        the main thing is that the system end-users interact with is static. it’s a snapshot of all the weights of the “neurons” at a particular point in the training process. you can keep training from that snapshot for every conversation, but nobody does that live because the result wouldn’t be useful. it needs to be cleaned up first. so it learns nothing from you, but it could.

      • frezik@midwest.social
        link
        fedilink
        arrow-up
        2
        ·
        edit-2
        7 个月前

        “Improvement” is an open ended term. Would having longer or shorter toes be beneficial? Depends on the evolutionary environment.

        ChatGPT does have a feedback loop. Every prompt you give it affects its internal state. That’s why it won’t give you the same response next time you give the same prompt. Will it be better or worse? Depends on what you want.