also dead internet is probably true, oh well

IO 😇@lemmy.blahaj.zone · 2 months ago

also dead internet is probably true, oh well

skisnow@lemmy.ca · 2 months ago

Of course. What the paper is suggesting is that during training and evaluation you should reward correct answers, punish wrong answers, and treat abstentions as somewhere in between. Current benchmarks punish abstentions and wrong answers equally, therefore models that guess instead of abstaining score higher on average.