Oh yeah this whole thing is very very much convenient to Nvidia and data center companies lmao.
I still think they’re incentivized to make the models more efficient as they could then squeeze out even more profit, it’s just that it’s a property of the technology itself that it doesn’t really work well until you have bajillions of parameters.
Oh yeah this whole thing is very very much convenient to Nvidia and data center companies lmao.
I still think they’re incentivized to make the models more efficient as they could then squeeze out even more profit, it’s just that it’s a property of the technology itself that it doesn’t really work well until you have bajillions of parameters.