and none of this accounts for training energy, pretraining/data synthesis/augmentation energy, energy consumed by human labor curating the data, and then multiplied by a number > 1 because all of those efforts failed in multiple ways at various times and had to be repeated
and none of this accounts for training energy, pretraining/data synthesis/augmentation energy, energy consumed by human labor curating the data, and then multiplied by a number > 1 because all of those efforts failed in multiple ways at various times and had to be repeated