Just how cheap are we talking about? The R1 paper claims the model was trained on the equivalent of just $5.6 million rented GPU hours, which is a small fraction of the hundreds of millions ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results