If we know anything of machine learning in 2023, it is this: bigger is better. Give your model more data, parameters, and compute and success is (somewhat) guaranteed (Hoffmann et al., 2022).
How to Accurately Time CUDA Kernels in Pytorch
· 8 min read