August 02, 2024
Optimizing Mixture Ratios for Continual Pre-training of Commercial Large Language Mod...
Eda Linwood, Tristan Fairchild, James Everly, et al.