151
Apple AI Released a 7B Open-Source Language Model Trained on 2.5T Tokens on Open Datasets.
(www.marktechpost.com)
This is a most excellent place for technology news and articles.
They managed a substantial incremental improvement over previous models by first creating a better set of data as their starting point.
https://huggingface.co/apple/DCLM-7B