labml.ai
@labmlai
π Annotated paper implementations nn.labml.ai
ID: 1338393738511589377
http://labml.ai 14-12-2020 08:02:04
648 Tweet
12,12K Followers
9 Following
π’ We are excited to announce Notbad v1.0 Mistral 24B, a new reasoning model trained in math and Python coding. This model is built upon the Mistral AI Small 24B 2501 and has been further trained with reinforcement learning on math and coding.
We just released a Python coding reasoning dataset with 200k samples on Hugging Face This was generated by our RL-based self-improved Mistral 24B 2501 model. This dataset was used to train train Notbad v1.0 Mistral 24B. π€ Links in replies π