Jia Li (@jiali52524397) 's Twitter Profile
Jia Li

@jiali52524397

Co-founder at Numina; former AI scienctist at Mistral AI; Co-founder & ex CSO at Cardiologs

ID: 1513456178835173376

calendar_today11-04-2022 09:57:47

58 Tweet

1,1K Takipçi

170 Takip Edilen

Guillaume Lample @ ICLR 2024 (@guillaumelample) 's Twitter Profile Photo

Today, we are releasing Mistral Large, our latest model. Mistral Large is vastly superior to Mistral Medium, handles 32k tokens of context, and is natively fluent in English, French, Spanish, German, and Italian. We have also updated Mistral Small on our API to a model that is

Today, we are releasing Mistral Large, our latest model. Mistral Large is vastly superior to Mistral Medium, handles 32k tokens of context, and is natively fluent in English, French, Spanish, German, and Italian.

We have also updated Mistral Small on our API to a model that is
Leandro von Werra (@lvwerra) 's Twitter Profile Photo

Super proud of what the BigCode community achieved. Building the best in class code LLMs in an open and collaborative way is no easy feat and is the result of the hard work of many community members!

Ross Wightman (@wightmanr) 's Twitter Profile Photo

timm 1.0.3 is pushed out the door and almost at 30k🌟 It's been stable for ages, why 1.0 now? I wanted one long promised feature in -- unified feature map extraction (features_only=True) for almost all models, incl ViT 🎉 Accomplished via new API. github.com/huggingface/py…

Mistral AI Labs (@mistralailabs) 's Twitter Profile Photo

Announcing Codestral: our first-ever code model. - Open-weights under the new Mistral AI Non-Production License - New endpoint via La Plateforme: codestral.mistral.ai - Try it now on Le Chat: chat.mistral.ai mistral.ai/news/codestral/

DeepSeek (@deepseek_ai) 's Twitter Profile Photo

📢 After 3 months, the AI Mathematical Olympiad (AIMO) on Kaggle has announced the winners! 🎉 We're thrilled to see the Top 4 teams all chose DeepSeekMath-7B as their base model, with Numina Jia Li achieving 29/50 correct answers! 👏 Even Terence Tao was amazed. 🤯

📢 After 3 months, the AI Mathematical Olympiad (AIMO) on Kaggle has announced the winners! 🎉

We're thrilled to see the Top 4 teams all chose DeepSeekMath-7B as their base model, with Numina <a href="/JiaLi52524397/">Jia Li</a> achieving 29/50 correct answers! 👏 Even Terence Tao was amazed. 🤯
Lewis Tunstall (@_lewtun) 's Twitter Profile Photo

Introducing NuminaMath-7B-TIR, the small but mighty model that won the first progress prize of the AI Math Olympiad 🥇! > Fine-tuned with iterative SFT on DeepSeekMath-7B from DeepSeek > Stage 1: learn math with chain of thought samples > Stage 2: learn code with

Introducing NuminaMath-7B-TIR, the small but mighty model that won the first progress prize of the AI Math Olympiad 🥇!

&gt; Fine-tuned with iterative SFT on DeepSeekMath-7B from <a href="/deepseek_ai/">DeepSeek</a>
&gt; Stage 1: learn math with chain of thought samples
&gt; Stage 2: learn code with
Jia Li (@jiali52524397) 's Twitter Profile Photo

Proud to officially introduce Numina to the world ! projectnumina.ai We also released a blog post about our approach with some details of our model: huggingface.co/blog/winning-a… This makes NuminaMath 7B one of the best 7B models on math reasoning !

Proud to officially introduce Numina to the world ! projectnumina.ai
We also released a blog post about our approach with some details of our model: huggingface.co/blog/winning-a…
This makes NuminaMath 7B one of the best 7B models on math reasoning !
Guillaume Lample @ ICLR 2024 (@guillaumelample) 's Twitter Profile Photo

Today we are releasing two small models: Mathstral 7B and Codestral Mamba 7B. On the MATH benchmark, Mathstral 7B obtains 56.6% pass@1, outperforming Minerva 540B by more than 20%. Mathstral scores 68.4% on MATH with majority voting@64, and 74.6% using a reward model. Codestral

Today we are releasing two small models: Mathstral 7B and Codestral Mamba 7B.

On the MATH benchmark, Mathstral 7B obtains 56.6% pass@1, outperforming Minerva 540B by more than 20%. Mathstral scores 68.4% on MATH with majority voting@64, and 74.6% using a reward model.

Codestral
Mistral AI (@mistralai) 's Twitter Profile Photo

magnet:?xt=urn:btih:7278e625de2b1da598b23954c13933047126238a&dn=pixtral-12b-240910&tr=udp%3A%2F%2Ftracker.opentrackr.org%3A1337%2Fannounce&tr=udp%3A%2F%2Fopen.demonii.com%3A1337%2Fannounce&tr=http%3A%2F%2Ftracker.ipv6tracker.org%3A80%2Fannounce