Jia Li (@jiali52524397) Twitter Tweets • TwiCopy

Jia Li

@jiali52524397

+ Follow

Co-founder at Numina; former AI scienctist at Mistral AI; Co-founder & ex CSO at Cardiologs

ID: 1513456178835173376

calendar_today11-04-2022 09:57:47

58 Tweet

1,1K Takipçi

170 Takip Edilen

Guillaume Lample @ ICLR 2024

@guillaumelample

8 months ago

Today, we are releasing Mistral Large, our latest model. Mistral Large is vastly superior to Mistral Medium, handles 32k tokens of context, and is natively fluent in English, French, Spanish, German, and Italian. We have also updated Mistral Small on our API to a model that is

thumb_up_off_alt5,5K

chat_bubble_outline175

repeat813

shareShare

Leandro von Werra

@lvwerra

8 months ago

Super proud of what the BigCode community achieved. Building the best in class code LLMs in an open and collaborative way is no easy feat and is the result of the hard work of many community members!

thumb_up_off_alt74

chat_bubble_outline2

repeat13

shareShare

Ross Wightman

@wightmanr

6 months ago

timm 1.0.3 is pushed out the door and almost at 30k🌟 It's been stable for ages, why 1.0 now? I wanted one long promised feature in -- unified feature map extraction (features_only=True) for almost all models, incl ViT 🎉 Accomplished via new API. github.com/huggingface/py…

thumb_up_off_alt233

chat_bubble_outline12

repeat44

shareShare

Mistral AI Labs

@mistralailabs

5 months ago

Announcing Codestral: our first-ever code model. - Open-weights under the new Mistral AI Non-Production License - New endpoint via La Plateforme: codestral.mistral.ai - Try it now on Le Chat: chat.mistral.ai mistral.ai/news/codestral/

thumb_up_off_alt769

chat_bubble_outline18

repeat161

shareShare

Mistral AI Labs

@mistralailabs

4 months ago

Try Mistral AI Codestral in JupyterLit (JupyterAI in the browser) -> jupyterlite.github.io/jupyterlab-cod…

thumb_up_off_alt58

chat_bubble_outline0

repeat8

shareShare

DeepSeek

@deepseek_ai

4 months ago

📢 After 3 months, the AI Mathematical Olympiad (AIMO) on Kaggle has announced the winners! 🎉 We're thrilled to see the Top 4 teams all chose DeepSeekMath-7B as their base model, with Numina Jia Li achieving 29/50 correct answers! 👏 Even Terence Tao was amazed. 🤯

thumb_up_off_alt600

chat_bubble_outline17

repeat92

shareShare

Lewis Tunstall

@_lewtun

4 months ago

Introducing NuminaMath-7B-TIR, the small but mighty model that won the first progress prize of the AI Math Olympiad 🥇! > Fine-tuned with iterative SFT on DeepSeekMath-7B from DeepSeek > Stage 1: learn math with chain of thought samples > Stage 2: learn code with

thumb_up_off_alt432

chat_bubble_outline7

repeat112

shareShare

Jia Li

@jiali52524397

4 months ago

Proud to officially introduce Numina to the world ! projectnumina.ai We also released a blog post about our approach with some details of our model: huggingface.co/blog/winning-a… This makes NuminaMath 7B one of the best 7B models on math reasoning !

thumb_up_off_alt139

chat_bubble_outline7

repeat26

shareShare

Guillaume Lample @ ICLR 2024

@guillaumelample

4 months ago

Today we are releasing two small models: Mathstral 7B and Codestral Mamba 7B. On the MATH benchmark, Mathstral 7B obtains 56.6% pass@1, outperforming Minerva 540B by more than 20%. Mathstral scores 68.4% on MATH with majority voting@64, and 74.6% using a reward model. Codestral

thumb_up_off_alt709

chat_bubble_outline13

repeat110

shareShare

Edward Beeching

@edwardbeeching

4 months ago

Our prize winning Math recipe is now released with datasets, training code and a new 72B math model. See thread for more details:

thumb_up_off_alt166

chat_bubble_outline4

repeat28

shareShare

Mistral AI

@mistralai

3 months ago

mistral.ai/news/mistral-l…

thumb_up_off_alt2,2K

chat_bubble_outline126

repeat362

shareShare

Mistral AI

@mistralai

2 months ago

magnet:?xt=urn:btih:7278e625de2b1da598b23954c13933047126238a&dn=pixtral-12b-240910&tr=udp%3A%2F%2Ftracker.opentrackr.org%3A1337%2Fannounce&tr=udp%3A%2F%2Fopen.demonii.com%3A1337%2Fannounce&tr=http%3A%2F%2Ftracker.ipv6tracker.org%3A80%2Fannounce

thumb_up_off_alt4,4K

chat_bubble_outline285

repeat498

shareShare