Maithra Raghu (@maithra_raghu) 's Twitter Profile
Maithra Raghu

@maithra_raghu

Cofounder and CEO @Samaya_AI. Formerly Research Scientist Google Brain (@GoogleAI), PhD in ML @Cornell.

ID: 888216099757490176

linkhttp://maithraraghu.com calendar_today21-07-2017 01:56:35

445 Tweet

18,18K Followers

498 Following

Nathan Benaich (@nathanbenaich) 's Twitter Profile Photo

News! Modal is supporting @RAAIS fellowships with $10,000 credits on their platform, alongside the $5,000 GPU credits from Lambda. Working on AI-first, open source research outside big institutions? Apply at airstreet(dot)com/fellowships

News! <a href="/modal_labs/">Modal</a> is supporting @RAAIS fellowships with $10,000 credits on their platform, alongside the $5,000 GPU credits from <a href="/LambdaAPI/">Lambda</a>.

Working on AI-first, open source research outside big institutions?

Apply at airstreet(dot)com/fellowships
Maithra Raghu (@maithra_raghu) 's Twitter Profile Photo

๐ŸŽ†๐ŸŽ‰ Happy New Year!! ๐ŸŽ†๐ŸŽ‰ Grateful to have wrapped up an incredible year at Samaya AI seeing huge growth in usage and customers! It is a privilege to work with a world-class AI team that makes this momentum possible! ๐Ÿš€ Over the break, I took some time to dive into o1's

Maithra Raghu (@maithra_raghu) 's Twitter Profile Photo

Key takeaways from reading the DeepSeek R1 paper: AI training matters, a lot! A lot of the focus in AI has been on the data โ€” how to collect it, generate it, ensure high quality. The DeepSeek paper also shows the critical role in the right multi-stage training process in

Key takeaways from reading the DeepSeek R1 paper:

AI training matters, a lot! A lot of the focus in AI has been on the data โ€” how to collect it, generate it, ensure high quality. The DeepSeek paper also shows the critical role in the right multi-stage training process in
Maithra Raghu (@maithra_raghu) 's Twitter Profile Photo

Discussion on AI progress today conflates "beauty" and "truth". "Beauty" is the amazing technical progress we see on challenging but carefully constructed, idealized benchmarks. "Truth" is the tangible impact AI has on the real world --- tackling ambiguity, noisiness, and

Samaya AI (@samaya_ai) 's Twitter Profile Photo

๐Ÿš€ Samaya AI is hiring 1 to 2 PhD summer research interns in Mountain View, CA! Work on cutting-edge topics: knowledge-grounded LLMs/agents, evaluations, data curation, RL, search/IR, and more! Why Samaya? ๐Ÿ”น 12-16 week paid internship with hands-on mentorship ๐Ÿ”น

Maithra Raghu (@maithra_raghu) 's Twitter Profile Photo

True Expert Level AI is hard to achieve when staying general purpose. Take this recent AI generated Tesla stock analysis:ย  pljclduq.manus.space At first, it looks impressive. But digging deeper, we see that: (i) The financials are out of date (no details from 2024) (ii)

True Expert Level AI is hard to achieve when staying general purpose.

Take this recent AI generated Tesla stock analysis:ย  pljclduq.manus.space At first, it looks impressive. But digging deeper, we see that:

(i) The financials are out of date (no details from 2024)

(ii)
Maithra Raghu (@maithra_raghu) 's Twitter Profile Photo

๐Ÿš€ Thrilled to share that @SamayaAI has raised $43.5M in funding led by NEA to build Expert AI Agents for financial services and transform knowledge work at scale. We started Samaya in 2022 โ€” before ChatGPT โ€” with a belief: ๐Ÿ’ก AI could revolutionize sophisticated financial

๐Ÿš€ Thrilled to share that @SamayaAI has raised $43.5M in funding led by <a href="/NEA/">NEA</a> to build Expert AI Agents for financial services and transform knowledge work at scale.

We started Samaya in 2022 โ€” before ChatGPT โ€” with a belief:
 ๐Ÿ’ก AI could revolutionize sophisticated financial
Maithra Raghu (@maithra_raghu) 's Twitter Profile Photo

Standard AI benchmarks are hitting a wall. As model capabilities grow, so do our expectations. It's no longer impressive for an AI to summarize a single document or regurgitate facts. In real-world use casesโ€”like expert financial analysisโ€”AI must deliver much more, e.g. 1)

Standard AI benchmarks are hitting a wall.
As model capabilities grow, so do our expectations. 

It's no longer impressive for an AI to summarize a single document or regurgitate facts. In real-world use casesโ€”like expert financial analysisโ€”AI must deliver much more, e.g.

1)
Maithra Raghu (@maithra_raghu) 's Twitter Profile Photo

๐Ÿ“‰ ๐—ช๐—ต๐—ฎ๐˜ ๐—ต๐—ฎ๐—ฝ๐—ฝ๐—ฒ๐—ป๐—ฒ๐—ฑ ๐—ฎ๐—ป๐—ฑ ๐˜„๐—ต๐—ฎ๐˜ ๐—ฐ๐—ต๐—ฎ๐—ป๐—ด๐—ฒ๐—ฑ ๐—ถ๐—ป ๐˜†๐—ฒ๐˜€๐˜๐—ฒ๐—ฟ๐—ฑ๐—ฎ๐˜†'๐˜€ ๐—™๐—ฒ๐—ฑ ๐—บ๐—ฒ๐—ฒ๐˜๐—ถ๐—ป๐—ด: ๐Ÿ“ ๐—ข๐˜‚๐˜๐—ฐ๐—ผ๐—บ๐—ฒ๐˜€: โ€ข Rate stayed the same at ๐Ÿฐ.๐Ÿฎ๐Ÿฑโ€“๐Ÿฐ.๐Ÿฑ% โ€ข Projecting ๐˜๐˜„๐—ผ ๐—ฟ๐—ฎ๐˜๐—ฒ ๐—ฐ๐˜‚๐˜๐˜€ later this year โ€ข No change to ๐—ฏ๐—ฎ๐—น๐—ฎ๐—ป๐—ฐ๐—ฒ ๐˜€๐—ต๐—ฒ๐—ฒ๐˜ policies

๐Ÿ“‰ ๐—ช๐—ต๐—ฎ๐˜ ๐—ต๐—ฎ๐—ฝ๐—ฝ๐—ฒ๐—ป๐—ฒ๐—ฑ ๐—ฎ๐—ป๐—ฑ ๐˜„๐—ต๐—ฎ๐˜ ๐—ฐ๐—ต๐—ฎ๐—ป๐—ด๐—ฒ๐—ฑ ๐—ถ๐—ป ๐˜†๐—ฒ๐˜€๐˜๐—ฒ๐—ฟ๐—ฑ๐—ฎ๐˜†'๐˜€ ๐—™๐—ฒ๐—ฑ ๐—บ๐—ฒ๐—ฒ๐˜๐—ถ๐—ป๐—ด:

๐Ÿ“ ๐—ข๐˜‚๐˜๐—ฐ๐—ผ๐—บ๐—ฒ๐˜€:
โ€ข Rate stayed the same at ๐Ÿฐ.๐Ÿฎ๐Ÿฑโ€“๐Ÿฐ.๐Ÿฑ%
โ€ข Projecting ๐˜๐˜„๐—ผ ๐—ฟ๐—ฎ๐˜๐—ฒ ๐—ฐ๐˜‚๐˜๐˜€ later this year
โ€ข No change to ๐—ฏ๐—ฎ๐—น๐—ฎ๐—ป๐—ฐ๐—ฒ ๐˜€๐—ต๐—ฒ๐—ฒ๐˜ policies
Maithra Raghu (@maithra_raghu) 's Twitter Profile Photo

Precision at scale is one of the hardest asks for LLMs today. Great work by the team in building real time self-correction to push the boundaries of accuracy and comprehensiveness.