Berivan Isik (@berivanisik) 's Twitter Profile
Berivan Isik

@berivanisik

Research scientist @GoogleAI. Efficient & trustworthy AI, LLMs, Gemini data & eval | prev: PhD @Stanford @StanfordAILab

ID: 2701532126

linkhttps://sites.google.com/view/berivanisik calendar_today02-08-2014 19:11:20

299 Tweet

10,10K Takipçi

1,1K Takip Edilen

Dan Alistarh (@dalistarh) 's Twitter Profile Photo

Our QuEST paper was selected for Oral Presentation at ICLR Sparsity in LLMs Workshop at ICLR 2025 workshop! QuEST is the first algorithm with Pareto-optimal LLM training for 4bit weights/activations, and can even train accurate 1-bit LLMs. Paper: arxiv.org/abs/2502.05003 Code: github.com/IST-DASLab/QuE…

Mehmet Hamza Erol (@mhamzaerol) 's Twitter Profile Photo

How much does a correct answer from an LM cost? How much has AI lowered the cost of solving problems? Meet Cost‑of‑Pass: An Economic Framework for Evaluating LMs! Cost‑of‑Pass = expected $ for one correct answer. Frontier Cost‑of‑Pass = cheapest route: an LM or a human expert.

How much does a correct answer from an LM cost?
How much has AI lowered the cost of solving problems?

Meet Cost‑of‑Pass: An Economic Framework for Evaluating LMs!

Cost‑of‑Pass = expected $ for one correct answer.
Frontier Cost‑of‑Pass = cheapest route: an LM or a human expert.
Sparsity in LLMs Workshop at ICLR 2025 (@sparsellms) 's Twitter Profile Photo

Sparse LLM workshop will run on Sunday with two poster sessions, a mentoring session, 4 spotlight talks, 4 invited talks and a panel session. We'll host an amazing lineup of researchers: Dan Alistarh Vithu Thangarasa Yuandong Tian Amir Yazdan Gintare Karolina Dziugaite Olivia Hsu Pavlo Molchanov Yang Yu

Sparse LLM workshop will run on Sunday with two poster sessions, a mentoring session, 4 spotlight talks, 4 invited talks and a panel session. 

We'll host an amazing lineup of researchers: <a href="/DAlistarh/">Dan Alistarh</a> <a href="/vithursant19/">Vithu Thangarasa</a> <a href="/tydsh/">Yuandong Tian</a> <a href="/ayazdanb/">Amir Yazdan</a> <a href="/gkdziugaite/">Gintare Karolina Dziugaite</a> Olivia Hsu <a href="/PavloMolchanov/">Pavlo Molchanov</a> Yang Yu
Ahmad Beirami @ ICLR 2025 (@abeirami) 's Twitter Profile Photo

A PhD is not about getting over the noisy conference bar and publishing a bunch of papers. If that's what your PhD is about, you need to take a step back and reconsider your path. And a PhD advisor's job is certainly not just to help make papers happen.

kyle morris (@kylejohnmorris) 's Twitter Profile Photo

Join 500+ ambitious builders forging an open democratic future for AI with DEMi, backed by @cyberfund_ 🌟Announcing the Democratize Intelligence Summit🌟 Free. Invite-only. 📅 June 27, 2025📍SF Registration in replies 👇 DEMi3 welcomes 4 tracks of builders: ⚙️

Join 500+ ambitious builders forging an open democratic future for AI with <a href="/demi_network/">DEMi</a>, backed by @cyberfund_

🌟Announcing the Democratize Intelligence Summit🌟

Free. Invite-only. 📅 June 27, 2025📍SF

Registration in replies 👇

DEMi3 welcomes 4 tracks of builders: 
⚙️
lmarena.ai (formerly lmsys.org) (@lmarena_ai) 's Twitter Profile Photo

🚨Breaking: New Gemini-2.5-Pro (06-05) takes the #1 spot across all Arenas again! 🥇 #1 in Text, Vision, WebDev 🥇 #1 in Hard, Coding, Math, Creative, Multi-turn, Instruction Following, and Long Queries categories Huge congrats Google DeepMind!

🚨Breaking: New Gemini-2.5-Pro (06-05) takes the #1 spot across all Arenas again!

🥇 #1 in Text, Vision, WebDev
🥇 #1 in Hard, Coding, Math, Creative, Multi-turn, Instruction Following, and Long Queries categories

Huge congrats <a href="/GoogleDeepMind/">Google DeepMind</a>!
Kate Lin (@k8_lin_) 's Twitter Profile Photo

Tabular data is a really exciting and rich domain to explore reasoning for LLMs 🧠 Check out our latest work on understanding the critical gaps that emerge when reasoning over messy, real-world data: arxiv.org/abs/2506.08249 and big kudos to Ken for the beautiful figures 👏

Demis Hassabis (@demishassabis) 's Twitter Profile Photo

It’s been an amazing few months of relentless building, shipping, and optimising our models incorporating your feedback. Excited for more users and developers to try out the incredible Gemini 2.5 series!

Berivan Isik (@berivanisik) 's Twitter Profile Photo

Proud to have contributed to this truly transformative milestone! Check out the tech report for details 🧐 storage.googleapis.com/deepmind-media…