Abhay Gupta (@gupta__abhay) 's Twitter Profile
Abhay Gupta

@gupta__abhay

Scaling post-training @DbrxMosaicAI | previously @CerebrasSystems @CMU_Robotics | Making GPUs go brrrr !!

ID: 2904700735

calendar_today19-11-2014 16:48:57

972 Tweet

304 Takipçi

1,1K Takip Edilen

Abhay Gupta (@gupta__abhay) 's Twitter Profile Photo

I’ll be at #NeurIPS2025 from 2nd-6th Dec. DM if you want to chat about MoEs, scaling training and inference, making GPUs go brrr and other fun stuff !!

Ashutosh Baheti (@abaheti95) 's Twitter Profile Photo

Will be at #NeurIPS2025 from 2nd to 6th Dec. Excited to chat about async RL, Environment Exploration, Agents/Tool use, User Simulator, Synthetic Data Generation or any other topic!! You can find me at the Databricks booth @ Tue 12 - 4pm

Abhay Gupta (@gupta__abhay) 's Twitter Profile Photo

Riding this wave for a second Chris Barber! Thank you for putting together an amazing list!!! Databricks Mosaic Research is also hiring for researchers and engineers for many projects / at all levels. DM me or come hang out with the team at our social: luma.com/regcpdhi

Databricks (@databricks) 's Twitter Profile Photo

Reliable enterprise agents require system-level reasoning when retrieving across heterogeneous knowledge sources. Traditional RAG often fails to consistently follow instructions, schemas, and constraints end to end. That’s why we’re presenting Instructed Retriever, a new

Reliable enterprise agents require system-level reasoning when retrieving across heterogeneous knowledge sources. Traditional RAG often fails to consistently follow instructions, schemas, and constraints end to end.

That’s why we’re presenting Instructed Retriever, a new
Andrew Drozdov (@mrdrozdov) 's Twitter Profile Photo

Instructed Retriever is a multi-tiered declarative approach for building high quality search agents. It's an example of an "instructed system", which goes beyond prompt tuning and tool calling by passing data among modules which work together to fulfill an information need.

Andrew Feldman (@andrewdfeldman) 's Twitter Profile Photo

OpenAI and @Cerebras have signed a multi-year agreement to deploy 750 megawatts of Cerebras wafer-scale systems to serve OpenAI customers. This has been a decade in the making. Deployment begins in early 2026, and when fully rolled out, it will be the largest high-speed AI

<a href="/OpenAI/">OpenAI</a> and @Cerebras have signed a multi-year agreement to deploy 750 megawatts of Cerebras wafer-scale systems to serve OpenAI customers.

This has been a decade in the making.

Deployment begins in early 2026, and when fully rolled out, it will be the largest high-speed AI
Matei Zaharia (@matei_zaharia) 's Twitter Profile Photo

Agent memory is a simple and powerful way to do continual learning! With the new MemAlign method from Databricks Research, we can build better LLM judges from examples of human ratings, and they scale with more data. Now in Databricks and MLflow. databricks.com/blog/memalign-…

Ali Ghodsi (@alighodsi) 's Twitter Profile Photo

I now constantly get questions about the SAAS meltdown, role of AI, system of records etc. I don't have an answer to all these. But I do know that we saw an acceleration in our business in Q2, Q3, and now finished the year with accelerating Q4. The question is, why? Short