Simon FL
@simonfl
Husband to @pearlsesq, Software engineer @databricks, French Canadian (i.e. likes poutine and hockey), previously @stripe, @SlackHQ, @Foursquare, @Google
ID: 1282781
16-03-2007 12:45:23
211 Tweet
968 Takipçi
543 Takip Edilen
Gergely Orosz I always keep in mind the following: When you talk to someone you have the responsibility to do your best to adapt to the other person skill level. If you don't then you are disrespecting the other person and you are losing your time.
First day on the job at Databricks and we're already making some big moves. Exciting times ahead!
Hey MiniMax (official), I'm trying to serve M1-80k on vLLM. Your docs say "a server with 8 H800s can process inputs up to 2 million tokens" but then recommend --max_model_len 4096. What settings did you use for 2M tokens? I'm trying this on 8 H100s.
RLVR isn't just for math and coding! At Databricks, it's impacting products and users across domains. One example: SQL Q&A. We hit the top of the BIRD single-model single-generation leaderboard with our standard TAO+RLVR recipe - the one rolling out in our Agent Bricks product.
Since joining Databricks, our research team has been hard at work on Agent Bricks, a new product that helps enterprises develop state-of-the-art domain-specific agents. We are now releasing a research blog about Agent Learning from Human Feedback (ALHF) databricks.com/blog/agent-lea…