steve-z (@steve_z_seattle) 's Twitter Profile
steve-z

@steve_z_seattle

#AGI #NLP #DL #RL Deep Learning and Reinforcement Learning practitioner. Interested in NLP, Vision, and RL. Ph.D. in AI.

ID: 1513039652155904001

calendar_today10-04-2022 06:22:28

425 Tweet

175 Followers

296 Following

François Chollet (@fchollet) 's Twitter Profile Photo

We'll have AGI long before we figure out how the brain works. The brain is just very, very complicated. In fact, developing AGI is likely a prerequisite in order to make progress on understanding the brain.

François Chollet (@fchollet) 's Twitter Profile Photo

The main benefit of having extensive programming experience isn't the ability to write software (which is a bit of a commodity -- you could hire someone to do it), it's how it changes the way you think.

Junxian He (@junxian_he) 's Twitter Profile Photo

We replicated the DeepSeek-R1-Zero and DeepSeek-R1 training on 7B model with only 8K examples, the results are surprisingly strong. 🚀 Starting from Qwen2.5-Math-7B (base model), we perform RL on it directly. No SFT, no reward model, just 8K MATH examples for verification, the

We replicated the DeepSeek-R1-Zero and DeepSeek-R1 training on 7B model with only 8K examples, the results are surprisingly strong. 

🚀 Starting from Qwen2.5-Math-7B (base model), we perform RL on it directly. No SFT, no reward model, just 8K MATH examples for verification, the
steve-z (@steve_z_seattle) 's Twitter Profile Photo

Jeremy, you know, 99% of the people who are talking about AI or LLMs on the internet don't know what they are talking about.

SuperAI (@superai_conf) 's Twitter Profile Photo

In 120 days, the global AI ecosystem will converge in Singapore. 7,000+ attendees, 150+ speakers, 2 unparalleled days to connect and co-create. Step into the future at Asia's Largest AI Event, 18-19 June 2025. superai.com

Richard Sutton (@richardssutton) 's Twitter Profile Photo

Rich's slogans for AI research (revised 2006): 1. Approximate the solution, not the problem (no special cases) 2. Drive from the problem 3. Take the agent’s point of view 4. Don’t ask the agent to achieve what it can’t measure 5. Don't ask the agent to know what it can't verify