Shivam Negi (@shivam_negi_ai) 's Twitter Profile
Shivam Negi

@shivam_negi_ai

Staff Data Scientist @ Walmart

AI / Nature Photography / Research

ID: 1653227793612779521

calendar_today02-05-2023 02:40:14

27 Tweet

8 Takipçi

144 Takip Edilen

Santiago (@svpino) 's Twitter Profile Photo

This should be impossible! With only 3 lines, this open-source library finds every bad image in your dataset: • Blurry • Under/over-exposed • Oddly sized • Duplicates If you are into AI/ML, star this repo: github.com/cleanlab/clean…. They are building incredible things!

This should be impossible!

With only 3 lines, this open-source library finds every bad image in your dataset:

• Blurry
• Under/over-exposed
• Oddly sized
• Duplicates

If you are into AI/ML, star this repo: github.com/cleanlab/clean…. They are building incredible things!
Mark Riedl (@mark_riedl) 's Twitter Profile Photo

At today’s White House meeting on AI: - OpenAI - people who left OpenAI because it wasn’t focused enough on existential risk - people who bought the exclusive rights to everything OpenAI makes - Google

maharshi (@mrsiipa) 's Twitter Profile Photo

i have noticed that LLMs like claude and gpt-4o work really well with this prompt, it instructs them to 'contemplate' for a bit before giving the final answer.

i have noticed that LLMs like claude and gpt-4o work really well with this prompt, it instructs them to 'contemplate' for a bit before giving the final answer.
Shivam Negi (@shivam_negi_ai) 's Twitter Profile Photo

Just visited the men’s washroom at Indore Airport and was impressed by how spotless it was. A big shoutout to Shreeram ji, the cleaner, for his hard work and dedication. Such individuals truly make a difference. Kudos to him! Airports Authority of India Swachh Survekshan

Niklas Muennighoff (@muennighoff) 's Twitter Profile Photo

DeepSeek r1 is exciting but misses OpenAI’s test-time scaling plot and needs lots of data. We introduce s1 reproducing o1-preview scaling & performance with just 1K samples & a simple test-time intervention. 📜arxiv.org/abs/2501.19393

DeepSeek r1 is exciting but misses OpenAI’s test-time scaling plot and needs lots of data.

We introduce s1 reproducing o1-preview scaling & performance with just 1K samples & a simple test-time intervention.

📜arxiv.org/abs/2501.19393
Sam Bowman (@sleepinyourhat) 's Twitter Profile Photo

My team is hiring researchers! I’m primarily interested in candidates who have (i) several years of experience doing excellent work as a SWE or RE, (ii) who have substantial research experience of some form, and (iii) who are familiar with modern ML and the AGI alignment

Andrew Ng (@andrewyng) 's Twitter Profile Photo

Announcing AI Dev 25: A conference for AI developers, this Pi day (3/14/2025)! There're great AI academic conferences for researchers (NeurIPS, ICLR, ICML, etc.) and some companies hold great meetings around their products (Google I/O, OpenAI DevDay, etc.). But we need more

elvis (@omarsar0) 's Twitter Profile Photo

Chain-of-Associated-Thoughts (CoAT) is a new framework that enhances LLMs' reasoning abilities by combining Monte Carlo Tree Search with dynamic knowledge integration. The framework addresses the limitations of existing "fast thinking" approaches by introducing an "associative

Chain-of-Associated-Thoughts (CoAT) is a new framework that enhances LLMs' reasoning abilities by combining Monte Carlo Tree Search with dynamic knowledge integration.

The framework addresses the limitations of existing "fast thinking" approaches by introducing an "associative
TuringPost (@theturingpost) 's Twitter Profile Photo

A new feature from Anthropic: "Think" tool for Claude Once Claude starts generating a response, "think" tool creates dedicated space to pause and reflect on whether Claude has gathered all necessary information. It's not the same feature as “extended thinking,” which happens

A new feature from <a href="/AnthropicAI/">Anthropic</a>: "Think" tool for Claude

Once Claude starts generating a response, "think" tool creates dedicated space to pause and reflect on whether Claude has gathered all necessary information.

It's not the same feature as “extended thinking,” which happens
elvis (@omarsar0) 's Twitter Profile Photo

Neat report from Microsoft providing a taxonomy of failure modes in agentic AI systems. If you are building agentic systems today, you will run into many issues. Some of the common ones are summarized in this report. Great resource for AI devs.

Neat report from Microsoft providing a taxonomy of failure modes in agentic AI systems.

If you are building agentic systems today, you will run into many issues.

Some of the common ones are summarized in this report.

Great resource for AI devs.
Shivam Negi (@shivam_negi_ai) 's Twitter Profile Photo

Just finished reading Chip Huyen's "ML System Design". Highly recommend it to anyone looking to deepen their understanding of system design in the ML space! 📚🤖 #MLSystemDesign #ChipHuyen #MachineLearning open.substack.com/pub/shivamnegi…

Boaz Barak (@boazbaraktcs) 's Twitter Profile Photo

I didn't want to post on Grok safety since I work at a competitor, but it's not about competition. I appreciate the scientists and engineers at xAI but the way safety was handled is completely irresponsible. Thread below.

Brett Adcock (@adcock_brett) 's Twitter Profile Photo

Another massive week of AI and robotics news. I summarized everything from OpenAI, xAI, Google, Cognition, Figure, AWS, Meta, UBTECH, Pollen Robotics, Agility Robotics, and more. Here's everything you need to know and how to make sense out of it: