Anand Sukumaran (@anandrmedia) 's Twitter Profile
Anand Sukumaran

@anandrmedia

Co-founder & CTO @Engagespot (Backed by Techstars NYC) - The modern way to add emails and notifications to your app πŸš€ | Logidots $200K ARR

ID: 60270429

linkhttps://anands.me calendar_today26-07-2009 09:10:11

3,3K Tweet

949 Followers

215 Following

Anand Sukumaran (@anandrmedia) 's Twitter Profile Photo

Yesterday, a new type of LLM (diffusion-based LLM) was released. I ran a few tests to understand how it performs. This could be the next breakthrough πŸš€ πŸ”Έ What is a diffusion-based LLM, and how is it different from other LLMs like ChatGPT? The LLMs we use today work by

Anand Sukumaran (@anandrmedia) 's Twitter Profile Photo

The simplest way to understand how an LLMs like ChatGPT works (non-tech explanation) with an example πŸ’‘ An LLM has only one job: predict the next word (or "token," to be precise). That's it. Not a sentence, not a story, just the next word! Example: (Look at the diagram) πŸ‘‰ If

The simplest way to understand how an LLMs like ChatGPT works (non-tech explanation) with an example πŸ’‘ 

An LLM has only one job: predict the next word (or "token," to be precise). That's it. Not a sentence, not a story, just the next word!

Example: (Look at the diagram)

πŸ‘‰ If
Anand Sukumaran (@anandrmedia) 's Twitter Profile Photo

I built an agentic software development team to see how agents collaborate to complete a task πŸš€ using a task manager. I added three agents to the team - - Project Manager - Developer - QA. Also, all of them have access to a common tool - a Task Manager! πŸ‘‰ I then told the

I built an agentic software development team to see how agents collaborate to complete a task πŸš€ using a task manager.

I added three agents to the team - 
- Project Manager
- Developer
- QA. 

Also, all of them have access to a common tool - a Task Manager!

πŸ‘‰ I then told the
Anand Sukumaran (@anandrmedia) 's Twitter Profile Photo

Why do LLMs fail to count letters in a word? Here's an easy way to understand. But let's NOT do the famous "strawberry" test. πŸ“ Because, most LLMs now answer that correctly (memorised it probably?). Instead, use a slightly misspelled word - "strawberi" or "strayberi," etc. πŸ‘‰

Why do LLMs fail to count letters in a word? Here's an easy way to understand. But let's NOT do the famous "strawberry" test. πŸ“ 

Because, most LLMs now answer that correctly (memorised it probably?). Instead, use a slightly misspelled word - "strawberi" or "strayberi," etc.

πŸ‘‰
Anand Sukumaran (@anandrmedia) 's Twitter Profile Photo

Do LLMs "understand" what they are saying? Yes, they do. They are not just simple next-token predictors! If you want a true next-token predictor, look at how a Markov chain works. Its behavior is predictable. But LLMs are not. Why? πŸ‘‰ Because LLMs do not process words, tokens,

Do LLMs "understand" what they are saying? Yes, they do. They are not just simple next-token predictors!

If you want a true next-token predictor, look at how a Markov chain works. Its behavior is predictable. But LLMs are not. Why?

πŸ‘‰ Because LLMs do not process words, tokens,
Anand Sukumaran (@anandrmedia) 's Twitter Profile Photo

I built a simple MCP server to turn my Claude Desktop into an analytics tool! πŸš€ I created a MySQL MCP server and added it to my Claude Desktop. Now, it's not just an LLM that generates answers - it can also read my database, fetch the right data by writing accurate queries,

I built a simple MCP server to turn my Claude Desktop into an analytics tool! πŸš€ 

I created a MySQL MCP server and added it to my Claude Desktop. Now, it's not just an LLM that generates answers - it can also read my database, fetch the right data by writing accurate queries,
Anand Sukumaran (@anandrmedia) 's Twitter Profile Photo

Why can't LLMs do math? Why does it work when they are allowed to 'think'? A simple explanation. For simple math problems, like 2 + 2 = ?, the model answers correctly because it has seen that exact question countless times during training. In fact, it might have memorized "2 + 2

Why can't LLMs do math? Why does it work when they are allowed to 'think'? A simple explanation.

For simple math problems, like 2 + 2 = ?, the model answers correctly because it has seen that exact question countless times during training. In fact, it might have memorized "2 + 2
Anand Sukumaran (@anandrmedia) 's Twitter Profile Photo

This comic is completely generated by AI. Yes, the new image generator update in GPT-4o is insane! πŸš€ The most impressive thing about this update is that the text looks much better than in previous image models. I even tried generating posters, infographics, and even food

This comic is completely generated by AI. Yes, the new image generator update in GPT-4o is insane! πŸš€

The most impressive thing about this update is that the text looks much better than in previous image models. I even tried generating posters, infographics, and even food
Anand Sukumaran (@anandrmedia) 's Twitter Profile Photo

My AI refuses to believe it's an AI πŸ˜…. I fine-tuned a base language model just enough so it could answer questions - but I gave it zero instructions about its identity. (which they usually do - "You are ChatGPT") Instead, I just taught to answer questions. And I started asking

My AI refuses to believe it's an AI πŸ˜…. I fine-tuned a base language model just enough so it could answer questions - but I gave it zero instructions about its identity. (which they usually do - "You are ChatGPT")

Instead, I just taught to answer questions.

And I started asking
Anand Sukumaran (@anandrmedia) 's Twitter Profile Photo

I built an MCP server that finds up-to-date documentation for any npm package you add to your project. This makes sure your AI IDE (like Cursor) doesn't generate outdated implementations of the npm packages you use. github.com/meanands/npm-p…

Anand Sukumaran (@anandrmedia) 's Twitter Profile Photo

I built an MCP server that turns my SaaS product (Engagespot) into an Agentic experience! If you want to learn how to build AI Agents and MCP servers, checkout -> youtu.be/H4Odxtncxrk

Anand Sukumaran (@anandrmedia) 's Twitter Profile Photo

I built an MCP Client in JavaScript from scratch! Yes - without using the MCP library. Why? Because I wanted to truly understand how MCP client-server communication works over the JSON-RPC protocol behind the scenes. Here's what I learned πŸ‘‡ πŸ“ MCP client–server communication

I built an MCP Client in JavaScript from scratch! Yes - without using the MCP library.

Why? Because I wanted to truly understand how MCP client-server communication works over the JSON-RPC protocol behind the scenes.

Here's what I learned πŸ‘‡

πŸ“ MCP client–server communication
Anand Sukumaran (@anandrmedia) 's Twitter Profile Photo

Google just released Geminiβ€―2.5 Flash Image model. One thing that impressed me is it's character consistency! Photoshop is now officially cooked! πŸ˜… I tried the model with my own photos, and the results are impressive compared to any existing image models out there. Look at the

Google just released Geminiβ€―2.5 Flash Image model. One thing that impressed me is it's character consistency! Photoshop is now officially cooked! πŸ˜…

I tried the model with my own photos, and the results are impressive compared to any existing image models out there. Look at the
Anand Sukumaran (@anandrmedia) 's Twitter Profile Photo

I built (vibe coded) a Chrome extension that lets you replace Myntra's models with your own photo πŸ˜‰ With this virtual try-on tool, we can upload our photo and see yourself wearing any outfit directly on the Myntra product page. The extension uses Google's newly released Gemini

I built (vibe coded) a Chrome extension that lets you replace Myntra's models with your own photo πŸ˜‰

With this virtual try-on tool, we can upload our photo and see yourself wearing any outfit directly on the Myntra product page.

The extension uses Google's newly released Gemini
Anand Sukumaran (@anandrmedia) 's Twitter Profile Photo

OpenAI might have just killed many startups with the AgentKit launched yesterday. They made it super easy to build AI agents without using complex frameworks. All using a simple drag-and-drop workflow editor to design logic, add agents, run evals, do vector search, and even

OpenAI might have just killed many startups with the AgentKit launched yesterday.

They made it super easy to build AI agents without using complex frameworks. All using a simple drag-and-drop workflow editor to design logic, add agents, run evals, do vector search, and even
Anand Sukumaran (@anandrmedia) 's Twitter Profile Photo

Fine-tuning isn't the final answer for improving agentic performance - at least that's what a new research paper from Stanford says. This new paper from Stanford, SambaNova, UC Berkeley introduces a new strategy - Agentic Context Engineering (ACE). It talks about a better way

Anand Sukumaran (@anandrmedia) 's Twitter Profile Photo

People calling AI a bubble- If you want to know what's hype vs what's real, ask: Can you live without it? Now try that with ChatGPT. Can we go back to the pre-ChatGPT era?