Sean Ma (@_seantma) 's Twitter Profile
Sean Ma

@_seantma

Leveraging #AI #ML to bring business value for #healthcare.

ID: 2408916181

linkhttps://www.linkedin.com/in/seantma calendar_today24-03-2014 12:02:46

729 Tweet

224 Followers

560 Following

Lior⚡ (@lioronai) 's Twitter Profile Photo

This might be the biggest moment for Open-Source AI. Meta just released Llama 3.1 and a 405 billion parameter model, the most sophisticated open model ever released. It already outperforms GPT-4o on several benchmarks.

Rowan Cheung (@rowancheung) 's Twitter Profile Photo

NEWS: DeepSeek just dropped ANOTHER open-source AI model, Janus-Pro-7B. It's multimodal (can generate images) and beats OpenAI's DALL-E 3 and Stable Diffusion across GenEval and DPG-Bench benchmarks. This comes on top of all the R1 hype. The 🐋 is cookin'

NEWS: DeepSeek just dropped ANOTHER open-source AI model, Janus-Pro-7B.

It's multimodal (can generate images) and beats OpenAI's DALL-E 3 and Stable Diffusion across GenEval and DPG-Bench benchmarks.

This comes on top of all the R1 hype. The 🐋 is cookin'
Qwen (@alibaba_qwen) 's Twitter Profile Photo

🎉 恭喜发财🧧🐍 As we welcome the Chinese New Year, we're thrilled to announce the launch of Qwen2.5-VL , our latest flagship vision-language model! 🚀 💗 Qwen Chat: chat.qwenlm.ai 📖 Blog: qwenlm.github.io/blog/qwen2.5-v… 🤗 Hugging Face: huggingface.co/collections/Qw… 🤖 ModelScope:

Harrison Chase (@hwchase17) 's Twitter Profile Photo

yeah deep research is great... but have you ever wanted it open source, with swappable models, and able to research over your own data? GPT-Researcher is exactly that - the leading OSS AI Researcher project github.com/assafelovic/gp…

yeah deep research is great... but have you ever wanted it open source, with swappable models, and able to research over your own data?

GPT-Researcher is exactly that - the leading OSS AI Researcher project

github.com/assafelovic/gp…
Lior⚡ (@lioronai) 's Twitter Profile Photo

Wow, someone just released a notebook to train a reasoning LLM with the new RL algorithm from DeepSeek, GRPO. In <2 hours, you can transform a very small model, Qwen 0.5 (500 million parameters) into a tiny math reasoning machine.

Wow, someone just released a notebook to train a reasoning LLM with the new RL algorithm from DeepSeek, GRPO. 

In &lt;2 hours, you can transform a very small model, Qwen 0.5 (500 million parameters) into a tiny math reasoning machine.
Hugo Bowne-Anderson (@hugobowne) 's Twitter Profile Photo

📚 A hand-picked list of free resources for building reliable LLM applications—covering Python, deep learning, evaluation, MLOps, and prompt engineering. Everything here is open and accessible—no paywalls, no subscriptions, just great content to dive into (even the books!).

📚 A hand-picked list of  free resources for building reliable LLM applications—covering Python,  deep learning, evaluation, MLOps, and prompt engineering.

Everything here is open and accessible—no paywalls, no subscriptions, just great content to dive into (even the books!).
Sean Ma (@_seantma) 's Twitter Profile Photo

Hey Google AI , why can't Gemini 2.0 Flash see my attached file when trying to include it in my prompt? Am I prompting it wrong? I've also tried Gemini 2.5 and it behaves similarly - as if no file was attached.

Hey <a href="/GoogleAI/">Google AI</a> , why can't Gemini 2.0 Flash see my attached file when trying to include it in my prompt? Am I prompting it wrong? I've also tried Gemini 2.5 and it behaves similarly - as if no file was attached.
Toby Kim (@_doyeob_) 's Twitter Profile Photo

Two undergrads. One still in the military. Zero funding. One ridiculous goal: build a TTS model that rivals NotebookLM Podcast, ElevenLabs Studio, and Sesame CSM. Somehow… we pulled it off. Here’s how 👇

Abhishek (@heyabhishekk) 's Twitter Profile Photo

Grok-3 can now help you create Mind Maps. No more wasting hours creating visuals for studying or breaking down complex topics. Here’s how to create a mind map in just few minutes:

Grok-3 can now help you create Mind Maps.

No more wasting hours creating visuals for studying or breaking down complex topics.

Here’s how to create a mind map in just few minutes:
Min Choi (@minchoi) 's Twitter Profile Photo

It’s only been just 9 days since OpenAI dropped o3. People are already bending it like AGI magic. 10 wild examples + pro tips:

Sean Ma (@_seantma) 's Twitter Profile Photo

Keeping the “not-for-profit” status is great for the public given the risk to have a for-profit board control the advancement of AI!

elvis (@omarsar0) 's Twitter Profile Photo

LLMs Get Lost in Multi-turn Conversation The cat is out of the bag. Pay attention, devs. This is one of the most common issues when building with LLMs today. Glad there is now paper to share insights. Here are my notes:

LLMs Get Lost in Multi-turn Conversation

The cat is out of the bag.

Pay attention, devs.

This is one of the most common issues when building with LLMs today.

Glad there is now paper to share insights.

Here are my notes:
Gary Marcus (@garymarcus) 's Twitter Profile Photo

BREAKING: Explosive new paper from MIT/Harvard/UChicago. Things just got worse — a lot worse — for LLM’s and the myth that they can understand and reason. The paper documents a pattern they called Potemkins, a kind of reasoning inconsistency (see figure below). They show that

BREAKING: Explosive new paper from MIT/Harvard/UChicago.

Things just got worse — a lot worse — for LLM’s and the myth that they can understand and reason.

The paper documents a pattern they called Potemkins, a kind of reasoning inconsistency (see figure below). They show that
Sebastian Raschka (@rasbt) 's Twitter Profile Photo

Btw if you're learning how to build LLMs from the ground up, there's now a 17h companion video course for my LLMs From Scratch book on Manning: manning.com/livevideo/mast… It follows the book chapter by chapter, so it works great either as a standalone or code-along resource. It's

Btw if you're learning how to build LLMs from the ground up, there's now a 17h companion video course for my LLMs From Scratch book on Manning: manning.com/livevideo/mast…

It follows the book chapter by chapter, so it works great either as a standalone or code-along resource. 
It's
LangChain (@langchainai) 's Twitter Profile Photo

Open Deep Research is here 🔍 We've open sourced one of the most powerful agent use cases. Built on LangGraph, Open Deep Research: • Uses a supervisor architecture to coordinate research sub-agents • Supports your own LLMs, tools, and MCP servers • Produces high-quality

Open Deep Research is here 🔍 We've open sourced one of the most powerful agent use cases. Built on LangGraph, Open Deep Research:

• Uses a supervisor architecture to coordinate research sub-agents
• Supports your own LLMs, tools, and MCP servers
• Produces high-quality
Andrej Karpathy (@karpathy) 's Twitter Profile Photo

I think congrats again to OpenAI for cooking with GPT-5 Pro. This is the third time I've struggled on something complex/gnarly for an hour on and off with CC, then 5 Pro goes off for 10 minutes and comes back with code that works out of the box. I had CC read the 5 Pro version

LangChain (@langchainai) 's Twitter Profile Photo

🌐⚡ BLAST: AI Web Browser Engine A high-performance serving engine that adds web browsing to AI applications. BLAST provides an OpenAI-compatible interface with automatic parallelization, intelligent caching, and real-time streaming support. Explore this open-source project 👉

🌐⚡ BLAST: AI Web Browser Engine

A high-performance serving engine that adds web browsing to AI applications. BLAST provides an OpenAI-compatible interface with automatic parallelization, intelligent caching, and real-time streaming support.

Explore this open-source project 👉
LangChain (@langchainai) 's Twitter Profile Photo

🐍💬 Chatsky: Pure Python Dialog Framework A framework for building conversational services in pure Python, featuring a dialog graph system that integrates with LangGraph. Includes backend support for building sophisticated AI applications. Explore the framework

🐍💬 Chatsky: Pure Python Dialog Framework

A framework for building conversational services in pure Python, featuring a dialog graph system that integrates with LangGraph. Includes backend support for building sophisticated AI applications.

Explore the framework