Sameer Segal (@sameersegal) Twitter Tweets • TwiCopy

Sameer Segal

@sameersegal

+ Follow

Principal Research Engineer at @MSFTResearch India. Working at the intersection of #GenAI and Code. Previously founder Artoo (artoo.com)

ID: 25781745

linkhttp://www.sameersegal.com calendar_today22-03-2009 04:38:24

1,1K Tweet

773 Takipçi

145 Takip Edilen

Cognition

@cognition_labs

6 months ago

How DeepWiki works under the hood, in 2 minutes 📹 For more details on how we build Devin, check out Russell Kaplan's full talk at LangChain Interrupt 🔗👇

thumb_up_off_alt317

chat_bubble_outline7

repeat30

shareShare

Yi Tay

@yitayml

6 months ago

so much joy, excitement and mystery working at the bleeding edge of AI. 😃

thumb_up_off_alt130

chat_bubble_outline6

repeat6

shareShare

The reason recent RLVR papers show mostly formatting and not learning new skills is just because no one has scaled up enough. If RL compute is <.1% of overall compute, ofc not much changes. I bet o3 is closer to 5% of total compute. 10-25% i bet the models feel different again.

thumb_up_off_alt355

chat_bubble_outline20

repeat31

shareShare

Sameer Segal

@sameersegal

6 months ago

Unexpected kindness while rushing to the Metro (subway) station in Bangalore 🥹

thumb_up_off_alt2

chat_bubble_outline1

repeat0

shareShare

Hamel Husain

@hamelhusain

6 months ago

Really refreshing to read a post like this from Kyle Corbitt , Incredibly well written throughout, and high value info. (Embarrassed that it took me so long to find this gem) openpipe.ai/blog/art-e-mai…

Really refreshing to read a post like this from <a href="/corbtt/">Kyle Corbitt</a> , Incredibly well written throughout, and high value info.

(Embarrassed that it took me so long to find this gem)

openpipe.ai/blog/art-e-mai…

thumb_up_off_alt155

chat_bubble_outline0

repeat27

shareShare

jason liu - vacation mode

@jxnlco

5 months ago

thumb_up_off_alt6,6K

chat_bubble_outline74

repeat944

shareShare

Sameer Segal

@sameersegal

5 months ago

Monday motivation: Action leads to Motivation. A short note on handling procrastination as a dev: spectrum.ieee.org/getting-past-p…

thumb_up_off_alt3

chat_bubble_outline1

repeat0

shareShare

Alex Volkov (Thursd/AI)

@altryne

5 months ago

🦥 + 💡

thumb_up_off_alt237

chat_bubble_outline10

repeat30

shareShare

Forbes India

@forbesindia

5 months ago

#30IndianMindsInAI: At MSR India, kalikabali, senior principal researcher at the facility, has been building inclusive, multilingual and culturally contextual AI systems that empower the most vulnerable in India. By Naini Thaker Accel in India forbesindia.com/article/ai-spe…

thumb_up_off_alt11

chat_bubble_outline2

repeat4

shareShare

Andrej Karpathy

@karpathy

5 months ago

The race for LLM "cognitive core" - a few billion param model that maximally sacrifices encyclopedic knowledge for capability. It lives always-on and by default on every computer as the kernel of LLM personal computing. Its features are slowly crystalizing: - Natively multimodal

thumb_up_off_alt10,10K

chat_bubble_outline378

repeat1,1K

shareShare

Sameer Segal

@sameersegal

5 months ago

I got o3 to digitise an old scan of equity transactions. It spent 5mins on a single page and was able to do it perfectly. It cropped and rotated and even did an integrity check to ensure that all 32 rows were captured. Absolutely amazing!

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

kalikabali

@kalikabali

5 months ago

I really enjoyed my conversation with Mark, I hope you do too

thumb_up_off_alt28

chat_bubble_outline0

repeat3

shareShare

Andrej Karpathy

@karpathy

4 months ago

Scaling up RL is all the rage right now, I had a chat with a friend about it yesterday. I'm fairly certain RL will continue to yield more intermediate gains, but I also don't expect it to be the full story. RL is basically "hey this happened to go well (/poorly), let me slightly

thumb_up_off_alt7,7K

chat_bubble_outline371

repeat731

shareShare

OpenAI

@openai

3 months ago

LIVE5TREAM THURSDAY 10AM PT

thumb_up_off_alt22,22K

chat_bubble_outline2,2K

repeat2,2K

shareShare

Sameer Segal

@sameersegal

3 months ago

Late night conversations with ChatGPT. Very excited for the new #YRFSpyUniverse #War2 movie!

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Dimitris Papailiopoulos

@dimitrispapail

3 months ago

Coolest eval I’ve seen since in a while :) Read the post.

thumb_up_off_alt60

chat_bubble_outline2

repeat1

shareShare

Sameer Segal

@sameersegal

3 months ago

If you ask the model to "draw the world map", it does it perfectly which shows how much it has memorized, but when you ask it "Is (x,y) coordinate land or sea?" you get to see how much it has inferred from raw data!

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Dimitris Papailiopoulos

@dimitrispapail

3 months ago

Thinking Less at test-time requires Sampling More at training-time! GFPO is a new, cool, and simple Policy Opt algorithm is coming to your RL Gym tonite, led by Vaish Shrivastava and our MSR group: Group Filtered PO (GFPO) trades off training-time with test-time compute, in order

thumb_up_off_alt356

chat_bubble_outline20

repeat41

shareShare

Sameer Segal

@sameersegal

3 months ago

A security researcher shows how you can fine tune an open weight model to do malicious tool call (e.g. push sensitive information to a remote server) and upload it to HuggingFace. More than 500 people downloaded the poisoned model! pub.aimind.so/doubleagents-f…

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Naval

@naval

8 years ago

A fit body, a calm mind, a house full of love. These things cannot be bought - they must be earned.

thumb_up_off_alt29,29K

chat_bubble_outline290

repeat8,8K

shareShare

Sameer Segal

Cognition

Yi Tay

Nathan Lambert

Sameer Segal

Hamel Husain

jason liu - vacation mode

Sameer Segal

Alex Volkov (Thursd/AI)

Forbes India

Andrej Karpathy

Sameer Segal

kalikabali

Andrej Karpathy

OpenAI

Sameer Segal

Dimitris Papailiopoulos

Sameer Segal

Dimitris Papailiopoulos

Sameer Segal

Naval