Tejas Khot (@tjskhot) Twitter Tweets • TwiCopy

Tejas Khot

@tjskhot

+ Follow

Building defensive AI @AbnormalSec
Prev: Applied Scientist @Amazon, Robotics @CarnegieMellon

ID: 2600172548

linkhttps://tejaskhot.github.io/ calendar_today02-07-2014 17:05:34

891 Tweet

355 Takipçi

1,1K Takip Edilen

Tejas Khot

@tjskhot

2 years ago

🚀

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

🎙️Unleashing my inner researcher + policy + political wonk combo this convo with daniel bashir for The Gradient. We tackle the mahjong of AI policy - China, innovation/regulation trade-offs, and why researchers becoming bilingual (in tech & policy-speak) is crucial.

thumb_up_off_alt9

chat_bubble_outline1

repeat6

shareShare

Soren Iverson

@soren_iverson

2 years ago

Google Meet option to talk with a BetterHelp therapist after stressful meetings

thumb_up_off_alt8,8K

chat_bubble_outline90

repeat473

shareShare

Tejas Khot

@tjskhot

2 years ago

Some cool ideas in the thread

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

anu

@anuatluru

2 years ago

the purpose of reading a book isn’t to retain information, it’s to refine your worldview just a little bit with each one

thumb_up_off_alt13,13K

chat_bubble_outline94

repeat2,2K

shareShare

Evan Reiser

@evanreiser

2 years ago

Ben Lang Abnormal Security: >$100M ARR growing >100% y/y (series C). Help use good AI to stop crime, and protect humans and from bad AI! careers.abnormalsecurity.com

thumb_up_off_alt11

chat_bubble_outline1

repeat3

shareShare

Evan Reiser

@evanreiser

a year ago

I am excited to announce the $250M Series D for Abnormal AI. Led by Wellington Management, plus Greylock Partners, Menlo Ventures, Insight Partners, and CrowdStrike Falcon Fund, this milestone is another step in our mission to protect humans using AI. abnormalsecurity.com/blog/building-…

thumb_up_off_alt44

chat_bubble_outline5

repeat5

shareShare

Gokul ⚡️

@gokulns

a year ago

My favorite Mumbai story. The city is just something else.

thumb_up_off_alt1,1K

chat_bubble_outline26

repeat154

shareShare

Yu Su @#ICLR2025

@ysu_nlp

8 months ago

🔥2025 is the year of agents, but are we there yet?🤔 🤯 "An Illusion of Progress? Assessing the Current State of Web Agents" –– our new study shows that frontier web agents may be far less competent (up to 59%) than previously reported! Why were benchmark numbers inflated? -

thumb_up_off_alt230

chat_bubble_outline10

repeat66

shareShare

Tejas Khot

@tjskhot

8 months ago

Gemini 2.5 Pro often responds with several "Options" when you ask subjective questions. I've seen it answer 3 different ways and asking the user to pick whatever suits their needs, all in one shot. Great handling when intent is under-specified.

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Tejas Khot

@tjskhot

8 months ago

thumb_up_off_alt1

chat_bubble_outline1

repeat0

shareShare

Sebastian Raschka

@rasbt

7 months ago

As we all know by now, reasoning models often generate longer responses, which raises compute costs. Now, this new paper (arxiv.org/abs/2504.05185) shows that this behavior comes from the RL training process, not from an actual need for long answers for better accuracy. The RL

thumb_up_off_alt1,1K

chat_bubble_outline33

repeat191

shareShare

Andrew Carr (e/🤸)

@andrew_n_carr

7 months ago

Language models thinking step by step to do arithmetic...

thumb_up_off_alt9,9K

chat_bubble_outline40

repeat654

shareShare

Sara Hooker

@sarahookr

7 months ago

It is critical for scientific integrity that we trust our measure of progress. The lmarena.ai has become the go-to evaluation for AI progress. Our release today demonstrates the difficulty in maintaining fair evaluations on lmarena.ai, despite best intentions.

It is critical for scientific integrity that we trust our measure of progress.

The <a href="/lmarena_ai/">lmarena.ai</a> has become the go-to evaluation for AI progress.

Our release today demonstrates the difficulty in maintaining fair evaluations on <a href="/lmarena_ai/">lmarena.ai</a>, despite best intentions.

thumb_up_off_alt712

chat_bubble_outline21

repeat132

shareShare

Tejas Khot

@tjskhot

6 months ago

Gemini really loves explaining via analogies, makes its writing easy to detect in the wild

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Kate Olszewska

@olszewskakate

5 months ago

Gemini 2.5 Pro and Flash are 🚀Generally Available🚀 and with the new 2.5 Flash-Lite capture the quality/price pareto frontier. Read more about them in our Tech Report!