Zoro (@younghoax20) Twitter Tweets • TwiCopy

Zoro

@younghoax20

+ Follow

#AI

ID: 1151621377

calendar_today05-02-2013 18:15:04

440 Tweet

479 Takipçi

7,7K Takip Edilen

Greg Brockman

a month ago

gpt-5 is the best coding model in the world and is now the default in Cursor. youtube.com/watch?v=PQUcIb…

thumb_up_off_alt1,1K

chat_bubble_outline114

repeat124

shareShare

steve hsu

a month ago

Is Chain-of-Thought Reasoning of LLMs a Mirage? ... Our results reveal that CoT reasoning is a brittle mirage that vanishes when it is pushed beyond training distributions. This work offers a deeper understanding of why and when CoT reasoning fails, emphasizing the ongoing

Is Chain-of-Thought Reasoning of LLMs a Mirage?

... Our results reveal that CoT reasoning is a brittle mirage that vanishes when it is pushed beyond training distributions. This work offers a deeper understanding of why and when CoT reasoning fails, emphasizing the ongoing

thumb_up_off_alt4,4K

chat_bubble_outline167

repeat799

shareShare

Zoro

a month ago

Change my mind

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Zoro

a month ago

Slackkkk

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Zoro

a month ago

Acceleration>>speed

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Zoro

a month ago

Genie 3 is amazing I think this is the most important release of the week. This should be the starting of Metaverse

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Zoro

a month ago

Why is there so many mixed reactions on gpt-5. What's everyone's take ?

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Zoro

a month ago

Users should be allowed to switched between models for reasoning and non-reasoning tasks

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Zoro

a month ago

Gpt-5 is overhyped?

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Zoro

a month ago

People use the model and say it's bad.. After few days People use the model and say it's SOTA

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Zoro

a month ago

I let the llm pick the prompt

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

tetsuo.ai 💹🧲

a month ago

xAI Grok Updates (Last 24H): - Grok 4 boosts PDF handling for massive files! - iOS app v1.1.40 improves sound & Imagine. - Kids Mode hits Android soon. - longer vids in Imagine. - 44M images created, app #2 in Productivity! - Art Contest: Most-liked pics in X feed. - Tesla

thumb_up_off_alt334

chat_bubble_outline43

repeat39

shareShare

Zoro

a month ago

I still love openai❤️ Sam Altman

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Zoro

a month ago

Would you all think zucc poaching did affect openai gpt-5 release ?

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Zoro

a month ago

Deepthink alpha

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Zoro

a month ago

Lmaoo🤣

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Rohan Paul

a month ago

Fantastic paper from AI at Meta Reasoning LLMs hallucinate more on long answers, and the authors show why and fix it with a new reward that balances accuracy, detail, and relevance. Their online RL recipe cuts hallucinations by 23.1 points, raises factual detail by 23%, and

Fantastic paper from <a href="/AIatMeta/">AI at Meta</a>

Reasoning LLMs hallucinate more on long answers, and the authors show why and fix it with a new reward that balances accuracy, detail, and relevance.

Their online RL recipe cuts hallucinations by 23.1 points, raises factual detail by 23%, and

thumb_up_off_alt34

chat_bubble_outline3

repeat6

shareShare

Zoro

a month ago

O4-mini is my favourite go-to model for all kind of tasks. With deciprication of gpt-4 models o4-mini type model became obsolete Sam Altman Greg Brockman

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Rohan Paul

a month ago

Large language models often sound sure even when they are wrong. This paper teaches a model to treat its own confidence as a training reward, which tightens calibration and improves reasoning without any human labels. Here is how it works. The model generates several chain of

Large language models often sound sure even when they are wrong.

This paper teaches a model to treat its own confidence as a training reward, which tightens calibration and improves reasoning without any human labels.

Here is how it works.

The model generates several chain of

thumb_up_off_alt38

chat_bubble_outline3

repeat7

shareShare

Zoro

a month ago

arxiv.org/pdf/2508.05618

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare