Chenglong (@fatedier) Twitter Tweets • TwiCopy

Chenglong

@fatedier

+ Follow

Open-source enthusiast | Creator of frp: a fast reverse proxy | Currently focusing on AI Agents

ID: 1473317059035496448

linkhttps://github.com/fatedier/frp calendar_today21-12-2021 15:39:08

96 Tweet

43 Followers

7 Following

Chenglong

@fatedier

a year ago

Why does switching to o1-preview make ChatGPT respond immediately without thinking process? It doesn’t seem right.

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Chenglong

@fatedier

a year ago

The issue was finally resolved after updating to Chrome version 131. It seems like it was a browser-related bug.

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

🚀 DeepSeek-R1-Lite-Preview is now live: unleashing supercharged reasoning power! 🔍 o1-preview-level performance on AIME & MATH benchmarks. 💡 Transparent thought process in real-time. 🛠️ Open-source models & API coming soon! 🌐 Try it now at chat.deepseek.com #DeepSeek

thumb_up_off_alt4,4K

chat_bubble_outline265

repeat802

shareShare

Chenglong

@fatedier

a year ago

I’ve tested many models, but so far, only o1-preview can solve this problem: Using the numbers 5, 5, 5, 5, and 5, each number must be used exactly once and no more than once. Can they be combined using addition, subtraction, multiplication, and division to result in 24?

thumb_up_off_alt2

chat_bubble_outline1

repeat0

shareShare

Chenglong

@fatedier

a year ago

In the test results of my agent project, it performed significantly better than gpt-4o-2024-08-06. However, there’s still a noticeable gap compared to claude-3.5-sonnet(new).

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Chenglong

@fatedier

a year ago

QwQ 32B Preview is very impressive, and the price on OpenRouter is 1/100 of o1-preview. Qwen Awesome!

QwQ 32B Preview is very impressive, and the price on OpenRouter is 1/100 of o1-preview. <a href="/Alibaba_Qwen/">Qwen</a> Awesome!

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Chenglong

@fatedier

a year ago

An excellent platform for testing various models, with very fast support for new models.

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Chenglong

@fatedier

a year ago

Unfortunately, the full O1 is now almost 100% unable to solve this problem.

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Chenglong

@fatedier

a year ago

Best joke of the year: TGA 😂

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Chenglong

@fatedier

a year ago

Surprised to see that in my agent project, claude-3.5-sonnet is still outperforming other models after 6 months! Looking forward to GPT-4.5, Claude 4, and Grok 3, but not fully convinced there will be a massive leap in performance just yet.

thumb_up_off_alt1

chat_bubble_outline1

repeat0

shareShare

Chenglong

@fatedier

a year ago

The advancements of the new ChatGPT-4o in writing have surprised me greatly, far surpassing other models.

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Chenglong

@fatedier

a year ago

grok3 #1 across all categories. It turned out better than expected, can’t wait to test it out!

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Chenglong

@fatedier

a year ago

After initial testing, grok3 from xAI delivers surprisingly impressive results! Can't wait to dive deeper with my agent project once the API opens up. Exciting times ahead!

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Chenglong

@fatedier

a year ago

For paid users, using a downgraded model to respond when resources are insufficient, without any notification, is simply fraudulent behavior. Grok is currently the best platform. OpenAI seems more closed off compared to other companies.

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Chenglong

@fatedier

a year ago

I’ve noticed that very few people have paid attention to qwq 32b + groq, 400t/s, with reasoning capabilities comparable to top-tier models—enough to change everything.

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare