YI (@y_imjk) Twitter Tweets • TwiCopy

YI

@y_imjk

8 months ago

最近は純粋にAIが羨ましくて、寝ずに稼動できるし、ご飯用意したり掃除洗濯したりしなくていいし、体調も基本的には一定で安定性が抜群だし ……というのを考えながら、もう少し人間らしい生活もした方がいいなと生活を振り返るみたいな深夜1時半

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Kosuke Nakago

@corochann

8 months ago

4/8に行われたSakana AIのAI Agent勉強会ですが、せっかくご応募していただいたのに招待できなかった方も多かったので、後半部分で使用した資料を公開しました。すでにかなりエコシステムが広がっていて全てはカバーできず、研究寄りの観点での動向を広めに紹介しています speakerdeck.com/sakana_ai/2025…

thumb_up_off_alt512

chat_bubble_outline0

repeat85

shareShare

Sakana AI

@sakanaailabs

7 months ago

Our researchers love to cook 🍰

thumb_up_off_alt174

chat_bubble_outline4

repeat12

shareShare

Sam Altman

@sama

7 months ago

goodbye, GPT-4. you kicked off a revolution. we will proudly keep your weights on a special hard drive to give to some historians in the future.

thumb_up_off_alt44,44K

chat_bubble_outline1,1K

repeat2,2K

shareShare

YI

@y_imjk

7 months ago

学部生の時は誰よりも早く実験レポートを提出していたのに、今となってはなぜか自転車操業をしている……なぜ……………………

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Gemini Diffusion, our newest research model, is significantly faster than our fastest model so far AND matches its coding performance. By correcting errors as the model thinks, it is extremely fast for editing tasks like math and coding.

thumb_up_off_alt68

chat_bubble_outline1

repeat13

shareShare

Sakana AI

@sakanaailabs

6 months ago

Following our Sudoku-based reasoning benchmark announcement, we've been evaluating the latest models to track improvements in their reasoning capabilities. Today, we’re launching the Sudoku-Bench Leaderboard: pub.sakana.ai/sudoku/ New technical report: arxiv.org/abs/2505.16135

thumb_up_off_alt357

chat_bubble_outline14

repeat60

shareShare

YI

@y_imjk

6 months ago

学生向けにいろんなプロダクトが無料になっていて羨ましい……

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

YI

@y_imjk

6 months ago

謎のハゲタカジャーナル(?)メールがたまに迷惑メール貫通して届くんだけど、敬称がDr.だったりProf.だったりするのはなんなんだろうな……(自動メールなだけなんだろうけど)

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

DeepSeek

@deepseek_ai

6 months ago

🚀 DeepSeek-R1-0528 is here! 🔹 Improved benchmark performance 🔹 Enhanced front-end capabilities 🔹 Reduced hallucinations 🔹 Supports JSON output & function calling ✅ Try it now: chat.deepseek.com 🔌 No change to API usage — docs here: api-docs.deepseek.com/guides/reasoni… 🔗

thumb_up_off_alt9,9K

chat_bubble_outline386

repeat1,1K

shareShare

Luke Darlow

@learningluked

6 months ago

If you’re interested in learning about Continuous Thought Machines (sakana.ai/ctm/), we made interactive notebook tutorials so you can hack around with CTMs ImageNet: github.com/SakanaAI/conti… MNIST Tutorial: github.com/SakanaAI/conti… Let me know if you have any feedback!

thumb_up_off_alt816

chat_bubble_outline14

repeat135

shareShare

Shashank Kotyan

@shashankkotyan

6 months ago

🚨 Excited to present our paper Percept-Lens at the #ReGenAI Workshop at #CVPR2025! We introduce a 36M-image benchmark to test generalization in AI-generated image detection across 26 datasets & 16+ generative models. 🔍 Benchmark: dataverse.harvard.edu/dataverse/perc…

thumb_up_off_alt11

chat_bubble_outline1

repeat4

shareShare

松井研 / Matsui Lab

@utokyo_bunny

6 months ago

I'll present my poster paper at #CVPR2025 on June 15! I propose an extremely fast post-processing module for diverse nearest-neighbor searches🚀 Y. Matsui, "LotusFilter: Fast Diverse Nearest Neighbor Search via a Learned Cutoff Table" arxiv.org/abs/2506.04790

thumb_up_off_alt20

chat_bubble_outline0

repeat6

shareShare

Sakana AI

@sakanaailabs

6 months ago

Introducing ALE-Bench, ALE-Agent! Towards Automating Long-Horizon Algorithm Engineering for Hard Optimization Problems Blog: sakana.ai/ale-bench/ Paper: arxiv.org/abs/2506.09050 ALE-Bench is a coding benchmark primarily focused on hard optimization (NP-hard) problems. We

thumb_up_off_alt184

chat_bubble_outline0

repeat53

shareShare

Takuya Akiba

@iwiwi

6 months ago

AI will soon master Codeforces. So, what's the next challenge? 🚀Introducing ALE-Bench (ALgorithm Engineering Benchmark) 🏆 A new frontier benchmark for algorithmic coding, designed to test long-horizon reasoning on complex problems through trial and error. 🤖What is ALE-Bench?

thumb_up_off_alt53

chat_bubble_outline0

repeat20

shareShare

YI

YI

Kosuke Nakago

Sakana AI

Sam Altman

YI

Google AI

Sakana AI

YI

YI

DeepSeek

Luke Darlow

Shashank Kotyan

松井研 / Matsui Lab

Sakana AI

Takuya Akiba