Hiroto Kurita (@hiroto_kurita) Twitter Tweets • TwiCopy

Kazuki Fujii

3 months ago

複数データセンター間で大量のデータを転送する際に利用している LFTPの利用方法とhuggingface-cli upload-large-folderの使い分けに関するTipsブログを書きました。大量のデータ (数TB〜数100TB)を別の環境に移送する際に参考にしていただけますと幸いです。 zenn.dev/turing_motors/…

thumb_up_off_alt90

chat_bubble_outline0

repeat26

shareShare

Kotoba Technologies

@kotoba_tech

3 months ago

Kotoba Technologiesはこのたび、アジア太平洋機械翻訳協会(アジア太平洋機械翻訳協会(AAMT))の長尾賞の受賞に選ばれました🏆日本が誇るコンピュータサイエンティストである長尾先生の名を冠したこの賞を受賞できたことを大変光栄に思います。 news.yahoo.co.jp/articles/c0334…

thumb_up_off_alt78

chat_bubble_outline1

repeat16

shareShare

ライブドアニュース

@livedoornews

3 months ago

【日本産】生成AIによる同時翻訳が「予測」の領域にリアルタイム翻訳サービス、遅延は“-0.5秒“ news.livedoor.com/article/detail… サービス「同時通訳」はリアルタイム性と精度の2つで、世界最速レベルを達成。また、相手が次に話す内容を“予知”することで、話す前に文字が表示される“-0.5秒“にまで達した。

thumb_up_off_alt3,3K

chat_bubble_outline47

repeat764

shareShare

Tatsuki Kuribayashi

@ttk_kuribayashi

3 months ago

久々にブログを書きました！（日本語） note.com/kuribya/n/n252…

thumb_up_off_alt62

chat_bubble_outline2

repeat6

shareShare

new paper from our work at Meta! **GPT-style language models memorize 3.6 bits per param** we compute capacity by measuring total bits memorized, using some theory from Shannon (1953) shockingly, the memorization-datasize curves look like this: ___________ / / (🧵)

thumb_up_off_alt3,3K

chat_bubble_outline76

repeat369

shareShare

jack morris

@jxmnop

3 months ago

this gives a pretty good explanation into how models learn in particular, it explains grokking grokking occurs *exactly* when capacity saturates. this is where models can't perfectly fit every training example, so they have to share info bt examples in a smart way

thumb_up_off_alt340

chat_bubble_outline8

repeat19

shareShare

jack morris

@jxmnop

3 months ago

hello twittersphere! i am planning to graduate in a few months, so i am officially ✨ Looking For A Job ✨ if you know of a role that'd be a good fit, or just want to chat, please reach out! here are some projects i've worked on that i'm most proud of 👇

thumb_up_off_alt841

chat_bubble_outline33

repeat51

shareShare

Apple Hub

@theapplehub

3 months ago

Apple introduces backgrounds in Messages #WWDC25

thumb_up_off_alt17,17K

chat_bubble_outline211

repeat1,1K

shareShare

Apple Hub

@theapplehub

3 months ago

Apple introduces live translation in Messages, FaceTime and Calls #WWDC25

thumb_up_off_alt2,2K

chat_bubble_outline36

repeat277

shareShare

Richard Wei

@rxwei

3 months ago

On behalf of the whole team I'm so proud to introduce the Foundation Models framework, an API to access our on-device LLM! Check out the Platforms State of the Union for an introduction and 4 sessions later today! developer.apple.com/news/?id=us98z… #WWDC25

thumb_up_off_alt241

chat_bubble_outline14

repeat46

shareShare

Nathan Lambert

@natolambert

3 months ago

Apple exposing a developer API for their on device AI models is a major step in the right direction for them and the open model ecosystem. This is going to open up new feedback loops on models and a major platform for AI applications.

thumb_up_off_alt259

chat_bubble_outline8

repeat23

shareShare

福島良典 | LayerX

@fukkyy

3 months ago

プロンプトエンジニアリングがいらなくなったという風潮がよくわからない。メタプロンプト的な指示の書き方、Agentic RAG的な外部知識の引き出し方、自己改善的なメモリ更新、外部ツールの使い方を適切に教える…etc

thumb_up_off_alt461

chat_bubble_outline2

repeat42

shareShare

たつお

@tatsuokundayo

3 months ago

iOS26/macOS26のApple Intelligenceで使われてる3b on-device modelとserver modelの詳細が公開されてた！基盤モデルのお話です〜 machinelearning.apple.com/research/apple…

thumb_up_off_alt86

chat_bubble_outline0

repeat17

shareShare

Jungo Kasai 笠井淳吾

@jungokasai

2 months ago

Finally closed our $11M+ funding round! Backed by top Japanese VCs and amazing angel investors including Joi Ito, Thomas Wolf from Hugging Face, Noah A. Smith, Luke Zettlemoyer, and Sasha Rush. Now it’s time to focus on commercialization and tech development!!

thumb_up_off_alt92

chat_bubble_outline6

repeat12

shareShare

Sloth🦥

@sloth65557166

2 months ago

「Core Audio tapを使ったリアルタイム音声処理のお話」という題目で明日のFlutter TokyoでLTします！ macOSデスクトップアプリで、音声アプリの幅が広がるよ〜〜〜な話をします😎 flutter-jp.connpass.com/event/359088/

thumb_up_off_alt22

chat_bubble_outline0

repeat8

shareShare

Ruoming Pang

@ruomingpang

2 months ago

Proud to share our report on AXLearn (github.com/apple/axlearn), the code base for building Apple Foundation Models: arxiv.org/abs/2507.05411.

thumb_up_off_alt340

chat_bubble_outline16

repeat56

shareShare

Tatsuki Kuribayashi

@ttk_kuribayashi

2 months ago

8月から MBZUAI にて助教を務めることになりました。引き続き（NLPと言語学を橋渡しできるような）興味深い仕事ができればと思います。小さなチームも持ち、ポスドク・ビジター探しております。日本との共同研究も強固にしたく、今後ともよろしくお願いいたします！ 👉 kuribayashi4.github.io

thumb_up_off_alt165

chat_bubble_outline4

repeat14

shareShare

Anne Wu

@anne_youw

2 months ago

🗣️We can listen and speak simultaneously when we talk, and so should the spoken dialogue models (SDMs)! 💬Unlike typical "walkie-talkie" voice AIs, full-duplex SDMs let both sides talk at once - more like real, natural conversation. But this makes alignment harder: - No

thumb_up_off_alt63

chat_bubble_outline1

repeat12

shareShare

Kotoba Technologies

@kotoba_tech

2 months ago

久しぶりに Zennで記事を公開しました！ Keisuke Kamahori が Kotoba を支える MLSys周りの最先端技術をまとめてくれました。ぜひご一読ください。 Kotoba では MLSys エンジニアの採用を最強化中です。弊社 X のヘッダーからぜひご応募ください！ zenn.dev/kotoba_tech/ar…

thumb_up_off_alt300

chat_bubble_outline0

repeat52

shareShare

Kazuki Fujii

@okoge_kaz

a month ago

👀 > Each attention head has a learned bias in the denominator of the softmax, similar to off-by-one attention and attention sinks [14][15], which enables the attention mechanism to pay no attention to any tokens. cdn.openai.com/pdf/419b6906-9…

thumb_up_off_alt21

chat_bubble_outline0

repeat2

shareShare