acc-mu3n (@acceleratedmu3n) Twitter Tweets • TwiCopy

NVIDIA AI Developer

a month ago

Dynamo 0.4 is here and delivers 4x inference performance on Blackwell with disaggregated serving. ⚡️ New features include: • SLO-based disaggregated autoscaling • New disaggregated sizing tool • Real time LLM specific observability metrics • Fault tolerance inflight

thumb_up_off_alt93

chat_bubble_outline4

repeat20

shareShare

acc-mu3n

@acceleratedmu3n

a month ago

最近、自分のキャパを超えたものを捌き続けなくてはいけず、悩んでましたが、昨日は特にボロボロでした。今日はもう少し頑張りたい。

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

acc-mu3n

@acceleratedmu3n

a month ago

精神と時の部屋欲しい

thumb_up_off_alt3

chat_bubble_outline1

repeat0

shareShare

acc-mu3n

@acceleratedmu3n

a month ago

時々個人が特定出来るpostをしてしまってはいるものの、アホな投稿をし過ぎていて公式アカとしてBioに晒すのは憚れる

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Kazuki Fujii

@okoge_kaz

a month ago

AWSでH100を1枚単位で借りられるようになったようです👀 従来は、8GPU 1 Instance単位でないと借りられなかったので、利用を控えていた方多いと思いますが、これは朗報ですね。 aws.amazon.com/jp/about-aws/w…

thumb_up_off_alt177

chat_bubble_outline0

repeat34

shareShare

Bryce Adelstein Lelbach

@blelbach

a month ago

Want to learn CUDA? We're teaching tutorials at NDC TechTown in Norway this September: CUDA C++ (Sep 22): nvda.ws/45EITdZ CUDA Python (Sep 23): nvda.ws/45C5uId Through hands-on exercises, we'll teach you how to write, benchmark, profile, and optimize GPU code!

thumb_up_off_alt307

chat_bubble_outline3

repeat39

shareShare

tsuki

@tensorcore

a month ago

100 🌟! github.com/wmmae/wmma_ext…

thumb_up_off_alt27

chat_bubble_outline0

repeat5

shareShare

acc-mu3n

@acceleratedmu3n

a month ago

体がバキバキ過ぎる…

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

CuPy

@cupy_team

a month ago

CuPy v13.6 is out, now with CUDA 13 support! 🚀 Install with: pip install cupy-cuda13x

thumb_up_off_alt23

chat_bubble_outline1

repeat12

shareShare

Oleksii Kuchaiev

@kuchaev

a month ago

We are excited to release Nvidia-Nemotron-Nano-V2 model! This is a 9B hybrid SSM model with open base model and training data. This model also supports runtime "thinking" budget control. HF collection with base and post trained models: huggingface.co/collections/nv…

thumb_up_off_alt286

chat_bubble_outline9

repeat54

shareShare

Pavlo Molchanov

@pavlomolchanov

a month ago

📢New efficient Hybrid-SLM from NVIDIA-Nemotron-Nano-v2-9B: ❗️6x faster than Qwen3-8B because of Hybrid (Mamba2+Attention) design. We tried something new: pretrain & align a 12B reasoning model → compress to 9B. First real stab at reasoning-model compression. Key takeaways

thumb_up_off_alt79

chat_bubble_outline1

repeat16

shareShare

ちいかわ💫アニメ火金

@ngnchiikawa

a month ago

☀️

thumb_up_off_alt311,311K

chat_bubble_outline1,1K

repeat36,36K

shareShare

Shinnosuke Furuya

@sfuruyaz

a month ago

今年の「不老」ユーザ会、NVIDIAからは第2回でも登場した村上が講演します。みなさまぜひご参加を。 LLM開発を支えるエヌビディアの生成AIエコシステム / 村上　真奈（エヌビディア合同会社）

thumb_up_off_alt11

chat_bubble_outline0

repeat4

shareShare

acc-mu3n

@acceleratedmu3n

a month ago

体調が悪かった理由が判明した…そういう事だったのか…私…

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Naoaki Okazaki

@chokkanorg

a month ago

事後学習済みLLM向け評価フレームワーク swallow-evaluation-instruct を開発し、MIT Licenseで公開しました。日本語と英語の高難易度ベンチマークに対応しており、統一された条件のもとで最先端LLMの性能を適切に測定できる新しい評価基盤です。 GitHub: github.com/swallow-llm/sw…

thumb_up_off_alt166

chat_bubble_outline1

repeat50

shareShare

Daisuke Okanohara / 岡野原大輔

@hillbig

a month ago

Nemotron Nano 2 9B-v2はMamba-Transformerのハイブリット型言語モデルで、長い思考トレースの生成コストを抑え、22GB GPU1枚で128kトークン長の推論を実現、同規模モデルと比べ6倍のスループットを実現。事前・事後学習データセットが大幅に改善され、長文理解・数学・コードの性能が特に強い

thumb_up_off_alt49

chat_bubble_outline1

repeat7

shareShare

Daisuke Okanohara / 岡野原大輔

@hillbig

a month ago

Nemotron Nano 2: research.nvidia.com/labs/adlr/NVID… Paper: research.nvidia.com/labs/adlr/file…

thumb_up_off_alt3

chat_bubble_outline0

repeat1

shareShare

acc-mu3n

@acceleratedmu3n

a month ago

あ、esta...

thumb_up_off_alt2

chat_bubble_outline1

repeat0

shareShare

理化学研究所（理研）

@riken_jp

a month ago

理化学研究所、富士通およびNVIDIAとの国際連携による「富岳NEXT」開発体制を始動 riken.jp/pr/news/2025/2…

thumb_up_off_alt130

chat_bubble_outline1

repeat57

shareShare

acc-mu3n

NVIDIA AI Developer

acc-mu3n

acc-mu3n

acc-mu3n

Kazuki Fujii

Bryce Adelstein Lelbach

tsuki

acc-mu3n

CuPy

Oleksii Kuchaiev

Pavlo Molchanov

ちいかわ💫アニメ火金

Shinnosuke Furuya

acc-mu3n

Naoaki Okazaki

Daisuke Okanohara / 岡野原 大輔

Daisuke Okanohara / 岡野原 大輔

acc-mu3n

理化学研究所（理研）

Daisuke Okanohara / 岡野原大輔

Daisuke Okanohara / 岡野原大輔