Toshinori Kitamura (@t_kitamura14) Twitter Tweets • TwiCopy

3月14日の理研AIP成果報告会に向けて作成した，今年度の弊研究室成果抜粋まとめです． - 勾配法によるニューラルネット学習の情報理論的最適性 - 文脈内学習の理論を主たる結果として載せています．（他にも面白い結果は色々と出ていますが紙面のスペースの都合上割愛） aip.riken.jp/sympo/sympo202…

thumb_up_off_alt247

chat_bubble_outline0

repeat41

shareShare

Our recent work "Provably Efficient RL under Episode-Wise Safety in Linear CMDPs" is now on Arxiv! We propose the first computationally efficient RL algorithm with √K regret and episode-wise safety guarantees in linear CMDPs. arxiv.org/abs/2502.10138

thumb_up_off_alt22

chat_bubble_outline0

repeat4

shareShare

Masatoshi Uehara

@masa_uehara_1

7 months ago

Test-Time Alignment for Complex Reward Functions? We introduce a test-time, reward-guided iterative refinement algorithm for diffusion models. masatoshiuehara.com/research/rerd

thumb_up_off_alt198

chat_bubble_outline2

repeat32

shareShare

kaitos

@63556poiuytrewq

7 months ago

おーこれは欲しい（予約注文した）

thumb_up_off_alt8

chat_bubble_outline0

repeat1

shareShare

Taiji Suzuki

@btreetaiji

7 months ago

直接学習することが難しい問題でも，Chain-of-thoughtを使えばTransformerで簡単に学習できるようになることを示した論文がICLR2025にオーラル発表として採択されました． Kim&Suzuki: Transformers Provably Solve Parity Efficiently with Chain of Thought. ICLR2025. openreview.net/forum?id=n2Nid…

thumb_up_off_alt549

chat_bubble_outline2

repeat89

shareShare

Toshinori Kitamura

@t_kitamura14

7 months ago

I’ve completed my Ph.D.! I’m deeply grateful to everyone who supported me along the way, from mentors and colleagues to friends and family. Thank you all!

thumb_up_off_alt82

chat_bubble_outline5

repeat1

shareShare

Toshinori Kitamura

@t_kitamura14

7 months ago

載せるの忘れてましたが、博論の内容は専攻長賞をいただきました🏆 みなさんありがとうございます🙇‍♂️

thumb_up_off_alt36

chat_bubble_outline0

repeat0

shareShare

ばかなおうじ（あべけんし）

@bakanaouji

7 months ago

ICLRに論文が採択されました！去年のICMLで発表したミニマックス最適化問題などに適用できる均衡学習手法をさらに発展させた内容になっていますシンガポールに現地参加される方よろしくお願いします！ arxiv.org/abs/2410.02388

thumb_up_off_alt66

chat_bubble_outline0

repeat3

shareShare

Dylan Foster 🐢

@canondetortugas

7 months ago

Reinforcement learning has led to amazing breakthroughs in reasoning (e.g., R1), but can it discover truly new behaviors not already present in the base model? New paper with Zak Mhammedi and Dhruv Rohatgi: The Computational Role of the Base Model in Exploration thread:

thumb_up_off_alt705

chat_bubble_outline10

repeat108

shareShare

部品（吉岡里帆）

@tjmlab

6 months ago

しっかり学ぶ数理最適化 amzn.to/4iVy0Kk 今日だけ500円！！！これはマジ名著。今なら1ページ1円以下！！！買うしかない！！！

thumb_up_off_alt7

chat_bubble_outline0

repeat3

shareShare

Tech OMRON / オムロンテクノロジー

@tech_omron

6 months ago

オムロンサイニックエックスは、機械学習分野において国際的に権威のあるトップカンファレンス #ICLR2025 で最新の研究成果を発表します。 OMRON SINIC X will present research findings in ICLR2025. omron.com/sinicx/activit…

thumb_up_off_alt24

chat_bubble_outline0

repeat4

shareShare

Tadashi Kozuno

@tdash_koz

6 months ago

元インターンの北村さんが安全な強化学習の理論に関する論文を発表します。ICLRにご参加の皆さまは、ぜひポスターにいらしてください。

thumb_up_off_alt11

chat_bubble_outline0

repeat4

shareShare

Toshinori Kitamura

@t_kitamura14

6 months ago

今月のICLRにて、強化学習関連の研究を発表します！テーブルマルコフ決定過程において、ロバスト性と制約付き方策設計を同時に成立させる手法を実現しました。近似最適解への理論的な収束保証があります。現地の方はぜひ来てください🙏

thumb_up_off_alt46

chat_bubble_outline0

repeat5

shareShare

Toshinori Kitamura

@t_kitamura14

6 months ago

明日からICLR2025参加のためシンガポールに行きます現地参加する人は見に来てくれたら嬉しいです🙆‍♂️ iclr.cc/virtual/2025/p…

thumb_up_off_alt13

chat_bubble_outline0

repeat2

shareShare

Toshinori Kitamura

@t_kitamura14

5 months ago

thumb_up_off_alt6

chat_bubble_outline0

repeat0

shareShare

Toshinori Kitamura

@t_kitamura14

5 months ago

羽田に着きました

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

や

@syagishita917

5 months ago

新しいプレプリントを公開しました近接項を一般化した近接勾配法に対して勾配のリプシッツ連続性などを仮定しない収束解析をすることで、今まででは対象外だった様々な問題に対して効率的に近接勾配型アルゴリズムが適用可能になります Bregman近接勾配法よりも一般的な枠組みです

thumb_up_off_alt85

chat_bubble_outline0

repeat12

shareShare

鴨井遼

@ryokamoi_ja

5 months ago

Comprehensive examに合格してPhD Candidateになったので、PhD課程の前半2年間についてブログを書きました。海外留学についての記録: アメリカCS博士課程前半2年間の記録（PhD Candidateになりました） ryokamoi.blogspot.com/2025/05/cs-2ph…

thumb_up_off_alt65

chat_bubble_outline0

repeat8

shareShare

Toshinori Kitamura

Toshinori Kitamura

Toshinori Kitamura

Taiji Suzuki

Toshinori Kitamura

Masatoshi Uehara

kaitos

Taiji Suzuki

Toshinori Kitamura

Toshinori Kitamura

ばかなおうじ（あべけんし）

Dylan Foster 🐢

部品（吉岡里帆）

Tech OMRON / オムロン テクノロジー

Tadashi Kozuno

Toshinori Kitamura

Toshinori Kitamura

Toshinori Kitamura

Toshinori Kitamura

や

鴨井 遼

Tech OMRON / オムロンテクノロジー

鴨井遼