Chenhui Chu (@knccch) Twitter Tweets • TwiCopy

Anoop Kunchukuttan

2 years ago

How can we extend the capabilities of English LLMs to other languages? Sharing a survey that I did recently of the literature in this area: anoopkunchukuttan.gitlab.io/publications/p…

thumb_up_off_alt148

chat_bubble_outline4

repeat36

shareShare

𝐌𝐮𝐥𝐭𝐢𝐥𝐢𝐧𝐠𝐮𝐚𝐥 𝐋𝐋𝐌𝐬 𝐒𝐮𝐫𝐯𝐞𝐲 Multilingual large language models possess the advantage of comprehensively handling multiple languages. This survey paper presents the recent progress as well as emerging trends in multilingual large language models (MLLMs). The

thumb_up_off_alt20

chat_bubble_outline1

repeat8

shareShare

Rohan Paul

@rohanpaul_ai

2 years ago

BREAKING 🔥🤯 Google releases model with new Griffin architecture that outperforms transformers. Across multiple sizes, Griffin out performs the benchmark scores of transformers baseline in controlled tests in both the MMLU score across different parameter sizes as well as the

thumb_up_off_alt503

chat_bubble_outline6

repeat116

shareShare

Murawaki

@murawaki

2 years ago

5/11 (土) 午後に京大情報学研究科知能情報学コースの大学院入試説明会を開催します。コース全体の説明のあとに研究室見学の時間を設けています。今年も現地開催のみ。要事前登録。ist.i.kyoto-u.ac.jp/news/2024/04/1…

thumb_up_off_alt3

chat_bubble_outline0

repeat3

shareShare

Masahiro Oda

@moda0

2 years ago

「継続的な研究費獲得のための考え方」のスライドです．もっと研究費とれている方が書いた方が良いと思いますが... speakerdeck.com/moda0/ji-sok-d…

thumb_up_off_alt89

chat_bubble_outline0

repeat11

shareShare

jietang

@jietang

2 years ago

Very excited to give a keynote at ICLR'24. The slides are available here keg.cs.tsinghua.edu.cn/jietang/public… Hope it useful.

thumb_up_off_alt100

chat_bubble_outline0

repeat16

shareShare

Chenhui Chu

@knccch

2 years ago

指導に関わっていた毛卓遠さんが今年のAAMT長尾賞学生奨励賞の受賞者に選出されました！昨年の趙宇婷さんから続き2年連続の受賞になります。めでたい！ aamt.info/news/nagao-stu…

thumb_up_off_alt20

chat_bubble_outline1

repeat2

shareShare

Naoaki Okazaki

@chokkanorg

2 years ago

#JSAI2024 で「大規模言語モデルの開発」と題し、チュートリアル講演を行いました。事前学習、インストラクションチューニング、アライメント、評価の４部構成で、最近の研究動向や知見を紹介しました。 speakerdeck.com/chokkan/jsai20…

thumb_up_off_alt910

chat_bubble_outline2

repeat252

shareShare

Lei Li

@lileics

a year ago

Muhao, Chaowei, Huan, Leon, Anima and I are presenting a tutorial on Combating Security and Privacy Issues in the Era of Large Language Models at #NAACL2024 in room Don Alberto 4. luka-group.github.io/tutorials/tuto… 🌴Muhao Chen🌴 Chaowei Xiao Huan Sun (OSU) Leon Derczynski ✍🏻 🌞🏠🌲 Prof. Anima Anandkumar

thumb_up_off_alt63

chat_bubble_outline0

repeat10

shareShare

ACLRollingReview

@reviewacl

a year ago

If you haven't been invited to review for ARR 2024 June but are interested in helping us, please fill out this form by June 19: forms.office.com/pages/response…

thumb_up_off_alt40

chat_bubble_outline3

repeat36

shareShare

ACLRollingReview

@reviewacl

a year ago

If you are not yet in the ARR reviewer/AC pool and would be interested in serving as an area chair for ARR 2024 June, please fill out this form by June 20: forms.office.com/pages/response…

thumb_up_off_alt11

chat_bubble_outline0

repeat9

shareShare

Toshinori Sato

@overlast

a year ago

自前でモデル作らない段階でのあるあるが色々と散りばめられており、途中で何度か笑ってしまいました。生成AI応用に取り組んでる人が半年に一回くらい読むと良さそうな記事でした✨ | [翻訳]LLMで1年間開発して学んだこと〜LLMプロダクト開発を成功に導くための実践的ガイド〜 zenn.dev/seya/articles/…

thumb_up_off_alt191

chat_bubble_outline0

repeat31

shareShare

MT Group at FBK

@fbk_mt

a year ago

🎉Great news! Our paper “Speech Translation with Speech Foundation Models and Large Language Models: What is There and What is Missing?” got the outstanding paper & area chair's awards!!! 👏 👇arxiv.org/pdf/2402.12025 #NLProc #ACL2024NLP

thumb_up_off_alt32

chat_bubble_outline4

repeat6

shareShare

elvis

@omarsar0

a year ago

Foundations of LLMs This amazing new LLM book just dropped on arXiv. 200+ pages! It covers areas such as pre-training, prompting, and alignment methods. It looks like a great intro to LLMs for devs and researchers.

thumb_up_off_alt4,4K

chat_bubble_outline36

repeat843

shareShare

Tanishq Mathew Abraham, Ph.D.

@iscienceluvr

a year ago

Okay so this is so far the most important paper in AI of the year

thumb_up_off_alt3,3K

chat_bubble_outline52

repeat352

shareShare

Chenhui Chu

@knccch

9 months ago

Google citationがちょうど3000になりました。 scholar.google.co.jp/citations?user…

thumb_up_off_alt44

chat_bubble_outline3

repeat1

shareShare

ℏεsam

@hesamation

8 months ago

the best researchers from Meta, Yale, Stanford, Google DeepMind, and Microsoft laid out all we know about Agents in a 264-page paper [book], here are some of their key findings:

thumb_up_off_alt8,8K

chat_bubble_outline99

repeat1,1K

shareShare

Aadit Sheth

@aaditsh

8 months ago

OpenAI literally dropped a 32-page masterclass on building AI agents

thumb_up_off_alt14,14K

chat_bubble_outline150

repeat1,1K

shareShare

Ruben Hassid

@rubenhssd

6 months ago

BREAKING: Apple just proved AI "reasoning" models like Claude, DeepSeek-R1, and o3-mini don't actually reason at all. They just memorize patterns really well. Here's what Apple discovered: (hint: we're not as close to AGI as the hype suggests)

thumb_up_off_alt12,12K

chat_bubble_outline750

repeat1,1K

shareShare

Chenhui Chu

@knccch

5 months ago

本日付で、「特定」がなくなり、普通の准教授になりました。

thumb_up_off_alt209

chat_bubble_outline10

repeat6

shareShare

Chenhui Chu

Anoop Kunchukuttan

Kalyan KS

Rohan Paul

Murawaki

Masahiro Oda

jietang

Chenhui Chu

Naoaki Okazaki

Lei Li

ACLRollingReview

ACLRollingReview

Toshinori Sato

MT Group at FBK

elvis

Tanishq Mathew Abraham, Ph.D.

Chenhui Chu

ℏεsam

Aadit Sheth

Ruben Hassid

Chenhui Chu