Chenhui Chu (@knccch) 's Twitter Profile
Chenhui Chu

@knccch

Program-Specific Associate Professor @ Kyoto University

ID: 242113487

calendar_today24-01-2011 00:11:28

676 Tweet

271 Followers

450 Following

Anoop Kunchukuttan (@anoopk) 's Twitter Profile Photo

How can we extend the capabilities of English LLMs to other languages? Sharing a survey that I did recently of the literature in this area: anoopkunchukuttan.gitlab.io/publications/p…

How can we extend the capabilities of English LLMs to other languages? Sharing a survey that I did recently of the literature in this area: 

anoopkunchukuttan.gitlab.io/publications/p…
Kalyan KS (@kalyan_kpl) 's Twitter Profile Photo

𝐌𝐮𝐥𝐭𝐢𝐥𝐢𝐧𝐠𝐮𝐚𝐥 𝐋𝐋𝐌𝐬 𝐒𝐮𝐫𝐯𝐞𝐲 Multilingual large language models possess the advantage of comprehensively handling multiple languages. This survey paper presents the recent progress as well as emerging trends in multilingual large language models (MLLMs). The

𝐌𝐮𝐥𝐭𝐢𝐥𝐢𝐧𝐠𝐮𝐚𝐥 𝐋𝐋𝐌𝐬 𝐒𝐮𝐫𝐯𝐞𝐲

Multilingual large language models possess the advantage of comprehensively handling multiple languages.

This survey paper presents the recent progress as well as emerging trends in multilingual large language models (MLLMs).

The
Rohan Paul (@rohanpaul_ai) 's Twitter Profile Photo

BREAKING 🔥🤯 Google releases model with new Griffin architecture that outperforms transformers. Across multiple sizes, Griffin out performs the benchmark scores of transformers baseline in controlled tests in both the MMLU score across different parameter sizes as well as the

BREAKING 🔥🤯

Google releases model with new Griffin architecture that outperforms transformers.

Across multiple sizes, Griffin out performs the benchmark scores of transformers baseline in controlled tests in both the MMLU score across different parameter sizes as well as the
Masahiro Oda (@moda0) 's Twitter Profile Photo

「継続的な研究費獲得のための考え方」のスライドです.もっと研究費とれている方が書いた方が良いと思いますが... speakerdeck.com/moda0/ji-sok-d…

jietang (@jietang) 's Twitter Profile Photo

Very excited to give a keynote at ICLR'24. The slides are available here keg.cs.tsinghua.edu.cn/jietang/public… Hope it useful.

Very excited to give a keynote at ICLR'24. The slides are available here keg.cs.tsinghua.edu.cn/jietang/public… Hope it useful.
Chenhui Chu (@knccch) 's Twitter Profile Photo

指導に関わっていた毛卓遠さんが今年のAAMT長尾賞 学生奨励賞の受賞者に選出されました!昨年の趙宇婷さんから続き2年連続の受賞になります。めでたい! aamt.info/news/nagao-stu…

Naoaki Okazaki (@chokkanorg) 's Twitter Profile Photo

#JSAI2024 で「大規模言語モデルの開発」と題し、チュートリアル講演を行いました。事前学習、インストラクションチューニング、アライメント、評価の4部構成で、最近の研究動向や知見を紹介しました。 speakerdeck.com/chokkan/jsai20…

Lei Li (@lileics) 's Twitter Profile Photo

Muhao, Chaowei, Huan, Leon, Anima and I are presenting a tutorial on Combating Security and Privacy Issues in the Era of Large Language Models at #NAACL2024 in room Don Alberto 4. luka-group.github.io/tutorials/tuto… 🌴Muhao Chen🌴 Chaowei Xiao Huan Sun (OSU) Leon Derczynski ✍🏻 🌞🏠🌲 Prof. Anima Anandkumar

Muhao, Chaowei, Huan, Leon, Anima and I are presenting a tutorial on 

Combating Security and Privacy Issues in the Era of Large Language Models

at #NAACL2024 in room Don Alberto 4.  

luka-group.github.io/tutorials/tuto…

<a href="/muhao_chen/">🌴Muhao Chen🌴</a> <a href="/ChaoweiX/">Chaowei Xiao</a> <a href="/hhsun1/">Huan Sun (OSU)</a> <a href="/LeonDerczynski/">Leon Derczynski ✍🏻 🌞🏠🌲</a> <a href="/AnimaAnandkumar/">Prof. Anima Anandkumar</a>
ACLRollingReview (@reviewacl) 's Twitter Profile Photo

If you haven't been invited to review for ARR 2024 June but are interested in helping us, please fill out this form by June 19: forms.office.com/pages/response…

ACLRollingReview (@reviewacl) 's Twitter Profile Photo

If you are not yet in the ARR reviewer/AC pool and would be interested in serving as an area chair for ARR 2024 June, please fill out this form by June 20: forms.office.com/pages/response…

Toshinori Sato (@overlast) 's Twitter Profile Photo

自前でモデル作らない段階でのあるあるが色々と散りばめられており、途中で何度か笑ってしまいました。生成AI応用に取り組んでる人が半年に一回くらい読むと良さそうな記事でした✨ | [翻訳]LLMで1年間開発して学んだこと〜LLMプロダクト開発を成功に導くための実践的ガイド〜 zenn.dev/seya/articles/…

MT Group at FBK (@fbk_mt) 's Twitter Profile Photo

🎉Great news! Our paper “Speech Translation with Speech Foundation Models and Large Language Models: What is There and What is Missing?” got the outstanding paper & area chair's awards!!! 👏 👇arxiv.org/pdf/2402.12025 #NLProc #ACL2024NLP

🎉Great news! Our paper “Speech Translation with Speech Foundation Models and Large Language Models: What is There and What is Missing?” got the outstanding paper &amp; area chair's awards!!! 👏

👇arxiv.org/pdf/2402.12025

#NLProc #ACL2024NLP
elvis (@omarsar0) 's Twitter Profile Photo

Foundations of LLMs This amazing new LLM book just dropped on arXiv. 200+ pages! It covers areas such as pre-training, prompting, and alignment methods. It looks like a great intro to LLMs for devs and researchers.

Foundations of LLMs

This amazing new LLM book just dropped on arXiv. 

200+ pages!

It covers areas such as pre-training, prompting, and alignment methods. 

It looks like a great intro to LLMs for devs and researchers.
ℏεsam (@hesamation) 's Twitter Profile Photo

the best researchers from Meta, Yale, Stanford, Google DeepMind, and Microsoft laid out all we know about Agents in a 264-page paper [book], here are some of their key findings:

the best researchers from Meta, Yale, Stanford, Google DeepMind, and Microsoft laid out all we know about Agents in a 264-page paper [book],

here are some of their key findings:
Ruben Hassid (@rubenhssd) 's Twitter Profile Photo

BREAKING: Apple just proved AI "reasoning" models like Claude, DeepSeek-R1, and o3-mini don't actually reason at all. They just memorize patterns really well. Here's what Apple discovered: (hint: we're not as close to AGI as the hype suggests)

BREAKING: Apple just proved AI "reasoning" models like Claude, DeepSeek-R1, and o3-mini don't actually reason at all.

They just memorize patterns really well.

Here's what Apple discovered:

(hint: we're not as close to AGI as the hype suggests)