Yao Lu (@yaolu_nlp) Twitter Tweets • TwiCopy

evolvingstuff

6 years ago

Transformers as Soft Reasoners over Language "we explore whether transformers can similarly learn to reason (or emulate reasoning), but using rules expressed in language, thus bypassing a formal representation." arxiv.org/abs/2002.05867 Datasets and demo: rule-reasoning.apps.allenai.org

thumb_up_off_alt179

chat_bubble_outline7

repeat48

shareShare

Russ Salakhutdinov

@rsalakhu

6 years ago

#ICLR2020 paper on Differentiable Reasoning over a Virtual Knowledge Base: Efficient, end-to-end differentiable framework for doing complex multi-hop QA over a large text corpus. arxiv.org/abs/2002.10640 w/t Dhingra, Zaheer, Balachandran, Graham Neubig , William Cohen

thumb_up_off_alt191

chat_bubble_outline0

repeat52

shareShare

Marinka Zitnik

@marinkazitnik

6 years ago

We will present a tutorial on Machine Learning for Drug Development at #IJCAI2020! Materials to follow on our website: zitniklab.hms.harvard.edu/drugml IJCAIconf #drugs #networks #AI

thumb_up_off_alt65

chat_bubble_outline1

repeat25

shareShare

Mike Lewis

@ml_perception

5 years ago

Happy to share MARGE, our new work on rethinking pre-training: given a document, we first retrieve related documents, and then paraphrase these to reconstruct the original. MARGE works well for generation and classification in many languages, sometimes without supervision. (1/6)

thumb_up_off_alt400

chat_bubble_outline8

repeat89

shareShare

UCL Natural Language Processing

@ucl_nlp

5 years ago

We would like to wish you all a restful winter break, and best wishes for the new year 🙂🎄

thumb_up_off_alt30

chat_bubble_outline0

repeat3

shareShare

Yuxiang (Jimmy) Wu

@yuxiangjwu

3 years ago

Introducing ChatArena 🏟 - a Python library of multi-agent language game environments that facilitates communication and collaboration between multiple large language models (LLMs)! 🌐🤖 Check out our GitHub repo: github.com/chatarena/chat… #ChatArena #NLP #AI #LLM 1/8 🧵

thumb_up_off_alt595

chat_bubble_outline13

repeat140

shareShare

Oana-Maria Camburu

@oanacamb

2 years ago

🚨💫We are delighted to have Shishir Patil at our UCL Computer Science NLP Meetup *Monday 1st Nov 6:30pm GMT* The event will be *hybrid* Due to room capacity, there are *two links* to sign up depending on whether you attend in person or online Details in: meetup.com/ucl-natural-la…

thumb_up_off_alt11

chat_bubble_outline0

repeat7

shareShare

Charlie Holtz

@charliebholtz

2 years ago

David Attenborough is now narrating my life Here's a GPT-4-vision + ElevenLabs python script so you can star in your own Planet Earth:

thumb_up_off_alt26,26K

chat_bubble_outline722

repeat4,4K

shareShare

Alex Warstadt

@a_stadt

2 years ago

LLMs are now trained >1000x as much language data as a child, so what happens when you train a "BabyLM" on just 100M words? The proceedings of the BabyLM Challenge are now out along with our summary of key findings from 31 submissions: aclanthology.org/volumes/2023.c… Some highlights 🧵

thumb_up_off_alt1,1K

chat_bubble_outline13

repeat192

shareShare

Graham Neubig

@gneubig

2 years ago

Often prompt engineering focuses on the *content* of the prompt, but in reality *formatting* of the prompt can have an equal or larger effect, especially for less powerful models. This is a great deep dive into this phenomenon by Melanie Sclar et al.

thumb_up_off_alt113

chat_bubble_outline1

repeat10

shareShare

Yao Lu

@yaolu_nlp

2 years ago

Congrats on the launch! LFG Yuxiang (Jimmy) Wu Zhengyao Jiang

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Weco AI

@wecoai

2 years ago

We're excited to announce AIDE has become the first human-level AI agent for data science! AIDE outperforms half of human data scientists on a wide range of Kaggle competitions, surpassing conventional AutoML, LangChain agents, and ChatGPT with human assistance. 🏆

thumb_up_off_alt226

chat_bubble_outline7

repeat54

shareShare

Loubna Ben Allal

@loubnabenallal1

2 years ago

🍷 FineWeb technical report is out and so is 📚 FineWeb-Edu, a 1.3 trillion tokens dataset that outperforms all other open web datasets, with remarkable improvements on educational benchmarks such as MMLU, ARC, and OpenBookQA. Technical report: hf.co/spaces/Hugging… Dataset:

thumb_up_off_alt267

chat_bubble_outline10

repeat61

shareShare

Jimmy Lin

@lintool

2 years ago

They say a picture is worth a thousand words... but work led by ralphtang.eth finds words worth a thousand pictures! arxiv.org/abs/2406.08482

They say a picture is worth a thousand words... but work led by <a href="/ralph_tang/">ralphtang.eth</a> finds words worth a thousand pictures! arxiv.org/abs/2406.08482

thumb_up_off_alt22

chat_bubble_outline1

repeat5

shareShare

Yao Lu

@yaolu_nlp

a year ago

🎉🎉🏆

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Sohee Yang

@soheeyang_

a year ago

🚨 New Paper 🚨 Can LLMs perform latent multi-hop reasoning without exploiting shortcuts? We find the answer is yes – they can recall and compose facts not seen together in training or guessing the answer, but success greatly depends on the type of the bridge entity (80%+ for

thumb_up_off_alt192

chat_bubble_outline7

repeat46

shareShare