Marcell Fekete (@v4rmer) Twitter Tweets • TwiCopy

Johannes Bjerva

2 years ago

Interested in a ph.d. position in NLP, in the beautiful city of Copenhagen? We're hiring for a project on the topic of Explainability and Factuality in Language Modelling, at AAU Copenhagen. AAU TECH Department of Computer Science, Aalborg University #NLProc #LLM Apply via this link: stillinger.aau.dk/phd-stillinger…

thumb_up_off_alt41

chat_bubble_outline1

repeat12

shareShare

Johannes Bjerva

@johannesbjerva

2 years ago

Our latest typology #NLProc paper was accepted to eaclmeeting main, with Emi who visited us from Mila - Institut québécois d'IA @McGillU, and Esther. We derive continuous word order features from treebanks, better reflecting the variability of language: arxiv.org/abs/2402.01513 Department of Computer Science, Aalborg University

thumb_up_off_alt22

chat_bubble_outline2

repeat8

shareShare

Esther

@estherploeger

2 years ago

New #NLProc paper on ArXiv! More and more papers in NLP claim to evaluate on ‘typologically diverse’ languages. But what does this even mean? In our new paper (with Wessel Poelman, Miryam de Lhoneux/ @mdlhx.bsky.social and Johannes Bjerva), we systematically such claims. arxiv.org/abs/2402.04222 1/🧵

thumb_up_off_alt41

chat_bubble_outline2

repeat10

shareShare

Edoardo Ponti

@pontiedoardo

2 years ago

Can open-source LLMs execute *chains of instructions* in a single query? Not so well, we found. However, they can learn this ability by: - augmenting examples from public SFT mixtures with chains of instructions automatically - performing *sequential instruction tuning* on them.

thumb_up_off_alt90

chat_bubble_outline1

repeat21

shareShare

Piotr Nawrot

@p_nawrot

2 years ago

The memory in Transformers grows linearly with the sequence length at inference time. In SSMs it is constant, but often at the expense of performance. We introduce Dynamic Memory Compression (DMC) where we retrofit LLMs to compress their KV cache while preserving performance

thumb_up_off_alt461

chat_bubble_outline10

repeat73

shareShare

Irina Saparina

@irisaparina

2 years ago

Next week I’ll be in Malta 🇲🇹 to present our work on Improving Generalization in Semantic Parsing by Increasing Natural Language Variation at #EACL2024! 1/3

thumb_up_off_alt36

chat_bubble_outline1

repeat6

shareShare

Benjamin Minixhofer

@bminixhofer

a year ago

Introducing Zero-Shot Tokenizer Transfer (ZeTT) ⚡ ZeTT frees language models from their tokenizer, allowing you to use any model with any tokenizer, with little or no extra training. Super excited to (finally!) share the first project of my PhD🧵

thumb_up_off_alt729

chat_bubble_outline31

repeat145

shareShare

Edoardo Ponti

@pontiedoardo

a year ago

Today I am joining NVIDIA part-time as a visiting professor I could not imagine a better place to explore new efficient architectures for LLMs and diffusion I am looking forward to collaborating with so many talented researchers!

thumb_up_off_alt220

chat_bubble_outline13

repeat4

shareShare

Iñigo Alonso

@alonsonlp

a year ago

Reimagining table representation! In our new #ACL2024NLP paper we introduce PixT3: a family of image-based Table-to-Text Generation models that scale better at generating text from large tables, outperforming traditional text-based baselines. arxiv.org/abs/2311.09808

thumb_up_off_alt22

chat_bubble_outline1

repeat11

shareShare

Marcell Fekete

@v4rmer

a year ago

Completed my research stay at The University of Edinburgh, supported by Otto Mønsteds Fond! Investigated linguistic variation using multi-agent communication. Gained insights, networked with top researchers, and made new friends! Edoardo Ponti Johannes Bjerva AAU TECH Department of Computer Science, Aalborg University EdinburghNLP

Completed my research stay at <a href="/EdinburghUni/">The University of Edinburgh</a>, supported by Otto Mønsteds Fond! Investigated linguistic variation using multi-agent communication. Gained insights, networked with top researchers, and made new friends! <a href="/PontiEdoardo/">Edoardo Ponti</a> <a href="/johannesbjerva/">Johannes Bjerva</a> <a href="/aautech/">AAU TECH</a> <a href="/CompSciAAU/">Department of Computer Science, Aalborg University</a> <a href="/EdinburghNLP/">EdinburghNLP</a>

thumb_up_off_alt23

chat_bubble_outline3

repeat1

shareShare

babyLM

@babylmchallenge

a year ago

BabyLM is looking for organizers to join our team! If you are a mid-stage graduate student, interested in (sample-efficient) language modeling or cognitive science, you could be a great fit! Find out more information, and fill out our interest form here: forms.gle/uCf62Nk6mmQhhx…

thumb_up_off_alt22

chat_bubble_outline1

repeat9

shareShare

Nathaniel R. Robinson

@robinson_n8

10 months ago

Got to present our work in progress on leveraging adapters for machine translation of Creole languages MRL #EMNLP2024 🚀 Stay tuned for more on Creole language MT! aclanthology.org/2024.mrl-1.17/ @v4mer @heather_nlp @prajdabre1 Johannes Bjerva

Got to present our work in progress on leveraging adapters for machine translation of Creole languages <a href="/mrl2024_emnlp/">MRL</a> #EMNLP2024 🚀 Stay tuned for more on Creole language MT!
aclanthology.org/2024.mrl-1.17/

@v4mer @heather_nlp @prajdabre1 <a href="/johannesbjerva/">Johannes Bjerva</a>

thumb_up_off_alt64

chat_bubble_outline1

repeat5

shareShare

Nathan Godey

@nthngdy

9 months ago

I'm now officially looking for a post-doc position, starting in Spring! I would be happy to pursue my work on training dynamics, interpretability, or even more specifically on LM representations and character-level models Feel free to reach out!

thumb_up_off_alt8

chat_bubble_outline0

repeat5

shareShare