HPLT(@hplt_eu) 's Twitter Profileg
HPLT

@hplt_eu

Horizon Europe - High Performance Language Technology (HPLT)

ID:1542506822573051910

linkhttp://hplt-project.org calendar_today30-06-2022 13:54:35

43 Tweets

227 Followers

15 Following

HPLT(@hplt_eu) 's Twitter Profile Photo

Good news for small languages and LLMs. Paper on open Poro 34B model shows how training on Finnish, English and programming languages creates a very strong Finnish model, that excels in translation and is competitive in generating English and code: arxiv.org/abs/2404.01856

account_circle
HPLT(@hplt_eu) 's Twitter Profile Photo

We will be presenting the HPLT datasets HOW-TO and insights at LREC COLING 2024 in Torino. Paper already in Arxiv.org: arxiv.org/pdf/2403.14009….

We will be presenting the HPLT datasets HOW-TO and insights at @LrecColing in Torino. Paper already in Arxiv.org: arxiv.org/pdf/2403.14009….
account_circle
HPLT(@hplt_eu) 's Twitter Profile Photo

HPLT partners have explored the effects of multililingual vs monolingual instruction tuning, under a constrained budget and a self-made machine-translated Alpaca-based dataset. Spoiler: go multilingual! Work will be presented at
Findings: arxiv.org/abs/2309.08958

account_circle
Konstantin Dobler(@konstantdobler) 's Twitter Profile Photo

Attending the HPLT & NLPL Winter School in Skeikampen, Norway was a blast and highly recommended if you are interested in Large Language Models. Bonus: we had our very own take on building a snowman!

Attending the @hplt_eu & NLPL Winter School in Skeikampen, Norway was a blast and highly recommended if you are interested in Large Language Models. Bonus: we had our very own take on building a snowman!
account_circle
HPLT(@hplt_eu) 's Twitter Profile Photo

HPLT scientific highlights directly from one of the authors :
'LTG-BERT: an efficient LM architecture developed within the HPLT project, won the BabyLM challenge. Look forward to our future release of 75 LTG-BERTs for 75 different languages.'
Thanks David Samuel and congrats!

HPLT scientific highlights directly from one of the authors : 'LTG-BERT: an efficient LM architecture developed within the HPLT project, won the BabyLM challenge. Look forward to our future release of 75 LTG-BERTs for 75 different languages.' Thanks @davidsamuelcz and congrats!
account_circle
LTG Oslo(@ltgoslo) 's Twitter Profile Photo

It's snowing large language models this week in Norway!
1st, the 5th NLPL and HPLT Winter School on LLMs is ongoing now in Skeikampen
And 2nd, the LTG has released three fully open generative language models for Norwegian, based on Mistral and BLOOM architectures

It's snowing large language models this week in Norway! 1st, the 5th NLPL and @hplt_eu Winter School on LLMs is ongoing now in Skeikampen And 2nd, the LTG has released three fully open generative language models for Norwegian, based on Mistral and BLOOM architectures #NLProc
account_circle
HPLT(@hplt_eu) 's Twitter Profile Photo

Right now at Skeikampen, Norway: the HPLT&NLPL Winter School on Large Language Models: Creation, Customization, Evaluation, and Use wiki.nlpl.eu/Community/trai… is starting! Institute of Formal and Applied Linguistics Stephan Oepen LTG Oslo @unioslo European Commission @digitaleu hplt-project.org

Right now at Skeikampen, Norway: the HPLT&NLPL Winter School on Large Language Models: Creation, Customization, Evaluation, and Use wiki.nlpl.eu/Community/trai… is starting! @ufal_cuni @oepen @ltgoslo @unioslo #NLProc #LLM @EuropeanCommiss @digitaleu hplt-project.org
account_circle
HPLT(@hplt_eu) 's Twitter Profile Photo

HPLT project presented at the online Language Data Space Workshop today language-data-space.ec.europa.eu/events/legisla… Institute of Formal and Applied Linguistics Charles University DG Connect CLARIN ERIC LINDAT/CLARIAH-CZ . Jan Hajic was a panelist @ the LDS Legal Issues panel at the workshop led by Thomas Margoni of CiTiP KU Leuven.

HPLT project presented at the online Language Data Space Workshop today language-data-space.ec.europa.eu/events/legisla… @ufal_cuni @CharlesUniPRG @DG_Connect @CLARINERIC @LindatClariahCZ . @HajicJan was a panelist @ the LDS Legal Issues panel at the workshop led by Thomas Margoni of @CiTiP_KULeuven.
account_circle
HPLT(@hplt_eu) 's Twitter Profile Photo

HPLT LLMs are currently training on the LUMI supercomputer, this blog is worth a reading if you want to know more about it!

account_circle
HPLT(@hplt_eu) 's Twitter Profile Photo

The 2024 HPLT & NLPL Winter School focus is...

📜Large Language Models: Creation, Customisation, Evaluation, and Use.

🔥Take a look at our fabulous programme! wiki.nlpl.eu/Community/trai…

👏 Big thanks to our presenters afra alishahi, Desmond Elliott, Niklas Muennighoff and Aurélie N.!

account_circle
HPLT(@hplt_eu) 's Twitter Profile Photo

HPLT scientific news👨‍🏫!
🎙️Collaboration between HPLT, Hugging Face, and Harvard on 'Scaling Data-Constrained Language Models' awarded runner-up outstanding paper at NeurIPS'23. Well done TurkuNLP!

Paper: arxiv.org/pdf/2305.16264…

account_circle
Institute of Formal and Applied Linguistics(@ufal_cuni) 's Twitter Profile Photo

🚨Happenning now! HPLT birds of a feather session at . Want to know more about international academic collaboration on training very large open models? Come discuss high-performance language and translation models with us! ✌️

🚨Happenning now! @hplt_eu birds of a feather session at #EMNLP2023. Want to know more about international academic collaboration on training very large open models? Come discuss high-performance language and translation models with us! ✌️
account_circle
HPLT(@hplt_eu) 's Twitter Profile Photo

We just published version 1.2 of HPLT datasets. What's new?
- we fixed a bug in monolingual dedup, please redownload! 🛠️
- we filtered out very ugly monolingual documents🤮
- we anonymised the bilingual datasets🕵️‍♀️
hplt-project.org/datasets/v1.2

account_circle
HPLT(@hplt_eu) 's Twitter Profile Photo

HPLT is at EMNLP'2023! Join us for the HPLT BoF meeting that will take place Friday 08, 16:00 in the Aquarius room. See these web pages for more BoF details: 2023.emnlp.org/program/bof/ virtual2023.emnlp.org/socials.html#t…

account_circle
HPLT(@hplt_eu) 's Twitter Profile Photo

HPLT News and Tools!!! If you are interested in filtering your datasets for quality and using them to train MT and LLMs, you are interested in this thread 👇

account_circle
Helsinki-NLP(@HelsinkiNLP) 's Twitter Profile Photo

Helsinki-NLP is busy at the FCAI AI day blogs.helsinki.fi/language-techn…
Lots of interesting things including the HPLT project, the shroom2024 shared task and more. See us at the presentations and posters!

account_circle