Knut Jägersberg(@JagersbergKnut) 's Twitter Profileg
Knut Jägersberg

@JagersbergKnut

Content Strategy & AI

@[email protected]

https://t.co/xnBUK02hWS

ID:1010498049058201600

linkhttps://www.linkedin.com/in/knut-jägersberg calendar_today23-06-2018 12:21:23

80,6K Tweets

5,6K Followers

4,7K Following

Bindu Reddy(@bindureddy) 's Twitter Profile Photo

LLMs are plateauing and the gap between closed vs. open is almost closed!!

If you are look at MMLU open-source is caught up to closed source and we are seeing the LLMs plateau

It's time to move on to different benchmarks that measure LLM capabilities on hard problems

The key

LLMs are plateauing and the gap between closed vs. open is almost closed!! If you are look at MMLU open-source is caught up to closed source and we are seeing the LLMs plateau It's time to move on to different benchmarks that measure LLM capabilities on hard problems The key
account_circle
Knut Jägersberg(@JagersbergKnut) 's Twitter Profile Photo

How Self-Supervised Reinforcement Learning Combined With Offline Reinforcement Learning (RL) Could Enable Scalable Representation Learning

marktechpost.com/2021/12/19/uc-…

account_circle
Rohan Paul(@rohanpaul_ai) 's Twitter Profile Photo

New hints about GPT-5 in OpenAI Vivatech Paris Talk

The hints are at:

18:00 where he shows the chart of intelligence of 'GPT-Next' arriving in 2024 - he talks about that model being a 'step function' over GPT-4 and 'become masters students in a blink of an eye',

and also

account_circle
EleutherAI(@AiEleuther) 's Twitter Profile Photo

Excited to share our new paper, Lessons From The Trenches on Reproducible Evaluation of Language Models!

In it, we discuss common challenges we’ve faced evaluating LMs, and how our library the Evaluation Harness is designed to mitigate them 🧵

arxiv.org/abs/2405.14782

Excited to share our new paper, Lessons From The Trenches on Reproducible Evaluation of Language Models! In it, we discuss common challenges we’ve faced evaluating LMs, and how our library the Evaluation Harness is designed to mitigate them 🧵 arxiv.org/abs/2405.14782
account_circle
cohere(@cohere) 's Twitter Profile Photo

Why are leading technologists choosing Retrieval-Augmented Generation (RAG) systems for cutting-edge LLM solutions?

RAG connects LLMs with real-world data, tackling challenges like hallucinations and rising costs. Explore the top 5 reasons enterprises are choosing RAG systems

Why are leading technologists choosing Retrieval-Augmented Generation (RAG) systems for cutting-edge LLM solutions? RAG connects LLMs with real-world data, tackling challenges like hallucinations and rising costs. Explore the top 5 reasons enterprises are choosing RAG systems
account_circle
The Kyiv Independent(@KyivIndependent) 's Twitter Profile Photo

⚡️ Putin looking for ceasefire to cement gains in Ukraine, Reuters reports citing sources.

Russian President Vladimir Putin is open to a ceasefire that recognizes the current front lines on the battlefield but will fight on if Ukraine and its allies do not agree, Reuters

⚡️ Putin looking for ceasefire to cement gains in Ukraine, Reuters reports citing sources. Russian President Vladimir Putin is open to a ceasefire that recognizes the current front lines on the battlefield but will fight on if Ukraine and its allies do not agree, Reuters
account_circle
Alexander Doria(@Dorialexander) 's Twitter Profile Photo

Très content d’avoir pu présenter Common Corpus avec Anastasia Stasenko à la journée Deep learning pour la Science au CNRS 🌍 Et au-delà la nécessité scientifique et éthique d’un tournant science ouverte pour l’entraînement de LLM.

Très content d’avoir pu présenter Common Corpus avec @ana_stasenko à la journée Deep learning pour la Science au @CNRS Et au-delà la nécessité scientifique et éthique d’un tournant science ouverte pour l’entraînement de LLM.
account_circle
Jo Kristian Bergum(@jobergum) 's Twitter Profile Photo

It’s weird that big G trigger LLM
summarization for low volume queries and also allowing it to quote from low quality sources.

account_circle