treeducate.com (@worldcitizenhds) Twitter Tweets • TwiCopy

Itamar Golan 🤓

3 years ago

🎉 The most powerful Open-Source LLM is out! Yes! Smaller than LLaMA 🦙, but Stronger than LLaMA (65B) hands down! 💪 *** The Institute of Technological Innovation of the Arab Emirates released today the most powerful base model ever: FalconLM 🚀 The model is currently

thumb_up_off_alt897

chat_bubble_outline28

repeat206

shareShare

Sten Rüdiger

@stenruediger

2 years ago

Here is a ChatGPT on the European AI act. Use at your own risk. (Requires ChatGPT+) chat.openai.com/g/g-7ZUMpsavX-…

thumb_up_off_alt3

chat_bubble_outline1

repeat1

shareShare

Maarten Grootendorst

@maartengr

2 years ago

BERTopic + LLMs + DataMapPlot is an incredibly fun combo!

thumb_up_off_alt779

chat_bubble_outline5

repeat118

shareShare

Ethan Mollick

@emollick

2 years ago

LLMs passed a Turing Test, of a sort, for doctors. 149 actors playing patients texted live with one of 20 primary care doctors or else Google's new medical LLM, AMIE. Specialist human doctors & the "patients" rated the quality of care. AMIE beat the docs. blog.research.google/2024/01/amie-r…

thumb_up_off_alt1,1K

chat_bubble_outline42

repeat306

shareShare

Demis Hassabis

@demishassabis

2 years ago

Here’s a fun demo of long-context understanding. First, we asked the model to find 3 amusing moments in the 402-page pdf transcription of the iconic Apollo 11 mission. Then we uploaded a simple drawing of a boot and it identified the moment we had in mind: Neil’s one small step!

thumb_up_off_alt1,1K

chat_bubble_outline54

repeat198

shareShare

Shaun.AGI

@agishaun

2 years ago

⚠️RAG might be dead, after reading 58 pages of Genimi 1.5 Pro tech report. Here's my thoughts as AI founder, 1. Simple RAG system like similarity search with vector db will be dead. But more customized RAG will still live. The goal of RAG is mostly on retrieval relevant

thumb_up_off_alt967

chat_bubble_outline61

repeat161

shareShare

Jim Fan

@drjimfan

2 years ago

The most epic AI panel in a while! We at NVIDIA have gathered ALL 8 authors of "Attention is All You Need" for a panel at GTC, hosted by none other than the GOAT himself, Jensen Huang. In 2017, 8 researchers had a flash of genius and invented Transformer, the seminal work that

thumb_up_off_alt1,1K

chat_bubble_outline70

repeat252

shareShare

Carlos E. Perez

@intuitmachine

2 years ago

Groq is a Radically Different kind of AI architecture Among the new crop of AI chip startups, Groq stands out with a radically different approach centered around its compiler technology for optimizing a minimalist yet high-performance architecture. Groq's secret sauce is this

thumb_up_off_alt3,3K

chat_bubble_outline102

repeat689

shareShare

Andrej Karpathy

@karpathy

2 years ago

Fun LLM challenge that I'm thinking about: take my 2h13m tokenizer video and translate the video into the format of a book chapter (or a blog post) on tokenization. Something like: 1. Whisper the video 2. Chop up into segments of aligned images and text 3. Prompt engineer an LLM

thumb_up_off_alt4,4K

chat_bubble_outline203

repeat359

shareShare

Akshay 🚀

@akshay_pachaar

2 years ago

The new e=mc**2

thumb_up_off_alt1,1K

chat_bubble_outline64

repeat165

shareShare

Kosta Derpanis

@csprofkgd

2 years ago

Sounds like a supervisor answering questions about details in their student’s paper 🤓

thumb_up_off_alt131

chat_bubble_outline6

repeat5

shareShare

Brian Roemmele

@brianroemmele

2 years ago

Self driving building Janitor. Cleans all bathrooms in a multistory building… Unassisted.

thumb_up_off_alt8,8K

chat_bubble_outline609

repeat1,1K

shareShare

Andrej Karpathy

@karpathy

2 years ago

Have you ever wanted to train LLMs in pure C without 245MB of PyTorch and 107MB of cPython? No? Well now you can! With llm.c: github.com/karpathy/llm.c To start, implements GPT-2 training on CPU/fp32 in only ~1,000 lines of clean code. It compiles and runs instantly, and exactly

thumb_up_off_alt12,12K

chat_bubble_outline291

repeat1,1K

shareShare

Andrej Karpathy

@karpathy

2 years ago

# explaining llm.c in layman terms Training Large Language Models (LLMs), like ChatGPT, involves a large amount of code and complexity. For example, a typical LLM training project might use the PyTorch deep learning library. PyTorch is quite complex because it implements a very

thumb_up_off_alt9,9K

chat_bubble_outline406

repeat1,1K

shareShare

Elon Musk

@elonmusk

2 years ago

thumb_up_off_alt1,0M

chat_bubble_outline17,17K

repeat97,97K

shareShare

Open Source Intel

@osint613

a year ago

Bill Clinton On the Palestinians: “And the only time Yasser Arafat didn't tell me the truth was when he promised me he was gonna accept the peace deal that we had worked out, which would have given the Palestinians a state on 96% of the West Bank and 4% of Israel, and they got

thumb_up_off_alt30,30K

chat_bubble_outline1,1K

repeat8,8K

shareShare

Sten Rüdiger

@stenruediger

a year ago

I’ve been developing a parameter-efficient FT method that: – Improves in-domain knowledge uptake – Minimizes out-of-domain forgetting – Smaller parameter count than LoRA Initial QA benchmarks are strong. Today, perplexity-based evaluation confirmed efficiency trends. More soon

thumb_up_off_alt2

chat_bubble_outline0

repeat1

shareShare

Sten Rüdiger

@stenruediger

a year ago

For a long time, I've struggled with getting deep domain knowledge into LLM chatbots. RAG is powerful, sure, but it often feels like a workaround, not an elegant, integrated solution. I knew there had to be a better way to make LLMs truly learn. 🤔 #LLM #DomainAdaptation #A

thumb_up_off_alt3

chat_bubble_outline1

repeat2

shareShare