Rahul Somani (@rsomani95) Twitter Tweets • TwiCopy

shy kids

@shykids

2 years ago

'air head' is one of the first short films made using #Sora by OpenAI. the response so far has left us floating.🎈

thumb_up_off_alt3,3K

chat_bubble_outline1,1K

repeat584

shareShare

Reid Southen

@rahll

2 years ago

Latest workaround for getting ChatGPT to spit out copyright protected imagery? Simply knowing another language. What a joke.

thumb_up_off_alt1,1K

chat_bubble_outline24

repeat300

shareShare

Something that drives me to distraction in discussion of AI alignment: someone will say "Oh, it's crucial we build systems with properties X, Y, Z to ensure safety". And different people have slightly different formulations of what X, Y, and Z ought to be, and argue over it

thumb_up_off_alt259

chat_bubble_outline37

repeat36

shareShare

David Cole

@irondavy

2 years ago

I struggle to remember most historical dates, even very approximately. I’m a spatial thinker so I’ve tried making a number of different visualizations of history to help me, and this is my latest and favorite: mapping many different time scales to my hand

thumb_up_off_alt2,2K

chat_bubble_outline19

repeat265

shareShare

Aaron Defazio

@aaron_defazio

2 years ago

Schedule-Free Learning github.com/facebookresear… We have now open sourced the algorithm behind my series of mysterious plots. Each plot was either Schedule-free SGD or Adam, no other tricks!

thumb_up_off_alt1,1K

chat_bubble_outline38

repeat229

shareShare

Pedro Cuenca

@pcuenq

a year ago

Two new AI releases by Apple today: 🧚‍♀️ OpenELM, a set of small (270M-3B) efficient language models. Weights on the Hub: Pretrained: huggingface.co/collections/ap… Instruct: huggingface.co/collections/ap… 👷‍♀️ CoreNet, a training library used to train OpenELM: github.com/apple/corenet

thumb_up_off_alt461

chat_bubble_outline7

repeat112

shareShare

Sanchit Gandhi

@sanchitgandhi99

a year ago

Introducing 🤗 Diarizers: a library for fine-tuning speaker diarization models 🗣️ Improve multilingual diarization performance by 30% with just 10 minutes of GPU compute time! ⚡️ The first release comes with training scripts, datasets and a Google Colab 🚀 Check it out! ⚒️

thumb_up_off_alt288

chat_bubble_outline5

repeat58

shareShare

Yao Fu

@francis_yao_

a year ago

From Claude100K to Gemini10M, we are in the era of long context language models. Why and how a language model can utilize information at any input locations within long context? We discover retrieval heads, a special type of attention head responsible for long-context factuality

thumb_up_off_alt851

chat_bubble_outline22

repeat172

shareShare

gaut

@0xgaut

a year ago

Data visualization when done well is so interesting

thumb_up_off_alt13,13K

chat_bubble_outline86

repeat739

shareShare

Rahul Somani

@rsomani95

a year ago

Excited to share that we're looking to hire a Senior Full Stack Eng at Ozu. We're working on cutting edge problems in storytelling with a very passionate and capable team. Take a look below for more details if you'd like to join us!

thumb_up_off_alt6

chat_bubble_outline0

repeat1

shareShare

Jeremy Howard

@jeremyphoward

a year ago

We've released a new library, fastdata, for high quality synthetic data generation! 🚀 Check out this deep dive thread and blog post from its creator explaining everything you need to know to get started:

thumb_up_off_alt789

chat_bubble_outline16

repeat99

shareShare

Suhail

@suhail

a year ago

1/ We're unwrapping the new architecture + benchmarks behind Playground v3 - our new foundation model focused on graphic design. This is our first step towards making a powerful AI graphic designer. It's state-of-the-art at text rendering, prompt understanding, and color

thumb_up_off_alt932

chat_bubble_outline72

repeat81

shareShare

Michael Tschannen

@mtschannen

a year ago

Have you ever wondered how to train an autoregressive generative transformer on text and raw pixels, without a pretrained visual tokenizer (e.g. VQ-VAE)? We have been pondering this during summer and developed a new model: JetFormer 🌊🤖 arxiv.org/abs/2411.19722 A thread 👇 1/

thumb_up_off_alt834

chat_bubble_outline15

repeat142

shareShare

Vaibhav (VB) Srivastav

@reach_vb

8 months ago

HOLY SHITT, Microsoft dropped an open-source Multimodal (supports Audio, Vision and Text) Phi 4 - MIT licensed! 🔥 > Beats Gemini 2.0 Flash, GPT4o, Whisper, SeamlessM4T v2 > Models on Hugging Face hub, integrated with/ Transformers! Phi-4-Multimodal: > Modalities: Integrates

thumb_up_off_alt2,2K

chat_bubble_outline62

repeat431

shareShare

Rudy Gilman

@rgilman33

8 months ago

Siglip needs registers For comparison, here's DINO-v2 with registers. It has five extra tokens for the model to work with: one CLS token and four "registers". Look at how smooth those attention maps are! No artifacts.

thumb_up_off_alt673

chat_bubble_outline9

repeat62

shareShare

Peter Tong

@tongpetersb

7 months ago

Vision models have been smaller than language models; what if we scale them up? Introducing Web-SSL: A family of billion-scale SSL vision models (up to 7B parameters) trained on billions of images without language supervision, using VQA to evaluate the learned representation.

thumb_up_off_alt484

chat_bubble_outline8

repeat84

shareShare

OZU

@ozutechnology

5 months ago

📣Introducing the new OZU.ai - a new way to search for moments and scenes in film and tv. 🖤 Find themes you love, and moments that are in your head 🩶 Discover new shows, films, directors and actors ❤️ Change how you feel

thumb_up_off_alt13

chat_bubble_outline1

repeat5

shareShare

tomaarsen

@tomaarsen

5 months ago

Qwen is continuing their habit of state-of-the-art releases with 3 extraordinarily strong embedding models and 3 powerful reranker models, focusing on multilingual text retrieval and more. Details in 🧵

thumb_up_off_alt169

chat_bubble_outline1

repeat29

shareShare

Andrej Karpathy

@karpathy

4 months ago

+1 for "context engineering" over "prompt engineering". People associate prompts with short task descriptions you'd give an LLM in your day-to-day use. When in every industrial-strength LLM app, context engineering is the delicate art and science of filling the context window

thumb_up_off_alt8,8K

chat_bubble_outline328

repeat1,1K

shareShare

François Chollet

@fchollet

2 months ago

GenAI isn't just a technology; it's an informational pollutant—a pervasive cognitive smog that touches and corrupts every aspect of the Internet. It's not just a productivity tool; it's a kind of digital acid rain, silently eroding the value of all information. Every image is no

thumb_up_off_alt5,5K

chat_bubble_outline434

repeat941

shareShare

Rahul Somani

shy kids

Reid Southen

Michael Nielsen

David Cole

Aaron Defazio

Pedro Cuenca

Sanchit Gandhi

Yao Fu

gaut

Rahul Somani

Jeremy Howard

Suhail

Michael Tschannen

Vaibhav (VB) Srivastav

Rudy Gilman

Peter Tong

OZU

tomaarsen

Andrej Karpathy

François Chollet