Aman Arora(@amaarora) 's Twitter Profileg
Aman Arora

@amaarora

Data Science Lead at REA Group | Blog: https://t.co/k0LKBJ9aO7 | Previously: MLE @weights_biases; AI Scientist @Harrison.ai

ID:2582562763

linkhttp://amaarora.github.io calendar_today22-06-2014 17:05:12

2,8K Tweets

5,2K Followers

1,4K Following

Hamel Husain(@HamelHusain) 's Twitter Profile Photo

I WAS SO FRUSTRATED with the way GitHub renders Jupyter Notebooks, esp on mobile (You cannot horizontal scroll code, even with nbviewer)

So I made a thing: nbsanity.com

Source code is here github.com/hamelsmu/nbsan…

I WAS SO FRUSTRATED with the way @github renders Jupyter Notebooks, esp on mobile (You cannot horizontal scroll code, even with nbviewer) So I made a thing: nbsanity.com Source code is here github.com/hamelsmu/nbsan…
account_circle
Gabin MAURY(@csgmaury) 's Twitter Profile Photo

Now 24 hours post release, the model is bad according to every single person who tried it. Turns out the 'secret sauce' was overfiting the benchmarks.

account_circle
Jeremy Howard(@jeremyphoward) 's Twitter Profile Photo

I battled against GPT4 for quite a while trying to encourage it to solve another version of this problem in this video: youtu.be/jkrNMKz9pWU?si…

account_circle
Tanishq Mathew Abraham, Ph.D.(@iScienceLuvr) 's Twitter Profile Photo

The PyTorch team is developing a library for large model training called torchtitan 👀

They have scripts to train Llama-3 from scratch

The library went public today on GitHub but it is still in pre-release state & active development

Check it out → github.com/pytorch/torcht…

The @PyTorch team is developing a library for large model training called torchtitan 👀 They have scripts to train Llama-3 from scratch The library went public today on GitHub but it is still in pre-release state & active development Check it out → github.com/pytorch/torcht…
account_circle
Thomas Wolf(@Thom_Wolf) 's Twitter Profile Photo

This take on the FineWeb release is one of the most interesting feedback and also a reason FineWeb is very different from even larger datasets like RedPajama-V2 (which is double its size!)

Surprisingly, the size of the dataset of 15T tokens is not very important, what is much…

account_circle
LlamaIndex 🦙(@llama_index) 's Twitter Profile Photo

Phi-3 Mini (3.8B) from Microsoft was released today, claiming to match Llama 3 8B's performance! But how does it handle RAG, Routing, Query Planning, Text2SQL, Pydantic Program, and Agentic tasks?

Thanks to Ravi Theja, our benchmark cookbook offers an initial analysis:
✅ RAG…

Phi-3 Mini (3.8B) from @Microsoft was released today, claiming to match Llama 3 8B's performance! But how does it handle RAG, Routing, Query Planning, Text2SQL, Pydantic Program, and Agentic tasks? Thanks to @ravithejads, our benchmark cookbook offers an initial analysis: ✅ RAG…
account_circle
Nate Raw(@_nateraw) 's Twitter Profile Photo

Turns out if you do a cute little hack, you can make musicgen-songstarter-v0.2 work on vocal inputs 👀

🎤 Sing an idea ➡️ AI music sample 🎶

🔊Sound on 🔊

My goal was to make something useful for music producers 🔥

Demo, API, code + detailed info in thread below ⤵️

account_circle
Tanishq Mathew Abraham, Ph.D.(@iScienceLuvr) 's Twitter Profile Photo

LLM Evaluators Recognize and Favor Their Own Generations

abs: arxiv.org/abs/2404.13076

1. Frontier LLMs exhibit self-preference in self-evaluation.
2. LLMs have non-trivial self-recognition capability out of the box.
3. Fine-tuning leads to near-perfect self-recognition.
4.…

LLM Evaluators Recognize and Favor Their Own Generations abs: arxiv.org/abs/2404.13076 1. Frontier LLMs exhibit self-preference in self-evaluation. 2. LLMs have non-trivial self-recognition capability out of the box. 3. Fine-tuning leads to near-perfect self-recognition. 4.…
account_circle
lmsys.org(@lmsysorg) 's Twitter Profile Photo

Exciting update -- Llama-3 full result is out, now reaching top-5 on the Arena leaderboard🔥

We've got stable enough CIs with over 12K votes. No question now Llama-3 70B is the new king of open model. Its powerful 8B variant has also surpassed many larger-size models. What an…

Exciting update -- Llama-3 full result is out, now reaching top-5 on the Arena leaderboard🔥 We've got stable enough CIs with over 12K votes. No question now Llama-3 70B is the new king of open model. Its powerful 8B variant has also surpassed many larger-size models. What an…
account_circle
Maxime Labonne(@maximelabonne) 's Twitter Profile Photo

🎉 Great news for model merging!

Charles Goddard implemented an evolutionary technique à la Sakana AI to MergeKit (cc hardmaru)

He also released an excellent tutorial on how to use it with lm-evaluation-harness and vllm.

📝 Article: blog.arcee.ai/tutorial-tutor…

🎉 Great news for model merging! @chargoddard implemented an evolutionary technique à la @SakanaAILabs to MergeKit (cc @hardmaru) He also released an excellent tutorial on how to use it with lm-evaluation-harness and vllm. 📝 Article: blog.arcee.ai/tutorial-tutor…
account_circle
Jeremy Howard(@jeremyphoward) 's Twitter Profile Photo

A big reason this works so well is because Benjamin Warner has put in many *many* hours over the last few weeks on debugging and fixing complex performance issues deep inside the Transformers library.

It's the kind of work that's rarely appreciated -- so let's do so now! :D

account_circle
Jeremy Howard(@jeremyphoward) 's Twitter Profile Photo

Today at Answer.AI we've got something new for you: FSDP/QDoRA. We've tested it with AI at Meta Llama3 and the results blow away anything we've seen before.

I believe that this combination is likely to create better task-specific models than anything else at any cost. 🧵

Today at @answerdotai we've got something new for you: FSDP/QDoRA. We've tested it with @AIatMeta Llama3 and the results blow away anything we've seen before. I believe that this combination is likely to create better task-specific models than anything else at any cost. 🧵
account_circle
Jeremy Howard(@jeremyphoward) 's Twitter Profile Photo

This is the best tutorial I've seen for fully understanding and implementing Transformers models.

It includes a complete working implementation from scratch of all the key pieces, written in a way to make learning and understanding as easy as possible.

account_circle
Simon Willison(@simonw) 's Twitter Profile Photo

New paper from OpenAI on prompt injection - it's the most detailed evaluation of the problem I've seen from them so far, and has some very interesting details

Posted some of my notes on the paper on my log here: simonwillison.net/2024/Apr/23/th…

account_circle
Jeremy Howard(@jeremyphoward) 's Twitter Profile Photo

FB/Meta senior mgt has cared a lot about AI for many years. By way of example, here's a true story I haven't told before...

Do you remember when Zuck was giving testimony to congress in April 2018? I was watching it live, when the phone rang. It was the CTO of Facebook.

account_circle
Julien Chaumond(@julien_c) 's Twitter Profile Photo

we just shipped HuggingChat on iOS 💬

The app is super polished and gives you access to the community's best open AI models, on the go.

Give it a try!

link to Appstore below ⤵️

we just shipped HuggingChat on iOS 💬 The app is super polished and gives you access to the community's best open AI models, on the go. Give it a try! link to Appstore below ⤵️
account_circle