Aman Arora (@amaarora) Twitter Tweets • TwiCopy

Hamel Husain

6 months ago

I WAS SO FRUSTRATED with the way GitHub renders Jupyter Notebooks, esp on mobile (You cannot horizontal scroll code, even with nbviewer)

So I made a thing: nbsanity.com

Source code is here github.com/hamelsmu/nbsan…

account_circle

Gabin MAURY

@csgmaury

6 days ago

Now 24 hours post release, the model is bad according to every single person who tried it. Turns out the 'secret sauce' was overfiting the benchmarks.

thumb_up_off_alt83

chat_bubble_outline0

repeat8

shareShare

account_circle

Jeremy Howard

@jeremyphoward

4 days ago

I battled against GPT4 for quite a while trying to encourage it to solve another version of this problem in this video: youtu.be/jkrNMKz9pWU?si…

account_circle

Simon Willison

@simonw

4 days ago

'Do stuff and then blog about it' remains one of the most underrated pieces of career advice

account_circle

Tanishq Mathew Abraham, Ph.D.

@iScienceLuvr

4 days ago

The PyTorch team is developing a library for large model training called torchtitan 👀

They have scripts to train Llama-3 from scratch

The library went public today on GitHub but it is still in pre-release state & active development

Check it out → github.com/pytorch/torcht…

account_circle

Thomas Wolf

@Thom_Wolf

1 week ago

This take on the FineWeb release is one of the most interesting feedback and also a reason FineWeb is very different from even larger datasets like RedPajama-V2 (which is double its size!)

Surprisingly, the size of the dataset of 15T tokens is not very important, what is much…

account_circle

LlamaIndex 🦙

@llama_index

6 days ago

Phi-3 Mini (3.8B) from Microsoft was released today, claiming to match Llama 3 8B's performance! But how does it handle RAG, Routing, Query Planning, Text2SQL, Pydantic Program, and Agentic tasks?

Thanks to Ravi Theja, our benchmark cookbook offers an initial analysis:
✅ RAG…

account_circle

Nate Raw

@_nateraw

1 week ago

Turns out if you do a cute little hack, you can make musicgen-songstarter-v0.2 work on vocal inputs 👀

🎤 Sing an idea ➡️ AI music sample 🎶

🔊Sound on 🔊

My goal was to make something useful for music producers 🔥

Demo, API, code + detailed info in thread below ⤵️

account_circle

Tanishq Mathew Abraham, Ph.D.

@iScienceLuvr

1 week ago

LLM Evaluators Recognize and Favor Their Own Generations

abs: arxiv.org/abs/2404.13076

1. Frontier LLMs exhibit self-preference in self-evaluation.
2. LLMs have non-trivial self-recognition capability out of the box.
3. Fine-tuning leads to near-perfect self-recognition.
4.…

account_circle

lmsys.org

@lmsysorg

1 week ago

Exciting update -- Llama-3 full result is out, now reaching top-5 on the Arena leaderboard🔥

We've got stable enough CIs with over 12K votes. No question now Llama-3 70B is the new king of open model. Its powerful 8B variant has also surpassed many larger-size models. What an…

account_circle

Maxime Labonne

@maximelabonne

6 days ago

🎉 Great news for model merging!

Charles Goddard implemented an evolutionary technique à la Sakana AI to MergeKit (cc hardmaru)

He also released an excellent tutorial on how to use it with lm-evaluation-harness and vllm.

📝 Article: blog.arcee.ai/tutorial-tutor…

🎉 Great news for model merging! @chargoddard implemented an evolutionary technique à la @SakanaAILabs to MergeKit (cc @hardmaru) He also released an excellent tutorial on how to use it with lm-evaluation-harness and vllm. 📝 Article: blog.arcee.ai/tutorial-tutor…

account_circle

merve

@mervenoyann

6 days ago

very nice blog post on training a VLM purely in pytorch huggingface.co/blog/AviSoori1… 🔖

account_circle

Jeremy Howard

@jeremyphoward

1 week ago

A big reason this works so well is because Benjamin Warner has put in many *many* hours over the last few weeks on debugging and fixing complex performance issues deep inside the Transformers library.

It's the kind of work that's rarely appreciated -- so let's do so now! :D

account_circle

Jeremy Howard

@jeremyphoward

1 week ago

Today at Answer.AI we've got something new for you: FSDP/QDoRA. We've tested it with AI at Meta Llama3 and the results blow away anything we've seen before.

I believe that this combination is likely to create better task-specific models than anything else at any cost. 🧵

Today at @answerdotai we've got something new for you: FSDP/QDoRA. We've tested it with @AIatMeta Llama3 and the results blow away anything we've seen before. I believe that this combination is likely to create better task-specific models than anything else at any cost. 🧵

account_circle

Jeremy Howard

@jeremyphoward

1 week ago

This is the best tutorial I've seen for fully understanding and implementing Transformers models.

It includes a complete working implementation from scratch of all the key pieces, written in a way to make learning and understanding as easy as possible.

account_circle

Simon Willison

@simonw

1 week ago

New paper from OpenAI on prompt injection - it's the most detailed evaluation of the problem I've seen from them so far, and has some very interesting details

Posted some of my notes on the paper on my log here: simonwillison.net/2024/Apr/23/th…

account_circle

Rafal Wilinski

@rafalwilinski

1 week ago

Do yourself a favor and try Llama3 70B with Groq. GPT-4 level answers provided instant. Insane.

account_circle

Jeremy Howard

@jeremyphoward

1 week ago

FB/Meta senior mgt has cared a lot about AI for many years. By way of example, here's a true story I haven't told before...

Do you remember when Zuck was giving testimony to congress in April 2018? I was watching it live, when the phone rang. It was the CTO of Facebook.

account_circle

Maxime Labonne

@maximelabonne

1 week ago

Models dropped on Hugging Face!

huggingface.co/meta-llama/Met…
huggingface.co/meta-llama/Met…

account_circle

Julien Chaumond

@julien_c

1 week ago

we just shipped HuggingChat on iOS 💬

The app is super polished and gives you access to the community's best open AI models, on the go.

Give it a try!

link to Appstore below ⤵️

account_circle