m_ric (@aymericroucher) Twitter Tweets • TwiCopy

不鍊金丹不坐禪

6 months ago

m_ric Andrew Ng Post: My little homework project (DeepWebSearch AgentKit App. Build with 🤗 Hugging Face smolagents framework) github.com/lwyBZss8924d/D…

thumb_up_off_alt5

chat_bubble_outline0

repeat3

shareShare

Who said Transformers couldn't be good at forecasting? Datadog's new open model tops forecasting benchmarks! 💥 And boy did they cook. They followed the playbook to build the best model: 1. The best benchmark They release a new benchmark named BOOM, based on observability

thumb_up_off_alt17

chat_bubble_outline1

repeat1

shareShare

m_ric

@aymericroucher

6 months ago

TIL: When distilling reasoning capability from a teacher LLM to a smaller LLM, you should use Agent traces instead of CoT traces. Advantages are: 1. Increased generalization Intuitively, this is because your agent can encounter more "surprising" results by interacting with its

thumb_up_off_alt22

chat_bubble_outline1

repeat4

shareShare

Amir Mahla

@amir_mahla

6 months ago

🚀 Bring Vision to Your Blind LLM! 👀 “Vision as LoRA (VoRA)” makes it possible, transforming any LLM into an MLLM without needing a ViT at inference! ✨ How it works: •Only vision embeddings + LoRA layers are trained •Block-wise distillation from a pre-trained ViT

thumb_up_off_alt10

chat_bubble_outline2

repeat2

shareShare

Allan

@niemerg

5 months ago

Huge fan of Claude Code—so I built a python version using smolagents! Introducing SmolCC 🤖📟🛠️ An open source coding agent with Claude Code style tools (bash, grep, edit…✨) that can be easily customized.

thumb_up_off_alt101

chat_bubble_outline3

repeat17

shareShare

m_ric

@aymericroucher

5 months ago

If you didn't yet, you should read the technical report for SmolVLA, published yesterday by the Hugging Face robotics team! ➡️Amongst other ideas, it introduces "Async inference" to boost their robot actions. Robots have a problem: performing the actions takes time (Unlike

If you didn't yet, you should read the technical report for SmolVLA, published yesterday by the <a href="/huggingface/">Hugging Face</a> robotics team!
➡️Amongst other ideas, it introduces "Async inference" to boost their robot actions.

Robots have a problem: performing the actions takes time (Unlike

thumb_up_off_alt60

chat_bubble_outline2

repeat11

shareShare

m_ric

@aymericroucher

5 months ago

Qwen silently dropped the new standard for embeddings on the Hub! - 0.6B, 4B and 8B versions (probably would use only the 0.6B) - 32k context length 📏 - 100 languages 🌍 - SOTA on MTEB, but like real SOTA, with 10 points margin on the second bests 🤯

thumb_up_off_alt20

chat_bubble_outline3

repeat1

shareShare

Julien Chaumond

@julien_c

5 months ago

Today is a big day, we're introducing the first version of the HF MCP server 🔥 🧵

thumb_up_off_alt434

chat_bubble_outline15

repeat89

shareShare

Amir Mahla

@amir_mahla

5 months ago

🚀 We're releasing ScreenSuite - the most comprehensive evaluation suite for GUI Agents! ✅ Brings together key benchmarks from SOTA research ✅ Vision-only evaluation (no DOM cheating) ✅ Ubuntu & Android environments Dockerized ✅ Mobile, desktop & web coverage

thumb_up_off_alt85

chat_bubble_outline4

repeat14

shareShare

m_ric

@aymericroucher

5 months ago

English Wikipedia is 29.4B characters, with ~1.5 bits of information per character. So that means, at 3.6 bits per parameter, a 12B model could memorize all english Wikipedia, it seems wild!

thumb_up_off_alt8

chat_bubble_outline0

repeat2

shareShare

m_ric

@aymericroucher

5 months ago

Almost didn't take time to celebrate : Smolagents just wooshed past 20k stars on Github! ⭐️ Congrats to the team, especially Albert Villanova who spends lots of time improving the library. And thanks a lot to our community members for pushing is forward! 👏 It's still time to

Almost didn't take time to celebrate : Smolagents just wooshed past 20k stars on Github! ⭐️

Congrats to the team, especially <a href="/avillanovamoral/">Albert Villanova</a> who spends lots of time improving the library. And thanks a lot to our community members for pushing is forward! 👏

It's still time to

thumb_up_off_alt40

chat_bubble_outline1

repeat8

shareShare