Sinatras (@myainotez) Twitter Tweets • TwiCopy

main

@main_horse

7 months ago

(2/6) main-horse.github.io/hnet/eng-1gpu/

thumb_up_off_alt95

chat_bubble_outline5

repeat8

shareShare

Qwen

@alibaba_qwen

7 months ago

Ready to meet the biggest, brainiest guy in the Qwen3 family?

thumb_up_off_alt5,5K

chat_bubble_outline453

repeat324

shareShare

FineVision is not only bigger and more diverse than 3 popular open-source alternatives, models trained on it also perform significantly better. Check out all the details in the Blog Post: huggingface.co/spaces/Hugging…

thumb_up_off_alt36

chat_bubble_outline2

repeat3

shareShare

Omar Sanseviero

@osanseviero

7 months ago

Introducing EmbeddingGemma🎉 🔥With only 308M params, this is the top open model under 500M 🌏Trained on 100+ languages 🪆Flexible embeddings (768 to 128 dims) with Matryoshka 🤗Works with your favorite open tools 🤏Runs with as little as 200MB developers.googleblog.com/en/introducing…

thumb_up_off_alt1,1K

chat_bubble_outline27

repeat157

shareShare

DailyPapers

@huggingpapers

7 months ago

Meta FAIR just unveiled VLWM on Hugging Face! This Vision Language World Model is a new foundation for planning with reasoning directly on natural videos. It combines reactive (system-1) and reflective (system-2) planning for SoTA performance!

thumb_up_off_alt22

chat_bubble_outline1

repeat6

shareShare

elie

@eliebakouch

7 months ago

This is how you create the best open dataset for VLM

thumb_up_off_alt582

chat_bubble_outline4

repeat57

shareShare

OpenAI

@openai

7 months ago

By popular request: you can now branch conversations in ChatGPT, letting you more easily explore different directions without losing your original thread. Available now to logged-in users on web.

thumb_up_off_alt16,16K

chat_bubble_outline892

repeat1,1K

shareShare

Sinatras

@myainotez

7 months ago

Thats looks crazy directly streamlined as mod hello ??

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

🐻熊狸

@bigeagle_xd

7 months ago

3... 2... 1... lift off huggingface.co/moonshotai/Kim…

thumb_up_off_alt307

chat_bubble_outline7

repeat29

shareShare

Sinatras

@myainotez

7 months ago

Qwens new big baby is out as it seems, Qwen3 Max no information about model size yet openrouter.ai/qwen/qwen3-max

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

wh

@nrehiew_

7 months ago

Qwen 3 Max has no "thinking", interesting

thumb_up_off_alt144

chat_bubble_outline7

repeat5

shareShare

Qwen

@alibaba_qwen

7 months ago

Big news: Introducing Qwen3-Max-Preview (Instruct) — our biggest model yet, with over 1 trillion parameters! 🚀 Now available via Qwen Chat & Alibaba Cloud API. Benchmarks show it beats our previous best, Qwen3-235B-A22B-2507. Internal tests + early user feedback confirm:

thumb_up_off_alt3,3K

chat_bubble_outline256

repeat550

shareShare

Prime Intellect

@primeintellect

7 months ago

Lean 4 Theorem Proving Multi-turn formal theorem proving in Lean 4, where models alternate between reasoning, sketching proof code, receiving feedback. Ideal for search guided rl, process rewards, and curriculum design. By Sinatras app.primeintellect.ai/dashboard/envi…

thumb_up_off_alt33

chat_bubble_outline1

repeat3

shareShare

Guilherme Penedo

@gui_penedo

7 months ago

> we've hit a data wall > pretraining is dead Is it? Today we are releasing 📄 FinePDFs: 3T tokens of new text data for pre-training that until now had been locked away inside PDFs. It is the largest permissively licensed corpus sourced exclusively from PDFs.

thumb_up_off_alt1,1K

chat_bubble_outline27

repeat151

shareShare

elie

@eliebakouch

7 months ago

Freshly curated open dataset with 3T multilingual tokens from PDFs. > containing about 3 trillion tokens across 475 million documents in 1,733 languages. > new source of data, with a knowledge cutoff in february 2025 > sota performance when mixed with fineweb-edu/dclm.

thumb_up_off_alt183

chat_bubble_outline8

repeat18

shareShare

Sinatras

@myainotez

7 months ago

Sure i will check back in couple minutes... wait why did you hangup on me

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Thomas Wolf

@thom_wolf

7 months ago

oh my god this is just crazy folks

thumb_up_off_alt1,1K

chat_bubble_outline34

repeat115

shareShare

Sinatras

@myainotez

6 months ago

Test it for free, i got some impressive eval results for its size on couple domain specific tasks past week. Will revisit it with SFT in near future.

thumb_up_off_alt1

chat_bubble_outline0

repeat1

shareShare

Sinatras

@myainotez

6 months ago

Official reminder that your favorite coding asistant will be quantized to 1.58 bits in ~10 minutes have fun with smart model while its lasts, until next time cheers

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

tenderizzation

@tenderizzation

6 months ago

one weird trick to transpose your matrix really fast in hardware but you can’t transpose it back as fast

thumb_up_off_alt62

chat_bubble_outline0

repeat2

shareShare

Sinatras

main

Qwen

Luis

Omar Sanseviero

DailyPapers

elie

OpenAI

Sinatras

🐻熊狸

Sinatras

wh

Qwen

Prime Intellect

Guilherme Penedo

elie

Sinatras

Thomas Wolf

Sinatras

Sinatras

tenderizzation