Milan Kryl (@mikr) Twitter Tweets • TwiCopy

Marktechpost AI Research News ⚡

a year ago

LLMs No Longer Require Powerful Servers: Researchers from MIT, KAUST, ISTA, and Yandex Introduce a New AI Approach to Rapidly Compress Large Language Models without a Significant Loss of Quality The Yandex Research team, together with researchers from the Massachusetts Institute

thumb_up_off_alt87

chat_bubble_outline3

repeat22

shareShare

Sergei Nozdrenkov

@nozdrenkov

a year ago

We’re open-sourcing 352GB of Coral Reef pics (13 sites, 90k pics) from Indonesia under CC-BY-4.0 🌏🪸 3D photogrammetry data to accelerate research/conservation, no strings attached 🤗 🔵 Why? Coral reefs are so precious, beautiful, incredibly complex and threatened ecosystems.

thumb_up_off_alt375

chat_bubble_outline6

repeat88

shareShare

Aurick Qiao

@aurickq

a year ago

Excited to share our work on Speculative Decoding Snowflake AI Research! 🚀 4x faster LLM inference for coding agents like OpenHands All Hands AI 💬 2.4x faster LLM inference for interactive chat 💻 Open-source via Arctic Inference as a plugin for vLLM 🧵

Excited to share our work on Speculative Decoding <a href="/Snowflake/">Snowflake</a> AI Research!

🚀 4x faster LLM inference for coding agents like OpenHands <a href="/allhands_ai/">All Hands AI</a>

💬 2.4x faster LLM inference for interactive chat

💻 Open-source via Arctic Inference as a plugin for <a href="/vllm_project/">vLLM</a>

🧵

thumb_up_off_alt164

chat_bubble_outline3

repeat38

shareShare

clem 🤗

@clementdelangue

a year ago

The LeRobot hackathon is now scheduled to happen in 44 different locations at the same time. Which city is missing: London (UK) - Cotono (Benin) - Toulouse, Paris & 2 in Lyon (France) - Antwerp (Belgium) - Santiago (Chile) - Isfahan (Iran) - Anchen, Berlin & Munich (Germany)

thumb_up_off_alt203

chat_bubble_outline34

repeat28

shareShare

Harrison Kinsley

@sentdex

a year ago

Idk who needs to see this, but the Unitree G1 with the back plate off:

thumb_up_off_alt4,4K

chat_bubble_outline330

repeat385

shareShare

Neel Kant

@_neel_kant

a year ago

🎉Factorio Learning Environment 0.2.0 released! 📖Details: jackhopkins.github.io/factorio-learn… New Features: - Multi-agent support - Reasoning models + MCP - Reflection & backtracking - Vision-augmented inputs and more frontier model results! The initial release of FLE was met with great

thumb_up_off_alt208

chat_bubble_outline8

repeat26

shareShare

Crémieux

@cremieuxrecueil

a year ago

Someone slapped together a big dataset of password leaks today. Here's the distribution of pin numbers from a few times those got leaked a while back.

thumb_up_off_alt8,8K

chat_bubble_outline485

repeat1,1K

shareShare

Black Forest Labs

@bfl_ml

a year ago

High quality image editing no longer needs closed models We release FLUX.1 Kontext [dev] - an open weights model for proprietary-level image editing performance. Runs on consumer chips. ✓ Open weights available ✓ Best in-class performance ✓ Self-serve commercial licensing

thumb_up_off_alt1,1K

chat_bubble_outline89

repeat350

shareShare

atharva

@k7agar

10 months ago

its outttt

thumb_up_off_alt4,4K

chat_bubble_outline93

repeat274

shareShare

elie

@eliebakouch

10 months ago

New variant of attention by meta going beyond the standard bilinear form. It's changing the beta coef in scaling laws (which is a big deal) + there is a efficient triton implementation. Huge.

thumb_up_off_alt517

chat_bubble_outline16

repeat48

shareShare

机器之心 JIQIZHIXIN

@synced_global

10 months ago

Another attention! Introducing Power Attention—a breakthrough in efficient attention mechanisms! This novel linear-cost attention layer features tunable state size, completely decoupled from model parameters. Why it stands out: ⚡ Blazing-fast GPU kernels with fused operations

thumb_up_off_alt135

chat_bubble_outline4

repeat20

shareShare

Stanford Online

@stanfordonline

10 months ago

Our latest CS336 Language Modeling from Scratch lectures are now available! View the entire playlist here: youtube.com/playlist?list=…

thumb_up_off_alt1,1K

chat_bubble_outline5

repeat163

shareShare

Philipp Schmid

@_philschmid

10 months ago

Interesting new Memory framework and paper released! MemOS claims to outperform competitors by treating memory as OS-like framework using a three-layer system: Interface, Operation, and Infrastructure. MemOS is an open-source library that claims to differentiates itself by: 1.

thumb_up_off_alt263

chat_bubble_outline8

repeat38

shareShare

Dmitry Krotov

@dimakrotov

10 months ago

In physics there is an elegant method for computing the correlation functions called generating function. The idea is simple - instead of computing correlators one by one - you define a function of a parameter and compute the average of that new function. Individual correlators

thumb_up_off_alt1,1K

chat_bubble_outline24

repeat183

shareShare

elie

@eliebakouch

10 months ago

Kimi team just trained a state of the art open source model 32B active parameter/1T total with 0 training instabilities, thanks to MuonClip, this is amazing

thumb_up_off_alt1,1K

chat_bubble_outline20

repeat101

shareShare

Sukjun (June) Hwang

@sukjun_hwang

10 months ago

Tokenization has been the final barrier to truly end-to-end language models. We developed the H-Net: a hierarchical network that replaces tokenization with a dynamic chunking process directly inside the model, automatically discovering and operating over meaningful units of data

thumb_up_off_alt2,2K

chat_bubble_outline58

repeat355

shareShare

Tiezhen WANG

@xianbao_qian

10 months ago

Why did Kimi.ai switched from closed source to open-source and released K2? - Reputation. If K2 is API only, it might have ended up like Grok-4 - clearly well-built yet still taking a lot of flak. - Work with the whole ecosystem. Less than 24 hours after release, the

Why did <a href="/Kimi_Moonshot/">Kimi.ai</a> switched from closed source to open-source and released K2?

- Reputation. If K2 is API only, it might have ended up like <a href="/grok/">Grok</a>-4 - clearly well-built yet still taking a lot of flak.

- Work with the whole ecosystem. Less than 24 hours after release, the

thumb_up_off_alt137

chat_bubble_outline3

repeat16

shareShare

👋 Jan

@jandotai

10 months ago

Mistral released 2 open-source speech models: Voxtral (24B) & Voxtral Mini (3B). Both beat Whisper v3, GPT-4o-mini, and Scribe on ASR across 7+ languages, while supporting Q&A, summarization, and function-calling directly from voice. huggingface.co/mistralai

thumb_up_off_alt165

chat_bubble_outline2

repeat21

shareShare

Mistral AI

@mistralai

10 months ago

In our continued commitment to open-science, we are releasing the Voxtral Technical Report: arxiv.org/abs/2507.13264 The report covers details on pre-training, post-training, alignment and evaluations. We also present analysis on selecting the optimal model architecture, which

thumb_up_off_alt1,1K

chat_bubble_outline38

repeat194

shareShare

Black Forest Labs

@bfl_ml

9 months ago

Today we are releasing FLUX.1 Krea [dev] - a new state-of-the-art open-weights FLUX model, built for photorealism. Developed in collaboration with KREA AI, this model is focused on images with unique aesthetics. No “AI look”, no blown-out highlights, just natural detail.

thumb_up_off_alt404

chat_bubble_outline18

repeat70

shareShare