SakaiSec (@sksec_) Twitter Tweets • TwiCopy

MiniMax (official)

3 months ago

Q: Why choose CISPO instead of GSPO or GRPO? How well does CISPO adapt to MoE, and does changing the RL algorithm require architectural refactoring? GRPO predates both, but in our attempts to reproduce R1-Zero it proved unreliable: PPO-style clipping caused token-level gradients

thumb_up_off_alt166

chat_bubble_outline1

repeat19

shareShare

Kimi.ai

@kimi_moonshot

3 months ago

Kimi K2.5 tech report just dropped! Quick hits: - Joint text–vision training: pretrained with 15T vision-text tokens, zero-vision SFT (text-only) to activate visual reasoning - Agent Swarm + PARL: dynamically orchestrated parallel sub-agents, up to 4.5× lower latency, 78.4% on

thumb_up_off_alt1,1K

chat_bubble_outline54

repeat289

shareShare

Cerebras

@cerebrassystems

3 months ago

GLM 4.7 is one of the strongest open-source coding models available—but most developers aren't prompting it correctly. We put together 10 rules to help you get the most out of it: - Front-load instructions (it has a strong recency bias) - Use firm language: "must" and

thumb_up_off_alt984

chat_bubble_outline47

repeat63

shareShare

DailyPapers

@huggingpapers

3 months ago

Self-Distillation Enables Continual Learning MIT & ETH Zurich researchers introduce SDFT for on-policy learning from demonstrations. It uses demonstration-conditioned models as their own teacher to reduce catastrophic forgetting, outperforming SFT and enabling sequential skill

thumb_up_off_alt99

chat_bubble_outline1

repeat21

shareShare

SakaiSec

@sksec_

3 months ago

CTF as a Benchmark

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

𝕎𝕠𝕝𝕗 𝕋𝕣𝕒𝕚𝕟𝕖𝕣 牧狼人

@wolftrainer_101

3 months ago

挖国内云厂商漏洞前真心建议多刷刷wiz的题，还有云鼎早期的文章，80万月榜不是梦😄。很多同学都没掌握挖云厂商漏洞精髓，主动放弃了。还有一些被云产品的价格劝退了，说实话舍不得孩子套不着狼😑….

thumb_up_off_alt118

chat_bubble_outline4

repeat17

shareShare

Qoder

@qoder_ai_ide

3 months ago

Qwen-Coder-Qoder is live! We’ve launched a customized model built on Alibaba’s Qwen-Coder, fine-tuned via large-scale RL specifically for the Qoder. Learn more: qoder.com/blog/qwen-code…

thumb_up_off_alt123

chat_bubble_outline20

repeat13

shareShare

Unsloth AI

@unslothai

3 months ago

Qwen releases Qwen3-Coder-Next. 💜 The new 80B MoE model excels at agentic coding & local use. With 256K context, it delivers similar performance to models with 10-20× more active parameters. Run on 46GB RAM or less. Guide: unsloth.ai/docs/models/qw… GGUF: huggingface.co/unsloth/Qwen3-…

thumb_up_off_alt1,1K

chat_bubble_outline47

repeat186

shareShare

Watcher.Guru

@watcherguru

3 months ago

JUST IN: Ethereum Founder Vitalik Buterin sold 2,972 ETH for $6.69 million over the past 3 days.

thumb_up_off_alt11,11K

chat_bubble_outline1,1K

repeat1,1K

shareShare

GMOサイバーセキュリティ byイエラエ株式会社【公式】

@gmo_ierae

3 months ago

【合格体験記】カーネルモードから見る防御 ― CETP合格までの軌跡 gmo-cybersecurity.com/blog/evasion-l…

thumb_up_off_alt76

chat_bubble_outline0

repeat12

shareShare

mitsu 𓃦

@mitsufoppie

3 months ago

you can use gmod to verify your age by the way

thumb_up_off_alt73,73K

chat_bubble_outline194

repeat4,4K

shareShare

a16z

@a16z

3 months ago

We're thrilled to lead Shizuku AI's seed round. In January 2023, while still completing his Ph.D. at UC Berkeley, Akio Kodaira launched an AI VTuber named Shizuku on YouTube. He ran dozens of streams, building a community of thousands of followers who tuned in to talk with an AI

thumb_up_off_alt1,1K

chat_bubble_outline86

repeat298

shareShare

GMOサイバーセキュリティ byイエラエ株式会社【公式】

@gmo_ierae

3 months ago

当社エンジニアの金子孟司が発見した攻撃手法がPortSwigger社の「Top 10 Web Hacking Techniques of 2025」に選出されました gmo-cybersecurity.com/news/20260210/

thumb_up_off_alt69

chat_bubble_outline0

repeat9

shareShare

Z.ai

@zai_org

3 months ago

Introducing GLM-5: From Vibe Coding to Agentic Engineering GLM-5 is built for complex systems engineering and long-horizon agentic tasks. Compared to GLM-4.5, it scales from 355B params (32B active) to 744B (40B active), with pre-training data growing from 23T to 28.5T tokens.

thumb_up_off_alt5,5K

chat_bubble_outline297

repeat778

shareShare

SakaiSec

@sksec_

3 months ago

GLM なのに DeepSeek Sparse Attention

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Ant Ling

@antling20041208

2 months ago

🚀 Unveiling Ring-1T-2.5 The first hybrid linear-architecture 1T thinking model. -Efficient: Hybrid linear breakthrough (10x lower memory) -Gold Tier: IMO25 (35/42) & CMO25 (105/126) -Agentic: Natively with Claude Code & OpenClaw -Open SOTA: IMOAnswerBench，GAIA2-search & more!

thumb_up_off_alt1,1K

chat_bubble_outline36

repeat144

shareShare

MiniMax (official)

@minimax__ai

2 months ago

Introducing M2.5, an open-source frontier model designed for real-world productivity. - SOTA performance at coding (SWE-Bench Verified 80.2%), search (BrowseComp 76.3%), agentic tool-calling (BFCL 76.8%) & office work. - Optimized for efficient execution, 37% faster at complex

thumb_up_off_alt4,4K

chat_bubble_outline262

repeat500

shareShare

Zhengfu He

@zhengfuhe

2 months ago

We built a Complete Replacement Model (CRM) that fully sparsifies a language model. This brings many changes to circuit tracing and global circuits. (1/n)