SakaiSec (@sksec_) 's Twitter Profile
SakaiSec

@sksec_

18 y/o Hacker / Bug Hunter #hackthebox #seccamp #hardening #codeblue #bugbounty

ID: 1729634547913175040

calendar_today28-11-2023 22:53:16

122 Tweet

439 Followers

576 Following

MiniMax (official) (@minimax__ai) 's Twitter Profile Photo

Q: Why choose CISPO instead of GSPO or GRPO? How well does CISPO adapt to MoE, and does changing the RL algorithm require architectural refactoring? GRPO predates both, but in our attempts to reproduce R1-Zero it proved unreliable: PPO-style clipping caused token-level gradients

Kimi.ai (@kimi_moonshot) 's Twitter Profile Photo

Kimi K2.5 tech report just dropped! Quick hits: - Joint text–vision training: pretrained with 15T vision-text tokens, zero-vision SFT (text-only) to activate visual reasoning - Agent Swarm + PARL: dynamically orchestrated parallel sub-agents, up to 4.5× lower latency, 78.4% on

Kimi K2.5 tech report just dropped!

Quick hits:
- Joint text–vision training: pretrained with 15T vision-text tokens, zero-vision SFT (text-only) to activate visual reasoning
- Agent Swarm + PARL: dynamically orchestrated parallel sub-agents, up to 4.5× lower latency, 78.4% on
Cerebras (@cerebrassystems) 's Twitter Profile Photo

GLM 4.7 is one of the strongest open-source coding models available—but most developers aren't prompting it correctly. We put together 10 rules to help you get the most out of it: - Front-load instructions (it has a strong recency bias) - Use firm language: "must" and

DailyPapers (@huggingpapers) 's Twitter Profile Photo

Self-Distillation Enables Continual Learning MIT & ETH Zurich researchers introduce SDFT for on-policy learning from demonstrations. It uses demonstration-conditioned models as their own teacher to reduce catastrophic forgetting, outperforming SFT and enabling sequential skill

Self-Distillation Enables Continual Learning

MIT & ETH Zurich researchers introduce SDFT for on-policy learning from demonstrations. It uses demonstration-conditioned models as their own teacher to reduce catastrophic forgetting, outperforming SFT and enabling sequential skill
𝕎𝕠𝕝𝕗 𝕋𝕣𝕒𝕚𝕟𝕖𝕣 牧狼人 (@wolftrainer_101) 's Twitter Profile Photo

挖国内云厂商漏洞前真心建议多刷刷wiz的题,还有云鼎早期的文章,80万月榜不是梦😄。很多同学都没掌握挖云厂商漏洞精髓,主动放弃了。还有一些被云产品的价格劝退了,说实话舍不得孩子套不着狼😑….

Qoder (@qoder_ai_ide) 's Twitter Profile Photo

Qwen-Coder-Qoder is live! We’ve launched a customized model built on Alibaba’s Qwen-Coder, fine-tuned via large-scale RL specifically for the Qoder. Learn more: qoder.com/blog/qwen-code…

Unsloth AI (@unslothai) 's Twitter Profile Photo

Qwen releases Qwen3-Coder-Next. 💜 The new 80B MoE model excels at agentic coding & local use. With 256K context, it delivers similar performance to models with 10-20× more active parameters. Run on 46GB RAM or less. Guide: unsloth.ai/docs/models/qw… GGUF: huggingface.co/unsloth/Qwen3-…

Qwen releases Qwen3-Coder-Next. 💜

The new 80B MoE model excels at agentic coding & local use. With 256K context, it delivers similar performance to models with 10-20× more active parameters.

Run on 46GB RAM or less.

Guide: unsloth.ai/docs/models/qw…
GGUF: huggingface.co/unsloth/Qwen3-…
a16z (@a16z) 's Twitter Profile Photo

We're thrilled to lead Shizuku AI's seed round. In January 2023, while still completing his Ph.D. at UC Berkeley, Akio Kodaira launched an AI VTuber named Shizuku on YouTube. He ran dozens of streams, building a community of thousands of followers who tuned in to talk with an AI

We're thrilled to lead Shizuku AI's seed round.

In January 2023, while still completing his Ph.D. at UC Berkeley, Akio Kodaira launched an AI VTuber named Shizuku on YouTube. He ran dozens of streams, building a community of thousands of followers who tuned in to talk with an AI
GMOサイバーセキュリティ byイエラエ株式会社【公式】 (@gmo_ierae) 's Twitter Profile Photo

当社エンジニアの金子 孟司が発見した攻撃手法がPortSwigger社の「Top 10 Web Hacking Techniques of 2025」に選出されました gmo-cybersecurity.com/news/20260210/

Z.ai (@zai_org) 's Twitter Profile Photo

Introducing GLM-5: From Vibe Coding to Agentic Engineering GLM-5 is built for complex systems engineering and long-horizon agentic tasks. Compared to GLM-4.5, it scales from 355B params (32B active) to 744B (40B active), with pre-training data growing from 23T to 28.5T tokens.

Introducing GLM-5: From Vibe Coding to Agentic Engineering

GLM-5 is built for complex systems engineering and long-horizon agentic tasks. Compared to GLM-4.5, it scales from 355B params (32B active) to 744B (40B active), with pre-training data growing from 23T to 28.5T tokens.
Ant Ling (@antling20041208) 's Twitter Profile Photo

🚀 Unveiling Ring-1T-2.5 The first hybrid linear-architecture 1T thinking model. -Efficient: Hybrid linear breakthrough (10x lower memory) -Gold Tier: IMO25 (35/42) & CMO25 (105/126) -Agentic: Natively with Claude Code & OpenClaw -Open SOTA: IMOAnswerBench,GAIA2-search & more!

🚀 Unveiling Ring-1T-2.5 The first hybrid linear-architecture 1T thinking model.
-Efficient: Hybrid linear breakthrough (10x lower memory)
-Gold Tier: IMO25 (35/42) & CMO25 (105/126)
-Agentic: Natively with Claude Code & OpenClaw
-Open SOTA: IMOAnswerBench,GAIA2-search & more!
MiniMax (official) (@minimax__ai) 's Twitter Profile Photo

Introducing M2.5, an open-source frontier model designed for real-world productivity. - SOTA performance at coding (SWE-Bench Verified 80.2%), search (BrowseComp 76.3%), agentic tool-calling (BFCL 76.8%) & office work. - Optimized for efficient execution, 37% faster at complex

Introducing M2.5, an open-source frontier model designed for real-world productivity.

- SOTA performance at coding (SWE-Bench Verified 80.2%), search (BrowseComp 76.3%), agentic tool-calling (BFCL 76.8%) & office work.

- Optimized for efficient execution, 37% faster at complex
Zhengfu He (@zhengfuhe) 's Twitter Profile Photo

We built a Complete Replacement Model (CRM) that fully sparsifies a language model. This brings many changes to circuit tracing and global circuits. (1/n)

We built a Complete Replacement Model (CRM) that fully sparsifies a language model.

This brings many changes to circuit tracing and global circuits. (1/n)
moly (@morimolymoly2) 's Twitter Profile Photo

BQTLockとGREENBLOODについてANYRUNより Go製のランサムウェアもある。。。書きやすそうではある。ちゃんと侵入後の検知手法も書いてあるいい記事! any.run/cybersecurity-…