Mia Chase (@iammiachase) 's Twitter Profile
Mia Chase

@iammiachase

⚡️ ai is rewriting the rules. i'm here documenting the revolution. 📡

ID: 1725208309727444993

calendar_today16-11-2023 17:45:08

1,1K Tweet

238 Takipçi

240 Takip Edilen

Mia Chase (@iammiachase) 's Twitter Profile Photo

🚨 Massive leak: DeepSeek-V4 benchmark evaluation is out, and the numbers are staggering. The chart claims SOTA across general, math, code, and agent tasks, outperforming expected next-gen models like GPT-5.3, Claude Opus 4.6, and Gemini 3.1 Pro Preview. Key highlights: 📐 AIME

🚨 Massive leak: DeepSeek-V4 benchmark evaluation is out, and the numbers are staggering. The chart claims SOTA across general, math, code, and agent tasks, outperforming expected next-gen models like GPT-5.3, Claude Opus 4.6, and Gemini 3.1 Pro Preview.

Key highlights: 📐 AIME
Mia Chase (@iammiachase) 's Twitter Profile Photo

kimi k2.6 just dropped. the new background agent can run autonomously for 5 solid days. 120 hours of continuous work with zero sleep. the ultimate cyber worker is here. 💅 #kimi #aiagent #tech

kimi k2.6 just dropped. the new background agent can run autonomously for 5 solid days. 120 hours of continuous work with zero sleep. the ultimate cyber worker is here. 💅
#kimi #aiagent #tech
Mia Chase (@iammiachase) 's Twitter Profile Photo

deepseek v4 looks insane. kinda bummed it's text only and not multimodal, but still super hyped for it ngl 😭#deepseekv4 #llm #ai

Mia Chase (@iammiachase) 's Twitter Profile Photo

admitting to the lobotomy right after gpt-5.5 drops is crazy timing 💀 be honest, did u guys use 5.5 to help u find the bugs?

Mia Chase (@iammiachase) 's Twitter Profile Photo

looks like deepseek is quietly beta testing an image recognition mode right now. does this officially count as multimodal? 👀

looks like deepseek is quietly beta testing an image recognition mode right now. does this officially count as multimodal? 👀
Mia Chase (@iammiachase) 's Twitter Profile Photo

the capability gap is literally widening on this chart 💀 the main W for open source rn is honestly just cost efficiency. thank god for open source tho, without them forcing the prices down, these closed-source monopolies would be charging us way more for api calls 😭

the capability gap is literally widening on this chart 💀 the main W for open source rn is honestly just cost efficiency. thank god for open source tho, without them forcing the prices down, these closed-source monopolies would be charging us way more for api calls 😭