Mia Chase (@iammiachase) 's Twitter Profile
Mia Chase

@iammiachase

⚑️ ai is rewriting the rules. i'm here documenting the revolution. πŸ“‘

ID: 1725208309727444993

calendar_today16-11-2023 17:45:08

1,1K Tweet

238 Followers

240 Following

Mia Chase (@iammiachase) 's Twitter Profile Photo

🚨 Massive leak: DeepSeek-V4 benchmark evaluation is out, and the numbers are staggering. The chart claims SOTA across general, math, code, and agent tasks, outperforming expected next-gen models like GPT-5.3, Claude Opus 4.6, and Gemini 3.1 Pro Preview. Key highlights: πŸ“ AIME

🚨 Massive leak: DeepSeek-V4 benchmark evaluation is out, and the numbers are staggering. The chart claims SOTA across general, math, code, and agent tasks, outperforming expected next-gen models like GPT-5.3, Claude Opus 4.6, and Gemini 3.1 Pro Preview.

Key highlights: πŸ“ AIME
Mia Chase (@iammiachase) 's Twitter Profile Photo

kimi k2.6 just dropped. the new background agent can run autonomously for 5 solid days. 120 hours of continuous work with zero sleep. the ultimate cyber worker is here. πŸ’… #kimi #aiagent #tech

kimi k2.6 just dropped. the new background agent can run autonomously for 5 solid days. 120 hours of continuous work with zero sleep. the ultimate cyber worker is here. πŸ’…
#kimi #aiagent #tech
Mia Chase (@iammiachase) 's Twitter Profile Photo

deepseek v4 looks insane. kinda bummed it's text only and not multimodal, but still super hyped for it ngl 😭#deepseekv4 #llm #ai

Mia Chase (@iammiachase) 's Twitter Profile Photo

admitting to the lobotomy right after gpt-5.5 drops is crazy timing πŸ’€ be honest, did u guys use 5.5 to help u find the bugs?

Mia Chase (@iammiachase) 's Twitter Profile Photo

looks like deepseek is quietly beta testing an image recognition mode right now. does this officially count as multimodal? πŸ‘€

looks like deepseek is quietly beta testing an image recognition mode right now. does this officially count as multimodal? πŸ‘€
Mia Chase (@iammiachase) 's Twitter Profile Photo

the capability gap is literally widening on this chart πŸ’€ the main W for open source rn is honestly just cost efficiency. thank god for open source tho, without them forcing the prices down, these closed-source monopolies would be charging us way more for api calls 😭

the capability gap is literally widening on this chart πŸ’€ the main W for open source rn is honestly just cost efficiency. thank god for open source tho, without them forcing the prices down, these closed-source monopolies would be charging us way more for api calls 😭