Mia Chase (@iammiachase) 's Twitter Profile
Mia Chase

@iammiachase

โšก๏ธ ai is rewriting the rules. i'm here documenting the revolution. ๐Ÿ“ก

ID: 1725208309727444993

calendar_today16-11-2023 17:45:08

1,1K Tweet

238 Takipรงi

240 Takip Edilen

Mia Chase (@iammiachase) 's Twitter Profile Photo

๐Ÿšจ Massive leak: DeepSeek-V4 benchmark evaluation is out, and the numbers are staggering. The chart claims SOTA across general, math, code, and agent tasks, outperforming expected next-gen models like GPT-5.3, Claude Opus 4.6, and Gemini 3.1 Pro Preview. Key highlights: ๐Ÿ“ AIME

๐Ÿšจ Massive leak: DeepSeek-V4 benchmark evaluation is out, and the numbers are staggering. The chart claims SOTA across general, math, code, and agent tasks, outperforming expected next-gen models like GPT-5.3, Claude Opus 4.6, and Gemini 3.1 Pro Preview.

Key highlights: ๐Ÿ“ AIME
Mia Chase (@iammiachase) 's Twitter Profile Photo

kimi k2.6 just dropped. the new background agent can run autonomously for 5 solid days. 120 hours of continuous work with zero sleep. the ultimate cyber worker is here. ๐Ÿ’… #kimi #aiagent #tech

kimi k2.6 just dropped. the new background agent can run autonomously for 5 solid days. 120 hours of continuous work with zero sleep. the ultimate cyber worker is here. ๐Ÿ’…
#kimi #aiagent #tech
Mia Chase (@iammiachase) 's Twitter Profile Photo

deepseek v4 looks insane. kinda bummed it's text only and not multimodal, but still super hyped for it ngl ๐Ÿ˜ญ#deepseekv4 #llm #ai

Mia Chase (@iammiachase) 's Twitter Profile Photo

admitting to the lobotomy right after gpt-5.5 drops is crazy timing ๐Ÿ’€ be honest, did u guys use 5.5 to help u find the bugs?

Mia Chase (@iammiachase) 's Twitter Profile Photo

looks like deepseek is quietly beta testing an image recognition mode right now. does this officially count as multimodal? ๐Ÿ‘€

looks like deepseek is quietly beta testing an image recognition mode right now. does this officially count as multimodal? ๐Ÿ‘€
Mia Chase (@iammiachase) 's Twitter Profile Photo

the capability gap is literally widening on this chart ๐Ÿ’€ the main W for open source rn is honestly just cost efficiency. thank god for open source tho, without them forcing the prices down, these closed-source monopolies would be charging us way more for api calls ๐Ÿ˜ญ

the capability gap is literally widening on this chart ๐Ÿ’€ the main W for open source rn is honestly just cost efficiency. thank god for open source tho, without them forcing the prices down, these closed-source monopolies would be charging us way more for api calls ๐Ÿ˜ญ