jytan (@jyt4n) Twitter Tweets • TwiCopy

jytan

@jyt4n

+ Follow

building @usetusk | LLMs, coding assistants + verification

ID: 1753031051675602944

linkhttps://jytan.io calendar_today01-02-2024 12:22:38

100 Tweet

122 Takipçi

324 Takip Edilen

Yuchen Jin

@yuchenj_uw

4 months ago

DeepSeek was a side project at High-Flyer Quant. Qwen was a side project at Alibaba. Twitter was a side project at Odeo. Mac was a side project at Apple. Meanwhile: Windows Phone was a core project at Microsoft. Metaverse was a core project at Facebook. Google Glass was a core

thumb_up_off_alt3,3K

chat_bubble_outline137

repeat379

shareShare

jytan

@jyt4n

4 months ago

December saw my highest Cursor usage so far, with ~530M tokens in the past two weeks. Yet it still feels like I'm barely scratching the surface. Tokens should flow 24/7. I am the only bottleneck, my standards, my taste, my agency.

thumb_up_off_alt7

chat_bubble_outline0

repeat0

shareShare

jytan

@jyt4n

4 months ago

Found the perfect spot to bleed out

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

jytan

@jyt4n

3 months ago

I often think about this but for software. The meta problem is that traditional anti-entropy tools themselves decay. Documentation, tests, configs, runbooks go stale. AI can help with maintenance, but we need to invest in harnesses that let AI observe, interpret, and act on our

thumb_up_off_alt7

chat_bubble_outline1

repeat0

shareShare

jytan

@jyt4n

3 months ago

Thanks Cursor Ben Lang for such a cool gift! Time to put my cat to work :)

Thanks <a href="/cursor_ai/">Cursor</a> <a href="/benln/">Ben Lang</a> for such a cool gift!

Time to put my cat to work :)

thumb_up_off_alt40

chat_bubble_outline4

repeat0

shareShare

Tusk

@usetusk

3 months ago

"...the reality is that in production, 89.6% of Tusk Drift requests never reach similarity scoring. They [only go through] input value hash matching in the priority cascade before the best mock is chosen for replay testing." jytan writes about why similarity scoring is

thumb_up_off_alt4

chat_bubble_outline1

repeat3

shareShare

jytan

@jyt4n

3 months ago

Happy to see this grow to ~100 stars! Added SSH support and key fixes the past week, supports Claude/OpenCode/Cursor Agent/Gemini. Give it a shot and let your agents go yolo (safely)

thumb_up_off_alt5

chat_bubble_outline1

repeat1

shareShare

jytan

@jyt4n

3 months ago

One of the greatest joys in life.

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

jytan

@jyt4n

3 months ago

Just got burned by Cursor's unsandboxed agent mode. I instructed the agent to use the github MCP to read an issue and investigate it, but it went ahead and use the gh CLI to push a commit straight to main, created a response for the issue and closed it. But it's so annoying

thumb_up_off_alt9

chat_bubble_outline3

repeat1

shareShare

jytan

@jyt4n

3 months ago

This is my experience as well. Codex 5.3 is less trigger-happy and does a better job judging when to stop and ask and push back vs go ahead and implement - so it seems a lot more trustworthy. Definitely appreciate being able to prompt less defensively.

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare