Viv (@vtrivedy10) Twitter Tweets • TwiCopy

Viv

a month ago

*5 minutes before bed* - drop in feature .md file - claude —dangerously-skip-permissions - let it cook - wake and start reviewing every line and running tests hit or miss results, recommend doing this for more fun coding stuff, but can be a pleasant way to start the day

thumb_up_off_alt2

chat_bubble_outline2

repeat0

shareShare

Viv

@vtrivedy10

a month ago

after building an agent builder, it’s clear why everyone exclusively/esepcially dog foods tf outta their own product for everything: - find common patterns and wrap them in more determinism with workflows - find errors and fix them with better prompting/tool design

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Viv

@vtrivedy10

a month ago

Context editing + evicting doc dumps: this feels like a rlly underexplored pattern/api for context management, released a month ago by Anthropic concrete example: you need to make edits to your project using modal’s new documentation. Use a Tool call “load_docs()” to dump the

thumb_up_off_alt6

chat_bubble_outline1

repeat0

shareShare

Viv

@vtrivedy10

a month ago

“real analysis was hard, prob should do a CS PhD instead of math PhD” *LLMs are injective* 😦

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Viv

@vtrivedy10

a month ago

i sleep better at night doing this with codex agent (full access) at least I know it’ll take its time

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Viv

@vtrivedy10

a month ago

fun release we worked on packaging the DeepAgents Harness into everyone’s new fave interface, the CLI open harnesses are great bc you get an opinionated quickstart to run in minutes via the CLI, but rlly you have full flexibility (models, prompts, tools, middleware definitions)

thumb_up_off_alt5

chat_bubble_outline1

repeat1

shareShare

Viv

@vtrivedy10

a month ago

agi will just be a workflow generator that doesn’t make mistakes

thumb_up_off_alt6

chat_bubble_outline0

repeat0

shareShare

Viv

@vtrivedy10

a month ago

docs as skills + “Memory Banks” for portable memory: been thinking about this pattern for a bit. In Agent Eng, we can do the pre-work of bringing docs/config into a centralized place (as Skills). Locally loaded with the agent means less errors from finding the right thing via

thumb_up_off_alt1

chat_bubble_outline2

repeat0

shareShare

Viv

@vtrivedy10

a month ago

ask Claude to review my code “THIS IS BEAUTIFUL” literally doesn’t work there’s beauty in pain

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Viv

@vtrivedy10

a month ago

Claude Code in the mobile app is fantastic, low friction UX for dev-on-the-go biased bc most things i do is code but simple steps: 1. pull down a git repo, spins up a VM running CC 2. chat! i mainly use it for review, understanding a new repo I’ve forked, planning nice UX, i

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Viv

@vtrivedy10

a month ago

if someone talked to me the way i talk to haiku 4.5 😤👊🏽

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Viv

@vtrivedy10

a month ago

cognition out here explicitly calling out that you literally cannot Tab Tab Tab vibecode something good+complex, it's slop the first time it'll fail is a small misunderstanding in codebase logic, and that just multiplies over time and codebase size till everything's cooked

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Viv

@vtrivedy10

a month ago

interested in a potential future where the thing that determined success for an Agent Task was largely the search and context preparation step that precluded it (ie. any model of sufficient intelligence would have solved the task because of the alley-oop it was thrown) makes

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Viv

@vtrivedy10

a month ago

when all those PhD hours spent on optimizing tf out of representations for retrieval might come in handy 🥹 everything’s a cycle, there’s a lot of nice tricks and ideas in Information Retrieval but it’s incredibly hard to beat compute: - bigger model for embedding - multivector

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Viv

@vtrivedy10

a month ago

who’s gonna be the first big player to disavow mcp without beating around the bush 👀

thumb_up_off_alt3

chat_bubble_outline2

repeat0

shareShare

Viv

@vtrivedy10

a month ago

a fun behavior i'm trying to prompt into my agent builder harness...get the agent to interview me: - figure out roughly what i want, and then dive into the details (planning) - share thoughts and gets my takes/feedback (iterative edits) - help me define what a successful agent

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Viv

@vtrivedy10

a month ago

“be surgical in every file edit” what does it mean? no one knows but it gets the agent going

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Viv

@vtrivedy10

a month ago

embedding based retrieval is back in vogue….ready for all the rediscoveries :) my bet: someone renames query augmentation next week

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Viv

@vtrivedy10

a month ago

cool paper! We got RLVR GANs and the discriminator has some more agency than classification

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Viv

@vtrivedy10

23 days ago

fun release today for the sandbox pilled and not yet sandbox pilled fam :) the example I walk through in the video is something I do often. pull down a repo somewhere, let the agent do good work with a PR, review later and right now i’m especially excited about what sandboxes

thumb_up_off_alt5

chat_bubble_outline4

repeat0

shareShare