Viv (@vtrivedy10) 's Twitter Profile
Viv

@vtrivedy10

Building AI for Creativity ✨ // prev @awscloud

ID: 1222786244

calendar_today26-02-2013 19:26:31

1,1K Tweet

445 Followers

2,2K Following

Viv (@vtrivedy10) 's Twitter Profile Photo

*5 minutes before bed* - drop in feature .md file - claude —dangerously-skip-permissions - let it cook - wake and start reviewing every line and running tests hit or miss results, recommend doing this for more fun coding stuff, but can be a pleasant way to start the day

Viv (@vtrivedy10) 's Twitter Profile Photo

after building an agent builder, it’s clear why everyone exclusively/esepcially dog foods tf outta their own product for everything: - find common patterns and wrap them in more determinism with workflows - find errors and fix them with better prompting/tool design

Viv (@vtrivedy10) 's Twitter Profile Photo

Context editing + evicting doc dumps: this feels like a rlly underexplored pattern/api for context management, released a month ago by Anthropic concrete example: you need to make edits to your project using modal’s new documentation. Use a Tool call “load_docs()” to dump the

Context editing + evicting doc dumps:

this feels like a rlly underexplored pattern/api for context management, released a month ago by Anthropic 

concrete example: you need to make edits to your project using modal’s new documentation.  Use a Tool call “load_docs()” to dump the
Viv (@vtrivedy10) 's Twitter Profile Photo

fun release we worked on packaging the DeepAgents Harness into everyone’s new fave interface, the CLI open harnesses are great bc you get an opinionated quickstart to run in minutes via the CLI, but rlly you have full flexibility (models, prompts, tools, middleware definitions)

Viv (@vtrivedy10) 's Twitter Profile Photo

docs as skills + “Memory Banks” for portable memory: been thinking about this pattern for a bit. In Agent Eng, we can do the pre-work of bringing docs/config into a centralized place (as Skills). Locally loaded with the agent means less errors from finding the right thing via

Viv (@vtrivedy10) 's Twitter Profile Photo

Claude Code in the mobile app is fantastic, low friction UX for dev-on-the-go biased bc most things i do is code but simple steps: 1. pull down a git repo, spins up a VM running CC 2. chat! i mainly use it for review, understanding a new repo I’ve forked, planning nice UX, i

Claude Code in the mobile app is fantastic, low friction UX for dev-on-the-go

biased bc most things i do is code but simple steps:
1. pull down a git repo, spins up a VM running CC
2. chat!  i mainly use it for review, understanding a new repo I’ve forked, planning

nice UX, i
Viv (@vtrivedy10) 's Twitter Profile Photo

cognition out here explicitly calling out that you literally cannot Tab Tab Tab vibecode something good+complex, it's slop the first time it'll fail is a small misunderstanding in codebase logic, and that just multiplies over time and codebase size till everything's cooked

cognition out here explicitly calling out that you literally cannot Tab Tab Tab vibecode something good+complex, it's slop

the first time it'll fail is a small misunderstanding in codebase logic, and that just multiplies over time and codebase size till everything's cooked
Viv (@vtrivedy10) 's Twitter Profile Photo

interested in a potential future where the thing that determined success for an Agent Task was largely the search and context preparation step that precluded it (ie. any model of sufficient intelligence would have solved the task because of the alley-oop it was thrown) makes

Viv (@vtrivedy10) 's Twitter Profile Photo

when all those PhD hours spent on optimizing tf out of representations for retrieval might come in handy 🥹 everything’s a cycle, there’s a lot of nice tricks and ideas in Information Retrieval but it’s incredibly hard to beat compute: - bigger model for embedding - multivector

Viv (@vtrivedy10) 's Twitter Profile Photo

a fun behavior i'm trying to prompt into my agent builder harness...get the agent to interview me: - figure out roughly what i want, and then dive into the details (planning) - share thoughts and gets my takes/feedback (iterative edits) - help me define what a successful agent

Viv (@vtrivedy10) 's Twitter Profile Photo

embedding based retrieval is back in vogue….ready for all the rediscoveries :) my bet: someone renames query augmentation next week

Viv (@vtrivedy10) 's Twitter Profile Photo

fun release today for the sandbox pilled and not yet sandbox pilled fam :) the example I walk through in the video is something I do often. pull down a repo somewhere, let the agent do good work with a PR, review later and right now i’m especially excited about what sandboxes