gooby (@gooby_esq) 's Twitter Profile
gooby

@gooby_esq

The Real “Dan Price” / Attorney @ roedelparsons.com / Founder @ lexmagic.ai / CEO @ HTMX / danprice.eth / Retired "Musician" /

ID: 1534455684934270981

calendar_today08-06-2022 08:42:13

3,3K Tweet

675 Followers

2,2K Following

Yulu Gan (@yule_gan) 's Twitter Profile Photo

Reinforcement Learning (RL) has long been the dominant method for fine-tuning, powering many state-of-the-art LLMs. Methods like PPO and GRPO explore in action space. But can we instead explore directly in parameter space? YES we can. We propose a scalable framework for

Benjamin Clavié (@bclavie) 's Twitter Profile Photo

hypothetically if someone wanted to hire a person whose main job would be to preach about a great search platform and build demos/video content, handle comms on here and discord (tl;dr: do devrel) while still being technical and talking about colbert, where should they look?

Astropulse (@realastropulse) 's Twitter Profile Photo

Here's a demo of what you can do with the tilesets from retro diffusion, every single asset in this was generated with zero touch ups. Now imagine plugging this into a procedural generation system, where players can ask for *any* landscape or theme.

Here's a demo of what you can do with the tilesets from retro diffusion, every single asset in this was generated with zero touch ups.

Now imagine plugging this into a procedural generation system, where players can ask for *any* landscape or theme.
gooby (@gooby_esq) 's Twitter Profile Photo

So, if you had a subject matter expert willing/available to do the grading work, couldn't you substitute out the scoring and LLM reflection step in GEPA with a human scorer and reflector and potentially get better results? Is this already a thing? Sounds similar to

Lakshya A Agrawal (@lakshyaaagrawal) 's Twitter Profile Photo

Optimizing a data analysis coding agent with GEPA, using execution-guided feedback on real-world workloads. Amazing tutorial by Arslan Shahid: medium.com/firebird-techn…

gooby (@gooby_esq) 's Twitter Profile Photo

Finally got a nice harness wired up over the weekend with DSPy to generate valid interleaved ABC musical notation so can go LLM to MIDI file. Going to see if I can get GEPA or another optimizer to help generate better compositions. Fun problem because all the frontier LLMs