yishu
@yishusee
what the fuck is this place and why am I back
ID: 27190305
https://yishus.dev 28-03-2009 06:15:21
3,3K Tweet
334 Followers
434 Following
As part of Prime Intellect's RL residency program, I've been exploring how to do multi-agent RL using their current stack (from verifiers + prime-rl to lab experiments with hosted training /evals) and thinking about how it could be extended to support these abstractions natively.