Grad (@grad62304977) 's Twitter Profile
Grad

@grad62304977

ID: 1313209072460627976

calendar_today05-10-2020 20:07:03

2,2K Tweet

3,3K Followers

1,1K Following

wh (@nrehiew_) 's Twitter Profile Photo

Let's talk about the GLM 4.5 models. The latest frontier open weights model out of China (and possibly the best at the moment?) with quite a bit of details in the paper.

Let's talk about the GLM 4.5 models.

The latest frontier open weights model out of China (and possibly the best at the moment?) with quite a bit of details in the paper.
Feng Yao (@fengyao1909) 's Twitter Profile Photo

โšก๐…๐๐Ÿ– makes RL faster โ€” but at the cost of performance. We present ๐…๐ฅ๐š๐ฌ๐ก๐‘๐‹, the first ๐จ๐ฉ๐ž๐งโ€“๐ฌ๐จ๐ฎ๐ซ๐œ๐ž & ๐ฐ๐จ๐ซ๐ค๐ข๐ง๐  ๐‘๐‹ ๐ซ๐ž๐œ๐ข๐ฉ๐ž that applies ๐ˆ๐๐“๐Ÿ–/๐…๐๐Ÿ– for rollout ๐ฐ๐ข๐ญ๐ก๐จ๐ฎ๐ญ ๐ฅ๐จ๐ฌ๐ข๐ง๐  ๐ฉ๐ž๐ซ๐Ÿ๐จ๐ซ๐ฆ๐š๐ง๐œ๐ž compared to ๐๐…๐Ÿ๐Ÿ”! ๐Ÿ“ Blog:

โšก๐…๐๐Ÿ– makes RL faster โ€” but at the cost of performance.

We present ๐…๐ฅ๐š๐ฌ๐ก๐‘๐‹, the first ๐จ๐ฉ๐ž๐งโ€“๐ฌ๐จ๐ฎ๐ซ๐œ๐ž & ๐ฐ๐จ๐ซ๐ค๐ข๐ง๐  ๐‘๐‹ ๐ซ๐ž๐œ๐ข๐ฉ๐ž that applies ๐ˆ๐๐“๐Ÿ–/๐…๐๐Ÿ– for rollout ๐ฐ๐ข๐ญ๐ก๐จ๐ฎ๐ญ ๐ฅ๐จ๐ฌ๐ข๐ง๐  ๐ฉ๐ž๐ซ๐Ÿ๐จ๐ซ๐ฆ๐š๐ง๐œ๐ž compared to ๐๐…๐Ÿ๐Ÿ”!

๐Ÿ“ Blog:
Mika Senghaas (@mikasenghaas) 's Twitter Profile Photo

moving from vllm v0 to v1 made our async rl training crash! read how we fixed it we recently migrated from v0 to v1 as part of a larger refactor of prime-rl to make it easier-to-use, more performant and naturally async. we confirmed correct training dynamics on many

moving from vllm v0 to v1 made our async rl training crash! read how we fixed it

we recently migrated from v0 to v1 as part of a larger refactor of prime-rl to make it easier-to-use, more performant and naturally async. we confirmed correct training dynamics on many
Prime Intellect (@primeintellect) 's Twitter Profile Photo

Introducing the Environments Hub RL environments are the key bottleneck to the next wave of AI progress, but big labs are locking them down We built a community platform for crowdsourcing open environments, so anyone can contribute to open-source AGI