Noetic (@noetic_labs) Twitter Tweets • TwiCopy

Noetic

@noetic_labs

+ Follow

The experiential learning company

ID: 1925778757492625408

linkhttps://www.noeticlabs.co calendar_today23-05-2025 05:00:28

17 Tweet

99 Followers

0 Following

François Fleuret

@francoisfleuret

8 months ago

Remember that RL is finding the exit in a pitch black building you never visited before.

thumb_up_off_alt96

chat_bubble_outline11

repeat1

shareShare

Lienid

@0xlienid

8 months ago

Paper: noeticlabs.co/el

thumb_up_off_alt6

chat_bubble_outline0

repeat1

shareShare

We built a way for models to learn from arbitrary experience to move past naive SFT or scalar rewards The results: - +17% on holdout HumanEval set from just seeing printouts - +33% on GSM8K training set from natural language feedback No labels. No reward. Just experience.

thumb_up_off_alt15

chat_bubble_outline1

repeat2

shareShare

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)

@teortaxestex

8 months ago

Noetic develops «experiential learning framework», with the goal to move beyond RLVR or VR-CLI and their dependence on gold standard answers, make proper use of rich feedback for the general case, and usher in Richard Sutton's Era of Experience.

thumb_up_off_alt15

chat_bubble_outline1

repeat2

shareShare

Lienid

@0xlienid

7 months ago

Update: We've now matched GRPO performance. Again, no scalar rewards, no trajectory filtering for SFT. Now +36% over GSM8K baseline with Experiential Learning

thumb_up_off_alt12

chat_bubble_outline2

repeat2

shareShare

Lienid

@0xlienid

6 months ago

thumb_up_off_alt7

chat_bubble_outline1

repeat1

shareShare

Lienid

@0xlienid

4 months ago

Wow. Wonder who could be working on that...

thumb_up_off_alt6

chat_bubble_outline0

repeat1

shareShare

Lienid

@0xlienid

3 months ago

I very very strongly agree. It is incredibly inefficient, and not at all how intelligent creatures learn. Intelligence scales with converting unstructured experience into knowledge and behaviors. noeticlabs.co/el

thumb_up_off_alt6

chat_bubble_outline1

repeat1

shareShare

Noetic

@noetic_labs

10 days ago

RL is not good for continual learning because continual learning is not only about forgetting Continual learning in any meaningful way requires: - Learning from outcomes - In an unstructured output space and environment - With minimal forgetting RL is bad at the second

thumb_up_off_alt4

chat_bubble_outline0

repeat1

shareShare