Guillermo Barbadillo (@guille_bar) Twitter Tweets • TwiCopy

Guillermo Barbadillo

@guille_bar

+ Follow

In a quest to understand intelligence

Hablando de IA en español en la TERTULia: ironbar.github.io/tertulia_intel…

ID: 961945656544948225

linkhttps://www.linkedin.com/in/guillermobarbadillo/ calendar_today09-02-2018 12:51:31

452 Tweet

1,1K Followers

237 Following

Kyle Corbitt

@corbtt

4 months ago

Big news: we've figured out how to make a *universal* reward function that lets you apply RL to any agent with: - no labeled data - no hand-crafted reward functions - no human feedback! A 🧵 on RULER

thumb_up_off_alt1,1K

chat_bubble_outline47

repeat122

shareShare

Nice paper that in my opinion goes in the right direction to solve ARC. It generates python code to tackle the ARC tasks and combines search and learning in a virtuous cycle. I have summarized the results in the following plot.

thumb_up_off_alt90

chat_bubble_outline3

repeat14

shareShare

Guillermo Barbadillo

@guille_bar

4 months ago

As far as I understand, this is another case of test-time training, since they use example pairs from both the training and evaluation sets. I'm not sure if the hierarchical architecture is necessary, or we could get similar results with other models.

thumb_up_off_alt16

chat_bubble_outline0

repeat0

shareShare

Guillermo Barbadillo

@guille_bar

4 months ago

Giotto.ai is the first team to score above 20% on the ARC25 challenge. Congratulations! We're still far from the 85% goal, but there's time left since the competition ends in November.

thumb_up_off_alt23

chat_bubble_outline1

repeat5

shareShare

Cristóbal Valenzuela

@c_valenzuelab

4 months ago

Really nice demo of what Runway Aleph can do for complex changes in environments while adding accurate dynamic elements like snow on the shoulders or splashing water as the characters move.

thumb_up_off_alt1,1K

chat_bubble_outline43

repeat146

shareShare

Peter Gostev

@petergostev

3 months ago

I quite like how well ARC Prize shows the distribution of GPT-5 variant capabilities, from 1.5% (GPT-5 Nano, Minimal) to 65.7% (GPT-5 High). Some other things that seem interesting: - 'Thinking' really matters for GPT-5: 6% for 'Minimal' to 65.7% for 'High'. The difference is

I quite like how well <a href="/arcprize/">ARC Prize</a> shows the distribution of GPT-5 variant capabilities, from 1.5% (GPT-5 Nano, Minimal) to 65.7% (GPT-5 High).

Some other things that seem interesting:
- 'Thinking' really matters for GPT-5: 6% for 'Minimal' to 65.7% for 'High'. The difference is

thumb_up_off_alt345

chat_bubble_outline15

repeat30

shareShare

Guillermo Barbadillo

@guille_bar

3 months ago

Is a masked diffusion model one of the secrets behind the best score on ARC-AGI-2 so far? Or are they just trolling us :) Diffusion models might have an edge over autoregressive ones since they can capture a more global view of the grids.

thumb_up_off_alt49

chat_bubble_outline4

repeat6

shareShare

Luma AI

@lumalabsai

2 months ago

This is Ray3. The world’s first reasoning video model, and the first to generate studio-grade HDR. Now with an all-new Draft Mode for rapid iteration in creative workflows, and state of the art physics and consistency. Available now for free in Dream Machine.