Martin Bel (@__mbel__) Twitter Tweets • TwiCopy

Martin Bel

@__mbel__

a year ago

👏👏👏

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Martin Bel

@__mbel__

a year ago

The alpaca prompt for data generation. Probably their most original idea. If you run it on a GPT, you will understand how they generated the data for finetuning. github.com/tatsu-lab/stan…

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Martin Bel

@__mbel__

a year ago

😅

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Martin Bel

@__mbel__

a year ago

``` import re re.sub(r'data scientist', 'ai engineer', cv, flags=re.IGNORECASE)

thumb_up_off_alt2

chat_bubble_outline1

repeat0

shareShare

Martin Bel

@__mbel__

a year ago

Great project Jeremy! Looking forward to giving it a try

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

spent a few hours with Claude 3.5 sonnet doing some mathematics research. you are underestimating the impact AI will have on research. yes, you. yes, I'm serious. no, it does not replace mathematicians. but the augmentation is about to take off.

thumb_up_off_alt1,1K

chat_bubble_outline53

repeat173

shareShare

Ivan Werning

@ivanwerning

a year ago

I tried Google's NotebookLM "radio conversation generator" on my paper about taxing robots (the irony!). I was blown away by the results. Are robots and AI coming for your jobs journalists? 😱

thumb_up_off_alt278

chat_bubble_outline15

repeat40

shareShare

François Chollet

@fchollet

a year ago

A common misconception about Transformers is to believe that they're a sequence-processing architecture. They're not. They're a *set-processing* architecture. Transformers are 100% order-agnostic (which was the big innovation compared to RNNs, back in late 2016 -- you compute

thumb_up_off_alt2,2K

chat_bubble_outline64

repeat259

shareShare

Martin Bel

@__mbel__

a year ago

Surprisingly such a simple approach works fairly well

thumb_up_off_alt8

chat_bubble_outline0

repeat3

shareShare

naklecha

@naklecha

a year ago

today, i'm excited to release a reinforcement learning guide that carefully explains the intuition and implementation details behind every single fundamental algorithm in the field. enjoy :) naklecha.com/reinforcement-…

thumb_up_off_alt2,2K

chat_bubble_outline84

repeat316

shareShare

Germán Milano

@german_milano

a year ago

1/8 Everyone's doing great work building agents for bug fixing. But I'm curious—does anyone have insights on which types of issues your agents are good at, and which ones pose the biggest challenges? 🧵 Cc: Martin Bel José Lamas Rodolfo Anibal Gaston Milano Millan Marcelo Pérez

thumb_up_off_alt11

chat_bubble_outline1

repeat10

shareShare

Paul Gauthier

@paulgauthier

10 months ago

DeepSeek R1 gets 57% on the aider polyglot benchmark, ranks 2nd behind o1: 62% o1 (high) 57% DeepSeek R1 52% Sonnet 48% DeepSeek Chat V3 Full leaderboard: aider.chat/docs/leaderboa…

thumb_up_off_alt684

chat_bubble_outline25

repeat85

shareShare

Sebastian Raschka

@rasbt

10 months ago

But before I get to the reasoning model space... if you are looking to do some focused offline reading this weekend, I just re-compiled my take on the "noteworthy AI research papers of 2024" into one PDF-export-friendly 47-page mega-post with TOC and all: sebastianraschka.com/blog/2025/llm-…

thumb_up_off_alt794

chat_bubble_outline21

repeat165

shareShare

Martin Bel

@__mbel__

10 months ago

China take my data. LoL

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

José Lamas

@jlamasrios

8 months ago

Congrats to the Data Science team behind Code Fixer at Globant!! After ranking #1 on SWE-Bench Lite in Nov ’24, their major upgrades in multimodal preprocessing, prompt design, and large codebase navigation (especially for complex frontend/backend stacks) made it #1 on SWE-Bench

thumb_up_off_alt8

chat_bubble_outline0

repeat3

shareShare

Martin Bel

@__mbel__

8 months ago

We reached #1 at the SWE-bench-Multimodal benchmark with the Globant Code Fixer Agent. Kudos to team that made it happen! Germán Milano, Martin Bel, Marcelo Pérez, Rodolfo Anibal Gaston Milano Millan José Lamas

We reached #1 at the SWE-bench-Multimodal benchmark with the <a href="/Globant/">Globant</a> Code Fixer Agent.

Kudos to team that made it happen!
<a href="/german_milano/">Germán Milano</a>, <a href="/__mbel__/">Martin Bel</a>, <a href="/mperezjodal/">Marcelo Pérez</a>, <a href="/RodolfoAni72304/">Rodolfo Anibal</a>
<a href="/GMilano/">Gaston Milano Millan</a> <a href="/jlamasrios/">José Lamas</a>

thumb_up_off_alt12

chat_bubble_outline0

repeat6

shareShare

Jeremy Howard

@jeremyphoward

6 months ago

sam mcallister Personally I encourage my team to use other folks' tools too so we have a realistic view of where we stand in the market. I've noticed that most folks at most big labs seem quite unfamiliar with the competition, OTOH. You gotta be an expert user to understand capabilities.

thumb_up_off_alt25

chat_bubble_outline2

repeat2

shareShare

Martin Bel

Martin Bel

Martin Bel

Martin Bel

Martin Bel

Martin Bel

prof-g

Ivan Werning

François Chollet

Martin Bel

naklecha

Germán Milano

Paul Gauthier

Sebastian Raschka

Martin Bel

José Lamas

Martin Bel

Jeremy Howard