smile (@smilex_p) Twitter Tweets • TwiCopy

Aniket Vashishtha

a year ago

Can we teach Transformers Causal Reasoning? We propose Axiomatic Framework, a new paradigm for training LMs. Our 67M-param model, trained from scratch on simple causal chains, outperforms billion-scale LLMs and rivals GPT-4 in inferring cause-effect relations over complex graphs

thumb_up_off_alt699

chat_bubble_outline15

repeat133

shareShare

A meme page to check every time MatLab crashes

@memecrashes

8 months ago

thumb_up_off_alt2,2K

chat_bubble_outline15

repeat253

shareShare

ₕₐₘₚₜₒₙ

@hamptonism

8 months ago

statistical inference:

thumb_up_off_alt684

chat_bubble_outline5

repeat68

shareShare

Math Cafe

@riazi_cafe_en

8 months ago

UW–Madison's "Mathematical Techniques for Algorithm Analysis" Lecture Notes: pages.cs.wisc.edu/~cs809-1/lectu… Course Material: pages.cs.wisc.edu/~cs809-1/

thumb_up_off_alt508

chat_bubble_outline2

repeat101

shareShare

Math Cafe

@riazi_cafe_en

8 months ago

Terence Tao's "Linear Algebra" lecture notes PDF: terrytao.wordpress.com/wp-content/upl…

thumb_up_off_alt805

chat_bubble_outline5

repeat139

shareShare

Kirk Borne

@kirkdborne

8 months ago

Graph Data Modeling in #Python — Practical guide to curating, analyzing, & modeling data with graphs: packtpub.com/en-us/product/… from Packt Data Science & Machine Learning #ad — #DataScience #AI #DataScientist — 𝒦𝑒𝓎 𝐹𝑒𝒶𝓉𝓊𝓇𝑒𝓈: 🔵Transform relational data models into graph data model while

Graph Data Modeling in #Python — Practical guide to curating, analyzing, & modeling data with graphs: packtpub.com/en-us/product/… from <a href="/PacktDataML/">Packt Data Science & Machine Learning</a> #ad
—
#DataScience #AI #DataScientist
—
𝒦𝑒𝓎 𝐹𝑒𝒶𝓉𝓊𝓇𝑒𝓈:

🔵Transform relational data models into graph data model while

thumb_up_off_alt103

chat_bubble_outline3

repeat28

shareShare

首都大の猫『数学のための英語教本』

@shutodainohito

8 months ago

この本の写経は英文を書く勉強としてとてもいいと思います。欧州数学会出版のたった50ページの本です。著者名Jerzy Trzeciak で検索おすすめ

thumb_up_off_alt1,1K

chat_bubble_outline4

repeat190

shareShare

Swapna Kumar Panda

@swapnakpanda

8 months ago

FREE FREE FREE 10 Python Books, Absolutely FREE

thumb_up_off_alt1,1K

chat_bubble_outline26

repeat267

shareShare

結城浩 / Hiroshi Yuki

@hyuki

8 months ago

『数学ガール／リーマン予想』ついに登場。数学青春物語、堂々の完結へ！ただいま予約受付中！（2025年8月刊行予定） ◆結城浩『数学ガール／リーマン予想』 amzn.to/4lMB9hq #数学ガール

thumb_up_off_alt1,1K

chat_bubble_outline0

repeat665

shareShare

結城浩 / Hiroshi Yuki

@hyuki

8 months ago

みなさま応援ありがとうございます！😭刊行はまだ先になりますが、ご予約いただきますと「非常に大きな追い風」となりますので、よろしくお願いいたします🙇もちろんリポスト、いいね、引用ポストなども感謝です！

thumb_up_off_alt247

chat_bubble_outline0

repeat66

shareShare

Simone Scardapane

@s_scardapane

8 months ago

*Deep Learning is Not So Mysterious or Different* by Andrew Gordon Wilson Fantastic paper showing that many interesting phenomena (e.g., double descent) can be understood in the frameworks of PAC-Bayes and "soft inductive biases". Great visuals! 😍 arxiv.org/abs/2503.02113

*Deep Learning is Not So Mysterious or Different*
by <a href="/andrewgwils/">Andrew Gordon Wilson</a>

Fantastic paper showing that many interesting phenomena (e.g., double descent) can be understood in the frameworks of PAC-Bayes and "soft inductive biases". Great visuals! 😍

arxiv.org/abs/2503.02113

thumb_up_off_alt293

chat_bubble_outline4

repeat52

shareShare

Oscar Broekema 🇳🇱

@obr2021

8 months ago

THE MICROARCHITECTURE OF PIPELINED AND SUPERSCALAR COMPUTERS AMOS R. OMONDI

thumb_up_off_alt159

chat_bubble_outline0

repeat23

shareShare

William Gilpin

@wgilpin0

8 months ago

At #ICLR2025 , check out our poster today on forecasting chaos with foundation models. Yuanzhao Zhang 章元肇 Poster #43. Thu 24 Apr, 3 - 5:30 pm in Hall 3 + Hall 2B

At #ICLR2025 , check out our poster today on forecasting chaos with foundation models. <a href="/YuanzhaoZhang/">Yuanzhao Zhang 章元肇</a>

Poster #43. Thu 24 Apr, 3 - 5:30 pm in Hall 3 + Hall 2B

thumb_up_off_alt624

chat_bubble_outline10

repeat88

shareShare

Math Cafe

@riazi_cafe_en

8 months ago

"Modern Discrete Probability, An Essential Toolkit" by Sebastien Roch PDF: people.math.wisc.edu/~roch/mdp/roch…

thumb_up_off_alt938

chat_bubble_outline1

repeat169

shareShare

Swapna Kumar Panda

@swapnakpanda

8 months ago

11 FREE Books from MIT for Absolute Beginners - Machine Learning (ML) - Deep Learning (DL) - Reinforcement Learning (RL) - Artificial Intelligence (AI)

thumb_up_off_alt1,1K

chat_bubble_outline22

repeat295

shareShare

Andrew Lampinen

@andrewlampinen

8 months ago

How do language models generalize from information they learn in-context vs. via finetuning? We show that in-context learning can generalize more flexibly, illustrating key differences in the inductive biases of these modes of learning — and ways to improve finetuning. Thread: 1/

thumb_up_off_alt751

chat_bubble_outline7

repeat146

shareShare

Calc Consulting

@calccon

7 months ago

You can also solve the classic Double Descent problem (Vallet et. al. 1989) just using Random Matrix Theory Here's an outline of a sketch of the solution. This does not require replicas, and, instead, using just the properties of the Marchenko-Pastur distribution and it's

thumb_up_off_alt30

chat_bubble_outline1

repeat6

shareShare

ₕₐₘₚₜₒₙ

@hamptonism

7 months ago

Crypto & AI:

thumb_up_off_alt955

chat_bubble_outline21

repeat97

shareShare

Timothy Nguyen

@iamtimnguyen

7 months ago

Statistical physics for LLMs. Happy with that description :-) The original tweet thread for my paper: x.com/IAmTimNguyen/s… And my Machine Learning Street Talk interview: youtube.com/watch?v=W485bz…

thumb_up_off_alt189

chat_bubble_outline13

repeat20

shareShare

roadmap.sh

@roadmapsh

7 months ago

Tired of deployment headaches? 😩 Our FREE Cloudflare learning roadmap provides a clear and structured path to understanding and utilizing Cloudflare for your web applications. roadmap.sh/cloudflare

thumb_up_off_alt6

chat_bubble_outline1

repeat4

shareShare