Rex "garbage in" Douglass Ph.D. (@rexdouglass) 's Twitter Profile
Rex "garbage in" Douglass Ph.D.

@rexdouglass

Applied Scientist in Industry. Previously UCSD. Princeton PhD.

Follow me for recreational methods trash talk.

ID: 1882066740

linkhttp://rexdouglass.com calendar_today19-09-2013 06:09:23

12,12K Tweet

5,5K Followers

5,5K Following

ky (@kylefbutts) 's Twitter Profile Photo

joseph francis You're going to make me dust off this research note I care about methods and I spend a lot of my free time trying to write open source code to make more robust methods easier to use. Randomization-based RD is just not an example of a better method IMO

<a href="/joefrancis505/">joseph francis</a> You're going to make me dust off this research note

I care about methods and I spend a lot of my free time trying to write open source code to make more robust methods easier to use. 

Randomization-based RD is just not an example of a better method IMO
Andrew Gelman et al. (@statmodeling) 's Twitter Profile Photo

Bayesian probability, like frequentist probability, is a model-based activity that is mathematically anchored by physical randomization at one end and calibration to a reference set at the other statmodeling.stat.columbia.edu/2025/10/20/bay…

Eddie Yang (@ey_985) 's Twitter Profile Photo

New paper: LLMs are increasingly used to label data in political science. But how reliable are these annotations, and what are the consequences for scientific findings? What are best practices? Some new findings from a large empirical evaluation. Paper: eddieyang.net/research/llm_a…

New paper: LLMs are increasingly used to label data in political science. But how reliable are these annotations, and what are the consequences for scientific findings? What are best practices? Some new findings from a large empirical evaluation.
Paper: eddieyang.net/research/llm_a…
Eddie Yang (@ey_985) 's Twitter Profile Photo

Finding 1: LLM annotations show pretty low intercoder reliability with the original annotations (coded by humans or supervised models). Perhaps surprisingly, reliability among the different LLMs themselves is only moderate (larger models better).

Finding 1: LLM annotations show pretty low intercoder reliability with the original annotations (coded by humans or supervised models). Perhaps surprisingly, reliability among the different LLMs themselves is only moderate (larger models better).
Eddie Yang (@ey_985) 's Twitter Profile Photo

Finding 2: This disagreement has significant downstream consequences. Re-running the original analyses with LLM annotations produced highly variable coefficient estimates, often altering the conclusions of the original studies.

Finding 2: This disagreement has significant downstream consequences. Re-running the original analyses with LLM annotations produced highly variable coefficient estimates, often altering the conclusions of the original studies.
Eddie Yang (@ey_985) 's Twitter Profile Photo

We also developed a new R package, localLLM (CRAN.R-project.org/package=localL…), that enables reproducible annotation using LLM directly in R. More functionalities to follow!

Feodora Teti (@feodorateti) 's Twitter Profile Photo

New data drop! 📊 A new extension of the Global Tariff Database is now available, covering the U.S. trade war (2018–2025) 🇺🇸🌏 It includes bilateral tariffs — those imposed by the U.S. and those faced by U.S. exporters — tracking all changes from Jan 2018 to mid-Aug 2025.

Sayash Kapoor (@sayashk) 's Twitter Profile Photo

I am on the faculty job market this year! I am seeking tenure-track faculty positions to drive my research agenda on rigorous AI evaluation for science and policy. I am applying broadly across disciplines, and would be grateful to hear of relevant positions. Materials: 🧵

I am on the faculty job market this year! I am seeking tenure-track faculty positions to drive my research agenda on rigorous AI evaluation for science and policy.

I am applying broadly across disciplines, and would be grateful to hear of relevant positions. Materials: 🧵
Hynek Kydlíček (@hkydlicek) 's Twitter Profile Photo

We’re releasing the full FinePdfs source code — plus new datasets and models! 🚀 📚 Datasets: • OCR-Annotations — 1.6k PDFs labeled for OCR need • Gemma-LID-Annotation — 20k samples per language (annotated with Gemma3-27B) 🤖 Models: • XGB-OCR — OCR classifier for PDFs

We’re releasing the full FinePdfs source code — plus new datasets and models! 🚀

📚 Datasets:
• OCR-Annotations — 1.6k PDFs labeled for OCR need
• Gemma-LID-Annotation — 20k samples per language (annotated with Gemma3-27B)
🤖 Models:
• XGB-OCR — OCR classifier for PDFs
I4R (@i4replication) 's Twitter Profile Photo

We are hosting virtual Replication Games on Friday November 13th, 2025 with UK Reproducibility Network @ukrepro.bsky.social. This is our 2nd year collaborating with UKRN. Psych, public health, pol sci and econ studies will be reproduced! Register here: surveymonkey.ca/r/I4R_Replicat…

We are hosting virtual Replication Games on Friday November 13th, 2025 with <a href="/ukrepro/">UK Reproducibility Network @ukrepro.bsky.social</a>. This is our 2nd year collaborating with UKRN. Psych, public health, pol sci and econ studies will be reproduced!

Register here: surveymonkey.ca/r/I4R_Replicat…