Kevin Patrick Murphy (@sirbayes) 's Twitter Profile
Kevin Patrick Murphy

@sirbayes

Research Scientist at Google Brain / Deepmind. Interested in Bayesian Machine Learning.

ID: 788533935886077952

linkhttps://www.cs.ubc.ca/~murphyk/ calendar_today19-10-2016 00:15:15

925 Tweet

55,55K Followers

472 Following

Kevin Patrick Murphy (@sirbayes) 's Twitter Profile Photo

I just got my copy. It’s a good book and fills a unique and valuable niche, namely a code-first / pragmatic approach to causal inference using ML tools like DoWhy and Pyro.

Kevin Patrick Murphy (@sirbayes) 's Twitter Profile Photo

Had a great time at khipu.ai in Santiago, Chile. My talk on Sequential decision making using online variational bayes is here, in case you are interested. (Lots of other cool talks online too) youtube.com/live/s9VJv0GQE…

Kevin Patrick Murphy (@sirbayes) 's Twitter Profile Photo

Gemma 3 is best in class for a VLM that runs on 1 GPU. Should make RL fine tuning feasible. Also Academic researchers can apply for Google Cloud credits (worth $10,000 per award) to accelerate their Gemma 3-based research.

Kevin Patrick Murphy (@sirbayes) 's Twitter Profile Photo

Our BONE paper has been accepted to TMLR. It derives methods for state space inference in non stationary environments, where changes to the DHO can be gradual (eg drift) or sudden (eg change point)

Kevin Patrick Murphy (@sirbayes) 's Twitter Profile Photo

I'm happy to announce that v2 of my RL tutorial is now online. I added a new chapter on multi-agent RL, and improved the sections on 'RL as inference' and 'RL+LLMs' (although latter is still WIP), fixed some typos, etc. arxiv.org/abs/2412.05265…

Kevin Patrick Murphy (@sirbayes) 's Twitter Profile Photo

I am pleased to announce that I have updated the online versions of my 2 textbooks (see probml.github.io/pml-book/): I fixed all issues listed on github, added some new references (esp on LLMs), and made a few other small tweaks.

Kevin Patrick Murphy (@sirbayes) 's Twitter Profile Photo

I had a great time diving at Wakatobi Dive Resort in Indonesia (although unfortunately I got an ear infection and had to skip the last couple of days). Tomorrow off to Singapore for #ICLR2025 (DM me if you want to meet).

I had a great time diving at <a href="/Wakatobi/">Wakatobi Dive Resort</a> in Indonesia (although unfortunately I got an ear infection and had to skip the last couple of days). Tomorrow off to Singapore for #ICLR2025 (DM me if you want to meet).
Kevin Patrick Murphy (@sirbayes) 's Twitter Profile Photo

I dont know why singapore air is rated number 1 in world. their business class beds are much less comfortable than united/ polaris, because they are narrow and not straight. Food is good but not amazing. IMHO Emirates is best, then KLM & United (but grateful not economy class :)

Kevin Patrick Murphy (@sirbayes) 's Twitter Profile Photo

This was a great talk (*) on using (proper multi-turn) RL for training LLM agents to reason and use tools. Very bullish on this "Generative Agents" direction! (* Audio was very bad; fortunately brains are good at source separation :)

Kevin Patrick Murphy (@sirbayes) 's Twitter Profile Photo

I am pleased to announce a new version of my RL tutorial. Major update to the LLM chapter (eg DPO, GRPO, thinking), minor updates to the MARL and MBRL chapters and various sections (eg offline RL, DPG, etc). Enjoy! arxiv.org/abs/2412.05265

I am pleased to announce a new version of my RL tutorial. Major update to the LLM chapter (eg DPO, GRPO, thinking), minor updates to the MARL and MBRL chapters and various sections (eg offline RL, DPG, etc). Enjoy!
arxiv.org/abs/2412.05265
Kevin Patrick Murphy (@sirbayes) 's Twitter Profile Photo

Does anyone know if ChatGPT keeps some kind of context or user profile across sessions? If i ask it to derive mathy things related to online Bayes, it often asks me if I want to see a low-rank version of it, or a Thompson sampling version. How does it know I care? Spooky.

Kevin Patrick Murphy (@sirbayes) 's Twitter Profile Photo

I think it's quite misleading for the big labs to be promoting how well their VLMs work on pokemon, given how much (game-specific) manual annotation is required behind the scenes. Solving general tasks from pixel input is much harder than coding ("Moravec's revenge").

Kevin Patrick Murphy (@sirbayes) 's Twitter Profile Photo

This is a very thought provoking interview with my former student. I do think AI personas (esp multimodal and real time) may be addictive and seem better than humans - but so is heroin (albeit heroin has less useful applications than AI).