Lapsa-Malawski (@munhitsu) 's Twitter Profile
Lapsa-Malawski

@munhitsu

Tweets on Technology and Art. Views my own @[email protected]

ID: 8773392

linkhttps://github.com/munhitsu calendar_today09-09-2007 23:03:19

4,4K Tweet

664 Followers

1,1K Following

martin_casado (@martin_casado) 's Twitter Profile Photo

At this point I feel like we understand pretty well what's going on with LLMs: - Outputs are roughly equivalent to kernel smoothing over positional embeddings (arxiv.org/pdf/1908.11775…) - The learned computation model is *probably* bounded by RASP-L (arxiv.org/pdf/2310.16028…) -

Yann LeCun (@ylecun) 's Twitter Profile Photo

How to be as "smart" as Auto-Regressive LLMs: - memorize lots of problem statements together with recipes on how to solve them. - to solve a new problem, retrieve the recipe whose problem statement superficially matches the new problem. - apply the recipe blindly and declare

Financial Times (@ft) 's Twitter Profile Photo

Yann LeCun says he is working to develop an entirely new generation of AI systems that he hopes will power machines with human-level intelligence. It could take up to 10 years to achieve, he tells the Financial Times in an interview on.ft.com/3KbShLF

Yann LeCun says he is working to develop an entirely new generation of AI systems that he hopes will power machines with human-level intelligence. It could take up to 10 years to achieve, he tells the <a href="/FT/">Financial Times</a> in an interview on.ft.com/3KbShLF
Lapsa-Malawski (@munhitsu) 's Twitter Profile Photo

I'm playing with G-Eval to test the LLM outputs using LLM. It roughly works until it doesn't. How am I supposed to reason with test result: "the actual output's prompt is in Polish which mismatches the language-prompt specified as Polish, aligning correctly" #llm #gpt #deepeval

Ruoming Pang (@ruomingpang) 's Twitter Profile Photo

As Apple Intelligence is rolling out to our beta users today, we are proud to present a technical report on our Foundation Language Models that power these features on devices and cloud: machinelearning.apple.com/research/apple…. 🧵

Gergely Orosz (@gergelyorosz) 's Twitter Profile Photo

Explains why I found myself forced to not just block Musk, but also mute the terms “Elon”, “Musk”, “Elonmusk” to get a Twitter experience where I wouldn’t have every second tweet of his on my timeline. Case study worthy on how you degrade a social network long-term

Lapsa-Malawski (@munhitsu) 's Twitter Profile Photo

Nice, I might be eventually able to use letter “m” in passwords for some, ancient services. But then again if they are already ancient, will their CISO actually care about the new NIST guidance? mastodon.social/@LukaszOlejnik…

Lapsa-Malawski (@munhitsu) 's Twitter Profile Photo

I loved gevent. It was bringing all the benefits of event loop I needed and leaving me with a straightforward API on monkey patched threads. I never could understand why it was treated as an ugly child