Kirill Vishniakov (@kirill_vish) Twitter Tweets • TwiCopy

Grigory Bartosh

a year ago

🔥 Excited to share our new work on Neural Flow Diffusion Models — a general, end-to-end, simulation-free framework that works with an arbitrary noising processes and even enables learning them! 📜: arxiv.org/abs/2404.12940 🧵 1/11

thumb_up_off_alt517

chat_bubble_outline3

repeat78

shareShare

François Chollet

@fchollet

a year ago

There's a big difference between solving a problem from first principles vs applying a solution template you previously memorized. It's like the difference between a senior software engineer and a script kiddie that can't code. A script kiddie that has a gigantic bank of scripts

thumb_up_off_alt1,1K

chat_bubble_outline64

repeat235

shareShare

Kirill Vishniakov

@kirill_vish

a year ago

sonnet 3.5 is a great model for coding, but it is just way too eager to reply in bullet points on basic text prompts

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

MBZUAI

@mbzuai

a year ago

Are you going to #ICML2024? These top faculty and researchers from MBZUAI will be there presenting their latest pioneering work! We are excited to announce that 25 papers from MBZUAI have been accepted for presentation at the 41st International Conference on Machine Learning

thumb_up_off_alt4

chat_bubble_outline0

repeat1

shareShare

Kirill Vishniakov

@kirill_vish

a year ago

He is not wrong. Genuinely enjoyed this part of the discussion about current hype cycle in ML research.

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

AK

@_akhaliq

a year ago

Med42-v2 A Suite of Clinical LLMs discuss: huggingface.co/papers/2408.06… Med42-v2 introduces a suite of clinical large language models (LLMs) designed to address the limitations of generic models in healthcare settings. These models are built on Llama3 architecture and fine-tuned

thumb_up_off_alt115

chat_bubble_outline2

repeat29

shareShare

Biology+AI Daily

@biologyaidaily

10 months ago

Genomic Foundationless Models: Pretraining Does Not Promise Performance 1. This study challenges the paradigm of pretraining in Genomic Foundation Models (GFMs), revealing that randomly initialized models often match or surpass pretrained ones in fine-tuning tasks. 2. Despite

thumb_up_off_alt65

chat_bubble_outline1

repeat15

shareShare

Nadav Brandes

@brandesnadav

8 months ago

New preprint claims that most existing DNA language models perform just as well with random weights, suggesting that pretraining does nothing (Mistral & DNABERT-2 look like exceptions). We need better DNA language models.

thumb_up_off_alt456

chat_bubble_outline18

repeat64

shareShare

Zhuang Liu

@liuzhuang1234

8 months ago

How different are the outputs of various LLMs, and in what ways do they differ? Turns out, very very different, up to the point that a text encoding classifier could tell the source LLM with 97% accuracy. This is classifying text generated by LLMs, between ChatGPT, Claude,

thumb_up_off_alt532

chat_bubble_outline11

repeat83

shareShare

karthik viswanathan

@nickinack1

6 months ago

Introducing BioFM, a biologically-informed GFM that: ✅ Outperforms all small GFMs (265M params, trained on just 50 genomes) ✅ Beats Evo2-7B (variant embeddings), Enformer (expression), SpliceTransformer (sQTL). No brute-force scaling-just smarter tokenization.

thumb_up_off_alt14

chat_bubble_outline1

repeat4

shareShare