
Jeroen Van Goey
@biogeek
Staff Research Engineer in BioAI at @InstaDeepAI (part of @BioNTech_Group)
ML for de novo peptide sequencing.
bsky.app/profile/jeroen…
ID: 10564
http://jeroen.vangoey.be 26-10-2006 10:16:43
827 Tweet
875 Followers
5,5K Following

Join us on the journey toward improving de novo protein sequencing. 👉Explore their progress in our latest blog post: bit.ly/4lircYB ⌨️ Explore our codebase here: bit.ly/43ExRGl 📝 See the paper on Nature Machine Intelligence: bit.ly/3XBeJFh


As seen in Nature Machine Intelligence, our multimodal conversational agent, ChatNT, fuses biological sequences and natural language into a shared vocabulary. Hear from research scientist and lead author Bernardo Almeida on our first proof-of-concept for this generalist AI.

📖Learn from our blog: bit.ly/4dRlo5c 👉 Read more in Nature Machine Intelligence: go.nature.com/43ZKvPP




Introducing ChatNT: The first biological sequence-language model, published in Nature Machine Intelligence ! 🧬🎉 Inspired by vision-language models, ChatNT's architecture combines biological and language foundation models, using Nucleotide Transformer (NT) and Llama to answer questions

A multimodal conversational agent for DNA, RNA and protein tasks Nature Machine Intelligence 1.ChatNT is a new conversational AI system that understands DNA, RNA and protein sequences, enabling biologists to solve complex genomics tasks simply by asking questions in English. 2.Unlike




ChatNT has been featured on the front cover of Nature Machine Intelligence! This exciting opportunity allows us to share our proof-of-concept for a generalist genomics AI with a wider community of scientists and researchers around the world. 🎊


We made the cover of Nature Machine Intelligence! 🌱 ChatNT is a Conversational Agent analysing genomics sequences to answer key biological questions, assisting scientists in their work 👩🔬 Kudos to Bernardo Almeida, Thomas Pierrot & the InstaDeep research team for this huge milestone!✨


📆 Six months, four publications, one cover. We’re halfway through the year and InstaDeep research is powering ahead with multiple boundary-pushing papers published in the Nature Portfolio! Catch up on the highlights below 🔽

🧬 In March, we published our research for InstaNovo and InstaNovo+ in Nature Machine Intelligence, our diffusion-powered ‘de novo’ peptide sequencing models built to uncover the secrets of the human proteome. bit.ly/3XBeJFh




Thrilled to open-source the dataset behind our Nature Machine Intelligence cover paper! 🧬 The ChatNT training data is now open-source on Hugging Face. It's the first large-scale dataset for training conversational agents on biological sequences. A thread on what's inside 👇
