Alex Gurung (@alexaag1234) 's Twitter Profile
Alex Gurung

@alexaag1234

PhD student at @EdinburghNLP | undergrad+masters @gtcomputing

ID: 2729564050

calendar_today31-07-2014 01:58:14

21 Tweet

410 Takipçi

411 Takip Edilen

Mateusz Klimaszewski (@m_klimasz) 's Twitter Profile Photo

The next EuroLLM model is out 🎉 We support all the 🇪🇺 EU languages (+ more), but now in a 9B size (base and instruct). We are not done yet; stay tuned for more 👀

Edoardo Ponti (@pontiedoardo) 's Twitter Profile Photo

Is sparsity the key to conditional computation, interpretability, long context/generation, and more in foundation models? Find out at my #NeurIPS2024 tutorial on Dynamic Sparsity in Machine Learning with Andre Martins! Followed by a panel with Sara Hooker and Alessandro Sordoni 🧵

Yasmine (@cyousakura) 's Twitter Profile Photo

🎉 Introducing Open Reasoner Zero 🚀 Performance: Matches DeepSeek R1-Zero (32B) in just 1/30 steps! 📚 Full training strategies & technical paper 💻 100% open-source: Code + Data + Model ⚖️ MIT licensed - Use it your way! 🌊 Let the Reasoner-Zero tide rise! 🚢 1/n

🎉 Introducing Open Reasoner Zero

🚀 Performance: Matches DeepSeek R1-Zero (32B) in just 1/30 steps!

📚 Full training strategies & technical paper

💻 100% open-source: Code + Data + Model

⚖️ MIT licensed - Use it your way!

🌊 Let the Reasoner-Zero tide rise!

🚢 1/n
Wenhao Zhu (@wenhao_nlp) 's Twitter Profile Photo

🎉 Excited to share “Generalizing from Short to Long: Effective Data Synthesis for Long-Context Instruction Tuning” 📄 (arxiv.org/pdf/2502.15592) We propose "context synthesis": instead of generating instructions from long texts, we synthesize contexts for instructions—drawing

🎉 Excited to share “Generalizing from Short to Long: Effective Data Synthesis for Long-Context Instruction Tuning” 📄 (arxiv.org/pdf/2502.15592)

We propose "context synthesis": instead of generating instructions from long texts, we synthesize contexts for instructions—drawing
Irina Saparina (@irisaparina) 's Twitter Profile Photo

🔥 New Preprint! 🔥 How should LLMs handle ambiguous questions in text-to-SQL semantic parsing? 👉🏼 Disambiguate First, Parse Later! We propose a plug-and-play approach that explicitly disambiguates the question 💬 Paper: arxiv.org/abs/2502.18448

🔥 New Preprint! 🔥

How should LLMs handle ambiguous questions in text-to-SQL semantic parsing?

👉🏼 Disambiguate First, Parse Later!

We propose a plug-and-play approach that explicitly disambiguates the question 💬

Paper: arxiv.org/abs/2502.18448
Rohit Saxena (@rohit_saxena) 's Twitter Profile Photo

Can multimodal LLMs truly understand research poster images?📊 🚀 We introduce PosterSum—a new multimodal benchmark for scientific poster summarization! 🪧 📂 Dataset: huggingface.co/datasets/rohit… 📜 Paper: arxiv.org/abs/2502.17540

Can multimodal LLMs truly understand research poster images?📊

🚀 We introduce PosterSum—a new multimodal benchmark for scientific poster summarization! 🪧

📂 Dataset: huggingface.co/datasets/rohit…
📜 Paper: arxiv.org/abs/2502.17540
Rohit Saxena (@rohit_saxena) 's Twitter Profile Photo

📣This work will appear at the ICLR 2025 Workshop on Reasoning and Planning for LLMs.🇸🇬 I'm currently on the job market, looking for research scientist roles. Feel free to reach out if you're hiring or know of any opportunities!