Alex Gurung (@alexaag1234) Twitter Tweets • TwiCopy

Alex Gurung

@alexaag1234

+ Follow

PhD student at @EdinburghNLP | undergrad+masters @gtcomputing

ID: 2729564050

calendar_today31-07-2014 01:58:14

21 Tweet

410 Takipçi

411 Takip Edilen

Gate.io

@gate_io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Mateusz Klimaszewski

@m_klimasz

8 months ago

The next EuroLLM model is out 🎉 We support all the 🇪🇺 EU languages (+ more), but now in a 9B size (base and instruct). We are not done yet; stay tuned for more 👀

thumb_up_off_alt22

chat_bubble_outline0

repeat6

shareShare

Is sparsity the key to conditional computation, interpretability, long context/generation, and more in foundation models? Find out at my #NeurIPS2024 tutorial on Dynamic Sparsity in Machine Learning with Andre Martins! Followed by a panel with Sara Hooker and Alessandro Sordoni 🧵

thumb_up_off_alt85

chat_bubble_outline2

repeat25

shareShare

Yasmine

@cyousakura

6 months ago

🎉 Introducing Open Reasoner Zero 🚀 Performance: Matches DeepSeek R1-Zero (32B) in just 1/30 steps! 📚 Full training strategies & technical paper 💻 100% open-source: Code + Data + Model ⚖️ MIT licensed - Use it your way! 🌊 Let the Reasoner-Zero tide rise! 🚢 1/n

thumb_up_off_alt862

chat_bubble_outline27

repeat158

shareShare

Wenhao Zhu

@wenhao_nlp

5 months ago

🎉 Excited to share “Generalizing from Short to Long: Effective Data Synthesis for Long-Context Instruction Tuning” 📄 (arxiv.org/pdf/2502.15592) We propose "context synthesis": instead of generating instructions from long texts, we synthesize contexts for instructions—drawing

thumb_up_off_alt75

chat_bubble_outline1

repeat21

shareShare

Irina Saparina

@irisaparina

5 months ago

🔥 New Preprint! 🔥 How should LLMs handle ambiguous questions in text-to-SQL semantic parsing? 👉🏼 Disambiguate First, Parse Later! We propose a plug-and-play approach that explicitly disambiguates the question 💬 Paper: arxiv.org/abs/2502.18448

thumb_up_off_alt19

chat_bubble_outline1

repeat7

shareShare

Rohit Saxena

@rohit_saxena

5 months ago

Can multimodal LLMs truly understand research poster images?📊 🚀 We introduce PosterSum—a new multimodal benchmark for scientific poster summarization! 🪧 📂 Dataset: huggingface.co/datasets/rohit… 📜 Paper: arxiv.org/abs/2502.17540

thumb_up_off_alt81

chat_bubble_outline2

repeat24

shareShare

Rohit Saxena

@rohit_saxena

5 months ago

📣This work will appear at the ICLR 2025 Workshop on Reasoning and Planning for LLMs.🇸🇬 I'm currently on the job market, looking for research scientist roles. Feel free to reach out if you're hiring or know of any opportunities!

thumb_up_off_alt21

chat_bubble_outline0

repeat11

shareShare

Alex Gurung

Gate.io

Mateusz Klimaszewski

Edoardo Ponti

Yasmine

Wenhao Zhu

Irina Saparina

Rohit Saxena

Rohit Saxena