Hasan Hammoud (@hammh0a) Twitter Tweets • TwiCopy

Hasan Hammoud

@hammh0a

+ Follow

Ph.D. candidate in Computer Vision and Machine Learning @KaustVision; Former Intern at @samsungresearch; Former Intern at @UniofOxford

ID: 1637995601018388481

calendar_today21-03-2023 01:52:46

185 Tweet

764 Takipçi

615 Takip Edilen

Aleks Petrov

@aleksppetrov

7 months ago

If you work on long-context compression for LLMs, you've seen the Gisting approach: add a few "gist tokens" and adjust the attention mask so all context flows into them. Elegant and simple… But we found that it COMPLETELY BREAKS when compressing more than just a few tokens 🤯

thumb_up_off_alt8

chat_bubble_outline1

repeat2

shareShare

Tong Zhang

@tongzhang9801

6 months ago

📢Excited to share our new paper "Motion-Aware Concept Alignment for Consistent Video Editing". A training-free framework for video semantic mixing: 🔁Blend new concepts into specific objects 🎯Maintain spatial stability & temporal coherence 📊Outperform basselines A thread🧵

thumb_up_off_alt18

chat_bubble_outline4

repeat4

shareShare

Gordon Guocheng Qian

@guocheng_qian

6 months ago

📢I am attending #CVPR2025 (Jun 11 - 14). Come to our snap-research.github.io/Omni-ID/ poster to know more about how we achieved the highest ID preservation in personalization and further enables expression following in our follow ups. See you at Fri 4 - 6 pm, ExHall D Poster #326.

thumb_up_off_alt44

chat_bubble_outline1

repeat5

shareShare

Thao Nguyen

@thao_nguyen26

5 months ago

Web data, the “fossil fuel of AI”, is being exhausted. What’s next?🤔 We propose Recycling the Web to break the data wall of pretraining via grounded synthetic data. It is more effective than standard data filtering methods, even with multi-epoch repeats! arxiv.org/abs/2506.04689

thumb_up_off_alt213

chat_bubble_outline8

repeat57

shareShare

Alejandro Pardo

@pardoalejo

5 months ago

🚀 Our MatchDiffusion was accepted to ICCV 2025 in Hawaii! 🌺 We generate two synchronized videos from text prompts—designed for match-cuts. Results: matchdiffusion.github.io Paper: arxiv.org/abs/2411.18677 #MatchDiffusion #ICCV2025 #DiffusionModels #TextToVideo #GenerativeAI

thumb_up_off_alt38

chat_bubble_outline3

repeat5

shareShare

Hasan Hammoud

@hammh0a

4 months ago

New paper out ! Train Long, Think Less. We introduce Curriculum GRPO, start with long reasoning chains, then progressively tighten token budgets to train LLMs that think better with fewer tokens. 📈 +Accuracy, 🔻Token usage, across GSM8K, MATH500 & more. Special thanks to all

thumb_up_off_alt12

chat_bubble_outline0

repeat2

shareShare

KAUST

@kaust_news

3 months ago

AI, decoded in under a minute. Prof. Bernard Ghanem Bernard Ghanem from #KAUST, ranked #1 in the Middle East for producing #AItalent, breaks it into four pillars. The expertise driving Saudi Arabia’s bold #AI future.

thumb_up_off_alt40

chat_bubble_outline0

repeat6

shareShare

Thao Nguyen

@thao_nguyen26

3 months ago

We released 44B synthetic tokens from our CoT-guided rewriting, offering higher quality pretraining data than the average human-written web texts📈 🤗Data: huggingface.co/datasets/faceb… 📜Paper: arxiv.org/abs/2506.04689 (accepted at #COLM2025) Excited to see what the community builds!

thumb_up_off_alt219

chat_bubble_outline4

repeat47

shareShare

Hasan Hammoud

@hammh0a

2 months ago

We just released Hala: open, state-of-the-art Arabic instruction & translation models! ✨ Includes: • 1.2B Translation model (very light-weight) • 4.6M Arabic Instruction Tuning Dataset • 4 models (350M–9B) 📄 Paper: huggingface.co/papers/2509.14… Don't forget to upvote :)!! 🤗

thumb_up_off_alt5

chat_bubble_outline0

repeat2

shareShare

DailyPapers

@huggingpapers

2 months ago

Hala: New Arabic-centric models released on Hugging Face A family of state-of-the-art instruction and translation models, built with a novel translate-and-tune pipeline. Achieves SOTA performance in "nano" (≤2B) and "small" (7-9B) categories on Arabic benchmarks.

thumb_up_off_alt16

chat_bubble_outline1

repeat3

shareShare

ChatPaper.ai

@chatpaper_ai

2 months ago

🔥 Daily AI Paper (2025-09-18) 📄 Hala Technical Report: Building Arabic-Centric Instruction & Translation Models at Scale 🔗 chatpaper.ai/dashboard/pape… #AI #ML #ChatPaper

thumb_up_off_alt2

chat_bubble_outline0

repeat2

shareShare

AI Native Foundation

@ainativef

2 months ago

1. Hala Technical Report: Building Arabic-Centric Instruction & Translation Models at Scale 🔑 Keywords: Arabic-centric, Hala, translate-and-tune pipeline, lightweight language model, NLP 💡 Category: Natural Language Processing 🌟 Research Objective: - The primary goal is

thumb_up_off_alt2

chat_bubble_outline1

repeat1

shareShare