Qingxiu Dong (@qx_dong) 's Twitter Profile
Qingxiu Dong

@qx_dong

PhD student @PKU1898. Research Intern @MSFTResearch Asia.

ID: 1157164788398448642

linkhttp://dqxiu.github.io calendar_today02-08-2019 05:42:39

117 Tweet

1,1K Followers

663 Following

Liang Chen (@liangchen5518) 's Twitter Profile Photo

โœจA Spark of Vision-Language Intelligence! We introduce DnD-Transformer, a new auto-regressive image gen model beats GPT/Llama w/o extra cost. AR gen beats diffusion in joint VL modeling in a self-supervised way! Github: github.com/chenllliang/Dnโ€ฆ Paper: huggingface.co/papers/2410.01โ€ฆ

Heming Xia (@hemingkx) 's Twitter Profile Photo

๐Ÿค”How much potential do LLMs have for self-acceleration through layer sparsity? ๐Ÿš€ ๐Ÿšจ Excited to share our latest work: SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference Acceleration. Arxiv: arxiv.org/abs/2410.06916 ๐Ÿงต1/n

๐Ÿค”How much potential do LLMs have for self-acceleration through layer sparsity? ๐Ÿš€

๐Ÿšจ Excited to share our latest work:

SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference Acceleration.

Arxiv: arxiv.org/abs/2410.06916

๐Ÿงต1/n
Yiming Huang (@yeeelow233) 's Twitter Profile Photo

๐Ÿค” Are LLMs Ready for Real-World Data Science Challenges? ๐Ÿš€ Weโ€™ve just open-sourced our #EMNLP2024 work DA-Code, a cutting-edge benchmark designed to push LLMs to their limits in real-world data science tasks. Get involved and challenge your models! da-code-bench.github.io

๐Ÿค” Are LLMs Ready for Real-World Data Science Challenges? 

๐Ÿš€ Weโ€™ve just open-sourced our #EMNLP2024 work DA-Code, a cutting-edge benchmark designed to push LLMs to their limits in real-world data science tasks.

Get involved and challenge your models!
da-code-bench.github.io
Qingxiu Dong (@qx_dong) 's Twitter Profile Photo

(Perhaps a bit late) Excited to announce our survey on ICL has been accepted to #EMNLP2024 main conf and been cited 1,000+ times! Thanks to all collaborators and contributors to this field! We've updated the survey arxiv.org/abs/2301.00234. Excited to keep pushing boundaries!

Hongyu Wang (@realhongyu_wang) 's Twitter Profile Photo

How to deploy a 100B model on your CPU devices? ๐Ÿ”ฅ Excited to introduce bitnet.cpp, our inference framework for BitNet b1.58 ๐Ÿš€๐Ÿš€ github.com/microsoft/bitnโ€ฆ

How to deploy a 100B model on your CPU devices? ๐Ÿ”ฅ

Excited to introduce bitnet.cpp, our inference framework for BitNet b1.58 ๐Ÿš€๐Ÿš€

github.com/microsoft/bitnโ€ฆ
Qingxiu Dong (@qx_dong) 's Twitter Profile Photo

About to arrive in #Miami ๐ŸŒด after a 30-hour flight for #EMNLP2024! Excited to see new and old friends :) Iโ€™d love to chat about data synthesis and deep reasoning for LLMs (or anything else) โ€”feel free to reach out!

Zekun Wang (ZenMoore) ๐Ÿ”ฅ (@zenmoore1) 's Twitter Profile Photo

๐ŸŽ†Survey of the Year: ๐๐ž๐ฑ๐ญ ๐“๐จ๐ค๐ž๐ง ๐๐ซ๐ž๐๐ข๐œ๐ญ๐ข๐จ๐ง ๐“๐จ๐ฐ๐š๐ซ๐๐ฌ ๐Œ๐ฎ๐ฅ๐ญ๐ข๐ฆ๐จ๐๐š๐ฅ ๐ˆ๐ง๐ญ๐ž๐ฅ๐ฅ๐ข๐ ๐ž๐ง๐œ๐ž: ๐€ ๐‚๐จ๐ฆ๐ฉ๐ซ๐ž๐ก๐ž๐ง๐ฌ๐ข๐ฏ๐ž ๐’๐ฎ๐ซ๐ฏ๐ž๐ฒ arXiv: arxiv.org/abs/2412.18619 HugFace: huggingface.co/papers/2412.18โ€ฆ Github: github.com/LMM101/Awesomeโ€ฆ

๐ŸŽ†Survey of the Year:
๐๐ž๐ฑ๐ญ ๐“๐จ๐ค๐ž๐ง ๐๐ซ๐ž๐๐ข๐œ๐ญ๐ข๐จ๐ง ๐“๐จ๐ฐ๐š๐ซ๐๐ฌ ๐Œ๐ฎ๐ฅ๐ญ๐ข๐ฆ๐จ๐๐š๐ฅ ๐ˆ๐ง๐ญ๐ž๐ฅ๐ฅ๐ข๐ ๐ž๐ง๐œ๐ž: ๐€ ๐‚๐จ๐ฆ๐ฉ๐ซ๐ž๐ก๐ž๐ง๐ฌ๐ข๐ฏ๐ž ๐’๐ฎ๐ซ๐ฏ๐ž๐ฒ

arXiv: arxiv.org/abs/2412.18619
HugFace: huggingface.co/papers/2412.18โ€ฆ
Github: github.com/LMM101/Awesomeโ€ฆ
Liang Chen (@liangchen5518) 's Twitter Profile Photo

Proud to introduce our latest work โ€œNext Token Prediction Towards Multimodal Intelligence: A Comprehensive Surveyโ€ as our new year gift for the multimodal learning community! Paper: huggingface.co/papers/2412.18โ€ฆ Github: github.com/LMM101/Awesomeโ€ฆ

Proud to introduce our latest work โ€œNext Token Prediction Towards Multimodal Intelligence: A Comprehensive Surveyโ€ as our new year gift for the multimodal learning community!  

Paper: huggingface.co/papers/2412.18โ€ฆ
Github: github.com/LMM101/Awesomeโ€ฆ
Hongyu Wang (@realhongyu_wang) 's Twitter Profile Photo

Excited to introduce BitNet b1.58 2B4T โ€” the first large-scale, native 1-bit LLM๐Ÿš€๐Ÿš€ BitNet achieves performance on par with leading full-precision LLMs โ€” and itโ€™s blazingly fastโšก๏ธโšก๏ธuses much lower memory๐ŸŽ‰ Everything is open-sourced โ€” run it on GPU or your Macbook ๐Ÿ–ฅ๏ธโš™๏ธ

Excited to introduce BitNet b1.58 2B4T โ€” the first large-scale, native 1-bit LLM๐Ÿš€๐Ÿš€

BitNet achieves performance on par with leading full-precision LLMs โ€” and itโ€™s blazingly fastโšก๏ธโšก๏ธuses much lower memory๐ŸŽ‰

Everything is open-sourced โ€” run it on GPU or your Macbook ๐Ÿ–ฅ๏ธโš™๏ธ
Qingxiu Dong (@qx_dong) 's Twitter Profile Photo

So happy to reunite with old and new friends at ICLR! Had an amazing time exploring Singapore too! ๐ŸŒŸ๐Ÿ‡ธ๐Ÿ‡ฌ #ICLR2025

So happy to reunite with old and new friends at ICLR! 

Had an amazing time exploring Singapore too! ๐ŸŒŸ๐Ÿ‡ธ๐Ÿ‡ฌ #ICLR2025
Qingxiu Dong (@qx_dong) 's Twitter Profile Photo

All my labmates and I from the PKU Computational Linguistics Lab will be in #Suzhou for #EMNLP2025 (Nov 3โ€“9) ! Looking forward to meeting old and new friends. Always happy to grab a coffee and chat ๐Ÿฅฐ

Tianzhu Ye โœˆ๏ธ ICLR Singapore (@ytz2024) 's Twitter Profile Photo

๐Ÿš€ We propose Generative Adversarial Distillation (GAD) ๐Ÿค– Designed to perform on-policy distillation from proprietary black-box LLMs. โžก๏ธ Requires neither access to teacher logits nor alignment of tokenizer vocabularies. (1/n)

๐Ÿš€ We propose Generative Adversarial Distillation (GAD)
๐Ÿค– Designed to perform on-policy distillation from proprietary black-box LLMs.
โžก๏ธ Requires neither access to teacher logits nor alignment of tokenizer vocabularies.

(1/n)