Qingxiu Dong
@qx_dong
PhD student @PKU1898. Research Intern @MSFTResearch Asia.
ID: 1157164788398448642
http://dqxiu.github.io 02-08-2019 05:42:39
117 Tweet
1,1K Followers
663 Following
โจA Spark of Vision-Language Intelligence! We introduce DnD-Transformer, a new auto-regressive image gen model beats GPT/Llama w/o extra cost. AR gen beats diffusion in joint VL modeling in a self-supervised way! Github: github.com/chenllliang/Dnโฆ Paper: huggingface.co/papers/2410.01โฆ
๐Survey of the Year: ๐๐๐ฑ๐ญ ๐๐จ๐ค๐๐ง ๐๐ซ๐๐๐ข๐๐ญ๐ข๐จ๐ง ๐๐จ๐ฐ๐๐ซ๐๐ฌ ๐๐ฎ๐ฅ๐ญ๐ข๐ฆ๐จ๐๐๐ฅ ๐๐ง๐ญ๐๐ฅ๐ฅ๐ข๐ ๐๐ง๐๐: ๐ ๐๐จ๐ฆ๐ฉ๐ซ๐๐ก๐๐ง๐ฌ๐ข๐ฏ๐ ๐๐ฎ๐ซ๐ฏ๐๐ฒ arXiv: arxiv.org/abs/2412.18619 HugFace: huggingface.co/papers/2412.18โฆ Github: github.com/LMM101/Awesomeโฆ
Excited to introduce BitNet b1.58 2B4T โ the first large-scale, native 1-bit LLM๐๐ BitNet achieves performance on par with leading full-precision LLMs โ and itโs blazingly fastโก๏ธโก๏ธuses much lower memory๐ Everything is open-sourced โ run it on GPU or your Macbook ๐ฅ๏ธโ๏ธ