QingkaiZeng (@qingkaizeng_cs) 's Twitter Profile
QingkaiZeng

@qingkaizeng_cs

CS Ph.D. student in LLMs and KG, advised by @meng_cs | Ex-intern @TencentGlobal AI Lab | Looking for a research intern or full-time position in 2024.

ID: 1669583988032405504

linkhttps://qingkaizeng.github.io/ calendar_today16-06-2023 05:53:47

20 Tweet

254 Followers

460 Following

Zhenwen Liang (@liangzhenwen) 's Twitter Profile Photo

📢Check out our recent work in mathematical reasoning! Our Multi-View Fine-Tuning (MinT🌿) method achieves the SOTA performance of 7B models without LLM teachers, by harnessing different datasets, solution styles, and even noisy data in training. Link: arxiv.org/pdf/2307.07951…

📢Check out our recent work in mathematical reasoning! 

Our Multi-View Fine-Tuning (MinT🌿) method achieves the SOTA performance of 7B models without LLM teachers, by harnessing different datasets, solution styles, and even noisy data in training.

Link: arxiv.org/pdf/2307.07951…
Yue Dong @ NeurIPS 2023 (@yuedongcs) 's Twitter Profile Photo

#LLMs excel in various tasks, yet autoregressive #LLMs have limitations. We invite you to explore alternatives like non-autoregressive editing models with us. Join our tutorial (Google Eric Malmi & UCR) at #KDD2023 Aug 8th 10am-1pm, room 202c. Info: kdd2023-text-editing.github.io

AK (@_akhaliq) 's Twitter Profile Photo

Stabilizing RLHF through Advantage Model and Selective Rehearsal paper page: huggingface.co/papers/2309.10… Large Language Models (LLMs) have revolutionized natural language processing, yet aligning these models with human values and preferences using RLHF remains a significant

Stabilizing RLHF through Advantage Model and Selective Rehearsal

paper page: huggingface.co/papers/2309.10…

Large Language Models (LLMs) have revolutionized natural language processing, yet aligning these models with human values and preferences using RLHF remains a significant
Meng Jiang (@meng_cs) 's Twitter Profile Photo

Instructing LLMs to solve math word prob better: - Mask a condition, Predict the condition with initial answer, Verify the answer, Rectify to avoid wrong answer(s) (AAAI'24): wzy6642.github.io/prp.github.io/ - Identify and Ignore Irrelevant Conditions (NAACL'24): wzy6642.github.io/I3C.github.io/

QingkaiZeng (@qingkaizeng_cs) 's Twitter Profile Photo

The code of our CoL is available now. Feel free to use it in your own project! Thanks to Yuyang Bai for organizing the code package. Code: github.com/QingkaiZeng/Ch… Please contact me (qingkaizeng.github.io) if you have any questions about our work.

Zhenyu Wu (@zhenyu9409) 's Twitter Profile Photo

🔎How to unleash LLM's inherent ability to detect and rectify incorrect responses without external feedback? 👀Check out our latest pre-print and find out how to progressively identify and correct false responses. Link: arxiv.org/abs/2405.14092

🔎How to unleash LLM's inherent ability to detect and rectify incorrect responses without external feedback?

👀Check out our latest pre-print and find out how to progressively identify and correct false responses.

Link: arxiv.org/abs/2405.14092
Han Zhao (@hanzhao_ml) 's Twitter Profile Photo

🚨🚨 We are hiring! RT appreciated! Prof. Rui Song (song-ray.github.io) and I will recruit post-doc scientists through Amazon’s post-doc program (amazon.science/postdoctoral-s…).

Bowen Jin (@bowenjin13) 's Twitter Profile Photo

How do we define 'long' chain-of-thought (CoT) reasoning? Is it about hundreds, thousands, or even more reasoning tokens for a question?

Shangbin Feng (@shangbinfeng) 's Twitter Profile Photo

👀 How to effectively leverage the expertise of diverse models? ✨ Optimize graphs of LLMs with swarm intelligence! 👉🏻 Introducing Heterogeneous Swarms, jointly optimizing the roles and weights of multi-LLM systems for collaborative gains! 📄 Paper: arxiv.org/abs/2502.04510

👀 How to effectively leverage the expertise of diverse models?
✨ Optimize graphs of LLMs with swarm intelligence!

👉🏻 Introducing Heterogeneous Swarms, jointly optimizing the roles and weights of multi-LLM systems for collaborative gains!

📄 Paper: arxiv.org/abs/2502.04510
Lucy Family Institute for Data & Society (@lucy_institute) 's Twitter Profile Photo

Exciting news at the Lucy Family Institute for Data & Society! The Foundation Models and Applications Lab has launched with co-directors Meng Jiang and Xiangliang Zhang from University of Notre Dame's Department of Computer Science and Engineering. Learn more: lucyinstitute.nd.edu/news-events/20…

Exciting news at the <a href="/lucy_institute/">Lucy Family Institute for Data & Society</a>! The Foundation Models and Applications Lab has launched with co-directors <a href="/Meng_CS/">Meng Jiang</a> and <a href="/xiangliangzhang/">Xiangliang Zhang</a> from <a href="/NotreDame/">University of Notre Dame</a>'s Department of Computer Science and Engineering.

Learn more: lucyinstitute.nd.edu/news-events/20…
QingkaiZeng (@qingkaizeng_cs) 's Twitter Profile Photo

Totally agree. Still, I dislike how we split papers into “main vs. findings,” “oral vs. poster,” or “spotlight vs. others.” What’s truly valuable is solid, reproducible, real-world work. Collaboration > competition, always.