Zhengyang Tang (@zhengyang_42) Twitter Tweets • TwiCopy

arXiv Daily

3 years ago

DPTDR: Deep Prompt Tuning for Dense Passage Retrieval deepai.org/publication/dp… by Zhengyang Tang et al. including Benyou Wang #NaturalLanguageProcessing #Computation

thumb_up_off_alt8

chat_bubble_outline0

repeat2

shareShare

Microsoft presents GLAN (Generalized Instruction Tuning) Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models GLAN excels without using task-specific training data arxiv.org/abs/2402.13064

thumb_up_off_alt321

chat_bubble_outline8

repeat56

shareShare

AK

@_akhaliq

2 years ago

MathScale Scaling Instruction Tuning for Mathematical Reasoning Large language models (LLMs) have demonstrated remarkable capabilities in problem-solving. However, their proficiency in solving mathematical problems remains inadequate.

thumb_up_off_alt114

chat_bubble_outline3

repeat30

shareShare

Zhengyang Tang

@zhengyang_42

a year ago

🚀 Launching ORLM: the first open-source Operations Research LLM, powered by our OR-Instruct process! 🛠️ 🏆 ORLMs achieves SOTA on NL4OPT, MAMO, & the new IndustryOR benchmarks based on different 7b backbones! 📄 Paper: arxiv.org/pdf/2405.17743 💻 Code: github.com/Cardinal-Opera…

thumb_up_off_alt9

chat_bubble_outline0

repeat4

shareShare

Qingxiu Dong

@qx_dong

a year ago

OpenAI o1 scores 94.8% on MATH dataset😲 Then...how should we proceed to track and evaluate the next-gen LLMs' math skills? 👉Omni-Math: a new, challenging benchmark with 4k competition-level problems, where OpenAI o1-mini only achieves 60.54 acc Paper: huggingface.co/papers/2410.07…

thumb_up_off_alt134

chat_bubble_outline10

repeat24

shareShare

Zhengyang Tang

@zhengyang_42

10 months ago

📢 Introducing SCRIT: A framework enabling LLMs to self-evolve their critique abilities without human annotations or stronger models. 💡 Key features: • Contrastive self-critic • Mathematical validity check • Zero external supervision 🔗 Paper: huggingface.co/papers/2501.05…

thumb_up_off_alt18

chat_bubble_outline0

repeat6

shareShare

Ziniu Li @ ICLR2025

@ziniuli

10 months ago

🚀 Critique abilities are key for scaling LLMs, but current open-source models fall short. We introduce SCRIT: a framework with scalable oversight that enables LLMs to self-improve their critique skills✨ We’ve built a pipeline to generate high-quality synthetic critique data

thumb_up_off_alt68

chat_bubble_outline2

repeat8

shareShare

Zhengyang Tang

@zhengyang_42

6 months ago

Thrilled to share our paper "ORLM: A Customizable Framework in Training Large Models for Automated Optimization Modeling" has been accepted by Operations Research! 🎉 This is the FIRST LLM paper in the 70+ year history of this prestigious journal. Our framework improves modeling

thumb_up_off_alt10

chat_bubble_outline0

repeat4

shareShare

Zhengyang Tang

@zhengyang_42

6 months ago

Super excited to have been part of the Qwen3 team! We just dropped our technical report - check it out if you're interested in what's under the hood. Hope it helps with your projects and research. Let us know what you think! #Qwen3 #AI

thumb_up_off_alt3

chat_bubble_outline0

repeat1

shareShare

AK

@_akhaliq

6 months ago

Learning from Peers in Reasoning Models Large Reasoning Models often get stuck when they start reasoning incorrectly (the "Prefix Dominance Trap"). Propose LeaP (Learning from Peers), a method where parallel reasoning paths share intermediate summaries to learn from each other

thumb_up_off_alt107

chat_bubble_outline5

repeat26

shareShare

Tanishq Mathew Abraham, Ph.D.

@iscienceluvr

5 months ago

CoRT: Code-integrated Reasoning within Thinking "This paper introduces CoRT, a post-training framework for teaching LRMs to leverage Code Interpreter effectively and efficiently." "We manually create 30 high-quality samples, upon which we post-train models ranging from 1.5B to

thumb_up_off_alt127

chat_bubble_outline3

repeat18

shareShare

Zhengyang Tang

@zhengyang_42

5 months ago

We’re excited to share our new paper “CoRT: Code-integrated Reasoning within Thinking”! 🤖 A post-training framework that teaches Large Reasoning Models (LRMs) to better leverage Code Interpreters for enhanced mathematical reasoning. 🔍 Key Highlights: Strategic hint

thumb_up_off_alt23

chat_bubble_outline1

repeat2

shareShare

Zhengyang Tang

@zhengyang_42

4 months ago

Happy to share that our paper "Second Language (Arabic) Acquisition of LLMs via Progressive Vocabulary Expansion" has been accepted to #ACL2025 as oral & panel presentation (25 out of 3000 accepted papers = top 0.8%)! 🎉 🚀 We introduce AceGPT with Progressive Vocabulary

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Zhengyang Tang

@zhengyang_42

4 months ago

🚀 Thrilled to announce that our paper "SCRIT: Self-Evolving LLM Critique without Human or Stronger Models" was accepted to #COLM2025! We enable LLMs to self-improve critique abilities — zero human annotations, zero stronger models needed! 🔄✨ Looking forward to meeting

thumb_up_off_alt8

chat_bubble_outline1

repeat2

shareShare

Binyuan Hui

@huybery

3 months ago

We’ve updated Qwen3 and made excellent progress. The non‑reasoning model now delivers significant improvements across a wide range of tasks and many of its capabilities already rival those of reasoning models. It’s truly remarkable, and we hope you enjoy it!

thumb_up_off_alt720

chat_bubble_outline30

repeat62

shareShare

Zhengyang Tang

arXiv Daily

Aran Komatsuzaki

AK

Zhengyang Tang

Qingxiu Dong

Zhengyang Tang

Ziniu Li @ ICLR2025

Zhengyang Tang

Zhengyang Tang

AK

Tanishq Mathew Abraham, Ph.D.

Zhengyang Tang

Zhengyang Tang

Zhengyang Tang

Binyuan Hui