Guangsheng Bao (@gshbao) Twitter Tweets • TwiCopy

Guangsheng Bao

2 years ago

Non-autoregressive models advance document-level machine translation with impressive speedup. Delve into the opportunities and challenges of NAT on extended sequences: aclanthology.org/2023.findings-… #EMNLP2023

thumb_up_off_alt5

chat_bubble_outline0

repeat4

shareShare

Guangsheng Bao

@gshbao

2 years ago

GEMINI: Controlling The Sentence-Level Summary Style in Abstractive Text Summarization #EMNLP2023 aclanthology.org/2023.emnlp-mai…

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Linyi Yang

@linyi_yang

2 years ago

Thanks for sharing our work. Our primary contribution is the establishment of a framework that enhances the reliability of LLMs as it: 1) generalizes out-of-distribution data, 2) elucidates how LLMs benefit from discriminative models, and 3) minimizes hallucinations.

thumb_up_off_alt17

chat_bubble_outline0

repeat4

shareShare

Guangsheng Bao

@gshbao

2 years ago

Excited to announce that Fast-DetectGPT made it to #ICLR2024 🎉 WestlakeNLP In the rebuttal phase, got an unfair 1 amidst 8866. Huge shoutout to Eric for the public support. Appreciate your sense of justice! ⚖️🔍 Paper: openreview.net/forum?id=Bpcgc…

Excited to announce that Fast-DetectGPT made it to #ICLR2024 🎉 <a href="/NlpWestlake/">WestlakeNLP</a>

In the rebuttal phase, got an unfair 1 amidst 8866. Huge shoutout to <a href="/ericmitchellai/">Eric</a> for the public support. Appreciate your sense of justice! ⚖️🔍

Paper: openreview.net/forum?id=Bpcgc…

thumb_up_off_alt10

chat_bubble_outline0

repeat3

shareShare

Zhiyang Teng

@zhiyangteng

2 years ago

I am seeking a research intern to collaborate on the development and application of large language models within real-world industry scenarios at ByteDance, Singapore. If you are interested, please drop me an email.

thumb_up_off_alt5

chat_bubble_outline0

repeat3

shareShare

Jindong Wang

@jd92wang

2 years ago

GWLS: a general framework to learn from ANY weak supervision! It outperforms existing methods on *11* weak supervision settings, e.g., partial label, multiple instance learning, label prop., multiclass multi-label... 📸Paper: arxiv.org/abs/2402.01922

thumb_up_off_alt65

chat_bubble_outline2

repeat14

shareShare

Hongbo

@hongbo00231523

2 years ago

[1/5] 🧵 Thrilled to unveil our latest research on "Causal Analysis of CoT in LLMs"! We delve into the intricate dynamics between Chain of Thought reasoning and answer generation in LLMs, revealing some unexpected insights. 🤖💭 📄Read the full paper: arxiv.org/abs/2402.16048

thumb_up_off_alt6

chat_bubble_outline1

repeat1

shareShare

Hongbo

@hongbo00231523

2 years ago

[2/5] Despite the potential of CoT to enhance task performance in LLMs, our findings show a surprising number of instances where correct answers follow incorrect CoTs and vice versa. This discrepancy raises fundamental questions about LLMs' reasoning capabilities.

thumb_up_off_alt1

chat_bubble_outline1

repeat1

shareShare

Hongbo

@hongbo00231523

2 years ago

[3/5] Employing causal analysis, we dissect the cause-effect relationships between CoTs/instructions and answers in LLMs. Our analysis exposes the Structural Causal Model (SCM) LLMs mimic, highlighting significant differences from human reasoning processes.

thumb_up_off_alt1

chat_bubble_outline1

repeat1

shareShare

Hongbo

@hongbo00231523

2 years ago

[4/5] We further explore how ICL, SFT and RLHF significantly influence the causal structures in LLMs. Our investigation sheds light on how these techniques impact the reasoning process, offering critical insights for future advancements.

thumb_up_off_alt1

chat_bubble_outline1

repeat1

shareShare

Hongbo

@hongbo00231523

2 years ago

[5/5] Our research contributes to the broader discourse on the role of CoT in LLM reasoning, offering new angles on the extent to which LLMs replicate human-like reasoning steps. Please refer to our paper for detailed results!

thumb_up_off_alt1

chat_bubble_outline0

repeat1

shareShare

Guangsheng Bao

@gshbao

2 years ago

I just published “No Training Needed, Fast-DetectGPT Boosts Text Detection Speed by 340 Times” link.medium.com/sozGRpDd9Hb

thumb_up_off_alt4

chat_bubble_outline0

repeat1

shareShare

Guangsheng Bao

@gshbao

2 years ago

I'm attending #ICLR2024 in Vienna from May 7-11. Our posters are at Halle B #256 and #116 on May 8. Looking forward to meeting old and new friends!🥰

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Guangsheng Bao

@gshbao

2 years ago

Exciting news! 🎉 Our online demo for Fast-DetectGPT is now live! 🚀 Experience lightning-fast text detection in action. Give it a try here: [region-9.autodl.pro:21504] Let us know what you think! #FastDetectGPT #AI #TextDetection.

thumb_up_off_alt3

chat_bubble_outline2

repeat0

shareShare

Guangsheng Bao

@gshbao

a year ago

⛄️Excited to share our work on causal analysis of LLMs at COLING 2025!💖Hongbo Linyi Yang Cunxiang Wang "How Likely Do LLMs with CoT Mimic Human Reasoning?" Paper: arxiv.org/pdf/2402.16048

⛄️Excited to share our work on causal analysis of LLMs at COLING 2025!💖<a href="/Hongbo00231523/">Hongbo</a> <a href="/linyi_yang/">Linyi Yang</a> <a href="/CunxiangWang/">Cunxiang Wang</a>

"How Likely Do LLMs with CoT Mimic Human Reasoning?"

Paper: arxiv.org/pdf/2402.16048

thumb_up_off_alt6

chat_bubble_outline0

repeat2

shareShare

Linyi Yang

@linyi_yang

a year ago

Welcome to try our system. The feedback system will not replace any human reviewers. The agent will not write reviews or make automated edits to reviews. Rather, it will serve as an assistant, providing optional feedback that reviewers can incorporate or disregard. #LLM #ICLR #AI

thumb_up_off_alt6

chat_bubble_outline0

repeat1

shareShare

Guangsheng Bao

@gshbao

6 months ago

LLMs often rely on correlations, not causation. ❤️‍🔥 Our causal analyses show that RLVR-trained LRMs move closer to true causal reasoning — but distilled LRMs and LLMs do not⁉️ 🧠 Paper: "Correlation or Causation?" 📘 [arxiv.org/pdf/2509.17380](arxiv.org/pdf/2509.17380)

thumb_up_off_alt3

chat_bubble_outline0

repeat3

shareShare

Hongbo

@hongbo00231523

6 months ago

⛄️ Excited to share our EMNLP 2025 paper: Direct Value Optimization (DVO) 🌲 💡 Instead of pairwise DPO-style tuning, DVO learns directly from value signals in MCTS search data, enabling efficient RL training for reasoning LLMs ⚡️ 📘 arxiv.org/pdf/2502.13723

thumb_up_off_alt5

chat_bubble_outline0

repeat3

shareShare