Guoqing Zheng (@zzzzgq) 's Twitter Profile
Guoqing Zheng

@zzzzgq

Principal Researcher at @MSFTResearch.

ID: 27818641

calendar_today31-03-2009 05:06:30

31 Tweet

72 Followers

103 Following

Graham Neubig (@gneubig) 's Twitter Profile Photo

Just released a new survey on prompting methods, which use language models to solve prediction tasks by providing them with a "prompt" like: "CMU is located in __" We worked really hard to make this well-organized and educational for both NLP experts and beginners, check it out!

Subhabrata Mukherjee (@subho_mpi) 's Twitter Profile Photo

Open to everyone! The first-ever 2021 Microsoft Research Summit, Oct 19 - 21, with over 150 sessions across 16 tracks, provides the global research community with an opportunity learn from experts pushing the frontiers of technology. Register now:…lnkd.in/g798kNvy

Guoqing Zheng (@zzzzgq) 's Twitter Profile Photo

Integrating knowledge to NLG typically requires training/fine-tuning PLMs on knowledge sources. With KID, we show it's viable to infuse knowledge on the fly in decoding phase without requiring modifications to the LMs. Check out our #iclr2022 work on Knowledge Infused Decoding.

Integrating knowledge to NLG typically requires training/fine-tuning PLMs on knowledge sources. With KID, we show it's viable to infuse knowledge on the fly in decoding phase without requiring modifications to the LMs. Check out our #iclr2022 work on Knowledge Infused Decoding.
Guoqing Zheng (@zzzzgq) 's Twitter Profile Photo

Cleanly labeled and weakly labeled data are both crucial to few-shot NLU. WALNUT features a unified setting with both few-shot and weakly supervised learning. We hope to bring more attention to semi-weakly supervised learning for NLU. #NAACL2022 #NLProc Microsoft Research

arindam mitra (@arindam1408) 's Twitter Profile Photo

With Orca, we're excited about the potential of redefining the reasoning capabilities of smaller LLMs. We're still at the beginning phases of this intriguing journey, but our preliminary explorations have yielded encouraging results. 1/7

arindam mitra (@arindam1408) 's Twitter Profile Photo

#Orca I'm thrilled to announce our latest work on Generative Teaching: generating vast amount of diverse high-quality synthetic data for language models to teach a specific skill (e.g. RC, text classification, tool use,math) without the extensive human effort typically required

Dimitris Papailiopoulos (@dimitrispapail) 's Twitter Profile Photo

What is reasoning? Do LLMs use it? Does it help? Is o1 really that better than sonnet? How do you even measure all that? MSR AI Frontiers is working to figure it all out, and we're looking for interns to work on evals to better understand LLMs. Please apply!! Link below:

Ahmed Awadallah (@ahmedhawadallah) 's Twitter Profile Photo

Introducing Phi-4-reasoning, adding reasoning models to the Phi family of SLMs. The model is trained with both supervised finetuning (using a carefully curated dataset of reasoning demonstration) and Reinforcement Learning. 📌Competitive results on reasoning benchmarks with

Introducing Phi-4-reasoning, adding reasoning models to the Phi family of SLMs.

The model is trained with both supervised finetuning (using a carefully curated dataset of reasoning demonstration) and Reinforcement Learning.

📌Competitive results on reasoning benchmarks with
Dimitris Papailiopoulos (@dimitrispapail) 's Twitter Profile Photo

We’ve been cooking... a new open weights 14B Phi-4 reasoning model, SFT’d on ~1.4M carefully curated reasoning demonstrations from o3-mini and RL’d for a tiny bit. This model is a little beast.

We’ve been cooking... a new open weights 14B Phi-4 reasoning model, SFT’d on ~1.4M carefully curated reasoning demonstrations from o3-mini and RL’d for a tiny bit. This model is a little beast.
Bart Czernicki (@bartczernicki) 's Twitter Profile Photo

New Phi-4 reasoning models have been released. Offer performance that is comparable to GPT-4o and o3-mini! azure.microsoft.com/en-us/blog/one… #AIReasoning #GenAI #AI #Phi4 #OpenAI #AzureOpenAI

Mojan Javaheripi (@mojan_jp) 's Twitter Profile Photo

Excited to release our first set of reasoning models Phi-4-reasoning and Phi-4-reasoning-plus, available today on HuggingFace and Azure AI foundry. Some interesting insights below and more deep dives in following days!

Athul Paul Jacob (@apjacob03) 's Twitter Profile Photo

Today marks an important milestone. I’m launching Percepta together with Hemant Taneja, Hirsh Jain, Thomas Mathew, Radha Jain, Michael Rochlin, Constantinos Daskalakis and an incredible team, with the goal of bringing AI to the core industries that run our economy. For AI to deliver

Guoqing Zheng (@zzzzgq) 's Twitter Profile Photo

Join us at NeurIPS for a happy hour on AI transformation, co-hosted by Percepta, GC and NVIDIA: RSVP at invite.generalcatalyst.com/general-cataly…, and solve some fun puzzles at percepta-neurips.web.app to skip the waiting list!