Quyet V. Do (@quyet_azir) 's Twitter Profile
Quyet V. Do

@quyet_azir

PhD@VirginiaTech, supervised by @tuvllms.
Incoming Research Intern at @AdobeResearch.
RIs: Instruction Tuning, AI & Math

ID: 1530918820059426817

linkhttps://dovanquyet.github.io/ calendar_today29-05-2022 14:28:08

204 Tweet

120 Takipçi

271 Takip Edilen

Yann LeCun (@ylecun) 's Twitter Profile Photo

Excellent blog post from Turing Post on JEPA (Joint Embedding Predictive Architecture), my favorite meta-architecture for Self-Supervised Learning of continuous data, such as images, video, and audio. The post includes a list of relevant papers from my collaborators and me, as

homanp (@pelaseyed) 's Twitter Profile Photo

Traditional RAG sucks because it promises "relevant chunks" but in fact returns "similar chunks". Relevancy requires reasoning. Introducing ReAG - Reasoning Augmented Generation

Traditional RAG sucks because it promises "relevant chunks" but in fact returns "similar chunks". 

Relevancy requires reasoning.

Introducing ReAG - Reasoning Augmented Generation
Jaechul Roh (@jaechulroh) 's Twitter Profile Photo

🧠💸 "We made reasoning models overthink — and it's costing them big time." Meet 🤯 #OVERTHINK 🤯 — our new attack that forces reasoning LLMs to "overthink," slowing models like OpenAI's o1, o3-mini & DeepSeek-R1 by up to 46× by amplifying number of reasoning tokens.

🧠💸 "We made reasoning models overthink — and it's costing them big time."

Meet 🤯 #OVERTHINK 🤯 — our new attack that forces reasoning LLMs to "overthink," slowing models like OpenAI's o1, o3-mini & DeepSeek-R1 by up to 46× by amplifying number of reasoning tokens.
Andrej Karpathy (@karpathy) 's Twitter Profile Photo

Agency > Intelligence I had this intuitively wrong for decades, I think due to a pervasive cultural veneration of intelligence, various entertainment/media, obsession with IQ etc. Agency is significantly more powerful and significantly more scarce. Are you hiring for agency? Are

Jonas Pfeiffer (@pfeiffjo) 's Twitter Profile Photo

I am hiring a Student Researcher for our Modularity team at the Google DeepMind office in Zurich🇨🇭 Please fill out the interest form if you would like to work with us! The role would start mid/end 2025 and would be in-person in Zurich with 80-100% at GDM forms.gle/N94ViTmKHCCAcv…

Swaroop Mishra (@swarooprm7) 's Twitter Profile Photo

SWE tip: The importance of software design is higher than ever, given how well AI can code. Highly recommend checking out the following guide:

Dreaming Tulpa 🥓👑 (@dreamingtulpa) 's Twitter Profile Photo

adobe is cooking a new inpainting method that understands context extremely well! ELI5: it can insert stuff with the correct perspective and add/remove connected reflections/shadows 👀

Unsloth AI (@unslothai) 's Twitter Profile Photo

We partnered with @HuggingFace to teach you how to fine-tune LLMs with GRPO! Learn about: • Reward functions + creating them • GRPO Math + Free Reasoning training in Colab • Applying RL to real-world use cases Course: huggingface.co/reasoning-cour… Tutorial: docs.unsloth.ai/basics/reasoni…

We partnered with @HuggingFace to teach you how to fine-tune LLMs with GRPO!

Learn about:
• Reward functions + creating them
• GRPO Math + Free Reasoning training in Colab
• Applying RL to real-world use cases

Course: huggingface.co/reasoning-cour…
Tutorial: docs.unsloth.ai/basics/reasoni…
Tu Vu (@tuvllms) 's Twitter Profile Photo

📢 Research internship Google📢 I am looking for a PhD student researcher to work with me and my colleagues on advanced reasoning and/or RAG factuality this summer Google Mountain View, CA. We will focus on open-source models and benchmarks, and aim to publish our findings.

Andrew Ng (@andrewyng) 's Twitter Profile Photo

Contrary to standard prompting advice that you should give LLMs the context they need to succeed, I find it’s sometimes faster to be lazy and dash off a quick, imprecise prompt and see what happens. The key to whether this is a good idea is whether you can quickly assess the

Tu Vu (@tuvllms) 's Twitter Profile Photo

✨ New paper ✨ 🚨 Scaling test-time compute can lead to inverse or flattened scaling!! We introduce SealQA, a new challenge benchmark w/ questions that trigger conflicting, ambiguous, or unhelpful web search results. Key takeaways: ➡️ Frontier LLMs struggle on Seal-0 (SealQA’s

✨ New paper ✨
🚨 Scaling test-time compute can lead to inverse or flattened scaling!!

We introduce SealQA, a new challenge benchmark w/ questions that trigger conflicting, ambiguous, or unhelpful web search results. Key takeaways:

➡️ Frontier LLMs struggle on Seal-0 (SealQA’s
Shizhe Diao (@shizhediao) 's Twitter Profile Photo

🚨 NVIDIA is launching the Data Filtering Challenge for training edge language models! We believe edge LMs are the future — lightweight, powerful, and ready for real-world tasks like: 🧠 Reasoning 🗣️ Roleplay 🔍 RAG 🔧 Function calling Time to push dataset filtering to the

🚨 NVIDIA is launching the Data Filtering Challenge for training edge language models! 

We believe edge LMs are the future — lightweight, powerful, and ready for real-world tasks like:
🧠 Reasoning
🗣️ Roleplay
🔍 RAG
🔧 Function calling

Time to push dataset filtering to the
Andrew Ng (@andrewyng) 's Twitter Profile Photo

The invention of modern writing instruments like the typewriter made writing easier, but they also led to the rise of writer’s block, where deciding what to write became the bottleneck. Similarly, the invention of agentic coding assistants has led to a new builder’s block, where

Tianqing Fang @ ACL24 (@tfang229) 's Twitter Profile Photo

🚀 We are thrilled to release a new open-source Deep Research Agent, Cognitive Kernel-Pro, from Tencent AI Lab! We focus on building a fully open-source agent with (to the maximum extent) free tools, showcasing impressive performance on GAIA with Claude-3.7-sonnet and surpass the

🚀 We are thrilled to release a new open-source Deep Research Agent, Cognitive Kernel-Pro, from Tencent AI Lab! We focus on building a fully open-source agent with (to the maximum extent) free tools, showcasing impressive performance on GAIA with Claude-3.7-sonnet and surpass the
Sundar Pichai (@sundarpichai) 's Twitter Profile Photo

Excited to make our best AI tools free for college students in the US + other select countries for a year - and to provide $1B in funding for education + research, including free AI and career training for every college student in America.

Excited to make our best AI tools free for college students in the US + other select countries for a year - and to provide $1B in funding for education + research, including free AI and career training for every college student in America.