Zhenda Xie (@zdaxie) Twitter Tweets • TwiCopy

AK

2 years ago

DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior paper page: huggingface.co/papers/2310.16… present DreamCraft3D, a hierarchical 3D content generation method that produces high-fidelity and coherent 3D objects. We tackle the problem by leveraging a 2D

thumb_up_off_alt669

chat_bubble_outline10

repeat167

shareShare

DeepSeek

@deepseek_ai

2 years ago

🚀 DeepSeek Coder 33B is NOW LIVE! Open-source & absolutely FREE! #DeepSeekCoder 💥 Try out here: coder.deepseek.com 🤗 Also on Huggingface: huggingface.co/deepseek-ai 💬 Got questions? Join our Discord fam! discord.gg/Tc7c45Zzu5 🤖 Github page: deepseekcoder.github.io

thumb_up_off_alt185

chat_bubble_outline5

repeat47

shareShare

Zhenda Xie

@zdaxie

2 years ago

Feel the AGI, apply to DeepSeek!🖖🏻

thumb_up_off_alt3

chat_bubble_outline3

repeat0

shareShare

DeepSeek

@deepseek_ai

2 years ago

📚Out Now: Technical Report on DeepSeek LLM 67B! Read more: arxiv.org/abs/2401.02954 🔍Discover our in-depth study on scaling laws and how data quality influences them. ✨In the MT-Bench evaluation, DeepSeek surpassed GPT-3.5-turbo, ranking just behind GPT-4. #DeepSeekLLM

thumb_up_off_alt143

chat_bubble_outline4

repeat38

shareShare

DeepSeek

@deepseek_ai

2 years ago

🌟 Meet #DeepSeekMoE: The Next Gen of Large Language Models! Performance Highlights: 📈 DeepSeekMoE 2B matches its 2B dense counterpart with 17.5% computation. 🚀 DeepSeekMoE 16B rivals LLaMA2 7B with 40% computation. 🛠 DeepSeekMoE 145B significantly outperforms Gshard,

thumb_up_off_alt385

chat_bubble_outline14

repeat79

shareShare

DeepSeek

@deepseek_ai

2 years ago

🚀 Just Out: Tech Report On #DeepSeekCoder - An Open-Source Model Competing with #GPT4's Coding Capabilities. Paper Link: arxiv.org/abs/2401.14196 ⭐ Highlights: - Repo-level Data Construction - Topological Sort for Dependency Parsing - Fill-In-Middle Pre-training Strategy -

thumb_up_off_alt171

chat_bubble_outline3

repeat40

shareShare

DeepSeek

@deepseek_ai

a year ago

[1/5] 🚀 Announcing DeepSeek-VL, sota 1.3B and 7B visual-language models! Paper: arxiv.org/abs/2403.05525 GitHub: github.com/deepseek-ai/De… 📚 Diverse training corpus 👯 Hybrid Vision Encoder 🧠 3-stage training strategy 🆓 Totally free for commercial use and fully open-source

thumb_up_off_alt321

chat_bubble_outline9

repeat65

shareShare

DeepSeek

@deepseek_ai

a year ago

🚀 Launching DeepSeek-V2: The Cutting-Edge Open-Source MoE Model! 🌟 Highlights: > Places top 3 in AlignBench, surpassing GPT-4 and close to GPT-4-Turbo. > Ranks top-tier in MT-Bench, rivaling LLaMA3-70B and outperforming Mixtral 8x22B. > Specializes in math, code and reasoning.

thumb_up_off_alt1,1K

chat_bubble_outline42

repeat245

shareShare

DeepSeek

@deepseek_ai

9 months ago

🚀 DeepSeek-R1-Lite-Preview is now live: unleashing supercharged reasoning power! 🔍 o1-preview-level performance on AIME & MATH benchmarks. 💡 Transparent thought process in real-time. 🛠️ Open-source models & API coming soon! 🌐 Try it now at chat.deepseek.com #DeepSeek

thumb_up_off_alt4,4K

chat_bubble_outline265

repeat802

shareShare

DeepSeek

@deepseek_ai

8 months ago

🚀 Introducing DeepSeek-V3! Biggest leap forward yet: ⚡ 60 tokens/second (3x faster than V2!) 💪 Enhanced capabilities 🛠 API compatibility intact 🌍 Fully open-source models & papers 🐋 1/n

thumb_up_off_alt13,13K

chat_bubble_outline676

repeat2,2K

shareShare

DeepSeek

@deepseek_ai

6 months ago

🚀 Day 6 of #OpenSourceWeek: One More Thing – DeepSeek-V3/R1 Inference System Overview Optimized throughput and latency via: 🔧 Cross-node EP-powered batch scaling 🔄 Computation-communication overlap ⚖️ Load balancing Statistics of DeepSeek's Online Service: ⚡ 73.7k/14.8k

thumb_up_off_alt9,9K

chat_bubble_outline764

repeat1,1K

shareShare