Nanbeige (@nanbeige) Twitter Tweets • TwiCopy

We published our model Nanbeige2-16B-Chat(huggingface.co/Nanbeige/Nanbe…) with MT-Bench 8.6, AlpacaEval2.0 LC WinRate 43% and AlignBench 7.62. And a new open source model with the context window of 1 million tokens is on the road. Enjoy :-)

thumb_up_off_alt9

chat_bubble_outline1

repeat4

shareShare

Nanbeige

@nanbeige

2 years ago

Nanbiege2-16B-Chat（huggingface.co/Nanbeige/Nanbe…） in FlagEval's opensource model Leaderboard（flageval.baai.ac.cn/#/leaderboard）

thumb_up_off_alt10

chat_bubble_outline0

repeat5

shareShare

Nanbeige

@nanbeige

2 years ago

Nanbiege2-16B-Chat（huggingface.co/Nanbeige/Nanbe…） in Opencompass 24-05 opensource model Leaderboard （rank.opencompass.org.cn/home）

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Nanbeige

@nanbeige

2 years ago

Nanbeige2-16B-Chat（huggingface.co/Nanbeige/Nanbe…）made a high score in OpenCompass Leaderboard of May (Subject Part) compared with other opensource models.

thumb_up_off_alt6

chat_bubble_outline0

repeat2

shareShare

Nanbeige

@nanbeige

4 months ago

Our new model Nanbeige3.5-Pro-Thinking is here!(xbench.org/agi/scienceqa)

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

Nanbeige

@nanbeige

4 months ago

We published our new model Nanbeige4-3B-Thinking-2511(huggingface.co/Nanbeige/Nanbe…), which achieved state-of-the-art (SOTA) results among models smaller than 32B parameters on Arena-Hard-V2 and BFCL-V4.

thumb_up_off_alt13

chat_bubble_outline1

repeat2

shareShare

Tiezhen WANG

@xianbao_qian

4 months ago

China’s depth of STEM talent is the ultimate refutation of the "concentration of power." After Xiaomi, RedNote, Meituan (Chinese DoorDash) and many others, now BOSS Zhipin (a ~$10B mkt cap recruiting app) have also joined the game and open-sourced a small yet powerful model.

thumb_up_off_alt376

chat_bubble_outline17

repeat53

shareShare

Privacy AI - offline models & remote AI client

@best_privacy_ai

4 months ago

Adina Yakup Nanbeige Tested Nanbeige4-3B-Thinking(Q3_K_S) locally in Privacy AI with on-device tool calling (search_web). Performance on iOS is excellent. At 3B, it’s lightweight enough to serve as a practical daily offline assistant, yet still handles reasoning and tool use reliably. Congrats to

thumb_up_off_alt7

chat_bubble_outline0

repeat1

shareShare

ModelScope

@maasai42

4 months ago

🤖Meet Nanbeige4-3B from Boss Zhipin—a 3B-parameter LLM that outperforms Qwen3-32B on math (AIME), science (GPQA), and tool calling (BFCL-V4), while matching Qwen3-30B-A3B on human preference alignment (Arena-Hard-V2). How? ✅ 23T tokens of ultra-curated data ✅ Fine-grained WSD

thumb_up_off_alt164

chat_bubble_outline1

repeat26

shareShare

Nanbeige

@nanbeige

4 months ago

In the Berkeley Function Calling Leaderboard(gorilla.cs.berkeley.edu/leaderboard.ht…), Nanbeige4-3B-Thinking-2511(huggingface.co/Nanbeige/Nanbe…) ranks 25th overall, ranking among the top 10 open-source models and outperforming Qwen3-32B, despite it's only a 3B model.

thumb_up_off_alt9

chat_bubble_outline1

repeat0

shareShare

N8 Programs

@n8programs

2 months ago

Intriguing new model called 'Nanbeige/Nanbeige4.1-3B' released, appears to be *extremely* SOTA for its size range. So much so that I question if benchmaxxed. But Nanbeige appears to be a small but real lab out of China so I have faith! Quite exciting - will test.

thumb_up_off_alt252

chat_bubble_outline14

repeat25

shareShare

Privacy AI - offline models & remote AI client

@best_privacy_ai

2 months ago

Nanbeige Congrats! Now everyone with iOS devices can try the Nanbeige4.1-3B model immediately on their phone. This model excels at tool calling and tends to output many thinking tokens, which requires a large context window. I set 12K context on my iPhone 16 Pro Max with 8K max output,

thumb_up_off_alt18

chat_bubble_outline0

repeat1

shareShare

Nanbeige

@nanbeige

2 months ago

N8 Programs Thank you again for your interest! We hope the model will attract wider attention and be tested by the community to evaluate its performance. The technical report will be released tomorrow—stay tuned! 🌟

thumb_up_off_alt53

chat_bubble_outline3

repeat2

shareShare

Nanbeige

@nanbeige

2 months ago

We've just released our technical report(huggingface.co/Nanbeige/Nanbe…), feel free to share.

thumb_up_off_alt343

chat_bubble_outline12

repeat45

shareShare