Tianyi Zhang (@tianyi_zh) 's Twitter Profile
Tianyi Zhang

@tianyi_zh

iterating ...

I used to train more language models but am working on agents now

ID: 913048778516639744

calendar_today27-09-2017 14:32:47

212 Tweet

1,1K Followers

647 Following

Yann Dubois (@yanndubs) 's Twitter Profile Photo

Major upgrade: AlpacaEval->AlpacaEval2.0 ✅ 7x faster ✅ 3x cheaper ✅ GPT-4 turbo baseline ✅ less length bias ✅ new models (Mixtral, Gemini,...) We've annotated 12 models and hope to add 80. Seeking community support for annotations (~$400 OpenAI credits). Can anyone help?

Major upgrade: AlpacaEval->AlpacaEval2.0

✅ 7x faster
✅ 3x cheaper
✅ GPT-4 turbo baseline
✅ less length bias
✅ new models (Mixtral, Gemini,...)

We've annotated 12 models and hope to add 80. Seeking community support for annotations (~$400 <a href="/OpenAI/">OpenAI</a> credits). Can anyone help?
AGI House SF (@agihousesf) 's Twitter Profile Photo

Calling all CUDA, C++, Rust, Assembly, & low-level hackers! Hardcore Hackathon on Sat will be an intimate & high talent density gathering of systems-level engineers & AI researchers. No LLM wrappers or web apps this time 😉 Apply if you fit the part 🫡 partiful.com/e/fwlokaFbQIAo…

Tianyi Zhang (@tianyi_zh) 's Twitter Profile Photo

I used to be very interested in this topic! Incremental/simultaneous language processing. Reminds me of classic work like aclanthology.org/N16-1111.pdf

prerat (@prerationalist) 's Twitter Profile Photo

guy who plays piano: what's your favorite programming language guy who works at microsoft: look between C and D on your keyboard

Zayd 🧠 (@zaydenam) 's Twitter Profile Photo

STARTING A NEW COMPANY IS A CREATIVE PROCESS. Every detail, even the name, is part of this journey. As with all creative endeavors, it’s crucial to be selective about whose feedback you consider. Accepting feedback from everyone will strip away the uniqueness critical for

STARTING A NEW COMPANY IS A CREATIVE PROCESS.

Every detail, even the name, is part of this journey.

As with all creative endeavors, it’s crucial to be selective about whose feedback you consider. Accepting feedback from everyone will strip away the uniqueness critical for
Tianyi Zhang (@tianyi_zh) 's Twitter Profile Photo

Tried the new Gemini Pro for coding. It's impressive, but every time I ask it to write a simple React to-do app, the output gets blocked by the filter. Any suggestions? What's going on here 🤯

Tried the new Gemini Pro for coding. It's impressive, but every time I ask it to write a simple React to-do app, the output gets blocked by the filter. Any suggestions?

What's going on here 🤯
Brett Kuprel (@kuprel) 's Twitter Profile Photo

Recent talk AGI House The Internet started with MIT, Stanford, and Berkeley. It has since expanded to broader audiences with instagram and TikTok. Chimps can already use instagram with an iPhone. Let's leverage real-time interactive generative video to get chimps,

PIN AI (@pinai_io) 's Twitter Profile Photo

We are excited to announce that PIN AI has raised $10M in pre-seed funding to launch the world's first Open Platform for Personal AI. PIN AI converges AI and Crypto in meaningful ways, offering a privacy-focused, on-device AI assistant that democratizes on-device intelligence

We are excited to announce that PIN AI has raised $10M in pre-seed funding to launch the world's first Open Platform for Personal AI.

PIN AI converges AI and Crypto in meaningful ways, offering a privacy-focused, on-device AI assistant that democratizes on-device intelligence
Bill Sun (@billsun_ai) 's Twitter Profile Photo

Really excited for our announcement, we are confident that our PIN AI team really have a shot to build a more open Ecosystem than Apple Intelligence for Personal AI! Also excited to join a16z crypto in NYC for a16Z crypto's CSX program to build the best Personal AI!

Christopher Manning (@chrmanning) 's Twitter Profile Photo

6 years into the LLM revolution, it’s still Day One in developing the many ways they can help the world. Here: cheap, accurate, automated but human-approved mapping and removal of racially restrictive covenants from all Santa Clara County property deeds. reglab.github.io/racialcovenant…

6 years into the LLM revolution, it’s still Day One in developing the many ways they can help the world.

Here: cheap, accurate, automated but human-approved mapping and removal of racially restrictive covenants from all Santa Clara County property deeds.

reglab.github.io/racialcovenant…
Tianyi Zhang (@tianyi_zh) 's Twitter Profile Photo

I believe we’re close to a future where agents automatically setup environments and learn recursively from every repo online. Check out my early experiments—if you’re working on similar ideas, would love to get in touch! tianyicode.hashnode.dev/scaling-rl-env…

lmarena.ai (formerly lmsys.org) (@lmarena_ai) 's Twitter Profile Photo

We thank the authors' for their feedback. However, there are a number of factual errors and misleading statements in this writeup: Regarding the statement that some model providers are not treated fairly: - This is not true. Given our capacity, we have always tried to honor all