four (@four) Twitter Tweets • TwiCopy

SpaceX

2 years ago

Starship and Super Heavy are ready at the launch pad in Starbase, Texas. Targeting Saturday, November 18 for Starship’s second integrated flight test → spacex.com/launches

thumb_up_off_alt26,26K

chat_bubble_outline1,1K

repeat4,4K

shareShare

🚨 [New Paper] If you're involved in AI safety or jailbreaking, you don't want to miss this: Techniques from human communication now effectively breach aligned LLMs (Llama-2 Chat, GPT-3.5, GPT-4) with over 92% attack success rate. 👇🧵(1/7 - page link: chats-lab.github.io/persuasive_jai…)

thumb_up_off_alt431

chat_bubble_outline9

repeat98

shareShare

Linus Ekenstam – eu/acc

@linusekenstam

2 years ago

Formula 1 x Apple Vision Pro This will change the sport forever Links below 👇

thumb_up_off_alt2,2K

chat_bubble_outline99

repeat321

shareShare

Josh Miller

@joshm

2 years ago

Our vision for Act II of Arc

thumb_up_off_alt4,4K

chat_bubble_outline256

repeat480

shareShare

Dino A. Dai Zovi

@dinodaizovi

2 years ago

The number one reason why good security is hard is that the feedback loop on decisions is long and the signal is low fidelity. It's not clear how many incidents were prevented or mitigated from which foundational decisions years prior. This wrecks the incentives to be proactive.

thumb_up_off_alt37

chat_bubble_outline5

repeat14

shareShare

LLM Security

@llm_sec

2 years ago

Self-replicating prompt injection w/ Ben Nassi wired.com/story/here-com…

thumb_up_off_alt39

chat_bubble_outline2

repeat10

shareShare

SpaceX

@spacex

2 years ago

Starship completed its rehearsal for launch, loading more than 10 million pounds of propellant on Starship and Super Heavy and taking the flight-like countdown to T-10 seconds

thumb_up_off_alt26,26K

chat_bubble_outline1,1K

repeat3,3K

shareShare

will depue (in singapore for ICLR)

@willdepue

2 years ago

announcing... starlinkmap dot org real-time map of every starlink satellite. tracks upcoming launches, other constellations, orbital updates, etc. finally launching this after a while! more details below.

thumb_up_off_alt2,2K

chat_bubble_outline155

repeat320

shareShare

lmarena.ai (formerly lmsys.org)

@lmarena_ai

a year ago

Congrats Google DeepMind on the new Gemma-2 27B & 9B release! Gemma-2 was tested in the Arena under the codename "*late-june-chatbots" and now out of stealth. Its early result matches the best open models (Llama-3-70B, Nemotron-340B) with only 27B parameters! Impressively,

Congrats <a href="/GoogleDeepMind/">Google DeepMind</a> on the new Gemma-2 27B & 9B release!

Gemma-2 was tested in the Arena under the codename "*late-june-chatbots" and now out of stealth. Its early result matches the best open models (Llama-3-70B, Nemotron-340B) with only 27B parameters!

Impressively,

thumb_up_off_alt528

chat_bubble_outline9

repeat99

shareShare

Demis Hassabis

@demishassabis

a year ago

Advanced mathematical reasoning is a critical capability for modern AI. Today we announce a major milestone in a longstanding grand challenge: our hybrid AI system attained the equivalent of a silver medal at this year’s International Math Olympiad!

thumb_up_off_alt3,3K

chat_bubble_outline166

repeat591

shareShare

lmarena.ai (formerly lmsys.org)

@lmarena_ai

a year ago

Exciting News from Chatbot Arena! Google DeepMind's new Gemini 1.5 Pro (Experimental 0801) has been tested in Arena for the past week, gathering over 12K community votes. For the first time, Google Gemini has claimed the #1 spot, surpassing GPT-4o/Claude-3.5 with an impressive

Exciting News from Chatbot Arena!

<a href="/GoogleDeepMind/">Google DeepMind</a>'s new Gemini 1.5 Pro (Experimental 0801) has been tested in Arena for the past week, gathering over 12K community votes.

For the first time, Google Gemini has claimed the #1 spot, surpassing GPT-4o/Claude-3.5 with an impressive

thumb_up_off_alt1,1K

chat_bubble_outline83

repeat410

shareShare

Logan Kilpatrick

@officiallogank

a year ago

Yeah, Gemini-exp-1114 is pretty good :)

thumb_up_off_alt1,1K

chat_bubble_outline97

repeat79

shareShare

Logan Kilpatrick

@officiallogank

a year ago

Gemini-exp-1206, our latest Gemini iteration, (with the full 2M token context and much more) is available right now for free in Google AI Studio and the Gemini API. I hope you have enjoyed year 1 of the Gemini era as much as I have. We are just getting started : )

thumb_up_off_alt3,3K

chat_bubble_outline231

repeat301

shareShare

Jeff Dean

@jeffdean

a year ago

What a way to celebrate one year of incredible Gemini progress -- #1🥇across the board on overall ranking, as well as on hard prompts, coding, math, instruction following, and more, including with style control on. Thanks to the hard work of everyone in the Gemini team and

thumb_up_off_alt1,1K

chat_bubble_outline90

repeat314

shareShare

Eli Collins

@elicollins

a year ago

Blown away by our new image and video models. Glad to see others are as well! The mix of creativity and realism is 🤯

thumb_up_off_alt186

chat_bubble_outline9

repeat8

shareShare

Google DeepMind

@googledeepmind

10 months ago

As we make progress towards AGI, developing AI needs to be both innovative and safe. ⚖️ To help ensure this, we’ve made updates to our Frontier Safety Framework - our set of protocols to help us stay ahead of possible severe risks. Find out more → goo.gle/42IuIVf

thumb_up_off_alt515

chat_bubble_outline101

repeat83

shareShare

Logan Kilpatrick

@officiallogank

8 months ago

Introducing Gemini 2.5 Pro, the world's most powerful model, with unified reasoning capabilities + all the things you love about Gemini (long context, tools, etc) Available as experimental and for free right now in Google AI Studio + API, with pricing coming very soon!

thumb_up_off_alt4,4K

chat_bubble_outline269

repeat456

shareShare

Logan Kilpatrick

@officiallogank

8 months ago

Deep Research in the Gemini App is now powered by Gemini 2.5 Pro, and our early tests show users prefer this 2:1 vs “other products” ;) gemini.google.com

thumb_up_off_alt2,2K

chat_bubble_outline207

repeat205

shareShare

Anca Dragan

@ancadianadragan

7 months ago

Per our Frontier Safety Framework, we continue to test our models for critical capabilities. Here’s the updated model card for Gemini 2.5Pro with frontier safety evaluations + explanation of how our safety buffer / alert thresholds approach applies to 2.0, 2.5, and what’s coming.

thumb_up_off_alt79

chat_bubble_outline1

repeat13

shareShare

Demis Hassabis

@demishassabis

6 months ago

cooking up something tasty for tomorrow...

thumb_up_off_alt5,5K

chat_bubble_outline419

repeat298

shareShare

four

SpaceX

Yi Zeng 曾祎

Linus Ekenstam – eu/acc

Josh Miller

Dino A. Dai Zovi

LLM Security

SpaceX

will depue (in singapore for ICLR)

lmarena.ai (formerly lmsys.org)

Demis Hassabis

lmarena.ai (formerly lmsys.org)

Logan Kilpatrick

Logan Kilpatrick

Jeff Dean

Eli Collins

Google DeepMind

Logan Kilpatrick

Logan Kilpatrick

Anca Dragan

Demis Hassabis