Yann Dupis (@yanndupis) Twitter Tweets • TwiCopy

Cape

5 years ago

Cape is now part of the MPC Alliance. We’re thrilled to join these industry pioneers on the mission to accelerate awareness and adoption of secure multiparty computation (MPC) technology. Together we will improve #dataprivacy and #security efforts. #secureMPC #encryptedlearning

Cape is now part of the <a href="/MPCalliance/">MPC Alliance</a>. We’re thrilled to join these industry pioneers on the mission to accelerate awareness and adoption of secure multiparty computation (MPC) technology. Together we will improve #dataprivacy and #security efforts.

#secureMPC #encryptedlearning

thumb_up_off_alt14

chat_bubble_outline0

repeat7

shareShare

Dragoș Rotaru

@dragosrotaru

4 years ago

We are hiring cryptographers at @capeprivacy! Come join us and be part of an awesome and fully remote team capeinc.bamboohr.com/jobs/view.php?…

thumb_up_off_alt18

chat_bubble_outline1

repeat7

shareShare

Cape

@capeprivacy

4 years ago

Another big step in our journey! 🚀 Our new self-service platform for running #AI predictions in SnowflakeDB is live. #Financialservices organizations can now use #encrypted data for powerful predictive modeling safely in the #cloud. Read more here: globenewswire.com/news-release/2…

thumb_up_off_alt17

chat_bubble_outline0

repeat11

shareShare

Gavin Uhma

@gavinuhma

3 years ago

::Introducing the Cape API:: Keep sensitive data private while prompting LLMs like GPT-4 and GPT 3.5 Turbo. Easily de-identify sensitive data like financial, legal, and internal docs before sending to OpenAI Try the playground free: chat.capeprivacy.com How? /🧵

thumb_up_off_alt56

chat_bubble_outline4

repeat16

shareShare

Yann Dupis

@yanndupis

3 years ago

Making progress on privacy! “California lawmakers pass Delete Act that would force data brokers to eliminate all personal info they possess if people request it” fortune.com/2023/09/15/cal…

thumb_up_off_alt6

chat_bubble_outline0

repeat4

shareShare

Thomas Wolf

@thom_wolf

2 years ago

Llama3 was trained on 15 trillion tokens of public data. But where can you find such datasets and recipes?? Here comes the first release of 🍷Fineweb. A high quality large scale filtered web dataset out-performing all current datasets of its scale. We trained 200+ ablation

thumb_up_off_alt1,1K

chat_bubble_outline23

repeat290

shareShare

Shreya Shankar

@sh_reya

2 years ago

i'm having a super fun time collaborating with Eugene Yan, Bryan Bischof fka Dr. Donut, Charles 🎉 Frye, Hamel Husain, & jason liu on a 3-part series on working with LLMs. i learned so much from them that i really think it's the best resource on applied LLMs. here's part 1: oreilly.com/radar/what-we-…

thumb_up_off_alt165

chat_bubble_outline2

repeat33

shareShare

Hamel Husain

@hamelhusain

2 years ago

This talk by Ben Clavié is the highest value per second talk I have ever watched on RAG Chapter summaries and additional links in next tweet

thumb_up_off_alt1,1K

chat_bubble_outline15

repeat144

shareShare

Thomas Wolf

@thom_wolf

2 years ago

Among the most impressive aspect of the Llama 3.1 release is the accompanying research paper! Close to 100 pages of deep knowledge-sharing on LLMs like we havn't seen very often recently What a treat! It covers everything, pretrainining data, filtering, annealing, synthetic

thumb_up_off_alt1,1K

chat_bubble_outline15

repeat259

shareShare

Thomas Wolf

@thom_wolf

2 years ago

It’s Sunday morning we have some time with the coffee so let me tell you about some of our recent surprising journey in synthetic data and small language models. This post is prompted by the coming release of an instant, in-browser model called SmolLM360 (link at the end) The

thumb_up_off_alt510

chat_bubble_outline14

repeat113

shareShare

Jeremy Howard

@jeremyphoward

a year ago

I'll get straight to the point. We trained 2 new models. Like BERT, but modern. ModernBERT. Not some hypey GenAI thing, but a proper workhorse model, for retrieval, classification, etc. Real practical stuff. It's much faster, more accurate, longer context, and more useful. 🧵

thumb_up_off_alt4,4K

chat_bubble_outline127

repeat676

shareShare

Kyle Corbitt

@corbtt

a year ago

A few weeks ago, OpenAI announced Reinforcement Fine-Tuning (RFT)—a new way to adapt LLMs to complex tasks with very little training data. Here’s a quick rundown of how it works, why it’s a big deal, and when you should use it. 🧵

thumb_up_off_alt1,1K

chat_bubble_outline30

repeat148

shareShare

Philipp Schmid

@_philschmid

a year ago

The RLHF method behind the best open models! Both DeepSeek and Qwen use GRPO in post-training! Group Relative Policy Optimization. GRPO was introduced in the DeepSeekMath Paper last year to improve mathematical reasoning capabilities with less memory consumption,

The RLHF method behind the best open models! Both <a href="/deepseek_ai/">DeepSeek</a> and <a href="/Alibaba_Qwen/">Qwen</a> use GRPO in post-training! Group Relative Policy Optimization. GRPO was introduced in the DeepSeekMath Paper last year to improve mathematical reasoning capabilities with less memory consumption,

thumb_up_off_alt995

chat_bubble_outline17

repeat202

shareShare

Thomas Wolf

@thom_wolf

a year ago

After 6+ months in the making and burning over a year of GPU compute time, we're super excited to finally release the "Ultra-Scale Playbook" Check it out here: hf.co/spaces/nanotro… A free, open-source, book to learn everything about 5D parallelism, ZeRO, fast CUDA kernels,

thumb_up_off_alt3,3K

chat_bubble_outline109

repeat711

shareShare