Josh Lipe-Melton (@joshlipemelton) Twitter Tweets • TwiCopy

Madhav Singhal

2 years ago

At Replit, we extensively use Spark on Databricks for all our training data work. We run highly customized transformations on code like parsing, deduping, PII redaction, code filtering, tokenization, and more, written with low-level Spark primitives. High quality data is key.

thumb_up_off_alt249

chat_bubble_outline4

repeat37

shareShare

Josh Lipe-Melton

@joshlipemelton

2 years ago

POC with big proprietary models, deploy to production with smaller specialized models. Replit a great example of smaller specialized model offering improvements over larger proprietary ones at a better price

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Abhi Venigalla

@ml_hardware

2 years ago

Back in June we . showed that our LLM Foundry training stack runs seamlessly on AMD MI250 GPUs. Today, I'm happy to share that we've scaled up to 128xMI250, with great multi-node performance!

Back in June we <a href="/MosaicML/">.</a> showed that our LLM Foundry training stack runs seamlessly on <a href="/AMD/">AMD</a> MI250 GPUs.

Today, I'm happy to share that we've scaled up to 128xMI250, with great multi-node performance!

thumb_up_off_alt378

chat_bubble_outline12

repeat60

shareShare

Naveen Rao

@naveengrao

2 years ago

An organization is its people. DM me or email me at [email protected]

thumb_up_off_alt28

chat_bubble_outline1

repeat4

shareShare

Brian Armstrong

@brian_armstrong

2 years ago

AI should be decentralized as much as possible. Open source is a great step toward this. Crypto should be decentralized. Self-custodial wallets and protocols help this. Permissionless access creates innovation instead of gatekeepers. No single entity to capture.

thumb_up_off_alt2,2K

chat_bubble_outline298

repeat399

shareShare

Naveen Rao

@naveengrao

2 years ago

Meet our new AI, #DBRX DBRX is an advance in what language models can do per $. These economics will have profound impacts on how AI is used, and we've built this to democratize these capabilities! It's the best open model in the world. It closes the gap to closed models in a

thumb_up_off_alt366

chat_bubble_outline17

repeat62

shareShare

Databricks Mosaic Research

@dbrxmosaicai

2 years ago

New blog post on DSPy, a recent #llm ops innovation led by researchers from Databricks co-founder Matei Zaharia's lab, co-authored by Dan Pechi and Arnav Singhvi.

thumb_up_off_alt24

chat_bubble_outline1

repeat8

shareShare

Garry Tan

@garrytan

2 years ago

Perplexity is actually just better than Google for clear well cited answers. The quality of results and speed to answer is significantly better.

thumb_up_off_alt2,2K

chat_bubble_outline127

repeat109

shareShare

Ali Ghodsi

@alighodsi

2 years ago

Databricks to acquire Tabular (now part of Databricks), a data platform from the original creators of Apache Iceberg. Together, we will bring format compatibility to the lakehouse for Delta Lake and Apache Iceberg databricks.com/blog/databrick…

thumb_up_off_alt370

chat_bubble_outline11

repeat84

shareShare

TST

@tst7v7

2 years ago

NO GOALS FOR YOU!!!! Christian Lomeli is your TST 2024 Golden Glove Winner! What a tournament, Christian👏🧤

thumb_up_off_alt162

chat_bubble_outline10

repeat23

shareShare

Databricks Mosaic Research

@dbrxmosaicai

2 years ago

Lynx is a new hallucination detection model for #LLMs that is especially suited for real-world applications in industries like healthcare and fintech. PatronusAI trained Lynx on Databricks Mosaic AI using Composer, our open source PyTorch-based training library.

thumb_up_off_alt34

chat_bubble_outline1

repeat11

shareShare

Matei Zaharia

@matei_zaharia

a year ago

Great post on how AI can be engineered "brick-by-brick" to get the best results, including through the design of compound AI systems. The teams that figure out how to get the right data pipelines and system design are the ones getting transformational results.

thumb_up_off_alt47

chat_bubble_outline3

repeat11

shareShare

Josh Lipe-Melton

@joshlipemelton

a year ago

Per insider sources: Inter Miami will receive 37 minutes of extra time if they aren’t winning at the end of regulation tonight

thumb_up_off_alt1

chat_bubble_outline1

repeat0

shareShare

Josh Lipe-Melton

@joshlipemelton

a year ago

o3 or its checkpoints have been available to OpenAI developers and PMs internally for a while now. With virtually unlimited PhD level AI developers and capitol, why haven’t they shipped more world class software? Their other product releases were less exciting than google’s, no?

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Josh Lipe-Melton

@joshlipemelton

a year ago

OpenAI's high percentage of revenue from consumer apps could push them to commoditize their models via open source. Ironically, this would turn ChatGPT into a "thin model wrapper”

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare