piushvaish (@piushvaish) Twitter Tweets • TwiCopy

piushvaish

@piushvaish

3 years ago

huggingface.co/papers/2307.02…

thumb_up_off_alt0

chat_bubble_outline1

repeat0

shareShare

⚠️ Giveaway I Built a "Warren Buffett Investing Course" Step-by-step video course: - How to find high quality stocks - Value them - Determine an attractive price - and much more Valued at $159 FREE to 20 people, just: 1️⃣ Follow me 2️⃣ RT & Like 3️⃣ Reply 'Buffett' below

thumb_up_off_alt134

chat_bubble_outline72

repeat54

shareShare

Cameron R. Wolfe, Ph.D.

@cwolferesearch

3 years ago

Open-source LLMs are now commonly used and widely studied, but this area of research saw some initial struggles and criticism due to the poor performance of models like OPT and BLOOM. These four recently-proposed open-source models changed this narrative… LLaMA. Interest in

thumb_up_off_alt261

chat_bubble_outline5

repeat44

shareShare

Chip Huyen

@chipro

2 years ago

Open challenges in LLM research The first two challenges, hallucinations and context learning, are probably the most talked about today. I’m the most excited about 3 (multimodality), 5 (new architecture), and 6 (GPU alternatives). Number 5 and number 6, new architectures and

thumb_up_off_alt1,1K

chat_bubble_outline51

repeat397

shareShare

Cameron R. Wolfe, Ph.D.

@cwolferesearch

2 years ago

More info on temperature and its inner workings can be found here. x.com/cwolferesearch…

thumb_up_off_alt10

chat_bubble_outline1

repeat2

shareShare

Nando de Freitas

@nandodf

2 years ago

There appears to be a mismatch between publishing criteria in AI conferences and "what actually works". It is easy to publish new mathematical constructs (e.g. new models, new layers, new modules, new losses), but as Apple's MM1 paper concludes: 1. Encoder Lesson: Image

thumb_up_off_alt1,1K

chat_bubble_outline15

repeat196

shareShare

Matt Shumer

@mattshumer_

2 years ago

Introducing `claude-journalist` ✍️ The first Claude 3 journalist agent. Just provide a topic, and it will: - Search the web for articles/real-time details - Choose the best sources and read through them - Write a fantastic, *factual* article + edit it And it's open-source!

thumb_up_off_alt2,2K

chat_bubble_outline67

repeat298

shareShare

Jerry Liu

@jerryjliu0

2 years ago

A nice analogy of RAG vs. finetuning is to compare it to an open vs. closed-book exam 📖RAG == open-book exam without studying. Only use information provided in the page, but hard to discern which information is relevant. 📘Finetuning == closed-book exam. Only use memorized

thumb_up_off_alt585

chat_bubble_outline13

repeat111

shareShare

Databricks

@databricks

2 years ago

Meet #DBRX: a general-purpose LLM that sets a new standard for efficient open source models. Use the DBRX model in your RAG apps or use the DBRX design to build your own custom LLMs and improve the quality of your GenAI applications. dbricks.co/43xaCMj

thumb_up_off_alt545

chat_bubble_outline22

repeat134

shareShare

AI21 Labs

@ai21labs

2 years ago

Introducing Jamba, our groundbreaking SSM-Transformer open model! As the first production-grade model based on Mamba architecture, Jamba achieves an unprecedented 3X throughput and fits 140K context on a single GPU. 🥂Meet Jamba ai21.com/jamba 🔨Build on Hugging Face

thumb_up_off_alt1,1K

chat_bubble_outline37

repeat246

shareShare

Carlos E. Perez

@intuitmachine

2 years ago

The Future Of Foundation Models? Directed Evolution Over Brute Force Imagine having the power to mix and combine the knowledge and capabilities of different AI models like mixing colors - taking the language understanding of one model, blending it with the math reasoning prowess

thumb_up_off_alt79

chat_bubble_outline2

repeat12

shareShare

Andrew Ng

@andrewyng

2 years ago

Last week, I described four design patterns for AI agentic workflows that I believe will drive significant progress this year: Reflection, Tool use, Planning and Multi-agent collaboration. Instead of having an LLM generate its final output directly, an agentic workflow prompts

thumb_up_off_alt2,2K

chat_bubble_outline101

repeat576

shareShare

Eugene Yan

@eugeneyan

2 years ago

I've been trying several evals to find those that correlate well with use cases and discriminative enough for prod. Here's an opinionated take on what works, focusing on classification, summarization, translation, copyright regurgitation, and toxicity. eugeneyan.com/writing/evals/

thumb_up_off_alt157

chat_bubble_outline6

repeat20

shareShare

Cameron R. Wolfe, Ph.D.

@cwolferesearch

2 years ago

Prompt engineering is one of the most rapidly-evolving research topics in AI, but we can (roughly) group recent research on this topic into four categories… (1) Reasoning: Simple prompting techniques are effective for many problems, but more sophisticated strategies are

thumb_up_off_alt713

chat_bubble_outline12

repeat166

shareShare

Philipp Schmid

@_philschmid

2 years ago

An open LLM that can be used for LLM-as-a-Judge evaluation as strong as OpenAI GPT-4 or Anthropic Claude 3? 🤯 Yes, KAIST AI just published PROMETHEUS 2, an open LLM specialized in evaluating other LLMs highly correlating with human and GPT-4 judgments. 🔥 Implementation:

An open LLM that can be used for LLM-as-a-Judge evaluation as strong as <a href="/OpenAI/">OpenAI</a> GPT-4 or <a href="/AnthropicAI/">Anthropic</a> Claude 3? 🤯 Yes, <a href="/kaist_ai/">KAIST AI</a> just published PROMETHEUS 2, an open LLM specialized in evaluating other LLMs highly correlating with human and GPT-4 judgments. 🔥

Implementation:

thumb_up_off_alt220

chat_bubble_outline8

repeat40

shareShare

Daniel Han

@danielhanchen

2 years ago

Fixed continual finetuning in Unsloth AI! When continuing to finetune LoRA adapters, the loss goes haywire I accidentally set the tokenizer's padding side to "left", not "right" on new runs! Whoops! + save 20mins with multi GGUF options! Colab for both: colab.research.google.com/drive/1rU4kVb9…

Fixed continual finetuning in <a href="/UnslothAI/">Unsloth AI</a>! When continuing to finetune LoRA adapters, the loss goes haywire

I accidentally set the tokenizer's padding side to "left", not "right" on new runs! Whoops!

+ save 20mins with multi GGUF options!

Colab for both: colab.research.google.com/drive/1rU4kVb9…

thumb_up_off_alt180

chat_bubble_outline5

repeat21

shareShare

Benji Hyam

@benjihyam

7 months ago

We’ve found a 72% correlation between our clients ranking on the first page of Google and being mentioned by AI tools, such as ChatGPT and Perplexity. Here’s the approach we take for getting brands to show up in AI search. growandconvert.com/ai/google-seo-…

thumb_up_off_alt203

chat_bubble_outline0

repeat17

shareShare

Rohan Paul

@rohanpaul_ai

7 months ago

This paper studies vibe coding, a style where developers build apps by talking with an LLM instead of writing code. The authors conclude that skill stays vital, only the hands‑on parts move from keyboard to prompt and quick review. Many assume AI will erase coding grunt work

thumb_up_off_alt134

chat_bubble_outline5

repeat35

shareShare

Sophia Yang, Ph.D.

@sophiamyang

7 months ago

New Mistral AI cookbook: Self-Supervised Prompt Optimization ❌ Prompt engineering... sucks. It's a non-standard process, heavily relying on trial and error and difficult to standardize 🤩 Luckily, we can automate it using ✨prompt optimization✨, investigated in recent works

New <a href="/MistralAI/">Mistral AI</a> cookbook: Self-Supervised Prompt Optimization

❌ Prompt engineering... sucks. It's a non-standard process, heavily relying on trial and error and difficult to standardize
🤩 Luckily, we can automate it using ✨prompt optimization✨, investigated in recent works

thumb_up_off_alt451

chat_bubble_outline10

repeat53

shareShare

Machina

@exm7777

5 months ago

Gemini Nano Banana can be incredible if you know what prompts to use... i've built my own library of presets for ads, logos and all kinds of styles bookmark these 20 JSON templates:

thumb_up_off_alt1,1K

chat_bubble_outline41

repeat125

shareShare

piushvaish

piushvaish

Value Theory

Cameron R. Wolfe, Ph.D.

Chip Huyen

Cameron R. Wolfe, Ph.D.

Nando de Freitas

Matt Shumer

Jerry Liu

Databricks

AI21 Labs

Carlos E. Perez

Andrew Ng

Eugene Yan

Cameron R. Wolfe, Ph.D.

Philipp Schmid

Daniel Han

Benji Hyam

Rohan Paul

Sophia Yang, Ph.D.

Machina