Anish Shah (@ash0ts) Twitter Tweets • TwiCopy

Ayush Thakur

a year ago

Evaluated Llama 3.1 70B (Fireworks AI) as well and is performant at the same level as Mistral Large 2 128B. - Llama is outperforming on GSM8k (math) while Mistral is outperforming on MATH benchmark.😅 - Llama > Mistral (reasoning tasks) - Mistral > Llama (Q&A tasks)

Evaluated Llama 3.1 70B (<a href="/FireworksAI_HQ/">Fireworks AI</a>) as well and is performant at the same level as Mistral Large 2 128B.

- Llama is outperforming on GSM8k (math) while Mistral is outperforming on MATH benchmark.😅
- Llama > Mistral (reasoning tasks)
- Mistral > Llama (Q&A tasks)

thumb_up_off_alt9

chat_bubble_outline1

repeat5

shareShare

Weights & Biases

@weights_biases

a year ago

Hallucinations in AI-assisted coding are a significant challenge for developers. Join Daniel Loman from Groq Inc and Anish Shah from Weights & Biases as they share strategies to overcome this issue in our technical webinar on August 6. Register now: streamyard.com/watch/hikcAgep…

Hallucinations in AI-assisted coding are a significant challenge for developers. Join Daniel Loman from <a href="/GroqInc/">Groq Inc</a> and <a href="/ash0ts/">Anish Shah</a> from Weights & Biases as they share strategies to overcome this issue in our technical webinar on August 6. Register now: streamyard.com/watch/hikcAgep…

thumb_up_off_alt11

chat_bubble_outline0

repeat4

shareShare

Ayush Thakur

@ayushthakur0

a year ago

I will be talking tomorrow on- - LLM landscape - Need for structured output - function calling, json, constrained decoding, more - RAG - LLM system evaluation - Weights & Biases Weave for building LLM applications correctly If it excites you, consider showing up. 💫⭐🌟

thumb_up_off_alt18

chat_bubble_outline1

repeat4

shareShare

Morgan McGuire

@morgymcg

a year ago

Stoked to partner with the team to host 7 lectures, a lot of agents creativity needed 📺 wandb.me/hackercup NeurIPS Meta Hacker Cup AI track gonna be 🔥

thumb_up_off_alt11

chat_bubble_outline1

repeat4

shareShare

Bharat Ramanathan

@parambharat

a year ago

Join me tomorrow. I'm presentimg a codegen agent featuring RAG-based episodic memory and reflexion. Whether you're aiming to enhance your LLM toolkit or just curious about the latest techniques, this talk will deliver essential insights. 🔗events.bizzabo.com/NeurIPShackerc…

thumb_up_off_alt10

chat_bubble_outline0

repeat7

shareShare

Weights & Biases

@weights_biases

a year ago

Join us on September 10 to learn how to build production-ready RAG systems. Learn from Anish Shah about optimizing pipelines, enhancing queries, and scaling solutions for real-world applications. Ideal for tech leads and product managers driving AI innovation. Register now:

Join us on September 10 to learn how to build production-ready RAG systems. Learn from <a href="/ash0ts/">Anish Shah</a> about optimizing pipelines, enhancing queries, and scaling solutions for real-world applications. Ideal for tech leads and product managers driving AI innovation. Register now:

thumb_up_off_alt8

chat_bubble_outline1

repeat1

shareShare

Bharat Ramanathan

@parambharat

a year ago

I've been evaluating chunking mechanisms while prepping our RAG course's data ingestion lesson. Structured, semantic, and syntactic chunking each uniquely impact RAG performance. Learn to choose the right approach for your use case: wandb.me/rag-course

thumb_up_off_alt8

chat_bubble_outline0

repeat4

shareShare

Morgan McGuire

@morgymcg

a year ago

⚡️ AI Hacker Cup Lightning Comp Today we're kicking off a ⚡️ 7-day competition to solve all 5 of the 2023 practice Hacker Cup challenges with Mistral AI models Our current baseline is 2/5 with the starter RAG agent (with reflection) Mistral AI api access provided Details👇

⚡️ AI Hacker Cup Lightning Comp

Today we're kicking off a ⚡️ 7-day competition to solve all 5 of the 2023 practice Hacker Cup challenges with <a href="/MistralAI/">Mistral AI</a> models

Our current baseline is 2/5 with the starter RAG agent (with reflection)

<a href="/MistralAI/">Mistral AI</a> api access provided

Details👇

thumb_up_off_alt37

chat_bubble_outline2

repeat12

shareShare

Thomas Capelle

@capetorch

a year ago

Today I gave LlamaIndex 🦙's workflows a go to tackle the NeurIPS AI hackercup competition. I created a Workflow to iterate from an initial solution, run the generated solution, and check against the expected output. It is effortless to define steps with the expected inputs and

Today I gave <a href="/llama_index/">LlamaIndex 🦙</a>'s workflows a go to tackle the NeurIPS AI hackercup competition.

I created a Workflow to iterate from an initial solution, run the generated solution, and check against the expected output.

It is effortless to define steps with the expected inputs and

thumb_up_off_alt25

chat_bubble_outline4

repeat6

shareShare

Morgan McGuire

@morgymcg

a year ago

Defining your grading criteria for LLM outputs is an organic process that evolves the more time you spend on it In “Who Validates the Validators” Shreya Shankar et al highlight this This weekend in our SF office a good chunk % of hackers will learn this 😃 lu.ma/judge

thumb_up_off_alt8

chat_bubble_outline0

repeat4

shareShare

Weights & Biases

@weights_biases

a year ago

Announcing our new RAG++ course, now available in collaboration with @CohereAI and @Weaviate_io. Created for engineers looking to build production-ready RAG systems. The course covers everything from evaluation strategies and data preprocessing to advanced retrieval techniques

thumb_up_off_alt94

chat_bubble_outline1

repeat25

shareShare

GeekyRakshit (e/mad)

@soumikrakshit96

a year ago

📣 I am happy to announce that Weights & Biases Weave is now integrated with Instructor (jason liu)! 🧶 Weave will automatically capture traces for Instructor. To start tracking, call `weave.init()` and use the library as normal. 👉 Learn more at weave-docs.wandb.ai/guides/integra…

📣 I am happy to announce that <a href="/weights_biases/">Weights & Biases</a> Weave is now integrated with Instructor (<a href="/jxnlco/">jason liu</a>)!

🧶 Weave will automatically capture traces for Instructor. To start tracking, call `weave.init()` and use the library as normal.

👉 Learn more at weave-docs.wandb.ai/guides/integra…

thumb_up_off_alt21

chat_bubble_outline1

repeat8

shareShare

Bharat Ramanathan

@parambharat

a year ago

The new Weights & Biases RAG++ course uses the v2 API and the new command-r models. In addition to learning about nuanced RAG techniques, you also get to work with the new API in the course colabs thanks to the credits from cohere. Register here: wandb.me/rag

thumb_up_off_alt17

chat_bubble_outline1

repeat7

shareShare

Conference on Language Modeling

@colm_conf

a year ago

🏆🏆🏆🏆 Four papers at COLM were awarded outstanding paper awards. Congratulations to the authors!

thumb_up_off_alt268

chat_bubble_outline1

repeat40

shareShare

Weights & Biases

@weights_biases

7 months ago

Looking to go beyond basic LLM experiments and build real, production-grade applications? We have two free virtual courses designed for AI engineers of all levels who want to master RAG and LLM evaluations. Here’s what you’ll learn 🧵

thumb_up_off_alt61

chat_bubble_outline2

repeat8

shareShare