SRK (@sudalairajkumar) 's Twitter Profile
SRK

@sudalairajkumar

๐Ÿ“ˆ AI Innovations @TigerAnalytics
๐Ÿฆ† 4x @kaggle Grandmaster
๐Ÿ“Š AI / ML Advisor
โšก AI Tinkerer in Web3 โšก
๐Ÿ–ฅ๏ธ Ex - @h2oai @FreshworksInc

ID: 65376897

linkhttps://srkaidaily.substack.com/ calendar_today13-08-2009 14:37:18

510 Tweet

8,8K Followers

586 Following

Ravi Theja (@ravithejads) 's Twitter Profile Photo

๐Ÿšจ๐‘๐ž๐ฅ๐ž๐š๐ฌ๐ข๐ง๐  ๐ญ๐ก๐ž ๐“๐ž๐ฅ๐ฎ๐ ๐ฎ-๐‹๐ฅ๐š๐ฆ๐š-7๐-๐ฏ0-๐ˆ๐ง๐ฌ๐ญ๐ซ๐ฎ๐œ๐ญ ๐Œ๐จ๐๐ž๐ฅ ๐Ÿš€ Last week, we announced Telugu-LLM-Labs, a joint independent collaborative effort by me, and Ramsri Goutham Golla, where we released datasets translated and romanized in Telugu. ๐Ÿ”ฅ Today,

๐Ÿšจ๐‘๐ž๐ฅ๐ž๐š๐ฌ๐ข๐ง๐  ๐ญ๐ก๐ž ๐“๐ž๐ฅ๐ฎ๐ ๐ฎ-๐‹๐ฅ๐š๐ฆ๐š-7๐-๐ฏ0-๐ˆ๐ง๐ฌ๐ญ๐ซ๐ฎ๐œ๐ญ ๐Œ๐จ๐๐ž๐ฅ ๐Ÿš€

Last week, we announced Telugu-LLM-Labs, a joint independent collaborative effort by me, and <a href="/ramsri_goutham/">Ramsri Goutham Golla</a>, where we released datasets translated and romanized in Telugu.

๐Ÿ”ฅ Today,
Ramsri Goutham Golla (@ramsri_goutham) 's Twitter Profile Photo

Open-source models like Llama2 and Mistral are not good with Indian languages because of the scarcity of native tokens in the tokenizer and the lack of a strong presence in training data. One option is to expand the tokenizer vocabulary and pre-train with a lot of native text.

Open-source models like Llama2 and Mistral are not good with Indian languages because of the scarcity of native tokens in the tokenizer and the lack of a strong presence in training data.

One option is to expand the tokenizer vocabulary and pre-train with a lot of native text.
SRK (@sudalairajkumar) 's Twitter Profile Photo

"Machine Learning Engineering Open Book" by Stas Bekman This open book has a lot of information on the Engineering aspects of building DL / ML models specifically LLM and Multi-modal models. This open book is continuously being updated. A pdf version can also be downloaded.

"Machine Learning Engineering Open Book" by Stas Bekman

This open book has a lot of information on the Engineering aspects of building DL / ML models specifically LLM and Multi-modal models.

This open book is continuously being updated. A pdf version can also be downloaded.
SRK (@sudalairajkumar) 's Twitter Profile Photo

ICYMI - Andrej Karpathy has released an excellent video tutorial on "Tokenization" couple of days back. โฆฟ Basics covered: Strings, Unicode code points, and encodings like UTF-8. โฆฟ Byte pair encoding algorithm explained and implemented in Python. โฆฟ Delving into complexities:

ICYMI - Andrej Karpathy has released an excellent video tutorial on "Tokenization" couple of days back.

โฆฟ Basics covered: Strings, Unicode code points, and encodings like UTF-8.
โฆฟ Byte pair encoding algorithm explained and implemented in Python.
โฆฟ Delving into complexities:
Ravi Theja (@ravithejads) 's Twitter Profile Photo

๐Ÿ”ฅ ๐‘๐ž๐ฅ๐ž๐š๐ฌ๐ข๐ง๐  ๐ˆ๐ง๐๐ข๐œ ๐†๐ž๐ฆ๐ฆ๐š 7๐/2๐ ๐ˆ๐ง๐ฌ๐ญ๐ซ๐ฎ๐œ๐ญ๐ข๐จ๐ง ๐ญ๐ฎ๐ง๐ž๐ ๐ฆ๐จ๐๐ž๐ฅ ๐จ๐ง 9 ๐ˆ๐ง๐๐ข๐š๐ง ๐‹๐š๐ง๐ ๐ฎ๐š๐ ๐ž๐ฌ โ€” ๐๐š๐ฏ๐š๐ซ๐š๐ฌ๐š ๐Ÿš€ We are thrilled to share ๐ŸŒŸ ๐๐š๐ฏ๐š๐ซ๐š๐ฌ๐š, a Gemma 7B & 2B instruction-tuned models in 9 Indian Languages - Perhaps

๐Ÿ”ฅ ๐‘๐ž๐ฅ๐ž๐š๐ฌ๐ข๐ง๐  ๐ˆ๐ง๐๐ข๐œ ๐†๐ž๐ฆ๐ฆ๐š 7๐/2๐ ๐ˆ๐ง๐ฌ๐ญ๐ซ๐ฎ๐œ๐ญ๐ข๐จ๐ง ๐ญ๐ฎ๐ง๐ž๐ ๐ฆ๐จ๐๐ž๐ฅ ๐จ๐ง 9 ๐ˆ๐ง๐๐ข๐š๐ง ๐‹๐š๐ง๐ ๐ฎ๐š๐ ๐ž๐ฌ โ€” ๐๐š๐ฏ๐š๐ซ๐š๐ฌ๐š ๐Ÿš€

We are thrilled to share  ๐ŸŒŸ ๐๐š๐ฏ๐š๐ซ๐š๐ฌ๐š, a Gemma 7B &amp; 2B instruction-tuned models in 9 Indian Languages - Perhaps
Ravi Theja (@ravithejads) 's Twitter Profile Photo

๐ŸŒŸ RAG in 2024 with LlamaIndex ๐Ÿฆ™ Super excited to share that I will be speaking about RAG in 2024 with LlamaIndex ๐Ÿฆ™ at Saama Connect March Meetup in Chennai this Saturday. I will be covering a wide range of topics: 1๏ธโƒฃ Retrievers 2๏ธโƒฃ Agents 3๏ธโƒฃ Latest Research in RAG 4๏ธโƒฃ

๐ŸŒŸ RAG in 2024 with <a href="/llama_index/">LlamaIndex ๐Ÿฆ™</a>

Super excited to share that I will be speaking about RAG in 2024 with <a href="/llama_index/">LlamaIndex ๐Ÿฆ™</a> at <a href="/SaamaOfficial/">Saama</a> Connect March Meetup in Chennai this Saturday.

I will be covering a wide range of topics:

1๏ธโƒฃ Retrievers
2๏ธโƒฃ Agents
3๏ธโƒฃ Latest Research in RAG
4๏ธโƒฃ
SRK (@sudalairajkumar) 's Twitter Profile Photo

In case you missed it, 3Blue1Brown has released a video on "But what is a GPT?Visual intro to Transformers" couple of days back. Even if you have a good knowledge on Transformers, this is highly recommended. Visual illustrations help us grasp the underlying concepts easily.

In case you missed it, 3Blue1Brown has released a video on "But what is a GPT?Visual intro to Transformers" couple of days back.

Even if you have a good knowledge on Transformers, this is highly recommended. Visual illustrations help us grasp the underlying concepts easily.
Akshat Metaforms (@ofcakshat) 's Twitter Profile Photo

Yes, Typeform is shit expensive, but that's because they've not upgraded at all. It's exactly the same single-scree interface that it was 10 years ago. Imagine if OpenAI and Typeform had a kidโ€” Introducing Metaforms AI

Vishnu - Jarvislabs.ai (@vishnuvig) 's Twitter Profile Photo

Ola recently announced that they are bringing affordable AI to Indian developers. ๐‰๐š๐ซ๐ฏ๐ข๐ฌ๐ฅ๐š๐›๐ฌ an Indian company has been providing affordable GPUs for developers across the globe since 2020. We are a little known, so I want to share our story here. ๐–๐ก๐จ ๐ฐ๐ž ๐š๐ซ๐ž

Andrej Karpathy (@karpathy) 's Twitter Profile Photo

๐Ÿ“ฝ๏ธ New 4 hour (lol) video lecture on YouTube: "Letโ€™s reproduce GPT-2 (124M)" youtu.be/l8pRSuU81PU The video ended up so long because it is... comprehensive: we start with empty file and end up with a GPT-2 (124M) model: - first we build the GPT-2 network - then we optimize

๐Ÿ“ฝ๏ธ New 4 hour (lol) video lecture on YouTube:
"Letโ€™s reproduce GPT-2 (124M)"
youtu.be/l8pRSuU81PU

The video ended up so long because it is... comprehensive: we start with empty file and end up with a GPT-2 (124M) model:
- first we build the GPT-2 network 
- then we optimize
SRK (@sudalairajkumar) 's Twitter Profile Photo

An insightful blog by Hamel Husain on "Your AI Product Needs Evals" - a must-read for anyone looking to construct robust LLM evaluation systems for their applications. Here are the three levels of creating LLM evaluation systems: โœจ Level 1 - Unit Tests: โ–ช Write scoped tests

An insightful blog by <a href="/HamelHusain/">Hamel Husain</a> on "Your AI Product Needs Evals" - a must-read for anyone looking to construct robust LLM evaluation systems for their applications.

Here are the three levels of creating LLM evaluation systems:
โœจ Level 1 - Unit Tests:
โ–ช Write scoped tests
SRK (@sudalairajkumar) 's Twitter Profile Photo

As multi-agent systems are on the rise, I wrote a simple blog post on building a "News Agent using CrewAI ." This introductory blog post covers: โœฆ Basic building blocks of building an agent system with CrewAI โœฆ Creating a custom tool using DuckDuckGo search to get the

As multi-agent systems are on the rise, I wrote a simple blog post on building a "News Agent using <a href="/crewAIInc/">CrewAI</a> ."

This introductory blog post covers:
โœฆ Basic building blocks of building an agent system with CrewAI
โœฆ Creating a custom tool using DuckDuckGo search to get the
SRK (@sudalairajkumar) 's Twitter Profile Photo

I'll be speaking at the Analytics Vidhya DataHack Summit 2024, on "Agentic Framework for #GenAI Applications" Planning to cover the following topics: ยป Understanding of Multi-agent systems and their frameworks ยป Practical Tools and Techniques: AutoGen, CrewAI, PhiData, Function

I'll be speaking at the <a href="/AnalyticsVidhya/">Analytics Vidhya</a> DataHack Summit 2024, on "Agentic Framework for #GenAI Applications"

Planning to cover the following topics:
ยป Understanding of Multi-agent systems and their frameworks
ยป Practical Tools and Techniques: AutoGen, CrewAI, PhiData, Function
Sanyam Bhutani (@bhutanisanyam1) 's Twitter Profile Photo

Life update: I have moved to Bay Area to work Meta HQ! ๐Ÿ™ The flight from India takes a day but my journey was 2 years to get to Silicon Valley: In 2022, Jeremy Howard gave an advice that took over my mind: โ€œYou should live in Bay Area for a while if you want to meet some

Life update: I have moved to Bay Area to work <a href="/Meta/">Meta</a> HQ! ๐Ÿ™

The flight from India takes a day but my journey was 2 years to get to Silicon Valley: 

In 2022, <a href="/jeremyphoward/">Jeremy Howard</a> gave an advice that took over my mind:

โ€œYou should live in Bay Area for a while if you want to meet some
SRK (@sudalairajkumar) 's Twitter Profile Photo

We (Akash Milton, Cursor and I) hacked together a simple app over the weekend - to get the earnings call summaries of companies in Indian markets. Summaries are created by #LLMs and the deployment is done using Vercel Creating apps are getting breezy! App link:

We (<a href="/mil10akash/">Akash Milton</a>, <a href="/cursor_ai/">Cursor</a> and I) hacked together a simple app over the weekend - to get the earnings call summaries of companies in Indian markets.

Summaries are created by #LLMs and the deployment is done using <a href="/vercel/">Vercel</a> 

Creating apps are getting breezy!

App link: