Sveta (@churinasveta) 's Twitter Profile
Sveta

@churinasveta

NLP, Deep learning

ID: 1732770983365558273

calendar_today07-12-2023 14:36:23

18 Tweet

13 Followers

76 Following

Rowan Cheung (@rowancheung) 's Twitter Profile Photo

AI NEWS: Google just admitted the mind-blowing Gemini AI demo was staged. Plus, significant developments in AI from, Grok, Berkeley AI Research, AI regulation in the EU, NotebookLM, Seattle/UW Medicine, and 9 new AI tools. Here's everything you need to know:

Sebastian Raschka (@rasbt) 's Twitter Profile Photo

LoRA remains one of my favorite techniques for parameter-efficient finetuning of LLMs. Here's an implementation of LoRA (from scratch!) to train a GPT model to achieve 98% accuracy in SPAM classification: github.com/rasbt/LLMs-fro…

LoRA remains one of my favorite techniques for parameter-efficient finetuning of LLMs.
Here's an implementation of LoRA (from scratch!) to train a GPT model to achieve 98% accuracy in SPAM classification: github.com/rasbt/LLMs-fro…
Sveta (@churinasveta) 's Twitter Profile Photo

🎉 Just had an incredible experience attending The 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024)! 🎉 - via #Whova event app

🎉 Just had an incredible experience attending The 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024)! 🎉  - via #Whova event app
Google DeepMind (@googledeepmind) 's Twitter Profile Photo

We’re releasing DataGemma: open models that enhance LLM factuality by grounding them with real-world data from Google's Data Commons. 💡 It tackles hallucinations in AI models to generate more accurate and useful responses. Here’s how they work 🧵 dpmd.ai/47nWbvK

We’re releasing DataGemma: open models that enhance LLM factuality by grounding them with real-world data from <a href="/Google/">Google</a>'s Data Commons. 💡

It tackles hallucinations in AI models to generate more accurate and useful responses.

Here’s how they work 🧵 dpmd.ai/47nWbvK
Sebastian Raschka (@rasbt) 's Twitter Profile Photo

The Llama 3.2 1B and 3B models are my favorite LLMs -- small but very capable. If you want to understand how the architectures look like under the hood, I implemented them from scratch (one of the best ways to learn): github.com/rasbt/LLMs-fro…

The Llama 3.2 1B and 3B models are my favorite LLMs -- small but very capable.
If you want to understand how the architectures look like under the hood, I implemented them from scratch (one of the best ways to learn):  github.com/rasbt/LLMs-fro…
Sveta (@churinasveta) 's Twitter Profile Photo

That’s nowadays I think most common problem in Singapore as well, which is not being taken into the account at all unfortunately

Thomas Wolf (@thom_wolf) 's Twitter Profile Photo

Thrilled to finally share what we've been working on for months at Hugging Face 🤝Pollen Robotics Our first robot: Reachy Mini A dream come true: cute and low priced, hackable yet easy to use, powered by open-source and the infinite community. Tiny price, small size, huge

Sveta (@churinasveta) 's Twitter Profile Photo

Uploading a paper to arXiv from Overleaf is like going to therapy: you suddenly realize how many red flags there were that still need fixing 😩