Charmaine Mahachi (@cxmahachi) Twitter Tweets • TwiCopy

Charmaine Mahachi

@cxmahachi

+ Follow

Learning how to build Language Applications with Transformers | Python | PyTorch | FastMLX Docs @https://blaizzy.github.io/fastmlx/

ID: 1695501993379315712

linkhttps://medium.com/@charmainemahachi calendar_today26-08-2023 18:22:51

69 Tweet

93 Followers

46 Following

Charmaine Mahachi

@cxmahachi

22 days ago

Today I learnt: 1. Fine-tuning BERT with some layers frozen (82.1% accuracy achieved) 2. How to create a report on Weights & Biases and how to compare results from different projects 3. CS229 Lecture 1: Introduction to linear algebra. Covered the math behind ml. Lecture 1 focused on

thumb_up_off_alt504

chat_bubble_outline28

repeat17

shareShare

Prince Canuma

@prince_canuma

22 days ago

mlx-vlm v0.4.3 is here 🚀 Day-0 support: 🔥 Gemma 4 (vision, audio, MoE) by Google DeepMind 🦅 Falcon-OCR + Falcon Perception by Technology Innovation Institute 🪨 Granite Vision 4.0 by IBM Research New models: 🎯 SAM 3.1 with Object Multiplex by Facebook 🔍 RF-DETR detection & segmentation by

mlx-vlm v0.4.3 is here 🚀

Day-0 support:
🔥 Gemma 4 (vision, audio, MoE) by <a href="/GoogleDeepMind/">Google DeepMind</a>
🦅 Falcon-OCR + Falcon Perception by <a href="/TIIuae/">Technology Innovation Institute</a>
🪨 Granite Vision 4.0 by <a href="/IBMResearch/">IBM Research</a>

New models:
🎯 SAM 3.1 with Object Multiplex by <a href="/facebook/">Facebook</a>
🔍 RF-DETR detection & segmentation by

thumb_up_off_alt1,1K

chat_bubble_outline76

repeat187

shareShare

Amit Shekhar

@amitiitbhu

22 days ago

x.com/i/article/2039…

thumb_up_off_alt1,1K

chat_bubble_outline5

repeat180

shareShare

Charmaine Mahachi

@cxmahachi

21 days ago

Brain is fuming, so many math equations. But today I’m learning about one of my favourite topics: Tensor decomposition! 😁 I did my bachelor’s thesis on this, titled ‘Low rank compression of CNNs using tensor decomposition methods’

thumb_up_off_alt129

chat_bubble_outline6

repeat2

shareShare

Charmaine Mahachi

@cxmahachi

21 days ago

Today I learnt: 1. Quadratic forms and definitiveness - How the eigen values of the symmetric matrix A determines if they are positive, negative or indefinite 2.SVD and EVD - Decomposition of matrices focusing on singular and eigen value decomposition - Looked at the formulae

thumb_up_off_alt342

chat_bubble_outline13

repeat17

shareShare

Charmaine Mahachi

@cxmahachi

21 days ago

Today's lecture was very information dense and didn't get to go through the BERT paper again as I intended to (make the paper breakdown). Making this P1 for tomorrow and will also revisit the concepts again to master them before continuing

thumb_up_off_alt44

chat_bubble_outline1

repeat2

shareShare

Charmaine Mahachi

@cxmahachi

21 days ago

Out of curiosity, how many times do you read a research paper before implementing it?

thumb_up_off_alt49

chat_bubble_outline17

repeat2

shareShare

Charmaine Mahachi

@cxmahachi

21 days ago

A paper a day, keeps the doctor away

thumb_up_off_alt2,2K

chat_bubble_outline27

repeat150

shareShare

Charmaine Mahachi

@cxmahachi

20 days ago

This is such a great list of papers to start with. I got some questions on where to begin and I would say this is a great starting point if you’re interested in LLMs and Transformers

thumb_up_off_alt14

chat_bubble_outline1

repeat1

shareShare

Charmaine Mahachi

@cxmahachi

20 days ago

These are the books I keep going back to as someone who is interested in LLMs and Transformers. I reach for these gems a few times a day. Book 1: Natural Language Processing with Transformers by Lewis Tunstall Leandro von Werra Thomas Wolf Book 2: Hands On Large Language Models bu

thumb_up_off_alt267

chat_bubble_outline7

repeat25

shareShare

Prince Canuma

@prince_canuma

20 days ago

First time lapse of Gym Geeks 🤣 Where wifey and I train hard, and maybe discuss latest updates in the AI space. Maziyar PANAHI

thumb_up_off_alt17

chat_bubble_outline2

repeat1

shareShare

Charmaine Mahachi

@cxmahachi

18 days ago

Fine-tuning transformers used to feel intimidating to me, but it doesn't have to be. I just published a beginner-friendly guide to fine-tuning BERT using Hugging Face, where I walk through: - What fine-tuning actually is - A quick intro to BERT - A full code walkthrough on the

thumb_up_off_alt34

chat_bubble_outline1

repeat1

shareShare