
ฦฌโฒ ๐พ
@tm23twt
19 . ml . engineerโ๏ธ. tm23-portfolio.vercel.app . tmwork.vercel.app
ID: 1918285228403253249
02-05-2025 12:43:57
1,1K Tweet
348 Followers
253 Following














๐ป๐ผ๐๐ต๐ถ๐ป๐ด ๐ท๐๐๐ ๐ฐ๐ผ๐ฑ๐ถ๐ป๐ด ๐๐๐๐ฒ ๐ฝ๐ฎ๐ถ๐ฟ ๐ฒ๐ป๐ฐ๐ผ๐ฑ๐ถ๐ป๐ด ๐ณ๐ฟ๐ผ๐บ ๐๐ฐ๐ฟ๐ฎ๐๐ฐ๐ต for those who don't know this popular method is used as tokenization algorithm for models like GPT-2 to GPT-4, Llama 3 & many more :)




๐๐ผ๐ป๐ฒ ๐๐ถ๐๐ต ๐๐๐๐ฒ ๐ฃ๐ฎ๐ถ๐ฟ ๐๐ป๐ฐ๐ผ๐ฑ๐ถ๐ป๐ด ๐ฝ๐ฎ๐ฝ๐ฒ๐ฟ now onto next paper of Rotary Positional Embeddings, im slowly getting hang of the research papers bro. will share the code of bpe later. also weekend is cooked chat, btw what y'all doing :) paper link๐
