Jonibek Mansurov
@m_jonibek
ID: 1658226525831852035
15-05-2023 21:43:41
2 Tweet
12 Followers
69 Following
Final work promotion in 2024, by Jonibek Mansurov! We managed to achieve ~75% on a challenging GPQA with only 2 layers of transformers(~ 40M params) that were trained on different data; in our case, MedMCQA. Introducing...
⭐️Reasoning LLMs trained on English data can think in other languages. Read our paper to learn more! Thank you Yong Zheng-Xin (Yong) for leading the project and team! It was an exciting colab! farid Jonibek Mansurov Ruochen Zhang Niklas Muennighoff Carsten Eickhoff Julia Kreutzer