Marcos Gorgojo
@marcosgorgojo
Building @thevisualizerai Turn your research, notes, and transcripts into easy-to-understand diagrams with AI.
ID: 426549234
http://thevisualizer.ai 02-12-2011 11:24:54
2,2K Tweet
2,2K Takipçi
701 Takip Edilen
You can now train Mistral Ministral 3 with reinforcement learning in our free notebook! You'll GRPO the model to solve sudoku autonomously. Learn about our new reward functions, RL environment & reward hacking. Blog: docs.unsloth.ai/new/ministral-3 Notebook: colab.research.google.com/github/unsloth…