
Tian Jin @ ICLR
@tjingrant
PhD student @MIT_CSAIL, previously @IBMResearch, @haverfordedu .
ID: 3078864701
http://www.tjin.org 08-03-2015 06:52:01
81 Tweet
334 Takipçi
312 Takip Edilen



Haitham Bou Ammar Same thought! To my knowledge, this optimal value was first documented in "Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning" by Greensmith, Bartlett, and Baxter (JMLR 2004). The interesting finding is that this baseline value can achieve global





Happy Throughput Thursday! We’re excited to release Tokasaurus: an LLM inference engine designed from the ground up for high-throughput workloads with large and small models. (Joint work with Ayush Chakravarthy, Ryan Ehrlich, Sabri Eyuboglu, Bradley Brown, Joseph Shetaye,






Check out the 999 open models that Google has released on Hugging Face: huggingface.co/google (Comparative numbers: 387 for Microsoft, 33 for OpenAI, 0 for Anthropic).







