Yogesh (@yogeshbalaji95) 's Twitter Profile
Yogesh

@yogeshbalaji95

Research scientist at Nvidia.

ID: 2604687852

calendar_today05-07-2014 03:52:23

34 Tweet

262 Followers

282 Following

Charlotte Bunne (@_bunnech) 's Twitter Profile Photo

The Optimal Transport and Machine Learning #OTML workshop NeurIPS Conference #NeurIPS2021 is now open for submissions via @OpenReviewnet. Deadline is September 18, 2021! We look forward to your contributions! otml2021.github.io/call

Ming-Yu Liu (@liu_mingyu) 's Twitter Profile Photo

We are looking for several Ph.D. interns for 2022 spring/summer/fall. We plan to cover several topics, including neural rendering, zero-shot segmentation, text--image modeling, speech/music generation, and denoising diffusion models. If interested, send your CV to me.

Andrej Karpathy (@karpathy) 's Twitter Profile Photo

The ongoing consolidation in AI is incredible. Thread: ➡️ When I started ~decade ago vision, speech, natural language, reinforcement learning, etc. were completely separate; You couldn't read papers across areas - the approaches were completely different, often not even ML based.

Chen-Hsuan Lin (@chenhsuanlin) 's Twitter Profile Photo

Our team at NVIDIA Research has an open PhD internship position to work on 3D reconstruction & view synthesis problems -- please send me your CV (via email) if you're interested!

Polina Kirichenko (@polkirichenko) 's Twitter Profile Photo

Last Layer Re-Training is Sufficient for Robustness to Spurious Correlations. ERM learns multiple features that can be reweighted for SOTA on spurious correlations, reducing texture bias on ImageNet, & more! w/ Pavel Izmailov and Andrew Gordon Wilson arxiv.org/abs/2204.02937 1/11

Last Layer Re-Training is Sufficient for Robustness to Spurious Correlations. ERM learns multiple features that can be reweighted for SOTA on spurious correlations, reducing texture bias on ImageNet, & more!

w/ <a href="/Pavel_Izmailov/">Pavel Izmailov</a> and <a href="/andrewgwils/">Andrew Gordon Wilson</a>
arxiv.org/abs/2204.02937

1/11
Mazda Moayeri (@mlmazda) 's Twitter Profile Photo

To hear the whole story, check out our talk at #CVPR2022 oral session 4.2.3 at ⏰1:30pm⏰ in Great Hall A-D (or visit poster 39b) Paper: arxiv.org/abs/2201.10766 *Richly Annotated* Dataset: mmoayeri.github.io/RIVAL10 Joint work with Phil Yogesh Soheil Feizi

To hear the whole story, check out our talk at #CVPR2022 oral session 4.2.3 at ⏰1:30pm⏰ in Great Hall A-D (or visit poster 39b)

Paper: arxiv.org/abs/2201.10766
*Richly Annotated* Dataset: mmoayeri.github.io/RIVAL10

Joint work with <a href="/pepopep01/">Phil</a> <a href="/YogeshBalaji95/">Yogesh</a> <a href="/FeiziSoheil/">Soheil Feizi</a>
Ming-Yu Liu (@liu_mingyu) 's Twitter Profile Photo

I’m looking for researchers with experiences and strong passion in large-scale image-text models to join our research team at CA. Strong knowledge on diffusion models, contrastive learning, or data curation is preferred. Team-work first, extreme hard-core, and perfection-driven.

Karsten Kreis (@karsten_kreis) 's Twitter Profile Photo

📢📢 Our team at NVIDIA (Toronto AI Lab, nv-tlabs.github.io) is looking for strong and motivated PhD research interns! We are working at the cutting edge in diffusion models, 3D generation and much more. Email or message me with your CV and website, if you are interested.

📢📢 Our team at <a href="/nvidia/">NVIDIA</a> (Toronto AI Lab, nv-tlabs.github.io) is looking for strong and motivated PhD research interns!

We are working at the cutting edge in diffusion models, 3D generation and much more.

Email or message me with your CV and website, if you are interested.
Bryan Catanzaro (@ctnzr) 's Twitter Profile Photo

The Megatron team at NVIDIA is hiring! We're looking for people who can contribute to any aspect of foundation model research and development: nvidia.wd5.myworkdayjobs.com/en-US/NVIDIAEx…

Chen-Hsuan Lin (@chenhsuanlin) 's Twitter Profile Photo

Our team @NVIDIA is looking for PhD research interns for fall/winter 2023 to work on 3D generative AI applications -- please send me your CV (via email) if you're interested!

Ekta Prashnani (@ekta_prashnani) 's Twitter Profile Photo

We are hiring a Ph.D. intern @ NVIDIA for research on multi-modal generative models, reinforcement learning, self-supervised representations, and 3D perception for video conferencing. If you have experience leading insightful contributions on any of these topics, DM / email me.

Yogesh (@yogeshbalaji95) 's Twitter Profile Photo

Interested in a commercially safe image generator? Check out this latest collaboration of Getty Images with NVIDIA. Super excited to be a part of this effort. newsroom.gettyimages.com/en/getty-image…

Jia-Bin Huang (@jbhuang0604) 's Twitter Profile Photo

Excited to share PYoCo! PYoCo is a state-of-the-art text-to-video diffusion model that can generate videos up to 1024^2 resolution.

Songwei Ge (@songwei_ge) 's Twitter Profile Photo

Yogesh and I will be presenting our SOTA text-to-video model PYoCo at ICCV 2023! Check out our poster this afternoon in Room "Foyer Sud" - 092!!! 📽️ Website: research.nvidia.com/labs/dir/pyoco/

Soheil Feizi (@feizisoheil) 's Twitter Profile Photo

🚀 Exciting news! We’re launching RELAI with a mission to make AI reliability accessible and achievable for everyone. Our first release: RELAI agents for real-time hallucination detection in popular LLMs. 👉Try it now for free at: relai.ai

Yogesh (@yogeshbalaji95) 's Twitter Profile Photo

Very excited to share our work on building Edify Image - a family of diffusion models for various image generation applications. Please check it out.

Yogesh (@yogeshbalaji95) 's Twitter Profile Photo

Check out NVIDIA Cosmos - world foundation model platform for advancing physical AI. I am really excited to be a part of this incredible effort. Paper link: research.nvidia.com/publication/20… Try out our models here: github.com/NVIDIA/Cosmos

Yogesh (@yogeshbalaji95) 's Twitter Profile Photo

Catch our #CVPR2025 poster today! 🖼️ “A Comprehensive Study of Decoder-Only LLMs for Text-to-Image Generation” 📍 Exhibit Hall D, Poster #230 🕓 4:00–6:00 PM We explore how LLMs perform as text encoders for image generation—with some interesting findings! 🔗 Webpage: