Aljosa (@aljosaosep) 's Twitter Profile
Aljosa

@aljosaosep

Senior research scientist (@NVIDIAAI, prior @CMU_Robotics, @TU_Muenchen, @RWTH), working on learning to understand the world from video.

ID: 952532437875265538

linkhttp://aljosaosep.github.io calendar_today14-01-2018 13:26:45

843 Tweet

1,1K Takipçi

1,1K Takip Edilen

Aljosa (@aljosaosep) 's Twitter Profile Photo

This should be of interest to folks working on sfm / (point) tracking. You can literally get point trajectories “for free” with your recorded data!

Franziska Gerken (@franzigrkn) 's Twitter Profile Photo

⭐🚨 Tweeprint! 🚨⭐ Very excited to present my & Alana Darcher (she/her)’s PhD research! What happens in your brain when you watch a movie? How is the information underlying the movie’s content distributed across neurons? biorxiv.org/content/10.110…

⭐🚨 Tweeprint! 🚨⭐ 

Very excited to present my &amp; <a href="/AlanaDarcher/">Alana Darcher (she/her)</a>’s PhD research!

What happens in your brain when you watch a movie?

How is the information underlying the movie’s content distributed across neurons?

biorxiv.org/content/10.110…
Jonathon Luiten (@jonathonluiten) 's Twitter Profile Photo

If you’re at #CVPR2024 #CVPR tomorrow (Tuesday) I will be giving a talk on Dynamic 3D Gaussians + SplaTAM (see 👇) as part of the XRNeRF workshop (sites.google.com/view/xrnerf/). Talk is at 9.30am in room Summit 332. The other talks at the workshop also seem super cool! Def check it out

Tobias Fischer (@tobiasfischer11) 's Twitter Profile Photo

I'll be giving a lightning talk at the DDADS workshop (agents4ad.github.io) at #CVPR about the work that provided the basis for this tomorrow 9:45 am! Multi-Level Neural Scene Graphs for Dynamic Urban Environments - We'll also present on Friday at 10:30 am Poster Session 5.

Aljosa (@aljosaosep) 's Twitter Profile Photo

Y’all need detectors for objects that can move but don’t want to label the data? Come by our poster today! #cvpr2024

Aljosa (@aljosaosep) 's Twitter Profile Photo

Not the best photo, but fascinating work: your (unimodal) LLM already has visual representations “somewhere in there”. Talk to these folks to see how to get them out! #cvpr2024

Not the best photo, but fascinating work: your (unimodal) LLM already has visual representations “somewhere in there”. Talk to these folks to see how to get them out! #cvpr2024
Aljosa (@aljosaosep) 's Twitter Profile Photo

I once did that for a friend, and as soon as I put it on I had a group of folks around waiting for the presentation and I just tagged along 😅 by the time it was turn for my poster I already lost my voice!

Aljosa (@aljosaosep) 's Twitter Profile Photo

Best CVPR so far! There sure were some organizational hiccups, but its not the organization that makes conferences great, its the people. Good night, Seattle!

Best CVPR so far! There sure were some organizational hiccups, but its not the organization that makes conferences great, its the people. Good night, Seattle!
Aljosa (@aljosaosep) 's Twitter Profile Photo

Exactly my experience. Submitting less than half baked papers puts uneccessary strain on reviewers. We desperately need to do something about this. I was reviewing for neurips every year, wheter I submitted or not- I will change this policy.

NVIDIA AI Developer (@nvidiaaidev) 's Twitter Profile Photo

Introducing fVDB, a deep learning framework for large-scale, high-performance spatial intelligence. ➡️ nvda.ws/3WIR59J Explore how fVDB builds AI operators on top of OpenVDB to enable reality-scale, AI-ready #digitaltwins. #SIGGRAPH2024

Boyi Li (@boyiliee) 's Twitter Profile Photo

🚀 Introducing 𝐖𝐨𝐥𝐟 🐺: a mixture-of-experts video captioning framework that outperforms GPT-4V and Gemini-Pro-1.5 in general scenes 🖼️, autonomous driving 🚗, and robotics videos 🤖. 👑: wolfv0.github.io/leaderboard.ht…

🚀 Introducing 𝐖𝐨𝐥𝐟 🐺: a mixture-of-experts video captioning framework that outperforms GPT-4V and Gemini-Pro-1.5 in general scenes 🖼️, autonomous driving 🚗, and robotics videos 🤖.

👑: wolfv0.github.io/leaderboard.ht…
Fei-Fei Li (@drfeifei) 's Twitter Profile Photo

#AI is a civilizational technology that will have profound implications to everyone. "AI policy must encourage innovation, set appropriate restrictions, and mitigate the implications of those restrictions. Policy that doesn't will at best fall short of its goals, and at worst

martin_casado (@martin_casado) 's Twitter Profile Photo

Zoe Lofgren's (ranking member Science, Space and Tech) letter to Senator Scott Wiener voicing deep concerns on SB 1047. "I’m very concerned about the effect this legislation could have on the innovation economy of California without any clear benefit for the public." Please amplify.

Zoe Lofgren's (ranking member Science, Space and Tech) letter to <a href="/Scott_Wiener/">Senator Scott Wiener</a> voicing deep concerns on SB 1047.

"I’m very concerned about the effect this legislation could have on the innovation economy of California without any clear benefit for the public."

Please amplify.
Aleksa Gordić 🍿🤖 (@gordic_aleksa) 's Twitter Profile Photo

[📚 SlovenianGPT 🧠] Happy to announce that I'm open sourcing the best 7B LLM for the Slovenian language! (better than LLaMA3, Mistral, etc.) I hope that the Slovenian NLP community finds this one useful! HuggingFace: huggingface.co/gordicaleksa/S… SlovenianGPT is a base model and

[📚 SlovenianGPT 🧠] Happy to announce that I'm open sourcing the best 7B LLM for the Slovenian language! (better than LLaMA3, Mistral, etc.) I hope that the Slovenian NLP community finds this one useful!

HuggingFace: huggingface.co/gordicaleksa/S…

SlovenianGPT is a base model and
Karim Abou Zeid (@kacodes) 's Twitter Profile Photo

Check out our work on fine-tuning of image-conditional diffusion models for depth and normal estimation. Widely used diffusion models can be improved with single-step inference and task-specific fine-tuning, allowing us to gain better accuracy while being 200x faster!⚡ 🧵(1/6)

Check out our work on fine-tuning of image-conditional diffusion models for depth and normal estimation.

Widely used diffusion models can be improved with single-step inference and task-specific fine-tuning, allowing us to gain better accuracy while being 200x faster!⚡

🧵(1/6)
Bastian Leibe (@bastianleibe) 's Twitter Profile Photo

Great work by my team at RWTH Computer Vision Group. Up to now, the prevailing perception was that diffusion models must be slow or complex for depth estimation. We show that this is not the case. With our training scheme, faster more efficient diffusion-based models are possible.