Anil Ozturk(@anil_ozturkk) 's Twitter Profile Photo

Multimodal LLM'lerdeki halüsinasyon problemi üzerine yapılan çalışmaları derleyen bir repo. Ayrıca bu konu üzerine kendi hazırladıkları bir survey de var.

GitHub: github.com/showlab/Awesom…

Multimodal LLM'lerdeki halüsinasyon problemi üzerine yapılan çalışmaları derleyen bir repo. Ayrıca bu konu üzerine kendi hazırladıkları bir survey de var.

GitHub: github.com/showlab/Awesom…
account_circle
Alex Reibman 🖇️(@AlexReibman) 's Twitter Profile Photo

OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Ever since OpenInterpreter, we've all been wondering just how effective agents can be if you give them a computer.

Now we have a proper benchmark. Let's take a look (🧵):

account_circle
David Stutz(@davidstutz92) 's Twitter Profile Photo

Excited to announce Med-Gemini, demonstrating a new SOTA on MedQA, multimodal and long-context abilities - arxiv.org/abs/2404.18416

I particularly want to highlight our full relabeling of MedQA, revealing that 7.4% of questions are unfit for evaluation. A short thread:

Excited to announce Med-Gemini, demonstrating a new SOTA on MedQA, multimodal and long-context abilities - arxiv.org/abs/2404.18416

I particularly want to highlight our full relabeling of MedQA, revealing that 7.4% of questions are unfit for evaluation. A short thread:
account_circle
Poonam Soni(@CodeByPoonam) 's Twitter Profile Photo

Meta just announced multimodal Ray-Ban glasses and it's INSANE

SPOILER: Apple Vision Pro got a huge competition.

Here are 7 powerful things you can do with Ray-Ban smart glasses:

Meta just announced multimodal Ray-Ban glasses and it's INSANE

SPOILER: Apple Vision Pro got a huge competition.

Here are 7 powerful things you can do with Ray-Ban smart glasses:
account_circle
DJ(@DuaneJRich) 's Twitter Profile Photo

New paper surveying multimodal LLM hallucinations: arxiv.org/abs/2404.18930

It creates a taxonomy of the varied ways hallucinations appear, with an intent to reveal causes and explain mitigation strategies.

It's an educational read and an admirable effort. Hallucinations,

New paper surveying multimodal LLM hallucinations: arxiv.org/abs/2404.18930

It creates a taxonomy of the varied ways hallucinations appear, with an intent to reveal causes and explain mitigation strategies.

It's an educational read and an admirable effort. Hallucinations,
account_circle
Brian Roemmele(@BrianRoemmele) 's Twitter Profile Photo

Testing this today… Meet OSWorld a first-of-its-kind scalable, real computer environment for multimodal agents, supporting task setup, execution-based evaluation, and interactive learning across operating systems. It can serve as a unified environment for evaluating open-ended

account_circle
Vensy(@vensykrishna) 's Twitter Profile Photo

Meta x Rayban glasses are now multimodal.

The glasses can translate languages, make video calls, identify objects, capture photos, and even listen to music.

account_circle
Ahmad Al-Dahle(@Ahmad_Al_Dahle) 's Twitter Profile Photo

Multimodal Meta AI is rolling out widely on Ray-Ban Meta starting today! It's a huge advancement for wearables & makes using AI more interactive & intuitive.

Excited to share more on our multimodal work w/ Meta AI (& Llama 3), stay tuned for more updates coming soon.

account_circle
Min Choi(@minchoi) 's Twitter Profile Photo

Ray-Ban Meta smart glasses just got a massive Multimodal upgrade - Meta AI with Vision

It doesn't just take speech input, it can now answer questions about what you are seeing.

Here are 8 features that is now possible

1. Ask about what you are seeing

account_circle
Gradio(@Gradio) 's Twitter Profile Photo

LLaMA Factory is a Gradio UI that helps you in fine-tuning LLMs as well as MLLMs🤯
💪 Fine-tune multimodal LLMs⚡have never been this easy! Links below 👇

LLaMA Factory is a Gradio UI that helps you in fine-tuning LLMs as well as MLLMs🤯  
💪 Fine-tune multimodal LLMs⚡have never been this easy! Links below 👇
account_circle
Tanishq Mathew Abraham, Ph.D.(@iScienceLuvr) 's Twitter Profile Photo

Google announces Med-Gemini, a family of Gemini models fine-tuned for medical tasks! 🔬

Achieves SOTA on 10 of the 14 benchmarks, spanning text, multimodal & long-context applications.

Surpasses GPT-4 on all benchmarks!

This paper is super exciting, let's dive in ↓

Google announces Med-Gemini, a family of Gemini models fine-tuned for medical tasks! 🔬

Achieves SOTA on 10 of the 14 benchmarks, spanning text, multimodal &  long-context applications. 

Surpasses GPT-4 on all benchmarks!

This paper is super exciting, let's dive in ↓
account_circle
Babak Damavandi(@babdam) 's Twitter Profile Photo

Multimodal AI on Ray-Ban Meta

It's finally out!

Today, we're thrilled to announce the launch of Multimodal AI on Ray-Ban Meta ('Meta AI with Vision') in the United States and Canada.

This marks an exciting milestone on a multi-year journey for us. Taking this idea from initial

account_circle
Alan Karthikesalingam(@alan_karthi) 's Twitter Profile Photo

Delighted to share ✨Med-Gemini✨ - our new family of multimodal models for medicine unlocking new possibilities for health - arxiv.org/pdf/2404.18416

More accurate multimodal conversations about medical images🩻, surgical videos📽️, genomics🧬, ultra-long health records📚, ECGs🫀

Delighted to share ✨Med-Gemini✨ - our new family of multimodal models for medicine unlocking new possibilities for health - arxiv.org/pdf/2404.18416

More accurate multimodal conversations about medical images🩻, surgical videos📽️, genomics🧬, ultra-long health records📚, ECGs🫀
account_circle
City of Fairfax, VA(@CityofFairfaxVA) 's Twitter Profile Photo

Tonight, Mayor Catherine Read proclaimed May 2024 as Bike Month in . Chloe Ritter, city multimodal transportation planner, accepted the proclamation. Check out the city’s Bike Month events: fairfaxva.gov/bikemonth.
@BiketoWorkDay

Tonight, Mayor Catherine Read proclaimed May 2024 as Bike Month in #FairfaxCity. Chloe Ritter, city multimodal transportation planner, accepted the proclamation. Check out the city’s Bike Month events: fairfaxva.gov/bikemonth. #LiveLifeConnected
@BiketoWorkDay
account_circle