OmniAI (@getomni_ai) 's Twitter Profile
OmniAI

@getomni_ai

🪄 Next generation document intelligence

ID: 1659317053147283456

linkhttps://getomni.ai calendar_today18-05-2023 21:56:46

314 Tweet

603 Followers

179 Following

OmniAI (@getomni_ai) 's Twitter Profile Photo

We hit 7,000 stars on our OCR library! Only 3k to go till the double digits. It's been awesome seeing the community traction on Zerox. Turns out everyone's got documents! Up next on the roadmap: - Structured schema extraction - Llama 3.2 support - Dockerized deployment

We hit 7,000 stars on our OCR library! Only 3k to go till the double digits.

It's been awesome seeing the community traction on Zerox. Turns out everyone's got documents!

Up next on the roadmap:
- Structured schema extraction
- Llama 3.2 support
- Dockerized deployment
OmniAI (@getomni_ai) 's Twitter Profile Photo

Exciting updates are coming from OmniAI this Q1. Here’s what’s on the way: 1️⃣ Document fine-tuning 2️⃣ Benchmark 3️⃣ Excel & Google Sheets Plugin We’re also growing! We’re hiring for both growth and engineering roles. Can’t wait to share all we’ve been working on. Happy Q1! 🚀

OmniAI (@getomni_ai) 's Twitter Profile Photo

Wow, 7 days since the last post and Zerox went from 8,000 to 9,000 stars! 🚀 We didn't even have time to ship the extra features we promised last week 🤣 Coming soon: - Structured schema extraction - Edge detection & cropping - More model options (including Deepseek and Qwen!)

Wow, 7 days since the last post and Zerox went from 8,000 to 9,000 stars! 🚀 We didn't even have time to ship the extra features we promised last week 🤣

Coming soon:
- Structured schema extraction
- Edge detection & cropping
- More model options (including Deepseek and Qwen!)
OmniAI (@getomni_ai) 's Twitter Profile Photo

New year, new space for OmniAI! Today’s our first official day in the new office 🚚 Super excited to make great memories, close big deals, and host amazing events! We’re still furnishing it, but of course we have all the necessities covered - coffee, monitors, and whiteboards ☕️

New year, new space for OmniAI! Today’s our first official day in the new office 🚚 Super excited to make great memories, close big deals, and host amazing events!

We’re still furnishing it, but of course we have all the necessities covered - coffee, monitors, and whiteboards ☕️
OmniAI (@getomni_ai) 's Twitter Profile Photo

We just added Gemini 2.0 Flash to Zerox! ⚡️ These are early results from our VLM benchmark. While it still has a ways to go on the accuracy side (about ~80%), it easily beats GPT 4o and other traditional OCR providers like AWS Textract and Unstructured. And it's cheap!

We just added Gemini 2.0 Flash to Zerox! ⚡️

These are early results from our VLM benchmark. While it still has a ways to go on the accuracy side (about ~80%), it easily beats GPT 4o and other traditional OCR providers like AWS Textract and Unstructured.

And it's cheap!
OmniAI (@getomni_ai) 's Twitter Profile Photo

What do you do when someone sends you a PDF with 12,640 rows you've got to extract? OmniAI automates it in minutes, no code! Companies spend hundreds of engineering hours figuring out how to parse that data. We can’t stop PDFs, but at least we can turn them into real data!

OmniAI (@getomni_ai) 's Twitter Profile Photo

Introducing the Omni OCR Benchmark, the most comprehensive evaluation of OCR tools. We evaluated traditional OCR providers and multimodal LLMs across 1,000 documents for accuracy, cost, and speed with an open-source approach. See how each provider ranks: getomni.ai/ocr-benchmark

OmniAI (@getomni_ai) 's Twitter Profile Photo

It’s difficult to measure document extraction accuracy! Our benchmark compares the OCR / extraction JSON to the ground truth JSON, and then calculates the number of JSON differences divided by the total fields in the ground truth JSON. Comment with providers you’d like to see.

It’s difficult to measure document extraction accuracy!

Our benchmark compares the OCR / extraction JSON to the ground truth JSON, and then calculates the number of JSON differences divided by the total fields in the ground truth JSON.

Comment with providers you’d like to see.
OmniAI (@getomni_ai) 's Twitter Profile Photo

Excited to welcome Terry as OmniAI’s founding Growth! He built and scaled BuildStream (YC S19) and brings a ton of experience in driving growth from 0 to 1. 💡Fun fact: The twinning at Omni continues. Terry isn’t a twin himself, but he’s the proud dad of twin daughters and son!

Excited to welcome Terry as OmniAI’s founding Growth! He built and scaled BuildStream (YC S19) and brings a ton of experience in driving growth from 0 to 1.

💡Fun fact: The twinning at Omni continues. Terry isn’t a twin himself, but he’s the proud dad of twin daughters and son!
OmniAI (@getomni_ai) 's Twitter Profile Photo

A new round of OpenAI models (4.1 series) came out today and GPT is finally back in the running. But not with the model you'd expect... Turns out 4.1-mini dominates when it comes to document understanding. Gemini 2.5 pro is still the top ranked but at a higher cost and latency.

A new round of OpenAI models (4.1 series) came out today and GPT is finally back in the running. But not with the model you'd expect...

Turns out 4.1-mini dominates when it comes to document understanding.

Gemini 2.5 pro is still the top ranked but at a higher cost and latency.
OmniAI (@getomni_ai) 's Twitter Profile Photo

Bounding boxes for extracted fields are here! 🔍 Now you can see exactly where each extracted field came from. 🔍 It draws a box around the exaction 📄 Auto-scrolls to it ⚙️ Works on all documents Personally, this has made reviewing extractions way easier!

OmniAI (@getomni_ai) 's Twitter Profile Photo

One of the trickier document types we’ve worked on lately are multi-page tables. Header’s on page 1. Data's on page 5. Agentic document processing tracks header context across pages and keeps your table structured from start to finish. Drop a comment to see it on your documents

OmniAI (@getomni_ai) 's Twitter Profile Photo

We just rolled out a new agentic loading experience! And you can actually see it thinking. 🔄 Each document flows through layout correction, contrast enhancement, OCR selection, structure recognition, and more. Check it out, and watch it think before your eyes. 👀

OmniAI (@getomni_ai) 's Twitter Profile Photo

One real estate finance customer cut manual review time by 80% 🤯 Rent rolls are huge multi-page tables with dozens of columns. Most tools fail: ❌ Headers only on the first page ❌ Rows break across pages ❌ Layouts shift mid-doc Agentic document processing solves this!