OpenAdaptAI (@openadaptai) 's Twitter Profile
OpenAdaptAI

@openadaptai

Open source AI that automates tasks in desktop apps by observing human demonstrations. Mac/Win compatible. github.com/OpenAdaptAI/Op…

ID: 1657481712471867394

linkhttps://openadapt.ai/ calendar_today13-05-2023 20:23:34

66 Tweet

412 Followers

1 Following

louis030195 (@louis030195) 's Twitter Profile Photo

within the next year, AI will be able to ingest everything that ever happened on your computer check out this cool video about tools enabling this: OpenAdaptAI @tooluseai Mike Bird Interpreter Ty Richard Abrich and :) youtube.com/watch?v=VgJ0Cg…

Richard Abrich (@abrichr) 's Twitter Profile Photo

OpenAdaptAI Julien Chaumond Microsoft Amazon Web Services Docker (venv) % python client.py http://34.206.53.77:7861 ~/Desktop/screenshot.png Loaded as API: http://34.206.53.77:7861/ ✔ Parsed content: ... 2024-10-29 11:13:07.414 | INFO | __main__:predict:84 - Output image saved to: output_image.png

<a href="/OpenAdaptAI/">OpenAdaptAI</a> <a href="/julien_c/">Julien Chaumond</a> <a href="/Microsoft/">Microsoft</a> <a href="/AWS/">Amazon Web Services</a> <a href="/Docker/">Docker</a> (venv) % python client.py http://34.206.53.77:7861 ~/Desktop/screenshot.png
Loaded as API: http://34.206.53.77:7861/ ✔
Parsed content:
...
2024-10-29 11:13:07.414 | INFO     | __main__:predict:84 - Output image saved to: output_image.png
Richard Abrich (@abrichr) 's Twitter Profile Photo

Another day, another breakthrough: Apply DCT to convert actions into frequency components, quantize them prioritizing low frequencies, then use autoregressive prediction in frequency order (low to high) to generate actions. From Physical Intelligence. May generalize to OpenAdaptAI.

Another day, another breakthrough:

Apply DCT to convert actions into frequency components, quantize them prioritizing low frequencies, then use autoregressive prediction in frequency order (low to high) to generate actions.

From <a href="/physical_int/">Physical Intelligence</a>. May generalize to <a href="/OpenAdaptAI/">OpenAdaptAI</a>.
Richard Abrich (@abrichr) 's Twitter Profile Photo

github.com/deepseek-ai/De… > DeepSeek-R1 achieves performance comparable to OpenAI-o1 across math, code, and reasoning tasks. huggingface.co/deepseek-ai/De… We can run frontier models at home now.

Yujia Qin@ICLR2025 (@tsingyoga) 's Twitter Profile Photo

Check out our latest GUI Agent -> UI-TARS 🥳 A vision-language model surpasses GPT-4o & Claude Computer-Use Paper, code, model ckpt, desktop APP are now open-sourced~ github.com/bytedance/UI-T… github.com/bytedance/UI-T…

Check out our latest GUI Agent -&gt; UI-TARS 🥳
A vision-language model surpasses GPT-4o &amp; Claude Computer-Use

Paper, code, model ckpt, desktop APP are now open-sourced~
github.com/bytedance/UI-T…
github.com/bytedance/UI-T…
Richard Abrich (@abrichr) 's Twitter Profile Photo

Qwen2.5-VL is the first open source multimodal model that appears to be able to accurately generate bounding box coordinates 🚀 Thank you Qwen ! Excited to integrate this in OpenAdaptAI x.com/Alibaba_Qwen/s…

Qwen2.5-VL is the first open source multimodal model that appears to be able to accurately generate bounding box coordinates 🚀

Thank you <a href="/Alibaba_Qwen/">Qwen</a> ! Excited to integrate this in <a href="/OpenAdaptAI/">OpenAdaptAI</a> 

x.com/Alibaba_Qwen/s…
Richard Abrich (@abrichr) 's Twitter Profile Photo

I prompted OpenAI's ChatGPT o3-mini-high and deepseek's R1 to implement code to for deploying Qwen's Qwen2.5-VL. Both agree that R1's implementation is "more comprehensive" and better "for production systems".

I prompted <a href="/openai/">OpenAI</a>'s ChatGPT o3-mini-high and <a href="/DeepSeek/">deepseek</a>'s R1 to implement code to  for deploying <a href="/alibaba_qwen/">Qwen</a>'s Qwen2.5-VL.

Both agree that R1's implementation is "more comprehensive" and better "for production systems".
Rico Pagliuca (@pagilgukey) 's Twitter Profile Photo

Anybody looking for a GUI+ICL-->MCP library should definitely check out OmniMCP which puts Microsoft's Omniparser to use in generating GUI tool use APIs. Early days but pretty neat omnimcp.openadapt.ai

Xinyuan Wang (@xywang626) 's Twitter Profile Photo

We are super excited to release OpenCUA — the first from 0 to 1 computer-use agent foundation model framework and open-source SOTA model OpenCUA-32B, matching top proprietary models on OSWorld-Verified, with full infrastructure and data. 🔗 [Paper] arxiv.org/abs/2508.09123 📌

We are super excited to release OpenCUA — the first from 0 to 1 computer-use agent foundation model framework and open-source SOTA model OpenCUA-32B, matching top proprietary models on OSWorld-Verified, with full infrastructure and data.

🔗 [Paper] arxiv.org/abs/2508.09123 
📌
Xinyuan Wang (@xywang626) 's Twitter Profile Photo

🙌 Acknowledgement: We thank Yu Su (Hiring @Neurips), Caiming Xiong , and the anonymous reviewers for their insightful discussions and valuable feedback. We are grateful to Moonshot AI for providing training infrastructure and annotated data. We also sincerely appreciate Jin Zhang, Hao Yang,

JJ (@josephjacks_) 's Twitter Profile Photo

Don’t get me wrong, I love Opus 4.6 But there is no fucking way I’m letting Anthropic control my computer That’s why we have open source