CorneliusWefelscheid (@corny335) 's Twitter Profile
CorneliusWefelscheid

@corny335

Founder of deepkomma.de Founder of plugovr.ai Senior Manager for AI Perception at Aptiv

ID: 16589906

linkhttp://plugovr.ai calendar_today04-10-2008 09:20:26

74 Tweet

13 Takipçi

116 Takip Edilen

AI at Meta (@aiatmeta) 's Twitter Profile Photo

Today we released Meta Spirit LM — our first open source multimodal language model that freely mixes text and speech. Many existing AI voice experiences today use ASR to techniques to process speech before synthesizing with an LLM to generate text — but these approaches

CorneliusWefelscheid (@corny335) 's Twitter Profile Photo

Yesterday we released the first public beta of plugovr.ai. #plugovr easily interacts with all your applications. Use #anthropic haiku model where ever you need it or try llama 3.2 as local model. Let us know what features you would like to get next.

CorneliusWefelscheid (@corny335) 's Twitter Profile Photo

It’s insane how good kosmos 2.5 is on screenshots. I wish someone would provide a #rust or #candle implementation. github.com/microsoft/unil…

CorneliusWefelscheid (@corny335) 's Twitter Profile Photo

We released PlugOvr.ai now also for macOS. PlugOvr.ai can now be used on all major OSes (Windows, Linux and macOS). For macOS you get the full Metal speedup if you use Llama 3.2 1B or 3B as local LLM.

Ai2 (@allen_ai) 's Twitter Profile Photo

Meet Tülu 3 -- a set of state-of-the-art instruct models with fully open data, eval code, and training algorithms. We invented new methods for fine-tuning language models with RL and built upon best practices in the community to scale synthetic instruction and preference data.

Meet Tülu 3 -- a set of state-of-the-art instruct models with fully open data, eval code, and training algorithms.

We invented new methods for fine-tuning language models with RL and built upon best practices in the community to scale synthetic instruction and preference data.
Alex Reibman 🖇️ (@alexreibman) 's Twitter Profile Photo

OpenAI’s biggest competitor just gave AI the ability to control computers We gave 250+ hackers 24 hours to see what it’s capable of Here’s what we saw at the Nexgen Computer Use Agents Hackathon w/ Anthropic + AgentOps 🖇️ Notable Capital at AGI House SF (🧵):

OpenAI’s biggest competitor just gave AI the ability to control computers

We gave 250+ hackers 24 hours to see what it’s capable of

Here’s what we saw at the Nexgen Computer Use Agents Hackathon w/ <a href="/AnthropicAI/">Anthropic</a> + <a href="/AgentOpsAI/">AgentOps 🖇️</a> <a href="/notablecap/">Notable Capital</a> at <a href="/AGIHouseSF/">AGI House SF</a> (🧵):
CorneliusWefelscheid (@corny335) 's Twitter Profile Photo

Thanks to ollama and #ollama-rs it was super easy to integrate #ollama in #plugovr. With the newest release v0.1.65 you can use all ollama models inside plugovr.ai.

Ai2 (@allen_ai) 's Twitter Profile Photo

Meet OLMo 2, the best fully open language model to date, including a family of 7B and 13B models trained up to 5T tokens. OLMo 2 outperforms other fully open models and competes with open-weight models like Llama 3.1 8B — As always, we released our data, code, recipes and more 🎁

Meet OLMo 2, the best fully open language model to date, including a family of 7B and 13B models trained up to 5T tokens. OLMo 2 outperforms other fully open models and competes with open-weight models like Llama 3.1 8B — As always, we released our data, code, recipes and more 🎁
CorneliusWefelscheid (@corny335) 's Twitter Profile Photo

🚀 A sneak peek at a new feature coming soon to PlugOvr.ai: leveraging #Anthropic's #computer_use capabilities for screen understanding to automate form filling. 👉 What features would you love to see next? Drop your suggestions in the comments—we’re all ears! 👇

CorneliusWefelscheid (@corny335) 's Twitter Profile Photo

We just released PlugOvr to the open source community. github.com/PlugOvr-ai/Plu… PlugOvr is an AI Assistant that lets you directly interact with your favorite applications. You can define templates for your own use cases and select individual LLMs per template.

CorneliusWefelscheid (@corny335) 's Twitter Profile Photo

Eine neue KI-Generation soll den Sprung von der Sprache zur Handlung schaffen. Microsoft zeigt mit einem ersten "Large Action Model", wie KI-Systeme Windows-Programme selbstständig bedienen können. the-decoder.de/microsoft-zeig…

Philipp Schmid (@_philschmid) 's Twitter Profile Photo

New LLMs that control UIs! ByteDance Research releases UI-TARS, fine-tuned GUI agent that integrates reasoning, and action capabilities into a single vision-language model. Think of computer use but open. 👀 TL;DR; 3️⃣ Available in 3 sizes: 2B, 7B, and 72B parameters 🧠

Andi Marafioti (@andimarafioti) 's Twitter Profile Photo

Fuck it, today we're open-sourcing the codebase used to train SmolVLM from scratch on 256 H100s🔥 Inspired by our team's effort to open-source DeepSeek's R1 training, we are releasing the training and evaluation code on top of the weights 🫡 Now you can train any of our

Fuck it, today we're open-sourcing the codebase used to train SmolVLM from scratch on 256 H100s🔥
Inspired by our team's effort to open-source DeepSeek's R1 training, we are releasing the training and evaluation code on top of the weights 🫡
Now you can train any of our
Paul Couvert (@itspaulai) 's Twitter Profile Photo

Microsoft just released an impressive tool OmniParser V2 can turn any LLM into an agent capable of using a computer 🔥 You can enable GPT-4o, DeepSeek R1, Sonnet 3.5, Qwen... to understand what's on your screen and take actions. 100% free & open source

Ai2 (@allen_ai) 's Twitter Profile Photo

Today we’re releasing a prototype of Genesys, an autonomous multi-agent LLM discovery system that aims to discover new types of language model architectures. We found Genesys can discover novel architectures competitive with the industry-standard transformer. 🧵

Today we’re releasing a prototype of Genesys, an autonomous multi-agent LLM discovery system that aims to discover new types of language model architectures. We found Genesys can discover novel architectures competitive with the industry-standard transformer. 🧵
Kasey Zhang (@_weexiao) 's Twitter Profile Photo

It’s easy to fine-tune small models w/ RL to outperform foundation models on vertical tasks. We’re open sourcing Osmosis-Apply-1.7B: a small model that merges code (similar to Cursor’s instant apply) better than foundation models. Links to download and try out the model below!

Alibaba Tongyi_Lab (@labtongyi96898) 's Twitter Profile Photo

1/7 We're launching Tongyi DeepResearch, the first fully open-source Web Agent to achieve performance on par with OpenAI's Deep Research with only 30B (Activated 3B) parameters! Tongyi DeepResearch agent demonstrates state-of-the-art results, scoring 32.9 on Humanity's Last Exam,

1/7 We're launching Tongyi DeepResearch, the first fully open-source Web Agent to achieve performance on par with OpenAI's Deep Research with only 30B (Activated 3B) parameters! Tongyi DeepResearch agent demonstrates state-of-the-art results, scoring 32.9 on Humanity's Last Exam,