Emanuele Fenocchi (@manu_fe23) 's Twitter Profile
Emanuele Fenocchi

@manu_fe23

Machine learning engineer | AI & data-driven systems | AGI is still very far away, but it's already here

ID: 1732753813608579072

calendar_today07-12-2023 13:28:10

69 Tweet

24 Takipçi

264 Takip Edilen

Davit (@dbuniatyan) 's Twitter Profile Photo

The Genesis Mission calls for new ways to accelerate scientific discovery. This is our contribution Multimodal search across 25M papers is a step toward science discovery that moves at the speed of curiosity. Releasing, - Visually indexed scientific paper dataset with open

Davit (@dbuniatyan) 's Twitter Profile Photo

Today excited to open-source Deep Lake PG = Postgres + Deep Lake Biggest bottleneck of AI having impact on GDP is unlocking data in Enterprises. Every AI team I know is stitching Postgres → Vector DB → Warehouse → Lakehouse → Catalog. All to give their agents basic memory

Today excited to open-source Deep Lake PG = Postgres + Deep Lake

Biggest bottleneck of AI having impact on GDP is unlocking data in Enterprises.

Every AI team I know is stitching Postgres → Vector DB → Warehouse → Lakehouse → Catalog.

All to give their agents basic memory
Davit (@dbuniatyan) 's Twitter Profile Photo

Supply chain for Memory is disrupted. Consumer RAM is now more expensive than GPUs. In-memory compute (RAM + fast NVMe) is getting expensive thanks to AI datacenter build-out. That makes memory-limited algorithms far more valuable. Most databases heavily rely on in-memory data

Supply chain for Memory is disrupted. Consumer RAM is now more expensive than GPUs.

In-memory compute (RAM + fast NVMe) is getting expensive thanks to AI datacenter build-out.

That makes memory-limited algorithms far more valuable.

Most databases heavily rely on in-memory data
Davit (@dbuniatyan) 's Twitter Profile Photo

Activeloop and Pinkbot achieved 9× faster VLM reasoning throughput with Intel newest chips, unveiled at #CES2026. As Physical AI takes on increasingly complex tasks, vision-language models enable robots not just to see, but to perceive and reason. While perception now runs in

<a href="/activeloop/">Activeloop</a> and Pinkbot achieved 9× faster VLM reasoning throughput with <a href="/intel/">Intel</a> newest chips, unveiled at #CES2026.

As Physical AI takes on increasingly complex tasks, vision-language models enable robots not just to see, but to perceive and reason.

While perception now runs in
Davit (@dbuniatyan) 's Twitter Profile Photo

I let it run for 14 hours autonomously. Output was 83 lines of highly optimized C++ code. 714 lines of tests. It fixed the bottleneck in a large codebase. Improved the benchmark 2x. Verified memory leak using ASAN. Spent $150 of LLM calls. I was not in front of screen whole

Davit (@dbuniatyan) 's Twitter Profile Photo

Was at the Physical AI Hack in SF today. Absurd talent density with hundreds of people. Every team gets an assigned robot. Energy is off the charts. 🤖🔥 Proud to sponsor with Activeloop and enable teams building on multimodal AI with Deep Lake. So much data to capture.

Emanuele Fenocchi (@manu_fe23) 's Twitter Profile Photo

Last weekend, I had the pleasure of taking part as a judge in a Physical AI hackathon that recorded over 800 registrations. Activeloop supported the event by providing Deep Lake, its GPU-based multimodal database, enabling high scalability and low latency for the development

Last weekend, I had the pleasure of taking part as a judge in a Physical AI hackathon that recorded over 800 registrations. 

<a href="/activeloop/">Activeloop</a> supported the event by providing Deep Lake, its GPU-based multimodal database, enabling high scalability and low latency for the development
Davit (@dbuniatyan) 's Twitter Profile Photo

Flash Attention 4 moved to a Python DSL to make life easier. Last night, I rewrote it back into raw C++ CUTE just to get it running on my DGX Spark. Here’s the 5x optimization arc from an sm_121 CUDA mismatch to matching PyTorch's speed from scratch.

Flash Attention 4 moved to a Python DSL to make life easier. Last night, I rewrote it back into raw C++ CUTE just to get it running on my DGX Spark.

Here’s the 5x optimization arc from an sm_121 CUDA mismatch to matching PyTorch's speed from scratch.
Intel Business (@intelbusiness) 's Twitter Profile Photo

The path to solving last-mile delivery is built on real-time perception. With #IntelCoreUltra Series 3 processors and Activeloop’s Deep Lake GPU database, Pinkbot increased VLA throughput by 9x and improved delivery outcomes. Learn more about Intel Core Ultra Series 3 at

The path to solving last-mile delivery is built on real-time perception. With #IntelCoreUltra Series 3 processors and <a href="/activeloop/">Activeloop</a>’s Deep Lake GPU database, Pinkbot increased VLA throughput by 9x and improved delivery outcomes. Learn more about Intel Core Ultra Series 3 at
Davit (@dbuniatyan) 's Twitter Profile Photo

Jensen just announced the start of the GPU-accelerated database era at #GTC26. AI runs on GPUs. But your data still runs on CPUs. That mismatch is breaking the AI stack. For the last two months, we’ve been busy solving this problem. Excited to announce Deeplake becoming the

Jensen just announced the start of the GPU-accelerated database era at #GTC26.

AI runs on GPUs. But your data still runs on CPUs.

That mismatch is breaking the AI stack.

For the last two months, we’ve been busy solving this problem.

Excited to announce Deeplake becoming the
Davit (@dbuniatyan) 's Twitter Profile Photo

Here is a banger article on how to make Postgres serverless for AI Agents. The rules: - spin up per request - scale reads and writes - drop to zero when idle - no state tied to machines - keep Postgres, rethink storage engine What we got: ~14s cold start → ~1s

Here is a banger article on how to make Postgres serverless for AI Agents.
  
The rules: 
- spin up per request 
- scale reads and writes 
- drop to zero when idle 
- no state tied to machines 
- keep Postgres, rethink storage engine  

What we got: 
~14s cold start → ~1s