Ray Guirguis (@rayguirguisai) 's Twitter Profile
Ray Guirguis

@rayguirguisai

ID: 1744646413735059456

calendar_today09-01-2024 09:05:13

323 Tweet

53 Followers

52 Following

Alvaro Cintas (@dr_cintas) 's Twitter Profile Photo

What a crazy week in AI 🤯 - Cursor Phone App - Krea AI Modify Video - Google launches Doppl - Perplexity New Max Tier - X new AI note taking API - AI’s Breakthrough in Fertility - Morphic One-Shot Character - Meta & OpenAI recruiting drama Here’s EVERYTHING you need to know:

Eyisha Zyer (@eyishazyer) 's Twitter Profile Photo

Amazon just dropped Kiro. It not only writes code but also creates clear specifications, generates and executes tasks, and even detects bugs. Here are some wild examples (and yes-it’s FREE):

Amazon just dropped Kiro.

It not only writes code but also creates clear specifications, generates and executes tasks, and even detects bugs.

Here are some wild examples (and yes-it’s FREE):
Akshay 🚀 (@akshay_pachaar) 's Twitter Profile Photo

What is context engineering❓ And why is everyone talking about it...👇 Context engineering is rapidly becoming a crucial skill for AI engineers. It's no longer just about clever prompting; it's about the systematic orchestration of context. 🔷 The Problem: Most AI agents

Ray Guirguis (@rayguirguisai) 's Twitter Profile Photo

What’s “context engineering”? Designing how an AI gets the right info, at the right moment, and what it can do with it. 🧠 Why AIs stumble Not (only) “not enough knowledge,” but fuzzy instructions, missing/irrelevant context, no grounding, poor tool wiring, or capability limits.

Nicolas Camara (@nickscamara_) 's Twitter Profile Photo

We're releasing an open source Lovable clone next week 👀 Paste any website URL and AI agents will instantly create a working clone you can build on top of. Powered by Firecrawl, E2B, and Groq Inc👇

Ray Guirguis (@rayguirguisai) 's Twitter Profile Photo

🚀 Big news from OpenAI — GPT-5 shows a massive jump in T²-bench tool-use accuracy for Telecom📈 In Telecom, GPT-5 hits 97%, compared to 58% (o3) and 34% (GPT-4.1). For context, the τ²-Bench is a challenging test of multi-turn, real-world tool usage across Telecom, Retail, and

🚀 Big news from OpenAI — GPT-5 shows a massive jump in T²-bench tool-use accuracy for Telecom📈

In Telecom, GPT-5 hits 97%, compared to 58% (o3) and 34% (GPT-4.1). 

For context, the τ²-Bench is a challenging test of multi-turn, real-world tool usage across Telecom, Retail, and
Alex Prompter (@alex_prompter) 's Twitter Profile Photo

I tested ChatGPT 5 and Grok 4 with same critical prompts. The results will blow your mind. ChatGPT 5 Vs. Grok 4 (Video demos are included)

I tested ChatGPT 5 and Grok 4 with same critical prompts.

The results will blow your mind.

ChatGPT 5                Vs.                Grok 4

(Video demos are included)
Alex Prompter (@alex_prompter) 's Twitter Profile Photo

1. Realistic Physics Game (Hexagon Test) Prompt: Create a HTML, CSS, and javascript where a ball is inside a rotating hexagon. The ball is affected by Earth’s gravity and friction from the hexagon walls. The bouncing must appear realistic. → Tests physics simulation, code

Alex Reibman 🖇️ (@alexreibman) 's Twitter Profile Photo

Cognition invited SF’s most ambitious builders to push GPT-5, Claude Opus, and Gemini Pro to their limits And they invited Andrej Karpathy to pick the very best. Here are the top demos from the Cognition Applied AI Hackathon (🧵):

Cognition invited SF’s most ambitious builders to push GPT-5, Claude Opus, and Gemini Pro to their limits

And they invited <a href="/karpathy/">Andrej Karpathy</a> to pick the very best.

Here are the top demos from the Cognition Applied AI Hackathon (🧵):
Ray Guirguis (@rayguirguisai) 's Twitter Profile Photo

Exciting topic to learn: LLM orchestration 🚀 It helps Dev teams design complex workflows & Support teams trace/debug issues faster across agents & prompts — including monitoring costs, prompt quality, and stability. Great overview of frameworks (LangChain, AutoGen, LlamaIndex,

OpenAI Developers (@openaidevs) 's Twitter Profile Photo

We’re releasing new Codex features to make it a more effective coding collaborator: - A new IDE extension - Easily move tasks between the cloud and your local environment - Code reviews in GitHub - Revamped Codex CLI Powered by GPT-5 and available through your ChatGPT plan.

elvis (@omarsar0) 's Twitter Profile Photo

Better use of retrieval budget With the same latency, REFRAG can process more passages than a baseline model and outperform it across 16 RAG tasks, especially when the retriever is weak (messy or noisy results). Beyond RAG, it boosts multi-turn dialog (keeping more history

Better use of retrieval budget

With the same latency, REFRAG can process more passages than a baseline model and outperform it across 16 RAG tasks, especially when the retriever is weak (messy or noisy results).

Beyond RAG, it boosts multi-turn dialog (keeping more history
Cursor (@cursor_ai) 's Twitter Profile Photo

Cursor can now control your browser. Agent can take screenshots, improve UI, and debug client issues. Try our early preview with Sonnet 4.5.