Pawel Bojkowski (@pbojkowski) Twitter Tweets • TwiCopy

Pawel Bojkowski

a month ago

Active Context Compression: Autonomous Memory Management in LLM Agents Abstract: Large Language Model (LLM) agents struggle with long-horizon software engineering tasks due to “Context Bloat.” As interaction history grows, computational costs explode... arxiv.org/pdf/2601.07190

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Pawel Bojkowski

@pbojkowski

a month ago

SimpleMem: Efficient Lifelong Memory for LLM Agents Abstract: To support reliable long-term interaction in complex environments, LLM agents require memory systems that efficiently manage historical experiences. Existing approaches either retain full.... arxiv.org/pdf/2601.02553

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Pawel Bojkowski

@pbojkowski

25 days ago

Agentic Reasoning for Large Language Models Abstract: Reasoning is a fundamental cognitive process underlying inference, problem-solving, and decision-making. While large language models (LLMs) demonstrate strong reasoning capabilities in closed-world... arxiv.org/pdf/2601.12538

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Pawel Bojkowski

@pbojkowski

25 days ago

Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line Interfaces Abstract: AI agents may soon become capable of autonomously completing valuable, long-horizon tasks in diverse domains. Current benchmarks either do not measure...... arxiv.org/pdf/2601.11868

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Pawel Bojkowski

@pbojkowski

25 days ago

Remapping and navigation of an embedding space via error minimization: a fundamental organizational principle of cognition in natural and artificial systems Abstract: The emerging field of diverse intelligence seeks an integrated view of problem solving arxiv.org/pdf/2601.14096

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Tom Warren

@tomwarren

23 days ago

Anthropic just took a big swipe at OpenAI's decision to put ads in ChatGPT. Anthropic is airing ads mocking ChatGPT ads during the Super Bowl, and they're hilarious 😅 Anthropic is also committing to no ads in Claude theverge.com/ai-artificial-…

thumb_up_off_alt23,23K

chat_bubble_outline658

repeat2,2K

shareShare

Alex Patrascu

@maxescu

23 days ago

Kling 3.0 is here! And it comes with two game-changing updates: Kling 3.0 and Omni 3.0 Features: - 3-15s with multi-shot sequences - Native audio with multiple characters - Upload/record video character as reference + consistent voices Available now on Higgsfield AI 🧩

thumb_up_off_alt1,1K

chat_bubble_outline161

repeat226

shareShare

Claude

@claudeai

22 days ago

Introducing Claude Opus 4.6. Our smartest model got an upgrade. Opus 4.6 plans more carefully, sustains agentic tasks for longer, operates reliably in massive codebases, and catches its own mistakes. It’s also our first Opus-class model with 1M token context in beta.

thumb_up_off_alt26,26K

chat_bubble_outline1,1K

repeat3,3K

shareShare

Pawel Bojkowski

@pbojkowski

22 days ago

Building a C compiler with a team of parallel Claudes We (Anthropic) tasked Opus 4.6 using agent teams to build a C Compiler, and then (mostly) walked away. Here's what it taught us about the future of autonomous software development. anthropic.com/engineering/bu…

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Pawel Bojkowski

@pbojkowski

22 days ago

Scaling Multiagent Systems with Process Rewards Abstract: While multiagent systems have shown promise for tackling complex tasks via specialization, finetuning multiple agents simultaneously faces two key challenges.............. arxiv.org/pdf/2601.23228

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Pawel Bojkowski

@pbojkowski

22 days ago

Nice!

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭

@elder_plinius

20 days ago

PROTECC OPUS 4.6 AT ALL COSTS 🪄 THE MAGIC IS BACK ✨

thumb_up_off_alt656

chat_bubble_outline23

repeat14

shareShare

Pawel Bojkowski

@pbojkowski

19 days ago

Beyond RAG for Agent Memory: Retrieval by Decoupling and Aggregation Abstract Agent memory systems often adopt the standard Retrieval-Augmented Generation (RAG) pipeline, yet its underlying assumptions differ in this setting. RAG targets large......... arxiv.org/pdf/2602.02007

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Pawel Bojkowski

@pbojkowski

19 days ago

InfMem: Learning System-2 Memory Control for Long-Context Agent Abstract: Reasoning over ultra-long documents requires synthesizing sparse evidence scattered across distant segments under strict memory constraints. While streaming agents enable scalable arxiv.org/pdf/2602.02704

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Pawel Bojkowski

@pbojkowski

19 days ago

A-RAG: Scaling Agentic Retrieval-Augmented Generation via Hierarchical Retrieval Interfaces Abstract: Frontier language models have demonstrated strong reasoning and long-horizon tool-use capabilities. However, existing RAG systems fail to leverage..... arxiv.org/pdf/2602.03442

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Pawel Bojkowski

@pbojkowski

19 days ago

Agent Primitives: Reusable Latent Building Blocks for Multi-Agent Systems Abstract: While existing multi-agent systems (MAS) can handle complex problems by enabling collaboration among multiple agents, they are often highly task-specific, relying....... arxiv.org/pdf/2602.03695

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare