
Karpathy says LLM memory features "trying too hard"
In a pair of posts on X on March 25, Andrej Karpathy noted that the memory feature has an issue: models can treat stale, one-off queries as deep ongoing interests.

In a pair of posts on X on March 25, Andrej Karpathy noted that the memory feature has an issue: models can treat stale, one-off queries as deep ongoing interests.

Moonshot AI's Kimi Team has published a technical report proposing a change to a fundamental component of modern AI models called residual connections.

A Sydney tech entrepreneur with no background in biology used ChatGPT, Google DeepMind's AlphaFold, and his own machine learning expertise to design a custom mRNA cancer vaccine for his rescue dog.

Anthropic has moved the 1 million token context window for Claude Opus 4.6 and Sonnet 4.6 from beta to general availability, and eliminated the long-context pricing premium entirely.

OpenAI released GPT-5.4 today, billing it as its most capable model to date for professional work, coding, and automated tasks.

Alibaba's Qwen team has released four new small language models — Qwen3.5-0.8B, 2B, 4B, and 9B — alongside their base model counterparts, extending the Qwen3.5 architecture to compact, resource-efficient deployments.

Google has released Gemini 3.1 Pro, a new version of its Pro model line that it says upgrades the core reasoning capabilities behind recent Gemini 3 advances.

Anthropic released Claude Sonnet 4.6, upgrading its mid-tier AI model with improved coding, computer use, and long-context reasoning capabilities.

Gemini Deep Think is being used to assist with research-level problems in mathematics, computer science, physics, and economics.

OpenAI has rolled out a set of upgrades to Deep Research in ChatGPT, including a new backend model a...

A new study from MIT researchers argues that some popular platforms used to rank large language models can be far mor...

A new "fast mode" for Claude Opus delivers up to 2.5 times higher output token generation speed at a significant premium: six times the standard.

OpenAI on Feb. 5, 2026, released GPT-5.3-Codex<sup referenceindex="1" refer...

Anthropic announced Claude Opus 4.6, the latest...