Back to Home
Joseph Nordqvist

Joseph Nordqvist

Joseph founded AI News Home in 2026. He studied marketing and later completed a postgraduate program in AI and machine learning (business applications) at UT Austin’s McCombs School of Business. He is now pursuing an MSc in Computer Science at the University of York.

71 articles

Models

Gemma 4 on an iPhone: here's what a 2B model can actually do

Google shipped a real LLM to your phone. We tested Gemma 4 E2B fully offline (math, code, ethics, and history) to find where on-device AI shines and where it breaks.

April 8, 20264 min
Research

Netflix VOID: we tested their physics-aware video eraser with our own clips

Netflix's first open-source model isn't a chatbot — it's a production VFX tool that understands physics. We ran it on an H100 to see if the claims hold up.

April 6, 20269 min
Models

Gemma 4 E4B benchmark: 53 tests on a MacBook Pro M4 (24GB)

53 practical tests — vision benchmarks, 14 UI recreations, and 30 text challenges covering math, logic, essays, creative writing, and translation. All local, all free.

April 3, 202610 min
Products

Veo 3.1 Lite vs Fast: what you actually lose at a third of the price

Google launched Veo 3.1 Lite today at $0.05 per second. I ran 7 text-to-video prompts and one image-to-video test on both models to find out where Lite holds up and where it falls apart.

March 31, 20266 min
Products

I cloned my voice with Mistral's Voxtral TTS in under a minute, then tested the quantized local model

I recorded 43 seconds of audio on a MacBook, sent it to Mistral's Voxtral API, and got back natural-sounding voice clones in minutes. Then I ran the same tests on the 6-bit quantized local version and Kokoro 82M to see how they compare.

March 27, 20266 min
Products

Cohere Transcribe, the most accurate open-source speech recognition model currently available

Cohere's first speech recognition model tops the Open ASR Leaderboard as well as our own independent benchmark of four providers.

March 27, 20266 min
Analysis

Search is becoming a conversation, and Google just went first at scale

Google's expansion of Search Live to more than 200 countries this week is easy to frame as a product update. A voice feature got wider availability. But zoom out and the move looks more significant: it is the first time any company has deployed conversational, multimodal search at truly global scale.

March 26, 20267 min
Products

Google's new voice model powers Search Live expansion

Google released Gemini 3.1 Flash Live on March 26, positioning it as the company's most capable real-time audio model to date.

March 26, 20264 min
Research

Machine learning model predicts liver cancer risk from routine blood tests and health records

Researchers have developed a machine learning model that predicts an individual's risk of hepatocellular carcinoma using routine clinical information already collected during standard medical visits.

March 26, 20264 min
Models

Karpathy says LLM memory features "trying too hard"

In a pair of posts on X on March 25, Andrej Karpathy noted that the memory feature has an issue: models can treat stale, one-off queries as deep ongoing interests.

March 26, 20263 min
Industry

Nvidia-backed Reflection AI in talks to raise $2.5 billion at $25 billion valuation

Reflection AI, a startup backed by Nvidia that is building freely available AI models, is in talks to raise $2.5 billion at a pre-money valuation of $25 billion, the Wall Street Journal reported on March 25.

March 26, 20263 min
Products

Anthropic has introduced auto mode for Claude Code

Anthropic has introduced auto mode for Claude Code, a new permissions setting that lets it approve or block actions on the developer's behalf.

March 26, 20265 min
Products

Google's Lyria 3 Pro generates full songs up to three minutes long

Google DeepMind's Lyria 3 Pro music generation model can create complete tracks up to three minutes long, a sixfold increase over the 30-second clips produced by the standard Lyria 3 model that launched in February.

March 25, 20269 min
Products

OpenAI kills Sora as compute costs force a strategic retreat

OpenAI is discontinuing Sora, its AI video generation platform, just six months after the standalone app launched.

March 25, 20266 min
Products

Cursor publishes Composer 2 technical report, formally crediting Kimi K2.5 as base model

Cursor released a technical report on March 24 detailing how Composer 2 was trained, formally crediting Moonshot AI's Kimi K2.5 as the base model for the first time in an official document.

March 25, 20263 min
Research

Google finds a way to shrink AI memory usage by 4.5x without losing accuracy

A new compression technique from Google Research could make AI models cheaper to run, faster to respond, and capable of handling much longer conversations.

March 25, 20265 min
Industry

Cursor's new coding model was built on top of Kimi K2.5, a Chinese open-source base

A developer discovered that Cursor's Composer 2 model was built on top of Kimi K2.5, an open-source model from Beijing-based Moonshot AI, a detail Cursor had not mentioned anywhere in its announcement.

March 23, 20264 min
Products

Nvidia DLSS 5, fusing traditional rendering with gen AI

Nvidia DLSS 5 introduces a real-time neural rendering model that uses generative AI to enhance game visuals with photorealistic lighting and materials.

March 16, 202610 min
Research

Moonshot AI proposes new method for how LLM layers share information, claims 1.25x compute advantage

Moonshot AI's Kimi Team has published a technical report proposing a change to a fundamental component of modern AI models called residual connections.

March 16, 20265 min
Models

How ChatGPT and AlphaFold helped an Australian engineer design a personalized cancer vaccine for his dog

A Sydney tech entrepreneur with no background in biology used ChatGPT, Google DeepMind's AlphaFold, and his own machine learning expertise to design a custom mRNA cancer vaccine for his rescue dog.

March 16, 20266 min
Industry

Musk sets March 21 for Tesla's Terafab chip factory

Elon Musk set a March 21 date for what may well be one of the most ambitious semiconductor ventures ever attempted by a company outside the established chipmaking industry

March 16, 20267 min
Industry

Anthropic commits $100 million to new Claude Partner Network for enterprise adoption

Anthropic announced on March 12 that it is investing $100 million in 2026 to launch the Claude Partner Network, a program designed to help consulting firms, professional services companies, and AI specialists deploy Claude across enterprise customers.

March 15, 20265 min
Models

Claude's 1M context window is now generally available with no long-context pricing premium

Anthropic has moved the 1 million token context window for Claude Opus 4.6 and Sonnet 4.6 from beta to general availability, and eliminated the long-context pricing premium entirely.

March 14, 20265 min
Analysis

Google DeepMind celebrates AlphaGo's 10th anniversary as Pentagon AI crisis consumes the industry

Google DeepMind CEO Demis Hassabis published a warm retrospective about AlphaGo's 10th anniversary in what may very well be an act of narrative positioning at a moment when the rest of the industry is consumed by the question of who AI serves.

March 11, 20262 min
Products

Cursor Automations turns AI agents into background processes

Cursor's new Automations let AI agents run in the background, triggered by PRs, PagerDuty alerts, Slack messages, and timers, with no prompt required.

March 6, 20265 min
Models

GPT-5.4 arrives with native computer use, 1M context, and a new tool search capability

OpenAI released GPT-5.4 today, billing it as its most capable model to date for professional work, coding, and automated tasks.

March 5, 20265 min
Products

NotebookLM launches Cinematic Video Overviews, powered by a three-model AI pipeline

Google's AI research tool upgrades from narrated slides to source-grounded cinematic video using a three-model pipeline of Gemini 3, Nano Banana Pro, and Veo 3. Ultra-tier access is required, with a dedicated 20/day Cinematic limit.

March 5, 20266 min
Products

Google expands Canvas in AI Mode to all US users in Search

Google has made its Canvas feature available to all users in the United States who access AI Mode in Search, the company announced on March 4, 2026.

March 4, 20264 min
Models

Google releases Gemini 3.1 Flash-Lite

Google has released a preview of Gemini 3.1 Flash-Lite, positioning it as the fastest and most cost-efficient model in the Gemini 3 series.

March 3, 20262 min
Products

NotebookLM adds customizable infographic styles

Google's research AI ships ten style presets, a sign the competition for AI tool loyalty is moving beyond accuracy.

March 2, 20262 min
Models

Alibaba launches Qwen 3.5 small model series with sub-1B edge options

Alibaba's Qwen team has released four new small language models — Qwen3.5-0.8B, 2B, 4B, and 9B — alongside their base model counterparts, extending the Qwen3.5 architecture to compact, resource-efficient deployments.

March 2, 20264 min
Analysis

The pressure test: What February 2026 revealed about AI's safety promises

In one week, Anthropic dismantled its hardest safety promise and absorbed a federal blacklisting rather than cross its ethical red lines. Both things are true. That's the story.

March 2, 202611 min
Industry

Claude hits #1 on the US App Store after Pentagon dispute

After the Pentagon banned Anthropic for refusing to allow Claude to be used for mass surveillance and autonomous weapons, users responded by making Claude the #1 app on the US App Store, overtaking ChatGPT for the first time.

March 1, 20262 min
Policy & Ethics

OpenAI shares Pentagon contract language

OpenAI published excerpts of its agreement with the Department of Defense on Saturday morning. The language is more detailed than expected, yet more ambiguous than it first appears.

February 28, 20267 min
Policy & Ethics

OpenAI strikes Pentagon deal hours after Anthropic blacklisted — with seemingly the same terms Anthropic was punished for requesting

OpenAI CEO Sam Altman announced a deal to deploy AI on the Pentagon's classified network just hours after the administration blacklisted Anthropic for requesting the same restrictions OpenAI says it secured.

February 28, 202611 min
Industry

Amazon invests $50 billion in OpenAI as part of $110 billion funding round

OpenAI announced $110 billion in new investment at a $730 billion pre-money valuation, anchored by $50 billion from Amazon, $30 billion from SoftBank, and $30 billion from NVIDIA. The Amazon commitment comes with a broad strategic partnership covering cloud infrastructure, custom silicon, and enterprise AI distribution.

February 27, 20269 min
Policy & Ethics

The Pentagon wants Claude without guardrails. Anthropic said no. Here's where it stands

The U.S. Department of War gave Anthropic a Friday deadline to remove AI safety guardrails from Claude or face contract cancellation, a supply chain risk designation, and invocation of the Defense Production Act. Anthropic refused.

February 27, 202613 min
Products

Claude Code now remembers what it learns between sessions

Anthropic has rolled out an automatic memory feature for Claude Code that writes persistent notes about project patterns, debugging discoveries, and user preferences to a MEMORY.md file, carrying context between sessions without manual effort.

February 27, 20266 min
Industry

Meta signs $60 billion AMD chip deal, gaining a 10% stake in NVIDIA's biggest rival

Meta and AMD announced a five-year, approximately $60 billion agreement to deploy up to six gigawatts of AMD Instinct GPUs, with Meta gaining the option to acquire roughly 10% of AMD through performance-based warrants.

February 27, 20268 min
Products

Google launches Nano Banana 2, bringing pro-level image generation to its Flash model

Google DeepMind releases Nano Banana 2 (Gemini 3.1 Flash Image), delivering Pro-tier image generation quality at Flash speeds. The model rolls out across the Gemini app, Google Search, Flow, and Google Ads with advanced world knowledge, subject consistency, and improved text rendering.

February 26, 20267 min
Products

Anthropic launches Remote Control for Claude Code, enabling mobile access

Claude Code's new 'Remote Control' feature, launched February 24, 2026, allows devs to continue local terminal sessions from their phone, tablet, or any web browser.

February 25, 20264 min
Industry

Claude Code Security, an AI-powered vulnerability scanner

Anthropic announced Claude Code Security on February 20, 2026, a tool that scans codebases for security vulnerabilities and suggests patches for human review.

February 20, 20262 min
Models

Gemini 3.1 Pro claims top-tier reasoning gains

Google has released Gemini 3.1 Pro, a new version of its Pro model line that it says upgrades the core reasoning capabilities behind recent Gemini 3 advances.

February 19, 20264 min
Products

Google rolls out Lyria 3 music generation in the Gemini app

Google has begun rolling out Lyria 3 in the Gemini app, adding a built-in tool for generating short 30-second music tracks.

February 18, 20264 min
Industry

Meta signs multiyear deal for NVIDIA GPUs

Meta and NVIDIA announced a multiyear strategic partnership deploying millions of Blackwell and Rubin GPUs, standalone Grace CPUs, and Spectrum-X networking across Meta's hyperscale data centers.

February 18, 20264 min
Products

Anthropic releases Claude Sonnet 4.6, bringing near-flagship performance to its mid-tier model

Anthropic released Claude Sonnet 4.6, upgrading its mid-tier AI model with improved coding, computer use, and long-context reasoning capabilities.

February 18, 20269 min
Policy & Ethics

OpenAI adds Lockdown Mode and “Elevated Risk” labels

OpenAI introduced Lockdown Mode in ChatGPT and added “Elevated Risk” labels to reduce prompt injection data exfiltration risk.

February 17, 20267 min
Products

Manus brings its AI agent into Telegram chats

Manus has launched “Manus Agents,” a way to use its AI agent directly inside messaging apps, starting with Telegram.

February 16, 20264 min
Industry

Vatican introduces AI-assisted live translations for Mass at St. Peter’s Basilica

The Vatican is rolling out an AI-assisted live translation service intended to help attendees follow principal celebrations at St. Peter’s Basilica in their preferred language.

February 16, 20263 min
Products

OpenClaw creator Peter Steinberger joins OpenAI as OpenClaw shifts to a foundation

Peter Steinberger, the developer behind OpenClaw, announced in a blog post that he is joining OpenAI to work on “brin...

February 15, 20262 min
Industry

Speed, the rising product feature in AI

Developments in early 2026 show a change in how AI companies compete. Response speed appears to be moving from a background engineering concern to a visible product differentiator.

February 15, 20265 min
Products

Manus adds Project Skills to its AI agent platform

Manus has introduced a feature called Project Skills that lets teams curate and lock sets of reusable AI workflows at the project level.

February 14, 20263 min
Products

Google Docs adds Gemini-powered audio summaries

Google is rolling out a new Gemini feature in Google Docs that lets you listen to a short audio summary of a document.

February 14, 20262 min
Industry

Airbnb says AI now handles nearly 30% of English-language support tickets in North America

Airbnb CEO Brian Chesky said the company’s AI agent is now handling nearly 30% of English-language customer support t...

February 14, 20263 min
Models

OpenAI unveils ultra-fast “Codex-Spark” model for real-time coding

OpenAI has launched a research preview of GPT-5.3-Codex-Spark, a new version of its Codex coding model designed to respond quickly enough for real-time software work.

February 13, 20263 min
Research

DeepMind reports research-level results from Gemini Deep Think

Gemini Deep Think is being used to assist with research-level problems in mathematics, computer science, physics, and economics.

February 11, 20264 min
Products

OpenAI upgrades ChatGPT deep research with GPT-5.2 and new controls

OpenAI has rolled out a set of upgrades to Deep Research in ChatGPT, including a new backend model a...

February 10, 20263 min
Products

Alibaba launches Qwen-Image-2.0, a unified image model for 2K generation and editing

Alibaba’s Qwen team has launched Qwen-Image-2.0, a new image generation model positioned as a single system for both text-to-image generation and image editing.

February 10, 20264 min
Research

Study suggests LLM leaderboards may be more fragile than they appear

A new study from MIT researchers argues that some popular platforms used to rank large language models can be far mor...

February 10, 20264 min
Industry

OpenAI begins testing ads in ChatGPT, turning "Go" into the budget tier while Plus stays ad-free

OpenAI said on Feb. 9, 2026 that it is beginning a U.S. test of ads in ChatGPT, positioning ads as a way to support broader access while keeping answers independent.

February 9, 20264 min
Models

"Fast mode" for Claude Opus 4.6, 2.5x speed at 6x the price

A new "fast mode" for Claude Opus delivers up to 2.5 times higher output token generation speed at a significant premium: six times the standard.

February 8, 20266 min
Industry

The Cowork selloff: how an Anthropic plugin update rattled software stocks

A product update billed as a modest expansion triggered the worst week for enterprise software in years, then markets snapped back. Here's what actually happened, and what it means.

February 8, 20265 min
Industry

Goldman Sachs is building AI agents with Anthropic for accounting and compliance

Goldman Sachs has been working with Anthropic for the past six months to co-develop autonomous AI agents for back-off...

February 6, 20264 min
Products

OpenAI drops GPT-5.3-Codex right after Anthropic's Opus 4.6

OpenAI on Feb. 5, 2026, released GPT-5.3-Codex<sup referenceindex="1" refer...

February 6, 202613 min
Products

Anthropic releases Claude Opus 4.6 with 1M context window

Anthropic announced Claude Opus 4.6, the latest...

February 5, 202613 min
Policy & Ethics

International AI Safety Report warns oversight is lagging

A major international report released this week finds that artificial intelligence systems are advancing faster than the methods used to test, monitor, and govern them.

February 5, 20266 min
Products

Roblox launches AI 4D generation beta for game objects

Roblox has launched the beta release of 4D generation, a new AI system that allows creators to generate functional, interactive 3D objects from text prompts inside Roblox experiences.

February 4, 20264 min
Products

Without voice, the ChatGPT macOS app falls short

On January 15, 2026, OpenAI removed the Voice feature from the ChatGPT macOS desktop app. While voice remains available on the web, mobile, and Windows, its absence on macOS has changed how the tool fits into daily academic and professional work.

February 3, 20263 min
Products

OpenAI releases standalone Codex app for macOS

OpenAI released a dedicated macOS application for Codex, its AI-powered coding assistant. The app is designed to serve as a "command center" that makes it easy for software developers to manage multiple AI agents at once.

February 2, 20265 min
Industry

OpenAI shares how its internal data agent works

A look at OpenAI’s in-house data agent, including how it uses context, access controls, and evals to support internal analytics.

January 29, 20263 min
Industry

Inside Microsoft’s Maia 200: Why This Chip Matters More Than You Think

Microsoft’s Maia 200 is not trying to be the fastest chip on paper. It is trying to be the cheapest place to run AI at scale. That distinction matters. While most attention still goes to training massive models, the real cost of AI now sits in inference, the endless stream of everyday queries powering copilots, chatbots, and search tools. Maia 200 is Microsoft’s attempt to own that layer of the stack, quietly reducing dependence on Nvidia while reshaping how Azure prices and delivers AI. If it works, the biggest shift will not be technical, it will be economic.

January 27, 20265 min
Joseph Nordqvist — Author | AI News Home