Current

The Technical Anatomy of Model Extraction in 2026 (The Great AI Theft of the Century?)

Apr 8, 20269 min read

On this page

Article Brief

Why this article matters

Model extraction has industrialized this year and perhaps last year as well. Attackers now run phased pipelines of synthetic queries, hydra-style account multiplexing, and targeted logit harvesting to distill proprietary models at scale. This post presents the mathematics behind knowledge distillation and temperature manipulation, explains why watermarking fails under character-level perturbations, and covers the 2026 disclosures from Anthropic, Google GTIG, and NDSS. The defense section goes beyond API rate limits to behavioral fingerprinting, semantic embedding clustering, and optional logit poisoning—giving you a structured mental model for why perimeter controls alone are insufficient.

CurrentPublished 3 months ago

Next steps in the archive

Newer article

My First Skills Security Review: OWASP AST10, SkillSpector, and from 60 Findings to 5 Real Ones

It was the first in-house skills security review. The scanner flagged 60 findings across 25 skills. Only about 5 were real, all low impact, all supply-chain shaped. Here is the methodology I ended up using.

Older article

Rules vs. Skills: Creating Secure AI Context in Engineering Teams

At my company we ran into a familiar question while scaling AI coding assistants: when should context live in a Rule or `CLAUDE.md`, and when does it deserve a Skill...

Keep Exploring

Related reading

Continue through adjacent topics with the strongest tag overlap.

Mar 27, 2026AI Security / MLOps

Rules vs. Skills: Creating Secure AI Context in Engineering Teams

At my company we ran into a familiar question while scaling AI coding assistants: when should context live in a Rule or `CLAUDE.md`, and when does it deserve a Skill...

#AI Security#GenAI Applications#MLOps

Jun 27, 2026AI Security

My First Skills Security Review: OWASP AST10, SkillSpector, and from 60 Findings to 5 Real Ones

#AI Security#Agent Skills#OWASP

Oct 25, 2025AI Security

MCP Security for Enterprise Organizations: Real-world experiences and advanced defense

A personal reflection and technical analysis on the MCP protocol, from the challenge of presenting to the community to the real-world methods and risks in AI Security, MCP Server, and recommended defenses for organizations. Includes resources, papers, and key sites for modern research in AI agent security.

#AI Security#MCP Protocol#GenAI Applications