Everyone wants to know if content was written by AI. But can detection tools actually tell?
I tested the major detectors. The results are concerning.
The Test Setup
I used three types of content:
- Known AI: ChatGPT and Claude outputs
- Known Human: My own writing from before AI existed
- Mixed: Human writing edited by AI
Each sample: 500-1000 words.
Tools Tested
- GPTZero
- Originality.ai
- Copyleaks
- Turnitin
- ZeroGPT
- Writer.com
- Sapling
- Content at Scale
Results Summary
Testing 100% AI-Generated Content
| Tool | Correct Detection | False Negatives |
|---|---|---|
| GPTZero | 78% | 22% |
| Originality.ai | 85% | 15% |
| Copyleaks | 72% | 28% |
| Turnitin | 80% | 20% |
| ZeroGPT | 65% | 35% |
| Writer.com | 70% | 30% |
| Sapling | 68% | 32% |
| Content at Scale | 75% | 25% |
Average: 74% correct on obvious AI content.
26% of clearly AI-generated content passed as human.
Testing 100% Human Content
| Tool | Correct (Human) | False Positives |
|---|---|---|
| GPTZero | 82% | 18% |
| Originality.ai | 78% | 22% |
| Copyleaks | 85% | 15% |
| Turnitin | 88% | 12% |
| ZeroGPT | 70% | 30% |
| Writer.com | 75% | 25% |
| Sapling | 80% | 20% |
| Content at Scale | 76% | 24% |
Average: 79% correct on human content.
21% of clearly human content was flagged as AI.
The Problem
Best case scenario: Tools are right ~75-85% of the time.
That means:
- 1 in 5 humans falsely accused of using AI
- 1 in 4 AI texts pass undetected
This isn’t good enough for high-stakes decisions.
What Affects Detection
Easier to Detect:
- Direct ChatGPT outputs with no editing
- Generic topics
- Certain repetitive patterns
- Longer texts
Harder to Detect:
- Edited AI content
- Technical writing
- Non-English content
- Shorter texts
- Specific domain expertise
Almost Undetectable:
- AI-assisted human writing (person writes, AI polishes)
- Human writing that happens to be formal
- AI content run through multiple edit passes
The False Positive Problem
My pre-2020 writing (definitely human) was flagged as AI:
- 15-30% of the time depending on tool
- More often when writing was formal
- More often in technical content
Real consequence: Students, writers, and professionals get falsely accused.
What Detectors Actually Measure
They don’t detect “AI thinking.” They measure:
- Perplexity (predictability of word choices)
- Burstiness (variation in sentence patterns)
- Statistical patterns
The problem: Good human writing can have similar patterns to AI. AI can be tuned to mimic human patterns.
Why Detection Will Keep Failing
- AI improves faster than detection — Models get more human-like
- Trivial to evade — Minor edits defeat detectors
- Fundamental limitation — Detecting style, not source
- No ground truth — Can’t train on labeled data that doesn’t exist
My Recommendations
For Individuals
- Don’t trust AI detectors to prove you didn’t use AI
- Keep drafts and revision history
- Understand detection is unreliable
For Educators
- Don’t use detection as sole evidence
- Consider changing assessment types
- Accept that detection has limits
- Focus on learning, not policing
For Publishers
- Detection won’t save you
- Editorial judgment still matters
- Consider AI-disclosure policies instead
- Accept the new reality
For Employers
- False positives are a lawsuit waiting to happen
- Don’t fire people based on detection
- Update policies for the AI era
The Honest Truth
AI detection tools:
- Are better than random guessing
- Have unacceptable false positive rates
- Can be trivially defeated
- Will not improve significantly
The AI writing genie is out of the bottle. Detection isn’t the solution.
What Actually Works
Instead of detection, consider:
- Process documentation — Show your work
- Oral assessments — Discuss the content
- Iterative assignments — See work evolve
- Specific requirements — Hard-to-fake details
- Acceptance — AI is a tool; focus on outcomes
Bottom Line
AI detectors don’t work well enough for consequential decisions.
Until accuracy improves dramatically (unlikely), treat detection results as weak signals, not proof.
The future isn’t detecting AI. It’s adapting to AI being everywhere.
Related Articles
- How to Detect AI Writing - Manual methods
- AI Writing Tools Comparison - What’s available
- AI in Education - Broader implications