The Pumping Lemma

Proving Languages Are NOT Regular (or Not Context-Free)

Use ← → arrows to navigate

The Big Picture

The pumping lemma is a tool for proving negative results.

What it tells you

A language is NOT regular
A language is NOT context-free

Key Idea

Every regular language has a "pumping" property. If a language lacks this property, it cannot be regular.

What it does NOT tell you

It does not prove a language IS regular
Passing the pumping lemma ≠ regular

Warning

The pumping lemma is a necessary condition for regularity, NOT sufficient. It's a one-way test.

Intuition: Why Pumping Works

The Pigeonhole Principle meets finite automata.

A DFA with p states reading a string of length ≥ p must visit some state twice — creating a loop.

Analogy

5 pigeonholes, 6 pigeons ⇒ at least one hole has 2 pigeons. p states, p+1 visits ⇒ at least one state is visited twice.

The Pumping Lemma for Regular Languages

Pumping Lemma (Formal Statement)

If L is regular, then ∃ pumping length p such that ∀ s ∈ L with |s| ≥ p, s = xyz satisfying:

1. |y| > 0       (y is not empty)
2. |xy| ≤ p      (loop in first p chars)
3. xyⁱz ∈ L     ∀ i ≥ 0 (pump y any number of times)

The Pumping Game: Prove {aⁿbⁿ} is NOT Regular

Play the adversarial game — you are the prover!

Step 1: Adversary picks p =

Step 2: You pick s = a^pb^p (length ≥ p) ✓

Step 3: Adversary splits s = xyz (|xy|≤p, |y|>0)

Step 4: You pick i to break it!

Pump value i: 0

The Proof as a Flowchart

Every pumping lemma proof follows this exact structure.

Explore: Pumping a^pb^p

Adjust the split and pump value to see why EVERY split fails.

p = 4

|x| = 0

|y| = 2

pump i = 0

Key Insight

Since |xy| ≤ p, y consists entirely of a's. Pumping changes only the count of a's, breaking aⁿ=bⁿ.

Example 2: {ww | w ∈ {0,1}*} is NOT Regular

Prove that the language of doubled strings is not regular.

Assume L = {ww} is regular
Let p be the pumping length
Choose s = 0ᵖ1ᵖ0ᵖ1ᵖ ∈ L
s = xyz with |xy| ≤ p, |y| > 0
y = 0ᵏ (y is all 0's, within first p chars)
Pump i=2: xy²z = 0ᵖ⁺ᵏ1ᵖ0ᵖ1ᵖ
First half ≠ second half ⇒ NOT ww
Contradiction! L is NOT regular. ✓

Example 3: {1^n² | n ≥ 0} is NOT Regular

A language where string lengths are perfect squares.

Assume L = {1^(n²)} is regular
Let p be the pumping length
Choose s = 1^(p²) ∈ L
s = xyz, |xy| ≤ p, |y| > 0
|y| = k where 1 ≤ k ≤ p
Pump i=2: |xy²z| = p² + k
p² < p²+k ≤ p²+p < (p+1)²
p²+k is NOT a perfect square!
Contradiction! L is NOT regular.

Key Insight

Perfect squares get further apart: the gap between n² and (n+1)² is 2n+1. Pumping adds at most p, which is too small to reach the next square.

Common Mistakes in Pumping Proofs

Avoid these pitfalls that trip up most students!

Mistake 1: Fixing the split

You can't choose how xyz is split — the adversary picks the split. Your proof must work for ALL valid splits.

Mistake 2: Only checking i=0

You need to find at least one value of i that breaks it. But the adversary gets to split, so you must handle any split.

Mistake 3: Claiming PL proves regularity

If a language passes the PL, that tells you nothing. The PL can only prove non-regularity.

Correct Approach

You choose: the string s and the pump value i.
Adversary chooses: p and the split xyz.
Your proof must work for ANY p and ANY valid split.

When the Pumping Lemma Fails

Some non-regular languages satisfy the pumping lemma anyway!

Analogy

The PL is like a metal detector: if it beeps, there's metal. But if it doesn't beep, maybe the metal is too deep. Not beeping doesn't prove "no metal."

The classic example:

L = {aⁱb^jc^k | i,j,k ≥ 0 and if i=1 then j=k}

This language is not regular, but it satisfies the pumping lemma! For any string of length ≥ p, you can always pump it and stay in L (because most strings have i ≠ 1, so the j=k constraint doesn't apply).

Takeaway

PL passing ⇒ inconclusive
PL failing ⇒ definitely not regular

Challenge A: Predict the Pump

Challenge

Language: L = {0ⁿ1²ⁿ | n ≥ 0}. We pick s = 0^p1^2p. The adversary splits with x = ε, y = 0², z = 0^p-21^2p.

If we pump with i = 0 (delete y), what is xy°z?

The Pumping Lemma for Context-Free Languages

A more powerful version for proving languages are not context-free.

CFL Pumping Lemma

If L is context-free, then ∃ p such that ∀ s ∈ L with |s| ≥ p, s = uvxyz satisfying:

1. |vy| > 0       (v and y not both empty)
2. |vxy| ≤ p      (middle portion bounded)
3. uvⁱxyⁱz ∈ L    ∀ i ≥ 0 (pump v and y together)

Why CFL Pumping Works: Parse Trees

In a sufficiently long string, the parse tree must repeat a variable.

If string s is long enough, the parse tree is tall enough that some variable A must appear twice on some root-to-leaf path.

Analogy

Like the DFA pigeonhole, but for parse trees: if the tree is tall enough, a variable must repeat. The subtree between the two A's gives us the pumpable portion (v and y).

The Five Pieces

u = before outer A
v = outer A's left yield minus inner A
x = inner A's yield
y = outer A's right yield minus inner A
z = after outer A

Challenge B: Fix the Proof Bug

Spot the Error

A student writes this proof that {aⁿbⁿ} is not regular:

Assume L = {a^n b^n} is regular with pumping length p
Let s = a^p b^p. Pick x=ε, y=ab, z=a^(p-1)b^(p-1)
Pump i=0: xz = a^(p-1)b^(p-1) ∈ L. No contradiction...
Therefore the proof fails??

What's wrong with line 2?

CFL Example: {aⁿbⁿcⁿ} is NOT Context-Free

The classic example requiring the CFL pumping lemma.

Assume L = {a^n b^n c^n} is CFL
Let p be CFL pumping length
Pick s = a^p b^p c^p
s = uvxyz, |vxy|≤p, |vy|>0
vxy can't span all 3 symbols
Case: vxy in a's and b's only
Pump i=2: more a's+b's, same c's
#a or #b ≠ #c ⇒ NOT in L!
ALL cases lead to contradiction!

CFL Example: {ww | w ∈ {0,1}*} is NOT Context-Free

The doubled-string language needs even more power than a PDA.

Assume L = {ww} is CFL
Pick s = 0^p 1^p 0^p 1^p
s = uvxyz, |vxy|≤p
vxy spans at most 2 adjacent blocks
Pumping disrupts the symmetry
First half ≠ second half
Contradiction! L not CFL.

Key Insight

{ww} is the "poster child" of non-CFL languages. It requires matching across non-nested boundaries, which a PDA's stack can't handle.

Comparing the Two Pumping Lemmas

Side-by-side comparison of regular vs CFL pumping.

	Regular PL	CFL PL
Decomposition	s = xyz (3 parts)	s = uvxyz (5 parts)
Non-empty	\|y\| > 0	\|vy\| > 0
Length bound	\|xy\| ≤ p	\|vxy\| ≤ p
Pumping	xyⁱz ∈ L	uvⁱxyⁱz ∈ L
Pump pieces	1 piece (y)	2 pieces (v, y) pumped together
Intuition	DFA loop (pigeonhole)	Parse tree repeated variable
Proves	NOT regular	NOT context-free

Analogy

Regular PL: finding a loop in a circle (DFA). CFL PL: finding a repeated node in a tree (parse tree). More structure = more pieces.

Beyond Pumping

The pumping lemma isn't the only tool. Here are stronger alternatives.

Myhill-Nerode Theorem

A language L is regular iff it has a finite number of equivalence classes under the distinguishability relation.

Key Idea

If you can find infinitely many strings that are pairwise distinguishable (each pair needs a different "suffix" to tell them apart), then L is not regular. This is both necessary AND sufficient!

Ogden's Lemma

A stronger version of the CFL pumping lemma where you mark certain positions, and v and y must contain marked positions.

Key Idea

Ogden's gives you more control over WHERE the pumping happens. Useful when the basic CFL PL fails for a particular language.

Challenge C: Classify the Language

Which pumping lemma do you need?

For each language, decide: Regular PL, CFL PL, or neither (language IS regular/CFL)?

1. L = {aⁿ | n is prime}

2. L = {aⁿb^m | n ≤ m}

3. L = {aⁿbⁿcⁿ}

4. L = (ab)*

Summary & Cheat Sheet

Regular PL Template

Assume L is regular (for contradiction)
Let p be the pumping length
Choose s = [string in L, |s|≥p]
For ANY split s=xyz with |xy|≤p, |y|>0:
  Show xy^i z ∉ L for some i
Contradiction! L is NOT regular.

CFL PL Template

Assume L is CFL (for contradiction)
Let p be the CFL pumping length
Choose s = [string in L, |s|≥p]
For ANY split s=uvxyz, |vxy|≤p, |vy|>0:
  Show uv^i xy^i z ∉ L for some i
Contradiction! L is NOT CFL.

Quiz: Multiple Choice

Q1: In a PL proof, who picks the string s?

The adversary The prover (you) It doesn't matter

Q2: In the CFL PL, how many pieces is the string split into?

3 (xyz) 4 (uvyz) 5 (uvxyz)

Q3: If a language passes the PL, what can we conclude?

It's regular Nothing It's not regular

Quiz: Trace the Proof

Complete the Pumping Proof

Language: L = {aⁿb²ⁿ | n ≥ 0}. Fill in the blanks:

1. Assume L is regular with pumping length p

2. Choose s = a^p b^(2p) ∈ L, |s| = 3p ≥ p ✓

3. s = xyz, |xy| ≤ p, so y = a^k for some k ≥ 1

4. Pump with i =

5. xy^i z = a^() b^(2p)

6. For this to be in L, need #b = 2 × #a

7. But 2p ≠ 2×() since k ≥ 1

8. Contradiction! L is NOT regular.

Quiz: Build Your Own Proof

Prove {0ⁿ1ⁿ2ⁿ} is not context-free

Drag the proof steps into the correct order:

Pump i=2: increases count of at most 2 symbols

Let p be the CFL pumping length

Third symbol count unchanged ⇒ not in L

Assume L = {0^n 1^n 2^n} is CFL

|vxy| ≤ p, so vxy spans at most 2 of {0,1,2}

Contradiction! L is NOT context-free.

Pick s = 0^p 1^p 2^p

Your order: 

The Pumping Lemma

The Big Picture

What it tells you

Key Idea

What it does NOT tell you

Warning

Intuition: Why Pumping Works

Analogy

The Pumping Lemma for Regular Languages

Pumping Lemma (Formal Statement)

The Pumping Game: Prove {anbn} is NOT Regular

The Proof as a Flowchart

Explore: Pumping apbp

Key Insight

Example 2: {ww | w ∈ {0,1}*} is NOT Regular

Example 3: {1n² | n ≥ 0} is NOT Regular

Key Insight

Common Mistakes in Pumping Proofs

Mistake 1: Fixing the split

Mistake 2: Only checking i=0

Mistake 3: Claiming PL proves regularity

Correct Approach

When the Pumping Lemma Fails

Analogy

Takeaway

Challenge A: Predict the Pump

Challenge

The Pumping Lemma for Context-Free Languages

CFL Pumping Lemma

Why CFL Pumping Works: Parse Trees

Analogy

The Five Pieces

Challenge B: Fix the Proof Bug

Spot the Error

CFL Example: {anbncn} is NOT Context-Free

CFL Example: {ww | w ∈ {0,1}*} is NOT Context-Free

Key Insight

Comparing the Two Pumping Lemmas

Analogy

Beyond Pumping

Myhill-Nerode Theorem

Key Idea

Ogden's Lemma

Key Idea

Challenge C: Classify the Language

Which pumping lemma do you need?

Summary & Cheat Sheet

Regular PL Template

CFL PL Template

Quiz: Multiple Choice

Quiz: Trace the Proof

Complete the Pumping Proof

Quiz: Build Your Own Proof

Prove {0n1n2n} is not context-free

The Pumping Game: Prove {aⁿbⁿ} is NOT Regular

Explore: Pumping a^pb^p

Example 3: {1^n² | n ≥ 0} is NOT Regular

CFL Example: {aⁿbⁿcⁿ} is NOT Context-Free

Prove {0ⁿ1ⁿ2ⁿ} is not context-free