Direct naar de hoofdinhoud
eLearner.app
Module 2 · Les 3 van 47/32 in de cursus~10 min
Modulelessen (3/4)

Hebzuchtig versus lui

By default quantifiers are greedy: they consume as much as possible while keeping the pattern valid. Adding ? after a quantifier (*?, +?, ??, {n,m}?) gives you the lazy version: it consumes as little as possible.

Code
Sample: <b>uno</b> e <i>due</i>

Greedy pattern  <.*>     matches:  <b>uno</b> e <i>due</i>   (everything)
Lazy pattern    <.*?>    matches:  <b>           </b>
                                   <i>           </i>        (4 matches with the g flag)

The exact same pattern, a single character of difference (? added to the quantifier), totally different results.

When it matters

  • To extract the content between delimiters (HTML tags, quotes, parentheses) the lazy version is almost always the right one.
  • To match up to end of line you usually want greedy (.*).

Greedy vs Lazy strategies in the engine

A greedy quantifier consumes as much text as possible and backtracks only if forced. By adding ? (lazy), the engine consumes the absolute minimum and advances one character at a time searching for the next match of the pattern.

Try it

Oefening#regex.m2.l3.e1
Pogingen: 0Laden…

Extract every HTML tag from the sample (e.g. `<b>`, `</b>`, `<i>`, `</i>`) using the lazy version `.*?`.

Editor laden…
Toon hint

Greedy <.*> matches from the start of the first tag to the end of the last one. Lazy <.*?> stops at the first > it encounters.

Oplossing beschikbaar na 3 pogingen

Review exercise

Oefening#regex.m2.l3.e2
Pogingen: 0Laden…

Extract every string between double quotes in the text (e.g. "ciao", "mondo"). Use the lazy version of the quantifier to avoid jumping to later closings.

Editor laden…
Toon hint

".*?" stops at the first closing double quote.

Oplossing beschikbaar na 3 pogingen

Additional challenge

Oefening#regex.m2.l3.e3
Pogingen: 0Laden…

Extract all text blocks enclosed in square brackets (e.g. `[text]`), including the brackets, using a lazy quantifier so as not to merge separate blocks.

Editor laden…
Toon hint

Use \[ for the open bracket, then .*? and finally \] for the closed bracket.

Oplossing beschikbaar na 3 pogingen