跳转到主要内容
eLearner.app
模块 3 · 第 2 课(共 4)课程中的10/32~10 min
模块课程(2/4)

字边界:`\b` `\B`

\b is a word boundary anchor: it matches the position between a word character (\w) and a non-word character (\W, or the start/end of the string). Like ^ and $, it does NOT consume characters.

Code
Pattern: \bgatto\b
Sample:  Il gatto e la gattina giocano.
            ^^^^^

gatto matches only as a whole word: inside gattina the sequence gatto is not there (the final o is missing), but it would not match gatti either because the final i is a word character.

\B is the opposite: it matches a position that is NOT a word boundary.

Finding "whole words"

The most typical use of \b is "match the word X only when isolated, not as part of another word":

Code
\bif\b   matches 'if' but not 'sniff', 'gift', 'lifetime'.

Word boundaries and non-word characters

The boundary \\b does not match any physical character; it is a position test. A \\b boundary exists between a \\w character and a non-\\w character (or start/end of text). The negation \\B asserts that the current position is not a word boundary.

Try it

锻炼#regex.m3.l2.e1
尝试:0加载中...

Find every occurrence of the whole word `cat` (case-insensitive). It must NOT match `category`, `concatenate`, `scatter`.

正在加载编辑器...
显示提示

Wrap 'cat' between two \\b: boundary at the start AND boundary at the end.

3 次尝试后可用的解决方案

Review exercise

锻炼#regex.m3.l2.e2
尝试:0加载中...

Find every integer that is NOT part of an identifier (e.g. `42` yes, but neither `var42` nor `42abc`). Use `\\b` on both sides.

正在加载编辑器...
显示提示

\\b\\d+\\b matches only 'isolated' digit sequences. abc42 has 'c' (word) before the 42.

3 次尝试后可用的解决方案

Additional challenge

锻炼#regex.m3.l2.e3
尝试:0加载中...

Find the sequence `cat` only if it starts a longer word, excluding when it appears as a whole word or at the end (e.g. match `catalog` but not `wildcat` or isolated `cat`).

正在加载编辑器...
显示提示

Use \b at the start of cat (word boundary) and \B at the end (non-word boundary).

3 次尝试后可用的解决方案