跳转到主要内容
eLearner.app
模块 7 · 第 4 课(共 4)课程中的28/32~10 min
模块课程(4/4)

用正则表达式分割

String.prototype.split(separator) accepts not only a fixed string but also a regex as separator. This makes it a powerful tool to tokenize structured text.

JS
'uno, due,tre  quattro'.split(/[,\s]+/);
// ["uno", "due", "tre", "quattro"]

The regex /[,\s]+/ matches "one or more of comma or whitespace": split splits on any compound delimiter.

Typical cases

  • Permissive CSV: text.split(/\s*,\s*/) to handle spaces around commas.
  • Naive tokenizer: text.split(/\s+/) to extract words.
  • Keep the separator: if the regex contains capturing groups, the content of the groups is included in the result array.
JS
'a=1; b=2; c=3'.split(/(;)\s*/);
// ["a=1", ";", "b=2", ";", "c=3"]

Without the () group the semicolon would disappear. With (;) you keep it in the result.

Preserving separators in split operations

If you place split separators inside capturing parentheses, the output of String.prototype.split will include the separators themselves as elements in the final array, instead of discarding them.

Try it

锻炼#regex.m7.l4.e1
尝试:0加载中...

Find every permissive CSV separator: a comma with optional spaces around it. This way you could use it in split to tokenize the list.

正在加载编辑器...
显示提示

Use \\s* before and after the comma to absorb any optional spaces.

3 次尝试后可用的解决方案

Review exercise

锻炼#regex.m7.l4.e2
尝试:0加载中...

Find every 'spaces or semicolons' separator (one or more). This way split would tokenize the text into words.

正在加载编辑器...
显示提示

Combine \\s and ; in a class [\\s;] with the + quantifier.

3 次尝试后可用的解决方案

Additional challenge

锻炼#regex.m7.l4.e3
尝试:0加载中...

Write a regex to use in a split that separates numbers while keeping math operators `+`, `-`, `*`, `/` as elements of the array.

正在加载编辑器...
显示提示

Enclose the math operators character class in capture parentheses to preserve them in the split array.

3 次尝试后可用的解决方案