c# - RegEx 提取前 6 到 10 位数字，不包括 8 位数字

Question

我有以下测试文件名：

abc001_20111104_summary_123.txt
abc008_200700953_timeline.txt
abc008_20080402_summary200201573unitf.txt
123456.txt
100101-100102 test.txt
abc008_20110902_summary200110254.txt
abcd 200601141 summary.txt
abc008_summary_200502169_xyz.txt

我需要从每个文件名中提取一个数字。

该号码必须为6、7、9 或 10 位数字（因此，不包括 8 位数字）。

如果找到多个，我想获得第一个数字，如果没有找到，我想获得空字符串。

我设法通过2 个步骤完成此操作，首先删除 8 位数字，然后从我的列表中提取 6 到 10 位数字。

step 1 
  regex:  ([^0-9])([0-9]{8})([^0-9])
  replacement:  \1\3

step 2
  regex: (.*?)([1-9]([0-9]{5,6}|[0-9]{8,9}))([^0-9].*)
  replacement:  \2

我在这 2 个步骤过程后得到的数字正是我正在寻找的：

[]
[200700953]
[200201573]
[123456]
[100101]
[200110254]
[200601141]
[200502169]

现在，问题是： 有没有办法在一步过程中做到这一点？

我已经看到了一个类似问题的很好的解决方案，但是，如果找到多个，它会给我最新的数字。

注意：使用Regex Coach进行测试。

score 8 · Accepted Answer

假设您的正则表达式引擎支持后向断言：

(?<!\d)\d{6}(?:\d?|\d{3,4})(?!\d)

解释：

(?<!\d)   # Assert that the previous character (if any) isn't a digit
\d{6}     # Match 6 digits
(?:       # Either match
 \d?      # 0 or 1 digits
|         # or
 \d{3,4}  # 3 or 4 digits
)         # End of alternation
(?!\d)    # Assert that the next character (if any) isn't a digit

score 0 · Accepted Answer

试试这个：

regex: /(?:^|\D)(\d{6}(?:\d(?:\d{2,3})?)?)(?:\D|$)/
replacement: \1

这将提取六位数字，可选地后跟一位（总共 7 位），可选地后跟 2 或 3 位（9 或 10）。

score 0 · Accepted Answer

对于每个字符串 $subject

$subject = "abc001_20111104_summary_123.txt";
$subject ="abc008_200700953_timeline.txt";
$subject ="abc008_20080402_summary200201573unitf.txt";
$subject ="123456.txt"
$subject ="100101-100102 test.txt"
$subject ="abc008_20110902_summary200110254.txt";
$subject ="abcd 200601141 summary.txt";
$subject ="abc008_summary_200502169_xyz.txt";

$pattern = '*(?<!\d)(\d{6,7}|\d{9,10})(?!\d)*';
preg_match_all($pattern, $subject, $matches);
print_r($matches);

你会得到预期的结果：

空的
200700953
200201573
123456
100101
200110254
200601141
200502169

score 0 · Accepted Answer

匹配单词边界或边缘的非数字[0-9]{6,7}|[0-9]{9,10}应该这样做：

([^0-9]|\<)([0-9]{6,7}|[0-9]{9,10})([^0-9]|\>)

c# - RegEx 提取前 6 到 10 位数字，不包括 8 位数字

4 回答 4

Related

Reference