regex - 从字母数字字符中删除数字

Question

我有一个字母数字字符列表，如下所示：

x <-c('ACO2', 'BCKDHB456', 'CD444')

我想要以下输出：

x <-c('ACO', 'BCKDHB', 'CD')

有什么建议么？

# dput(tmp2)

structure(c(432L, 326L, 217L, 371L, 179L, 182L, 188L, 268L, 255L,..., 
), class = "factor")

score 99 · Accepted Answer

您可以gsub为此使用：

gsub('[[:digit:]]+', '', x)

或者

gsub('[0-9]+', '', x)
# [1] "ACO"    "BCKDHB" "CD"

score 13 · Accepted Answer

如果您的目标只是删除数字，则该removeNumbers()函数会从文本中删除数字。使用它可以降低出错的风险。

library(tm)

x <-c('ACO2', 'BCKDHB456', 'CD444') 

x <- removeNumbers(x)

x

[1] "ACO"    "BCKDHB" "CD"

score 11 · Accepted Answer

使用字符串

大多数 stringr 函数处理正则表达式

str_replace_all会做你需要的

str_replace_all(c('ACO2', 'BCKDHB456', 'CD444'), "[:digit:]", "")

score 6 · Accepted Answer

使用stringi的解决方案：

# your data
x <-c('ACO2', 'BCKDHB456', 'CD444')

# extract capital letters
x <- stri_extract_all_regex(x, "[A-Z]+")

# unlist, so that you have a vector
x <- unlist(x)

一行解决方案：

regex - 从字母数字字符中删除数字

4 回答 4

Related

Reference