我是 Python 新手,找不到删除无用文本的方法。主要目的是保留我想要的单词并删除所有其他单词。在这个阶段,我可以检查我的in_data并找到我想要的单词。如果sentence.find(wordToCheck)是肯定的,则保留它。in_data是每行句子,但当前输出是每行一个单词。我想要的是保留格式,在每一行中找到单词并删除其余部分。
import Orange
import orange
word = ['roaming','overseas','samsung']
out_data = []
for i in range(len(in_data)):
for j in range(len(word)):
sentence = str(in_data[i][0])
wordToCheck = word[j]
if(sentence.find(wordToCheck) >= 0):
print wordToCheck
输出
roaming
overseas
roaming
overseas
roaming
overseas
samsung
samsung
in_data是类似的句子
contacted vodafone about going overseas and asked about roaming charges. The customer support officer says there isn't a charge but while checking my usage overseas.
我希望看到输出就像
overseas roaming overseas