听起来他们要求您平均所有“X-DSPAM-Confidence”数字,而不是 find 0.750718518519
。
就个人而言,我会找到您要查找的单词,提取数字,然后将所有这些数字放入一个列表并在最后平均它们。
像这样的东西-
# Get the filename from the user
filename = raw_input("Enter file name: ")
# An empty list to contain all our floats
spamflts = []
# Open the file to read ('r'), and loop through each line
for line in open(filename, 'r'):
# If the line starts with the text we want (with all whitespace stripped)
if line.strip().startswith('X-DSPAM-Confidence'):
# Then extract the number from the second half of the line
# "text:number".split(':') will give you ['text', 'number']
# So you use [1] to get the second half
# Then we use .strip() to remove whitespace, and convert to a float
flt = float(line.split(':')[1].strip())
print flt
# We then add the number to our list
spamflts.append(flt)
print spamflts
# At the end of the loop, we work out the average - the sum divided by the length
average = sum(spamflts)/len(spamflts)
print average
>>> lines = """X-DSPAM-Confidence: 1
X-DSPAM-Confidence: 5
Nothing on this line
X-DSPAM-Confidence: 4"""
>>> for line in lines.splitlines():
print line
X-DSPAM-Confidence: 1
X-DSPAM-Confidence: 5
Nothing on this line
X-DSPAM-Confidence: 4
使用查找:
>>> for line in lines.splitlines():
pos = line.find('X-DSPAM-Confidence:')
print pos
0
0
-1
0
我们可以看到,它find()
只是给了我们'X-DSPAM-Confidence:'
在每一行中的位置,而不是它后面的数字的位置。
更容易找到一行是否以 开头'X-DSPAM-Confidence:'
,然后像这样提取数字:
>>> for line in lines.splitlines():
print line.startswith('X-DSPAM-Confidence')
True
True
False
True
>>> for line in lines.splitlines():
if line.startswith('X-DSPAM-Confidence'):
print line.split(':')
['X-DSPAM-Confidence', ' 1']
['X-DSPAM-Confidence', ' 5']
['X-DSPAM-Confidence', ' 4']
>>> for line in lines.splitlines():
if line.startswith('X-DSPAM-Confidence'):
print float(line.split(':')[1])
1.0
5.0
4.0