2

我在按某个字段对文本文件中的数据进行排序时遇到了一些问题。以后可能会通过多个字段。.txt 是几千行代码。我是 python 的新手,所以我的代码可能有点乱。例如,这是我要从中读取的文本文件:

stuff
123 1200 id-aaaa stuart@test.com
322 1812 id-wwww machine-switch@test.com
839 1750 id-wwww gary2-da@test.com
500 0545 id-aaaa abc123@test.com
525 1322 id-bbbb zyx321@test.com

到目前为止,我的代码如下:

filelist = open("info.txt").readlines()
splitlist = list()

class data:
    def __init__(self, eventName, time, identity, domain):
        self.evenName = eventName
        self.time = time
        self.identity = identity
        self.domain = domain

for line in filelist:
    filelist = list.split(', ')
    splitlist.append(filelist)

for column in splitlist:
    if (len(column) > 1): #to skip the first line
        eventName = column[0].strip()
        time = column[1].strip()
        identity = column[2].strip()
        domain = column[3].strip()

我想按身份逐行排序 .txt 文件,然后可能按时间排序。我看到这可以通过python教程中的类来完成,所以我正在尝试走这条路。请指教。谢谢!

4

4 回答 4

8
with open("info.txt") as inf:
    data = []
    for line in inf:
        line = line.split()
        if len(line)==4:
            data.append(line)

data.sort(key=lambda s:(s[2],s[1]))

如果你想变得更高级一点,

from collections import namedtuple
Input = namedtuple('Input', ('name', 'time', 'identity', 'domain'))

with open("info.txt") as inf:
    inf.next()  # skip header
    data = [Input(*(line.split()) for line in inf]

data.sort(key=lambda s:(s['identity'],s['time']))

如果您真的非常想使用一个类,请尝试:

import time

class Data(object):
    def __init__(self, event, time_, identity, domain):
        self.event = event
        self.time = time.strptime(time_, "%H%M")
        self.identity = identity
        self.domain = domain

with open("info.txt") as inf:
    data = []
    for line in inf:
        try:
            data.append(Data(*(line.split()))
        except TypeError:
            # wrong number of arguments (ie header or footer)
            pass

data.sort(key=lambda s:(s.identity,s.time))
于 2012-06-10T22:49:38.557 回答
0

以下 Python 代码应将您想要的信息放在一起,然后对其进行排序。

rows = []
for line in open("info.txt"):
    line = line.split()
    if len(line) != 4:
        continue

    eventName, time, identity, domain = line

    # Add them in the order you want to sort by
    rows.append((identity, time, eventName, domain)) 

rows.sort()
于 2012-06-10T22:51:01.570 回答
0

这是一个常见的错误,你所做的它在没有以正确的语法实际读取文件的情况下打开它,这是我的想法:

filelist = open("info.txt", "r")
print filelist
filelist.read() # reads the entire file
splitlist = list()

class data:
    def __init__(self, eventName, time, identity, domain):
        self.evenName = eventName
        self.time = time
        self.identity = identity
        self.domain = domain

for line in filelist:
    filelist = list.split(', ')
    splitlist.append(filelist)

for column in splitlist:
    if (len(column) > 1): #to skip the first line
        eventName = column[0].strip()
        time = column[1].strip()
        identity = column[2].strip()
        domain = column[3].strip()

希望有效!来源:http ://docs.python.org/tutorial/inputoutput.html

于 2012-06-10T22:52:26.447 回答
0

按 id 然后按日期排序:

text = ["123 1200 id-aaaa stuart@test.com",
        "322 1812 id-wwww machine-switch@test.com",
        "839 1750 id-wwww gary2-da@test.com",
        "500 0545 id-aaaa abc123@test.com",
        "525 1322 id-bbbb zyx321@test.com"]
text = [i.split() for i in text]
text.sort(key=lambda line: (line[2],line[1]))
text = [' '.join(i) for i in text]
print text
#Output:
['500 0545 id-aaaa abc123@test.com', 
'123 1200 id-aaaa stuart@test.com', 
'525 1322 id-bbbb zyx321@test.com', 
'839 1750 id-wwww gary2-da@test.com', 
'322 1812 id-wwww machine-switch@test.com']
于 2012-06-10T22:55:36.027 回答