1

我有一个如下文件:

101 start_time
102 start_time
101 end_time
103 start_time
103 end_time
102 end_time
104 start_time
104 end_time
102 start_time
102 end_time

我想要一个如下的输出文件:

101 start_time end_time
102 start_time end_time
103 start_time end_time
104 start_time end_time
102 start_time end_time

使用基本的 sed 或 awk 操作或使用 perl 如何完成?请帮忙!

4

3 回答 3

2

怎么样:

awk '$1 in a{ print $1, a[$1], $2; delete a[$1]; next} {a[$1] = $2}' input
于 2013-03-13T22:42:29.313 回答
0
perl -anE'say "@F end_time" if $F[1] eq "start_time"'
于 2013-03-13T23:23:45.473 回答
0

遵循 Perl 方法。

  • 注1:写得不是很好,但可以正常工作
  • 注2:我的回答是基于这样的考虑,即“start_time”和“end_time”不是字面意义上的字符串,而是某种时间戳或其他

你去:

#!/usr/bin/perl
use warnings;
use strict;

my @waiting; #here we will keep track of the order
my %previous; #here we will save previous rows that still can't be printed
open (my $IN,'<','file.txt') or die "$!"; #assuming that your data is in file.txt
while (<$IN>) {
    chomp;
    my ($id,$time)=split/ /;
    if (exists $previous{$id}) { #if this is the end_time
        $previous{$id}->[1]=$time;
        if ($waiting[0]==$id) { #if we are not waiting for another row's end_time
            my $count=0;
            for (@waiting) { #print anything you have available
                last if !defined $previous{$_}->[1];
                print join(' ',$x,@{$previous{$_}}),"\n";
                delete $previous{$_};
                $count++;
            }
            shift @waiting for 1..$count; 
        }
    }
    else { #if this is the start_time
        push @waiting,$id;
        $previous{$id}=[$time,undef];
    }
}
close $IN;
于 2013-03-14T00:55:18.850 回答