在具有匹配属性的列表中查找第一个和最后一个列表

import pprint import csv pp = pprint.PrettyPrinter(width=200) text = open('E:/Downloads/TNFL09/20120506_m1.so6', 'r') def clean_data(text, rows): newlist = [] reader = list(csv.reader(text, delimiter=' ')) for n in xrange(0, len(reader)): newlist.append(reader[n][1:6]+reader[n][9:12]+reader[n][16:18]) return newlist[:rows] data = clean_data(text,90) pp.pprint(data)

3条回答

网友
1楼 · 编辑于 2024-09-29 19:19:02

一种方法是，假设您的列表是按序列号排序的（看起来是这样的），则通过生成器运行该列表以将每个航班聚合在一起：
def aggregate_flights(flights): out = [] last_id = '' for row in flights: if row[-2] != last_id and len(out) > 0: yield (last_id,out) out = [] last_id = row[-2] out.append((row[3],row[4])) #2-tuple of (start,end) yield (last_id,out)
作为示例输入：
list(aggregate_flight(agg)) Out[21]: [('156912756', [('083914', '084141')]), ('156912546', [('005500', '010051'), ('010051', '010310')])]
有点乱，但你明白了。对于每个航班，您将有一个(start,end)的2元组列表，您可以进一步处理该列表以获得该航班的总体(start,end)。您甚至可以修改生成器，使其只提供总体的(start,end)，但我倾向于在较小的模块块中进行处理，这些模块块易于调试。你知道吗
如果输入未排序，则需要使用defaultdict累积数据。给它一个list工厂，并为每一行附加一个(start,end)元组。你知道吗
编辑：根据要求，这里的修改只产生单个(start,end)对：
def aggregate_flights(flights): last_id,start,end = None,None,None for row in flights: if row[-2] != last_id and last_id is not None: yield (last_id,(start,end)) start,end = None,None if start is None: start = row[3] last_id = row[-2] end = row[4] yield (last_id,(start,end))
在这一点上，我会注意到输出变得太难看了（一个(id,(start,end))元组，呃），所以我会向上移动到namedtuple以使事情变得更好：
from collections import namedtuple Flight = namedtuple('Flight',['id','start','end'])
现在你有了：
def aggregate_flights(flights): last_id,start,end = None,None,None for row in flights: if row[-2] != last_id and last_id is not None: yield Flight(last_id,start,end) start,end = None,None if start is None: start = row[3] last_id = row[-2] end = row[4] yield Flight(last_id,start,end) list(aggregate_flights(agg)) Out[18]: [Flight(id='156912756', start='083914', end='084141'), Flight(id='156912546', start='005500', end='010310')]
好多了。你知道吗

网友
2楼 · 编辑于 2024-09-29 19:19:02

我无法判断您的列表是否已经按flightID和序号排序，为此，您可以对列表列表执行以下操作：
from operator import itemgetter #use sort if the original list is not necessary to maintain, #if it is use sorted and send it to a new variable flightInfo.sort(key = itemgetter(8,9))
上面的排序首先是航班号，然后是序列号。要提取所需内容，可以执行以下操作：
prev, startTime = None, None results = [] for i, info in enumerate(flightInfo): if prev == None or prev != flight[8]: if prev != None: # use a list if you are going to have to modify these values results.append((prev, startTime, flightInfo[i-1][4])) startTime = flight[3] prev = flight[8]

网友
3楼 · 编辑于 2024-09-29 19:19:02

你可以使用地图关键字。作为“完整列表”的航班列表：

# python.py

time = [] # Is a dictionaries list. Each dictionary contains: {flight_id: [start, end]}

result = [] # We going to store results here.

def extract(flight, result):
   """ param flight: list containing flight's data. """
   global result # Give function access to result variable.
                 # If not, "result" is passed as a value copy.

   result.append({flight[9]: [flight[3], flight[3]]})

map(extract, result)

这应该可以解决问题。你知道吗

相关问题更多 >

编程相关推荐

热门问题

热门文章