Python：在两个文件中查找字符串并打印所有行

import pandas as pd #ARRAY my_value = [] cluster_value = [] #READ THE FILES my_data_file = pd.read_csv('my_data.txt', sep=',') log_file = pd.read_csv('log.txt', sep=',') #TAKE THE COLUMN WITH THE CLUSTERS for row in my_data_file[my_data_file.columns[1]]: my_value.append(row) for row in log_file[log_file.columns[0]]: cluster_value.append(row) #Restult print("_______________") print(list(set(my_value) & set(cluster_value))) print("_______________")

1条回答

网友

1楼 · 发布于 2024-06-26 17:40:02

使用正则表达式

不需要熊猫来读取这个简单的文件

代码

import re

def search(key_file, search_file):
    with open(key_file) as kfile:
      keys = '|'.join(line.rstrip().split(',')[0] for line in kfile.readlines())
    # regex for cluster names
    regex = re.compile(keys)

    with open(search_file) as search_data:
      for line in search_data:
        if regex.search(line):
          print(line.rstrip())

search('mydata.txt', 'log.txt')

输入

'mydata.txt'（注'，'无所谓，即忽略）

clusterB,
clusterZ

'log.txt'

2019, clusterB, log
2020, clusterC, log
2017, clusterZ, log

输出

2019, clusterB, log
2017, clusterZ, log

相关问题更多 >

编程相关推荐

热门问题

热门文章