把txt文件读入Python？

2244018 #*OQL[C++]: Extending C++ with an Object Query Capability. #@José A. Blakeley #year1995 #confModern Database Systems #citation14 #index0 #arnetid2 #*Transaction Management in Multidatabase Systems. #@Yuri Breitbart,Hector Garcia-Molina,Abraham Silberschatz #year1995 #confModern Database Systems #citation22 #index1 #arnetid3 #*Overview of the ADDS System. #@Yuri Breitbart,Tom C. Reyes #year1995 #confModern Database Systems #citation-1 #index2 #arnetid4

#* --- paperTitle #@ --- Authors #year ---- Year #conf --- publication venue #citation --- citation number (both -1 and 0 means none) #index ---- index id of this paper #arnetid ---- pid in arnetminer database #% ---- the id of references of this paper (there are multiple lines, with each indicating a reference) #! --- Abstract

1条回答

网友

1楼 · 发布于 2024-09-27 19:29:04

我的正则表达式没有达到应有的速度，但只要数据保持相同的形式，并且列名在其他行中不重复，下面的方法就可以工作：

import re
import pandas as pd

path = r"filepath.txt"

f = open(path, 'r')

year = []
confModern = []
#continue for all columns

for ele in f:
    if len(re.findall('year', ele)) > 0:
       year.append(ele[5:])
    if len(re.findall('confModern', ele)) > 0:
       year.append(ele[12:])
    # continue for all columns with the needed string

df = pd.DataFrame(data={'year' : year ...#continue for each list})

相关问题更多 >

编程相关推荐

热门问题

热门文章