基于密钥的CSV连接

2条回答

网友

1楼 · 编辑于 2024-10-01 19:30:08

您可以使用Pandas并使用age数据更新{}。通过将两个数据帧的索引分别设置为ID和{}，然后更新{}中的age列。之后，重新设置索引，使ID再次成为列。在

from StringIO import StringIO
import pandas as pd

info = StringIO("""Last Name,First Name,ID,phone,adress,age X [Total age: 100] |009076
abc, xyz, 1234, 982-128-0000, pqt,
bcd, uvw, 3124, 813-222-1111, tre, 
poi, ccc, 9087, 123-45607890, weq,""")


age = StringIO("""student_id,age_1
3124,20
9087,21
1234,45""")

info_df = pd.read_csv(info, sep=",", engine='python')
age_df = pd.read_csv(age, sep=",", engine='python')

info_df = info_df.set_index('ID')
age_df = age_df.set_index('student_id')
info_df['age X [Total age: 100] |009076'].update(age_df.age_1)
info_df.reset_index(level=0, inplace=True)
info_df

输出：

^{pr2}$

网友

2楼 · 编辑于 2024-10-01 19:30:08

试试这个。。。在

import csv

info = list(csv.reader(open("info.csv", 'rb')))
age = list(csv.reader(open("age.csv", 'rb')))

def copyCSV(age, info, outFileName = 'out.csv'):
    # put age into dict, indexed by ID
    # assumes no duplicate entries

    # 1 - build a dict ageDict to represent data
    ageDict = dict([(entry[0].replace(' ',''), entry[1]) for entry in age[1:] if entry != []])

    # 2 - setup output
    with open(outFileName, 'wb') as outFile:
        outwriter = csv.writer(outFile)
        # 3 - run through info and slot in ages and write to output
        # nb: had to use .replace(' ','') to strip out whitespaces - these may not be in original .csv
        outwriter.writerow(info[0])
        for entry in info[1:]:
            if entry != []:
                key = entry[2].replace(' ','')
                if key in ageDict: # checks that you have data from age.csv
                    entry[5] = ageDict[key]
            outwriter.writerow(entry)

copyCSV(age, info)

如果有什么不清楚的地方，请告诉我。我使用dict是因为如果你的文件非常大，它应该更快，因为你只需要在年龄.csv一次。在

也许有一种更简单的方法/一些已经实现的东西…但这应该能做到。在

相关问题更多 >

编程相关推荐

热门问题

热门文章

基于密钥的CSV连接

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >