Pandas扔铁轨问题的回答

Pandas扔铁轨

回答此问题可获得 20 贡献值，回答如果被采纳可获得 50 分。

我正在尝试对两个数据帧进行更改数据捕获。逻辑是合并两个数据帧并按一个键分组，然后对计数大于1的组运行循环，以查看哪个列“更新”。我犯了个奇怪的错误。感谢任何帮助。代码 <pre><code>import pandas as pd import numpy as np pd.set_option('display.height', 1000) pd.set_option('display.max_rows', 500) pd.set_option('display.max_columns', 500) pd.set_option('display.width', 1000) print("reading wolverine xlxs") # defining metadata df_header = ['DisplayName','StoreLanguage','Territory','WorkType','EntryType','TitleInternalAlias', 'TitleDisplayUnlimited','LocalizationType','LicenseType','LicenseRightsDescription', 'FormatProfile','Start','End','PriceType','PriceValue','SRP','Description', 'OtherTerms','OtherInstructions','ContentID','ProductID','EncodeID','AvailID', 'Metadata', 'AltID', 'SuppressionLiftDate','SpecialPreOrderFulfillDate','ReleaseYear','ReleaseHistoryOriginal','ReleaseHistoryPhysicalHV', 'ExceptionFlag','RatingSystem','RatingValue','RatingReason','RentalDuration','WatchDuration','CaptionIncluded','CaptionExemption','Any','ContractID', 'ServiceProvider','TotalRunTime','HoldbackLanguage','HoldbackExclusionLanguage'] df_w01 = pd.read_excel("wolverine_1.xlsx", names = df_header) df_w02 = pd.read_excel("wolverine_2.xlsx", names = df_header) df_w01['version'] = 'OLD' df_w02['version'] = 'NEW' #print(df_w01) df_m_d = pd.concat([df_w01, df_w02], ignore_index = True) first_pass = df_m_d[df_m_d.duplicated(['StoreLanguage','Territory','TitleInternalAlias','LocalizationType','LicenseType','FormatProfile'], keep=False)] first_pass_keep_duplicate = df_m_d[df_m_d.duplicated(['StoreLanguage','Territory','TitleInternalAlias','LocalizationType','LicenseType','FormatProfile'], keep='first')] group_by_1 = first_pass.groupby(['StoreLanguage','Territory','TitleInternalAlias','LocalizationType','LicenseType','FormatProfile']) for i,rows in group_by_1.iterrows(): print("rownumber", i) print (rows) print(first_pass) </code></pre> 我得到的错误是： ^{pr2}$ 任何帮助都是非常感谢的。在

0 条评论
分类：Python问答

默认排序时间排序

1 个回答

匿名 1天前

　擅长：python、mysql、java

Pandas扔铁轨

1 个回答

相关Python问题