我已经根据一些条件从旧的pandas文件创建了新的pandas文件。问题是新pandas文件中的索引不是从0到大小,索引只是从旧pandas中获取的。 我做错了什么
这是我的代码:
import pandas as pd
import numpy as np
from collections import Counter
import math
df_OnlineRetail = pd.read_csv("OnlineRetailNEW.csv")
#print(df_OnlineRetail.head())
df_size = df_OnlineRetail.shape[0]
#print("No of rows:", df_size)
def get_frequency():
customer_list = df_OnlineRetail['CustomerID'].tolist()
no_of_purchases = Counter(customer_list)
customers = []
purchases = []
for key, value in no_of_purchases.items():
customers.append(key)
purchases.append(value)
frequency_dict = {'CustomerID':customers,
'Purchases':purchases}
df_frequency = pd.DataFrame(frequency_dict)
df_frequency = df_frequency.reset_index(drop=True)
print(df_frequency.sort_values(by="Purchases", ascending = False))
Frequency = get_frequency()
您的新数据帧似乎按
Purchases
降序排序。它所修改的原始数据帧显然不是。因此,索引(除非您reindexdf.reindex()
,否则将保持原始数据帧中的索引)现在不再按升序排列相关问题 更多 >
编程相关推荐