import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns
%matplotlib inline
df = pd.read_csv("G:\learning python\medical-data visualizer/medical_examination.csv")
df["overweight"] = (df["weight"]/pow(df["height"]/100, 2) > 25).astype(int)
df["cholesterol"] = (df["cholesterol"] > 1).astype(int)
df["gluc"] = (df["gluc"] > 1).astype(int)
df_cat = pd.melt(df, id_vars =["cardio"], value_vars = ["cholesterol", "gluc", "smoke", "alco", "active", "overweight"])
df_cat = df_cat.groupby(['cardio','variable','value']).size()
print(df_cat)
这是我的系列:
cardio variable value
0 active 0 6378
1 28643
alco 0 33080
1 1941
cholesterol 0 29330
1 5691
gluc 0 30894
1 4127
overweight 0 15915
1 19106
smoke 0 31781
1 3240
1 active 0 7361
1 27618
alco 0 33156
1 1823
cholesterol 0 23055
1 11924
gluc 0 28585
1 6394
overweight 0 10539
1 24440
smoke 0 32050
1 2929
我想将它转换为一个数据帧,其中包含系列中最后一个未命名列的列名cardio、variable、value和total。我尝试使用.to_frame(),但dataframe只使用一个列名,因此我无法正确地放置所有四个列名。我该怎么做?提前谢谢
目前没有回答
相关问题 更多 >
编程相关推荐