python堆叠面积图

Area Courses Year 1900 Agriculture 0.0 1900 Architecture 32.0 1900 Astronomy 10.0 1900 Biology 20.0 1900 Chemistry 25.0 1900 Civil Engineering 21.0 1900 Education 14.0 1900 Engineering Design 10.0 1900 English 30.0 1900 Geography 1.0

Area Aeronautical Engineering ... Visual Design Year ... 1900 NaN ... NaN 1901 NaN ... NaN

df = pd.read_csv(filepath, encoding= 'unicode_escape') df = df.groupby(['Year','GenArea'])['Taught'].sum().to_frame(name = 'Courses').reset_index() plt.stackplot(df['Year'], df['Courses'], labels = df['GenArea']) plt.legend(loc='upper left') plt.show()

1条回答

网友

1楼 · 发布于 2024-09-30 22:19:14

有了额外的信息，我做了这个。希望你喜欢

import pandas as pd
import matplotlib.pyplot as plt

plt.close('all')

df=pd.read_csv('https://query.data.world/s/djx5mi7dociacx7smdk45pfmwp3vjo',
               encoding='unicode_escape')
df=df.groupby(['Year','GenArea'])['Taught'].sum().to_frame(name=
             'Courses').reset_index()
aux1=df.duplicated(subset='GenArea', keep='first').values
aux2=df.duplicated(subset='Year', keep='first').values

n=len(aux1);year=[];courses=[]

for i in range(n):
    if not aux1[i]:
        courses.append(df.iloc[i]['GenArea'])
    if not aux2[i]:
        year.append(df.iloc[i]['Year'])
    else:
        continue

del aux1,aux2
df1=pd.DataFrame(index=year)
s=0

for i in range(len(courses)):
    df1[courses[i]]=0
for i in range(n):
    string=df.iloc[i]['GenArea']
    if any(df1.iloc[s].values==0):
        df1.at[year[s],string]=df.iloc[i]['Courses']
    else:
        s+=1
        df1.at[year[s],string]=df.iloc[i]['Courses']

del year,courses,df
df1=df1[df1.columns[::-1]]
df1.plot.area(legend='reverse')

Example

相关问题更多 >

编程相关推荐

热门问题

热门文章