擅长:python、mysql、java
<p>本着StackOverflow的精神,下面是我如何解决我的问题。在</p>
<pre><code>import numpy as np
import pandas as pd
import pandas.util.testing as tm; tm.N = 3
data = pd.DataFrame.from_csv('MYDATA.csv')
byqualityissue = data.groupby(["CompanyName","QualityIssue"]).size()
df = pd.DataFrame(byqualityissue)
formatted = df.unstack(level=-1)
formatted[np.isnan(formatted)] = 0
formatted.to_csv('byqualityissue.csv', header=True)
includingtotals = pd.concat([formatted,pd.DataFrame(formatted.sum(axis=1),columns=['Total'])],axis=1)
sorted = includingtotals.sort_index(by=['Total'], ascending=[False])
</code></pre>
<p>我使用unstack重新组织数据,用0替换NaN值,将所有行相加,并用这些值附加一个新列,然后进行排序。在</p>