使用pandas重塑长列csv文件，以获得适当的dataframe表

df1 = pd.DataFrame(['CompA','$200','$450','10.3x','50.0%' ,'CompB','$300','$50','13.2x','40.0%', 'CompC','$100','$150','2.8x','13.5%', 'CompD','$150','$250','3.8x','53.2%' ])

2条回答

网友

1楼 · 编辑于 2024-09-30 22:20:02

您可以使用column_names命令来^{}数据帧中的值

pd.DataFrame(df1.to_numpy().reshape(-1, len(column_names)), columns=column_names)

输出：

  Company name Revenues Gross Profit P/E Multiple Operating Margin
0        CompA     $200         $450        10.3x            50.0%
1        CompB     $300          $50        13.2x            40.0%
2        CompC     $100         $150         2.8x            13.5%
3        CompD     $150         $250         3.8x            53.2%

网友

2楼 · 编辑于 2024-09-30 22:20:02

你几乎是对的。Pivot可以以这种方式工作，但是，它需要三件事：要透视的值、要透视的列和索引

我不认为有必要在这里手动计数

# Get number of entities in long list
n_entities = int(len(df)/len(column_names))

# Generates n-repetitions of column_names and assign to df for pivot
df['col_name'] = column_names * n_entities 

# Generate and assign an index column
index_vals = []
for i in range(n_entities):
    index_vals.extend([str(i)]*len(column_names))
df['index_val'] = index_vals 

df.pivot(index = 'index_val', columns='col_name', values=0)

相关问题更多 >

编程相关推荐

热门问题

热门文章

使用pandas重塑长列csv文件，以获得适当的dataframe表

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >