擅长:python、mysql、java
<p>在不提供(或不知道)列名的情况下,您可以使用以下更通用的方法:</p>
<pre><code># seperate categoricals from numericals
df_numeric = dataset.select_dtypes(exclude=['object'])
df_obj = dataset.select_dtypes(include=['object']).copy()
# now factorize categoricals
for c in df_obj:
df_obj[c] = pd.factorize(df_obj[c])[0]
# merge back into one dataframe
df_final = pd.concat([df_numeric, df_obj], axis=1)
df_final.reset_index(inplace=True, drop=True)
</code></pre>