擅长:python、mysql、java
<p>对唯一的国家名称调用<code>guess_country()</code>,并创建一个<code>country_map</code>系列对象,原始名称作为索引,转换后的名称作为值。然后可以使用<code>country_map[df.country]</code>进行转换。在</p>
<pre><code>import pandas as pd
c = ["abc","abc","ade","ade","ccc","bdc","bxy","ccc","ccx","ccb","ccx"]
v = range(len(c))
df = pd.DataFrame({"country":c, "data":v})
def guess_country(c):
return c[0]
uc = df.country.unique()
country_map = pd.Series(list(map(guess_country, uc)), index=uc)
df["country_id"] = country_map[df.country].values
print(df)
</code></pre>