擅长:python、mysql、java
<p>首先,我将在<code>job_industry</code>和<code>job_title</code>之间创建一个映射(Python dict),然后将<code>job_industry</code>列的映射分配给job_title的<code>NaN</code>值</p>
<p>代码如下:</p>
<pre class="lang-py prettyprint-override"><code>df = pd.DataFrame(
columns=["job_title", "job_industry"],
data=[["Quality Engineer", "Financial Services"], ["Quality Engineer", None]]
)
# May be there is a faster way
title_industry_mapping = df.dropna(["job_industry"]).set_index("job_title")["job_industry"].drop_duplicates().to_dict()
isna = df["job_industry"].isna()
df.loc[isna, "job_industry"] = df.loc[isna, "job_title"].replace(title_industry_mapping)
</code></pre>
<p>结果:</p>
<div class="s-table-container">
^{tb1}$
</div>