擅长:python、mysql、java
<p>诀窍是用<code>~df['race'].isin(['a', 'b', c'])</code>检查是否每个元素都(不是)在给定的列表中。下面是一个例子:</p>
<pre class="lang-py prettyprint-override"><code>from io import StringIO as sio
data = sio("""
sex age race
Male 0.204082 Hispanic
Male 0.122449 African-American
Female 0.163265 African-American
Male 0.081633 African-American
Male 0.530612 African-American
""")
import pandas as pd
df = pd.read_csv(data, sep='\s+').astype({'race': 'category'})
df_train_val_scaled = df[~df["race"].isin(["Native American", "Asian"])]
df_train_val_scaled
</code></pre>