擅长:python、mysql、java
<p>您可以使用<code>pd.get_dummies()</code>来实现一个热编码结果,并使用<code>lambda()</code>函数来过滤分数是否为>;3或不是</p>
<pre><code>df = pd.DataFrame({
'project': ['abc', 'Sys', 'Syst' ,'EPS', 'foss', 'opc'],
'location': ['Bangalore' ,'Bangalore', 'Chennai', 'Bangalore', 'Bangalore', 'Noida'],
'category': ['s', 's', 'task', 's', 's', 'task'],
'ftecount': [41, 35.3, 2, 5, 6, 1],
'bpssscore': [3.98, 3.98, 3.98, 3.98, 3.98, 3.98]
})
df['ftecount>3'] = df['ftecount'].apply(lambda x: 1 if x>3 else 0)
df['bpssscore>3'] = df['bpssscore'].apply(lambda x: 1 if x>3 else 0)
df = pd.concat([df['project'],
pd.get_dummies(df['location'], prefix='loc'),
pd.get_dummies(df['category'], prefix='cat'),
df['ftecount>3'],
df['bpssscore>3']
], axis=1)
</code></pre>
<p>输出测向</p>
<p><a href="https://i.stack.imgur.com/LkyRq.png" rel="nofollow noreferrer"><img src="https://i.stack.imgur.com/LkyRq.png" alt="enter image description here"/></a></p>