基于特定列条件从数据帧获取所有行组合？问题的回答

基于特定列条件从数据帧获取所有行组合？

回答此问题可获得 20 贡献值，回答如果被采纳可获得 50 分。

0 条评论
分类：Python问答

默认排序时间排序

1 个回答

匿名 1天前

　擅长：python、mysql、java

您可以使用<a href="https://stackoverflow.com/a/29780529/189418">this answer</a>中描述的方法生成一个新的数据帧，其中包含来自原始数据的三行的所有组合： <pre class="lang-py prettyprint-override"><code>from itertools import combinations import pandas as pd # Using skbrhmn's df df = pd.DataFrame({"Calories": [100, 200, 300, 400, 500], "Protein": [10, 20, 30, 40, 50], "IsBreakfast": [1, 1, 0, 0, 0], "IsLunch": [1, 0, 0, 0, 1], "IsDinner": [1, 1, 1, 0, 1]}) comb_rows = list(combinations(df.index, 3)) comb_rows </code></pre> 输出： <pre><code>[(0, 1, 2), (0, 1, 3), (0, 1, 4), (0, 2, 3), (0, 2, 4), (0, 3, 4), (1, 2, 3), (1, 2, 4), (1, 3, 4), (2, 3, 4)] </code></pre> 然后创建一个新的DataFrame，包含原始帧中所有数值字段的总和，覆盖三行的所有可能组合： <pre><code>combinations = pd.DataFrame([df.loc[c,:].sum() for c in comb_rows], index=comb_rows) print(combinations) Calories Protein IsBreakfast IsLunch IsDinner (0, 1, 2) 600 60 2 1 3 (0, 1, 3) 700 70 2 1 2 (0, 1, 4) 800 80 2 2 3 (0, 2, 3) 800 80 1 1 2 (0, 2, 4) 900 90 1 2 3 (0, 3, 4) 1000 100 1 2 2 (1, 2, 3) 900 90 1 0 2 (1, 2, 4) 1000 100 1 1 3 (1, 3, 4) 1100 110 1 1 2 (2, 3, 4) 1200 120 0 1 2 </code></pre> 最后，您可以应用所需的任何筛选器： <pre class="lang-py prettyprint-override"><code>filtered = combinations[ (combinations.IsBreakfast>0) & (combinations.IsLunch>0) & (combinations.IsDinner>0) & (combinations.Calories>600) & (combinations.Calories<1000) & (combinations.Protein>=80) & (combinations.Protein<120) ] print(filtered) Calories Protein IsBreakfast IsLunch IsDinner (0, 1, 4) 800 80 2 2 3 (0, 2, 3) 800 80 1 1 2 (0, 2, 4) 900 90 1 2 3 </code></pre>

基于特定列条件从数据帧获取所有行组合？

1 个回答

相关Python问题