擅长:python、mysql、java
<p>以下是我的看法:</p>
<pre><code>from itertools import combinations
def combine(batch):
"""Combine all products within one batch into pairs"""
return pd.Series(list(combinations(set(batch), 2)))
edges = df.groupby('Batch_ID')['Product_ID'].apply(combine).value_counts()
edges
#(B, C) 3
#(A, B) 1
#(A, C) 1
#(D, C) 1
</code></pre>
<p>我知道不需要0次出现的边。在</p>
<p>如果需要,可以将索引进一步拆分为源和目标:</p>
^{pr2}$
<p>或者:</p>
<pre><code>c = ['Source', 'Target']
L = edges.index.values.tolist()
edges = pd.DataFrame(L, columns=c).join(edges.reset_index(drop=True))
</code></pre>