<p>我有两个数据帧,df1,df2,我想在其中连接两个不同的表名。但是,我得到的最终输出记录数呈指数级增长</p>
<p><strong>数据</strong></p>
<p>df1</p>
<pre><code>site planq tr unit alias energy
ny q1 22 du1 du_cc 10
ny q1 22 du1 du_cc 10
ny q1 22 du1 du_cc 10
dc q2 22 aa1 aa_cc 8
dc q2 22 aa1 aa_cc 8
dc q2 22 aa1 aa_cc 8
dc q2 22 aa1 aa_cc 8
</code></pre>
<p>df2</p>
<pre><code>serial sku type reason id
34444 d1 d ok ny
34444 d1 d ok ny
34444 d1 d ok ny
555 a1 a np dc
555 a1 a np dc
</code></pre>
<p><strong>所需的</strong></p>
<pre><code>site planqtr unit alias energy serial sku type reason id
ny q1 22 du1 du_cc 10 34444 d1 d ok ny
ny q1 22 du1 du_cc 10 34444 d1 d ok ny
ny q1 22 du1 du_cc 10 34444 d1 d ok ny
dc q2 22 aa1 aa_cc 8 555 a1 a np dc
dc q2 22 aa1 aa_cc 8 555 a1 a np dc
dc q2 22 aa1 aa_cc 8 NaN NaN NaN NaN NaN
dc q2 22 aa1 aa_cc 8 NaN NaN NaN NaN NaN
</code></pre>
<p><strong>做</strong></p>
<pre><code>join = pd.merge(df1, df2, how='left',left_on= ['id'], right_on = ['site'])
</code></pre>
<p>但是,我得到的最终输出记录数呈指数级增长。我不确定为什么输出显示数千行输出。如有任何建议,我们将不胜感激</p>