回答此问题可获得 20 贡献值,回答如果被采纳可获得 50 分。
<p>我想从这个数据帧获取标签的分布:</p>
<pre><code>df=pd.DataFrame([
[43,{"tags":["webcom","start","temp","webcomfoto","dance"],"image":["https://image.com/Kqk.jpg"]}],
[83,{"tags":["yourself","start",""],"image":["https://images.com/test.jpg"]}],
[76,{"tags":["en","webcom"],"links":["http://webcom.webcomdb.com","http://webcom.webcomstats.com"],"users":["otole"]}],
[77,{"tags":["webcomznakomstvo","webcomzhiznx","webcomistoriya","webcomosebe","webcomfotografiya"],"image":["https://images.com/nt4wzguoh/y_a3d735b4.jpg","https://images.com/sucb0u24x/b1sd_Naju.jpg"]}],
[81,{"tags":["webcomfotografiya"],"users":["myself","boattva"],"links":["https://webcom.com/nk"]}],
],columns=["_id","tags"])
</code></pre>
<p>我需要得到一个表,其中的'id'和特定数量的标签。
例如</p>
^{pr2}$
<p>当“tags”是唯一的字段时,我使用了<a href="https://stackoverflow.com/questions/50722056/how-to-split-text-data-and-count-number-of-occurrences-in-pandas-dataframe/50722221">this approach</a>。在这个数据框中,我还有“image”、“users”和其他带值的文本字段。在这种情况下,我应该如何处理数据?在</p>
<p>谢谢你</p>