<p>这样做有效:
我想我不确定这对于巨大的数据集。你知道吗</p>
<pre><code>csv = r"""dummy,obj,loc,query
bar,6usrg82hwsa3,a,'select * from abc'
bar,b6usrg82hwsa3,a,'select * from abc'
bar,4g9cgbm813czs,a,'select * from abc'
bar,fhf8upax5cxsz,b,'select * from abc'
bar,cnphq355f5rah,b,'select * from abc'
bar,b6usrg82hwsa3,b,'SELECT LIST(HIGHLIGHT, ',') WITHIN GR...'"""
lengths = []
for line in csv.split('\n'):
lengths.append(line.count(',') + 1)
n_columns = min(lengths)
rows = []
for line in csv.split('\n'):
temp = line.split(',')
row = temp[:(n_columns-1)]
temp = temp[(n_columns-1):]
try:
temp = [string + ',' for string in temp[:-1]] + temp[-1]
row += [''.join(temp)]
except IndexError:
pass
rows.append(row)
data = pd.DataFrame(data=rows[1:] , columns=rows[0])
print(data)
# dummy obj loc query
# 0 bar 6usrg82hwsa3 a 'select * from abc'
# 1 bar b6usrg82hwsa3 a 'select * from abc'
# 2 bar 4g9cgbm813czs a 'select * from abc'
# 3 bar fhf8upax5cxsz b 'select * from abc'
# 4 bar cnphq355f5rah b 'select * from abc'
# 5 bar b6usrg82hwsa3 b 'SELECT LIST(HIGHLIGHT '') WITHIN GR...'
</code></pre>