python在whitesp上拆分字符串

2024-05-18 10:09:22 发布

您现在位置:Python中文网/ 问答频道 /正文

我正在尝试将复制/粘贴的文本转换为csv,这样我可以在之后拆分。 问题是里面有空白标签,我似乎无法去掉

复制/粘贴示例:

Amarr Hybrid Tech Decryptor 12  Decryptors - Hybrid         12 m3
Ancient Coordinates Database    23  Sleeper Components          2.30 m3
Caldari Hybrid Tech Decryptor   17  Decryptors - Hybrid         17 m3
Carbon  17  General         34 m3
Cartesian Temporal Coordinator  4   Ancient Salvage         0.04 m3
Central System Controller   2   Ancient Salvage         0.02 m3

现在我想得到这样的东西:

Amarr Hybrid Tech Decryptor,12,Decryptors - Hybrid,12,m3,
Ancient Coordinates Database,23,Sleeper Components,2.30,m3,
Caldari Hybrid Tech Decryptor,17,Decryptors - Hybrid,17,m3,
Carbon,17,General,34,m3,
Cartesian Temporal Coordinator,4,Ancient Salvage,0.04,m3,
Central System Controller,2,Ancient Salvage,0.02,m3,

(将始终是每行5个分隔符

我试着用不同的方法 Split by comma and strip whitespace in Python 但我似乎无法让它工作。

@login_required
def index(request):
    if request.method == "POST":
        form = SellListForm(request.POST)
        if form.is_valid():
            selllist = form.save(commit=False)
            selllist.user = request.user
            string = selllist.sell
            string = [x.strip() for x in string.split(',')] 
            print string
            return HttpResponseRedirect(reverse('processed'))
    else:
        form = SellListForm()
    return render(request, 'index.html', {'form': form})

回报

[u'<<<SULTS STUFF>>>\t\t\tVoucher\t\t\t0 m3\r\nAmarr Hybrid Tech Decryptor\t12\tDecryptors - Hybrid\t\t\t12 m3\r\nAncient Coordinates Database\t23\tSleeper Components\t\t\t2.30 m3\r\nCaldari Hybrid Tech Decryptor\t17\tDecryptors - Hybrid\t\t\t17 m3\r\nCarbon\t17\tGeneral\t\t\t34 m3\r\nCartesian Temporal Coordinator\t4\tAncient Salvage\t\t\t0.04 m3\r\nCentral System Controller\t2\tAncient Salvage\t\t\t0.02 m3']

Tags: formstringrequestcomponentsdatabasetechm3coordinates
2条回答

您可以在选项卡上split

line = line.split('\t')

除非您特别需要逗号分隔的值,否则您只需将文本直接粘贴到一个文件中,open它,split在选项卡上使用数据,而不必引入逗号。

我看到你有时有好几个\t。我将使用re模块正确拆分:

for line in lines:
    linedata = re.split(r'\t+', line)
    print ",".join(linedata)

相关问题 更多 >