在列之间添加，跳过并保留一些行/列

3条回答

网友

1楼 · 编辑于 2024-10-02 14:18:01

您的文件不是“部分csv”（看不到逗号）；它们是（部分）空格分隔的。您可以逐行读取文件，使用Python的.split()方法将相关字符串转换为子字符串列表，然后根据需要重新排列这些片段。拆分和重新组装可能如下所示：

input_line = "'Z1' 30 26 1 1 5 7 /"  # test data
input_items = input_line.split()
output_items = ["'Z_NEW'", '1000']
output_items.append(input_items[1])
output_items.append(input_items[2])
output_items.append(input_items[3])
output_items.append('/')
output_line = ' '.join(output_items)
print(output_line)

最后的print()语句显示结果字符串是

'Z_NEW' 1000 30 26 1 /

网友

2楼 · 编辑于 2024-10-02 14:18:01

下面是使用Perl的一种方法：

#!/usr/bin/perl
use strict;
use warnings;

# initialize output array
my @output = ('KW_NEW');

# proceed first file
open my $fh1, '<', 'in1.txt' or die "unable to open file1: $!";
while(<$fh1>) {
    # consider only lines after KW2
    if (/KW2/ .. eof) {
        # Don't treat KW2 line
        next if /KW2/;
        # split the current line on space and keep only the fifth first element
        my @l = (split ' ', $_)[0..4];
        # change the first element
        $l[0] = 'Z_NEW';
        # insert 1000 at second position
        splice @l,1,0,1000;
        # push into output array
        push @output, "@l";
    }
}

# proceed second file
open my $fh2, '<', 'in2.txt' or die "unable to open file2: $!";
while(<$fh2>) {
    if (/KW2/ .. eof) {
        next if /KW2/;
        my @l = (split ' ', $_)[0..4];
        $l[0] = 'Z_NEW';
        splice @l,1,0,1000;
        push @output, "@l";
    }
}

# write array to output file
open my $fh3, '>', 'out.txt' or die "unable to open file3: $!";
print $fh3 $_,"\n" for @output;

网友

3楼 · 编辑于 2024-10-02 14:18:01

你的文件格式是静态的吗？（顺便说一句，这实际上不是csv:P）您可能需要研究一种标准化的文件格式，如JSON或strict csv来存储数据，以便可以使用现有的工具来解析输入文件。python有很好的JSON和CSV库，可以为您完成所有困难的工作。你知道吗

如果你被这种文件格式困住了，我会尝试类似的方法。你知道吗

path = '<input_path>'
kws = ['KW1', 'KW2']
desired_kw = kws[1]

def parse_columns(line):
    array = line.split()
    if array[-1] is '/':
        # get rid of trailing slash
        array = array[:-1]

def is_kw(cols):
    if len(cols) > 0 and cols[0] in kws:
        return cols[0]

# to parse the section denoted by desired keyword
with open(path, 'r') as input_fp:
    matrix = []
    reading_file = False
    for line in input_fp.readlines:
        cols = parse_columns(line)
        line_is_kw = is_kw(line)
        if line_is_kw:
            if not reading_file:
                if line_is_kw is desired_kw:
                    reading_file = True
                else:
                    continue
            else:
                break

        if reading_file:
            matrix = cols

print matrix

在那里，您可以使用诸如切片表示法和基本列表操作之类的方法来获得所需的数组。祝你好运！你知道吗

相关问题更多 >

编程相关推荐

热门问题

热门文章

在列之间添加，跳过并保留一些行/列

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >