Python Pandas读取具有可变前导长度的csv文件

HEADER = '#Start' def header_index(file_name): with open(file_name) as fp: for ind, line in enumerate(fp): if line.startswith(HEADER): return ind for row in directories: path2file = '%s%s%s' % (path2data, row, suffix) myDF = pd.read_csv(path2file, skiprows=header_index(path2file), header=0, delimiter='\t')

1条回答

网友

1楼 · 发布于 2024-09-27 07:18:44

现在这是可能的（不知道当时是否可行），如下所示：

pos= 0
oldpos = None

while pos != oldpos:  # make sure we stop reading, in case we reach EOF
    line= fp.readline()
    if line.startswith(HEADER):
        # set the read position to the start of the line
        # so pandas can read the header
        fp.seek(pos)
        break
    oldpos= pos
    pos= fp.tell()    # renenber this position as sthe start of the next line

pd.read_csv(fp, ...your options here...)

编程相关推荐

java IntelliJ IDEA CreativeProcess错误=193，%1不是有效的Win32应用程序
在java中返回多个值（字符串和数组）
我们可以使用java驱动程序。在pom类中查找数据？
java是处理请求后数据的有效方法
用于小文件的java音频缓存安卓 studio
使用Java exec的postgresql额外psql命令行参数
java导入语句代码错误
使用服务上传java Android HTTPS文件（从HTTP转换为HTTPS）
启动配置服务器组织时发生java Microservice错误。springframework。靴子上下文财产。绑定绑定结果
swing Java:无法在JFrame中显示图像

相关问题更多 >

编程相关推荐

热门问题

热门文章

Python Pandas读取具有可变前导长度的csv文件

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >