刮削1a。10K文件中的风险因素

2024-10-02 02:44:32 发布

男 | 程序猿一只，喜欢编程写python代码。

我正在努力获得1a。每个10-K文件的风险因素部分。我已经下载了文件并将其保存为txt。文件

```'/content/drive/My Drive/Colab Notebooks/10/BKR/1.txt'
'/content/drive/My Drive/Colab Notebooks/10/BKR/2.txt'```

同样，文件夹10包含多个子文件夹（如10），每个子文件夹（如BKR）包含多个10-K As txt文件

我在下面的代码中尝试获取1a.风险因素部分，但失败了。如果你能分享你的意见，我会很高兴的

```import re
import os, os.path

PATH = '/content/drive/My Drive/Colab Notebooks/10/BKR'

conclusions = []
for file in os.listdir(path):
    with open(os.path.join(PATH, file)) as f:
        data = f.read()

    conclusion = re.search('1a: (.*?)([A-Z]{2,})', data).group(1)
    conclusions.append(conclusion)```

我收到的错误消息是：

```

---------------------------------------------------------------------------

NotADirectoryError                        Traceback (most recent call last)

<ipython-input-12-051ca10fbeb3> in <module>()
      5 
      6 conclusions = []
----> 7 for file in os.listdir(path):
      8     with open(os.path.join(PATH, file)) as f:
      9         data = f.read()

NotADirectoryError: [Errno 20] Not a directory: '/content/drive/My Drive/Colab Notebooks/10/APA/1.txt

“```”

Tags：文件 path txt 文件夹 os my drive content

0条回答

目前没有回答

刮削1a。10K文件中的风险因素

相关问题更多 >

编程相关推荐

热门问题

热门文章

刮削1a。10K文件中的风险因素

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >