“Rasterio”在文件系统中不存在,并且不被识别为受支持的数据集名称

2024-06-14 12:54:33 发布

您现在位置:Python中文网/ 问答频道 /正文

遵循本教程:https://www.usgs.gov/media/files/landsat-cloud-direct-access-requester-pays-tutorial

import boto3
import rasterio as rio
from matplotlib.pyplot import imshow
from rasterio.session import AWSSession

s3 = boto3.client('s3', aws_access_key_id=AWS_KEY_ID,
                  aws_secret_access_key=AWS_SECRET)

resources = boto3.resource('s3', aws_access_key_id=AWS_KEY_ID,
                           aws_secret_access_key=AWS_SECRET)

aws_session = AWSSession(boto3.Session())

cog = 's3://usgs-landsat/collection02/level-2/standard/oli-tirs/2020/026/027/LC08_L2SP_026027_20200827_20200906_02_T1/LC08_L2SP_026027_20200827_20200906_02_T1_SR_B2.TIF'

with rio.Env(aws_session):
    with rio.open(cog) as src:
        profile = src.profile
        arr = src.read(1)
imshow(arr)

我得到以下错误:

rasterio.errors.RasterioIOError: '/vsis3/usgs-landsat/collection02/level-2/standard/oli-tirs/2020/026/027/LC08_L2SP_026027_20200827_20200906_02_T1/LC08_L2SP_026027_20200827_20200906_02_T1_SR_B2.TIF' does not exist in the file system, and is not recognized as a supported dataset name.
在AWS CloudShell中,如果我运行: ``` aws s3 ls s3://usgs陆地卫星/collection02/level-2/standard/oli tirs/2020/026/027/LC08_L2SP_026027_20200827_20200906_02_T1/ ```

我得到:

An error occurred (AccessDenied) when calling the ListObjectsV2 operation: Access Denied

我在EC2实例中运行了cloudshell命令,出现了相同的错误

我需要在文档中指定我是请求者,我有权这样做:

aws s3 ls s3://usgs-landsat/collection02/level-2/standard/oli-tirs/2020/026/027/LC08_L2SP_026027_20200827_20200906_02_T1/ --request-payer requ
ester

使用boto3仍然不起作用

我对与我一起运行boto3的用户具有管理员权限。在CloudShell中获得与boto用户和root用户相同的错误。我以前使用过访问密钥和密钥,它可以从“陆地卫星pds”存储桶(只有L8个图像)和“sentinel-s2-l1c”存储桶下载。似乎只有“美国地质勘探局陆地卫星”桶(https://registry.opendata.aws/usgs-landsat/)有问题

还尝试使用s3.list_对象访问usgs陆地卫星存储桶:

landsat = resources.Bucket("usgs-landsat")
all_objects = s3.list_objects(Bucket = 'usgs-landsat')

得到一个类似的错误:

botocore.exceptions.ClientError: An error occurred (AccessDenied) when calling the ListObjects operation: Access Denied

在查看其他解决方案后,一些用户发现:

os.environ["AWS_REQUEST_PAYER"] = "requester"
os.environ["CURL_CA_BUNDLE"] = "/etc/ssl/certs/ca-certificates.crt"

为了解决他们的问题,这对我不起作用


Tags: keyimportawss3accessboto3levelstandard
2条回答

正如您正确指出的,usgs-landsatS3 bucket是请求者付费的,因此您需要正确配置rasterio以处理该问题

如您所见hererasterio.session.AWSSession有一个requester_pays参数,您可以将其设置为True以便执行此操作

我还可以指出:

s3 = boto3.client('s3', aws_access_key_id=AWS_KEY_ID,
                  aws_secret_access_key=AWS_SECRET)

resources = boto3.resource('s3', aws_access_key_id=AWS_KEY_ID,
                           aws_secret_access_key=AWS_SECRET)

在您的代码段中不需要使用s3resources变量,因为以后不会重用这些变量

事实上,如果您的凭证正确地位于~/.aws/文件夹中——这可以通过运行^{}python包(请参见documentation)提供的命令行实用程序aws configure来完成——您根本不需要导入boto3rasterio为您完成

因此,您的代码段可以修改为:

import rasterio as rio
from matplotlib.pyplot import imshow
from rasterio.session import AWSSession

aws_session = AWSSession(requester_pays=True)

cog = 's3://usgs-landsat/collection02/level-2/standard/oli-tirs/2020/026/027/LC08_L2SP_026027_20200827_20200906_02_T1/LC08_L2SP_026027_20200827_20200906_02_T1_SR_B2.TIF'


with rio.Env(aws_session):
    with rio.open(cog) as src:
        profile = src.profile
        arr = src.read(1)
imshow(arr)

在我的机器上运行正常

这对我有用

s3sr = boto3.resource('s3')
bucket='usgs-landsat'
prefix = 'collection02/'
keys_list = []
paginator = s3sr.meta.client.get_paginator('list_objects_v2')
for page in  paginator.paginate(Bucket=bucket, Prefix=prefix, Delimiter='/', RequestPayer='requester'):
    keys = [content['Key'] for content in page.get('Contents')]
    keys_list.extend(keys)
len(keys_list)

# keys_list 
['collection02/catalog.json',
 'collection02/landsat-c2l1.json',
 'collection02/landsat-c2l2-sr.json',
 'collection02/landsat-c2l2-st.json',
 'collection02/landsat-c2l2alb-bt.json',
 'collection02/landsat-c2l2alb-sr.json',
 'collection02/landsat-c2l2alb-st.json',
 'collection02/landsat-c2l2alb-ta.json']

# getting the catalog.json
response = boto3.client('s3').get_object(Bucket=bucket, Key='collection02/catalog.json', RequestPayer='requester')
jsondata = response['Body'].read().decode()

相关问题 更多 >