Python nordata包_程序模块 - PyPI

连接到AWS S3和Redshift的方便包装

nordata的Python项目详细描述

诺达塔

作者：

尼克·布克

简介：

nordata是访问aws s3和aws redshift的一个小的实用函数集合。它是由Nordstrom分析团队的一位数据科学家撰写的。nordata的目标是成为一个简单、健壮的包，以简化数据工作流程。它并不打算处理所有可能的需求（例如，凭证管理很大程度上留给用户），但它旨在简化常见任务。

Installing Nordata:

Nordata can be install via pip. As always, use of a project-level virtual environment is recommended.

Nordata requires Python >= 3.6.

^{pr 1}$

Setting up credentials for Nordata:

Redshift:

Nordata is designed to ingest your Redshift credentials as an environment variable in the below format. This method allows the user freedom to handle credentials a number of ways. As always, best practices are advised. Your credentials should never be placed in the code of your project such as in a ^{} or ^{} file. Instead, you may wish to place them in your ^{} locally or take advantage of a key management service such as the one offered by AWS.

^{pr 2}$

S3:

If the user is running locally, their ^{} directory should contain a ^{} directory with a ^{} file. The ^{} file should look similar to the example below where the profile name is in brackets. Note that the specific values and region may vary. If the user is running on an EC2, instance permission to access S3 is handled by the IAM role for the instance.

^{pr 3}$

Note the the profile name in brackets. If the profile name differs in your credentials file, you will likely need to pass this profile name to the S3 functions as an argument.

How to use Nordata:

fromnordataimportboto_get_creds,redshift_execute_sqlcreds=boto_get_creds(profile_name='default',region_name='us-west-2',session=None)sql=f'''    unload (        'select            col1            ,col2        from            my_schema.my_table'    )    to        's3://mybucket/unload/my_table/'    credentials        '{creds}'    parallel off header gzip allowoverwrite;'''redshift_execute_sql(sql=sql,env_var='REDSHIFT_CREDS',return_data=False,return_dict=False)

Transferring data from S3 to Redshift using a ^{} statement (see Redshift COPY documentation了解更多信息：

fromnordataimportboto_get_creds,redshift_execute_sqlcreds=boto_get_creds(profile_name='default',region_name='us-west-2',session=None)sql=f'''    copy        my_schema.my_table    from        's3://mybucket/unload/my_table/'    credentials        '{creds}'    ignoreheader 1 gzip;'''redshift_execute_sql(sql=sql,env_var='REDSHIFT_CREDS',return_data=False,return_dict=False)

测试：

对于那些对nordata或分叉和编辑项目感兴趣的人来说，pytest是使用的测试框架。要运行测试，请创建一个虚拟环境，安装dev-requirements.txt的内容，并从项目的根目录运行以下命令。测试脚本可以在test/目录中找到。

$ pytest

欢迎加入QQ群-->： 979659372

nordata 0.2.2

nordata的Python项目详细描述

诺达塔

作者：

简介：

目录：

安装Nordata:

为Nordata设置凭据：

如何使用nordata:

测试：

Installing Nordata:

Setting up credentials for Nordata:

Redshift:

S3:

How to use Nordata:

Redshift:

S3:

Boto3:

Transferring data between Redshift and S3:

测试：

推荐PyPI第三方库

PyCPU-RETRO

splashbuilder

nsn-distributions

hclctpm

elasticfeed

splusdata

spaceone-api

snakeshell

dyn2sel

photo-downloader

python-TinderA

gau-bi-distributions

kamvas-driver

lumfunc

dbnd-azure

导 航 栏

项目 链接

标 签

维护者

最新PyPI项目

最新Python常见问题

导航栏

项目链接

标签