Redshift UDF中的hurry2包导入失败

2024-05-05 11:18:04 发布

您现在位置:Python中文网/ 问答频道 /正文

我正在尝试导入杂音2包作为一个库在红移数据库。我做了以下步骤

  1. 运行模块打包机
    $ ./installPipModuleAsRedshiftLibrary.sh -m murmur2 -s s3://path/to/murmur2/lib

  2. 在redshift上创建库
    CREATE OR REPLACE LIBRARY murmur2 LANGUAGE plpythonu from 's3://path/to/murmur2/lib/murmur2.zip' WITH CREDENTIALS AS 'aws_access_key_id=AAAAAAAAAAAAAAAAAAAA;aws_secret_access_key=SSSSSSSSSSSSSSSSS' region 'us-east-1';

  3. 创建函数和查询

    create OR REPLACE function f_py_kafka_partitioner (s varchar, ps int)
    returns int stable as $$ import murmur2
    m2 = murmur2.murmur64a(s, len(s), 0x9747b28c)
    return m2 % ps
    $$ language plpythonu;

SELECT f_py_kafka_partitioner('jiimit', 100);

这会产生以下错误:

[Amazon](500310) Invalid operation: ImportError: No module named murmur2. Please look at svl_udf_log for more information
Details: 
 -----------------------------------------------
  error:  ImportError: No module named murmur2. Please look at svl_udf_log for more information
  code:      10000
  context:   UDF
  query:     0
  location:  udf_client.cpp:366
  process:   padbmaster [pid=31381]
  -----------------------------------------------;

下面是svl_udf_log的内容

^{pr2}$

文件夹结构如下所示
Murmur2 Directory tree


Tags: ortopathkeypyawslogs3