擅长:python、mysql、java
<p>从博图做这件事的一些参考</p>
<pre><code>import boto.emr.connection as botocon
import boto.emr.step as step
con = botocon.EmrConnection(aws_access_key_id='', aws_secret_access_key='')
step = step.JarStep(name='Find similar items', jar='s3://mahout-core-0.6-job.jar', main_class='org.apache.mahout.cf.taste.hadoop.similarity.item.ItemSimilarityJob', action_on_failure='CANCEL_AND_WAIT', step_args=[' input', 's3://', ' output', 's3://', ' similarityClassname', 'SIMILARITY_PEARSON_CORRELATION'])
con.add_jobflow_steps('jflow', [step])
</code></pre>
<p>显然你需要上传mahout-core-0.6-作业.jar到可进入的s3位置。输入和输出必须是可访问的。在</p>