使用k8s和多个容器处理作业

def read_path(): files = [] suffixes = ('_SUCCESS', 'referen/') files_path = fs.listdir(f"s3://{path}") if files_path is not None: for num, file in enumerate(files_path, start=1): if not file['Key'].endswith(suffixes): files.append(file['Key']) return files def from_containers(container, path): containers = [] for num, file in enumerate(read_path(container, path), start=1): containers.append(client.V1Container(name=f'hcm1-{num}', image='image-python', command=['python3', 'model.py'], env=[client.V1EnvVar(name='model', value='hcm1'), client.V1EnvVar(name='referen', value=f"s3://{file}")])) template = client.V1PodTemplateSpec(metadata=client.V1ObjectMeta(name="hcm1"), spec=client.V1PodSpec(restart_policy="OnFailure", containers=from_containers(containers, path), image_pull_secrets=[client.V1LocalObjectReference(name="secret")])) spec = client.V1JobSpec(template=template, backoff_limit=20) job = client.V1Job(api_version="batch/v1", kind="Job", metadata=client.V1ObjectMeta(name="hcm1"), spec=spec) api_response = batch_v1.create_namespaced_job(body=job, namespace="default") print("Job created. status='%s'" % str(api_response.status))

1条回答

网友

1楼 · 发布于 2024-10-01 07:27:47

你的问题有点不对劲。根据您的情况和方法，您可以找到许多解决方案。但是David Maze提到得很好：

If you have this scripting setup already, why not 500 Jobs? (Generally you want to avoid multiple containers per pod.)

是的，如果您正在编写脚本，这可能是解决问题的一个非常好的方法。它肯定不会那么复杂，因为为1个pod创建1Job。每个吊舱将有一个集装箱

理论上，您还可以定义自己的Custom Resource并编写一个类似于Job controller的控制器，它支持多个pod模板

另见this question.

相关问题更多 >

编程相关推荐

热门问题

热门文章