运行tf.distribute MultiWorkerMirroredStrategy时出现“地址已在使用”错误。如何使用端口终止进程

2024-03-28 19:28:15 发布

您现在位置:Python中文网/ 问答频道 /正文

最近,我无法运行分布式。捕获的异常只是“无法启动gRPC服务器”

其他相关产出:

E0315 13:10:30.924933027    1721 server_chttp2.cc:40]        {"created":"@1615839030.924848216","description":"No address added out of total 1 resolved","file":"external/com_github_grpc_grpc/src/core/ext/transport/chttp2/server/chttp2_server.cc","file_line":395,"referenced_errors":[{"created":"@1615839030.924842411","description":"Failed to add any wildcard listeners","file":"external/com_github_grpc_grpc/src/core/lib/iomgr/tcp_server_posix.cc","file_line":342,"referenced_errors":[{"created":"@1615839030.924819058","description":"Unable to configure socket","fd":7,"file":"external/com_github_grpc_grpc/src/core/lib/iomgr/tcp_server_utils_posix_common.cc","file_line":216,"referenced_errors":[{"created":"@1615839030.924814323","description":"Address already in use","errno":98,"file":"external/com_github_grpc_grpc/src/core/lib/iomgr/tcp_server_utils_posix_common.cc","file_line":189,"os_error":"Address already in use","syscall":"bind"}]},{"created":"@1615839030.924841231","description":"Unable to configure socket","fd":7,"file":"external/com_github_grpc_grpc/src/core/lib/iomgr/tcp_server_utils_posix_common.cc","file_line":216,"referenced_errors":[{"created":"@1615839030.924837099","description":"Address already in use","errno":98,"file":"external/com_github_grpc_grpc/src/core/lib/iomgr/tcp_server_utils_posix_common.cc","file_line":189,"os_error":"Address already in use","syscall":"bind"}]}]}]}
2021-03-15 13:10:30.925019: E tensorflow/core/distributed_runtime/rpc/grpc_server_lib.cc:533] Unknown: Could not start gRPC server
2021-03-15 13:10:30.925374: E tensorflow/c/c_api_experimental.cc:520] Could not start gRPC server

“修复”是更改gRPC端口。但是,我想知道是什么进程锁定地址/端口并将其删除

如何识别和终止导致问题的进程

  • 我没有管理员权限
  • 我在多台主机上运行
  • TensorFlow 2.4.1

Tags: coresrcgithubcomgrpcserverlibline