如何使用selenium 3和firefox创建lambda层?

2024-09-26 22:54:15 发布

您现在位置:Python中文网/ 问答频道 /正文

我有一个工作的docker容器,希望迁移到firefox上selenium 3.141的lambda层。工作容器的Dockerfile为:

FROM python:3.7-slim

RUN /usr/local/bin/python -m pip install --upgrade pip
RUN pip install selenium==3.141.0

RUN apt-get update
RUN apt-get -y install wget bzip2 firefox-esr #68.12.0esr
WORKDIR /tmp

RUN wget https://github.com/mozilla/geckodriver/releases/download/v0.26.0/geckodriver-v0.26.0-linux64.tar.gz

RUN tar -zxf geckodriver-v0.26.0-linux64.tar.gz -C /usr/local/bin/

当我在此容器中执行以下操作时,将创建一个Firefox对象

from selenium.webdriver import Firefox
from selenium.webdriver.firefox.options   import Options
opts = Options()
opts.headless = True
Firefox(options=opts, service_log_path='/home/geckodriver.log')

对于lambda层,我使用amazonlinux图像进行局部测试。在aws上,我提供的一个zip文件将被提取到/opt,所以我想把所有内容都放在/opt中,看看它是否有效

我的dockerfile是

FROM amazonlinux

WORKDIR /
RUN yum update -y

# Install Python 3.7
RUN yum install python3 zip unzip wget tar bzip2 bzip2-libs gzip -y

RUN pip3.7 install --upgrade pip

RUN pip3.7 install selenium==3.141.0 #-t /opt/python/lib/python3.7/site-packages
  
RUN wget https://github.com/mozilla/geckodriver/releases/download/v0.26.0/geckodriver-v0.26.0-linux64.tar.gz
RUN tar -zxf geckodriver-v0.26.0-linux64.tar.gz -C /opt

RUN wget https://ftp.mozilla.org/pub/firefox/releases/68.9.0esr/linux-x86_64/de/firefox-68.9.0esr.tar.bz2
RUN tar -xvjf firefox-68.9.0esr.tar.bz2 -C /opt

但是当我和你一起跑的时候

sudo docker build -t selenium:fftest -f Dockerfile.ff .
sudo docker run --rm -it -v $(pwd):/data selenium:fftest

执行

from selenium.webdriver.firefox.options   import Options
from selenium.webdriver.firefox.firefox_binary import FirefoxBinary
from selenium.webdriver import Firefox
opts = Options()
opts.headless = True
firefox_dir = '/opt/'
gecko_dir = '/opt/'
binary = FirefoxBinary( firefox_dir + 'firefox/firefox')
Firefox(options=opts, executable_path= gecko_dir + 'geckodriver', firefox_binary=binary)

我明白了

Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "/usr/local/lib/python3.7/site-packages/selenium/webdriver/firefox/webdriver.py", line 174, in __init__
    keep_alive=True)
  File "/usr/local/lib/python3.7/site-packages/selenium/webdriver/remote/webdriver.py", line 157, in __init__
    self.start_session(capabilities, browser_profile)
  File "/usr/local/lib/python3.7/site-packages/selenium/webdriver/remote/webdriver.py", line 252, in start_session
    response = self.execute(Command.NEW_SESSION, parameters)
  File "/usr/local/lib/python3.7/site-packages/selenium/webdriver/remote/webdriver.py", line 321, in execute
    self.error_handler.check_response(response)
  File "/usr/local/lib/python3.7/site-packages/selenium/webdriver/remote/errorhandler.py", line 242, in check_response
    raise exception_class(message, screen, stacktrace)
selenium.common.exceptions.WebDriverException: Message: invalid argument: can't kill an exited process

我试过了

检查版本兼容性

这个答案https://stackoverflow.com/a/52535654/3014199表明版本不兼容,但根据table,我对selenium 3.141、Geckodriver 22和Firefox 68没有意见

以非root用户身份运行它

链接的答案进一步建议以非root用户身份运行。我终于设法做到了:

FROM amazonlinux

RUN yum update -y
RUN yum install python3 wget tar bzip2 bzip2-libs gzip -y
Run yum install shadow-utils -y
RUN useradd -ms /bin/bash jd
RUN echo "jd:password" | chpasswd
RUN usermod -aG root jd
USER jd
WORKDIR /home/jd

RUN pip3.7 install --user --upgrade pip
RUN pip3.7 install --user selenium==3.141.0
  
COPY downloads/geckodriver-v0.26.0-linux64.tar.gz /tmp/
COPY downloads/firefox-68.9.0esr.tar.bz2 /tmp/

USER root

RUN tar -zxf /tmp/geckodriver-v0.26.0-linux64.tar.gz -C /opt
RUN tar -xvjf /tmp/firefox-68.9.0esr.tar.bz2 -C /opt
RUN chown -R jd:jd /opt/firefox
USER jd

/opt中的所有内容都将jd作为所有者和组,但错误消息是相同的

将所有内容提取到/usr/local/bin

当我将geckodriver和firefox提取到/usr/local/bin中时,我不能仅用Firefox(options=opts)实例化,但得到相同的can't kill an exited process错误

检查二进制文件

$ /opt/firefox/firefox --version
XPCOMGlueLoad error for file /opt/firefox/libmozgtk.so:
libgtk-3.so.0: cannot open shared object file: No such file or directory
Couldn't load XPCOM.

我认为丢失的对象文件是根本原因。但是我不知道从哪里得到它,也不知道把它放在哪里(我只能把它放在/opt

yum install gtk3-devel之后libdbus-glib-1.so.2缺少文件/opt/firefox/libxul.so


Tags: installrunusrlocalseleniumtarwgetfirefox

热门问题