我使用beautiful soup从html中提取了以下脚本:
<script>
dataLayer =[{
"pageTitle": "PRODUCT: Macculloch Parka Print( 9512MP )",
"pageCategory": "shop-mens-parkas",
"visitorLoginState": "Guest",
"EmployeeLoginState": false,
"customerEmail": "null",
"customerOrders": "null",
"customerValue": "0",
"Country": "CA",
"State": "ON",
"ecommerce": {
"currencyCode": "CAD",
"detail": {
"actionField": {
"list": "Product Category / Search Results"
},
"products": [
{
"name": "Macculloch Parka Print",
"id": "9512MP",
"price": 1295,
"brand": "Canada Goose",
"category": "shop-mens-parkas"}]}}}];</script>
我想提取与产品相关的信息(名称、id、价格和品牌)作为数据帧。有没有不使用regex的方法呢?你知道吗
您可以使用
regex
获取json并进行解析:这是一个临时解决方案,取决于接收到更多关于数据格式的信息。你知道吗
相关问题 更多 >
编程相关推荐