擅长:python、mysql、java
<p>如果这是完整的文件,则文件格式不正确。在大括号之间必须有一个逗号,它应该以方括号开始和结束。像这样:<code>[{...},{...}]</code>。对于您的数据,它看起来像:</p>
<pre><code>[{"review_id":"x7mDIiDB3jEiPGPHOmDzyw","user_id":"msQe1u7Z_XuqjGoqhB0J5g","business_id": ...},
{"review_id":"dDl8zu1vWPdKGihJrwQbpw","user_id":"msQe1u7Z_XuqjGoqhB0J5g","business_id": ...}]
</code></pre>
<p>下面是一些如何清理文件的代码:</p>
^{pr2}$
<p>为了正确地读取json文件,还可以考虑使用pandas库(<a href="https://pandas.pydata.org/pandas-docs/stable/generated/pandas.read_json.html" rel="nofollow noreferrer">https://pandas.pydata.org/pandas-docs/stable/generated/pandas.read_json.html</a>)。在</p>
<pre><code>import pandas as pd
#get a pandas dataframe object from json file
df = pd.read_json("path/to/your/filename.json")
</code></pre>
<p>如果您不熟悉pandas,这里有一个快速入门,如何使用dataframe对象:</p>
<pre><code>df.head() #gives you the first rows of the dataframe
df["review_id"] # gives you the column review_id as a vector
df.iloc[1,:] # gives you the complete row with index 1
df.iloc[1,2] # gives you the item in row with index 1 and column with index 2
</code></pre>