擅长:python、mysql、java
<p>使用<a href="https://spark.apache.org/docs/latest/api/python/reference/api/pyspark.sql.functions.translate.html?highlight=translate#pyspark.sql.functions.translate" rel="nofollow noreferrer">^{<cd1>}</a></p>
<pre><code>from pyspark.sql.functions import *
data_list = [(1, "'Name 1'"), (2, "'Name 2' and 'Something'")]
df = spark.createDataFrame(data = data_list, schema = ["ID", "my_col"])
# + -+ +
# | ID| my_col|
# + -+ +
# | 1| 'Name 1'|
# | 2|'Name 2' and 'Som...|
# + -+ +
df.withColumn('my_col', translate('my_col', "'", '"')).show()
# + -+ +
# | ID| my_col|
# + -+ +
# | 1| "Name 1"|
# | 2|"Name 2" and "Som...|
# + -+ +
</code></pre>
<p>这将在<code>my_col</code>列中用双引号替换所有出现的单引号字符</p>