擅长:python、mysql、java
<p>你的udf方法适合我。此外,您可以将<code>transform</code>与<code>substring</code>一起使用:</p>
<pre><code>import pyspark.sql.functions as f
df.withColumn('new_column', f.expr('transform(col1, x -> substring(x, 0, 5))')).show()
+ + +
| col1| new_column|
+ + +
|[hello-123, abcde...| [hello, abcde]|
|[hello-234, abcde...|[hello, abcde, xy...|
|[hiiii-111, abbbb...|[hiiii, abbbb, xy...|
+ + +
</code></pre>