PySpark – to_date()

pyspark-mytechmint

PySpark functions provide to_date() function to convert timestamp to date (DateType), this is ideally achieved by just truncating the time part from the Timestamp column. …

Read More ➜

PySpark – to_timestamp()

pyspark-mytechmint

Use to_timestamp() function to convert String to Timestamp (TimestampType) in PySpark. The converted time would be in a default format of MM-dd-yyyy HH:mm:ss.SSS, I will explain how …

Read More ➜

PySpark – concat_ws()

pyspark-mytechmint

In this PySpark article, We will learn how to convert an array of String column on DataFrame to a String column (separated or concatenated with …

Read More ➜

PySpark – split()

pyspark-mytechmint

PySpark SQL provides split() function to convert delimiter separated String to an Array (StringType to ArrayType) column on DataFrame. This can be done by splitting a string column based …

Read More ➜

PySpark – lit()

pyspark-mytechmint

PySpark SQL functions lit() and typedLit() are used to add a new column to DataFrame by assigning a literal or constant value. Both these functions return Column type as return type. …

Read More ➜

PySpark – expr()

pyspark-mytechmint

PySpark expr() is a SQL function to execute SQL-like expressions and to use an existing DataFrame column value as an expression argument to Pyspark built-in functions. Most …

Read More ➜