pkg.gl

ArraysOverlap - Collection function: returns true if the arrays contain any common non-null element; if not, returns null if both the arrays are non-empty and any of them contains a null element; returns false otherwise.

ArraysZip

ArraysZip - Collection function: Returns a merged array of structs in which the N-th struct contains all N-th values of input arrays.

ArrayUnion

ArrayUnion - Collection function: returns an array of the elements in the union of col1 and col2, without duplicates.

Asc

Asc - Returns a sort expression based on the ascending order of the given column name.

Ascii

Ascii - Computes the numeric value of the first character of the string column.

AscNullsFirst

AscNullsFirst - Returns a sort expression based on the ascending order of the given column name, and null values return before non-null values.

AscNullsLast

AscNullsLast - Returns a sort expression based on the ascending order of the given column name, and null values appear after non-null values.

Asin

Asin - Computes inverse sine of the input column.

Asinh

Asinh - Computes inverse hyperbolic sine of the input column.

Atan

Atan - Compute inverse tangent of the input column.

Atan2

Atan2 - Atan2 is the Golang equivalent of atan2: (col1: Union[ForwardRef('ColumnOrName'), float], col2: Union[ForwardRef('ColumnOrName'), float]) -> pyspark.sql.connect.column.Column.

Atanh

Atanh - Computes inverse hyperbolic tangent of the input column.

Avg

Avg - Aggregate function: returns the average of the values in a group.

Base64

Base64 - Computes the BASE64 encoding of a binary column and returns it as a string column.

Bin

Bin - Returns the string representation of the binary value of the given column.

BitAnd

BitAnd - Aggregate function: returns the bitwise AND of all non-null input values, or null if none.

BitCount

BitCount - Returns the number of bits that are set in the argument expr as an unsigned 64-bit integer, or NULL if the argument is NULL.

BitGet

BitGet - Returns the value of the bit (0 or 1) at the specified position.

BitLength

BitLength - Calculates the bit length for the specified string column.

BitmapBitPosition

BitmapBitPosition - Returns the bit position for the given input column.

BitmapBucketNumber

BitmapBucketNumber - Returns the bucket number for the given input column.

BitmapConstructAgg

BitmapConstructAgg - Returns a bitmap with the positions of the bits set from all the values from the input column.

BitmapCount

BitmapCount - Returns the number of set bits in the input bitmap.

BitmapOrAgg

BitmapOrAgg - Returns a bitmap that is the bitwise OR of all of the bitmaps from the input column.

BitOr

BitOr - Aggregate function: returns the bitwise OR of all non-null input values, or null if none.

BitwiseNot

BitwiseNot - Computes bitwise not.

BitwiseNOT

BitwiseNOT - Computes bitwise not.

BitXor

BitXor - Aggregate function: returns the bitwise XOR of all non-null input values, or null if none.

BoolAnd

BoolAnd - Aggregate function: returns true if all values of `col` are true.

BoolOr

BoolOr - Aggregate function: returns true if at least one value of `col` is true.

Bround

Bround - Round the given value to `scale` decimal places using HALF_EVEN rounding mode if `scale` >= 0 or at integral part when `scale` < 0.

Btrim

Btrim - Remove the leading and trailing `trim` characters from `str`.

CallFunction

CallFunction - Call a SQL function.

Cardinality

Cardinality - Collection function: returns the length of the array or map stored in the column.

Cbrt

Cbrt - Computes the cube-root of the given value.

Ceil

Ceil - Computes the ceiling of the given value.

Ceiling

Ceiling - Computes the ceiling of the given value.

Char

Char - Returns the ASCII character having the binary equivalent to `col`.

CharacterLength

CharacterLength - Returns the character length of string data or number of bytes of binary data.

CharLength

CharLength - Returns the character length of string data or number of bytes of binary data.

Coalesce

Coalesce - Returns the first column that is not null.

Col

No description provided by the author

CollectList

CollectList - Aggregate function: returns a list of objects with duplicates.

CollectSet

CollectSet - Aggregate function: returns a set of objects with duplicate elements eliminated.

Concat

Concat - Concatenates multiple input columns together into a single column.

ConcatWs

ConcatWs - Concatenates multiple input string columns together into a single string column, using the given separator.

Contains

Contains - Returns a boolean.

Conv

Conv - Convert a number in a string column from one base to another.

Corr

Corr - Returns a new :class:`~pyspark.sql.Column` for the Pearson Correlation Coefficient for “col1“ and “col2“.

Cos

Cos - Computes cosine of the input column.

Cosh

Cosh - Computes hyperbolic cosine of the input column.

Cot

Cot - Computes cotangent of the input column.

Count

Count - Aggregate function: returns the number of items in a group.

CountDistinct

CountDistinct - Returns a new :class:`Column` for distinct count of “col“ or “cols“.

CountIf

CountIf - Returns the number of `TRUE` values for the `col`.

CountMinSketch

CountMinSketch - Returns a count-min sketch of a column with the given esp, confidence and seed.

CovarPop

CovarPop - Returns a new :class:`~pyspark.sql.Column` for the population covariance of “col1“ and “col2“.

CovarSamp

CovarSamp - Returns a new :class:`~pyspark.sql.Column` for the sample covariance of “col1“ and “col2“.

Crc32

Crc32 - Calculates the cyclic redundancy check value (CRC32) of a binary column and returns the value as a bigint.

CreateMap

CreateMap - Creates a new map column.

Csc

Csc - Computes cosecant of the input column.

CumeDist

CumeDist - Window function: returns the cumulative distribution of values within a window partition, i.e.

Curdate

Curdate - Returns the current date at the start of query evaluation as a :class:`DateType` column.

CurrentCatalog

CurrentCatalog - Returns the current catalog.

CurrentDatabase

CurrentDatabase - Returns the current database.

CurrentDate

CurrentDate - Returns the current date at the start of query evaluation as a :class:`DateType` column.

CurrentSchema

CurrentSchema - Returns the current database.

CurrentTimestamp

CurrentTimestamp - Returns the current timestamp at the start of query evaluation as a :class:`TimestampType` column.

CurrentTimezone

CurrentTimezone - Returns the current session local timezone.

CurrentUser

CurrentUser - Returns the current database.

Dateadd

Dateadd - Returns the date that is `days` days after `start`.

DateAdd

DateAdd - Returns the date that is `days` days after `start`.

Datediff

Datediff - Returns the number of days from `start` to `end`.

DateDiff

DateDiff - Returns the number of days from `start` to `end`.

DateFormat

DateFormat - Converts a date/timestamp/string to a value of string in the format specified by the date format given by the second argument.

DateFromUnixDate

DateFromUnixDate - Create date from the number of `days` since 1970-01-01.

Datepart

Datepart is the Golang equivalent of datepart: (field: 'ColumnOrName', source: 'ColumnOrName') -> pyspark.sql.connect.column.Column.

DatePart

DatePart is the Golang equivalent of date_part: (field: 'ColumnOrName', source: 'ColumnOrName') -> pyspark.sql.connect.column.Column.

DateSub

DateSub - Returns the date that is `days` days before `start`.

DateTrunc

DateTrunc - Returns timestamp truncated to the unit specified by the format.

Day

Day - Extract the day of the month of a given date/timestamp as integer.

Dayofmonth

Dayofmonth - Extract the day of the month of a given date/timestamp as integer.

Dayofweek

Dayofweek - Extract the day of the week of a given date/timestamp as integer.

Dayofyear

Dayofyear - Extract the day of the year of a given date/timestamp as integer.

Days

Days - Partition transform function: A transform for timestamps and dates to partition data into days.

Decode

Decode - Computes the first argument into a string from a binary using the provided character set (one of 'US-ASCII', 'ISO-8859-1', 'UTF-8', 'UTF-16BE', 'UTF-16LE', 'UTF-16').

Degrees

Degrees - Converts an angle measured in radians to an approximately equivalent angle measured in degrees.

DenseRank

DenseRank - Window function: returns the rank of rows within a window partition, without any gaps.

Desc

Desc - Returns a sort expression based on the descending order of the given column name.

DescNullsFirst

DescNullsFirst - Returns a sort expression based on the descending order of the given column name, and null values appear before non-null values.

DescNullsLast

DescNullsLast - Returns a sort expression based on the descending order of the given column name, and null values appear after non-null values.

E - Returns Euler's number.

Elt

Elt - Returns the `n`-th input, e.g., returns `input2` when `n` is 2.

Encode

Encode - Computes the first argument into a binary from a string using the provided character set (one of 'US-ASCII', 'ISO-8859-1', 'UTF-8', 'UTF-16BE', 'UTF-16LE', 'UTF-16').

Endswith

Endswith - Returns a boolean.

EqualNull

EqualNull - Returns same result as the EQUAL(=) operator for non-null operands, but returns true if both are null, false if one of the them is null.

Every

Every - Aggregate function: returns true if all values of `col` are true.

Exp

Exp - Computes the exponential of the given value.

Explode

Explode - Returns a new row for each element in the given array or map.

ExplodeOuter

ExplodeOuter - Returns a new row for each element in the given array or map.

Expm1

Expm1 - Computes the exponential of the given value minus one.

Expr

No description provided by the author

Extract

Extract - Extracts a part of the date/timestamp or interval source.

Factorial

Factorial - Computes the factorial of the given value.

FindInSet

FindInSet - Returns the index (1-based) of the given string (`str`) in the comma-delimited list (`strArray`).

Flatten

Flatten - Collection function: creates a single array from an array of arrays.

Floor

Floor - Computes the floor of the given value.

FormatNumber

FormatNumber - Formats the number X to a format like '#,--#,--#.--', rounded to d decimal places with HALF_EVEN round mode, and returns the result as a string.

FormatString

FormatString - Formats the arguments in printf-style and returns the result as a string column.

FromUnixtime

FromUnixtime - Converts the number of seconds from unix epoch (1970-01-01 00:00:00 UTC) to a string representing the timestamp of that moment in the current system time zone in the given format.

FromUtcTimestamp

FromUtcTimestamp - This is a common function for databases supporting TIMESTAMP WITHOUT TIMEZONE.

Get

Get - Collection function: Returns element of array at given (0-based) index.

Getbit

Getbit - Returns the value of the bit (0 or 1) at the specified position.

GetJsonObject

GetJsonObject - Extracts json object from a json string based on json `path` specified, and returns json string of the extracted json object.

Greatest

Greatest - Returns the greatest value of the list of column names, skipping null values.

Grouping

Grouping - Aggregate function: indicates whether a specified column in a GROUP BY list is aggregated or not, returns 1 for aggregated or 0 for not aggregated in the result set.

GroupingId

GroupingId - Aggregate function: returns the level of grouping, equals to (grouping(c1) << (n-1)) + (grouping(c2) << (n-2)) + ..

Hash

Hash - Calculates the hash code of given columns, and returns the result as an int column.

Hex

Hex - Computes hex value of the given column, which could be :class:`pyspark.sql.types.StringType`, :class:`pyspark.sql.types.BinaryType`, :class:`pyspark.sql.types.IntegerType` or :class:`pyspark.sql.types.LongType`.

HistogramNumeric

HistogramNumeric - Computes a histogram on numeric 'col' using nb bins.

HllSketchEstimate

HllSketchEstimate - Returns the estimated number of unique values given the binary representation of a Datasketches HllSketch.

Hour

Hour - Extract the hours of a given timestamp as integer.

Hours

Hours - Partition transform function: A transform for timestamps to partition data into hours.

Hypot

Hypot - Computes “sqrt(a^2 + b^2)“ without intermediate overflow or underflow.

Ifnull

Ifnull - Returns `col2` if `col1` is null, or `col1` otherwise.

Initcap

Initcap - Translate the first letter of each word to upper case in the sentence.

Inline

Inline - Explodes an array of structs into a table.

InlineOuter

InlineOuter - Explodes an array of structs into a table.

InputFileBlockLength

InputFileBlockLength - Returns the length of the block being read, or -1 if not available.

InputFileBlockStart

InputFileBlockStart - Returns the start offset of the block being read, or -1 if not available.

InputFileName

InputFileName - Creates a string column for the file name of the current Spark task.

Instr

Instr - Locate the position of the first occurrence of substr column in the given string.

Isnan

Isnan - An expression that returns true if the column is NaN.

Isnotnull

Isnotnull - Returns true if `col` is not null, or false otherwise.

Isnull

Isnull - An expression that returns true if the column is null.

JavaMethod

JavaMethod - Calls a method with reflection.

JsonArrayLength

JsonArrayLength - Returns the number of elements in the outermost JSON array.

JsonObjectKeys

JsonObjectKeys - Returns all the keys of the outermost JSON object as an array.

JsonTuple

JsonTuple - Creates a new row for a json column according to the given field names.

Kurtosis

Kurtosis - Aggregate function: returns the kurtosis of the values in a group.

LastDay

LastDay - Returns the last day of the month which the given date belongs to.

Lcase

Lcase - Returns `str` with all characters changed to lowercase.

Least

Least - Returns the least value of the list of column names, skipping null values.

Left

Left - Returns the leftmost `len`(`len` can be string type) characters from the string `str`, if `len` is less or equal than 0 the result is an empty string.

Length

Length - Computes the character length of string data or number of bytes of binary data.

Levenshtein

Levenshtein - Computes the Levenshtein distance of the two given strings.

Lit

No description provided by the author

Ln - Returns the natural logarithm of the argument.

Localtimestamp

Localtimestamp - Returns the current timestamp without time zone at the start of query evaluation as a timestamp without time zone column.

Locate

Locate - Locate the position of the first occurrence of substr in a string column, after position pos.

Log

Log - Returns the first argument-based logarithm of the second argument.

Log10

Log10 - Computes the logarithm of the given value in Base 10.

Log1p

Log1p - Computes the natural logarithm of the "given value plus one".

Log2

Log2 - Returns the base-2 logarithm of the argument.

Lower

Lower - Converts a string expression to lower case.

Lpad

Lpad - Left-pad the string column to width `len` with `pad`.

Ltrim

Ltrim - Trim the spaces from left end for the specified string value.

MakeDate

MakeDate - Returns a column with a date built from the year, month and day columns.

MakeDtInterval

MakeDtInterval - Make DayTimeIntervalType duration from days, hours, mins and secs.

MakeInterval

MakeInterval - Make interval from years, months, weeks, days, hours, mins and secs.

MakeTimestamp

MakeTimestamp - Create timestamp from years, months, days, hours, mins, secs and timezone fields.

MakeTimestampLtz

MakeTimestampLtz - Create the current timestamp with local time zone from years, months, days, hours, mins, secs and timezone fields.

MakeTimestampNtz

MakeTimestampNtz - Create local date-time from years, months, days, hours, mins, secs fields.

MakeYmInterval

MakeYmInterval - Make year-month interval from years, months.

MapConcat

MapConcat - Returns the union of all the given maps.

MapEntries

MapEntries - Collection function: Returns an unordered array of all entries in the given map.

MapFromArrays

MapFromArrays - Creates a new map from two arrays.

MapFromEntries

MapFromEntries - Collection function: Converts an array of entries (key value struct types) to a map of values.

MapKeys

MapKeys - Collection function: Returns an unordered array containing the keys of the map.

MapValues

MapValues - Collection function: Returns an unordered array containing the values of the map.

Mask

Mask - Masks the given string value.

Max

Max - Aggregate function: returns the maximum value of the expression in a group.

MaxBy

MaxBy - Returns the value associated with the maximum value of ord.

Md5

Md5 - Calculates the MD5 digest and returns the value as a 32 character hex string.

Mean

Mean - Aggregate function: returns the average of the values in a group.

Median

Median - Returns the median of the values in a group.

Min

Min - Aggregate function: returns the minimum value of the expression in a group.

MinBy

MinBy - Returns the value associated with the minimum value of ord.

Minute

Minute - Extract the minutes of a given timestamp as integer.

Mode

Mode - Returns the most frequent value in a group.

MonotonicallyIncreasingId

MonotonicallyIncreasingId - A column that generates monotonically increasing 64-bit integers.

Month

Month - Extract the month of a given date/timestamp as integer.

Months

Months - Partition transform function: A transform for timestamps and dates to partition data into months.

NamedStruct

NamedStruct - Creates a struct with the given field names and values.

Nanvl

Nanvl - Returns col1 if it is not NaN, or col2 if col1 is NaN.

Negate

Negate - Returns the negative value.

Negative

Negative - Returns the negative value.

NextDay

NextDay - Returns the first date which is later than the value of the date column based on second `week day` argument.

Now

Now - Returns the current timestamp at the start of query evaluation.

Ntile

Ntile - Window function: returns the ntile group id (from 1 to `n` inclusive) in an ordered window partition.

Nullif

Nullif - Returns null if `col1` equals to `col2`, or `col1` otherwise.

Nvl

Nvl - Returns `col2` if `col1` is null, or `col1` otherwise.

Nvl2

Nvl2 - Returns `col2` if `col1` is not null, or `col3` otherwise.

OctetLength

OctetLength - Calculates the byte length for the specified string column.

Overlay

Overlay - Overlay the specified portion of `src` with `replace`, starting from byte position `pos` of `src` and proceeding for `len` bytes.

ParseUrl

ParseUrl - Extracts a part from a URL.

PercentRank

PercentRank - Window function: returns the relative rank (i.e.

Pi - Returns Pi.

Pmod

Pmod - Returns the positive value of dividend mod divisor.

Posexplode

Posexplode - Returns a new row for each element with position in the given array or map.

PosexplodeOuter

PosexplodeOuter - Returns a new row for each element with position in the given array or map.

Position

Position - Returns the position of the first occurrence of `substr` in `str` after position `start`.

Positive

Positive - Returns the value.

Pow

Pow - Returns the value of the first argument raised to the power of the second argument.

Printf

Printf - Formats the arguments in printf-style and returns the result as a string column.

Product

Product - Aggregate function: returns the product of the values in a group.

Quarter

Quarter - Extract the quarter of a given date/timestamp as integer.

Radians

Radians - Converts an angle measured in degrees to an approximately equivalent angle measured in radians.

Rand

Rand - Generates a random column with independent and identically distributed (i.i.d.) samples uniformly distributed in [0.0, 1.0).

Randn

Randn - Generates a column with independent and identically distributed (i.i.d.) samples from the standard normal distribution.

Rank

Rank - Window function: returns the rank of rows within a window partition.

Reflect

Reflect - Calls a method with reflection.

Regexp

Regexp - Returns true if `str` matches the Java regex `regexp`, or false otherwise.

RegexpCount

RegexpCount - Returns a count of the number of times that the Java regex pattern `regexp` is matched in the string `str`.

RegexpExtract

RegexpExtract - Extract a specific group matched by the Java regex `regexp`, from the specified string column.

RegexpLike

RegexpLike - Returns true if `str` matches the Java regex `regexp`, or false otherwise.

RegexpSubstr

RegexpSubstr - Returns the substring that matches the Java regex `regexp` within the string `str`.

RegrAvgx

RegrAvgx - Aggregate function: returns the average of the independent variable for non-null pairs in a group, where `y` is the dependent variable and `x` is the independent variable.

RegrAvgy

RegrAvgy - Aggregate function: returns the average of the dependent variable for non-null pairs in a group, where `y` is the dependent variable and `x` is the independent variable.

RegrCount

RegrCount - Aggregate function: returns the number of non-null number pairs in a group, where `y` is the dependent variable and `x` is the independent variable.

RegrIntercept

RegrIntercept - Aggregate function: returns the intercept of the univariate linear regression line for non-null pairs in a group, where `y` is the dependent variable and `x` is the independent variable.

RegrR2

RegrR2 - Aggregate function: returns the coefficient of determination for non-null pairs in a group, where `y` is the dependent variable and `x` is the independent variable.

RegrSlope

RegrSlope - Aggregate function: returns the slope of the linear regression line for non-null pairs in a group, where `y` is the dependent variable and `x` is the independent variable.

RegrSxx

RegrSxx - Aggregate function: returns REGR_COUNT(y, x) * VAR_POP(x) for non-null pairs in a group, where `y` is the dependent variable and `x` is the independent variable.

RegrSxy

RegrSxy - Aggregate function: returns REGR_COUNT(y, x) * COVAR_POP(y, x) for non-null pairs in a group, where `y` is the dependent variable and `x` is the independent variable.

RegrSyy

RegrSyy - Aggregate function: returns REGR_COUNT(y, x) * VAR_POP(y) for non-null pairs in a group, where `y` is the dependent variable and `x` is the independent variable.

Repeat

Repeat - Repeats a string column n times, and returns it as a new string column.

Replace

Replace - Replaces all occurrences of `search` with `replace`.

Reverse

Reverse - Collection function: returns a reversed string or an array with reverse order of elements.

Right

Right - Returns the rightmost `len`(`len` can be string type) characters from the string `str`, if `len` is less or equal than 0 the result is an empty string.

Rint

Rint - Returns the double value that is closest in value to the argument and is equal to a mathematical integer.

Rlike

Rlike - Returns true if `str` matches the Java regex `regexp`, or false otherwise.

Round

Round - Round the given value to `scale` decimal places using HALF_UP rounding mode if `scale` >= 0 or at integral part when `scale` < 0.

RowNumber

RowNumber - Window function: returns a sequential number starting at 1 within a window partition.

Rpad

Rpad - Right-pad the string column to width `len` with `pad`.

Rtrim

Rtrim - Trim the spaces from right end for the specified string value.

Sec

Sec - Computes secant of the input column.

Second

Second - Extract the seconds of a given date as integer.

Sentences

Sentences - Splits a string into arrays of sentences, where each sentence is an array of words.

Sequence

Sequence - Generate a sequence of integers from `start` to `stop`, incrementing by `step`.

Sha

Sha - Returns a sha1 hash value as a hex string of the `col`.

Sha1

Sha1 - Returns the hex string result of SHA-1.

Sha2

Sha2 - Returns the hex string result of SHA-2 family of hash functions (SHA-224, SHA-256, SHA-384, and SHA-512).

Shiftleft

Shiftleft - Shift the given value numBits left.

ShiftLeft

ShiftLeft - Shift the given value numBits left.

Shiftright

Shiftright - (Signed) shift the given value numBits right.

ShiftRight

ShiftRight - (Signed) shift the given value numBits right.

Shiftrightunsigned

Shiftrightunsigned - Unsigned shift the given value numBits right.

ShiftRightUnsigned

ShiftRightUnsigned - Unsigned shift the given value numBits right.

Shuffle

Shuffle - Collection function: Generates a random permutation of the given array.

Sign

Sign - Computes the signum of the given value.

Signum

Signum - Computes the signum of the given value.

Sin

Sin - Computes sine of the input column.

Sinh

Sinh - Computes hyperbolic sine of the input column.

Size

Size - Collection function: returns the length of the array or map stored in the column.

Skewness

Skewness - Aggregate function: returns the skewness of the values in a group.

Slice

Slice - Collection function: returns an array containing all the elements in `x` from index `start` (array indices start at 1, or from the end if `start` is negative) with the specified `length`.

Some

Some - Aggregate function: returns true if at least one value of `col` is true.

Soundex

Soundex - Returns the SoundEx encoding for a string Soundex is the Golang equivalent of soundex: (col: 'ColumnOrName') -> pyspark.sql.connect.column.Column.

SparkPartitionId

SparkPartitionId - A column for partition ID.

Split

Split - Splits str around matches of the given pattern.

SplitPart

SplitPart - Splits `str` by delimiter and return requested part of the split (1-based).

Sqrt

Sqrt - Computes the square root of the specified float value.

Stack

Stack - Separates `col1`, ..., `colk` into `n` rows.

Startswith

Startswith - Returns a boolean.

Std

Std - Aggregate function: alias for stddev_samp.

Stddev

Stddev - Aggregate function: alias for stddev_samp.

StddevPop

StddevPop - Aggregate function: returns population standard deviation of the expression in a group.

StddevSamp

StddevSamp - Aggregate function: returns the unbiased sample standard deviation of the expression in a group.

StrToMap

StrToMap - Creates a map after splitting the text into key/value pairs using delimiters.

Struct

Struct - Creates a new struct column.

Substr

Substr - Returns the substring of `str` that starts at `pos` and is of length `len`, or the slice of byte array that starts at `pos` and is of length `len`.

Substring

Substring - Substring starts at `pos` and is of length `len` when str is String type or returns the slice of byte array that starts at `pos` in byte and is of length `len` when str is Binary type.

SubstringIndex

SubstringIndex - Returns the substring from string str before count occurrences of the delimiter delim.

Sum

Sum - Aggregate function: returns the sum of all values in the expression.

SumDistinct

SumDistinct - Aggregate function: returns the sum of distinct values in the expression.

Tan

Tan - Computes tangent of the input column.

Tanh

Tanh - Computes hyperbolic tangent of the input column.

TimestampMicros

TimestampMicros - Creates timestamp from the number of microseconds since UTC epoch.

TimestampMillis

TimestampMillis - Creates timestamp from the number of milliseconds since UTC epoch.

TimestampSeconds

TimestampSeconds - Converts the number of seconds from the Unix epoch (1970-01-01T00:00:00Z) to a timestamp.

ToBinary

ToBinary - Converts the input `col` to a binary value based on the supplied `format`.

ToChar

ToChar - Convert `col` to a string based on the `format`.

ToDate

ToDate - Converts a :class:`~pyspark.sql.Column` into :class:`pyspark.sql.types.DateType` using the optionally specified format.

ToDegrees

ToDegrees - ToDegrees is the Golang equivalent of toDegrees: (col: 'ColumnOrName') -> pyspark.sql.connect.column.Column.

ToNumber

ToNumber - Convert string 'col' to a number based on the string format 'format'.

ToRadians

ToRadians - ToRadians is the Golang equivalent of toRadians: (col: 'ColumnOrName') -> pyspark.sql.connect.column.Column.

ToTimestamp

ToTimestamp - Converts a :class:`~pyspark.sql.Column` into :class:`pyspark.sql.types.TimestampType` using the optionally specified format.

ToTimestampLtz

ToTimestampLtz - Parses the `timestamp` with the `format` to a timestamp without time zone.

ToTimestampNtz

ToTimestampNtz - Parses the `timestamp` with the `format` to a timestamp without time zone.

ToUnixTimestamp

ToUnixTimestamp - Returns the UNIX timestamp of the given time.

ToUtcTimestamp

ToUtcTimestamp - This is a common function for databases supporting TIMESTAMP WITHOUT TIMEZONE.

ToVarchar

ToVarchar - Convert `col` to a string based on the `format`.

Translate

Translate - A function translate any character in the `srcCol` by a character in `matching`.

Trim

Trim - Trim the spaces from both ends for the specified string column.

Trunc

Trunc - Returns date truncated to the unit specified by the format.

TryAdd

TryAdd - Returns the sum of `left`and `right` and the result is null on overflow.

TryAesDecrypt

TryAesDecrypt - This is a special version of `aes_decrypt` that performs the same operation, but returns a NULL value instead of raising an error if the decryption cannot be performed.

TryAvg

TryAvg - Returns the mean calculated from values of a group and the result is null on overflow.

TryDivide

TryDivide - Returns `dividend`/`divisor`.

TryElementAt

TryElementAt - (array, index) - Returns element of array at given (1-based) index.

TryMultiply

TryMultiply - Returns `left`*`right` and the result is null on overflow.

TrySubtract

TrySubtract - Returns `left`-`right` and the result is null on overflow.

TrySum

TrySum - Returns the sum calculated from values of a group and the result is null on overflow.

TryToBinary

TryToBinary - This is a special version of `to_binary` that performs the same operation, but returns a NULL value instead of raising an error if the conversion cannot be performed.

TryToNumber

TryToNumber - Convert string 'col' to a number based on the string format `format`.

TryToTimestamp

TryToTimestamp - Parses the `col` with the `format` to a timestamp.

Typeof

Typeof - Return DDL-formatted type string for the data type of the input.

Ucase

Ucase - Returns `str` with all characters changed to uppercase.

Unbase64

Unbase64 - Decodes a BASE64 encoded string column and returns it as a binary column.

Unhex

Unhex - Inverse of hex.

UnixDate

UnixDate - Returns the number of days since 1970-01-01.

UnixMicros

UnixMicros - Returns the number of microseconds since 1970-01-01 00:00:00 UTC.

UnixMillis

UnixMillis - Returns the number of milliseconds since 1970-01-01 00:00:00 UTC.

UnixSeconds

UnixSeconds - Returns the number of seconds since 1970-01-01 00:00:00 UTC.

UnixTimestamp

UnixTimestamp - Convert time string with given pattern ('yyyy-MM-dd HH:mm:ss', by default) to Unix time stamp (in seconds), using the default timezone and the default locale, returns null if failed.

Upper

Upper - Converts a string expression to upper case.

UrlDecode

UrlDecode - Decodes a `str` in 'application/x-www-form-urlencoded' format using a specific encoding scheme.

UrlEncode

UrlEncode - Translates a string into 'application/x-www-form-urlencoded' format using a specific encoding scheme.

User

User - Returns the current database.

Variance

Variance - Aggregate function: alias for var_samp Variance is the Golang equivalent of variance: (col: 'ColumnOrName') -> pyspark.sql.connect.column.Column.

VarPop

VarPop - Aggregate function: returns the population variance of the values in a group.

VarSamp

VarSamp - Aggregate function: returns the unbiased sample variance of the values in a group.

Version

Version - Returns the Spark version.

Weekday

Weekday - Returns the day of the week for date/timestamp (0 = Monday, 1 = Tuesday, ..., 6 = Sunday).

Weekofyear

Weekofyear - Extract the week number of a given date as integer.

WidthBucket

WidthBucket - Returns the bucket number into which the value of this expression would fall after being evaluated.

Window

Window - Bucketize rows into one or more time windows given a timestamp specifying column.

WindowTime

WindowTime - Computes the event time from a window column.

Xpath

Xpath - Returns a string array of values within the nodes of xml that match the XPath expression.

XpathBoolean

XpathBoolean - Returns true if the XPath expression evaluates to true, or if a matching node is found.

XpathDouble

XpathDouble - Returns a double value, the value zero if no match is found, or NaN if a match is found but the value is non-numeric.

XpathFloat

XpathFloat - Returns a float value, the value zero if no match is found, or NaN if a match is found but the value is non-numeric.

XpathInt

XpathInt - Returns an integer value, or the value zero if no match is found, or a match is found but the value is non-numeric.

XpathLong

XpathLong - Returns a long integer value, or the value zero if no match is found, or a match is found but the value is non-numeric.

XpathNumber

XpathNumber - Returns a double value, the value zero if no match is found, or NaN if a match is found but the value is non-numeric.

XpathShort

XpathShort - Returns a short integer value, or the value zero if no match is found, or a match is found but the value is non-numeric.

XpathString

XpathString - Returns the text contents of the first xml node that matches the XPath expression.

Xxhash64

Xxhash64 - Calculates the hash code of given columns using the 64-bit variant of the xxHash algorithm, and returns the result as a long column.

Year

Year - Extract the year of a given date/timestamp as integer.

Years

Years - Partition transform function: A transform for timestamps and dates to partition data into years.