Allow `Column` type for timezone argument in pyspark.sql.functions #380

charlietsai · 2020-03-11T10:33:35Z

In the functions here:

pyspark-stubs/third_party/3/pyspark/sql/functions.pyi

Lines 100 to 101 in 3c4684a

    
           def from_utc_timestamp(timestamp: ColumnOrName, tz: str) -> Column: ... 
        
           def to_utc_timestamp(timestamp: ColumnOrName, tz: str) -> Column: ...

we currently have tz: str but this can also be specified as a Column

Example:

>>> from pyspark.sql import functions
>>> df = spark.sql("SELECT CAST(0 AS TIMESTAMP) AS timestamp, 'Asia/Tokyo' AS tz")
>>> df.select(functions.from_utc_timestamp(df.timestamp, df.tz)).collect()
[Row(from_utc_timestamp(timestamp, tz)=datetime.datetime(1970, 1, 1, 18, 0))]

I think this could be expanded to tz: ColumnOrName?

The text was updated successfully, but these errors were encountered:

zero323 · 2020-03-13T01:27:41Z

Thanks for reporting @charlietsai. Care to open a PR for this?

I think this could be expanded to tz: ColumnOrName?

Indeed, although in this particular case I'd prefer to have explicit Union[Column, str] to stress out that meaning is different from ColumnOrName. It won't make any difference for the type checker, but can make a big one for anyone who reads the annotation.

charlietsai · 2020-03-26T04:53:12Z

Makes sense to me! Opening a PR

Closes zero323#380

zero323 · 2020-04-10T11:41:33Z

Thank you @charlietsai. Merged into master and backported to branch-3.0 and branch-2.4

charlietsai · 2020-04-15T06:23:18Z

Thanks @zero323 for this work!

charlietsai pushed a commit to charlietsai/pyspark-stubs that referenced this issue Mar 26, 2020

Allow Column type for arguments in timestamp functions

e088c4d

Closes zero323#380

charlietsai pushed a commit to charlietsai/pyspark-stubs that referenced this issue Mar 26, 2020

Allow Column type for arguments in timestamp functions (zero323#380)

726501b

charlietsai mentioned this issue Mar 26, 2020

Allow Column type for arguments in timestamp functions (#380) #383

Merged

charlietsai added a commit to charlietsai/pyspark-stubs that referenced this issue Mar 26, 2020

Allow Column type for arguments in timestamp functions (zero323#380)

42dde1a

charlietsai added a commit to charlietsai/pyspark-stubs that referenced this issue Mar 26, 2020

Allow Column type for arguments in timestamp functions (zero323#380)

8cc075b

charlietsai added a commit to charlietsai/pyspark-stubs that referenced this issue Apr 10, 2020

Allow Column type for arguments in timestamp functions (zero323#380)

f5f412b

zero323 closed this as completed in #383 Apr 10, 2020

zero323 pushed a commit that referenced this issue Apr 10, 2020

Allow Column type for arguments in timestamp functions (#380) (#383)

306edd3

zero323 pushed a commit that referenced this issue Apr 10, 2020

Allow Column type for arguments in timestamp functions (#380) (#383)

c444a5c

zero323 pushed a commit that referenced this issue Apr 10, 2020

Allow Column type for arguments in timestamp functions (#380) (#383)

66354ec

zero323 added 2.4 3.0 3.1 3.1 labels Apr 10, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow `Column` type for timezone argument in pyspark.sql.functions #380

Allow `Column` type for timezone argument in pyspark.sql.functions #380

charlietsai commented Mar 11, 2020

zero323 commented Mar 13, 2020

charlietsai commented Mar 26, 2020

zero323 commented Apr 10, 2020

charlietsai commented Apr 15, 2020

Allow Column type for timezone argument in pyspark.sql.functions #380

Allow Column type for timezone argument in pyspark.sql.functions #380

Comments

charlietsai commented Mar 11, 2020

zero323 commented Mar 13, 2020

charlietsai commented Mar 26, 2020

zero323 commented Apr 10, 2020

charlietsai commented Apr 15, 2020

Allow `Column` type for timezone argument in pyspark.sql.functions #380

Allow `Column` type for timezone argument in pyspark.sql.functions #380