bug - switching between 0 and 1 based indexes #68

staubda · 2022-02-02T18:27:22Z

Hive uses 0-based indexing, while Presto uses 1-based indexing, however in the following example sqlglot doesn't properly translate between the two:

print(sqlglot.transpile("""
SELECT
    SPLIT(str_col_with_space, ' ')[0]
FROM
    my_db.my_table
""", read='hive', write='presto', pretty=True)[0])

returns the query unchanged, but it should be

SELECT
    SPLIT(str_col_with_space, ' ')[1]
FROM
    my_db.my_table

and conversely

print(sqlglot.transpile("""
SELECT
    SPLIT(str_col_with_space, ' ')[1]
FROM
    my_db.my_table
""", read='presto', write='hive', pretty=True)[0])

returns the query unchanged, but it should be

SELECT
    SPLIT(str_col_with_space, ' ')[0]
FROM
    my_db.my_table

This is particularly dangerous when going from presto --> hive, as the wrongly translated code will always still be syntactically correct.

The text was updated successfully, but these errors were encountered:

staubda · 2022-02-02T18:30:55Z

this is using version 1.22.0

fixes #68 triggers warnings because it may not be correct for certain cases like maps with integer indicies

tobymao added a commit that referenced this issue Feb 7, 2022

allow for array index offsets

96e3afd

fixes #68 triggers warnings because it may not be correct for certain cases like maps with integer indicies

tobymao mentioned this issue Feb 7, 2022

allow for array index offsets #71

Merged

tobymao closed this as completed in #71 Feb 7, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bug - switching between 0 and 1 based indexes #68

bug - switching between 0 and 1 based indexes #68

staubda commented Feb 2, 2022 •

edited

staubda commented Feb 2, 2022

bug - switching between 0 and 1 based indexes #68

bug - switching between 0 and 1 based indexes #68

Comments

staubda commented Feb 2, 2022 • edited

staubda commented Feb 2, 2022

staubda commented Feb 2, 2022 •

edited