Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Zero delays as nulls #3

Open
rjendoubi opened this issue Sep 29, 2023 · 0 comments
Open

Zero delays as nulls #3

rjendoubi opened this issue Sep 29, 2023 · 0 comments

Comments

@rjendoubi
Copy link

Flights without delays have no values in certain "delay" columns in the CSV.
When those are loaded, they are treated as null values in Hive, and that persists to the ORC tables as well.
However, this then causes problems for certain visualizations created at the Data Viz part of the lab.

We should either

  • Fix the CSV to have zeros in those columns instead of no value, or
  • Update the CTAS queries to explicitly put zeros in those columns IF NULL, or
  • Include instructions on how to deal with this in CDV

Opening this to track, will aim to come back with more details and/or a PR.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant