New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support of temporal column types #21
Comments
Hi Siegfried, Thanks for your suggestions. A bridge for Commons CSV is a great idea to improve file format support. |
Hi Alexander, drop me a note when there is something to integrate - seems that I some unexpected free time in summer due to COVID-19 :-) Siegfried |
Hi Siegfried, Help is always greatly appreciated :) I hope that I can work on custom value type support this weekend. |
I'm ready to kick into action - https://issues.apache.org/jira/browse/FREEMARKER-144 |
So I started working on custom value type support in this branch: value-type-abstraction and create an issue #22 This approach would allow the creation of a Date ValueType that can support different formats for parsing and writing |
I got a proof of concept running integrating "nRo/Dataframe" into Apache FreeMarker CLI |
Thats great! Regarding temporal column types. |
You make the decision - both approaches sound good to me :-) What I want to do:
So I guess I come from side of defining the DataFrame and then populating it |
Okay, thanks for the feedbacks. |
Mostly finished but not really happy with the code :-) |
Is the feature branch stable enough to do some preliminary integration? |
The feature branch is pretty stable. |
sorry for the delay, the custom value feature branch is now merged into master #22 . I will now start looking into temporal column types.
Each should be able to handle different formats.
Any other suggestions? |
Hi, little bit confused by "Timespan" - is this a time-only value, e.g. "13:42:23" or do you mean "3:24h"? Regarding operations - little to no suggestion from my side - I would mostly filter / sort / query on temporal columns coming from CSV and Excel. |
Hi, |
Ack - assuming that I understood the things correctly
A few questions along the line
|
sorry for the delay again
exactly. some points I am still not sure about:
yes. Thats how I would do it. So that if two dates are extracted you get information about how many hours, minutes, seconds,... passed between those dates.
Timestamps could be supported by adding parsing hints to the CSV parser. Autodetection for timestamps is more difficult. A timestamp could also be a Long column. |
Hi Alexander,
Don't worry about the delays - it is an open-source project after all :-)
Thanks in advance,
Siegfried Goeschl
… On 22.08.2020, at 12:57, Alexander Grün ***@***.***> wrote:
sorry for the delay again
A column to be read would consist of either Date (e.g. 14.10.2019) or DateTime in some format (e.g. 2019-10-14T12:00:00)
exactly. some points I am still not sure about:
how to pass format information to the CSV parser (should be rather simple)
implement autodetection for temporal columns
How to handle Timezones
Substractions of dates would result in a timespan
yes. Thats how I would do it. So that if two dates are extracted you get information about how many hours, minutes, seconds,... passed between those dates.
Is it possible to read timespans from CSV? Usually they don't have a qualifier such as seconds or day as cell values
Would timestamps be supported, e .g. "12:01:31"?
Timestamps could be supported by adding parsing hints to the CSV parser.
They could either just be epoch miliseconds (Long) or a format like you described.
Autodetection for timestamps is more difficult. A timestamp could also be a Long column.
—
You are receiving this because you authored the thread.
Reply to this email directly, view it on GitHub <#21 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AABR6IDWFI4OUYM5E3IRVRDSB6QBHANCNFSM4M66LYMQ>.
|
Hi Alexander,
I'm looking at "DataFrame" to use it in of my pet projects
Siegfried
The text was updated successfully, but these errors were encountered: