-
Notifications
You must be signed in to change notification settings - Fork 81
Conversation
TSDataset
TSDataset.describe
Codecov Report
@@ Coverage Diff @@
## master #409 +/- ##
==========================================
- Coverage 87.22% 87.08% -0.15%
==========================================
Files 99 99
Lines 5003 5047 +44
==========================================
+ Hits 4364 4395 +31
- Misses 639 652 +13
Continue to review full report at Codecov.
|
etna/datasets/tsdataset.py
Outdated
start_date end_date length num_missing num_segments num_exogs num_regressors freq | ||
segments | ||
segment_0 2021-06-01 2021-06-30 30 0 2 1 1 D | ||
segment_1 2021-06-01 2021-06-30 30 0 2 1 1 D |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
looks like there are only four segment-specific columns: start_date
, end_date
, num_missing
, length
; another ones duplicate each other, don't they?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think we should change *_date
-> *_timestamp
# Conflicts: # CHANGELOG.md # tests/test_datasets/test_dataset.py
etna/datasets/tsdataset.py
Outdated
Information about individual segments: | ||
* start_timestamp: beginning of the segment, missing values in the beginning are ignored | ||
* length: length according to start_date and end_date | ||
* num_missing: number of missing variables between start_date and end_date |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
* num_missing: number of missing variables between start_date and end_date | |
* num_missing: number of missing variables between start_timestamp and end_timestamp |
etna/datasets/tsdataset.py
Outdated
|
||
Information about individual segments: | ||
* start_timestamp: beginning of the segment, missing values in the beginning are ignored | ||
* length: length according to start_date and end_date |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
* length: length according to start_date and end_date | |
* length: length according to start_timestamp and end_timestamp |
etna/datasets/tsdataset.py
Outdated
>>> ts = TSDataset(df_ts_format, df_exog=df_exog_ts_format, freq="D") | ||
>>> ts.info() | ||
<class 'etna.datasets.TSDataset'> | ||
end_timestamp: 2021-06-30 00:00:00 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
actually end_timestamp
can be different for different series
I can run smth like
ts = TSDataset(...)
ts.info()
and should get valid result
etna/datasets/tsdataset.py
Outdated
* start_timestamp: beginning of the segment, missing values in the beginning are ignored | ||
* end_timestamp: ending of the segment, missing values are not ignored, common for all segments | ||
* length: length according to start_date and end_date | ||
* num_missing: number of missing variables between start_date and end_date |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
* num_missing: number of missing variables between start_date and end_date | |
* num_missing: number of missing variables between start_timestamp and end_timestamp |
etna/datasets/tsdataset.py
Outdated
Method describes dataset in segment-wise fashion. Description columns: | ||
* start_timestamp: beginning of the segment, missing values in the beginning are ignored | ||
* end_timestamp: ending of the segment, missing values are not ignored, common for all segments | ||
* length: length according to start_date and end_date |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
* length: length according to start_date and end_date | |
* length: length according to start_timestamp and end_timestamp |
# Conflicts: # CHANGELOG.md
…> end_timestamps in docstrings
IMPORTANT: Please do not create a Pull Request without creating an issue first.
Before submitting (must do checklist)
Type of Change
Proposed Changes
Look #347
Related Issue
#347
Closing issues
Closes #347