Documentation #71

gorold · 2024-06-12T07:37:57Z

This branch/PR compiles the efforts to enhance the documentation of the uni2ts library.

Co-authored-by: Chenghao Liu <74166079+chenghaoliu89@users.noreply.github.com>

liu-jc · 2024-06-14T09:16:17Z

Are we good to merge this? We can create a new one when we have more documentation?

HALF111 · 2024-06-14T13:19:05Z

Hello, I'm sorry to bother you. When reading the code, I think the following parts may be relatively difficult to understand, and it is possible to add some comments in the code or additional documentation to further enhance the understanding of the project.

Regarding Data Formats:

The definitions for wide, long, and long-wide data formats appear unclear, leaving uncertainty about their distinct characteristics in https://github.com/SalesforceAIResearch/uni2ts/tree/main/src/uni2ts/data/builder/simple.py .
'offset' and 'date_offset' Clarification:

It would be helpful to have a brief explanation for the terms 'offset' and 'date_offset' within the function definitions in the same file.
Understanding Column Types and getitem Methods:

It would be helpful to state the distinction between 'seqs' and 'non-seqs' columns, along with a detailed explanation of the methods '_getitem_int', '_getitem_iterable', and '_getitem_slice' in https://github.com/SalesforceAIResearch/uni2ts/tree/main/src/uni2ts/data/indexer/hf_dataset_indexer.py .
MultiSampleTimeSeriesDataset Functionality:

Clarification on the purpose of 'MultiSampleTimeSeriesDataset' and how it differs from 'TimeSeriesDataset' in https://github.com/SalesforceAIResearch/uni2ts/tree/main/src/uni2ts/data/dataset.py would enhance the understanding of dataset handling in the project.
Sequence Packing Explanation:

It would be better if documentation for the 'PackCollate' function as well as how Sequence Packing works in https://github.com/SalesforceAIResearch/uni2ts/tree/main/src/uni2ts/data/loader.py can be provided, particularly how it uses queues to pack data and the process of forming batches, seems not that clear.
DistrParamProj and DistributionOutput Classes:

The definition and operationals of the DistrParamProj class, along with the DistributionOutput class, which involve nested functions and lambdas, are not clearly outlined. An explanation of their roles within the model definition context would be better. (https://github.com/SalesforceAIResearch/uni2ts/tree/main/src/uni2ts/distribution/_base.py and https://github.com/SalesforceAIResearch/uni2ts/tree/main/src/uni2ts/model/moirai/module.py)
Explanation of PackedLoss Functions, etc:

It would be helpful to provide a comprehensive explanation of the computation and application of 'PackedLoss', 'PackedPointLoss', and 'PackedDistributionLoss' defined in https://github.com/SalesforceAIResearch/uni2ts/tree/main/src/uni2ts/loss/packed/_base.py .
Explanation of train_transform_map in Model Definition:

A brief explanation of specific transformations applied within 'train_transform_map' in https://github.com/SalesforceAIResearch/uni2ts/tree/main/src/uni2ts/model/moirai/pretrain.py would be invaluable, which can help understand how each transform contributes to the processing of training data in the 'MoiraiPretrain` model.

Thank you again for your great work, and hope that you can find these suggestions/information useful!

chenghaoliu89 · 2024-06-15T04:46:32Z

@gorold could you add clarification for load_dataset and _get_transform from lotsa_v1/_base.py?

chenghaoliu89 · 2024-06-19T12:27:51Z

better to add code comment about batch, packed_batch, merged_batch in line 109-116 from loader.py, and show their format, shape, the label of padding used for bin_package etc.

gorold · 2024-06-24T02:17:59Z

better to add code comment about batch, packed_batch, merged_batch in line 109-116 from loader.py, and show their format, shape, the label of padding used for bin_package etc.

You can see the typehints for format and shape of arrays and tensors, but yeah, I'll add the docstrings for loader.py

chenghaoliu89 · 2024-07-02T06:21:06Z

src/uni2ts/data/loader.py

@@ -141,6 +161,14 @@ def first_fit_decreasing_bin_packing(
    def get_sample_id(
        self, batch: list[list[Sample]], bin_spaces: Int[np.ndarray, "batch"]
    ) -> Int[torch.Tensor, "batch seq"]:
+        """
+        Create an array of integers representing the sample id in a sequence.
+        Sample id starts from 1, and 0 represents padding.


This is very helpful, could you also explain the what it means when the variate_id, time_id value is 1 or 0 in transform?

simple comments on model/moirai in source code (#69)

048ff5d

Co-authored-by: Chenghao Liu <74166079+chenghaoliu89@users.noreply.github.com>

salesforce-cla bot added the cla:signed label Jun 12, 2024

gorold added the documentation Improvements or additions to documentation label Jun 12, 2024

gorold added 3 commits June 12, 2024 17:56

update contributed docstrings

db72959

Add Indexer documentation

1ee0790

Add documentation for datasets and builders

79061ce

gorold added 3 commits June 28, 2024 15:34

add loader documentation

e5f30c0

add packedloss doc

c869da0

add some comments for distr

c449083

chenghaoliu89 reviewed Jul 2, 2024

View reviewed changes

add docs for distributions base class and refactor for readability

07656c4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Documentation #71

Documentation #71

gorold commented Jun 12, 2024

liu-jc commented Jun 14, 2024

HALF111 commented Jun 14, 2024

chenghaoliu89 commented Jun 15, 2024 •

edited

Loading

chenghaoliu89 commented Jun 19, 2024 •

edited

Loading

gorold commented Jun 24, 2024

chenghaoliu89 Jul 2, 2024

Documentation #71

Are you sure you want to change the base?

Documentation #71

Conversation

gorold commented Jun 12, 2024

liu-jc commented Jun 14, 2024

HALF111 commented Jun 14, 2024

chenghaoliu89 commented Jun 15, 2024 • edited Loading

chenghaoliu89 commented Jun 19, 2024 • edited Loading

gorold commented Jun 24, 2024

chenghaoliu89 Jul 2, 2024

Choose a reason for hiding this comment

chenghaoliu89 commented Jun 15, 2024 •

edited

Loading

chenghaoliu89 commented Jun 19, 2024 •

edited

Loading