You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Currently, demo data can only be obtained through function sdgx.utils.io.csv_utils.get_demo_single_table, and only one adult data sets are supported. In this issue, please implement a more scientific demonstration data management module.
🏕Solution
We recommend stripping this moudule out of script sdgx/utils/io/csv_utils.py and implementing a separate script in the sdgx/utils/io/ directory。
We recommend creating a file demo_data.py and implementing the functions or class in this file.
🍰 Example
We provide a class example for your reference:
# ISSUE DESCRIPTION A DemoData example
class DemoData(object):
def __init__(self, dataset_name) -> None:
# ISSUE DESCRIPTION
# the dataset name should be checked
pass
def get_data(self, offline_path = None) -> pd.DataFrame:
# ISSUE DESCRIPTION
# if offline_path is not None value,
# read data from the input path
pass
def download_data(self) -> None:
#
pass
⚙️ Detail
Some operations that enhance user experience are also worthwhile, such as:
When we support many datasets, it is unreasonable to put each dataset in the dataset/ directory. We expect to support downloading the target dataset from the Internet,this helps reduce the size of the entire git repository.
Due to the network speed in mainland China, you can ask the development team to use network resources to upload and provide download links for some larger data sets. The speed of these download links will be faster than the original links of the data sets.
The text was updated successfully, but these errors were encountered:
🚅Search before asking
I have searched for issues similar to this one.
🚅Description
Currently, demo data can only be obtained through function
sdgx.utils.io.csv_utils.get_demo_single_table
, and only one adult data sets are supported. In this issue, please implement a more scientific demonstration data management module.🏕Solution
We recommend stripping this moudule out of script
sdgx/utils/io/csv_utils.py
and implementing a separate script in thesdgx/utils/io/
directory。We recommend creating a file
demo_data.py
and implementing the functions or class in this file.🍰 Example
We provide a class example for your reference:
⚙️ Detail
Some operations that enhance user experience are also worthwhile, such as:
The text was updated successfully, but these errors were encountered: