SyntheticDataGenerator: A Python library for data analysts to generate synthetic datasets
column_specs = [
('Column1', 'int'),
('Column2', 'float'),
('Column3', 'str'),
('Column4', 'date'),
('Column5', 'int_uniq'),
('Column6', 'bool')
]
num_rows = 10
include_nulls = False
str_categories = ['apple', 'banana', 'cherry', 'orange']
start_date = '2022-01-01'
end_date = '2022-12-31'
df = generate_synthetic_data(column_specs, num_rows, include_nulls, str_categories, start_date, end_date)
print(df)
Natalia Chernikova π» π π |
Stanislav Kaliberda π» π£ π€ |
bantichsasha π» π |