Synthetic dataset generation Read my article on Medium about synthetic data generation and why it is critical for self-driven data science Synthetic data set for regression, classification, and clustering using scikit-learn functions Synthetic random regression and classification problems using symbolic expressions (using sympy package) Synthetic categorical data (name, address, phone number, office title, license plate etc.) generation usinf pydbgen package