A list of TrainingData-datasets found in the industry and their descriptions are shown below. Clicking the link will take you to the data description page with all information anout the dataset and saample of data for you to try it.
This block includes collected datasets with a large array of biometric data (photos & videos of real people) for deep learning and computer vision, as well as for improving safety and identification systems.
- Face Masks Detection Dataset: Selfies of people wearing medical masks in 4 ways (no mask on the face, mask does not cover the nose or mouth, mask covers the mouth, mask covers the nose and mouth).
- Anti Spoofing Real Dataset: Selfie photos and selfie videos of people. The dataset solves the tasks of training algorithms to distinguish real users from scammers.
- Anti Spoofing Replay Dataset: Replay video attacks from the phone screens to train biometric systems to recognize a real person from a prepared photos and videos.
- Selfies and Video Dataset: Photos and videos of people taken on a phone and on a webcam under different lighting conditions.
- Selfie and ID Dataset: Selfies of people and photos of their ID. The dataset is used to train neural networks to create biometric technologies and reindentification systems.
- Portrait and 26 Photos Dataset: Dataset for training a neural network in face recognition and identification tasks using selfie and 26 photos of a person.
Data generation in machine learning involves creating or manipulating data to train and evaluate machine learning models. The purpose of data generation is to provide diverse and representative examples that cover a wide range of scenarios, ensuring the model's robustness and generalization. TrainingData can generate data on the following projects according to your requirments.
-
GENERATED USA Passports Augmentation: Dataset with replicas of official USA passports but with randomly generated details and various augmentations.
-
GENERATED Vietnamese Passports: Dataset with replicas of official Vietnamese passports but with randomly generated details.
-
GENERATED E-mail Spam: Dataset with e-mail spam messages generated using Davinci Open AI.