GESIS Improved Gender Inference Deep Learning

Requirements

Keras (Using TensorFlow backend)

About

GIGIDL takes vision inputs specifically image of a person or web address of a image. Generally GIGIDL identify faces from the image and take this single or multiple faces as a input and provides the gender of that face or faces with the probability of accuracy if it’s human otherwise it skip the prediction of image data.

Training

To train our model we have used full dataset of IMDB and WIKI together (after cleaning total 36,356 images of human). We have used 10-fold cross-validation to minimize the bias and got 98.28 % training accuracy.

How to Use:

This model takes an input image and gives the prediction as an output. You need to a) Copy or clone this package (git clone https://github.com/gesiscss/Improved-Gender-Identifier.git) and follow the instructions. b) Preprocess (for a bunch of images) like resize to get them reshaped. Use the preprocess script to make the images right.

You can use this model to predict the gender using image or a bunch of images (same extension e.g., only jpg or png and so on ) as data in any format but need to mention the extension (e.g., jpg, png etc.).

Image could be inputed mostly 2 different ways:

Web-link

Simply provide the web address of the image or images as web link in the system (e.g., web_link = “https://github.com/gesiscss/Improved-Gender -Identifier/blob/master/GIGIDL/Data/a.jpeg”).

Directory

You need to create a directory called ‘data’ in your default directory or individually and keep your image or images in that directory in the system (e.g., [default directory] /data/).

Install

Datasets

IMDB Dataset

What is in the data: The IMDB dataset contains 460,723 facial images (with gender and age labels) of film stars, predominantly Hollywood actors and actresses. We have taken only the frontal faces which has the face score 4.5 or above and get 33,147 facial images.

Number of images: 33,147

Male images: 14370

Female images: 18777

Wikipedia Dataset

What is in the data: Wikipedia dataset includes 62,328 (with gender and age labels) of celebrities from various fields, such as sports, politics, social events, and the film industry. We have taken only the frontal faces which has the face score 5 or above and get 3,209 facial images.

Number of images: 3,209

Male images: 1830

Female images: 1379

Scholars Dataset (manual labeled at GESIS)

What is in the data: Images of xx scholars which have been manually labeled by the coders at GESIS.

Number of images: 3,324

Male images: 1,953

Female images: 1,371

Every performance have huge differences between male and female images. We have checked the reasons and got some problems which need to be minimized to get better performance for female and that is,
```
  a) Bad performance for coloured data 
  b) Better performance when the canvas is not coloured i.e. noisy 
  c) More better when the canvas is perfectly white.
```

Note: Quality images means low resolution, background is noisy, face is not frontal, side view or multiple objects in the image, have eye glass, cap or cloths top to the head and so on.

Limitations

We have some limitations in our training model i.e.,

    a) Accuracy for females is lower than the accuracy for males.
    b) Predictions for dark, low resolutions and less contrast images is lower.
    c) For the people of east asia prediction accuracy is also lower.

FAQs

If you have any questions then contact with the team members.

Name		Name	Last commit message	Last commit date
Latest commit History 230 Commits
CNN_baseline		CNN_baseline
DeepFace		DeepFace
Gender_classification_M3		Gender_classification_M3
Gender_classification_Names		Gender_classification_Names
Notebooks		Notebooks
retrieve_data		retrieve_data
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
install.R		install.R
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GESIS Improved Gender Inference Deep Learning

Requirements

About

Training

How to Use:

Install

Datasets

Limitations

FAQs

More Questions

License

About

Releases

Packages

Contributors 3

Languages

gesiscss/Improved-Gender-Identifier

Folders and files

Latest commit

History

Repository files navigation

GESIS Improved Gender Inference Deep Learning

Requirements

About

Training

How to Use:

Install

Datasets

Limitations

FAQs

More Questions

License

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages