## Inspiration behind this work :
* I wish to have an attendance management system driven by computer vision
* It should be capable of marking people present by just recognizing their face
* In case, the system is not capable of recognizing a person with certain confidence level, tagging feature is provided for the user to add name/ID etc. for the detected face

## References : 
* Reference to the [face_recognition](https://github.com/code4thought/face_recognition) package for easier face processing and storage

## The pseudo code
**Step-1 :** Import the required librarires
   * numpy
   * opencv (imported as cv2)
   * face_recognition (has a set of dependencies for this to work fine). Refer [Here](https://www.pyimagesearch.com/2018/01/22/install-dlib-easy-complete-guide/) for installation steps. Specifically look for below steps :
![pic](./docs/steps-for-dlib.png)
and then,
![pic](./docs/steps-for-dlib-01.png)
**Please note :** The above steps are valid for installation in Ubuntu. Steps for MacOS is also provided in the reference link. Windows installation is a bit of a challenge.However, there are references for the same too. (*Not tried though!*)

In [11]:
import numpy as np
import face_recognition
import cv2

In [12]:
# define the base location where all the images to be face-cropped are kept
# iterate through all the images, identify faces in them and crop the faces
# store all cropped faces into another folder

image = face_recognition.load_image_file('amit.jpg')
print(type(image))
image = cv2.cvtColor(image, cv2.COLOR_BGR2RGB)

<class 'numpy.ndarray'>


In [None]:
# apply clustering algorithms to group similar faces together
# this would save a lot of time labelling the images one-by-one
# barring few erroneous cases, if the 

In [13]:
cv2.imshow("test",image)
cv2.waitKey(0)
cv2.destroyAllWindows()

In [14]:
face_locations = face_recognition.face_locations(image)

In [15]:
for (top, right, bottom, left) in face_locations:
    print(top,right,bottom,left)
    #cv2.rectangle(image,(left,top),(right,bottom),(0,255,0),2)
    frame = image[top:bottom,left:right]
    cv2.imwrite('img_'+str(top)+'.jpg',frame)
#cv2.imwrite('drawn.jpg',image)

150 322 305 167
253 184 408 29


In [None]:
cv2.rectangle(image,(167,150),(322,305),(0,255,0),1)
#cv2.rectangle(image,(29,253),(184,408),(0,255,0),1)
cv2.imwrite('amit5.jpg',image)

In [13]:
video_capture = cv2.VideoCapture(0)

In [14]:
amit_image = face_recognition.load_image_file("amit/img-01.jpg")
amit_face_encoding = face_recognition.face_encodings(amit_image)[0]

In [15]:
priti_image = face_recognition.load_image_file("priti/img-01.jpg")
priti_face_encoding = face_recognition.face_encodings(priti_image)[0]

In [16]:
known_face_encodings = [
    amit_face_encoding,
    priti_face_encoding
]
known_face_names = [
    "Amit",
    "Priti"
]

In [17]:
# Initialize some variables
face_locations = []
face_encodings = []
face_names = []
process_this_frame = True

while True:
    # Grab a single frame of video
    ret, frame = video_capture.read()
    # Resize frame of video to 1/4 size for faster face recognition processing
    small_frame = cv2.resize(frame, (0, 0), fx=0.25, fy=0.25)
    # Convert the image from BGR color (which OpenCV uses) to RGB color (which face_recognition uses)
    rgb_small_frame = small_frame[:, :, ::-1]

    # Only process every other frame of video to save time
    if process_this_frame:
        # Find all the faces and face encodings in the current frame of video
        face_locations = face_recognition.face_locations(rgb_small_frame)
        face_encodings = face_recognition.face_encodings(rgb_small_frame, face_locations)

        face_names = []
        for face_encoding in face_encodings:
            # See if the face is a match for the known face(s)
            matches = face_recognition.compare_faces(known_face_encodings, face_encoding)
            name = "Unknown"

            # # If a match was found in known_face_encodings, just use the first one.
            # if True in matches:
            #     first_match_index = matches.index(True)
            #     name = known_face_names[first_match_index]

            # Or instead, use the known face with the smallest distance to the new face
            face_distances = face_recognition.face_distance(known_face_encodings, face_encoding)
            best_match_index = np.argmin(face_distances)
            if matches[best_match_index]:
                name = known_face_names[best_match_index]

            face_names.append(name)

    process_this_frame = not process_this_frame


    # Display the results
    for (top, right, bottom, left), name in zip(face_locations, face_names):
        # Scale back up face locations since the frame we detected in was scaled to 1/4 size
        top *= 4
        right *= 4
        bottom *= 4
        left *= 4

        # Draw a box around the face
        cv2.rectangle(frame, (left, top), (right, bottom), (0, 0, 255), 2)

        # Draw a label with a name below the face
        cv2.rectangle(frame, (left, bottom - 35), (right, bottom), (0, 0, 255), cv2.FILLED)
        font = cv2.FONT_HERSHEY_DUPLEX
        cv2.putText(frame, name, (left + 6, bottom - 6), font, 1.0, (255, 255, 255), 1)

    # Display the resulting image
    cv2.imshow('Video', frame)

    # Hit 'q' on the keyboard to quit!
    if cv2.waitKey(1) & 0xFF == ord('q'):
        break

# Release handle to the webcam
video_capture.release()
cv2.destroyAllWindows()