Tên đề tài: Xác định số CMND trên giấy chứng minh nhân dân

Nhóm đã xác định được vùng chứa số CMND trên ảnh CMND với độ chính xác 100% (Điều kiện thử nhiệm là CMND được để trên nền trắng và ánh sáng có thể nhìn rõ được CMND), và nhận diện đúng số CMND chính xác 90% trên 30 ảnh CMND của 30 nhân viên trong công ty

Thực hiện bởi Nguyễn Duy Diệu và Nguyễn Văn Phong

Chi tiết Google Colab.

NhanDienCMND.ipynb

Kết quả đạt được

Đầu vào (Hình ảnh cmnd):

Dữ liệu tiền xử lý (Hình ảnh khu vực chứa số CMND)

Đầu ra (Kết quả nhận dạng từ hình sang chuổi text):

174572276

Cách thức hoạt động.

Bước 1: Nhận diện khu vực chưa CMND bằng cách phát hiện 4 góc
Bước 2: Bóc tách CMND ra khỏi bức ảnh
Bước 3: Nhận diện khu vực chứa số CMND trên bức hình đã bóc tách
Bước 4: Chuyển bức hình thành dữ liệu Text.

Cài đặt

Chương trình họat động trơn tru trên Python3.6, pytesseract-0.3.7, tesseract-ocr_4.00. các phiên bản khác nhóm chưa thử. Được thực hiện trên google colab

Cài đặt tesseract-ocr

!apt-get install tesseract-ocr

Cài đặt pytesseract

!pip install pytesseract

Tải các model và thư viện

!git clone https://github.com/dieund/CS2225.CH1501/

Di chuyển đến thư mục chứa các model và thư viện

%cd CS2225.CH1501

Chạy mã lệnh để phát hiện số cmnd

import pytesseract
from PIL import Image
from google.colab.patches import cv2_imshow
import os
import numpy as np
import tensorflow as tf
import cv2
from utils import load_label_map
from utils.image_utils import non_max_suppression_fast

class Detector(object):
    def __init__(self, path_to_model, path_to_labels, nms_threshold=0.15, score_threshold=0.3):
        self.path_to_model = path_to_model
        self.path_to_labels = path_to_labels
        self.category_index = load_label_map.create_category_index_from_labelmap(path_to_labels, use_display_name=True)
        self.nms_threshold = nms_threshold
        self.score_threshold = score_threshold

        # load model
        self.interpreter = self.load_model()

        # Get input and output tensors.
        self.input_details = self.interpreter.get_input_details()
        self.output_details = self.interpreter.get_output_details()

        self.detection_scores = None
        self.detection_boxes = None
        self.detection_classes = None

    def load_model(self):
        # Load the TFLite model and allocate tensors.
        interpreter = tf.lite.Interpreter(model_path=self.path_to_model)
        interpreter.allocate_tensors()

        return interpreter

    def predict(self, img):
        original = img
        height = self.input_details[0]['shape'][1]
        width = self.input_details[0]['shape'][2]
        img = cv2.resize(img, (width, height), interpolation=cv2.INTER_AREA)
        img = np.expand_dims(img, axis=0)

        # Normalize input data
        input_mean = 127.5
        input_std = 127.5
        input_data = (np.float32(img) - input_mean) / input_std
        self.interpreter.set_tensor(self.input_details[0]['index'], input_data)

        self.interpreter.invoke()

        # Retrieve detection results
        self.detection_boxes = self.interpreter.get_tensor(self.output_details[0]['index'])[
            0]  # Bounding box coordinates of detected objects
        self.detection_classes = self.interpreter.get_tensor(self.output_details[1]['index'])[
            0]  # Class index of detected objects
        self.detection_scores = self.interpreter.get_tensor(self.output_details[2]['index'])[
            0]  # Confidence of detected objects

        mask = np.array(self.detection_scores) > self.score_threshold
        self.detection_boxes = np.array(self.detection_boxes)[mask]
        self.detection_classes = np.array(self.detection_classes)[mask]

        self.detection_classes += 1

        # Convert coordinate to original coordinate
        h, w, _ = original.shape
        self.detection_boxes[:, 0] = self.detection_boxes[:, 0] * h
        self.detection_boxes[:, 1] = self.detection_boxes[:, 1] * w
        self.detection_boxes[:, 2] = self.detection_boxes[:, 2] * h
        self.detection_boxes[:, 3] = self.detection_boxes[:, 3] * w

        # Apply non-max suppression
        self.detection_boxes, self.detection_classes = non_max_suppression_fast(boxes=self.detection_boxes,
                                                                                labels=self.detection_classes,
                                                                                overlapThresh=self.nms_threshold)
        return self.detection_boxes, np.array(self.detection_classes).astype("int"), self.category_index

    def draw(self, image):
        self.detection_boxes, self.detection_classes, self.category_index = self.predict(image)
        height, width, _ = image.shape

        for i in range(len(self.detection_classes)):            
            label = str(self.category_index[self.detection_classes[i]]['name'])
            if label == 'id':
              real_ymin = int(max(1, self.detection_boxes[i][0]))
              real_xmin = int(max(1, self.detection_boxes[i][1]))
              real_ymax = int(min(height, self.detection_boxes[i][2]))
              real_xmax = int(min(width, self.detection_boxes[i][3]))

              real_height = real_ymax - real_ymin
              real_width =  real_xmax - real_xmin

              crop_img = image[real_ymin:real_ymin+real_height, real_xmin:real_xmin+real_width]

              return crop_img

detection_model = Detector(path_to_model='./config_text_detection/model.tflite',
                           path_to_labels='./config_text_detection/label_map.pbtxt',
                           nms_threshold=0.2, score_threshold=0.3)
img = cv2.imread('cmnd1.jpg')
cv2_imshow(img)
IDImg = detection_model.draw(img)
print('============ Hinh anh khu vuc ID =================')
cv2_imshow(IDImg)


gray = cv2.cvtColor(IDImg, cv2.COLOR_BGR2GRAY) #convert to grey to reduce detials
gray = cv2.bilateralFilter(gray, 11, 17, 17) #Blur to reduce noise

filename = "{}.png".format(os.getpid())
cv2.imwrite(filename, gray)

text = pytesseract.image_to_string(Image.open(filename))
# Xóa ảnh tạm sau khi nhận dạng
os.remove(filename)
# In dòng chữ nhận dạng được
print('================ Ket qua =============')
print(text)
if text == "174572276":
  print('Duoc vao')
else:
  print('Khong co trong danh sach')

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
Data/Train		Data/Train
config_text_detection		config_text_detection
config_text_recognition		config_text_recognition
utils		utils
CS2225.CH1501.FinalReport.CH1902003.pdf		CS2225.CH1501.FinalReport.CH1902003.pdf
FaceDetectionFromWebCam.ipynb		FaceDetectionFromWebCam.ipynb
FaceDetectionFromWebCam_Phong.ipynb		FaceDetectionFromWebCam_Phong.ipynb
GroupIntro.ipynb		GroupIntro.ipynb
NhanDienCMND.ipynb		NhanDienCMND.ipynb
README.md		README.md
Tuoi.ipynb		Tuoi.ipynb
cmnd.jpg		cmnd.jpg
cmnd1.jpg		cmnd1.jpg
khuvuccmnd.png		khuvuccmnd.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Tên đề tài: Xác định số CMND trên giấy chứng minh nhân dân

Chi tiết Google Colab.

Kết quả đạt được

Cách thức hoạt động.

Cài đặt

About

Releases

Packages

Contributors 3

Languages

dieund/CS2225.CH1501

Folders and files

Latest commit

History

Repository files navigation

Tên đề tài: Xác định số CMND trên giấy chứng minh nhân dân

Chi tiết Google Colab.

Kết quả đạt được

Cách thức hoạt động.

Cài đặt

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages