The objective of this project is to detect and recognise the faces from the scene and give the person name as voice output to the user.
It has three modules:
- Creator
- Trainer
- Recognizer
This module is used to create datasets i.e. we save faces and their names. It first takes the id and name of the person and stores in a file (datatext file). Then it will detect the faces from scene using a pre-executed xml file called haarcascade_frontalface_default (included in OpenCV file). It takes photos in range 50-100 (this range is based on my personal experience but you may take more if you want) and saves all photos in grayscale for better feature extraction.
Datasets are used for training the model. It does by first loading the LBPHFaceRecognizer which is an algorithm called LBPH (Local Binary Patterns Histograms) used for finding patterns in the image and remembering them. After loading, it then separates the name and associated image and gives it to train() for training the model to remember the image for the given name. The trained data is saved in yml format.
For windows users : You may face a problem like
IOError: cannot identify image file '*Data\\Thumbs.db*
Solution you can either delete or ignore it by slicing the imgpaths array.
This module is responsible for recognizing and giving voice output. It first detects the faces from the scene and gives to the predict() function which returns id and confidence with which it thinks the detected face(s) belong to that id(s). The received id(s) are examined to find out to which name(s) it belongs and gives to say() which is a function of pyttsx for voice output to the user. It continues until user quits.
- Python 2.7.14
- OpenCV 2.4.13
- Pyttsx
- Numpy
- Pillow