spotify

PDF Download

A METHOD FOR AUGMENTING A PLURALITY OF FACE IMAGES WO2021060971A1

The patent WO2021060971A1 describes a method for augmenting face images, particularly for use in video surveillance systems. The invention addresses the limitations of traditional surveillance, where cameras capture face images from limited angles, leading to incomplete or substandard images. The method involves acquiring face images from both cameras and the internet, applying data augmentation techniques to increase the number of images, and using a Generative Adversarial Network (GAN) to create additional face images. This process helps generate better-quality face images that can improve facial recognition systems.

The system includes several components, such as an image acquisition module connected to a camera, data input and augmentation modules, and a GAN module. The data augmentation modules apply transformations like rotations and flips to increase the variety of face images, while the GAN module generates new images based on trained models. A fuzzy logic module is also employed to evaluate the quality of the generated images, ensuring that only the best images are selected and stored for training a deep learning module. This deep learning module further refines the facial recognition process by extracting higher-level features from the images.

The method also emphasizes the importance of image quality in improving recognition accuracy. By using a combination of data augmentation and GAN-generated images, the system can produce a diverse set of high-quality face images, which are essential for training deep learning models in surveillance and security applications. The invention claims a more effective way to enhance face images from different viewpoints and conditions, ultimately aiding in better identification and recognition in security systems.

Main Topic: A Method for Augmenting a Plurality of Face Images

  1. Image Acquisition
  2. Data Augmentation
  3. Generative Adversarial Network (GAN)
  4. Fuzzy Logic for Image Selection
  5. Storage and Learning
  6. Deep Learning Training