person-clustering

Introduction

Overall Service [Github]

When working on the final project at 'Naver Boost Camp AITech', I had to implement the function of providing users with key characters in the video.

The following two conditions must be met:

It have to show good performance for various videos.
To be applied to various videos, the 'training process' is eliminated. (Unsupervised method)

To meet the above two conditions, I used the following two methods.

Use Face landmark + Clothing image feature together
Use unsupervised clustering method HAC (Hierarchical Agglomeratie Clustering)

In other words, the project was implemented to effectively cluster the characters that appear in one video.

When the clustering performance using Face, Cloth, and Face+Cloth was quantified, it was confirmed that the performance was greatly improved when all Face+Cloth features were used.

NMI was used as a metric, and for an explanation of NMI, see here.

Additionally, if post-processing such as cluster merging or min_csize deletion applied, clustering shows up to 0.9 or higher NMI for key characters.

Related Works

Clustering Method

Clustering techniques can be divided into a supervised method and an unsupervised method.

Supervised clustering has a better performance, but has the problem of securing learning data, and Unsupervised clustering is generally applicable because it does not use learning data, but has a relatively low performance and is sensitive to hyperparameters.

In this project, unsupervised clustering technique is used for general application, but combined two features, face and clothes, to improve performance.

Face feature extraction

Face landmark is used to extract face feature values for face distinction. It uses at least 5 points (eyes, nose, corners of the mouth), and at most more than 50 points.

The 'dlib' library makes it easy to perform from face detection to feature extraction.

Image feature extraction

Image clustering is an area where research is being conducted steadily. The image itself is clustered using the feature map obtained by passing through the deep learning model.

Although high-performance clustering techniques have recently emerged, person clustering is difficult because it often requires training in part and uses the entire image.

reference: [Papers with code]

As far as I know, there is no github repository that has previously attempted to extract Face and Clothing features separately, concatenate them, and then clustering. So it is hoped that people who find similar ideas or need them can use the project well.

Project Description

Typical clustering tasks include face clustering and image clustering. Face clustering uses face landmark, and image clustering uses image feature map.

In this project, face landmark and image feature map information are used simultaneously to improve performance.

First, face detection is performed on the frame image, and a 128-d face landmark feature vector is extracted using the 'dlib' library.

At the same time, the clothing location is calculated from the face location and the 128-d clothing feature vector is extracted. In this case, 'ResNet-18' is used as the feature extractor, and the feature map of conv3_x block is extracted and GAP (Global Average Pooling) is applied to extract a 128-d feature vector.

Next, the extracted two 128-dimensional feature vectors are normalized to a size of 1, and then concatenated and used as feature values for one character.

Here, I used unsupervised clustering method for general application to various videos.

Among techniques such as K-Means, HAC (Hierarchical Agglomerative Clustering), and DBSCAN, HAC is selected because the number of clusters doesn't need to be determined in advance and it shows the best performance.

In addition, the clustering results were further enhanced by cluster merging or post-processing that eliminated clusters with sizes less than 10.

Project Structure

├── clustering
│   ├── calc.py
│   ├── exceptions.py
│   ├── icio.py
│   └── postproc.py
│
├── outdated_files
│   ├── face_classifier.py
│   └── person_clustering.ipynb
│
├── demo.ipynb
└── face_extractor.py

clustering: Files Required for Clustering
- calc.py: File with some util and feature computing functions
- exceptions.py: File with core operation functions (such as clustering, feature extraction, etc.)
- icio.py: File with IO (Input/Output) related functions
- postproc.py: File with Post-processing functions
outdated_files: Files used in previous versions
demo.ipynb: Demo file
face_extractor.py: Main code file (frame extraction, face detection, cloth detection, feature extraction, etc.)

Installation

pip install -r requirements.txt

How to use

run demo.ipynb

Create Face Extractor Instance

# Create FaceExtractor instance
video_num = 0
video_path = video_paths[video_num]
result_dir = video_path[:-4] + '_result'

face_extractor = FaceExtractor(
    video_path = video_path,
    result_dir = result_dir,
)

Run Face Extractor

parameters

face_cnt: Number of faces to be extracted from the image
frame_batch_size: Number of frames to process at a time
face_cloth_weights: Ratio of face feature:cloth feature
use_scene_detection: If True, use scene detection when extracting frames
capture_interval: If use_scene_detection=False, extract frames from the video every capture_interval seconds (0 < )
stop_sec: If > 0, the video stops after stop_sec seconds
skip_sec: If > 0, skip 'skip_sec' seconds at the beginning of the video
resizing_resolution: Resize video with width greater than resizing_resolution
resizing_ratio: resize ratio ([0, 1])

# Run face extractor
fingerprints = face_extractor.run(
    face_cnt=350,
    frame_batch_size=16,
    face_cloth_weights=[1.0, 1.0],
    use_scene_transition=False,
    capture_interval_sec=3,
    stop_sec=0,
    skip_sec=0,
    resizing_resolution=1000,
    resizing_ratio=None
)

Clustering & Print result

parameters

sim: similarity threshold (the higher, the stricter)
min_csize: Eliminate clusters with sizes less than min_csize

# Print clustering result
clusters = calc.cluster(fingerprints, sim=0.63, min_csize=6) # the higher, the stricter

postproc.make_links(clusters, os.path.join(result_dir, 'imagecluster/clusters'))

images = icio.read_images(result_dir, size=(224,224))

fig, ax = postproc.plot_clusters(clusters, images)
fig.savefig(os.path.join(result_dir, 'imagecluster/_cluster.png'))
postproc.plt.show()

Cluster merging & Print result

# After merging
FACE_THRESHOLD_HARD = 0.18
CLOTH_THRESHOLD_HARD = 0.12
merged_clusters = postproc.merge_clusters(clusters, fingerprints, FACE_THRESHOLD_HARD, CLOTH_THRESHOLD_HARD, iteration=1)

postproc.make_links(merged_clusters, os.path.join(result_dir, 'imagecluster/merged_clusters'))

images = icio.read_images(result_dir, size=(224,224))

fig, ax = postproc.plot_clusters(merged_clusters, images)
fig.savefig(os.path.join(result_dir, 'imagecluster/_merged_cluster.png'))
postproc.plt.show()

Contribute

how to contribute?

Make issue & Create new branch

Create issue that contains the content you want to contribute (assignees, labels 지정)
Issue title have to start with one of the tags below
- [DOCS]: Working with documents
- [DEV]: Development (Same function as before, but performance may differ)
- [FEAT]: Add or Request new feature
- [REFACT]: Update code that does not affect results/performance
- [EXT]: Etc.

If you want to commit after writing the code yourself, create a branch with the name [tag]/[issue_num] as shown below and push
- Ex. dev/13, feat/20, ext/27

Commit convention

Start with one of the tags below, same as when creating an Issue
- Use the same tag if you have a corresponding issue
Make the title clear and understandable
- Ex. [FEAT] Add scene transition code
When you push, push the branch you worked on
- Ex. git push origin dev/13

Pull request

Create content according to template when creating pull request

References

Unknown Face Classifier [Github]
Image Cluster [Github]

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
.github		.github
clustering		clustering
outdated_files		outdated_files
README-kor.md		README-kor.md
README.md		README.md
demo.ipynb		demo.ipynb
face_extractor.py		face_extractor.py
requirements.txt		requirements.txt
test.jpg		test.jpg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

person-clustering

Contents

Introduction

Related Works

Project Description

Project Structure

Installation

How to use

Contribute

References

About

Releases

Packages

Languages

wowo0709/person-clustering

Folders and files

Latest commit

History

Repository files navigation

person-clustering

Contents

Introduction

Related Works

Project Description

Project Structure

Installation

How to use

Contribute

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages