ViSCAN

This is the implementation of our paper：From Still to Moving: Enhancing Real-Time Metal Corrosion Detection with Dual Feature Aggregation

Link subsequent updates.

Dataset

For our TJNC dataset see TJNC Dataset

Pretrain Models

Pretrained Models for YOLOV: YOLOV

Pretrained Models for YOLOX: YOLOX

Quick Start

Installation for Enviroment

git clone https://github.com/returnlsz/ViSCAN.git

cd ViSCAN

conda create -n yolov python=3.7

conda activate yolov

pip install -r requirements.txt

pip3 install -v -e .

Data Preparation

Data Structure

We use the data set format of ILSVRC2015, as follows:

|-ILSVRC2015
|----Annotations
|----|----VID
|----|----|----train
|----|----|----|----ILSVRC2015_VID_train_0000
|----|----|----|----|----ILSVRC2015_train_00000000
|----|----|----|----|----|----00000.xml
|----|----|----|----|----|----00001.xml
|----|----|----|----|----|----...
|----|----|----|----|----ILSVRC2015_train_00000001
|----|----|----|----|----...
|----|----|----|----ILSVRC2015_VID_train_0000_augmented
|----|----|----val
|----|----|----|----ILSVRC2015_val_00003003
|----|----|----|----...
|----|----|----val_augmented
|----Data
|----|----VID
|----|----|----train
|----|----|----|----ILSVRC2015_VID_train_0000
|----|----|----|----|----ILSVRC2015_train_00000000
|----|----|----|----|----|----00000.jpg
|----|----|----|----|----|----00001.jpg
|----|----|----|----|----|----...
|----|----|----|----|----ILSVRC2015_train_00000001
|----|----|----|----|----...
|----|----|----|----ILSVRC2015_VID_train_0000_augmented
|----|----|----val
|----|----|----|----ILSVRC2015_val_00003003
|----|----|----|----...
|----|----|----val_augmented

For ViSCAN training

For the training of ViSCAN, we use the TJNC video dataset, and after preparing the framework behind our TJNC dataset, use the following instructions

python generate_npy.py

This will generate train_seq.npy and val_seq.npy files for ViSCAN training and testing (files with the npy suffix cannot be opened directly, you can use the view_npy.py script to convert the npy file to a txt file to see its details).

You need to review the instructions in the generate_npy.py script to make changes to the corresponding path .

For YOLOX training

We sample the TJNC video data set to get the corresponding image data set, which will be used for YOLOX training, after preparing our train_seq.npy and val_seq.npy files, using the following instructions

python xml2coco.py

This will be used for the vid_train_coco.json and vid_val_coco.json files for YOLOX training and testing (default under annotations folder).

You need to modify the values of data_dir and jsonname at the bottom of the script xml2coco.py.

Training Pipelines

Our ViSCAN training is divided into two processes:

Use the pre-trained yoloxl_vid to train on the image dataset, using the following instructions

python tools/train.py -f exps/yolov/yoloxl_vid.py -b 16 --fp16 -o -c weights/yoloxl_vid.pth

Among them, weights/yoloxl_vid.pth is the YOLOX pre-training weights provided by us, and exps/yolov/yoloxl_vid.py is the training configuration file where you can modify the training parameters (we recommend the default parameters). You can view other optional configurations for training at train.py.

Use YOLOX_L trained in 1 to train on our TJNC video dataset (training on TJNC dataset directly using YOLOV's pre-training weight cannot achieve the expected effect)

Use the following instructions

python tools/vid_train.py -f exps/yolov/yolov_l.py -c weights/yolov_l.pth --fp16 -b 32

Among them, weights/yolov_l.pth is the trained YOLOX model obtained in step 1, and we will train ViSCAN on this basis. exps/yolov/yolov_l.py is the training configuration file.

Evalution

Use the following instructions for evaluation

python tools/vid_eval.py -f exps/yolov/yolov_l.py --mode 'lc' -d 0 --lframe 32 --gframe 0 -c path_to_your_weights.pth --tnum 500 --fp16

Where, mode is the frame sampling strategy, which you can see in our paper, lc is the proximity sampling, lframe and gframe are the number of frames sampled, more optional parameters see vid_eval.py.

Inference

Use the following instructions to inference

python tools/vid_demo.py -f exps/yolov/yolov_l.py -c path_to_your_weights --path path_to_your_video.mp4 --conf 0.25 --nms 0.5 --tsize 576 --save_result true --device gpu

Where path_to_your_video.mp4 is your audio file, see vid_demo.py for more inference options.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
annotations		annotations
assets		assets
build		build
demo		demo
docs		docs
exps		exps
tools		tools
yolox.egg-info		yolox.egg-info
yolox		yolox
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
generate_npy.py		generate_npy.py
kfold_read_npy.py		kfold_read_npy.py
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py
train_seq.npy		train_seq.npy
val_seq.npy		val_seq.npy
view_npy.py		view_npy.py
xml2coco.py		xml2coco.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ViSCAN

Dataset

Pretrain Models

Quick Start

Installation for Enviroment

Data Preparation

Data Structure

For ViSCAN training

For YOLOX training

Training Pipelines

Evalution

Inference

Cite ViSCAN

About

Releases

Packages

Languages

License

returnlsz/ViSCAN

Folders and files

Latest commit

History

Repository files navigation

ViSCAN

Dataset

Pretrain Models

Quick Start

Installation for Enviroment

Data Preparation

Data Structure

For ViSCAN training

For YOLOX training

Training Pipelines

Evalution

Inference

Cite ViSCAN

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages