Skip to content

[TPAMI2024] Divide-and-Conquer: Confluent Triple-Flow Network for RGB-T Salient Object Detection

License

Notifications You must be signed in to change notification settings

CSer-Tang-hao/ConTriNet_RGBT-SOD

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

51 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

(TPAMI 2024) Divide-and-Conquer: Confluent Triple-Flow Network for RGB-T Salient Object Detection IEEE Page Arxiv Page

1Nanjing University of Science and Technology, Nanjing, China
2Hong Kong University of Science and Technology, Hong Kong, China
3Singapore Management University, Singapore 

Framework

framework

An overview of the proposed Confluent Triple-Flow Network (ConTriNet), which adopts an efficient ``Divide-and-Conquer'' strategy, is presented. ConTriNet comprises three main flows: a modality-complementary flow that predicts a modality-complementary saliency map, and two modality-specific flows that predict RGB- and Thermal-specific saliency maps, respectively.

VT-IMAG Dataset

vt-imag

The primary purpose of the constructed VT-IMAG is to drive the advancement of RGB-T SOD methods and facilitate their deployment in real-world scenarios. For a fair comparison, all models are solely trained on clear data and simple scenes (i.e., training set of VT5000) and evaluated for Zero-shot Robustness on various real-world challenging cases in VT-IMAG. Download Dataset (Google Drive)

Benchmark Datasets

Saliency Maps

The prediction results of existing RGB-T SOD methods and our ConTriNet on benchmark datasets are now available for download, enabling researchers to easily compare their methods with existing SOTA methods and directly incorporate these results into their studies.

The prediction results of existing RGB-T SOD methods on VT-IMAG are now available for download, enabling researchers to easily compare their methods with existing SOTA methods and directly incorporate these results into their studies.

How to run

  • After you download datasets, just run train.py to train our model and test.py to generate the final prediction map.
python train.py --rgb_label_root [path_of_training_rgb_images] --thermal_label_root [path_of_training_thermal_images] --gt_label_root [path_of_training_gt_images] --gpu_id 
python test.py --test_path [path_of_test_images] --model_path [path_of_model_checkpoint] --gpu_id 

The original model weights are unavailable due to the long paper review process. We reorganize and release the code along with newly reproduced model weights (Google Drive) for reference, with results closely matching the paper, sometimes outperforming it.

Evaluation

We use this Saliency-Evaluation-Toolbox for evaluating all RGB-T SOD results.

Citation

Please cite our paper if you find the work useful, thanks!

@article{tang2024divide,
  author={Tang, Hao and Li, Zechao and Zhang, Dong and He, Shengfeng and Tang, Jinhui},
  journal={IEEE Transactions on Pattern Analysis and Machine Intelligence}, 
  title={Divide-and-Conquer: Confluent Triple-Flow Network for RGB-T Salient Object Detection}, 
  year={2024},
  pages={1-17},
  doi={10.1109/TPAMI.2024.3511621}}

About

[TPAMI2024] Divide-and-Conquer: Confluent Triple-Flow Network for RGB-T Salient Object Detection

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages