Debiasing Global Workspace: A Cognitive Neural Framework for Learning Debiased and Interpretable Representations
This is the official implementation of Debiasing Global Workspace (DGW). This work has been accepted to:
- (Short paper) NeurIPS 2024 Workshop on Behavioral Machine Learning
- (Full paper) NeurIPS 2024 Workshop on UniReps: 2nd Edition of the Workshop on Unifying Representations in Neural Models
conda create --name py38DGW python=3.8
conda activate py38DGW
pip install -r requirements.txt
Please check the repo of Learning Debiased Represntations via Disentangled Feature Augmentation (LFA). You can download all dataests via the link.
- For
[vanilla, lfa, dgw]
, check script files in the folderscripts
to execute models. You can execute[vanilla, lfa, dgw]
. - For
[ReBias, LfF]
, we re-implemented them based on their repos. Please checkdev
branch in our repo to see our implementations of the two baselines.
You can download our pretrained models of DGW via the following links to check their test accuracies in our paper.
Our source codes are based on:
- Concept-Centric Transformers [WACV2023]: We implemented our method referring to this repo and used this as one of our baselines.
- Learning Debiased Represntations via Disentangled Feature Augmentation (LFA) [NeurIPS2021]: We implemented our method based on this repo and compare our model with this as one of our baselines.
- Learning De-biased Representations with Biased Representations (ReBias) [ICML2020]: This is one of our baselines.
- Learning from Failure: De-biasing Classifier from Biased Classifier (LfF) [NeurIPS2020]: This is one of our baselines.