Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

About experiment setting #13

Open
jhyukjang opened this issue May 23, 2023 · 5 comments
Open

About experiment setting #13

jhyukjang opened this issue May 23, 2023 · 5 comments

Comments

@jhyukjang
Copy link

jhyukjang commented May 23, 2023

Thx for your great work!
I have some questions about your code.

  1. Did you set the random seed to 0 for all experiments?
  2. I reproduced as you did (NUM_GPUS=2, BS_FITS_YOUR_GPU=2) like below

ARROW_ROOT=./datasets/mmimdb
NUM_GPUS=2
NUM_NODES=1
BS_FITS_YOUR_GPU=2
PRETRAINED_MODEL_PATH=./pretrained_weight/vilt_200k_mlm_itm.ckpt
EXP_NAME=mmimdb

python run.py with data_root=${ARROW_ROOT}
num_gpus=${NUM_GPUS}
num_nodes=${NUM_NODES}
per_gpu_batchsize=${BS_FITS_YOUR_GPU}
task_finetune_mmimdb
load_path=${PRETRAINED_MODEL_PATH}
exp_name=${EXP_NAME}

and I got 40.65 (paper: 42.66) on test set with same setting. Can I reproduce the paper's results without changing parameters like a learning rate or are there some optimized hyperparameters for each dataset?

@jhyukjang jhyukjang changed the title Random seed About experiment setting May 23, 2023
@Trae1ounG
Copy link

Trae1ounG commented Oct 22, 2023

Thx for your great work! I have some questions about your code.

  1. Did you set the random seed to 0 for all experiments?
  2. I reproduced as you did (NUM_GPUS=2, BS_FITS_YOUR_GPU=2) like below

ARROW_ROOT=./datasets/mmimdb NUM_GPUS=2 NUM_NODES=1 BS_FITS_YOUR_GPU=2 PRETRAINED_MODEL_PATH=./pretrained_weight/vilt_200k_mlm_itm.ckpt EXP_NAME=mmimdb

python run.py with data_root=${ARROW_ROOT} num_gpus=${NUM_GPUS} num_nodes=${NUM_NODES} per_gpu_batchsize=${BS_FITS_YOUR_GPU} task_finetune_mmimdb load_path=${PRETRAINED_MODEL_PATH} exp_name=${EXP_NAME}

and I got 40.65 (paper: 42.66) on test set with same setting. Can I reproduce the paper's results without changing parameters like a learning rate or are there some optimized hyperparameters for each dataset?

Hi.How many epochs do you trained?

@LinnHY
Copy link

LinnHY commented Dec 1, 2023

same problem.Hope the author can answer and help us

@herkerser
Copy link

same problem, it would be helpful if the authors could provide more details on getting the results on paper

@singhayush27
Copy link

lots of version issue how did you handle guys?

@Xfavor
Copy link

Xfavor commented Apr 27, 2024

Is the code complete, and it can't be run directly using the training command given by the author?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

6 participants