Skip to content

Commit b7136bb

Browse files
committed
srun
1 parent 5cfd133 commit b7136bb

File tree

1 file changed

+3
-2
lines changed

1 file changed

+3
-2
lines changed

srun_rodrigo_voxceleb_image.sh

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -16,7 +16,7 @@ export NCCL_SOCKET_IFNAME=ens32
1616
export HYDRA_FULL_ERROR=1
1717
export CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7
1818
cd /data/home/rs2517/code/generative-models
19-
srun python main.py --base configs/example_training/svd_image.yaml --wandb True lightning.trainer.num_nodes 4 \
19+
srun python main.py --resume logs/2024-05-09T14-27-30_example_training-svd_image/checkpoints --base configs/example_training/svd_image.yaml --wandb True lightning.trainer.num_nodes 4 \
2020
lightning.strategy=deepspeed_stage_1 lightning.trainer.precision=32 model.base_learning_rate=1.e-5 \
2121
data.params.train.datapipeline.filelist=/fsx/rs2517/data/lists/voxceleb2_proper.txt \
2222
data.params.train.datapipeline.video_folder=/fsx/behavioural_computing_data/voxceleb2 \
@@ -25,4 +25,5 @@ srun python main.py --base configs/example_training/svd_image.yaml --wandb True
2525
data.params.train.datapipeline.audio_in_video=True \
2626
data.params.train.datapipeline.load_all_possible_indexes=False \
2727
data.params.train.loader.num_workers=4 \
28-
lightning.trainer.devices=4 lightning.trainer.accumulate_grad_batches=1 data.params.train.datapipeline.virtual_increase=10 \
28+
lightning.trainer.devices=4 lightning.trainer.accumulate_grad_batches=1 data.params.train.datapipeline.virtual_increase=10 \
29+
data.params.train.loader.batch_size=28 model.params.network_config.params.audio_cond_method=to_time_emb_image \

0 commit comments

Comments
 (0)