Name		Name	Last commit message	Last commit date
parent directory ..
config		config
ddppo		ddppo
models		models
ppo		ppo
pretraining		pretraining
README.md		README.md
__init__.py		__init__.py
run.py		run.py
slurm.sh		slurm.sh

README.md

Semantic Audio-Visual Navigation (SAVi) Model

Details

This folder provides the code of the model as well as the training/evaluation configurations used in the Semantic Audio-Visual Navigation paper. Use of this model is the similar as described in the usage section of the main README file. Simply replace av_nav with savi in the command.

Note that the numbers in the paper were initially reported on Habitat-Lab v0.1.5. Later versions of Habitat-Lab seed the random seeds a bit differently. The difference of performance should be within 1%. Pretrained weights are provided.

Usage

Pretrain the label predictor (or use the pretrained model weights from this repo):

python ss_baselines/savi/pretraining/audiogoal_trainer.py --run-type train --model-dir data/models/savi --predict-label

Train the SAVi model with the trained label predictor (location predictor is better trained online) with DDPPO. Submit slurm.sh to your slurm cluster for training. If clusters are not available, use the following training command to train with PPO.
SAVi is first trained with the external memory size of 1, which only uses the last observation. It is then fine-tuned with the whole external memory with encoders freezed. Please update the pretrained_weights path in savi.yaml with the best pretrained checkpoint when finetuning.

python ss_baselines/savi/run.py --exp-config ss_baselines/savi/config/semantic_audionav/savi_pretraining.yaml --model-dir data/models/savi
python ss_baselines/savi/run.py --exp-config ss_baselines/savi/config/semantic_audionav/savi.yaml --model-dir data/models/savi

Evaluating pretrained model

python ss_baselines/savi/run.py --run-type eval --exp-config ss_baselines/savi/config/semantic_audionav/savi.yaml EVAL_CKPT_PATH_DIR data/pretrained_weights/semantic_audionav/savi/best_val.pth EVAL.SPLIT test USE_SYNC_VECENV True RL.DDPPO.pretrained False

Citation

If you use this model in your research, please cite the following paper:

@inproceedings{chen21semantic,
  title     =     {Semantic Audio-Visual Navigation,
  author    =     {Changan Chen and Ziad Al-Halah and Kristen Grauman},
  booktitle =     {CVPR},
  year      =     {2021}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

savi

savi

README.md

Semantic Audio-Visual Navigation (SAVi) Model

Details

Usage

Citation

Files

savi

Directory actions

More options

Directory actions

More options

Latest commit

History

savi

Folders and files

parent directory

README.md

Semantic Audio-Visual Navigation (SAVi) Model

Details

Usage

Citation