Skip to content

my-yy/vfal_papers

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 

Repository files navigation

Voice Face Association Learning Papers

Other Keywords:

  • voice-face cross-modal biometric matching
  • voice-face representation learning
  • voice-face cross-modal mapping

Please raise an issue if there is anything mistake or missing. :-)

Abbr. Title Year Conf Code
SVHF Seeing voices and hearing faces: Cross-modal biometric matching 2018 CVPR code
,model
FVCME Face-voice matching using cross-modal embeddings 2018 MM
Pins Learnable pins: Crossmodal embeddings for person identity 2018 ECCV official, pytorch
LAFV On learning associations of faces and voices 2018 ACCV
SSNet Deep Latent Space Learning for Cross-modal Mapping of Audio and Visual Signals 2019 DICTA
DIMNet Disjoint mapping network for cross-modal matching of voices and faces 2019 ICLR
EmNet A Novel Distance Learning for Elastic Cross-Modal Audio-Visual Matching 2019 ICME-Workshop
VFMR Voice-Face Cross-modal Matching and Retrieval- A Benchmark 2019 -
Learning Discriminative Joint Embeddings for Efficient Face and Voice Association 2020 SIGIR
Hearing like Seeing: Improving Voice-Face Interactions and Associations via Adversarial Deep Semantic Matching Network 2020 MM
VFNet Audio-visual Speaker Recognition with a Cross-modal Discriminative Network 2020 Interspeech
AML Adversarial-Metric Learning for Audio-Visual Cross-Modal Matching 2021 TMM official, copy
Seeking the Shape of Sound- An Adaptive Framework for Learning Voice-Face Association 2021 CVPR code
Cross-modal Speaker Verification and Recognition: A Multilingual Perspective 2021 CVPR-Workshop
Disentangled Representation Learning for Cross-Modal Biometric Matching 2021 TMM
FOP Fusion and Orthogonal Projection for Improved Face-Voice Association 2022 ICASSP code
Self-Lifting Self-Lifting: A Novel Framework for Unsupervised Voice-Face Association Learning 2022 ICMR code
CMPC Unsupervised Voice-Face Representation Learning by Cross-Modal Prototype Contrast 2022 IJCAI code
Detach and Enhance: Learning Disentangled Cross-modal Latent Representation for Efficient Face-Voice Association and Matching 2022 ICDM
Looking and Hearing into Details: Dual-enhanced Siamese Adversarial Network for Audio-Visual Matching 2022 TMM
SBNet Single-branch Network for Multimodal Training 2023 ICASSP code

Benchmarks

Voice-Face Association Learning Evaluation

https://github.com/my-yy/vfal-eva

  • Reproduce bunches of works based on unified standards 😃
  • High-speed training and testing ⚡
  • Easy to extend 💭

Releases

No releases published

Packages