Voice Face Association Learning Papers

Other Keywords：

voice-face cross-modal biometric matching
voice-face representation learning
voice-face cross-modal mapping

Please raise an issue if there is anything mistake or missing. :-)

Abbr.	Title	Year	Conf	Code
SVHF	Seeing voices and hearing faces: Cross-modal biometric matching	2018	CVPR	code ,model
FVCME	Face-voice matching using cross-modal embeddings	2018	MM	❎
Pins	Learnable pins: Crossmodal embeddings for person identity	2018	ECCV	official, pytorch
LAFV	On learning associations of faces and voices	2018	ACCV	❎
SSNet	Deep Latent Space Learning for Cross-modal Mapping of Audio and Visual Signals	2019	DICTA	❎
DIMNet	Disjoint mapping network for cross-modal matching of voices and faces	2019	ICLR	❎
EmNet	A Novel Distance Learning for Elastic Cross-Modal Audio-Visual Matching	2019	ICME-Workshop	❎
VFMR	Voice-Face Cross-modal Matching and Retrieval- A Benchmark	2019	-	❎
	Learning Discriminative Joint Embeddings for Efficient Face and Voice Association	2020	SIGIR	❎
	Hearing like Seeing: Improving Voice-Face Interactions and Associations via Adversarial Deep Semantic Matching Network	2020	MM	❎
VFNet	Audio-visual Speaker Recognition with a Cross-modal Discriminative Network	2020	Interspeech	❎
AML	Adversarial-Metric Learning for Audio-Visual Cross-Modal Matching	2021	TMM	official, copy
	Seeking the Shape of Sound- An Adaptive Framework for Learning Voice-Face Association	2021	CVPR	code
	Cross-modal Speaker Veriﬁcation and Recognition: A Multilingual Perspective	2021	CVPR-Workshop	❎
	Disentangled Representation Learning for Cross-Modal Biometric Matching	2021	TMM	❎
FOP	Fusion and Orthogonal Projection for Improved Face-Voice Association	2022	ICASSP	code
Self-Lifting	Self-Lifting: A Novel Framework for Unsupervised Voice-Face Association Learning	2022	ICMR	code
CMPC	Unsupervised Voice-Face Representation Learning by Cross-Modal Prototype Contrast	2022	IJCAI	code
	Detach and Enhance: Learning Disentangled Cross-modal Latent Representation for Efficient Face-Voice Association and Matching	2022	ICDM	❎
	Looking and Hearing into Details: Dual-enhanced Siamese Adversarial Network for Audio-Visual Matching	2022	TMM	❎
SBNet	Single-branch Network for Multimodal Training	2023	ICASSP	code

Benchmarks

Voice-Face Association Learning Evaluation

https://github.com/my-yy/vfal-eva

Reproduce bunches of works based on unified standards 😃
High-speed training and testing ⚡
Easy to extend 💭

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Voice Face Association Learning Papers

Benchmarks

Voice-Face Association Learning Evaluation

About

Releases

Packages

my-yy/vfal_papers

Folders and files

Latest commit

History

Repository files navigation

Voice Face Association Learning Papers

Benchmarks

Voice-Face Association Learning Evaluation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages