Speaker Recognition
12,456 Followers
Recent papers in Speaker Recognition
We present a method for speaker recognition that uses the duration patterns of speech units to aid speaker classification. The approach represents each word and/or phone by a feature vector comprised of either the durations of the... more
This paper presents the deep neural networks to classification of children with voice impairments from speech signals. In the analysis of speech signals, 6,373 static acoustic features are extracted from many kinds of... more
Most elderly people monitoring systems include the detection of abnormal situations, in particular distress situations, as one of their main goals. In order to reach this objective, many solutions end up combining several modalities such... more
The paper describes a multisensorial personidentification system: visual and acoustic cues are usedjointly for person identification. A simple approach,based on the fusion of the lists of scores produced independentlyby a speaker... more
Recently satisfactory results have been obtained in NIST speaker recognition evaluations. These results are mainly due to accurate modeling of a very large development dataset provided by LDC. However, for many realistic scenarios the use... more
— An automatic verification of person's identity from its voice is a part of modern telecommunication services. In order to execute a verification task, a speech signal has to be transmitted to a remote server. So, a performance of the... more
This paper describes a new identity authentication technique by a synergetic use of lip-motion and speech. The lip-motion is defined as the distribution of apparent velocities in the movement of brightness patterns in an image and is... more
One characteristic that distinguishes speaker recognition (identification, verification, classification, tracking, etc.) from other biometrics is that it is designed to operate with devices and over channels that were created for other... more
In the meeting case scenario, audio is often recorded using Multiple Distance Microphones (MDM) in a non-intrusive manner. Typically a beamforming is performed in order to obtain a single enhanced signal out of the multiple channels. This... more
Il progetto verte sullo sviluppo di un sistema di riconoscimento del parlatore per l'esecuzione di comandi vocali. Il sistema è stato implementato in Python e si occupa del riconoscimento sia del linguaggio parlato che del parlatore. Per... more
Speaker recognition is the computing task of validating a user's claimed identity using characteristics extracted from their voices. Voice -recognition is combination of the two where it uses learned aspects of a speaker’s voice to... more
Identification of non-native personnel is a critical piece of information for making crucial on-the-spot decisions for security purposes. Identification of a non-native speaker is often readily apparent in normal conversation with a... more
Device, language and environmental mismatch adversely affect speaker verification (SV) performance. We investigate such effects empirically based on the M3 (multibiometric, multilingual and multi-device) Corpus [1]. Device mismatch (among... more
An audio-assisted system is investigated that detects if a movie scene is a dialogue or not. The system is based on actor indicator functions. That is, functions which define if an actor speaks at a certain time instant. In particular,... more
Availability of databases is a necessity in the speech processing field. The publically available databases in Arabic language are few. In this paper we describe a rich database for Arabic language. The database is rich in many... more
Feeling of knowing (or expressed confidence) reflects a speaker's certainty or commitment to a statement and can be associated with one's trustworthiness or persuasiveness in social interaction. We investigated the perceptual-acoustic... more
Pre-processing of Speech Signal serves various purposes in any speech processing application. It includes Noise Removal, Endpoint Detection, Pre-emphasis, Framing, Windowing, Echo Canceling etc. Out of these, silence/unvoiced portion... more
Reliable identity management must be built with an accurate user identity recognition method. This recognition usually is the core of the authentication method which is the essential part of any identity management system. The... more
It's me!" This pronouncement is usually made over the telephone or at an entryway out of sight of the intended hearer. It embodies the expectation that the sound of one's voice is sufficient for the hearer to recognize the speaker. In... more
Recent studies show that Gaussian mixture model (GMM) weights carry less, yet complementary, information to GMM means for language and dialect recognition. However, state-of-the-art language recognition systems usually do not use this... more
In this study, we present a binaural scene analyzer that is able to simultaneously localize, detect and identify a known number of target speakers in the presence of spatially positioned noise sources and reverberation. In contrast to... more
Speaker recognition is the process of recognizing a speaker’s identity by his or her voice. Humans sound differently and there are features in our speaking voice which differentiate us from other people. In this paper, we show an... more
This paper presents the feature analysis and design of compensators for speaker recognition under stressed speech conditions. Any condition that causes a speaker to vary his or her speech production from normal or neutral condition is... more
Standard speaker recognition system employs a pre-processed form of an acoustic signal, which provides information about the distribution of signal energy across time and frequency. However, different signal representations may be... more
Speaker recognition is the process of automatically recognizing who is speaking on the basis of individual information included in speech waves. This technique makes it possible to use the speaker's voice to verify their identity and... more
SPEAKER RECOGNATION SYSTEM (SRS)
Dedicated to my parents who sacrificed their today for our better tomorrow and my mentors who have guided me throughout my research and helped me to improve professionally and personally. ACKNOWLEDGMENTS I would like to gratefully... more
A simple yet complex approach to modern sophistication. In this project we used the MFCC approach to build a unique and accurate coefficients extracting processor to extract feature from the voice stored in the database, then on the next... more
The paper discusses Derrida's concept of hospitality which perfectly describes the experience of loosing the sense of feeling at home and reveals the disintegrating entrance of the Otherness into a coherent home space. Jacques Derrida's... more
ABSTRACT: Defining the vowel system in comparison with the previous similar works for standard Turkish by using a wider database and determining the rate of speaker specific invariance are two aims of this study. In the previous similar... more
Biometric system performance can be improved by means of data fusion. Several kinds of information can be fused in order to obtain a more accurate classification (identification or verification) of an input sample. In this paper we... more
A wide variety of systems require reliable personal recognition schemes to either confirm or determine the identity of an individual requesting their services. The purpose of such schemes is to ensure that the rendered services are... more
This paper aims at inscribing Forensic Linguistics within the variegate field of Forensic sciences. After a deep and meticulous description of the state of art of Forensic Linguistics from 1960s until now, we propose all of the... more
Superbisor sa Programang Edukasyon sa Filipino at Mother Tongue Based-Multi Lingual Education ng Kagawaran ng Edukasyon, Lungsod ng Maynila. DepEd Concept Paper Writer, Translator, Editor and National Lead Trainer. International Book... more
Tracking speakers in multiparty conversations constitutes a fundamental task for automatic meeting analysis. In this paper, we present a novel probabilistic approach to jointly track the location and speaking activity of multiple speakers... more