Skip to main content
We present a method for speaker recognition that uses the duration patterns of speech units to aid speaker classification. The approach represents each word and/or phone by a feature vector comprised of either the durations of the... more
    • by 
    •   3  
      Speaker RecognitionBackground modelingMixture of Gaussians
This paper presents the deep neural networks to classification of children with voice impairments from speech signals. In the analysis of speech signals, 6,373 static acoustic features are extracted from many kinds of... more
    • by 
    •   23  
      Information SystemsArtificial IntelligenceAcousticsGraph Theory
Most elderly people monitoring systems include the detection of abnormal situations, in particular distress situations, as one of their main goals. In order to reach this objective, many solutions end up combining several modalities such... more
    • by 
    •   9  
      Computer ScienceArtificial IntelligenceGeriatricsSpeaker Recognition
The paper describes a multisensorial personidentification system: visual and acoustic cues are usedjointly for person identification. A simple approach,based on the fusion of the lists of scores produced independentlyby a speaker... more
    • by 
    •   4  
      Speaker RecognitionFace RecognitionSpeaker IdentificationNetwork Architecture
Recently satisfactory results have been obtained in NIST speaker recognition evaluations. These results are mainly due to accurate modeling of a very large development dataset provided by LDC. However, for many realistic scenarios the use... more
    • by 
    •   4  
      Speaker RecognitionSpeaker VerificationSpeaker IdentificationDomain Adaptation
    • by 
    •   17  
      EngineeringHarmonic AnalysisSpeaker RecognitionSpeech Processing
— An automatic verification of person's identity from its voice is a part of modern telecommunication services. In order to execute a verification task, a speech signal has to be transmitted to a remote server. So, a performance of the... more
    • by 
    •   31  
      Telecommunications EngineeringComputer ScienceHuman Computer InteractionForensics
    • by 
    •   8  
      Speaker RecognitionA Priori KnowledgeGaussian processesSignal and information processing
In this paper we present an adapted UBM-GMM based privacy preserving speaker verification (PPSV) system, where the system is not able to observe the speech data provided by the user and the user does not observe the models trained by the... more
    • by  and +1
    •   11  
      Approximation TheoryPrivacySpeaker RecognitionSupport Vector Machines
This paper describes a new identity authentication technique by a synergetic use of lip-motion and speech. The lip-motion is defined as the distribution of apparent velocities in the movement of brightness patterns in an image and is... more
    • by 
    •   23  
      Cognitive ScienceImage ProcessingSignal ProcessingSpeaker Recognition
One characteristic that distinguishes speaker recognition (identification, verification, classification, tracking, etc.) from other biometrics is that it is designed to operate with devices and over channels that were created for other... more
    • by 
    •   6  
      Distributed ComputingSpeaker RecognitionSpeaker VerificationSpeaker Identification
In the meeting case scenario, audio is often recorded using Multiple Distance Microphones (MDM) in a non-intrusive manner. Typically a beamforming is performed in order to obtain a single enhanced signal out of the multiple channels. This... more
    • by 
    •   12  
      Speaker RecognitionSpeech AcousticsClustering AlgorithmsSpeech
Il progetto verte sullo sviluppo di un sistema di riconoscimento del parlatore per l'esecuzione di comandi vocali. Il sistema è stato implementato in Python e si occupa del riconoscimento sia del linguaggio parlato che del parlatore. Per... more
    • by 
    •   4  
      Speaker RecognitionSecurity StudiesAuthenticationSound Perception and Speech Recognition
This article deals with a technique of voice forgery using the ALISP (Automatic Language Independent Speech Processing) approach. Such a technique allows to transform the voice of an arbitrary person (the impostor), forging the identity... more
    • by  and +1
    •   6  
      Speaker RecognitionAutomatic Speaker RecognitionSpeech ProcessingSpeaker Verification
Speaker recognition is the computing task of validating a user's claimed identity using characteristics extracted from their voices. Voice -recognition is combination of the two where it uses learned aspects of a speaker’s voice to... more
    • by 
    •   2  
      Speaker RecognitionSpeech Recognition
Identification of non-native personnel is a critical piece of information for making crucial on-the-spot decisions for security purposes. Identification of a non-native speaker is often readily apparent in normal conversation with a... more
    • by 
    •   8  
      Natural Language ProcessingSpeaker RecognitionNeural Networkhidden Markov model
Device, language and environmental mismatch adversely affect speaker verification (SV) performance. We investigate such effects empirically based on the M3 (multibiometric, multilingual and multi-device) Corpus [1]. Device mismatch (among... more
    • by 
    •   5  
      Speaker RecognitionEnglishBiometricsSpeaker Verification
    • by 
    •   6  
      Machine LearningSpeaker RecognitionAugmented RealityHuman behavior
An audio-assisted system is investigated that detects if a movie scene is a dialogue or not. The system is based on actor indicator functions. That is, functions which define if an actor speaks at a certain time instant. In particular,... more
    • by 
    •   2  
      Speaker RecognitionAudio Signal Processing
"Implementation of an Automatic Algorithm for syllabic division in Portuguese Language A new algorithm for automatic syllabic splitting in the Portuguese language is proposed, which is based on the envelope of the speech signal of an... more
    • by  and +1
    •   6  
      Speaker RecognitionAudio EngineeringAudio Signal ProcessingSpeech Processing
Availability of databases is a necessity in the speech processing field. The publically available databases in Arabic language are few. In this paper we describe a rich database for Arabic language. The database is rich in many... more
    • by 
    •   13  
      AcousticsNatural Language ProcessingSpeaker RecognitionSpeech Recognition
    • by 
    •   19  
      Machine LearningSpeaker RecognitionFuzzy LogicFace Recognition
Feeling of knowing (or expressed confidence) reflects a speaker's certainty or commitment to a statement and can be associated with one's trustworthiness or persuasiveness in social interaction. We investigated the perceptual-acoustic... more
    • by 
    •   8  
      Machine LearningSpeaker RecognitionSpeech CommunicationSocial Perception
Pre-processing of Speech Signal serves various purposes in any speech processing application. It includes Noise Removal, Endpoint Detection, Pre-emphasis, Framing, Windowing, Echo Canceling etc. Out of these, silence/unvoiced portion... more
    • by 
    •   6  
      Speaker RecognitionSpeech ProcessingNoise RemovalProbability Density Function
Making no claim of being exhaustive, a review of the most popular MFCC (Mel Frequency Cepstral Coefficients) implementations is made. These differ mainly in the particular approximation of the nonlinear pitch perception of human, the... more
    • by  and +1
    •   3  
      Speaker RecognitionSpeech ProcessingFeature Extraction
Reliable identity management must be built with an accurate user identity recognition method. This recognition usually is the core of the authentication method which is the essential part of any identity management system. The... more
    • by 
    •   7  
      Artificial IntelligenceSpeaker RecognitionBiometrics And IdentityAuthentication
It's me!" This pronouncement is usually made over the telephone or at an entryway out of sight of the intended hearer. It embodies the expectation that the sound of one's voice is sufficient for the hearer to recognize the speaker. In... more
    • by 
    •   4  
      Signal ProcessingSpeaker RecognitionAutomatic Speaker RecognitionLiterature survey
Recent studies show that Gaussian mixture model (GMM) weights carry less, yet complementary, information to GMM means for language and dialect recognition. However, state-of-the-art language recognition systems usually do not use this... more
    • by 
    •   10  
      Signal ProcessingSpeaker RecognitionAudio Signal ProcessingDigital Signal Processing
In this study, we present a binaural scene analyzer that is able to simultaneously localize, detect and identify a known number of target speakers in the presence of spatially positioned noise sources and reverberation. In contrast to... more
    • by 
    •   11  
      EngineeringSpeaker RecognitionA Priori KnowledgeAutomatic Speaker Recognition
Speaker recognition is the process of recognizing a speaker’s identity by his or her voice. Humans sound differently and there are features in our speaking voice which differentiate us from other people. In this paper, we show an... more
    • by 
    •   2  
      Speaker RecognitionSupport Vector Machines
This paper presents the feature analysis and design of compensators for speaker recognition under stressed speech conditions. Any condition that causes a speaker to vary his or her speech production from normal or neutral condition is... more
    • by 
    •   12  
      Cognitive ScienceSpeaker RecognitionSpeech ProductionLinguistics
Conventional multimodal biometric identification systems tend to have larger memory footprint, slower processing speeds and a higher implementation and operational cost. In this paper we propose a state of the art framework for multimodal... more
    • by  and +1
    •   12  
      Speaker RecognitionSupport Vector MachinesDatabasesSpeech
Standard speaker recognition system employs a pre-processed form of an acoustic signal, which provides information about the distribution of signal energy across time and frequency. However, different signal representations may be... more
    • by 
    •   7  
      EngineeringSpeaker RecognitionMFCCMultilayer Perceptron
Speaker recognition is the process of automatically recognizing who is speaking on the basis of individual information included in speech waves. This technique makes it possible to use the speaker's voice to verify their identity and... more
    • by 
    • Speaker Recognition
SPEAKER RECOGNATION SYSTEM (SRS)
    • by 
    •   5  
      Speaker RecognitionGrowth Mixture Models GMMText to SpeechHMM speech recognition
SpeechBrain is an open-source and all-in-one speech toolkit. It is designed to facilitate the research and development of neural speech processing technologies by being simple, flexible, user-friendly, and well-documented. This paper... more
    • by  and +3
    •   10  
      Machine LearningSignal ProcessingSpeaker RecognitionAudio Signal Processing
Dedicated to my parents who sacrificed their today for our better tomorrow and my mentors who have guided me throughout my research and helped me to improve professionally and personally. ACKNOWLEDGMENTS I would like to gratefully... more
    • by 
    •   4  
      Speaker RecognitionHomeland SecurityAccess ControlHigher Order Thinking
A simple yet complex approach to modern sophistication. In this project we used the MFCC approach to build a unique and accurate coefficients extracting processor to extract feature from the voice stored in the database, then on the next... more
    • by 
    •   6  
      Speaker RecognitionAutomatic Speech RecognitionSpeech RecognitionMatlab
The paper discusses Derrida's concept of hospitality which perfectly describes the experience of loosing the sense of feeling at home and reveals the disintegrating entrance of the Otherness into a coherent home space. Jacques Derrida's... more
    • by 
    •   253  
      Critical TheoryLanguagesBiochemistryBioinformatics
    • by 
    •   9  
      Signal ProcessingSpeaker RecognitionAudio Signal ProcessingPattern Recognition
ABSTRACT: Defining the vowel system in comparison with the previous similar works for standard Turkish by using a wider database and determining the rate of speaker specific invariance are two aims of this study. In the previous similar... more
    • by 
    •   4  
      Speaker RecognitionAcoustic PhoneticsTurkish phonetics, phonologyFormant Frequencies
Biometric system performance can be improved by means of data fusion. Several kinds of information can be fused in order to obtain a more accurate classification (identification or verification) of an input sample. In this paper we... more
    • by 
    •   22  
      Cognitive ScienceSignal ProcessingSpeaker RecognitionPattern Recognition
A wide variety of systems require reliable personal recognition schemes to either confirm or determine the identity of an individual requesting their services. The purpose of such schemes is to ensure that the rendered services are... more
    • by 
    •   7  
      Speaker RecognitionFace RecognitionGesture RecognitionData Privacy
This paper aims at inscribing Forensic Linguistics within the variegate field of Forensic sciences. After a deep and meticulous description of the state of art of Forensic Linguistics from 1960s until now, we propose all of the... more
    • by 
    •   6  
      Languages and LinguisticsForensic LinguisticsSociolinguisticsSpeaker Recognition
Superbisor sa Programang Edukasyon sa Filipino at Mother Tongue Based-Multi Lingual Education ng Kagawaran ng Edukasyon, Lungsod ng Maynila. DepEd Concept Paper Writer, Translator, Editor and National Lead Trainer. International Book... more
    • by 
    •   6  
      EducationLanguages and LinguisticsSpeaker RecognitionSupervisory Control of Discrete Event Systems
Learning good representations without supervision is still an open issue in machine learning, and is particularly challenging for speech signals, which are often characterized by long sequences with a complex hierarchical structure. Some... more
    • by  and +1
    •   8  
      Speaker RecognitionAutomatic Speech RecognitionSpeech RecognitionUnsupervised Learning Techniques
The QUT-NOISE-SRE protocol is designed to mix the large QUT-NOISE database, consisting of over 10 hours of background noise, collected across 10 unique locations covering 5 common noise scenarios, with commonly used speaker recognition... more
    • by  and +1
    •   7  
      Speaker RecognitionSpeaker VerificationSpeaker IdentificationNoise
Tracking speakers in multiparty conversations constitutes a fundamental task for automatic meeting analysis. In this paper, we present a novel probabilistic approach to jointly track the location and speaking activity of multiple speakers... more
    • by 
    •   69  
      EngineeringSocial PsychologySignal ProcessingSpeaker Recognition