ATE401644T1 - Verfahren zur spracherkennung - Google Patents

Verfahren zur spracherkennung

Info

Publication number
ATE401644T1
ATE401644T1 AT06250864T AT06250864T ATE401644T1 AT E401644 T1 ATE401644 T1 AT E401644T1 AT 06250864 T AT06250864 T AT 06250864T AT 06250864 T AT06250864 T AT 06250864T AT E401644 T1 ATE401644 T1 AT E401644T1
Authority
AT
Austria
Prior art keywords
speech
voice recognition
recognized
user
importation
Prior art date
Application number
AT06250864T
Other languages
English (en)
Inventor
Toshiaki Fukada
Original Assignee
Canon Kk
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Canon Kk filed Critical Canon Kk
Application granted granted Critical
Publication of ATE401644T1 publication Critical patent/ATE401644T1/de

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/187Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/226Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
    • G10L2015/228Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics of application context

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Artificial Intelligence (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Telephonic Communication Services (AREA)
  • Electrically Operated Instructional Devices (AREA)
  • Machine Translation (AREA)
  • User Interface Of Digital Computer (AREA)
  • Electric Clocks (AREA)
AT06250864T 2005-03-09 2006-02-17 Verfahren zur spracherkennung ATE401644T1 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP2005065355A JP4667082B2 (ja) 2005-03-09 2005-03-09 音声認識方法

Publications (1)

Publication Number Publication Date
ATE401644T1 true ATE401644T1 (de) 2008-08-15

Family

ID=36250777

Family Applications (1)

Application Number Title Priority Date Filing Date
AT06250864T ATE401644T1 (de) 2005-03-09 2006-02-17 Verfahren zur spracherkennung

Country Status (8)

Country Link
US (1) US7634401B2 (de)
EP (1) EP1701338B1 (de)
JP (1) JP4667082B2 (de)
KR (1) KR100742888B1 (de)
CN (1) CN100587806C (de)
AT (1) ATE401644T1 (de)
DE (1) DE602006001764D1 (de)
ES (1) ES2310893T3 (de)

Families Citing this family (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4282704B2 (ja) * 2006-09-27 2009-06-24 株式会社東芝 音声区間検出装置およびプログラム
JP4950930B2 (ja) * 2008-04-03 2012-06-13 株式会社東芝 音声/非音声を判定する装置、方法およびプログラム
KR20130133629A (ko) * 2012-05-29 2013-12-09 삼성전자주식회사 전자장치에서 음성명령을 실행시키기 위한 장치 및 방법
US8577671B1 (en) 2012-07-20 2013-11-05 Veveo, Inc. Method of and system for using conversation state information in a conversational interaction system
US9799328B2 (en) * 2012-08-03 2017-10-24 Veveo, Inc. Method for using pauses detected in speech input to assist in interpreting the input during conversational interaction for information retrieval
CN103971685B (zh) * 2013-01-30 2015-06-10 腾讯科技(深圳)有限公司 语音命令识别方法和系统
ES2751484T3 (es) * 2013-05-07 2020-03-31 Veveo Inc Interfaz de entrada de voz incremental con retroalimentación en tiempo real
US20160063990A1 (en) * 2014-08-26 2016-03-03 Honeywell International Inc. Methods and apparatus for interpreting clipped speech using speech recognition
US9854049B2 (en) 2015-01-30 2017-12-26 Rovi Guides, Inc. Systems and methods for resolving ambiguous terms in social chatter based on a user profile
JP6972287B2 (ja) * 2016-09-15 2021-11-24 東芝テック株式会社 音声認識装置、音声認識方法及び音声認識プログラム
JP6804909B2 (ja) * 2016-09-15 2020-12-23 東芝テック株式会社 音声認識装置、音声認識方法及び音声認識プログラム
US10283117B2 (en) * 2017-06-19 2019-05-07 Lenovo (Singapore) Pte. Ltd. Systems and methods for identification of response cue at peripheral device
US10586529B2 (en) 2017-09-14 2020-03-10 International Business Machines Corporation Processing of speech signal
JP7092708B2 (ja) * 2019-05-20 2022-06-28 ヤフー株式会社 情報処理プログラム、情報処理装置及び情報処理方法
JP7404664B2 (ja) * 2019-06-07 2023-12-26 ヤマハ株式会社 音声処理装置及び音声処理方法
US12118984B2 (en) 2020-11-11 2024-10-15 Rovi Guides, Inc. Systems and methods to resolve conflicts in conversations
US11545143B2 (en) 2021-05-18 2023-01-03 Boris Fridman-Mintz Recognition or synthesis of human-uttered harmonic sounds

Family Cites Families (35)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4761815A (en) * 1981-05-01 1988-08-02 Figgie International, Inc. Speech recognition system based on word state duration and/or weight
US4712242A (en) * 1983-04-13 1987-12-08 Texas Instruments Incorporated Speaker-independent word recognizer
US5774851A (en) * 1985-08-15 1998-06-30 Canon Kabushiki Kaisha Speech recognition apparatus utilizing utterance length information
US4882757A (en) * 1986-04-25 1989-11-21 Texas Instruments Incorporated Speech recognition system
JP2882791B2 (ja) * 1986-10-03 1999-04-12 株式会社リコー パターン比較方式
JP2829014B2 (ja) 1989-01-12 1998-11-25 株式会社東芝 音声認識装置及び方法
JP2708566B2 (ja) * 1989-09-06 1998-02-04 株式会社日立製作所 音声認識制御装置
DE4031421C2 (de) * 1989-10-05 1995-08-24 Ricoh Kk Musteranpassungssystem für eine Spracherkennungseinrichtung
JP3004749B2 (ja) * 1990-05-14 2000-01-31 株式会社リコー 標準パターン登録方法
DE69128990T2 (de) * 1990-09-07 1998-08-27 Toshiba Kawasaki Kk Sprecherkennungsvorrichtung
US5692104A (en) * 1992-12-31 1997-11-25 Apple Computer, Inc. Method and apparatus for detecting end points of speech activity
DE4306508A1 (de) * 1993-03-03 1994-09-08 Philips Patentverwaltung Verfahren und Anordnung zum Ermitteln von Wörtern in einem Sprachsignal
US5765130A (en) * 1996-05-21 1998-06-09 Applied Language Technologies, Inc. Method and apparatus for facilitating speech barge-in in connection with voice recognition systems
US5835890A (en) * 1996-08-02 1998-11-10 Nippon Telegraph And Telephone Corporation Method for speaker adaptation of speech models recognition scheme using the method and recording medium having the speech recognition method recorded thereon
JP3588929B2 (ja) 1996-08-27 2004-11-17 日産自動車株式会社 音声認識装置
US6167374A (en) * 1997-02-13 2000-12-26 Siemens Information And Communication Networks, Inc. Signal processing method and system utilizing logical speech boundaries
DE69831991T2 (de) 1997-03-25 2006-07-27 Koninklijke Philips Electronics N.V. Verfahren und Vorrichtung zur Sprachdetektion
JPH10319991A (ja) * 1997-05-20 1998-12-04 Sony Corp 電子機器の音声認識起動方法及び装置
EP1083545A3 (de) * 1999-09-09 2001-09-26 Xanavi Informatics Corporation Eigennamen Spracherkennung in einem Navigationssystem
JP4520555B2 (ja) * 1999-09-09 2010-08-04 クラリオン株式会社 音声認識装置および音声認識ナビゲーション装置
US6389394B1 (en) * 2000-02-09 2002-05-14 Speechworks International, Inc. Method and apparatus for improved speech recognition by modifying a pronunciation dictionary based on pattern definitions of alternate word pronunciations
JP4880136B2 (ja) * 2000-07-10 2012-02-22 パナソニック株式会社 音声認識装置および音声認識方法
US7277853B1 (en) * 2001-03-02 2007-10-02 Mindspeed Technologies, Inc. System and method for a endpoint detection of speech for improved speech recognition in noisy environments
US7308404B2 (en) * 2001-09-28 2007-12-11 Sri International Method and apparatus for speech recognition using a dynamic vocabulary
JP2003330491A (ja) * 2002-05-10 2003-11-19 Nec Corp 音声認識装置および音声認識方法ならびにプログラム
KR100474253B1 (ko) * 2002-12-12 2005-03-10 한국전자통신연구원 단어의 첫 자음 발성을 이용한 음성인식 방법 및 이를 저장한 기록 매체
US7024360B2 (en) * 2003-03-17 2006-04-04 Rensselaer Polytechnic Institute System for reconstruction of symbols in a sequence
US7343289B2 (en) * 2003-06-25 2008-03-11 Microsoft Corp. System and method for audio/video speaker detection
CA2473195C (en) * 2003-07-29 2014-02-04 Microsoft Corporation Head mounted multi-sensory audio input system
US20050033571A1 (en) * 2003-08-07 2005-02-10 Microsoft Corporation Head mounted multi-sensory audio input system
KR100577387B1 (ko) 2003-08-06 2006-05-10 삼성전자주식회사 음성 대화 시스템에서의 음성 인식 오류 처리 방법 및 장치
JP3890326B2 (ja) * 2003-11-07 2007-03-07 キヤノン株式会社 情報処理装置、情報処理方法ならびに記録媒体、プログラム
JP4516863B2 (ja) * 2005-03-11 2010-08-04 株式会社ケンウッド 音声合成装置、音声合成方法及びプログラム
TWI319152B (en) * 2005-10-04 2010-01-01 Ind Tech Res Inst Pre-stage detecting system and method for speech recognition
JP4282704B2 (ja) * 2006-09-27 2009-06-24 株式会社東芝 音声区間検出装置およびプログラム

Also Published As

Publication number Publication date
KR20060097647A (ko) 2006-09-14
EP1701338B1 (de) 2008-07-16
US20060206326A1 (en) 2006-09-14
JP2006251147A (ja) 2006-09-21
CN1831939A (zh) 2006-09-13
DE602006001764D1 (de) 2008-08-28
EP1701338A1 (de) 2006-09-13
KR100742888B1 (ko) 2007-07-25
CN100587806C (zh) 2010-02-03
ES2310893T3 (es) 2009-01-16
JP4667082B2 (ja) 2011-04-06
US7634401B2 (en) 2009-12-15

Similar Documents

Publication Publication Date Title
ATE401644T1 (de) Verfahren zur spracherkennung
TW200601263A (en) Apparatus and method for synthesized audible response to an utterance in speaker-independent voice recognition
WO2006023631A3 (en) Document transcription system training
DE602005001125D1 (de) Erlernen der Aussprache neuer Worte unter Verwendung eines Aussprachegraphen
WO2007015869A3 (en) Spoken language proficiency assessment by computer
TW200638337A (en) Using a spoken utterance for disambiguation of spelling inputs into a speech recognition system
ATE524777T1 (de) Automatische aktualisierung eines sprachmodells
WO2007027989A3 (en) Dynamic speech sharpening
WO2009025356A1 (ja) 音声認識装置および音声認識方法
WO2011133766A3 (en) Methods and systems for training dictation-based speech-to-text systems using recorded samples
EP0865032A3 (de) Spracherkenner mit Rauschadaptierung
ATE457510T1 (de) Spracherkennungssystem mit riesigem vokabular
ATE405919T1 (de) Spracherkennungssystem und verfahren auf phonetischer basis
DE60111329D1 (de) Anpassung des phonetischen Kontextes zur Verbesserung der Spracherkennung
ATE343197T1 (de) Vorrichtung zur bestimmung von parametern eines gauss'schen mischungmodells (gmm) oder eines gmm basierten hidden markov modells
ATE395685T1 (de) Spracherkennung durch wort-in-phrase-befehl
DE60026637D1 (de) Verfahren zur Erweiterung des Wortschatzes eines Spracherkennungssystems
WO2008042511A3 (en) Personalizing a voice dialogue system
WO2004100638A3 (en) Source-dependent text-to-speech system
WO2007047587A3 (en) Method and device for recognizing human intent
WO2007034478A3 (en) System and method for correcting speech
WO2008005711A3 (en) Non-enrolled continuous dictation
WO2004049305A3 (en) Discriminative training of hidden markov models for continuous speech recognition
ATE394773T1 (de) Verfahren zur spracherkennung mit zeitabhängiger interpolation und verborgenen dynamischen wertklassen
ATE487212T1 (de) Verstekte bedingte zufallfeldermodelle für phonetische klassifizierung und spracherkennung

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties