ATE401644T1 - Verfahren zur spracherkennung - Google Patents
Verfahren zur spracherkennungInfo
- Publication number
- ATE401644T1 ATE401644T1 AT06250864T AT06250864T ATE401644T1 AT E401644 T1 ATE401644 T1 AT E401644T1 AT 06250864 T AT06250864 T AT 06250864T AT 06250864 T AT06250864 T AT 06250864T AT E401644 T1 ATE401644 T1 AT E401644T1
- Authority
- AT
- Austria
- Prior art keywords
- speech
- voice recognition
- recognized
- user
- importation
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/187—Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/226—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
- G10L2015/228—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics of application context
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Artificial Intelligence (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Telephonic Communication Services (AREA)
- Electrically Operated Instructional Devices (AREA)
- Machine Translation (AREA)
- User Interface Of Digital Computer (AREA)
- Electric Clocks (AREA)
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2005065355A JP4667082B2 (ja) | 2005-03-09 | 2005-03-09 | 音声認識方法 |
Publications (1)
Publication Number | Publication Date |
---|---|
ATE401644T1 true ATE401644T1 (de) | 2008-08-15 |
Family
ID=36250777
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
AT06250864T ATE401644T1 (de) | 2005-03-09 | 2006-02-17 | Verfahren zur spracherkennung |
Country Status (8)
Country | Link |
---|---|
US (1) | US7634401B2 (de) |
EP (1) | EP1701338B1 (de) |
JP (1) | JP4667082B2 (de) |
KR (1) | KR100742888B1 (de) |
CN (1) | CN100587806C (de) |
AT (1) | ATE401644T1 (de) |
DE (1) | DE602006001764D1 (de) |
ES (1) | ES2310893T3 (de) |
Families Citing this family (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP4282704B2 (ja) * | 2006-09-27 | 2009-06-24 | 株式会社東芝 | 音声区間検出装置およびプログラム |
JP4950930B2 (ja) * | 2008-04-03 | 2012-06-13 | 株式会社東芝 | 音声/非音声を判定する装置、方法およびプログラム |
KR20130133629A (ko) * | 2012-05-29 | 2013-12-09 | 삼성전자주식회사 | 전자장치에서 음성명령을 실행시키기 위한 장치 및 방법 |
US8577671B1 (en) | 2012-07-20 | 2013-11-05 | Veveo, Inc. | Method of and system for using conversation state information in a conversational interaction system |
US9799328B2 (en) * | 2012-08-03 | 2017-10-24 | Veveo, Inc. | Method for using pauses detected in speech input to assist in interpreting the input during conversational interaction for information retrieval |
CN103971685B (zh) * | 2013-01-30 | 2015-06-10 | 腾讯科技(深圳)有限公司 | 语音命令识别方法和系统 |
ES2751484T3 (es) * | 2013-05-07 | 2020-03-31 | Veveo Inc | Interfaz de entrada de voz incremental con retroalimentación en tiempo real |
US20160063990A1 (en) * | 2014-08-26 | 2016-03-03 | Honeywell International Inc. | Methods and apparatus for interpreting clipped speech using speech recognition |
US9854049B2 (en) | 2015-01-30 | 2017-12-26 | Rovi Guides, Inc. | Systems and methods for resolving ambiguous terms in social chatter based on a user profile |
JP6972287B2 (ja) * | 2016-09-15 | 2021-11-24 | 東芝テック株式会社 | 音声認識装置、音声認識方法及び音声認識プログラム |
JP6804909B2 (ja) * | 2016-09-15 | 2020-12-23 | 東芝テック株式会社 | 音声認識装置、音声認識方法及び音声認識プログラム |
US10283117B2 (en) * | 2017-06-19 | 2019-05-07 | Lenovo (Singapore) Pte. Ltd. | Systems and methods for identification of response cue at peripheral device |
US10586529B2 (en) | 2017-09-14 | 2020-03-10 | International Business Machines Corporation | Processing of speech signal |
JP7092708B2 (ja) * | 2019-05-20 | 2022-06-28 | ヤフー株式会社 | 情報処理プログラム、情報処理装置及び情報処理方法 |
JP7404664B2 (ja) * | 2019-06-07 | 2023-12-26 | ヤマハ株式会社 | 音声処理装置及び音声処理方法 |
US12118984B2 (en) | 2020-11-11 | 2024-10-15 | Rovi Guides, Inc. | Systems and methods to resolve conflicts in conversations |
US11545143B2 (en) | 2021-05-18 | 2023-01-03 | Boris Fridman-Mintz | Recognition or synthesis of human-uttered harmonic sounds |
Family Cites Families (35)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4761815A (en) * | 1981-05-01 | 1988-08-02 | Figgie International, Inc. | Speech recognition system based on word state duration and/or weight |
US4712242A (en) * | 1983-04-13 | 1987-12-08 | Texas Instruments Incorporated | Speaker-independent word recognizer |
US5774851A (en) * | 1985-08-15 | 1998-06-30 | Canon Kabushiki Kaisha | Speech recognition apparatus utilizing utterance length information |
US4882757A (en) * | 1986-04-25 | 1989-11-21 | Texas Instruments Incorporated | Speech recognition system |
JP2882791B2 (ja) * | 1986-10-03 | 1999-04-12 | 株式会社リコー | パターン比較方式 |
JP2829014B2 (ja) | 1989-01-12 | 1998-11-25 | 株式会社東芝 | 音声認識装置及び方法 |
JP2708566B2 (ja) * | 1989-09-06 | 1998-02-04 | 株式会社日立製作所 | 音声認識制御装置 |
DE4031421C2 (de) * | 1989-10-05 | 1995-08-24 | Ricoh Kk | Musteranpassungssystem für eine Spracherkennungseinrichtung |
JP3004749B2 (ja) * | 1990-05-14 | 2000-01-31 | 株式会社リコー | 標準パターン登録方法 |
DE69128990T2 (de) * | 1990-09-07 | 1998-08-27 | Toshiba Kawasaki Kk | Sprecherkennungsvorrichtung |
US5692104A (en) * | 1992-12-31 | 1997-11-25 | Apple Computer, Inc. | Method and apparatus for detecting end points of speech activity |
DE4306508A1 (de) * | 1993-03-03 | 1994-09-08 | Philips Patentverwaltung | Verfahren und Anordnung zum Ermitteln von Wörtern in einem Sprachsignal |
US5765130A (en) * | 1996-05-21 | 1998-06-09 | Applied Language Technologies, Inc. | Method and apparatus for facilitating speech barge-in in connection with voice recognition systems |
US5835890A (en) * | 1996-08-02 | 1998-11-10 | Nippon Telegraph And Telephone Corporation | Method for speaker adaptation of speech models recognition scheme using the method and recording medium having the speech recognition method recorded thereon |
JP3588929B2 (ja) | 1996-08-27 | 2004-11-17 | 日産自動車株式会社 | 音声認識装置 |
US6167374A (en) * | 1997-02-13 | 2000-12-26 | Siemens Information And Communication Networks, Inc. | Signal processing method and system utilizing logical speech boundaries |
DE69831991T2 (de) | 1997-03-25 | 2006-07-27 | Koninklijke Philips Electronics N.V. | Verfahren und Vorrichtung zur Sprachdetektion |
JPH10319991A (ja) * | 1997-05-20 | 1998-12-04 | Sony Corp | 電子機器の音声認識起動方法及び装置 |
EP1083545A3 (de) * | 1999-09-09 | 2001-09-26 | Xanavi Informatics Corporation | Eigennamen Spracherkennung in einem Navigationssystem |
JP4520555B2 (ja) * | 1999-09-09 | 2010-08-04 | クラリオン株式会社 | 音声認識装置および音声認識ナビゲーション装置 |
US6389394B1 (en) * | 2000-02-09 | 2002-05-14 | Speechworks International, Inc. | Method and apparatus for improved speech recognition by modifying a pronunciation dictionary based on pattern definitions of alternate word pronunciations |
JP4880136B2 (ja) * | 2000-07-10 | 2012-02-22 | パナソニック株式会社 | 音声認識装置および音声認識方法 |
US7277853B1 (en) * | 2001-03-02 | 2007-10-02 | Mindspeed Technologies, Inc. | System and method for a endpoint detection of speech for improved speech recognition in noisy environments |
US7308404B2 (en) * | 2001-09-28 | 2007-12-11 | Sri International | Method and apparatus for speech recognition using a dynamic vocabulary |
JP2003330491A (ja) * | 2002-05-10 | 2003-11-19 | Nec Corp | 音声認識装置および音声認識方法ならびにプログラム |
KR100474253B1 (ko) * | 2002-12-12 | 2005-03-10 | 한국전자통신연구원 | 단어의 첫 자음 발성을 이용한 음성인식 방법 및 이를 저장한 기록 매체 |
US7024360B2 (en) * | 2003-03-17 | 2006-04-04 | Rensselaer Polytechnic Institute | System for reconstruction of symbols in a sequence |
US7343289B2 (en) * | 2003-06-25 | 2008-03-11 | Microsoft Corp. | System and method for audio/video speaker detection |
CA2473195C (en) * | 2003-07-29 | 2014-02-04 | Microsoft Corporation | Head mounted multi-sensory audio input system |
US20050033571A1 (en) * | 2003-08-07 | 2005-02-10 | Microsoft Corporation | Head mounted multi-sensory audio input system |
KR100577387B1 (ko) | 2003-08-06 | 2006-05-10 | 삼성전자주식회사 | 음성 대화 시스템에서의 음성 인식 오류 처리 방법 및 장치 |
JP3890326B2 (ja) * | 2003-11-07 | 2007-03-07 | キヤノン株式会社 | 情報処理装置、情報処理方法ならびに記録媒体、プログラム |
JP4516863B2 (ja) * | 2005-03-11 | 2010-08-04 | 株式会社ケンウッド | 音声合成装置、音声合成方法及びプログラム |
TWI319152B (en) * | 2005-10-04 | 2010-01-01 | Ind Tech Res Inst | Pre-stage detecting system and method for speech recognition |
JP4282704B2 (ja) * | 2006-09-27 | 2009-06-24 | 株式会社東芝 | 音声区間検出装置およびプログラム |
-
2005
- 2005-03-09 JP JP2005065355A patent/JP4667082B2/ja not_active Expired - Fee Related
-
2006
- 2006-02-17 ES ES06250864T patent/ES2310893T3/es active Active
- 2006-02-17 AT AT06250864T patent/ATE401644T1/de not_active IP Right Cessation
- 2006-02-17 DE DE602006001764T patent/DE602006001764D1/de active Active
- 2006-02-17 EP EP06250864A patent/EP1701338B1/de not_active Not-in-force
- 2006-03-06 US US11/368,986 patent/US7634401B2/en not_active Expired - Fee Related
- 2006-03-08 KR KR1020060021863A patent/KR100742888B1/ko not_active IP Right Cessation
- 2006-03-09 CN CN200610057222A patent/CN100587806C/zh not_active Expired - Fee Related
Also Published As
Publication number | Publication date |
---|---|
KR20060097647A (ko) | 2006-09-14 |
EP1701338B1 (de) | 2008-07-16 |
US20060206326A1 (en) | 2006-09-14 |
JP2006251147A (ja) | 2006-09-21 |
CN1831939A (zh) | 2006-09-13 |
DE602006001764D1 (de) | 2008-08-28 |
EP1701338A1 (de) | 2006-09-13 |
KR100742888B1 (ko) | 2007-07-25 |
CN100587806C (zh) | 2010-02-03 |
ES2310893T3 (es) | 2009-01-16 |
JP4667082B2 (ja) | 2011-04-06 |
US7634401B2 (en) | 2009-12-15 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
ATE401644T1 (de) | Verfahren zur spracherkennung | |
TW200601263A (en) | Apparatus and method for synthesized audible response to an utterance in speaker-independent voice recognition | |
WO2006023631A3 (en) | Document transcription system training | |
DE602005001125D1 (de) | Erlernen der Aussprache neuer Worte unter Verwendung eines Aussprachegraphen | |
WO2007015869A3 (en) | Spoken language proficiency assessment by computer | |
TW200638337A (en) | Using a spoken utterance for disambiguation of spelling inputs into a speech recognition system | |
ATE524777T1 (de) | Automatische aktualisierung eines sprachmodells | |
WO2007027989A3 (en) | Dynamic speech sharpening | |
WO2009025356A1 (ja) | 音声認識装置および音声認識方法 | |
WO2011133766A3 (en) | Methods and systems for training dictation-based speech-to-text systems using recorded samples | |
EP0865032A3 (de) | Spracherkenner mit Rauschadaptierung | |
ATE457510T1 (de) | Spracherkennungssystem mit riesigem vokabular | |
ATE405919T1 (de) | Spracherkennungssystem und verfahren auf phonetischer basis | |
DE60111329D1 (de) | Anpassung des phonetischen Kontextes zur Verbesserung der Spracherkennung | |
ATE343197T1 (de) | Vorrichtung zur bestimmung von parametern eines gauss'schen mischungmodells (gmm) oder eines gmm basierten hidden markov modells | |
ATE395685T1 (de) | Spracherkennung durch wort-in-phrase-befehl | |
DE60026637D1 (de) | Verfahren zur Erweiterung des Wortschatzes eines Spracherkennungssystems | |
WO2008042511A3 (en) | Personalizing a voice dialogue system | |
WO2004100638A3 (en) | Source-dependent text-to-speech system | |
WO2007047587A3 (en) | Method and device for recognizing human intent | |
WO2007034478A3 (en) | System and method for correcting speech | |
WO2008005711A3 (en) | Non-enrolled continuous dictation | |
WO2004049305A3 (en) | Discriminative training of hidden markov models for continuous speech recognition | |
ATE394773T1 (de) | Verfahren zur spracherkennung mit zeitabhängiger interpolation und verborgenen dynamischen wertklassen | |
ATE487212T1 (de) | Verstekte bedingte zufallfeldermodelle für phonetische klassifizierung und spracherkennung |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
RER | Ceased as to paragraph 5 lit. 3 law introducing patent treaties |