CN105791931A - Smart television and voice control method of the smart television - Google Patents

Smart television and voice control method of the smart television Download PDF

Info

Publication number
CN105791931A
CN105791931A CN201610109679.7A CN201610109679A CN105791931A CN 105791931 A CN105791931 A CN 105791931A CN 201610109679 A CN201610109679 A CN 201610109679A CN 105791931 A CN105791931 A CN 105791931A
Authority
CN
China
Prior art keywords
sound
phonetic order
intelligent television
user
sub
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201610109679.7A
Other languages
Chinese (zh)
Inventor
汪斯涛
王云华
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen TCL New Technology Co Ltd
Original Assignee
Shenzhen TCL New Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen TCL New Technology Co Ltd filed Critical Shenzhen TCL New Technology Co Ltd
Priority to CN201610109679.7A priority Critical patent/CN105791931A/en
Priority to PCT/CN2016/084869 priority patent/WO2017143692A1/en
Publication of CN105791931A publication Critical patent/CN105791931A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/422Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/422Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS]
    • H04N21/42203Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS] sound input device, e.g. microphone
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/34Adaptation of a single recogniser for parallel processing, e.g. by use of multiple processors or cloud computing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • G10L25/54Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination for retrieval

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Physics & Mathematics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Signal Processing (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Theoretical Computer Science (AREA)
  • Selective Calling Equipment (AREA)
  • User Interface Of Digital Computer (AREA)
  • Television Systems (AREA)

Abstract

The invention discloses a voice control method of a smart television. The method comprises following processes of determining the user identifier of a user in a voice control mode according to a received voice construction when the voice instruction input by the user is received; searching a sub-voice library associated with the user identifier from a prebuilt standard voice library; matching voice templates stored in the searched sub-voice library with the received voice instruction; and controlling the smart television to execute corresponding operations according to the control instruction corresponding to the voice template matching with the voice instruction when a voice template stored in the sub-voice library matches with the voice instruction. The invention also provides the smart television. According to the method and the smart television, the technical problem that the voice identification speed is low when the smart television is controlled by voices is solved.

Description

Intelligent television and sound control method thereof
Technical field
The present invention relates to voice control technology field, particularly relate to a kind of intelligent television and sound control method thereof.
Background technology
nullDevelopment along with intelligent television technology,The function that intelligent television has increasingly is come,Such as,There are video display,Music,Intelligent television program,Game,The functions such as mail,In order to adapt to the various functions on intelligent television,Button on remote controller gets more and more,The demand of user can not be met gradually,Therefore,The intelligent television with voice control function is developed,Existing voice recognition mode mainly has two kinds,A kind of voice recognition mode relies on cloud network,Intelligent television is after the voice command receiving user,The sound bank needed to high in the clouds carries out contrasting to get control instruction,But this mode is bad in network environment、Do not connect network or high in the clouds storage received pronunciation too much time,Cause that speech recognition speed is slow,Even cannot be carried out speech recognition,Another way is when intelligent television dispatches from the factory,Storage received pronunciation storehouse carries out local voice identification for user,But,Pronunciation characteristic due to different user、Language is not equal,Equally exist the slow-footed technical problem of speech recognition.
Summary of the invention
The present invention provides a kind of intelligent television and sound control method thereof, and its main purpose is in that to solve speech recognition slow-footed technical problem during Voice command intelligent television in prior art.
For achieving the above object, the present invention provides the sound control method of a kind of intelligent television, and the sound control method of this intelligent television includes:
Under Voice command pattern, when receiving the phonetic order of user's input, determine the ID of described user according to the described phonetic order received;
From the received pronunciation storehouse pre-build, search the sub-sound bank associated with described ID, and the sound template of storage in the described sub-sound bank found is mated with the described phonetic order received;
When described sub-sound bank exists the sound template mated with described phonetic order, control intelligent television according to the control instruction corresponding with the described sound template that described phonetic order mates and perform corresponding operation.
Preferably, the sound control method of described intelligent television further comprises the steps of:
Under speech data memory module, when receiving the speech data of user's input, described speech data is resolved, obtains the speech speed of described speech data;
According to the speech speed got described speech data stored corresponding with described speech speed in sound bank.
Preferably, described under Voice command pattern, when receiving the phonetic order of user's input, determine that according to the described phonetic order received the step of the ID of described user includes:
When receiving the phonetic order of user's input under Voice command pattern, calculate described user and send the speech speed of described phonetic order;
Determine that the speech speed that described speech speed is corresponding is interval, obtain the ID that described speech speed interval is corresponding.
Preferably, it is described when described sub-sound bank exists the sound template mated with described phonetic order, after controlling, according to the control instruction corresponding with the described sound template that described phonetic order mates, the step that described intelligent television performs corresponding operation, the sound control method of described intelligent television further comprises the steps of:
Update the access times of the sound template mated with described phonetic order, and the access times after described renewal are associated storage with described sound template.
Preferably, the described sub-sound bank associated with described ID of searching from the received pronunciation storehouse pre-build, and the step that the sound template of storage in the described sub-sound bank found carries out mating with the described phonetic order received is included:
The sub-sound bank associated with described ID is searched, by the sound template in described sub-sound bank according to access times by how to be at least ranked up from the received pronunciation storehouse pre-build;
According to access times by many orders at least, the sound template of storage in described sub-sound bank is mated one by one with the described phonetic order received.
Additionally, for achieving the above object, the present invention also provides for a kind of intelligent television, and this intelligent television includes:
Mark determines module, for, under Voice command pattern, when receiving the phonetic order of user's input, determining the ID of described user according to the described phonetic order received;
Voice match module, for searching the sub-sound bank associated with described ID from the received pronunciation storehouse pre-build, and mates the sound template of storage in the described sub-sound bank found with the described phonetic order received;
Instruction performs module, for when there is the sound template mated with described phonetic order in described sub-sound bank, controlling intelligent television according to the control instruction corresponding with the described sound template that described phonetic order mates and performing corresponding operation.
Preferably, described intelligent television also includes:
Word speed acquisition module, for, under speech data memory module, when receiving the speech data of user's input, resolving described speech data, obtain the speech speed of described speech data;
Data memory module is corresponding with described speech speed in sound bank for being stored by described speech data according to the speech speed that gets.
Preferably, described mark determines that module includes:
Word speed computing unit, when being used for the phonetic order receiving user's input under Voice command pattern, calculates described user and sends the speech speed of described phonetic order;
Mark acquiring unit, interval for determining the speech speed that described speech speed is corresponding, obtain the ID that described speech speed interval is corresponding.
Preferably, described intelligent television also includes:
Number of times is new module more, for updating the access times of the sound template mated with described phonetic order, and with described sound template, the access times after described renewal is associated storage.
Preferably, described voice match module includes:
Template sequencing unit, for searching the sub-sound bank associated with described ID, by the sound template in described sub-sound bank according to access times by how to be at least ranked up from the received pronunciation storehouse pre-build;
Voice match unit, is used for according to access times by many orders at least, is mated one by one with the described phonetic order received by the sound template of storage in described sub-sound bank.
nullThe intelligent television of present invention proposition and sound control method thereof,When receiving the phonetic order of user's input,The ID of user is determined according to the phonetic order received,The sub-sound bank associated with this ID is searched from the received pronunciation storehouse pre-build,And the sound template of storage in the sub-sound bank found is mated with the phonetic order received,When group sound bank has sound template to mate with phonetic order,Control intelligent television according to the control instruction corresponding with the sound template that phonetic order mates and perform corresponding operation,The present invention is in advance for using each user of this intelligent television to set up sub-sound bank,And associate with the ID of this user,When sound template in carrying out sound bank mates with phonetic order,Have only to after determining the user that the phonetic order received is corresponding,Obtain the ID of this user,The sub-sound bank directly associated to this ID carries out voice match,Decrease the amount of calculation of voice match,And contrast without the sound bank to high in the clouds,Drastically increase the speed of speech recognition,And then accelerate the response speed of phonetic order.
Accompanying drawing explanation
Fig. 1 is the flow chart of the sound control method first embodiment of intelligent television of the present invention;
Fig. 2 be intelligent television of the present invention sound control method first embodiment in obtain the refinement schematic flow sheet of step of ID;
Fig. 3 is the flow chart of sound control method second embodiment of intelligent television of the present invention;
Fig. 4 is the refinement schematic flow sheet of the step in sound control method second embodiment of intelligent television of the present invention, phonetic order mated;
Fig. 5 is the high-level schematic functional block diagram of intelligent television first embodiment of the present invention;
Fig. 6 is the refinement high-level schematic functional block diagram of identifier acquisition module in intelligent television first embodiment of the present invention;
Fig. 7 is the high-level schematic functional block diagram of intelligent television the second embodiment of the present invention;
Fig. 8 is the refinement high-level schematic functional block diagram of voice match module in intelligent television the second embodiment of the present invention.
The realization of the object of the invention, functional characteristics and advantage will in conjunction with the embodiments, are described further with reference to accompanying drawing.
Detailed description of the invention
Should be appreciated that specific embodiment described herein is only in order to explain the present invention, is not intended to limit the present invention.
The present invention provides the sound control method of a kind of intelligent television.
With reference to shown in Fig. 1, for the flow chart of the sound control method first embodiment of intelligent television of the present invention.
In the first embodiment, the sound control method of this intelligent television includes:
Step S10, under Voice command pattern, when receiving the phonetic order of user's input, determines the ID of described user according to the described phonetic order received;
Step S20, searches the sub-sound bank associated with described ID from the received pronunciation storehouse pre-build, and is mated with the described phonetic order received by the sound template of storage in the described sub-sound bank found;
On the remote control unit of intelligent television or the control triggering Voice command pattern is directly set on intelligent television, when using hands machine, the mobile terminals such as panel computer are set up with intelligent television when being connected, Voice command pattern can also be triggered by above-mentioned mobile terminal, under Voice command pattern, when user distance intelligent television farther out time, in order to ensure the definition of the voice signal collected, the phonetic order that user sends can be gathered by remote control unit or mobile terminal, intelligent television is sent it to by the mode of wireless telecommunications, when user distance intelligent television is nearer, the phonetic order that user sends can also be gathered either directly through the mike of intelligent television.When receiving the phonetic order of user's input, determine the ID of user according to this phonetic order.
Generally, the user of same intelligent television is used to be likely to more than one, but the characteristic voice of each user differing, for instance, it is possible to different according to tone color, speech speed etc. judge it is which user is carrying out Voice command.
In one embodiment, with reference to shown in Fig. 2, step S10 can include following refinement step:
Step S11, when receiving the phonetic order of user's input under Voice command pattern, calculates described user and sends the speech speed of described phonetic order;
Step S12, it is determined that the speech speed that described speech speed is corresponding is interval, obtains the ID that described speech speed interval is corresponding.
In this embodiment, the user of correspondence is determined by speech speed, user stores the voice of oneself in advance as sound template on intelligent television, the sound template of user is analyzed determining the speech speed of this user by intelligent television, and it is interval to divide speech speed in advance, for instance, it is Rapid Speech that unit says 1.5-2 character for 1 second, it is middling speed voice that unit says 1-1.5 character for 1 second, and it is low rate speech that unit says 0.5-1 character for 1 second;Judge that the speech speed belonging to this speech speed is interval, interval for this speech speed ID with this user is associated, simultaneously, Criterion sound bank, received pronunciation storehouse is divided into many sub-sound banks, distribute a sub-sound bank for each user and associate with the ID of this user, above-mentioned sound template user recorded in advance stores in the sub-sound bank that this ID is corresponding, and the control instruction of correspondence is set for each sound template, such as channel adds, channel down, weather lookup, application open command, volume adjusting instruction etc., user can be configured according to the needs of oneself, when inputting phonetic order, have only to the voice content that input is identical with sound template.
When intelligent television has carried out system update, being provided with new function, or after being mounted with new application, the sound template in sub-sound bank corresponding to the ID of oneself can be updated by user at any time, for instance, amendment, add or deletion etc..
Under Voice command pattern, after intelligent television receives the phonetic order of user's input, according to the phonetic order received, calculate this user and send the speech speed of phonetic order, wherein, in how having about the calculation of speech speed, for example, it is possible to the phonetic order that user inputs is converted into word, obtain the duration of this phonetic order simultaneously, thus obtaining the number of characters that each second, user sent, as speech speed;Or the voice signal received is carried out waveform analysis, to obtain the speech speed of user, if the signal received is digital signal, then after converting digital signals into analogue signal, carries out waveform analysis, to obtain the speech speed of user.
After obtaining ID, from the received pronunciation storehouse pre-build, search the sub-sound bank associated with ID, and the sound template of storage in the sub-sound bank found is mated with the phonetic order received.
Step S30, when there is the sound template mated with described phonetic order in described sub-sound bank, controlling intelligent television according to the control instruction corresponding with the described sound template that described phonetic order mates and performing corresponding operation.
When finding, in the sub-sound bank corresponding in this ID, the sound template that the phonetic order sent with user mates, control intelligent television according to the control instruction that sound template is corresponding and perform corresponding operation.
Further, after step S20, the sound control method of this intelligent television further comprises the steps of:
The text message that display is corresponding with the sound template of described phonetic order coupling on the display interface of described intelligent television;
After step S30, the sound control method of this intelligent television is further comprising the steps of:
The display interface of described intelligent television shows result after the intelligent television described control instruction of execution in the form of text.In other embodiments, it is also possible to perform the result after described control instruction with speech form feedback intelligent TV.
nullThe sound control method of the intelligent television that the present embodiment proposes,When receiving the phonetic order of user's input,The ID of user is determined according to the phonetic order received,The sub-sound bank associated with this ID is searched from the received pronunciation storehouse pre-build,And the sound template of storage in the sub-sound bank found is mated with the phonetic order received,When group sound bank has sound template to mate with phonetic order,Control intelligent television according to the control instruction corresponding with the sound template that phonetic order mates and perform corresponding operation,The present invention is in advance for using each user of this intelligent television to set up sub-sound bank,And associate with the ID of this user,When sound template in carrying out sound bank mates with phonetic order,Have only to after determining the user that the phonetic order received is corresponding,Obtain the ID of this user,The sub-sound bank directly associated to this ID carries out voice match,Decrease the amount of calculation of voice match,And contrast without the sound bank to high in the clouds,Drastically increase the speed of speech recognition,And then accelerate the response speed of phonetic order.
With reference to shown in Fig. 3, the second embodiment of the sound control method of intelligent television of the present invention is proposed based on the first embodiment of the sound control method of intelligent television of the present invention.In the present embodiment, described method and first embodiment are distinctive in that, the sound control method of this intelligent television also includes:
Step S40, under speech data memory module, when receiving the speech data of user's input, resolves described speech data, obtains the speech speed of described speech data;
Step S50, stores corresponding with described speech speed in sound bank according to the speech speed got by described speech data.
User can record in the locally stored space that the voice of oneself stores intelligent television as sound template, when carrying out Voice command, intelligent television can be made directly local voice identification, contrast without the sound bank to high in the clouds, improve the speed of speech recognition, further, the voice in local voice storehouse can also be uploaded to high in the clouds as backup by user.
When setting up sub-sound bank, user controls intelligent television and enters speech data memory module, when receiving the speech data of user's input, speech speed according to speech data, namely the word speed of user is allocated, so, when the word speed of active user is middling speed, then corresponding with speech speed in sound bank by automatically the speech data of this user being stored.In other embodiments, can also be which user according to other mode identification, and its speech data stored correspondence in sound bank, such as, not equal by tone color, or, when user sends speech data by mobile terminals such as mobile phones, distinguish user according to the mark of mobile terminal.
Or, in other examples, it is also possible to set up sub-sound bank in such a way:
Under speech data memory module, set up sub-sound bank, the ID that the described sub-sound bank set up inputs with user is associated;When receiving the speech data of user's input, the speech data received is stored in the described sub-sound bank associated with the described ID set up as sound template.
Speech data memory module is set, carrying out when arranging of sound template, enter into speech data memory module, set up sub-sound bank, the ID that the described sub-sound bank set up inputs with user is associated, receive the speech data of user's input, the speech data received is stored in the described sub-sound bank associated with the described ID set up as sound template.
The sound control method of the intelligent television that the present embodiment proposes, for using each user of this intelligent television to set up sub-sound bank, and associate with the ID of this user, when sound template in carrying out sound bank mates with phonetic order, can according to the speech speed of this user, the speech data of its correspondence is assigned in the sub-sound bank of correspondence, drastically increases the speed of speech recognition, and then accelerate the response speed of phonetic order.
3rd embodiment of sound control method of intelligent television of the present invention is proposed based on the first embodiment of the sound control method of intelligent television of the present invention.In the present embodiment, described method and first embodiment are distinctive in that, after the step s 40, the method is further comprising the steps of:
Update the access times of the sound template mated with described phonetic order, and the access times after described renewal are associated storage with described sound template.
With reference to shown in Fig. 4, carry out the renewal of access times of sound template and associate storage basis on, step S20 can include following refinement step:
Step S21, searches the sub-sound bank associated with described ID, by the sound template in described sub-sound bank according to access times by how to be at least ranked up from the received pronunciation storehouse pre-build;
Step S22, according to access times by many orders at least, mates with the described phonetic order received one by one by the sound template of storage in described sub-sound bank.
After the phonetic order that successful match inputs to user each time, all the sound template matched is carried out the renewal of access times, each sound template in sub-sound bank is storing after on intelligent television, its initial access times are zero, every successful match is once, this number of times is increased by 1, count user in this manner and use the number of times of each sound template, when carrying out the coupling of phonetic order, by the sound template in sub-sound bank according to access times by how to be at least ranked up, then according to access times are by many orders at least, the sound template of storage in sub-sound bank is mated one by one with the described phonetic order received.
The sound control method of the intelligent television that the present embodiment proposes, carry out the renewal of access times of sound template and associate storage basis on, by when in sub-sound bank, the sound template of storage mates with the phonetic order received, mate one by one by many orders at least according to access times, so, some commonly used phonetic orders just can match the sound template of correspondence more rapidly, further increase the speed of speech recognition, and then accelerate the response speed of phonetic order.
The present invention also proposes a kind of intelligent television.
With reference to shown in Fig. 5, for the high-level schematic functional block diagram of intelligent television first embodiment of the present invention.
In this embodiment, this intelligent television includes mark and determines that module 10, voice match module 20 and instruction perform module 30.
Concrete, mark determines module 10, for, under Voice command pattern, when receiving the phonetic order of user's input, determining the ID of described user according to the described phonetic order received;
Voice match module 20, for searching the sub-sound bank associated with described ID from the received pronunciation storehouse pre-build, and mates the sound template of storage in the described sub-sound bank found with the described phonetic order received;
On the remote control unit of intelligent television or the control triggering Voice command pattern is directly set on intelligent television, when using hands machine, the mobile terminals such as panel computer are set up with intelligent television when being connected, Voice command pattern can also be triggered by above-mentioned mobile terminal, under Voice command pattern, when user distance intelligent television farther out time, in order to ensure the definition of the voice signal collected, the phonetic order that user sends can be gathered by remote control unit or mobile terminal, intelligent television is sent it to by the mode of wireless telecommunications, when user distance intelligent television is nearer, the phonetic order that user sends can also be gathered either directly through the mike of intelligent television.When mark determines the phonetic order that module 10 receives user's input, determine the ID of user according to this phonetic order.
Generally, the user of same intelligent television is used to be likely to more than one, but the characteristic voice of each user differing, for instance, it is possible to different according to tone color, speech speed etc. judge it is which user is carrying out Voice command.
In one embodiment, with reference to shown in Fig. 6, mark determines that module 10 can include following refinement unit:
Word speed computing unit 11, when being used for the phonetic order receiving user's input under Voice command pattern, calculates described user and sends the speech speed of described phonetic order;
Mark acquiring unit 12, interval for determining the speech speed that described speech speed is corresponding, obtain the ID that described speech speed interval is corresponding.
In this embodiment, the user of correspondence is determined by speech speed, user stores the voice of oneself in advance as sound template on intelligent television, the sound template of user is analyzed determining the speech speed of this user, and it is interval to divide speech speed in advance, for instance, it is Rapid Speech that unit says 1.5-2 character for 1 second, it is middling speed voice that unit says 1-1.5 character for 1 second, and it is low rate speech that unit says 0.5-1 character for 1 second;Judge that the speech speed belonging to this speech speed is interval, interval for this speech speed ID with this user is associated, simultaneously, Criterion sound bank, received pronunciation storehouse is divided into many sub-sound banks, distribute a sub-sound bank for each user and associate with the ID of this user, above-mentioned sound template user recorded in advance stores in the sub-sound bank that this ID is corresponding, and the control instruction of correspondence is set for each sound template, such as channel adds, channel down, weather lookup, application open command, volume adjusting instruction etc., user can be configured according to the needs of oneself, when inputting phonetic order, have only to the voice content that input is identical with sound template.
When intelligent television has carried out system update, being provided with new function, or after being mounted with new application, the sound template in sub-sound bank corresponding to the ID of oneself can be updated by user at any time, for instance, amendment, add or deletion etc..
Under Voice command pattern, after intelligent television receives the phonetic order of user's input, word speed computing unit 11, according to the phonetic order received, calculates this user and sends the speech speed of phonetic order, wherein, in how having about the calculation of speech speed, for example, it is possible to the phonetic order that user inputs is converted into word, obtain the duration of this phonetic order simultaneously, thus obtaining the number of characters that each second, user sent, as speech speed;Or the voice signal received is carried out waveform analysis, to obtain the speech speed of user, if the signal received is digital signal, then after converting digital signals into analogue signal, carries out waveform analysis, to obtain the speech speed of user.
After obtaining ID, voice match module 20 searches the sub-sound bank associated with ID from the received pronunciation storehouse pre-build, and is mated with the phonetic order received by the sound template of storage in the sub-sound bank found.
Instruction performs module 30, for when there is the sound template mated with described phonetic order in described sub-sound bank, controlling intelligent television according to the control instruction corresponding with the described sound template that described phonetic order mates and performing corresponding operation.
When finding, in the sub-sound bank corresponding in this ID, the sound template that the phonetic order sent with user mates, instruction performs module 30 and controls, according to the control instruction that sound template is corresponding, the operation that intelligent television execution is corresponding.
Further, this intelligent television also includes:
Display module, the text message that display is corresponding with the sound template of described phonetic order coupling on the display interface at described intelligent television;And on the display interface of described intelligent television, show the result after the intelligent television described control instruction of execution in the form of text.In other embodiments, intelligent television can also perform the result after described control instruction with speech form feedback intelligent TV.
nullThe intelligent television that the present embodiment proposes,When receiving the phonetic order of user's input,The ID of user is determined according to the phonetic order received,The sub-sound bank associated with this ID is searched from the received pronunciation storehouse pre-build,And the sound template of storage in the sub-sound bank found is mated with the phonetic order received,When group sound bank has sound template to mate with phonetic order,Control intelligent television according to the control instruction corresponding with the sound template that phonetic order mates and perform corresponding operation,The present invention is in advance for using each user of this intelligent television to set up sub-sound bank,And associate with the ID of this user,When sound template in carrying out sound bank mates with phonetic order,Have only to after determining the user that the phonetic order received is corresponding,Obtain the ID of this user,The sub-sound bank directly associated to this ID carries out voice match,Decrease the amount of calculation of voice match,And contrast without the sound bank to high in the clouds,Drastically increase the speed of speech recognition,And then accelerate the response speed of phonetic order.
With reference to shown in Fig. 7, based on the second embodiment of the first embodiment proposition intelligent television of the present invention of intelligent television of the present invention.In the present embodiment, described method and first embodiment are distinctive in that, this intelligent television also includes with lower module:
Word speed acquisition module 40, for, under speech data memory module, when receiving the speech data of user's input, resolving described speech data, obtain the speech speed of described speech data;
Data memory module 50 is corresponding with described speech speed in sound bank for being stored by described speech data according to the speech speed that gets.
User can record in the locally stored space that the voice of oneself stores intelligent television as sound template, when carrying out Voice command, intelligent television can be made directly local voice identification, contrast without the sound bank to high in the clouds, improve the speed of speech recognition, further, the voice in local voice storehouse can also be uploaded to high in the clouds as backup by user.
When setting up sub-sound bank, user controls intelligent television and enters speech data memory module, when receiving the speech data of user's input, data memory module 50 is according to the speech speed of speech data, namely the word speed of user is allocated, so, when the word speed of active user is middling speed, then data memory module 50 is corresponding with speech speed in sound bank by automatically being stored by the speech data of this user.In other embodiments, can also be which user according to other mode identification, and its speech data stored correspondence in sound bank, such as, not equal by tone color, or, when user sends speech data by mobile terminals such as mobile phones, distinguish user according to the mark of mobile terminal.
Or, in other examples, DTV can also include:
Sound bank sets up module, and voice, under speech data memory module, sets up sub-sound bank, the ID that the described sub-sound bank set up inputs with user is associated;Data memory module 50, is additionally operable to when receiving the speech data of user's input, is stored by the speech data received in the described sub-sound bank associated with the described ID set up as sound template.
Speech data memory module is set, carrying out when arranging of sound template, enter into speech data memory module, sound bank is set up module and is set up sub-sound bank, the ID that the described sub-sound bank set up inputs with user is associated, data memory module 50 receives the speech data of user's input, is stored by the speech data received in the described sub-sound bank associated with the described ID set up as sound template.
The intelligent television that the present embodiment proposes, for using each user of this intelligent television to set up sub-sound bank, and associate with the ID of this user, when sound template in carrying out sound bank mates with phonetic order, can according to the speech speed of this user, the speech data of its correspondence is assigned in the sub-sound bank of correspondence, drastically increases the speed of speech recognition, and then accelerate the response speed of phonetic order.
The 3rd embodiment based on the first embodiment proposition intelligent television of the present invention of intelligent television of the present invention.In the present embodiment, described method and first embodiment are distinctive in that, this intelligent television also includes with lower module:
Number of times is new module more, for updating the access times of the sound template mated with described phonetic order, and with described sound template, the access times after described renewal is associated storage.
With reference to shown in Fig. 8, in the renewal carrying out the access times of sound template in number of times more new module the basis associating storage, voice match module 20 can include following refinement unit:
Template sequencing unit 21, for searching the sub-sound bank associated with described ID, by the sound template in described sub-sound bank according to access times by how to be at least ranked up from the received pronunciation storehouse pre-build;
Voice match unit 22, is used for according to access times by many orders at least, is mated one by one with the described phonetic order received by the sound template of storage in described sub-sound bank.
After the phonetic order that successful match inputs to user each time, all the sound template matched is carried out the renewal of access times, each sound template in sub-sound bank is storing after on intelligent television, its initial access times are zero, every successful match is once, this number of times is increased by 1, count user in this manner and use the number of times of each sound template, when carrying out the coupling of phonetic order, template sequencing unit 21 by the sound template in sub-sound bank according to access times by how to be at least ranked up, then voice match unit 22 according to access times by many orders at least, the sound template of storage in sub-sound bank is mated one by one with the described phonetic order received.
The intelligent television that the present embodiment proposes, carry out the renewal of access times of sound template and associate storage basis on, by when in sub-sound bank, the sound template of storage mates with the phonetic order received, mate one by one by many orders at least according to access times, so, some commonly used phonetic orders just can match the sound template of correspondence more rapidly, further increases the speed of speech recognition, and then accelerates the response speed of phonetic order.
These are only the preferred embodiments of the present invention; not thereby the scope of the claims of the present invention is limited; every equivalent structure utilizing description of the present invention and accompanying drawing content to make or equivalence flow process conversion; or directly or indirectly it is used in other relevant technical fields, all in like manner include in the scope of patent protection of the present invention.

Claims (10)

1. the sound control method of an intelligent television, it is characterised in that the sound control method of described intelligent television includes:
Under Voice command pattern, when receiving the phonetic order of user's input, determine the ID of described user according to the described phonetic order received;
From the received pronunciation storehouse pre-build, search the sub-sound bank associated with described ID, and the sound template of storage in the described sub-sound bank found is mated with the described phonetic order received;
When described sub-sound bank exists the sound template mated with described phonetic order, control intelligent television according to the control instruction corresponding with the described sound template that described phonetic order mates and perform corresponding operation.
2. the sound control method of intelligent television according to claim 1, it is characterised in that the sound control method of described intelligent television further comprises the steps of:
Under speech data memory module, when receiving the speech data of user's input, described speech data is resolved, obtains the speech speed of described speech data;
According to the speech speed got described speech data stored corresponding with described speech speed in sound bank.
3. the sound control method of intelligent television according to claim 1, it is characterized in that, described under Voice command pattern, when receiving the phonetic order of user's input, determine that according to the described phonetic order received the step of the ID of described user includes:
When receiving the phonetic order of user's input under Voice command pattern, calculate described user and send the speech speed of described phonetic order;
Determine that the speech speed that described speech speed is corresponding is interval, obtain the ID that described speech speed interval is corresponding.
4. the sound control method of intelligent television according to claim 1, it is characterized in that, it is described when described sub-sound bank exists the sound template mated with described phonetic order, after controlling, according to the control instruction corresponding with the described sound template that described phonetic order mates, the step that described intelligent television performs corresponding operation, the sound control method of described intelligent television further comprises the steps of:
Update the access times of the sound template mated with described phonetic order, and the access times after described renewal are associated storage with described sound template.
5. the sound control method of intelligent television according to claim 4, it is characterized in that, the described sub-sound bank associated with described ID of searching from the received pronunciation storehouse pre-build, and the step that the sound template of storage in the described sub-sound bank found carries out mating with the described phonetic order received is included:
The sub-sound bank associated with described ID is searched, by the sound template in described sub-sound bank according to access times by how to be at least ranked up from the received pronunciation storehouse pre-build;
According to access times by many orders at least, the sound template of storage in described sub-sound bank is mated one by one with the described phonetic order received.
6. an intelligent television, it is characterised in that described intelligent television includes:
Mark determines module, for, under Voice command pattern, when receiving the phonetic order of user's input, determining the ID of described user according to the described phonetic order received;
Voice match module, for searching the sub-sound bank associated with described ID from the received pronunciation storehouse pre-build, and mates the sound template of storage in the described sub-sound bank found with the described phonetic order received;
Instruction performs module, for when there is the sound template mated with described phonetic order in described sub-sound bank, controlling intelligent television according to the control instruction corresponding with the described sound template that described phonetic order mates and performing corresponding operation.
7. intelligent television according to claim 6, it is characterised in that described intelligent television also includes:
Word speed acquisition module, for, under speech data memory module, when receiving the speech data of user's input, resolving described speech data, obtain the speech speed of described speech data;
Data memory module is corresponding with described speech speed in sound bank for being stored by described speech data according to the speech speed that gets.
8. intelligent television according to claim 6, it is characterised in that described mark determines that module includes:
Word speed computing unit, when being used for the phonetic order receiving user's input under Voice command pattern, calculates described user and sends the speech speed of described phonetic order;
Mark acquiring unit, interval for determining the speech speed that described speech speed is corresponding, obtain the ID that described speech speed interval is corresponding.
9. intelligent television according to claim 6, it is characterised in that described intelligent television also includes:
Number of times is new module more, for updating the access times of the sound template mated with described phonetic order, and with described sound template, the access times after described renewal is associated storage.
10. intelligent television according to claim 9, it is characterised in that described voice match module includes:
Template sequencing unit, for searching the sub-sound bank associated with described ID, by the sound template in described sub-sound bank according to access times by how to be at least ranked up from the received pronunciation storehouse pre-build;
Voice match unit, is used for according to access times by many orders at least, is mated one by one with the described phonetic order received by the sound template of storage in described sub-sound bank.
CN201610109679.7A 2016-02-26 2016-02-26 Smart television and voice control method of the smart television Pending CN105791931A (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201610109679.7A CN105791931A (en) 2016-02-26 2016-02-26 Smart television and voice control method of the smart television
PCT/CN2016/084869 WO2017143692A1 (en) 2016-02-26 2016-06-04 Smart television and voice control method therefor

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610109679.7A CN105791931A (en) 2016-02-26 2016-02-26 Smart television and voice control method of the smart television

Publications (1)

Publication Number Publication Date
CN105791931A true CN105791931A (en) 2016-07-20

Family

ID=56402906

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610109679.7A Pending CN105791931A (en) 2016-02-26 2016-02-26 Smart television and voice control method of the smart television

Country Status (2)

Country Link
CN (1) CN105791931A (en)
WO (1) WO2017143692A1 (en)

Cited By (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106384592A (en) * 2016-11-22 2017-02-08 浙江圣奥家具制造有限公司 Smart voice controlled table and control method thereof
CN106778927A (en) * 2016-12-30 2017-05-31 深圳Tcl新技术有限公司 Update TV semantics recognition dictionary method and device
CN106997763A (en) * 2017-03-17 2017-08-01 浙江大学 A kind of air conditioning control device based on the processing of voice signal frequency domain
WO2018027843A1 (en) * 2016-08-11 2018-02-15 张焰焰 Data acquisition method and television set of television instruction input technology
CN108172221A (en) * 2016-12-07 2018-06-15 广州亿航智能技术有限公司 The method and apparatus of manipulation aircraft based on intelligent terminal
CN108319829A (en) * 2017-01-11 2018-07-24 中兴通讯股份有限公司 A kind of voice print verification method and apparatus
CN108932942A (en) * 2018-06-26 2018-12-04 四川斐讯信息技术有限公司 A kind of interactive system and method for realization intelligent sound box
CN109036424A (en) * 2018-08-30 2018-12-18 出门问问信息科技有限公司 Audio recognition method, device, electronic equipment and computer readable storage medium
CN109065056A (en) * 2018-09-26 2018-12-21 珠海格力电器股份有限公司 Method and device for controlling air conditioner through voice
CN109089140A (en) * 2017-06-14 2018-12-25 北京优朋普乐科技有限公司 A kind of sound control method and device
CN109119076A (en) * 2018-08-02 2019-01-01 重庆柚瓣家科技有限公司 A kind of old man user exchanges the collection system and method for habit
CN109741738A (en) * 2018-12-10 2019-05-10 平安科技(深圳)有限公司 Sound control method, device, computer equipment and storage medium
CN110010131A (en) * 2019-04-04 2019-07-12 深圳市语芯维电子有限公司 A kind of method and apparatus of speech signal analysis
CN110020219A (en) * 2017-11-09 2019-07-16 北京京东尚科信息技术有限公司 Information processing method and device for server
CN110136700A (en) * 2019-03-15 2019-08-16 湖北亿咖通科技有限公司 A kind of voice information processing method and device
CN110459222A (en) * 2019-09-06 2019-11-15 Oppo广东移动通信有限公司 Sound control method, phonetic controller and terminal device
CN110570846A (en) * 2018-06-05 2019-12-13 青岛海信移动通信技术股份有限公司 Voice control method and device and mobile phone
CN111081248A (en) * 2019-12-27 2020-04-28 安徽仁昊智能科技有限公司 Artificial intelligence speech recognition device
CN111105798A (en) * 2018-10-29 2020-05-05 宁波方太厨具有限公司 Equipment control method based on voice recognition
CN111192573A (en) * 2018-10-29 2020-05-22 宁波方太厨具有限公司 Equipment intelligent control method based on voice recognition
CN111312253A (en) * 2018-12-11 2020-06-19 青岛海尔洗衣机有限公司 Voice control method, cloud server and terminal equipment
CN111916084A (en) * 2020-09-09 2020-11-10 深圳创维-Rgb电子有限公司 Smart home voice control method and device, equipment and storage medium
CN112017653A (en) * 2020-07-13 2020-12-01 武汉戴美激光科技有限公司 Laser treatment handle with voice recognition function and adjusting method
CN112516584A (en) * 2020-12-21 2021-03-19 上海连尚网络科技有限公司 Control method and device for game role
CN112885354A (en) * 2021-01-25 2021-06-01 海信视像科技股份有限公司 Display device, server and display control method based on voice
CN113593554A (en) * 2021-07-21 2021-11-02 深圳市芯中芯科技有限公司 Voice recognition offline command word awakening application method and system

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108419109A (en) * 2018-03-06 2018-08-17 杭州政信金服互联网科技有限公司 A kind of meeting live streaming sound adjusting method and system

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120226502A1 (en) * 2011-03-01 2012-09-06 Kabushiki Kaisha Toshiba Television apparatus and a remote operation apparatus
CN102750126A (en) * 2012-06-27 2012-10-24 深圳Tcl新技术有限公司 Speech input method and terminal
CN103456303A (en) * 2013-08-08 2013-12-18 四川长虹电器股份有限公司 Method for controlling voice and intelligent air-conditionier system
CN103903621A (en) * 2012-12-26 2014-07-02 联想(北京)有限公司 Method for voice recognition and electronic equipment
CN104778946A (en) * 2014-01-10 2015-07-15 中国电信股份有限公司 Voice control method and system

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2004053742A (en) * 2002-07-17 2004-02-19 Matsushita Electric Ind Co Ltd Speech recognition device
CN101478648B (en) * 2008-10-17 2012-08-22 康佳集团股份有限公司 Voice control method for television set
CN101894553A (en) * 2010-07-23 2010-11-24 四川长虹电器股份有限公司 Realization method of television voice control
KR20120080069A (en) * 2011-01-06 2012-07-16 삼성전자주식회사 Display apparatus and voice control method thereof
CN102708858A (en) * 2012-06-27 2012-10-03 厦门思德电子科技有限公司 Voice bank realization voice recognition system and method based on organizing way

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120226502A1 (en) * 2011-03-01 2012-09-06 Kabushiki Kaisha Toshiba Television apparatus and a remote operation apparatus
CN102750126A (en) * 2012-06-27 2012-10-24 深圳Tcl新技术有限公司 Speech input method and terminal
CN103903621A (en) * 2012-12-26 2014-07-02 联想(北京)有限公司 Method for voice recognition and electronic equipment
CN103456303A (en) * 2013-08-08 2013-12-18 四川长虹电器股份有限公司 Method for controlling voice and intelligent air-conditionier system
CN104778946A (en) * 2014-01-10 2015-07-15 中国电信股份有限公司 Voice control method and system

Cited By (33)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2018027843A1 (en) * 2016-08-11 2018-02-15 张焰焰 Data acquisition method and television set of television instruction input technology
CN106384592A (en) * 2016-11-22 2017-02-08 浙江圣奥家具制造有限公司 Smart voice controlled table and control method thereof
CN108172221A (en) * 2016-12-07 2018-06-15 广州亿航智能技术有限公司 The method and apparatus of manipulation aircraft based on intelligent terminal
CN106778927A (en) * 2016-12-30 2017-05-31 深圳Tcl新技术有限公司 Update TV semantics recognition dictionary method and device
CN108319829A (en) * 2017-01-11 2018-07-24 中兴通讯股份有限公司 A kind of voice print verification method and apparatus
CN106997763A (en) * 2017-03-17 2017-08-01 浙江大学 A kind of air conditioning control device based on the processing of voice signal frequency domain
CN109089140A (en) * 2017-06-14 2018-12-25 北京优朋普乐科技有限公司 A kind of sound control method and device
CN110020219A (en) * 2017-11-09 2019-07-16 北京京东尚科信息技术有限公司 Information processing method and device for server
CN110570846A (en) * 2018-06-05 2019-12-13 青岛海信移动通信技术股份有限公司 Voice control method and device and mobile phone
CN108932942A (en) * 2018-06-26 2018-12-04 四川斐讯信息技术有限公司 A kind of interactive system and method for realization intelligent sound box
CN109119076A (en) * 2018-08-02 2019-01-01 重庆柚瓣家科技有限公司 A kind of old man user exchanges the collection system and method for habit
CN109036424A (en) * 2018-08-30 2018-12-18 出门问问信息科技有限公司 Audio recognition method, device, electronic equipment and computer readable storage medium
CN109065056A (en) * 2018-09-26 2018-12-21 珠海格力电器股份有限公司 Method and device for controlling air conditioner through voice
CN109065056B (en) * 2018-09-26 2021-05-11 珠海格力电器股份有限公司 Method and device for controlling air conditioner through voice
CN111192573B (en) * 2018-10-29 2023-08-18 宁波方太厨具有限公司 Intelligent control method for equipment based on voice recognition
CN111105798B (en) * 2018-10-29 2023-08-18 宁波方太厨具有限公司 Equipment control method based on voice recognition
CN111105798A (en) * 2018-10-29 2020-05-05 宁波方太厨具有限公司 Equipment control method based on voice recognition
CN111192573A (en) * 2018-10-29 2020-05-22 宁波方太厨具有限公司 Equipment intelligent control method based on voice recognition
CN109741738A (en) * 2018-12-10 2019-05-10 平安科技(深圳)有限公司 Sound control method, device, computer equipment and storage medium
CN111312253A (en) * 2018-12-11 2020-06-19 青岛海尔洗衣机有限公司 Voice control method, cloud server and terminal equipment
CN110136700B (en) * 2019-03-15 2021-04-20 湖北亿咖通科技有限公司 Voice information processing method and device
CN110136700A (en) * 2019-03-15 2019-08-16 湖北亿咖通科技有限公司 A kind of voice information processing method and device
CN110010131B (en) * 2019-04-04 2022-01-04 深圳市语芯维电子有限公司 Voice information processing method and device
CN110010131A (en) * 2019-04-04 2019-07-12 深圳市语芯维电子有限公司 A kind of method and apparatus of speech signal analysis
CN110459222A (en) * 2019-09-06 2019-11-15 Oppo广东移动通信有限公司 Sound control method, phonetic controller and terminal device
CN111081248A (en) * 2019-12-27 2020-04-28 安徽仁昊智能科技有限公司 Artificial intelligence speech recognition device
CN112017653A (en) * 2020-07-13 2020-12-01 武汉戴美激光科技有限公司 Laser treatment handle with voice recognition function and adjusting method
CN111916084A (en) * 2020-09-09 2020-11-10 深圳创维-Rgb电子有限公司 Smart home voice control method and device, equipment and storage medium
CN112516584A (en) * 2020-12-21 2021-03-19 上海连尚网络科技有限公司 Control method and device for game role
CN112516584B (en) * 2020-12-21 2024-06-04 上海连尚网络科技有限公司 Game role control method and device
CN112885354A (en) * 2021-01-25 2021-06-01 海信视像科技股份有限公司 Display device, server and display control method based on voice
CN112885354B (en) * 2021-01-25 2022-09-23 海信视像科技股份有限公司 Display device, server and display control method based on voice
CN113593554A (en) * 2021-07-21 2021-11-02 深圳市芯中芯科技有限公司 Voice recognition offline command word awakening application method and system

Also Published As

Publication number Publication date
WO2017143692A1 (en) 2017-08-31

Similar Documents

Publication Publication Date Title
CN105791931A (en) Smart television and voice control method of the smart television
CN102842306B (en) Sound control method and device, voice response method and device
CN108831469B (en) Voice command customizing method, device and equipment and computer storage medium
CN107644638B (en) Audio recognition method, device, terminal and computer readable storage medium
CN106782526B (en) Voice control method and device
US9218052B2 (en) Framework for voice controlling applications
CN106205615B (en) Control method and system based on voice interaction
CN106098063B (en) Voice control method, terminal device and server
CN107655154A (en) Terminal control method, air conditioner and computer-readable recording medium
CN109378006B (en) Cross-device voiceprint recognition method and system
CN105469789A (en) Voice information processing method and voice information processing terminal
CN110827826B (en) Method for converting words by voice and electronic equipment
CN104123938A (en) Voice control system, electronic device and voice control method
CN103699530A (en) Method and equipment for inputting texts in target application according to voice input information
CN103092928B (en) Voice inquiry method and system
CN110992955A (en) Voice operation method, device, equipment and storage medium of intelligent equipment
CN109074804B (en) Accent-based speech recognition processing method, electronic device, and storage medium
CN104808794A (en) Method and system for inputting lip language
CN107155121B (en) Voice control text display method and device
CN111161731A (en) Intelligent off-line voice control device for household electrical appliances
CN109215660A (en) Text error correction method and mobile terminal after speech recognition
CN110660391A (en) Method and system for customizing voice control of large-screen terminal based on RPA (resilient packet Access) interface
CN108897517B (en) Information processing method and electronic equipment
CN105529025B (en) Voice operation input method and electronic equipment
CN107894882B (en) Voice input method of mobile terminal

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20160720

RJ01 Rejection of invention patent application after publication