CN101051323A - Character input method, input method system and method for updating word stock - Google Patents

Character input method, input method system and method for updating word stock Download PDF

Info

Publication number
CN101051323A
CN101051323A CNA2007100994746A CN200710099474A CN101051323A CN 101051323 A CN101051323 A CN 101051323A CN A2007100994746 A CNA2007100994746 A CN A2007100994746A CN 200710099474 A CN200710099474 A CN 200710099474A CN 101051323 A CN101051323 A CN 101051323A
Authority
CN
China
Prior art keywords
dictionary
cell
word set
information
input method
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CNA2007100994746A
Other languages
Chinese (zh)
Other versions
CN100483416C (en
Inventor
马占凯
杨磊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Sogou Technology Development Co Ltd
Original Assignee
Beijing Sogou Technology Development Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Family has litigation
First worldwide family litigation filed litigation Critical https://patents.darts-ip.com/?family=38782735&utm_source=google_patent&utm_medium=platform_link&utm_campaign=public_patent_search&patent=CN101051323(A) "Global patent litigation dataset” by Darts-ip is licensed under a Creative Commons Attribution 4.0 International License.
Application filed by Beijing Sogou Technology Development Co Ltd filed Critical Beijing Sogou Technology Development Co Ltd
Priority to CNB2007100994746A priority Critical patent/CN100483416C/en
Publication of CN101051323A publication Critical patent/CN101051323A/en
Priority to PCT/CN2008/071027 priority patent/WO2008141583A1/en
Application granted granted Critical
Publication of CN100483416C publication Critical patent/CN100483416C/en
Ceased legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/02Input arrangements using manually operated switches, e.g. using keyboards or dials
    • G06F3/023Arrangements for converting discrete items of information into a coded form, e.g. arrangements for interpreting keyboard generated codes as alphanumeric codes, operand codes or instruction codes
    • G06F3/0233Character input methods

Landscapes

  • Engineering & Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Machine Translation (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Document Processing Apparatus (AREA)

Abstract

A character input system comprises input interface unit, information conversion unit, display-output unit, and system lexicon and cell word set. It is featured as using said lexicon to record basic word and relevant information, using cell word set to record expansion word and relevant information, obtaining said cell word set from one of multiple cell word lexicon stored at server end and setting words in each cell word lexicon to have common attribute.

Description

A kind of method of method, input method system and Word library updating of character input
Technical field
The present invention relates to the input field of character information, particularly relate to a kind of method and a kind of dictionary delivery system of method, input method system and a kind of Word library updating of character input.
Background technology
Along with popularizing and development of computer technology and Internet technology, the user of different professional domains, different interest and use habit is more and more higher for the intelligent requirement of input method system.
Estimating input method when intelligent, the accuracy rate of first-selected speech is a very important evaluation criterion, and simultaneously, the ordering of candidate item is very important also, and the input method dictionary that records entry information and word frequency information is to influence one of the two key factor.Because the required target word of user exists in dictionary, with and corresponding word frequency information meet very much user's use habit, then first-selected speech accuracy rate and the candidate item ordering at this user will relatively meet demand.
But at present the dictionary of input method generally is merely able to cover the part of the vocabulary that people use, and mainly comprises the common wordss that some people are general usually, and some vocabulary input method dictionary can not all be included.Because the existing input method dictionary all is a standard, at being all users, if the vocabulary that all users are used all adds to come in, the dictionary capacity of input method will be in millions of magnitudes so.Dictionary is excessive, and phonetically similar word is too much, and candidate item increases, and do not need to use the user of these speech to be interfered, and the dictionary of such super large certainly will significantly take computing equipment resources such as CPU, internal memory, is unacceptable to PC.
For example, everyone when using input method except importing many common wordss (for example " now ", " time ", " how much " etc.), also can import the vocabulary of sub-fraction human, for example: some nouns " Ai Ze Lars " " Druid " of playing, up-to-date film " The Knot " or the like.These vocabulary are often input concerning very little colony, for example: the player of World of Warcraft, the slip-stick artist of specialty chemical, biological teacher or the like.But the usage ratio of these vocabulary in overall user is low especially, input method dictionary under the existing pattern is impossible all include these vocabulary, will cause under the prior art like this, first-selected speech accuracy rate when the user imports the common wordss of above-mentioned these microcommunities is very low, the experience that has a strong impact on the user with and the expression of thought.
In a word, need the urgent technical matters that solves of those skilled in the art to be exactly: how to improve the input method dictionary, make it both can satisfy the resources allocation of existing computing equipment, can improve each user's input efficiency greatly again.
Summary of the invention
Technical matters to be solved by this invention provides a kind of novel input method dictionary pattern and a whole set of input solution, can satisfy the resources allocation of existing computing equipment, more computational resources can be do not taken, and each user's input efficiency can be significantly improved.
In order to address the above problem, the invention discloses a kind of input method system, comprise input interface unit, information translation unit and show output unit, also comprise:
System's dictionary is used to write down basic words and relevant information thereof;
The cell word set is used for record expansion words and relevant information thereof; Described cell word set is obtained by at least one the cell dictionary that obtains a plurality of cell dictionaries of storing from server end; Words in each cell dictionary has a predicable at least.
Preferably, described input method system can also comprise: automatic update module is used for obtaining required more new data according to existing cell dictionary tabulation from server end.
Preferably, the relevant information type of storing in the described cell word set relevant information type that is less than or equals to store in the described system dictionary; At least exist a cell dictionary in described a plurality of cell dictionary by generating manually.
Further, described input method system can also comprise: user thesaurus.
Further, described input method system can also comprise: add module, the cell dictionary entry information that is used for obtaining is added into described cell word set; Described cell word set is independently a dictionary or the dictionary set that exists side by side for a plurality of dictionaries.Preferably, described interpolation process is independently carried out in the buffer memory dictionary one.
Further, described input method system can also comprise: the cell dictionary module of stopping using, be used to receive user instruction, and from the cell word set, remove the entry record that belongs to user-selected cell dictionary.
According to embodiments of the invention, a kind of method of character input is also disclosed, comprising:
Loading system dictionary and cell word set; Described cell word set is used for record expansion words and relevant information thereof; Described cell word set is obtained by at least one the cell dictionary that obtains a plurality of cell dictionaries of storing from server end; Words in each cell dictionary has a predicable at least;
Receive user's input information;
According to the input information that is received, in described system dictionary and cell word set, retrieve, obtain corresponding candidate item;
Receive user's selection information, with screen output on the candidate item of appointment.
Wherein, described being loaded as: cell word set and system's dictionary are merged into a dictionary, place buffer memory; Perhaps, described being loaded as: cell word set and system's dictionary are placed buffer memory as two or more independent dictionaries, and set dictionary priority according to presetting rule; Described priority is used for the demonstration ordering of candidate item.
Preferably, record affiliated cell dictionary of each entry and corresponding cell dictionary priority in the described cell word set; Described priority is used for the demonstration ordering of candidate item.
Further, described method can also comprise: in loading procedure, dynamically adjust cell dictionary priority according to the environment for use of input method.
According to another embodiment of the present invention, a kind of method of Word library updating is also disclosed, the dictionary that is upgraded relates to the cell word set that is used to write down expansion words and relevant information thereof, and described cell word set is obtained by at least one cell dictionary of choosing a plurality of cell dictionaries of storing from server end; Words in each cell dictionary has a predicable at least; Described method comprises:
Accept triggering, relatively more existing cell dictionary tabulation and the tabulation of server teloblast dictionary obtain the dictionary tabulation of required renewal;
Download the cell dictionary entry information of required renewal, and be added in the cell word set.
Further, described method can also comprise: manually or the cell dictionary stored of auto-update server end, and change corresponding version information.Preferably, described interpolation process is independently carried out in the buffer memory dictionary one.
According to another embodiment of the present invention, a kind of dictionary delivery system is also disclosed, comprising:
Cell dictionary generation unit comprises: interface module is used to receive input information; Generation module is used for according to the information cellulation dictionary that is received; Identification module is used to each cell dictionary designated identification and version information; Wherein, the words in each cell dictionary has a predicable at least;
Communication unit is used to accept triggering, transmits corresponding cell dictionary entry information to client.
Further, described cell dictionary generation unit can also comprise: revise update module, be used for revising and upgrade cell dictionary prestored message, and notify described identification module to generate new version information at this cell dictionary.
Further, described dictionary delivery system can also comprise: identification module, be used for the cell dictionary tabulation of comparison server end and the cell dictionary tabulation of client, and resulting comparative result is used to transmit required more new data to client.
Preferably, store a plurality of entry information in the cell dictionary that obtains according to the information that received; Perhaps, store index information in the cell dictionary that the information that foundation is received obtains, corresponding other cell dictionaries of described index information.
Further, described dictionary delivery system can also comprise: merge module, be used for that a plurality of cell dictionary entry information are merged into one and download dictionary, and the notifying communication unit should be downloaded dictionary and transfers to client.
Compared with prior art, the present invention has the following advantages:
The present invention constitutes being improved to by system's dictionary and cell word set two parts towards all users' standard input method dictionary in the prior art, wherein, system's dictionary is still towards all users, based on universal word, cell word set part then provides a plurality of cell dictionaries by server end, by the user select optimum oneself, merge then and obtain.Therefore, the input method dictionary that can guarantee last this user's use is still on existing dictionary capacity levels, and by everyone selection and the use of personalization, make it can cover the vocabulary that the user is nearly all substantially, and has word frequency information relatively more accurately, thereby can improve first-selected speech accuracy rate greatly, also can realize more meeting the candidate item ordering of user's use habit.
The present invention has realized dynamic cell formula dictionary on the dictionary capacity levels of existing input method, the user is by manually or by computer adding little dictionary automatically, by everyone selection or customization of personalization, by automatic renewal, the use of uniting with the system dictionary just can cover the vocabulary that the user is nearly all.So just make the user can import nearly all vocabulary or sentence, can significantly promote the first-selected speech accuracy rate of input method.Dictionary coverage rate with the individual expands maximum in theory, thereby makes the accuracy rate of typewriting that a bigger lifting be arranged.
The present invention is by the use of a plurality of cell dictionaries, and can upgrade the cell dictionary by the mode of auto-update, can make individual's dictionary and epoch synchronous.The individual need not to start just can keep the freshness of vocabulary, thereby under the development of making rapid progress in the internet, improves the first-selected speech accuracy rate of typewriting, thereby more significantly improve typing speed, reduces the appearance of new word, reduces the page turning number of times.
And the present invention also provides a dictionary delivery system, is used to help each user manually to generate the cell dictionary of own affiliated colony, and upgrades, revises this cell dictionary; Increased automatically updating function again in client, thereby the automatic renewal that can obtain classifying cell dictionary accurately and realize the cell dictionary is consistent the user and the world, never falls behind.
Description of drawings
Fig. 1 is the structured flowchart of a kind of embodiment of input method system;
Fig. 2 is a kind of flow chart of steps that is used to finish the method embodiment of character input;
Fig. 3 is the structured flowchart of a kind of dictionary delivery system embodiment;
Fig. 4 is the flow chart of steps of the automatic method for updating embodiment of a kind of dictionary.
Embodiment
For above-mentioned purpose of the present invention, feature and advantage can be become apparent more, the present invention is further detailed explanation below in conjunction with the drawings and specific embodiments.
The present invention can be applied to the input method platform of various input modes, comprises keyboard symbol, hand-written information and phonetic entry or the like.Be that described input information can comprise coded string, also can comprise the information of handwritten input information and phonetic entry, carry out the candidate item ordering because these input modes also all need to use dictionary.Because the information translation in these input modes all belongs to known technology, just do not described in detail at this.Only being input as example with coded string below is elaborated.
With reference to Fig. 1, show the embodiment of a kind of input method system of the present invention, specifically can comprise:
Input interface unit 101 is used to receive the input information that the user imports;
Information translation unit 102 is used for the input information according to user's input, for example, receives keypad character, carries out code conversion, obtains corresponding candidate item;
Show output unit 103, be used for the show candidate item, and receive user's selection, last screen output.
System's dictionary 104 is used to write down basic words and relevant information thereof;
Cell word set 105, the set of expression cell dictionary is used for record expansion words and relevant information thereof; Described cell word set is obtained by at least one cell dictionary of choosing a plurality of cell dictionaries of storing from server end; Words in each cell dictionary has a predicable at least.
In the process of character input, adopt and preset strategy, searching system dictionary and cell word set can be finished the input process that meets this users ' individualized requirement.
Described cell dictionary, concrete implication is the dictionary with a certain general character that uses of a certain special group, a certain individual or some people (is in each cell dictionary words have a predicable at least), for example: everyone thesaurus of recent film dictionary, up-to-date title of the song dictionary, World of Warcraft's dictionary, biology dictionary, Tsing-Hua University, all people's thesaurus of so-and-so company, ground, Haidian District thesaurus etc.Obtain the cell dictionary mode can for: classify, resolve acquisition cell dictionary automatically by a management organization or server zone; Also can for: provide a server platform, by the cell dictionary of the spontaneous described colony of manual generation oneself of user, to better meet the demand of personalized colony.Promptly preferred, exist a cell dictionary manually to generate at least in the described a plurality of cell dictionaries in the present embodiment by the user.
In the prior art, the input method platform may operate on the multiple computing equipment, for example, and PC, personal digital assistant, mobile terminal device or the like, the present invention also can be useful in the above-mentioned various computing equipment, and its running environment is not needed to be limited.
Simply introducing Chinese character, Korean, Japanese etc. below needs the process of the character input of code conversion, is example with the input in Chinese:
In Chinese, as the Chinese character of basic language unit not with keyboard on button have corresponding relation.Therefore need input method to import conversion.At first need to convert Chinese character to the letter that can directly import, numeral etc. by encode Chinese characters for computer.Normally the coding of usefulness is exactly phonetic (comprising various forms such as simplicity, Two bors d's oeuveres, fuzzy sound).The user imports computing machine (also may use mouse under some situation, such as soft keyboard) with the coded string of Chinese character by keyboard.User's keyboard input is given input method by operating system, and input method is decoded.Because different Chinese character sequences (speech, sentence) may have identical coding, so input method provides a candidate list therefrom to select for the user usually.For example, may comprise following steps for spelling input method:
A, spelling analyzing: the cutting input of character string obtains phonetic, such as zhuanli → [zhuan] [li].Certainly, sometimes this cutting is not unique, such as fangan → [fang] [an] or [fan] [gan] (respectively corresponding " scheme " " dislike ").Preferably, input method can be supported simplicity, allows the user to import with following form: zl, and zhl, zhuanl, zhli ....It is nonstandard to consider that the certain user pronounces, and also can support fuzzy sound: zuanli.Can also adopt forms such as Two bors d's oeuveres in addition.
B, Chinese character decoding.In dictionary, search corresponding words according to the pinyin sequence that cutting obtains, perhaps generate corresponding sentence by certain algorithm.
C, user select needed content, last screen (may make the process of speech, sentence-making in addition).
Because the corresponding identical coding of different Chinese character sequences possibility, for specific coded string, input method need guess that the user is intended to really.And to accomplish this point, need the support of dictionary.
For the present invention, dictionary can comprise various language messages, for example:
(1) entry
Though also can make up input method on the basis of word, because speech is only the minimum Biao Yi unit in the Chinese, therefore modern input method has been used entry information in a large number.When for example the user imports " zhuan " this phonetic respectively, be difficult to determine he thinks which word in the input " change and specially earn brick ... " actually.Equally, in the time of user input " li ", be difficult to also to determine that he wants to import is which word in " Li Lilili ... ".But if the user imports " zhuanli " these two syllables continuously, what can conclude basically that the user wants to import is exactly " patent " this speech.This can improve the accuracy of input method first-selection greatly.
(2) word frequency
Phonetically similar word exists in a large number, and homonym also remains existence.Run into this situation, can only list Total Options for the user and select.But position candidate has a significant impact the ease for use of input method.Generally speaking, speech commonly used is put into forward position can be more favourable to the user, and promptly word frequency is the important evidence that the candidate sorts.
In addition, all integrated function of automatic structure sentence in existing a lot of input method.At this moment, word frequency information also is the important evidence of syntax.
Top bilingual information is indispensable in the input method dictionary, and that input method dictionary of the present invention can also comprise is some other to improving the favourable information of input method accuracy, for example:
The language annexation.Input method is considered word frequency except needs in the process of structure sentence, also need to consider the annexation between speech and the speech.For example " " often appear at back such as adjective, noun, pronoun, " " then often appear at the adverbial word back.In this case, if the user has imported " de ", be only to see that " " " " which word frequency is higher.
In dictionary, deposited the required language message of input method.The user just can finish character and import.But the language message that different user is required is also inequality.Such as:
(1) entry difference.Almost each industry all has own special vocabulary, and these speech are seldom used at other field, can consider in structure input method dictionary.Compuword " buffer memory " or the like for example.
(2) entry significance level difference.Different users may need to use identical speech, but its importance is different with user's difference.Such as homonym " research " and " tobacco and wine ", the former uses more in sphere of learning, and the latter then uses more in daily life.Therefore but the both may use, and when user's input Pinyin " yanjiu ", all can appear in user's the candidate list.Because the importance difference, the relative size of position candidate can influence user's direct feel.
For the significance level of entry with respect to the user, can use separately or applied in any combination by variety of way, in dictionary, embodied, for example:
Word frequency information.Word frequency information is used for representing the frequent degree of use of this speech usually with a numeral; Generally use frequent more speech word frequency high more.
Word order information.Word order information also is a numeral usually, but only is used to represent the relative meaning of this entry significance level.
Perhaps, positional information.For convenience, also can omit these data, and express the significance level of entry with the relative position of entry in dictionary.For example, can think that the speech that comes the dictionary front is more important than the speech that comes the back, thereby the former is placed on the front of candidate list.
Because the input method dictionary can not generate the dictionary of a special use at each user, therefore, the present invention proposes, and the input method dictionary is divided into system's dictionary and cell word set two parts.System's dictionary is used to put down in writing common wordss, satisfying most of people input demand in most of the cases, and for certain user's individual demand, then puts down in writing by the cell word set.In order to improve cell word set and each user's applying degree, generate a large amount of cell dictionaries by manual or automatic mode, select own required cell dictionary voluntarily by each user then, obtain the cell word set, such cell word set and each user's applying degree all are extraordinary, because personalized part is that it is selected voluntarily.
Selected the situation of a cell dictionary for the user, then this cell dictionary can directly constitute the cell word set.
When having selected a plurality of cell dictionary for the user, then the cell word set can have the multiple form of expression.For example: (1) becomes a dictionary in client with described a plurality of cell dictionaries merging, and promptly the cell word set exists with the form of an independent dictionary; Can store source (cell dictionary the promptly) information of each entry in this dictionary, also can not store.(2) in client, described a plurality of cell dictionaries are stored side by side, promptly the cell word set exists with a plurality of independent dictionaries and the form of depositing, and these a plurality of cell dictionaries of scanning get final product successively.(3) in client, a part of dictionary in described a plurality of cell dictionaries (is for example merged, the dictionary that some attribute is more close), promptly the cell word set exists with a plurality of independent dictionaries and the form of depositing, but wherein some independent dictionary is merged by a plurality of cell dictionaries and obtains.
For the cell word set, because some language message more complicated, for example, language annexation or the like, the one, be difficult to obtain, the 2nd, be difficult to storage, so preferred, (in fact comprise each cell dictionary) for the cell word set, wherein the type of Cun Chu language message will be less than the type of the language message of being stored in system's dictionary.Certainly, the type of the language message of being stored in the cell word set also might for example, for word order information or positional information, generally be stored in the cell dictionary, and generally not have in system's dictionary more than the type of the language message of being stored in system's dictionary.
Further, can also comprise user thesaurus 106 in the input method system of present embodiment, be used to write down this user's input habit, to better meet this user's individual demand.
On the platform that server provides, there is a large amount of cell dictionaries, and also have a large amount of users in order to improve these cell dictionaries, it is made amendment and upgrades, therefore, how up-to-date best cell dictionary being offered the input method user who selects this cell dictionary and use, also is one of technical issues that need to address of the present invention.
Preferably, present embodiment can also comprise: automatic update module 107, be used to accept triggering, and according to existing cell dictionary tabulation, download required more new data from server end.For example, store the information list of the cell dictionary of using in this user's the input method system, the information with server end compares then, upgrades if desired, then according to the update strategy that presets, finishes down loading updating.Described more new data can be whole cell dictionary, for example, learns that this cell dictionary needs to upgrade, and then directly downloads all entry information of this cell dictionary; Described more new data also can be the part entry information in the cell dictionary, for example, learns that this cell dictionary needs to upgrade, and then by the entry comparison, only downloads the entry information that changes.Certainly, server end can also merge the entry information that changes in a plurality of cell dictionaries become a new dictionary as new data more.
If the user has selected a plurality of cell dictionaries, then server end can merge these a plurality of cell dictionaries becomes a dictionary, is sent to client then as the cell word set, and promptly the data of cell dictionary interpolation task is finished by server end.
If the user has selected a plurality of cell dictionaries, then add under the situation about finishing voluntarily by input method system for the data of cell dictionary, present embodiment can also comprise: add module 108, the cell dictionary entry information that is used for downloading is added into described cell word set.This interpolation module 108 can adopt various feasible interpolation strategies, and for example, described addition manner is: finish a more newly downloaded cell dictionary, then add this cell dictionary to described cell word set; Perhaps, described addition manner is: after finishing the download of all cell dictionaries to be updated, just be added in the described cell word set.
When this interpolation module 108 can be used for the cell word set and forms for the first time, perhaps its Word library updating the time.This interpolation module 108 can be used to download the situation of whole cell dictionary, also can be used for downloading the situation of the part entry information of a cell dictionary.
Preferably, if the dictionary process of adding can be finished (such as being no more than 1 second) within a short period of time,, then can directly the interpolation process be inserted in user's the input process because influence is little.If but could finish the use experience that consequently may influence the user within a short period of time, then dictionary interpolation process should independently be carried out in the buffer memory dictionary at one.The original dictionary of input method is unaffected in this process, and the user can normally use.After the establishment of buffer memory dictionary finishes, directly replace the original dictionary of input method and get final product.Because this replacement process can be very fast, therefore can accomplish to drop to minimum to user's normal use interference.
Preferably, in order further to improve the management of user to dictionary, present embodiment can also comprise: the cell dictionary module 109 of stopping using, (for example be used to receive user instruction, by clicking modes such as menu item), from the cell word set, remove the entry record that belongs to user-selected cell dictionary, reach the purpose that certain or some cell dictionaries are stopped using.
Wherein, described removal process can for: receive user instruction,, and add cell dictionary in the tabulation again, obtain new cell word set the deletion from tabulation of user-selected cell dictionary.Because deleted cell dictionary do not exist in tabulation, the cell word set that newly obtains will not comprise speech wherein, and it is deleted to be equivalent to this dictionary on the effect.For for self-existent cell dictionary in the cell word set, directly delete or add that delete flag can reach inactive purpose.
Perhaps, described removal process also can for: receive user instruction, deletion belongs to the entry record of user-selected cell dictionary from described cell word set, records the cell dictionary under each entry in the described cell word set.Perhaps, described removal process also can for: receive user instruction, in described cell word set, add delete flag, record the cell dictionary under each entry in the described cell word set to the entry record that belongs to user-selected cell dictionary.
Promptly put down in writing the source of each entry in the big dictionary as the cell word set, notice input method system when the user specifies certain cell dictionary of deletion (perhaps its active) will remove from dictionary from the entry of this dictionary.This shifting out can be directly this entry to be deleted from data structure and discharged its corresponding space, also can realize by a delete flag.Entry with delete flag in follow-up use, will be left in the basket (Free up Memory not can be easier but implement).The benefit of this mode is, deletes a small amount of dictionary and the system overhead that causes can be smaller when the cell dictionary is a lot.
With reference to Fig. 2, show a kind of method embodiment that is used to finish the character input, specifically can comprise:
Step 201, loading system dictionary and cell word set; Described cell word set is obtained by at least one cell dictionary of choosing a plurality of cell dictionaries of storing from server end; Words in each cell dictionary has a predicable at least;
Step 202, reception user's input information;
The input information that step 203, foundation are received is retrieved in described system dictionary and cell word set, obtains corresponding candidate item;
Step 204, reception user's selection information will screen output on the candidate item of appointment.
Important in a present embodiment problem is, when a plurality of dictionaries and when depositing, how to finish detecting of candidate item.Loading procedure described in the step 201 can for: cell word set and system's dictionary are merged into a dictionary, place buffer memory.
Input method is when starting, and system's dictionary that has in the scan inputting method system and cell word set are written in the buffer memory after the two is merged into a dictionary, and the user can directly use according to the use-pattern of system's dictionary in subsequent operation like this.Wherein, the loading of the loading of system's dictionary and cell word set can separately be carried out, for example, under the simple scenario, the user only needs the loading system dictionary to get final product, in some cases, user's selection or input method system start (for example, meeting under the situation that presets strategy) triggering automatically and start the loading of cell word set, then the cell word set are incorporated into system's dictionary, place buffer memory, the retrieval when being used for the user and importing.
Further, the loading procedure described in the step 201 also can for: cell word set and system's dictionary are placed buffer memory as two or more independent dictionaries, and the dictionary priority of setting according to presetting rule; Described priority is used for the demonstration ordering of candidate item.
Promptly in loading procedure, the cell word set is put into system's dictionary space specified in addition, and in the searching system dictionary, also retrieves the cell word set.Preferably, need the priority of appointing system dictionary and cell word set this moment, and for example, the priority of acquiescence cell word set is higher than system's dictionary, when then exporting candidate item, all speech that belong to the cell word set all forced to be placed on the front of the speech that belongs to system's dictionary.
For the cell word set be a major term stock the time, promptly have two independently dictionaries in the buffer memory.And when also independently forming, then may there be a plurality of independently dictionaries in the buffer memory by a plurality of cell dictionaries for the cell word set.Certainly, need to set the priority of each dictionary this moment; Described priority is used for the demonstration ordering of candidate item.
Preferably, for the cell word set be a major term stock the time, in order to embody the difference of each cell dictionary, also can in the cell word set, record cell dictionary and corresponding cell dictionary priority under each entry.
For the situation that is provided with priority at each cell dictionary (comprise each cell dictionary independently exist and merge into a major term stock situation), then preferred, in loading procedure, can dynamically adjust cell dictionary priority according to the environment for use of input method.For example, the cell word set includes " office term " and " cyberspeak " two cell dictionaries, and their priority is identical under the normal condition.But when input method system identification current application program is the Word word processor, can gives the weighting of " office term " cell dictionary, and when the user switches to the QQ chatting programme, then can give the weighting of " cyberspeak " cell dictionary.
With reference to Fig. 3, to show and a kind ofly to be applicable to that aforementioned input method system is (in order clearly demonstrating, adopt input method client one speech to be described) dictionary delivery system embodiment, this dictionary delivery system can be used for input method client and download the process that the cell dictionary obtains the cell word set from server end first, also can be used for the process that existing cell dictionary is upgraded.
Dictionary delivery system shown in Figure 3 specifically can comprise:
Cell dictionary generation unit 301 comprises the interface module 3011 that is used to receive input information, is used for obtaining the generation module 3012 of cell dictionary according to the information that is received, and the identification module 3013 that is used to each cell dictionary designated identification and version information; Words in each cell dictionary has a predicable at least;
Communication unit 302 generally is positioned at server end, is used to accept trigger message, transmits corresponding cell dictionary entry information to client.
Generally be positioned at server end in the cell dictionary generation unit 301, be used for unified management and safeguard the cell dictionary.Certainly, some or all of module in the cell dictionary generation unit 301 also can be arranged in client (can for being independent of other clients of input method client), for example, interface module 3011 and generation module 3012 are positioned at client, the user can directly be sent to server end with the cell dictionary file that generates and get final product, and is finished the work of designated identification and version information by server end.
Described trigger message can be for user's selection operation etc., also can be the trigger message that the input method system client sends automatically, can also be that the automatic detection of server end triggers.For example, server or client retrieves IP address or current input environment, and recommend corresponding cell dictionary to give the user automatically; Perhaps, the updating message of client transmission also belongs to a kind of of trigger message.
The generation of cell dictionary can be adopted modes such as manual, automatic, below the process of manual cellulation dictionary is carried out simple declaration:
Dictionary generation personnel need pass through interface module 3011 (for example, comprising with the dictionary edit page) provides following information: title, classification, bar number, version, explanation, dictionary author, entry for example, entry (comprising pronunciation information) or the like.After click on submission button, these information are saved in the database.Enable the dictionary generator program then immediately.The simplest, the dictionary generator program directly is saved in the file mode of these information with text for user's download.
For example, a cell dictionary is a file, and the data that wherein comprise have:
The dictionary sequence number 00015214
The link network address Http:https:// abc.com/dict/00015214
Title World of Warcraft
Classification Recreation
The bar number 188
Version 0008
Date 2006.12.6
Explanation The cell that I do.
The dictionary author Zhang, Li or anybody
Entry for example Ai Ze Lars ... ..
Entry, pronunciation, word frequency data Concrete data
In order to improve the efficient that the cell dictionary adds, form that can also the pair cell dictionary carries out necessary processing.For example the entry to its inside sorts, and certainly, these work can be finished in generation module 3012, then the data file after the entry ordering is offered user's download as cell dictionary file.
For purposes such as copyright information protections, can also carry out encryption by the pair cell dictionary.Corresponding, need when being installed, the cell dictionary be decrypted it.Promptly preferred, server end can also comprise an encrypting module, and input method client can also comprise a deciphering module.
For the ease of upgrading, identification module 3013 can be specified a unique ID and a version number for each cell dictionary simultaneously.
Cell dictionary in embodiment illustrated in fig. 3 can have the multiple form of expression, for example: generally speaking, directly store a plurality of entry information in the cell dictionary; Perhaps, also can only store index information in the cell dictionary, corresponding other cell dictionaries of described index information.The cell dictionary of storage index information generally can be applied to: server end stores the cell dictionary that information that a plurality of foundation receives obtains, then according to certain general character of these cell dictionaries, generate a new cell dictionary (being the information that indirect utilization receives), easy in order to realize, then can only store index information in this new cell dictionary gets final product, when the user needs this dictionary, merge the laggard row transmission of each corresponding dictionary by server end again.
Further, in order to satisfy the fast updating of cell dictionary, then the cell dictionary generation unit 301 of dictionary delivery system can also comprise in the present embodiment: revise update module 3014, be used for revising and upgrade cell dictionary prestored message, and notify described identification module to generate new version information at this cell dictionary.Described modification can be for manually finishing, also can be for presetting tactful pair cell dictionary and adjust and finish according to certain, and for example: other users add new entry in certain cell dictionary; Perhaps, according to presetting strategy, a cell dictionary merged in the entry in two cell dictionaries; Perhaps, according to internet word frequency statistics result, the entry that internet word frequency in certain cell dictionary the is not met prerequisite adjustment of deleting or sort.
Embodiment shown in Figure 3 can finish the data interpolation of cell dictionary at least by following dual mode.
The one, earlier the cell dictionary is downloaded to this locality, open this file by double-click then, finish the interpolation of data.The cell dictionary is the file that has a certain specific suffix name, for example the .scd suffix.When input method system is being installed, can be the .scd suffix is related with a specific application program by registration table.When the user double-clicked suffix and is the .scd file, operating system can start corresponding application program module (for example, the interpolation module in embodiment illustrated in fig. 1) according to this correlation rule, finishes the interpolation of cell dictionary data.
The 2nd, by clicking the link on the page, direct-on-line is finished the interpolation of cell dictionary data.After the user clicks cell dictionary link on the page, dual mode is arranged: preserve and carry out.If the user has preserved cell dictionary file, the same a kind of mode.If the user has selected execution, system can be kept at cell dictionary file in the temporary folder of system, then bootup window.Its inner realization mechanism also is identical with first kind of mode, and difference is that file has been downloaded to system's temporary folder, does not therefore need the user to specify the download location.Simultaneously, system can clear up temp directory where necessary, though therefore the cell dictionary has been downloaded in the temp directory, actual is sightless for the user.
Preferably, the cell dictionary of being downloaded is added into the process of cell word set, can also comprises a switch process, for example original unordered entry in the dictionary is sorted so that improves the efficient of interpolation.If there is this switch process, with the dictionary file that uses after changing; Otherwise directly use former dictionary file.Certainly, if server end has been finished the work of conversion ordering in the dictionary generative process, then client has not just needed repetition when data are added.
In data interpolation process, input method system (being input method client) needs to safeguard the tabulation of a current applied cell dictionary.Described cell dictionary tabulation can be adopted various feasible forms, for example, the cell dictionary that all are movable copies in the designated directory, (this tabulation can be placed in the local disk file perhaps to preserve the tabulation of a filename, also can leave in the registration table, perhaps leave in long-range, for example on the network).
Be added into the process of cell word set for data with the cell dictionary, can immediate operation after download be finished (for example, the notice input method client begins to add operation); Also can wait for when input method initiatively finds to upgrade (for example the user starts input method next time), begin to add operation again: the tabulation of scanning cell dictionary, read in successively and each cell dictionary is added in the cell word set.
The form of expression with the cell word set is that a self-existent big dictionary is that example describes, and concrete interpolation process can have dual mode: increment, in batches.
Batch mode is disposablely speech in all cells dictionary to be merged into a big interim dictionary, disposable then adding cell word set.This mode implements can be fairly simple, could use initiate cell dictionary but the user must wait for that all dictionaries all merge after finishing.Incremental mode is: just it is added the cell word set when reading in several entries, if merge chronicly, the user can use while merging, but this having relatively high expectations to system design.
Merge mode for increment, in merging process, just can use, therefore after merging is finished, do not need to notify input method system.But, need after finishing, merging notify the new dictionary of input method system can use for batch merging mode.A kind of alternative way is, directly visits the storage space of input method and data are upgraded, though input method does not obtain notice like this, data are updated, and therefore actually can use new data.
Preferably, in the process that data are added, can also comprise optimization step, be used for the dictionary dittograph is optimized, for example, the dittograph bar be merged.Certainly, for this speech of accurate recording, can in originating attribute, it write down the information such as sign of its described a plurality of cell dictionaries.Further, can also write down the different priority of the described a plurality of cell dictionaries of this speech, be used for, adopt the priority of different cell dictionaries to carry out the candidate item ordering for different input environments.
In order to help input method client better to finish updating task, then the dictionary delivery system in the present embodiment can be arranged on server end with the identification work of upgrading and finishes.Promptly preferred, dictionary delivery system in the present embodiment can also comprise: identification module 303, be used for the cell dictionary that the cell dictionary is tabulated and client the sends tabulation that the comparison server end is preserved, resulting comparative result is used to transmit required more new data to client.For example, the cell dictionary that changes can be formed tabulation and send to client, determine and the initiation download request by client; Perhaps, also can directly the cell dictionary that changes be pushed to client, finish renewal by server.Described more new data can be whole cell dictionary, and for example, identification learns that this cell dictionary needs to upgrade, and then transmits all entry information of this cell dictionary; Described more new data also can be the part entry information in the cell dictionary, and for example, identification learns that this cell dictionary needs to upgrade, and then further by the entry comparison, only transmits the entry information that changes and gets final product.
Further improve the dictionary efficiency of publishing, present embodiment can also comprise: merge module 304, be used for that a plurality of cell dictionary entry information are merged into one and download dictionary, and notifying communication unit 302 should be downloaded dictionary and transfers to client.Described merge cells can be used for various possible scenes, for example, transmits after user-selected a plurality of cell dictionaries are merged into a dictionary; Perhaps, the entry information that changes in a plurality of cell dictionaries that need to upgrade is merged, obtain a new dictionary, transmit then; Perhaps, the corresponding cell dictionary of index information in the cell dictionary is merged, obtain a new dictionary, transmit then.
With reference to Fig. 4, show a kind of method embodiment of Word library updating, the dictionary of required renewal relates to the cell word set that writes down expansion words and relevant information thereof in input method system, and described cell word set is obtained by at least one cell dictionary of choosing a plurality of cell dictionaries of storing from server end; Words in each cell dictionary has a predicable at least;
Described method embodiment specifically can comprise:
Step 401, accept triggering, relatively more existing cell dictionary tabulation and the tabulation of server teloblast dictionary, the dictionary that obtains required renewal is tabulated; Described triggering can manual triggers, also can trigger automatically;
The cell dictionary entry information of step 402, the required renewal of download, and be added in the cell word set.
Preferably, described method embodiment can also comprise step 403: manually or the cell dictionary stored of auto-update server end, and change corresponding version information.Described upgrading can be for manually finishing, also can be for presetting tactful pair cell dictionary and adjust and finish according to certain, and for example: other users add new entry in certain cell dictionary; Perhaps, according to presetting strategy, a cell dictionary merged in the entry in two cell dictionaries; Perhaps, according to internet word frequency statistics result, the entry that internet word frequency in certain cell dictionary the is not met prerequisite adjustment of deleting or sort.
For the ease of upgrading, each cell dictionary all has a unique ID, this unique ID can be the integer of a natural increase, also can be a network address or other information (as long as guarantee two different cell dictionaries have different ID just can).Each cell dictionary can also have a version information, and this version information can be a serial number, also can be the time of Last modification.Change has taken place in this version information, shows that then this dictionary file needs to upgrade.For example, adopt client last update time as version information, if with server on the file preserved compare the former update time and change, this dictionary file needs to upgrade so.
Realization for the comparison procedure in the step 401 can be adopted multiple implementation, for example:
(1) input method client will have the tabulation of cell dictionary now and send to server, can send by ICP/IP protocol, perhaps send by http protocol; Whether need to upgrade by the corresponding cell dictionary of ID in server judgement and the tabulation.
(2) input method client is initiated update request, and server is beamed back the list information of all cell dictionaries, judges that by input method client which existing dictionary needs to upgrade.
(3) input method client will have the tabulation of cell dictionary now and send to server, and the version information of the corresponding cell dictionary of ID during server will be tabulated is beamed back, and judge that by input method client which existing dictionary needs to upgrade.
Each is different for bandwidth and calculation of equipments pressure for above-mentioned several mode, and those skilled in the art select for use according to actual needs and get final product.
For the situation of being finished identifying by server, server can form the cell dictionary that changes tabulation and send to client, is determined and initiation download request (for example, therefrom selection portion divides dictionary to upgrade) by client; Perhaps, also can directly the cell dictionary that changes be pushed to client, finish renewal by server.
For institute's data downloaded in the step 402, can be whole dictionary, also can be the part entry information in the cell dictionary, for example, the entry information that changes.
Add process for the data in the step 402, can adopt incremental mode, batch mode or the combination of the two.For example, described addition manner is: finish a more newly downloaded cell dictionary, then add this cell dictionary entry information to described cell word set; Perhaps, described addition manner is: after finishing the download of all cell dictionaries to be updated, just be added in the described cell word set.
For incremental mode, can upgrade a dictionary dictionary just is installed, its advantage is the influence that dictionary do not downloaded in the dictionary downloaded, can come into force.But may cause frequent dictionary to add operation more for a long time when downloading dictionary, increase the weight of system burden.Batch mode just adds after then requiring all dictionaries all to download to this locality.Because it is less to add operation, system loading is lower.But it is longer to work as the download process, when particularly the situation of failed download also may take place in the centre, and the problem that the dictionary that will occur having downloaded can't use for a long time.Two kinds of patterns can be carried out combination in actual the use, download successfully a dictionary and just check to operate whether passed through a predefined time interval apart from interpolation last time such as every.If surpass, just carry out dictionary and add operation.
Can finish (such as being no more than 1 second) within a short period of time if dictionary adds process,, can directly insert in user's the input process because influence is little.If but could finish the use experience that consequently may influence the user within a short period of time, then dictionary interpolation process should independently be carried out in the buffer memory dictionary at one.The original dictionary of input method is unaffected in this process, and the user can normally use.After the establishment of buffer memory dictionary finishes, directly replace the original dictionary of input method.Because this replacement process can be very fast, therefore can accomplish to avoid normal use to constitute interference to the user.
More than to the method for a kind of input method system provided by the present invention, a kind of character input and a kind of method and a kind of dictionary delivery system of Word library updating, be described in detail, used specific case herein principle of the present invention and embodiment are set forth, the explanation of above embodiment just is used for helping to understand method of the present invention and core concept thereof; Simultaneously, for one of ordinary skill in the art, according to thought of the present invention, the part that all can change in specific embodiments and applications, in sum, this description should not be construed as limitation of the present invention.

Claims (22)

1, a kind of input method system comprises input interface unit, information translation unit and shows output unit, it is characterized in that, also comprises:
System's dictionary is used to write down basic words and relevant information thereof;
The cell word set is used for record expansion words and relevant information thereof; Described cell word set is obtained by at least one the cell dictionary that obtains a plurality of cell dictionaries of storing from server end; Words in each cell dictionary has a predicable at least.
2, input method system as claimed in claim 1 is characterized in that, also comprises:
Automatically update module is used for obtaining required more new data according to existing cell dictionary tabulation from server end.
3, input method system as claimed in claim 1 is characterized in that, the relevant information type that the relevant information type of storing in the described cell word set is less than or equals to store in the described system dictionary; At least exist a cell dictionary in described a plurality of cell dictionary by generating manually.
4, input method system as claimed in claim 1 is characterized in that, also comprises user thesaurus.
5, input method system as claimed in claim 1 is characterized in that, also comprises:
Add module, the cell dictionary entry information that is used for obtaining is added into described cell word set; Described cell word set is independently a dictionary or the dictionary set that exists side by side for a plurality of dictionaries.
6, input method system as claimed in claim 5 is characterized in that,
Described addition manner is: finish a more newly downloaded cell dictionary, then add this cell dictionary entry information to described cell word set;
Perhaps, described addition manner is: after finishing the download of all cell dictionaries to be updated, just be added in the described cell word set.
7, input method system as claimed in claim 6 is characterized in that,
Described interpolation process is independently carried out in the buffer memory dictionary one.
8, input method system as claimed in claim 1 is characterized in that, also comprises:
The cell dictionary module of stopping using is used to receive user instruction, removes the entry record that belongs to user-selected cell dictionary from the cell word set.
9, input method system as claimed in claim 8 is characterized in that,
Described removal process is: receive user instruction, and with the deletion from tabulation of user-selected cell dictionary, and the cell dictionary during interpolation is tabulated again, obtain new cell word set;
Perhaps, described removal process is: receive user instruction, deletion belongs to the entry record of user-selected cell dictionary from described cell word set, records the affiliated cell dictionary of each entry in the described cell word set;
Perhaps, described removal process is: receive user instruction, in described cell word set, add delete flag to the entry record that belongs to user-selected cell dictionary, record the affiliated cell dictionary of each entry in the described cell word set.
10, a kind of method of character input is characterized in that, comprising:
Loading system dictionary and cell word set; Described cell word set is used for record expansion words and relevant information thereof; Described cell word set is obtained by at least one the cell dictionary that obtains a plurality of cell dictionaries of storing from server end; Words in each cell dictionary has a predicable at least;
Receive user's input information;
According to the input information that is received, in described system dictionary and cell word set, retrieve, obtain corresponding candidate item;
Receive user's selection information, with screen output on the candidate item of appointment.
11, method as claimed in claim 10 is characterized in that,
Described being loaded as: cell word set and system's dictionary are merged into a dictionary, place buffer memory;
Perhaps, described being loaded as: cell word set and system's dictionary are placed buffer memory as two or more independent dictionaries, and set dictionary priority according to presetting rule; Described priority is used for the demonstration ordering of candidate item.
12, method as claimed in claim 10 is characterized in that, records affiliated cell dictionary of each entry and corresponding cell dictionary priority in the described cell word set; Described priority is used for the demonstration ordering of candidate item.
13, method as claimed in claim 12 is characterized in that, also comprises:
In loading procedure, dynamically adjust cell dictionary priority according to the environment for use of input method.
14, a kind of method of Word library updating, it is characterized in that, the dictionary that is upgraded relates to the cell word set that is used to write down expansion words and relevant information thereof, and described cell word set is obtained by at least one cell dictionary of choosing a plurality of cell dictionaries of storing from server end; Words in each cell dictionary has a predicable at least;
Described method comprises:
Accept triggering, relatively more existing cell dictionary tabulation and the tabulation of server teloblast dictionary obtain the dictionary tabulation of required renewal;
Download the cell dictionary entry information of required renewal, and be added in the cell word set.
15, method as claimed in claim 14 is characterized in that, also comprises:
The cell dictionary manual or the auto-update server end is stored, and change corresponding version information.
16, method as claimed in claim 14 is characterized in that:
Described addition manner is: finish and download a cell dictionary, then add this cell dictionary entry information to described cell word set;
Perhaps, described addition manner is: after finishing the download of all cell dictionaries to be updated, just be added in the described cell word set.
17, method as claimed in claim 16 is characterized in that, described interpolation process is independently carried out in the buffer memory dictionary one.
18, a kind of dictionary delivery system is characterized in that, comprising:
Cell dictionary generation unit comprises: interface module is used to receive input information; Generation module is used for according to the information cellulation dictionary that is received; Identification module is used to each cell dictionary designated identification and version information; Wherein, the words in each cell dictionary has a predicable at least;
Communication unit is used to accept triggering, transmits corresponding cell dictionary entry information to client.
19, dictionary delivery system as claimed in claim 18 is characterized in that, described cell dictionary generation unit also comprises:
Revise update module, be used for revising and upgrade cell dictionary prestored message, and notify described identification module to generate new version information at this cell dictionary.
20, dictionary delivery system as claimed in claim 18 is characterized in that, also comprises:
Identification module is used for the cell dictionary tabulation of comparison server end and the cell dictionary tabulation of client, and resulting comparative result is used to transmit required more new data to client.
21, dictionary delivery system as claimed in claim 18 is characterized in that,
Store a plurality of entry information in the cell dictionary that the information that foundation is received obtains;
Perhaps, store index information in the cell dictionary that the information that foundation is received obtains, corresponding other cell dictionaries of described index information.
22, dictionary delivery system as claimed in claim 18 is characterized in that, also comprises:
Merge module, be used for that a plurality of cell dictionary entry information are merged into one and download dictionary, and the notifying communication unit should be downloaded dictionary and transferred to client.
CNB2007100994746A 2007-05-22 2007-05-22 Character input method, input method system and method for updating word stock Ceased CN100483416C (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CNB2007100994746A CN100483416C (en) 2007-05-22 2007-05-22 Character input method, input method system and method for updating word stock
PCT/CN2008/071027 WO2008141583A1 (en) 2007-05-22 2008-05-21 Character input method, input system and method for updating word lexicon

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNB2007100994746A CN100483416C (en) 2007-05-22 2007-05-22 Character input method, input method system and method for updating word stock

Publications (2)

Publication Number Publication Date
CN101051323A true CN101051323A (en) 2007-10-10
CN100483416C CN100483416C (en) 2009-04-29

Family

ID=38782735

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB2007100994746A Ceased CN100483416C (en) 2007-05-22 2007-05-22 Character input method, input method system and method for updating word stock

Country Status (2)

Country Link
CN (1) CN100483416C (en)
WO (1) WO2008141583A1 (en)

Cited By (37)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2008141583A1 (en) * 2007-05-22 2008-11-27 Beijing Sogou Technology Development Co., Ltd. Character input method, input system and method for updating word lexicon
CN101256557B (en) * 2008-04-16 2010-06-23 腾讯科技(深圳)有限公司 Self-defining word management apparatus and method
CN101178741B (en) * 2007-12-24 2010-06-23 腾讯科技(深圳)有限公司 Method and device for updating user's word stock
CN101840418A (en) * 2010-03-31 2010-09-22 北京搜狗科技发展有限公司 User word library synchronous update method, update server and input method system
CN101995963A (en) * 2010-11-19 2011-03-30 哈尔滨工业大学 Vocabulary self-adaption Chinese input method
CN102346743A (en) * 2010-08-02 2012-02-08 腾讯科技(深圳)有限公司 Method and device for generating candidate words
CN102346557A (en) * 2010-07-28 2012-02-08 腾讯科技(深圳)有限公司 Input method and input method system
CN102566774A (en) * 2010-12-26 2012-07-11 上海量明科技发展有限公司 Method and system for measuring user input characters to adjust levels
CN102682031A (en) * 2011-03-17 2012-09-19 新奥特(北京)视频技术有限公司 Method and system of Chinese Pin Yin search suggest based on relational database
CN101710326B (en) * 2009-12-03 2012-10-03 腾讯科技(深圳)有限公司 Word stock substitution method, device and input method system
CN102789317A (en) * 2011-05-20 2012-11-21 腾讯科技(深圳)有限公司 Method and device for accelerating text input
CN103108012A (en) * 2011-11-15 2013-05-15 腾讯科技(深圳)有限公司 Synchronized method and synchronized server for user word bank
CN103246355A (en) * 2012-02-06 2013-08-14 百度在线网络技术(北京)有限公司 On-line input method evaluation method, system and device
CN103248551A (en) * 2012-02-03 2013-08-14 腾讯科技(深圳)有限公司 Information presentation method and system
CN103389979A (en) * 2012-05-08 2013-11-13 腾讯科技(深圳)有限公司 System, device and method for recommending classification lexicon in input method
CN103473313A (en) * 2013-09-11 2013-12-25 百度在线网络技术(北京)有限公司 Establishment method and device for name dictionary of input method
CN104423621A (en) * 2013-08-22 2015-03-18 北京搜狗科技发展有限公司 Pinyin string processing method and device
CN101645065B (en) * 2008-08-05 2016-02-24 北京搜狗科技发展有限公司 Determine the method for the auxiliary lexicon needing loading, device and input method system
CN105718071A (en) * 2016-01-19 2016-06-29 努比亚技术有限公司 Terminal and method for recommending associational words in input method
CN105824436A (en) * 2015-01-06 2016-08-03 阿里巴巴集团控股有限公司 Character input method and input method system
CN105955495A (en) * 2016-04-29 2016-09-21 百度在线网络技术(北京)有限公司 Information input method and device
CN102346743B (en) * 2010-08-02 2016-12-14 深圳市世纪光速信息技术有限公司 Generate the method and device of candidate word
CN106873795A (en) * 2016-12-29 2017-06-20 北京五八信息技术有限公司 A kind of character input method, device and terminal
CN106896937A (en) * 2017-02-28 2017-06-27 百度在线网络技术(北京)有限公司 Method and apparatus for being input into information
CN106933801A (en) * 2017-02-13 2017-07-07 北京奇虎科技有限公司 The update method and device of a kind of dictionary
CN107832035A (en) * 2017-11-13 2018-03-23 赵桂银 A kind of pronunciation inputting method of intelligent terminal
CN108228620A (en) * 2016-12-14 2018-06-29 北京搜狗科技发展有限公司 A kind of Word library updating method and device
CN108256051A (en) * 2018-01-15 2018-07-06 中企动力科技股份有限公司 Website product generation method and device
CN108376129A (en) * 2018-01-24 2018-08-07 北京奇艺世纪科技有限公司 A kind of error correction method and device
CN108399013A (en) * 2018-03-16 2018-08-14 北京搜狗科技发展有限公司 A kind of user's word adding method and device
CN108628461A (en) * 2017-03-16 2018-10-09 北京搜狗科技发展有限公司 A kind of input method and device, a kind of method and apparatus of update dictionary
CN109240511A (en) * 2017-07-04 2019-01-18 北京搜狗科技发展有限公司 It is a kind of to update the method for dictionary, system and a kind of for updating the device of dictionary
CN109284228A (en) * 2018-09-25 2019-01-29 北京金山安全软件有限公司 Input method evaluation method and device, electronic equipment and storage medium
US10210148B2 (en) 2010-08-02 2019-02-19 Lenovo (Beijing) Limited Method and apparatus for file processing
CN109408815A (en) * 2018-10-09 2019-03-01 苏州思必驰信息科技有限公司 Dictionary management method and system for voice dialogue platform
CN109542248A (en) * 2018-11-16 2019-03-29 上海二三四五网络科技有限公司 A kind of control method and control device of incremental update dictionary data
CN114637407A (en) * 2020-12-15 2022-06-17 中国联合网络通信集团有限公司 Chinese character input method, system, computer device and storage medium

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9600566B2 (en) 2010-05-14 2017-03-21 Microsoft Technology Licensing, Llc Identifying entity synonyms
US10032131B2 (en) 2012-06-20 2018-07-24 Microsoft Technology Licensing, Llc Data services for enterprises leveraging search system data assets
US9594831B2 (en) 2012-06-22 2017-03-14 Microsoft Technology Licensing, Llc Targeted disambiguation of named entities
CN103825952B (en) * 2014-03-04 2017-07-04 百度在线网络技术(北京)有限公司 Cell dictionary method for pushing and server
CN109725736B (en) * 2017-10-27 2023-02-28 北京搜狗科技发展有限公司 Candidate sorting method and device and electronic equipment
CN112987941B (en) * 2019-12-17 2024-02-13 北京搜狗科技发展有限公司 Method and device for generating candidate words
CN111581971B (en) * 2020-06-04 2024-01-23 腾讯科技(深圳)有限公司 Word stock updating method, device, terminal and storage medium
CN114594861B (en) * 2020-12-03 2024-10-29 北京搜狗科技发展有限公司 Recommendation method and device and electronic equipment
CN115407882A (en) * 2022-07-13 2022-11-29 穆运洋 Visualization-based big data analysis and arrangement system

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1114066A (en) * 1994-05-08 1995-12-27 刘树根 Sense sgroup input, editing and word code
CN1494025A (en) * 2002-10-31 2004-05-05 英业达股份有限公司 Input method of Chinese character having classification thesaurus and its system
US7421386B2 (en) * 2003-10-23 2008-09-02 Microsoft Corporation Full-form lexicon with tagged data and methods of constructing and using the same
CN1560767A (en) * 2004-02-24 2005-01-05 珠海市汉易通信息科技有限公司 Automatic fully adding method for word input
CN100424703C (en) * 2006-08-23 2008-10-08 北京搜狗科技发展有限公司 Method for obtaining newly encoded character string, input method system and word stock generation device
CN100483416C (en) * 2007-05-22 2009-04-29 北京搜狗科技发展有限公司 Character input method, input method system and method for updating word stock

Cited By (50)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2008141583A1 (en) * 2007-05-22 2008-11-27 Beijing Sogou Technology Development Co., Ltd. Character input method, input system and method for updating word lexicon
CN101178741B (en) * 2007-12-24 2010-06-23 腾讯科技(深圳)有限公司 Method and device for updating user's word stock
CN101256557B (en) * 2008-04-16 2010-06-23 腾讯科技(深圳)有限公司 Self-defining word management apparatus and method
CN101645065B (en) * 2008-08-05 2016-02-24 北京搜狗科技发展有限公司 Determine the method for the auxiliary lexicon needing loading, device and input method system
CN101645088B (en) * 2008-08-05 2016-06-01 北京搜狗科技发展有限公司 Determine the method for auxiliary lexicon, device and the input method system that need to load
CN101710326B (en) * 2009-12-03 2012-10-03 腾讯科技(深圳)有限公司 Word stock substitution method, device and input method system
CN101840418A (en) * 2010-03-31 2010-09-22 北京搜狗科技发展有限公司 User word library synchronous update method, update server and input method system
CN102346557A (en) * 2010-07-28 2012-02-08 腾讯科技(深圳)有限公司 Input method and input method system
CN102346557B (en) * 2010-07-28 2016-08-03 深圳市世纪光速信息技术有限公司 A kind of input method and input method system
US10210148B2 (en) 2010-08-02 2019-02-19 Lenovo (Beijing) Limited Method and apparatus for file processing
CN102346743A (en) * 2010-08-02 2012-02-08 腾讯科技(深圳)有限公司 Method and device for generating candidate words
CN102346743B (en) * 2010-08-02 2016-12-14 深圳市世纪光速信息技术有限公司 Generate the method and device of candidate word
CN101995963A (en) * 2010-11-19 2011-03-30 哈尔滨工业大学 Vocabulary self-adaption Chinese input method
CN102566774A (en) * 2010-12-26 2012-07-11 上海量明科技发展有限公司 Method and system for measuring user input characters to adjust levels
CN102682031A (en) * 2011-03-17 2012-09-19 新奥特(北京)视频技术有限公司 Method and system of Chinese Pin Yin search suggest based on relational database
CN102789317A (en) * 2011-05-20 2012-11-21 腾讯科技(深圳)有限公司 Method and device for accelerating text input
CN102789317B (en) * 2011-05-20 2016-05-25 腾讯科技(深圳)有限公司 A kind of method and apparatus of accelerating text input
CN103108012B (en) * 2011-11-15 2019-11-19 深圳市世纪光速信息技术有限公司 A kind of user thesaurus synchronous method and user thesaurus sync server
CN103108012A (en) * 2011-11-15 2013-05-15 腾讯科技(深圳)有限公司 Synchronized method and synchronized server for user word bank
CN103248551A (en) * 2012-02-03 2013-08-14 腾讯科技(深圳)有限公司 Information presentation method and system
CN103246355A (en) * 2012-02-06 2013-08-14 百度在线网络技术(北京)有限公司 On-line input method evaluation method, system and device
CN103246355B (en) * 2012-02-06 2017-04-05 百度在线网络技术(北京)有限公司 On-line input method evaluating method, system and device
WO2013166949A1 (en) * 2012-05-08 2013-11-14 Shenzhen Shi Ji Guang Su Information Technology Co., Ltd. System, apparatus and method for recommending thesaurus in input method
US9984328B2 (en) 2012-05-08 2018-05-29 Shenzhen Shi Ji Guang Su Information Technology Co., Ltd. System, apparatus and method for recommending thesaurus in input method
CN103389979A (en) * 2012-05-08 2013-11-13 腾讯科技(深圳)有限公司 System, device and method for recommending classification lexicon in input method
CN103389979B (en) * 2012-05-08 2018-10-12 深圳市世纪光速信息技术有限公司 Recommend system, the device and method of classified lexicon in input method
CN104423621A (en) * 2013-08-22 2015-03-18 北京搜狗科技发展有限公司 Pinyin string processing method and device
CN103473313A (en) * 2013-09-11 2013-12-25 百度在线网络技术(北京)有限公司 Establishment method and device for name dictionary of input method
CN103473313B (en) * 2013-09-11 2017-01-18 百度在线网络技术(北京)有限公司 Establishment method and device for name dictionary of input method
CN105824436A (en) * 2015-01-06 2016-08-03 阿里巴巴集团控股有限公司 Character input method and input method system
CN105718071A (en) * 2016-01-19 2016-06-29 努比亚技术有限公司 Terminal and method for recommending associational words in input method
CN105955495A (en) * 2016-04-29 2016-09-21 百度在线网络技术(北京)有限公司 Information input method and device
CN108228620A (en) * 2016-12-14 2018-06-29 北京搜狗科技发展有限公司 A kind of Word library updating method and device
CN106873795A (en) * 2016-12-29 2017-06-20 北京五八信息技术有限公司 A kind of character input method, device and terminal
CN106933801A (en) * 2017-02-13 2017-07-07 北京奇虎科技有限公司 The update method and device of a kind of dictionary
CN106896937A (en) * 2017-02-28 2017-06-27 百度在线网络技术(北京)有限公司 Method and apparatus for being input into information
CN108628461B (en) * 2017-03-16 2022-07-08 北京搜狗科技发展有限公司 Input method and device and method and device for updating word stock
CN108628461A (en) * 2017-03-16 2018-10-09 北京搜狗科技发展有限公司 A kind of input method and device, a kind of method and apparatus of update dictionary
CN109240511A (en) * 2017-07-04 2019-01-18 北京搜狗科技发展有限公司 It is a kind of to update the method for dictionary, system and a kind of for updating the device of dictionary
CN107832035A (en) * 2017-11-13 2018-03-23 赵桂银 A kind of pronunciation inputting method of intelligent terminal
CN107832035B (en) * 2017-11-13 2021-03-12 深圳市矽昊智能科技有限公司 Voice input method of intelligent terminal
CN108256051A (en) * 2018-01-15 2018-07-06 中企动力科技股份有限公司 Website product generation method and device
CN108376129A (en) * 2018-01-24 2018-08-07 北京奇艺世纪科技有限公司 A kind of error correction method and device
CN108376129B (en) * 2018-01-24 2022-04-22 北京奇艺世纪科技有限公司 Error correction method and device
CN108399013A (en) * 2018-03-16 2018-08-14 北京搜狗科技发展有限公司 A kind of user's word adding method and device
CN108399013B (en) * 2018-03-16 2022-08-09 北京搜狗科技发展有限公司 User word adding method and device
CN109284228A (en) * 2018-09-25 2019-01-29 北京金山安全软件有限公司 Input method evaluation method and device, electronic equipment and storage medium
CN109408815A (en) * 2018-10-09 2019-03-01 苏州思必驰信息科技有限公司 Dictionary management method and system for voice dialogue platform
CN109542248A (en) * 2018-11-16 2019-03-29 上海二三四五网络科技有限公司 A kind of control method and control device of incremental update dictionary data
CN114637407A (en) * 2020-12-15 2022-06-17 中国联合网络通信集团有限公司 Chinese character input method, system, computer device and storage medium

Also Published As

Publication number Publication date
WO2008141583A1 (en) 2008-11-27
CN100483416C (en) 2009-04-29

Similar Documents

Publication Publication Date Title
CN101051323A (en) Character input method, input method system and method for updating word stock
EP1450267B1 (en) Methods and systems for language translation
CN1180369C (en) Equipment and method for input of character string
CN1871603A (en) System and method for processing a query
CN1197962A (en) Speech recognition device, method and recording medium for storing program of the speech recognition device
CN1815477A (en) Method and system for providing semantic subjects based on mark language
CN1924858A (en) Method and device for fetching new words and input method system
CN1290901A (en) Method and system for text substitute mode formed by random input source
US20110231411A1 (en) Topic Word Generation Method and System
CN101038550A (en) Information processing apparatus and information processing method
CN1282928A (en) Method and system for providing national language inquiry service
CN1841362A (en) Method and system for processing electronic forms
CN1530860A (en) Data searching system and method by mere label in whole
CN1441371A (en) Character input device
CN1784680A (en) Progressive relaxation of search criteria
CN101035125A (en) Method and system for processing Web service requests
CN1855039A (en) Method and system for creating, storing, managing and consuming culture-dedicated data
CN1679022A (en) Systems and methods of building and using custom word lists
CN1685341A (en) Blinking annotation callouts highlighting cross language search results
CN1328668A (en) System and method for specifying www site
CN101055588A (en) Method for catching limit word information, optimizing output and input method system
CN104428770A (en) Contextual query adjustments using natural action input
CN1877583A (en) Accessing identification index system and accessing identification index library generation method
CN1492367A (en) Inquire/response system and inquire/response method
CN1510595A (en) Dictionary updating system, updating processing servo, terminal, controlling method, program, recording medium

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
IW01 Full invalidation of patent right

Decision date of declaring invalidation: 20161020

Decision number of declaring invalidation: 30260

Granted publication date: 20090429

IW01 Full invalidation of patent right