CN112650846B - Question and answer intention knowledge base construction system and method based on question frame - Google Patents

Question and answer intention knowledge base construction system and method based on question frame Download PDF

Info

Publication number
CN112650846B
CN112650846B CN202110040888.1A CN202110040888A CN112650846B CN 112650846 B CN112650846 B CN 112650846B CN 202110040888 A CN202110040888 A CN 202110040888A CN 112650846 B CN112650846 B CN 112650846B
Authority
CN
China
Prior art keywords
question
sentence
frame
answer
target word
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202110040888.1A
Other languages
Chinese (zh)
Other versions
CN112650846A (en
Inventor
侯志强
柳晶晶
刘锋
谭培波
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Zhitong Yunlian Technology Co ltd
Original Assignee
Beijing Zhitong Yunlian Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Zhitong Yunlian Technology Co ltd filed Critical Beijing Zhitong Yunlian Technology Co ltd
Priority to CN202110040888.1A priority Critical patent/CN112650846B/en
Publication of CN112650846A publication Critical patent/CN112650846A/en
Application granted granted Critical
Publication of CN112650846B publication Critical patent/CN112650846B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/332Query formulation
    • G06F16/3329Natural language query formulation or dialogue systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3344Query execution using natural language analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/205Parsing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/237Lexical tools
    • G06F40/242Dictionaries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Mathematical Physics (AREA)
  • Human Computer Interaction (AREA)
  • Machine Translation (AREA)

Abstract

The invention provides a question and answer intention knowledge base construction system and method based on a question frame. Comprising the following steps: the data layer comprises a question corpus, a frame element dictionary and a question and answer intention knowledge base; the method is used for storing files, reading and writing files and modifying files; the processing layer comprises a frame element processing module and a question and answer rewriting module and is used for rewriting a question; the application layer comprises a question analysis module and is used for outputting candidate target word strings formed by rewriting the questions. The question and answer intention knowledge base construction system and method based on the question frame improve the problems that in the prior art, the frame elements are difficult to identify and the question and answer form of question analysis cannot be automatically obtained.

Description

Question and answer intention knowledge base construction system and method based on question frame
Technical Field
The invention relates to the technical field of construction of question and answer intention knowledge bases, in particular to a question and answer intention knowledge base construction system and method based on a question frame.
Background
Sentence intent (sentence frame) is the meaning of sentences in the real world, which is a semantic meaning, and a method of frame semantics (FrameNet) is generally adopted, the name and frame elements of a frame are determined according to the scene where the frame is located, and the target words of the frame are defined according to predicates or verbs in the sentences. This definition of the target word of the whole sentence in a part of the predicate or verb and the determination of the frame element in practice presents the following problems:
(1) The entity ambiguity cannot be resolved and the frame element cannot be identified
Such as "what is the dam 3 well depth", where "dam 3" and "dam 3 well" are both 2 truly existing but disparate types of entities, then the entity in question is exactly "dam 3 well depth" or "dam 3 well depth? This ambiguity problem cannot be resolved at the word level, and the actual intent and elements of the questioner can be corrected out only at the higher sentence level through the knowledge base.
(2) Non-verb sentence unrecognizable frame
English is a language based on verbs, so that the frame and the frame element are successfully identified based on verbs, but the method is clear, but target words cannot be defined when the intention (frame) of a question is identified, so that the frame and the frame element of the question cannot be identified, and analysis of the question and the answer cannot be performed.
For example, "depth of hole of dam 3", the meaning of this sentence in question and answer scene is very clear, namely "depth of hole" of each well that this gas gathering station of question "dam 3" contains, but there is no verb but noun in question, NLP can't discern the frame and frame element of this sentence.
(3) The frame cannot be identified without the sequence of the participatory words
The method of slot position is used for removing the entity in the sentence, and the left virtual word sequence is used as the target word to carry out frame recognition, so that the result only uses half of information, and therefore, the frame and frame elements of the sentence cannot be recognized.
For sentences with the works such as ' dam 3 well depth ' what is ' can be obtained by removing the entity words ' dam 3 ', ' well depth ' and reserving the slot position positions ', what is ' is, the frame of the sentence can be identified through the target word, but for sentences without works such as ' dam 3 well depth ', the works cannot be identified by the works sequence method.
(4) Parsing of questions cannot automatically obtain forms of answers
Because the question and the answer are presented in pairs, the question and the answer are different, so that the consistency of the words, the language and the semantics of the question and the answer is maintained. However, if the question is independently processed without considering the answer, a smooth answer form conforming to the scene and the semantics cannot be obtained.
Disclosure of Invention
The invention aims to provide a question and answer intention knowledge base construction method based on a question frame, which can solve the problems that in the prior art, frame elements are difficult to identify and question analysis answer forms cannot be automatically obtained.
In order to achieve the above object, the present invention provides the following technical solutions:
A question and answer intention knowledge base construction system based on a question framework, comprising: the data layer comprises a question corpus, a frame element dictionary and a question and answer intention knowledge base; the method is used for storing files, reading and writing files and modifying files;
the processing layer comprises a frame element processing module and a question and answer rewriting module and is used for rewriting sentences;
and the application layer comprises a question analysis module and is used for outputting candidate target word strings formed by rewriting sentences.
Based on the technical scheme, the invention can also be improved as follows:
Further, the question corpus comprises sequence numbers, question sources and questions, and the questions are used for recording relevant information of the questions.
Further, the format of the frame element dictionary comprises the name of the frame and the code number of the frame element; the framework element dictionary comprises question analysis, wherein the question analysis comprises a first layer and a second layer, the first layer is used for sequence analysis, and the second layer is used for implication relation and hierarchical structure analysis.
Further, the question-answering intention knowledge base comprises a question target word string and a question-answering intention analysis, wherein the question-answering intention analysis comprises a first part and a second part, the first part is a name of a frame, and the second part is an answer template.
Further, the frame element processing module is used for searching out word strings from the frame element dictionary.
Further, the rewrite module is used for replacing the word strings in the sentences to finish the rewrite of the sentences, and each rewritten sentence is added into the rewritten sentence set as a new original sentence to be accumulated until all the frame element character strings are used, so as to obtain a rewritten sentence set.
Further, the question analysis module is used for establishing a read-in question list, and outputting the frame target word strings formed by the rewritten sentences in a reverse ordering mode according to the word string length.
A method for constructing a question and answer intention knowledge base based on a question frame specifically comprises the following steps:
s101, constructing a sentence frame element dictionary according to the frame element dictionary and the question file;
s102, circulating the sentence frame element dictionary;
s103, circulating the existing sentence target word string set to form a new candidate target word string set, and reserving sentences in the new candidate target word string set;
s104, replacing corresponding words in the target word strings with the sentence frame element dictionary, and updating the candidate target word string set;
S105, sorting according to the lengths of the candidate target word strings, and outputting the candidate target word strings.
Further, constructing the sentence frame element dictionary specifically includes: and searching each sentence in the question file, and collecting the sentences in the frame element dictionary to form a sentence frame element dictionary when the words in the frame element dictionary appear in the sentences.
The invention has the following advantages:
According to the question and answer intention knowledge base construction system and method based on the question frame, information of all words and word sequences in the question is adopted to the greatest extent, the semantics of the target word strings of the question are reserved to the greatest extent, word-level object ambiguity can be effectively eliminated, recognition of the frame name of the question and analysis of frame elements and generation of answer sentences can be realized through the target word strings of the question, and analysis of the question and answer intention in the question and answer system is solved; the method solves the problems that in the prior art, the frame elements are difficult to identify and the answer sentence form of question analysis cannot be obtained automatically.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings that are needed in the description of the embodiments or the prior art will be briefly described, and it is obvious that the drawings in the description below are some embodiments of the present invention, and other drawings can be obtained according to the drawings without inventive effort for a person skilled in the art.
FIG. 1 is a schematic diagram of a knowledge base construction system for question and answer purposes in an embodiment of the invention;
FIG. 2 is a flowchart of a method for constructing a knowledge base of question and answer intents in an embodiment of the invention;
FIG. 3 is a schematic diagram of the definition of the frame element and the encoding rule of the question and answer frame name in the embodiment of the invention;
FIG. 4 is a schematic diagram of a format of a corpus of questions in an embodiment of the present invention;
FIG. 5 is a diagram of a framework element dictionary format in an embodiment of the present invention;
Fig. 6 is a schematic diagram of a knowledge base format of a question and answer chart in an embodiment of the invention.
Reference numerals illustrate:
The system comprises a data layer 10, a question corpus 101, a frame element dictionary 102, a question and answer intention knowledge base 103, a processing layer 20, a frame element processing module 201, a question and answer rewrite module 202, an application layer 30 and a question analysis module 301.
Detailed Description
The technical solutions of the present invention will be clearly and completely described in connection with the embodiments, and it is apparent that the described embodiments are some embodiments of the present invention, but not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.
As shown in fig. 1, the present invention provides a question and answer intention knowledge base construction system based on a question framework, comprising:
a data layer 10 including a corpus of questions 101, a dictionary of frame elements 102, and a knowledge base of questions and answers 103; the method is used for storing files, reading and writing files and modifying files;
The processing layer 20 comprises a frame element processing module 201 and a question and answer sentence rewriting module 202, and is used for rewriting sentences;
The application layer 30 includes a question analysis module 301 for outputting candidate target word strings formed by rewritten sentences.
Sentence intent (sentence frame) is the meaning of sentences in the real world, which is a semantic meaning, and a method of frame semantics (FrameNet) is generally adopted, the name and frame elements of a frame are determined according to the scene where the frame is located, and the target words of the frame are defined according to predicates or verbs in the sentences.
For example, a frame named "talk and communicate" is defined in a social scenario, and generally has frame elements such as "sender", "receiver", "information", "medium", etc., and verbs included in sentences such as "say", "talk", "command", "tell", "discuss", "remind", "ask", "promise", "warn", "threat", etc. are all target words of the frame.
For a specific sentence, the frame to which the sentence belongs is determined by identifying the target word. The target words such as "Zhang Sanzhu Liqu airport where", "tell" indicate that the sentence belongs to the "talk-around interaction" frame, and "Zhang Sanzhu", "Liqu", "airport where" etc. are frame elements of the frame.
The entity words in the question sentence are replaced by the frame elements to form a complete word string combining virtual and real, so that the information of all words in the sentence is utilized to the maximum extent, and the complete sentence frame target word string is formed.
The specific operation is that firstly, the whole question-answer (including question and answer) frame and the question-answer frame element codes (such as frame number F111, frame element code T, time, O object and P parameter) are determined according to the question.
Secondly, constructing a frame element dictionary 102 according to the frames and the frame element codes of sentences; then, the corresponding entity words in the original question and answer are replaced by the frame element codes, and the question and answer are rewritten (for example, "how much" is rewritten to "how much the TOP is the F111# TOP is the Q" in month 3 in 2020).
Defining the rewritten question as a target word under the selected frame for identifying the frame, and forming a question-answer frame corresponding to the sentence by taking the rewritten answer as a question-answer template and the question; and putting the question-answering frameworks of all the questions together to form the whole question-answering knowledge base.
Further, as shown in fig. 4, the corpus of questions 101 includes a sequence number, a source of the questions, and questions for recording related information of the questions.
The method is used for recording question related information, and the information can be expanded, such as adding information of regions, questioners and the like, so as to prepare for more accurate questions and answers in the future.
Further, as shown in fig. 5, the format of the frame element dictionary 102 includes the frame name and the frame element code; the framework element dictionary 102 includes question parsing including a first layer for sequence parsing and a second layer for implication relationship and hierarchical parsing in consideration of granularity of time in chinese natural language and implication relationship of natural language such as "how much oil is produced". The definition of the frame names and the frame elements of the questions and answers are shown in fig. 2, wherein the questions and answers define TOPVM elements, and the answers define TOPVMQ and other 6 elements.
Further, as shown in the question-answer diagram knowledge base format of fig. 6, the question-answer knowledge base 103 includes a question target word string and a question-answer intention parse including a first portion and a second portion, where the question-answer intention parse includes a first portion and a second portion separated by "@ @" where the first portion is a frame name and the second portion is a question template.
Further, the frame element processing module 201 is configured to find a word string from the frame element dictionary 102.
Further, the rewrite module is used for replacing the word strings in the sentences to finish the rewrite of the sentences, and each rewritten sentence is added into the rewritten sentence set as a new original sentence to be accumulated until all the frame element character strings are used, so as to obtain a rewritten sentence set.
The target word string set of the original question sentence after being rewritten not only comprises a sentence pattern constructed by the virtual word part of the original sentence, but also comprises a frame or a slot constructed by the real word part, so that the method utilizes the information in the sentence and is the most complete construction mode of the target word of the sentence subframe.
Further, the question analysis module 301 is configured to establish a read-in question list, and output a frame target word string formed by rewriting the question in a reverse order according to the word string length. For people to select and verify proper frame target word strings. For a simple question, "what the gas production of 401-1 well is", because the number of dictionaries is large, typically on the order of 100 tens of thousands, 163 possible frame target word strings will be output, with only "what the OP is" being the correct question frame target word string. The more complex the question, the more frame elements are included, and the greater the number of frame target word strings that are output.
A method for constructing a question and answer intention knowledge base based on a question frame specifically comprises the following steps:
s101, constructing a sentence frame element dictionary 102;
In this step, a sentence frame element dictionary 102 is constructed from the frame element dictionary 102 and the question file; the frame element dictionary 102 is opened and the entire dictionary is read into memory to speed up processing. Opening an input question file to process each sentence in the question file; sentences in the question file are read in terms of periods. The frame elements corresponding to each question in the sentence frame element dictionary 102dic are constructed differently, and therefore, each sentence needs to be searched separately.
Whenever a word in the frame element dictionary 102 appears in a sentence, it is collected in the sentence frame element dictionary 102. This sentence-frame element dictionary 102 is large in number and may have complex inclusion relationships, such as "1" may be month data or may be physical well number, which requires full revenue into the sentence-frame element dictionary 102.
S102, cycling the sentence frame element dictionary 102;
in this step, the sentence frame element dictionary 102 is cycled; cycling according to the sentence frame element dictionary 102 dic; a loop of the entire element dictionary is constituted. Each occurrence in the dictionary should be exhausted, and the possibility of each sentence rewriting construction candidate target word string is checked.
S103, circulating the existing sentence target word string set;
In the step, circulating the existing sentence target word string set to form a new candidate target word string set, and reserving sentences in the new candidate target word string set; cycling the existing sentence target word string set; and updating the single element dictionary by the target word string set. The existing set of sentence target word strings is a variable, increasing set because the number of candidate target word string sets will increase every time a sentence replaces a word in an element dictionary.
Here, all the target word strings are updated once. Retaining sentences in the new candidate target word string set; the original sentence is always kept in the target candidate word string set, and expansion of the candidate word string set is realized. And the automatic duplication removal of sentences is realized by adopting a new collection mode.
S104, updating the candidate target word string set;
In this step, the sentence frame element dictionary 102 is used to replace the corresponding word in the target word string, and the candidate target word string set is updated; replacing corresponding words in the target word string with the sentence-frame element dictionary 102 dic; sentences with new word substitutions are also added to the target word string set. It should be noted that if there are several identical element words in the sentence, only one element word can be replaced at a time, and the rest element words to be replaced later can be replaced.
The processing of the next element word is formed together with the step S102, and the key point is that the target word string set is an updated new set, and all non-repeated candidate target word strings processed from the original sentence to each dictionary in the middle are accumulated;
s105, outputting candidate target word strings;
In the step, the candidate target word strings are output according to the length sequence of the candidate target word strings.
Further, constructing the sentence frame element dictionary 102 specifically includes: each sentence in the question file is searched, and when a word in the frame element dictionary 102 appears in the sentence, the sentence is collected in the frame element dictionary 102 to form a sentence frame element dictionary 102.
Sequencing according to the length of the candidate target word strings; the candidate target word strings are ordered from small to large in length because, in general, the longer word semantics are more definite the greater the number of words, and therefore, after substitution, the shorter the entire target word string is, the more likely it is that the correct target word string is required. Such ordering may reduce the time for a person to find the correct target word string. Outputting all candidate target word string files; and constructing a target word string sequence for each question, and then recording the target word string sequence in a file for outputting for manual inspection.
The construction method of the question and answer intention knowledge base 103 based on the question frame comprises the following steps:
When in use, an operator constructs a sentence frame element dictionary 102 according to the frame element dictionary 102 and the question file; cycling the sentence frame element dictionary 102; cycling the existing sentence target word string set to form a new candidate target word string set, and reserving sentences in the new candidate target word string set; replacing corresponding words in the target word strings with the sentence frame element dictionary 102 to update the candidate target word string set; and sequencing according to the lengths of the candidate target word strings, and outputting the candidate target word strings.
In the description of the present invention, it should be understood that the terms "center", "longitudinal", "lateral", "length", "width", "thickness", "upper", "lower", "front", "rear", "left", "right", "vertical", "horizontal", "top", "bottom", "inner", "outer", "clockwise", "counterclockwise", etc. indicate orientations or positional relationships based on the orientations or positional relationships shown in the drawings are merely for convenience in describing the present invention and simplifying the description, and do not indicate or imply that the device or element referred to must have a specific orientation, be configured and operated in a specific orientation, and thus should not be construed as limiting the present invention.
Furthermore, the terms "first," "second," and the like, are used for descriptive purposes only and are not to be construed as indicating or implying a relative importance or implicitly indicating the number of technical features indicated. Thus, a feature defining "a first" or "a second" may explicitly or implicitly include one or more of the stated features. In the description of the present invention, the meaning of "a plurality" is two or more, unless explicitly defined otherwise. Furthermore, the terms "mounted," "connected," "coupled," and "connected" are to be construed broadly, and may be, for example, fixedly connected, detachably connected, or integrally connected; can be mechanically or electrically connected; can be directly connected or indirectly connected through an intermediate medium, and can be communication between two elements. The specific meaning of the above terms in the present invention will be understood in specific cases by those of ordinary skill in the art.
Finally, it should be noted that: the above embodiments are only for illustrating the technical solution of the present invention, and not for limiting the same; although the invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical scheme described in the foregoing embodiments can be modified or some or all of the technical features thereof can be replaced by equivalents; such modifications and substitutions do not depart from the spirit of the invention.

Claims (8)

1. A question and answer intention knowledge base construction system based on a question framework, comprising:
The data layer comprises a question corpus, a frame element dictionary and a question and answer intention knowledge base; the method is used for storing files, reading and writing files and modifying files;
The format of the frame element dictionary comprises the name of the frame and the code number of the frame element; the frame element dictionary comprises question analysis, wherein the question analysis comprises a first layer and a second layer, the first layer is used for sequence analysis, and the second layer is used for implication relation and hierarchical structure analysis;
the processing layer comprises a frame element processing module and a question and answer rewriting module and is used for rewriting sentences;
and the application layer comprises a question analysis module and is used for outputting candidate target word strings formed by rewriting sentences.
2. The question-and-answer knowledge base construction system based on a question framework according to claim 1, wherein the question corpus comprises sequence numbers, question sources and questions for recording relevant information of the questions.
3. The question and answer knowledge base construction system based on a question framework according to claim 1, wherein the question and answer knowledge base comprises a question target word string and a question and answer analysis, the question and answer analysis comprises a first part and a second part, the first part is a name of the framework, and the second part is a question template.
4. The question and answer knowledge base construction system based on a question framework according to claim 1, wherein the framework element processing module is configured to find a word string from the framework element dictionary.
5. The system for constructing a knowledge base of question and answer based on a question and sentence framework according to claim 4, wherein the rewrite module is configured to replace a word string in the sentence with a character to complete the rewriting of the sentence, and each rewritten sentence is added as a new original sentence to a rewritten sentence set for accumulation until all the framework element character strings are used, so as to obtain a rewritten sentence set.
6. The question-answering intention knowledge base construction system based on a question framework according to claim 5, wherein the question analysis module is configured to build a read-in question list, and output a frame target word string formed by the rewritten sentences in a reverse order according to a word string length.
7. A method for constructing a question and answer intention knowledge base based on a question frame is characterized by comprising the following steps:
s101, constructing a sentence frame element dictionary according to the frame element dictionary and the question file;
s102, circulating the sentence frame element dictionary;
s103, circulating the existing sentence target word string set to form a new candidate target word string set, and reserving sentences in the new candidate target word string set;
s104, replacing corresponding words in the target word strings with the sentence frame element dictionary, and updating the candidate target word string set;
S105, sorting according to the lengths of the candidate target word strings, and outputting the candidate target word strings.
8. The method for constructing a knowledge base of question and answer intentions based on a frame of questions as claimed in claim 7, wherein constructing a dictionary of sentence frame elements comprises: and searching each sentence in the question file, and collecting the sentences in the frame element dictionary to form a sentence frame element dictionary when the words in the frame element dictionary appear in the sentences.
CN202110040888.1A 2021-01-13 2021-01-13 Question and answer intention knowledge base construction system and method based on question frame Active CN112650846B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110040888.1A CN112650846B (en) 2021-01-13 2021-01-13 Question and answer intention knowledge base construction system and method based on question frame

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110040888.1A CN112650846B (en) 2021-01-13 2021-01-13 Question and answer intention knowledge base construction system and method based on question frame

Publications (2)

Publication Number Publication Date
CN112650846A CN112650846A (en) 2021-04-13
CN112650846B true CN112650846B (en) 2024-08-23

Family

ID=75367975

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110040888.1A Active CN112650846B (en) 2021-01-13 2021-01-13 Question and answer intention knowledge base construction system and method based on question frame

Country Status (1)

Country Link
CN (1) CN112650846B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114626384A (en) * 2022-03-01 2022-06-14 京东科技信息技术有限公司 Dialogue processing method, question rewriting model training method and device

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111428483A (en) * 2020-03-31 2020-07-17 华为技术有限公司 Voice interaction method and device and terminal equipment

Family Cites Families (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101097573B (en) * 2006-06-28 2010-06-09 腾讯科技(深圳)有限公司 Automatically request-answering system and method
CN101470700A (en) * 2007-12-28 2009-07-01 日电(中国)有限公司 Text template generator, text generation equipment, text checking equipment and method thereof
JP5829000B2 (en) * 2008-08-20 2015-12-09 株式会社ユニバーサルエンターテインメント Conversation scenario editing device
CN103020040A (en) * 2011-09-27 2013-04-03 富士通株式会社 Rewriting processing method and equipment of source languages, and machine translation system
JP5439606B1 (en) * 2012-09-07 2014-03-12 株式会社東芝 Structured document management apparatus, method and program
US9436288B2 (en) * 2013-05-17 2016-09-06 Leap Motion, Inc. Cursor mode switching
CN104715168B (en) * 2015-02-13 2018-10-09 陈佳阳 A kind of file security management and control based on digital finger-print and the method and system traced to the source
JP6767224B2 (en) * 2016-09-29 2020-10-14 株式会社東芝 Communication devices, communication methods, and communication programs
JP6652045B2 (en) * 2016-12-26 2020-02-19 トヨタ自動車株式会社 Self-driving vehicle
CN109002432B (en) * 2017-06-07 2022-01-04 北京京东尚科信息技术有限公司 Synonym mining method and device, computer readable medium and electronic equipment
CN107748757B (en) * 2017-09-21 2021-05-07 北京航空航天大学 Question-answering method based on knowledge graph
CN108073708A (en) * 2017-12-20 2018-05-25 北京百度网讯科技有限公司 Information output method and device
CN109408627B (en) * 2018-11-15 2021-03-02 众安信息技术服务有限公司 Question-answering method and system fusing convolutional neural network and cyclic neural network
CN110765277B (en) * 2019-10-22 2023-06-09 河海大学常州校区 Knowledge-graph-based mobile terminal online equipment fault diagnosis method
CN111949774A (en) * 2020-07-08 2020-11-17 深圳鹏锐信息技术股份有限公司 Intelligent question answering method and system
CN111930906A (en) * 2020-07-29 2020-11-13 北京北大软件工程股份有限公司 Knowledge graph question-answering method and device based on semantic block
CN112069298B (en) * 2020-07-31 2024-06-14 杭州远传新业科技股份有限公司 Man-machine interaction method, device and medium based on semantic web and intention recognition
CN111708874B (en) * 2020-08-24 2020-11-13 湖南大学 Man-machine interaction question-answering method and system based on intelligent complex intention recognition
CN112035610B (en) * 2020-08-28 2023-04-07 康键信息技术(深圳)有限公司 Medical field question and answer pair generation method and device, computer equipment and medium

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111428483A (en) * 2020-03-31 2020-07-17 华为技术有限公司 Voice interaction method and device and terminal equipment

Also Published As

Publication number Publication date
CN112650846A (en) 2021-04-13

Similar Documents

Publication Publication Date Title
US11151179B2 (en) Method, apparatus and electronic device for determining knowledge sample data set
US5559693A (en) Method and apparatus for efficient morphological text analysis using a high-level language for compact specification of inflectional paradigms
US7620538B2 (en) Constructing a translation lexicon from comparable, non-parallel corpora
CN117076653B (en) Knowledge base question-answering method based on thinking chain and visual lifting context learning
US9047275B2 (en) Methods and systems for alignment of parallel text corpora
CN106537370A (en) Method and system for robust tagging of named entities in the presence of source or translation errors
Neme et al. Pattern-and-root inflectional morphology: the Arabic broken plural
JP2005535007A (en) Synthesizing method of self-learning system for knowledge extraction for document retrieval system
KR100835706B1 (en) System and method for korean morphological analysis for automatic indexing
CN107656921B (en) Short text dependency analysis method based on deep learning
Cahill et al. Wide-coverage deep statistical parsing using automatic dependency structure annotation
CN112395425A (en) Data processing method and device, computer equipment and readable storage medium
CN113535977A (en) Knowledge graph fusion method, device and equipment
CN112650846B (en) Question and answer intention knowledge base construction system and method based on question frame
CN115390806A (en) Software design mode recommendation method based on bimodal joint modeling
Schick Transition-Based Generation from Abstract Meaning Representations
Perez-Beltrachini et al. Semantic parsing for conversational question answering over knowledge graphs
JP2001318792A (en) Intrinsic expression extraction rule generation system and method, recording medium recorded with processing program therefor, and intrinsic expression extraction device
CN111782773B (en) Text matching method and device based on cascade mode
CN115831117A (en) Entity identification method, entity identification device, computer equipment and storage medium
CN115906878A (en) Machine translation method based on prompt
CN115033706A (en) Method for automatically complementing and updating knowledge graph
CN112784019A (en) Classroom note-taking system based on knowledge graph
CN114021589A (en) Sample generation method and device, computer equipment and storage medium
RU2785207C1 (en) Method and system for automatic search and correction of errors in texts in natural language

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant