CN114416947A

CN114416947A - Relation perception similar problem identification and evaluation method, system, equipment and storage medium

Info

Publication number: CN114416947A
Application number: CN202210047822.XA
Authority: CN
Inventors: 陈恩红; 刘淇; 陈彦敏; 王皓; 黄振亚
Original assignee: University of Science and Technology of China USTC
Current assignee: University of Science and Technology of China USTC
Priority date: 2022-01-17
Filing date: 2022-01-17
Publication date: 2022-04-29
Anticipated expiration: 2042-01-17
Also published as: CN114416947B

Abstract

The invention discloses a relation perception similar problem identification and evaluation method, a system, equipment and a storage medium. The results of the prediction are improved to some extent by a plurality of evaluation indexes.

Description

Relation perception similar problem identification and evaluation method, system, equipment and storage medium

Technical Field

The invention relates to the field of natural language processing, in particular to a method, a system, equipment and a storage medium for identifying and evaluating relationship perception similar problems.

Background

Similar question identification is a core question in the field of intelligent customer service question-and-answer research. When a user proposes a new question, the intelligent customer service needs to understand the new question of the user, finds out similar questions possibly matched with the user question, obtains the best matched question through a matching model, and feeds back the best matched question to the corresponding answer of the user. The user affinity question recognition task can also be modeled as an affinity text matching task. Similar question recognition tasks are applied to many fields, such as community question and answer queries, information retrieval and intelligent customer service systems, and similar question recognition is researched as a core question of the application field. Therefore, how to solve the identification of similar problems becomes a very important basic problem.

Around this research topic, researchers have proposed a variety of solutions, and most of the related researches mainly focus on lexical, syntactic or problem structures between two problems, and the similarity of the two problems is determined by modeling the semantic relationship of the two problems.

However, similar problem matching faces a diverse challenge due to the short length of the problem and the flexibility and breadth of natural language expressions. In order to solve the diversity of the problem, some researches have introduced external knowledge such as knowledge graph, problem answer, etc. to solve the situation of insufficient diversity. However, these external knowledge have a wide range of fields or are not targeted, and cannot completely meet the supplement of problem diversity expression, so that the recognition accuracy still needs to be improved.

Disclosure of Invention

The invention aims to provide a relation perception similar problem identification and evaluation method, a system, equipment and a storage medium, which can fully utilize semantic relation information among a plurality of problem pairs to solve similar problem identification among problems and have higher prediction precision.

The purpose of the invention is realized by the following technical scheme:

a relation-aware similar problem identification and evaluation method comprises the following steps:

extracting matched semantically related question data from the data set under the condition of multiple semantic relations, wherein each matched semantically related question data is text data and comprises a verification question pair Q^uAnd Q^aAnd problem Q^aA plurality of semantic related problem sets T under different corresponding matching relations;

constructing a relation perception neural network-based similar problem recognition model, performing joint representation on each matched semantic-related problem data, and utilizing the obtained problem Q^aIs associated with the problem Q^uThe expression vector is identified and evaluated, in a training stage, a loss function is constructed by using an identification and evaluation result and an identification label, and model parameters are updated;

and in the testing stage, for a given problem pair, the trained recognition model based on the relation perception neural network similarity problem is used for recognition and evaluation.

A relationship-aware similarity problem identification and evaluation system for implementing the foregoing method, the system comprising:

a data extraction unit for extracting matched semantically related question data from the data set under multiple semantic relations, wherein each matched semantically related question data is text data and contains verification question pair Q^uAnd Q^aQuestion Q^aAnd problem Q^aA plurality of semantic related problem sets T under different corresponding matching relations;

a model construction and training unit for constructing a relation perception neural network-based similar problem recognition model, performing joint characterization on each matched semantically related problem data, and utilizing the obtained problem Q^aIs associated with the problem Q^uThe representative vector of (a) is subjected to an identification evaluation,in the training stage, a loss function is constructed by using the identification evaluation result and the identification label, and model parameters are updated;

and the recognition evaluation testing unit is used for performing recognition evaluation on a given problem pair by using the trained relation perception-based neural network similarity problem recognition model.

A processing device, comprising: one or more processors; a memory for storing one or more programs;

wherein the one or more programs, when executed by the one or more processors, cause the one or more processors to implement the aforementioned methods.

A readable storage medium, storing a computer program, characterized in that the computer program realizes the aforementioned method when executed by a processor.

According to the technical scheme provided by the invention, the relation perception neural network-based similar problem recognition model is used for recognizing the similar problems of the problem pairs, and compared with the traditional model, the semantic relation related information of a plurality of semantic matches is used. The results of the prediction are improved to some extent by a plurality of evaluation indexes.

Drawings

In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings needed to be used in the description of the embodiments are briefly introduced below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art to obtain other drawings based on the drawings without creative efforts.

Fig. 1 is a flowchart of a method for identifying and evaluating relationship-aware similarity problems according to an embodiment of the present invention;

fig. 2 is a schematic diagram of a relationship-aware similarity problem identification and evaluation system according to an embodiment of the present invention;

fig. 3 is a schematic diagram of a processing apparatus according to an embodiment of the present invention.

Detailed Description

The technical solutions in the embodiments of the present invention are clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments of the present invention without making any creative effort, shall fall within the protection scope of the present invention.

Terms that may be used herein are first described as follows.

The terms "comprising," "including," "containing," "having," or other similar terms of meaning should be construed as non-exclusive inclusions. For example: including a feature (e.g., material, component, ingredient, carrier, formulation, material, dimension, part, component, mechanism, device, process, procedure, method, reaction condition, processing condition, parameter, algorithm, signal, data, product, or article of manufacture), is to be construed as including not only the particular feature explicitly listed but also other features not explicitly listed as such which are known in the art.

Next, the existing records of similar results are analyzed and explained.

The analysis shows that the existing data set has the following condition that the same problem and a plurality of problems have semantic relations. For example, as shown in table 1, there is a semantic relationship of such multiple matching problems in a customer service problem data set of a certain bank. Wherein Q₁Respectively and 4 questions (i.e., Q in Table 1)₂～Q₅) There is a semantic relationship. A classification label of 1 indicates that the two questions have the same semantics, and a classification label of 0 indicates that the two questions are similar, but the semantics are not the same. Specifically, Q₂And Q₃And Q₁The label of (1) is given, and Q is obtained by analyzing the question content₂And Q₃Although the expressions are different in terms of sentence patterns, the expressed basic semantics are the same and are all information about the "telephone audit time", so that Q₂And Q₃Is Q₁Multiple semantic matching problems. Q₄And Q₅And Q₁Is labeled 0, assay Q₄And Q₅It is known that Q₄And Q₅These two problems and Q₁Having the same key words but different semantics of expression, so Q₄And Q₅Is Q₁Multiple semantically similar but not matching problems.

TABLE 1 multiple semantic matching problem examples

Thus, by observing the data set, it can be seen that under a label-based classification relationship, there is a pair of Q₁A plurality of semantically matched semantically related questions. The semantic influence of the semantic matching relations on the basic problem is reflected on finer granularity by the semantic matching situations under the two different relations. In contrast to matching labels used only to distinguish problems in previous studies, in fact the matching labels of the problems reflect the implicit semantic relationship that the problem objects have between them. The matching relations have strong semantic relevance and can help the semantic understanding of the problem. In order to show the positive effects of multiple matching semantic relations on similar problem identification, the matching semantic relations are used for representing semantic information between problem pairs, and the method is the key point of research.

The relationship-aware similar problem identification and evaluation method provided by the invention is described in detail below. Details which are not described in detail in the embodiments of the invention belong to the prior art which is known to the person skilled in the art. Those not specifically mentioned in the examples of the present invention were carried out according to the conventional conditions in the art or conditions suggested by the manufacturer. The instruments used in the examples of the present invention are not indicated by manufacturers, and are all conventional products that can be obtained by commercial purchase.

As shown in fig. 1, a method for identifying and evaluating similar problems of relationship perception mainly includes the following steps:

step 1, extracting matched semantically related problem data from a data set under the condition of a plurality of semantic relations, wherein each matched semantically related problem data is text dataIncluding verification problem pair Q^uAnd Q^aAnd problem Q^aAnd a plurality of semantic related problem sets T under corresponding different matching relations.

The preferred embodiment of this step is as follows:

1) the problem of finding that there are multiple semantic matches.

In the embodiment of the invention, problems related to matching are extracted from collected data, each problem and a plurality of other problems form a similar problem matching relation, the similar problem matching relation comprises two categories, and a first category label is 1, which indicates that the two problems have the same semantic meaning and belong to a semantic matching relation; the second classification label is 0, which indicates that the two questions have the same keywords but different semantics and belong to a semantic similarity relationship.

2) The data set is partitioned.

In the embodiment of the invention, in order to further divide a plurality of semantic matching problem sets and verification model data sets, the existing data is divided into two parts: the first part is used as a matching problem knowledge base; the other part is used as the training data of the verification model; wherein the matching problem knowledge base provides Q^aWith set T, the training data of the verification model, provide problem Q^u(ii) a During training, the verification data finds a plurality of semantic matching problem sets corresponding to the problems from the matching problem knowledge base to realize semantic uniform expression of the corresponding problems.

Specifically, the method comprises the following steps: problem Q in training data^uAnd matching individual questions Q in the question knowledge base^aWhen matching, for problem Q^aIf the matching problem knowledge base M ═ S (S)₁,S₂,..) of any other problem or problem Q^aWith a classification label of 1, put the corresponding question into the set

T^pRepresents a problem Q^aN is the set T^pThe number of questions in; if the classification label is 0, put into the set

T^qRepresents a problem Q^aM is a set T^qThe number of questions in; set T ═ T^p,T^qGet it as the question Q^aAnd a plurality of semantically related problem sets under corresponding different matching relations.

3) And processing text content.

In the embodiment of the invention, each matched semantic related question data obtained in the process is participled to obtain a verification question pair Q^uAnd Q^aThe above two types of word segmentation results are used as input of model training and used for evaluation of similar problem identification of each problem pair.

Step 2, constructing a relation perception neural network-based similar problem recognition model, performing joint characterization on each matched semantically related problem data, and utilizing the obtained problem Q^aIs associated with the problem Q^uAnd in the training stage, a loss function is constructed by using the identification and evaluation result and the identification label, and the model parameters are updated.

The preferred embodiment of this step is as follows:

1) and constructing a relation-aware neural network-based similar problem identification model.

In the embodiment of the invention, the relation perception neural network-based similar problem identification model mainly comprises the following steps: the system comprises a problem representation layer, a relation perception representation layer and a problem identification evaluation layer; wherein: a) the question representation layer is used for extracting each verification question pair Q^uAnd Q^aIs represented vector, and question Q^aRepresenting vectors of each question in a plurality of semantically related question sets under corresponding different matching relations; illustratively, the processing stages may use the sequence-BERT model. b) The relation perception representation layer is used for combining the question Q^aObtaining the semantic relation with each question in the question set T to obtain the question Q^aThe relationship-aware representation vector of (a) and, in particular,this stage is to integrate multiple semantic matching relationship information into question Q^aThe expression vector (2) is a unified expression vector for obtaining the problem of relation perception. c) The question recognition and evaluation layer is used for utilizing the question Q^uAnd problem Q^aIs represented vector, and question Q^aThe relationship perception of (1) represents a vector, for a verification problem pair Q^uAnd Q^aAnd (4) performing identification and evaluation on the semantic relationship, and predicting a similar problem identification label of the verification problem pair.

The following is a description of the principles and associated processes for each layer.

a) A problem presentation layer.

All questions need to be input to the question representation layer, including the question Q mentioned earlier^uQuestion matching problem Q in the knowledge base^aAnd about problem Q^aProblem set T ═ T^p,T^q}. At the question representation level, the question is represented as a dense vector in d-dimension by having pre-trained the initial word vector. Thus, for any question to capture the context between question words, the Sennce BERT model is used to encode each question independently and map them to a dense vector, generating a semantic representation of the question.

h＝Sentence-BERT(Q)

Wherein the sequence-BERT represents a sequence BERT model, h is a representation vector of the problem Q,

| Q | is the length of the problem Q, d_hIs the dimension of the output.

For verification problem pair Q^uAnd Q^aObtaining a corresponding representation vector h by a problem representation layer_uAnd h_a(ii) a For problem set T ═ T^p,T^qObtaining semantic matching relation vector set through problem representation layer

Vector set with semantic similarity relation

b) A relationship-aware representation layer.

In the relation perception representation layer, the plurality of semantic relations comprise semantic matching relations and semantic similar relations, and each semantic relation is related to the question Q^aMultiple semantic matching problems. For multiple semantic matching problems, RNs (relational networks) are used first to capture the relationship vectors of two problems, and then RNs are expanded to discuss the integration of multiple relationship labels to semantic matching and semantic similarity.

In order to clarify the relation semantics in the question and its associated sentences, an RNs relation network structure is used to represent the relation between two objects. Expressing question Q by RNs with question as relational object^aMultiple relation vector h corresponding to the problem_k. According to the RNs model, the function of model relationship characterization:

RNs(H_r)＝f_φ(∑g_θ(h_a,h_k))

wherein, g_θAnd f_φIs a Multi-Layer Perceptron network (MLP), h_aIs a problem Q^aRepresents a vector of h_kA representative vector representing a single problem in the set of problems T; h is_k∈H^por H^q，H^p、H^qAre respectively a set T^p、T^qThe set of representation vectors of the question.

To obtain h_aAnd h_kThe interaction characteristics between the elements are realized by adopting an element-wise product method and utilizing h_aAnd h_kAnd (3) performing product operation on corresponding position elements:

thus, the function of the relational representation of the problem can be rewritten as:

wherein H_rObtaining Q about a problem through an RNs network^aThe multi-semantic matching vector obtains similar semantic features of corresponding words between two sentences by using dot product operation of elements, and obtains core interactive features expressed by a plurality of relations by using an RNs network structure.

In order to embody the implicit semantics of the matching relationship between different labels, the part extends the function of the basic problem relationship to the multi-relationship semantic matching representation. Combining the label of each question in the question set T, calculating the following matching expression function under multiple labels:

wherein,

o∈|N^r|，N^rset of semantic vectors for multiple semantic matching problems under the r-th relationship, | N^rI represents a semantic vector set N^rR represents a set of categories of labels. Intuitively, the above formula encodes the label feature vectors of multiple sentences by normalization.

Set H of vectors based on representation^pAnd H^qAnd expanding the matching expression function under the multiple labels to obtain:

wherein, W_piAnd W_qjThe learning weights under two relationships, b_piAnd b_qjIs a deviation parameter;

each being a representative vector of the corresponding problem.

Computing by adopting an element-wise product method to respectively obtain h_aAnd

h between_aAnd

the interaction characteristics between:

obtaining:

wherein v is_rIs a problem Q^aThe relationship perception of (a) represents a vector. From the above principle, the problem Q^aSemantic matching characterizations of multiple relationships are obtained through the RNs network.

c) And a problem identification and evaluation layer.

The goal of the question recognition evaluation layer is to evaluate the language match of each question pair. First, question Q^uAnd problem Q^aIs represented vector, and question Q^aRepresents a vector join, represented as:

wherein h is_u、h_a、v_rSequentially represents the problem Q^uIs a representative vector, question Q^aIs a representative vector, question Q^aThe relationship perception of (a) represents a vector.

Then, by RELU excitationOperation of live function and sigmoid function to obtain verification problem pair Q^uAnd Q^aSemantic relation R (Q)^u,Q^a) Expressed as:

o_au＝ReLU(W₁z_au+b₁),

R(Q^u,Q^a)＝σ(W₂o_au+b₂)

wherein o is_auσ () is sigmoid function, W, which is the result of the RELU activation function₁，W₂，b₁，b₂Are network parameters. Semantic relationship R (Q)^u,Q^a) I.e. the identification and evaluation result, and the similar label of the verification problem pair.

2) And (5) training a model.

In the embodiment of the present invention, all W and b parameter matrices or vectors (i.e., model parameters) involved in all the above descriptions in the last part of the constructed relationship-aware-based neural network model are trained, and a cross entropy loss function is used as a final optimization target, where the loss function is expressed as:

where θ is the model parameter to be updated, y_lIs the true identification label for the ith validation question pair, N is the number of training instances (i.e., validation question pairs);

indicating the ith pair of verification questions,

representing the identification evaluation result of l verification problem pairs, and adding l 2-norm as a training parameter considering the complexity of the model, wherein lambda is_θIs a regularization hyper-parameter.

By optimizing the above loss function, an optimum state can be learned. Illustratively, in the whole training process, Adam is used as an optimizer, and the learning rate is 0.0005; in the training process, firstly, obtained data is divided into a training data and a problem matching knowledge base according to the principle of 20% and 80%, then, in the training data, a training set, a verification set and a test set are divided according to the proportion of 60%, 20% and 20%, the training set and the verification set are used for optimizing parameters of a model, and the test set is used for verifying the model.

And 3, in the testing stage, for a given problem pair, carrying out identification evaluation by using a trained recognition model based on the relation perception neural network similarity problem.

After model training is completed through the foregoing steps, Q is paired for a given new problem^u’And Q^a’Likewise, for problem Q^a’Searching a plurality of semantically related problem sets T' under different matching relations through a matching problem knowledge base, processing text contents, inputting the processed text contents into a trained recognition model based on relation perception neural network similarity problems, and processing the processed text contents according to the above mode. And obtaining the similar labels of the new problem pairs, and realizing the similar problem identification and evaluation of the problem pairs.

According to the scheme of the embodiment of the invention, the message is subjected to problem pair similarity problem identification based on the relation perception neural network similarity problem identification model, and compared with the traditional model, a plurality of semantic relation related information matched with semantics are used. The results of the prediction are improved to some extent by a plurality of evaluation indexes.

Another embodiment of the present invention further provides a relationship-aware similar problem identification and evaluation system, which is mainly used to implement the method provided in the foregoing embodiment, as shown in fig. 2, the system mainly includes:

a data extraction unit for extracting matched semantically related question data from the data set under multiple semantic relations, wherein each matched semantically related question data is text data and contains verification question pair Q^uAnd Q^aAnd problem Q^aA plurality of semantic related problem sets T under different corresponding matching relations;

a model construction and training unit for constructing a recognition model based on the relation perception neural network similarity problemJointly characterizing each matching semantically related question data and utilizing the obtained question Q^aIs associated with the problem Q^uThe expression vector is identified and evaluated, in a training stage, a loss function is constructed by using an identification and evaluation result and an identification label, and model parameters are updated;

It will be clear to those skilled in the art that, for convenience and simplicity of description, the foregoing division of the functional modules is merely used as an example, and in practical applications, the above function distribution may be performed by different functional modules according to needs, that is, the internal structure of the system is divided into different functional modules to perform all or part of the above described functions.

In addition, the specific technical details related to each unit of the system have been described in detail in the previous embodiment of the method, and thus are not described again.

Another embodiment of the present invention further provides a processing apparatus, as shown in fig. 3, which mainly includes: one or more processors; a memory for storing one or more programs; wherein the one or more programs, when executed by the one or more processors, cause the one or more processors to implement the methods provided by the foregoing embodiments.

Further, the processing device further comprises at least one input device and at least one output device; in the processing device, a processor, a memory, an input device and an output device are connected through a bus.

In the embodiment of the present invention, the specific types of the memory, the input device, and the output device are not limited; for example:

the input device can be a touch screen, an image acquisition device, a physical button or a mouse and the like;

the output device may be a display terminal;

the Memory may be a Random Access Memory (RAM) or a non-volatile Memory (non-volatile Memory), such as a disk Memory.

Another embodiment of the present invention further provides a readable storage medium, which stores a computer program, and when the computer program is executed by a processor, the computer program implements the method provided by the foregoing embodiment.

The readable storage medium in the embodiment of the present invention may be provided in the foregoing processing device as a computer readable storage medium, for example, as a memory in the processing device. The readable storage medium may be various media that can store program codes, such as a usb disk, a removable hard disk, a Read-Only Memory (ROM), a magnetic disk, or an optical disk.

The above description is only for the preferred embodiment of the present invention, but the scope of the present invention is not limited thereto, and any changes or substitutions that can be easily conceived by those skilled in the art within the technical scope of the present invention are included in the scope of the present invention. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.

Claims

1. A relation-aware similar problem identification and evaluation method is characterized by comprising the following steps:

2. The relationship-aware similar problem identification and assessment method according to claim 1, wherein said extracting matching semantically related problem data under multiple semantic relationship conditions from a data set comprises:

extracting problems related to matching from collected data, wherein each problem and a plurality of other problems form a similar problem matching relation, the similar problem matching relation comprises two categories, and a first category label is 1, indicates that the two problems have the same semantic meaning and belong to a semantic matching relation; the second classification label is 0, which indicates that the two problems have the same keywords but different semantics and belong to a semantic similarity relationship;

the extracted problem is divided into two parts: the first part is used as a matching problem knowledge base; the other part is used as the training data of the verification model; extracting problem Q in training data^uAnd matching individual questions Q in the question knowledge base^aForm a verification question pair for question Q^aIf any other questions in the question knowledge base are matched with Q^aIf the classification label of (1) is set, the set T will be put in^pIf the classification label is 0, put into the set T^q(ii) a Set T ═ T^p,T^qGet it as the question Q^aA plurality of semantic related problem sets under corresponding different matching relations;

then, each matched semantically related question data is participled to obtain a verification question pair Q^uAnd Q^aAnd the text content word segmentation result of each question in the question set T.

3. The relationship-aware similar problem identification and assessment method according to claim 1, wherein the relationship-aware neural network-based similar problem identification model comprises: the system comprises a problem representation layer, a relation perception representation layer and a problem identification evaluation layer; wherein:

the question representation layer is used for extracting each verification question pair Q^uAnd Q^aIs represented vector, and question Q^aRepresenting vectors of each question in a plurality of semantically related question sets under corresponding different matching relations;

the relation perception representation layer is used for combining the question Q^aObtaining the semantic relation with each question in the question set T to obtain the question Q^aThe relationship-aware representation vector of (1);

the question recognition and evaluation layer is used for utilizing the question Q^uAnd problem Q^aIs represented vector, and question Q^aThe relationship perception of (1) represents a vector, for a verification problem pair Q^uAnd Q^aThe semantic relationship of the user is identified and evaluated.

4. The relationship-aware similar problem identification and evaluation method according to claim 3, wherein the problem representation layer performs independent coding on each problem to generate a representation vector of the problem:

h＝Sentence-BERT(Q)

wherein h is a representative vector of the problem Q; the sequence-BERT represents the sequence BERT model;

for verification problem pair Q^uAnd Q^aObtaining a corresponding representation vector h by a problem representation layer_uAnd h_a(ii) a For problem set T ═ T^p,T^qObtaining a semantic matching relationship vector set H through a problem representation layer^pVector set H with semantic similarity relation^q(ii) a Wherein, the set T^pAnd T^qProblem of and Q^aThe classification labels of (1) and (0) respectively represent that two problems belong to a semantic matching relationship, and the classification label of (1) represents that the two problems belong to a semantic similar relationship.

5. The relation-aware similar problem identification and evaluation method according to claim 3, wherein the combined problem Q is^aExtracting the semantic relation with each question in the question set T to obtain a question Q^aStep package of relationship-aware representation vectorComprises the following steps:

representing a network characterization problem Q using RNs relationships^aThe relationship to each question in the set of questions T represents:

wherein, g_θIs a multi-layer perceptron network and is,

represents h obtained by adopting element-wise product method to carry out operation_aAnd h_kInter-working characteristics between h_aIs a problem Q^aRepresents a vector of h_kA representative vector representing a single problem in the set of problems T; h is_k∈H^por H^q，H^p、H^qAre respectively a set T^p、T^qSet of representation vectors, set T, of question^pEach problem in (1) and problem Q^aThe classification label of (1) represents that the two problems belong to a semantic matching relationship; set T^qEach problem in (1) and problem Q^aThe classification label of (2) is 0, which represents that the two problems belong to semantic similarity relation;

combining the label of each question in the question set T, calculating the following matching expression function under multiple labels:

wherein,

N^ra semantic vector set which is a plurality of semantic matching questions under the r-th relation; | N^rI represents a semantic vector set N^rR represents a set of categories of labels;

set H of vectors based on representation^pAnd H^qPerforming the matching representation function under the multiple labelsAnd (3) opening to obtain:

each being a representative vector of the respective problem;

h between_aAnd

the interaction characteristics between:

obtaining:

wherein v is_rIs a problem Q^aThe relationship perception of (a) represents a vector.

6. According to claim 3The relation-aware similar problem identification and evaluation method is characterized in that the problem Q is used^uAnd problem Q^aIs represented vector, and question Q^aThe relationship perception of (1) represents a vector, for a verification problem pair Q^uAnd Q^aThe identification and evaluation of the semantic relationship comprises the following steps:

will question Q^uAnd problem Q^aIs represented vector, and question Q^aRepresents a vector join, represented as:

wherein h is_u、h_a、v_rSequentially represents the problem Q^uIs a representative vector, question Q^aIs a representative vector, question Q^aThe relationship-aware representation vector of (1);

obtaining verification problem pairs Q through calculation of RELU activation function and sigmoid function^uAnd Q^aSemantic relation R (Q)^u,Q^a) Expressed as:

o_au＝ReLU(W₁z_au+b₁),

R(Q^u,Q^a)＝σ(W₂o_au+b₂)

wherein o is_auσ () is sigmoid function, W, which is the result of the RELU activation function₁，W₂，b₁，b₂Are network parameters.

7. The relationship-aware similarity problem identification and assessment method according to any one of claims 1 to 6, wherein said loss function is expressed as:

wherein, y_lIs the true identification tag for the ith verification problem pair, N is the trainingVerifying the number of problem pairs;

indicating the ith pair of verification questions,

denotes the result of the identification evaluation of the l verification problem pairs, lambda_θTo regularize the hyper-parameters, θ is the model parameter to be updated.

8. A relationship-aware similarity problem identification and assessment system for implementing the method of any one of claims 1 to 7, the system comprising:

a model construction and training unit for constructing a relation perception neural network-based similar problem recognition model, performing joint characterization on each matched semantically related problem data, and utilizing the obtained problem Q^aIs associated with the problem Q^uThe expression vector is identified and evaluated, in a training stage, a loss function is constructed by using an identification and evaluation result and an identification label, and model parameters are updated;

9. A processing device, comprising: one or more processors; a memory for storing one or more programs;

wherein the one or more programs, when executed by the one or more processors, cause the one or more processors to implement the method of any of claims 1-7.

10. A readable storage medium, storing a computer program, characterized in that the computer program, when executed by a processor, implements the method according to any of claims 1 to 7.