WO2019085585A1

WO2019085585A1 - Device control processing method and apparatus

Info

Publication number: WO2019085585A1
Application number: PCT/CN2018/100489
Authority: WO
Inventors: 刘质斌; 王九飚; 周文斌; 石秋成; 王红霞; 王琳
Original assignee: 格力电器（武汉）有限公司; 珠海格力电器股份有限公司
Priority date: 2017-10-31
Filing date: 2018-08-14
Publication date: 2019-05-09
Also published as: CN108039988B; CN108039988A

Abstract

The present application discloses a device control processing method and apparatus. The device control method comprises: acquiring information of a user, the information of the user comprising at least one of: a photograph of the user captured by a photographing device and a voice of the user received by an audio device; using a model to evaluate an emotional level of the user corresponding to the information of the user, the model being obtained by training according to a plurality of sets of data, each of the plurality of sets of data comprising: a photograph and/or a voice of the user and a tag for identifying an emotional level representing the photograph and/or the voice; and sending a control command according to the emotional level, the control command being used to instruct a device to perform a predetermined operation. The present application addresses the technical issue that home systems in the related art cannot meet requirements of users for an intelligence degree of the home systems.

Description

Device control processing method and device

Related application

The present application claims the benefit of priority to the benefit of the benefit of the benefit of the benefit of the benefit of the benefit of the benefit of the benefit of the benefit of the benefit of the benefit of the benefit of the benefit of the benefit of the benefit of the benefit of the benefit of the benefit of the benefit of the benefit of the benefit of the benefit of the benefit of the disclosure.

Technical field

The present application relates to the field of smart homes, and in particular to a device control processing method and apparatus.

Background technique

With the development of science and technology, especially the rapid development of artificial intelligence, artificial intelligence is quickly combined with various fields. Among them, people can feel the intelligence of various household devices in daily life. For example, the user can control it with a gesture, or turn the TV on or off by voice. However, the degree of intelligence of household equipment currently on the market does not meet the needs of users to a certain extent.

In view of the above-mentioned related art, the problem that the home system does not satisfy the user's demand for the intelligence level of the home system has not yet proposed an effective solution.

Summary of the invention

The embodiment of the present application provides a device control processing method and device, so as to at least solve the technical problem that the home system in the related art cannot meet the user's demand for the intelligence degree of the home system.

According to an aspect of the embodiments of the present application, a device control processing method is provided, including: acquiring information of a user, where the information includes at least one of: a photo of the user captured by an imaging device, through audio a voice of the user received by the device; using a model to evaluate an emotional level of the user corresponding to the information, wherein the model is trained according to a plurality of sets of data, each set of data in the plurality of sets of data Each includes: a photo and/or a sound of the user, and a label for identifying an emotional level represented by the photo and/or sound; transmitting a control command according to the emotional level, wherein the control command is used to indicate the device Perform the scheduled operation.

Further, before using the model to evaluate the emotional level of the user corresponding to the information, the method further includes: transmitting a photo and/or a sound of the user to another user; acquiring the other user as The label of the user's photo and/or sound added.

Further, acquiring the tag added by the other user for the photo and/or sound of the user includes at least one of: sending a photo and/or a sound of the user, and a plurality of emotion levels that can be selected to Receiving, by the other user, an emotion level selected by the other user from the plurality of emotion levels as the tag; acquiring an evaluation of the photo and/or voice of the user by the other user, from the evaluation The emotion level is extracted as the tag, wherein the evaluation includes at least one of: an evaluation of a natural language, an evaluation of a voice.

Further, before using the model to evaluate the emotional level of the user corresponding to the information, the method further includes: after obtaining the photo and/or sound of the user, according to the photo of the user and/or Or the voice makes a question to the user; the emotion level corresponding to the photo and/or sound of the user is extracted according to the user's answer to the question questioned.

Further, sending the control command according to the emotion level comprises: sending the control command if the emotion level matches a predetermined level, wherein the control command is used to control the device to perform the following operations At least one of: playing music corresponding to the emotion level, playing a video corresponding to the emotion level.

According to another aspect of the embodiments of the present application, a device control processing apparatus is further provided, including: a first acquiring unit, configured to acquire information of a user, where the information includes at least one of the following: capturing by the imaging device a photo of the user, the voice of the user received through the audio device; an evaluation unit for evaluating a mood level of the user corresponding to the information using a model, wherein the model is trained according to multiple sets of data Obtaining, each of the plurality of sets of data includes: a photo and/or a sound of the user, and a label for identifying an emotional level represented by the photo and/or the sound; the first sending unit, And a method for transmitting a control command according to the emotion level, wherein the control command is used to instruct the device to perform a predetermined operation.

Further, the device further includes: a second sending unit, configured to send the photo and/or sound of the user to other users before evaluating the emotion level of the user corresponding to the information by using the model a second obtaining unit, configured to acquire the label added by the other user for the photo and/or sound of the user.

Further, the second obtaining unit includes at least one of: a first sending module, configured to send a photo and/or a sound of the user, and a plurality of selectable multiple emotion levels to the other user; Determining, by the other user, an emotion level selected from the plurality of emotion levels as the label; an extracting module, configured to obtain an evaluation of the photo and/or sound of the user by the other user, and extracting from the evaluation The emotion level is the tag, wherein the evaluation includes at least one of: an evaluation of a natural language, an evaluation of a voice.

Further, the device further includes: a questioning unit, configured to: after obtaining the photo and/or sound of the user, using the model to evaluate the emotion level of the user corresponding to the information, according to the The user's photo and/or voice makes a question to the user; and an extracting unit is configured to extract an emotion level corresponding to the photo and/or sound of the user according to the user's answer to the question questioned.

Further, the first sending unit includes: a second sending module, configured to send the control command if the emotion level matches a predetermined level, where the control command is used to control the device Performing at least one of the following operations: playing music corresponding to the emotion level, and playing a video corresponding to the emotion level.

According to still another aspect of the embodiments of the present application, there is also provided a storage medium, the storage medium comprising a stored program, wherein the program performs the device control processing method according to any one of the above.

According to still another aspect of the embodiments of the present application, there is also provided a processor, wherein the processor is configured to run a program, wherein the program is executed to perform the device control processing method according to any one of the above.

In the embodiment of the present application, the information of the user may be acquired, where the information includes at least one of the following: a photo of the user captured by the imaging device, a voice of the user received by the audio device, and a user corresponding to the model evaluation information. An emotional level in which the model is trained based on a plurality of sets of data, each of the sets of data including: a photo and/or sound of the user, and a level of emotion represented by the photo and/or sound. The tag transmits a control command according to the emotion level, wherein the control command is used to instruct the device to perform a predetermined operation. The device control processing method provided by the embodiment of the present application achieves the purpose of controlling the smart home system according to the acquired user's emotion, and achieves the technical effect of letting the user experience the happiness brought by the modern technology and improving the quality of life, thereby solving the problem. In the related art, the home system does not satisfy the technical problem of the user's demand for the intelligence degree of the home system, and the user experience is improved.

DRAWINGS

The drawings described herein are intended to provide a further understanding of the present application, and are intended to be a part of this application. In the drawing:

1 is a flowchart of a device control processing method according to an embodiment of the present application;

2 is a flow chart of an adjustment mechanism of a smart home system according to an embodiment of the present application;

FIG. 3 is a schematic diagram of a device control apparatus according to an embodiment of the present application.

Detailed ways

The technical solutions in the embodiments of the present application are clearly and completely described in the following with reference to the accompanying drawings in the embodiments of the present application. It is an embodiment of the present application, and not all of the embodiments. All other embodiments obtained by a person of ordinary skill in the art based on the embodiments of the present application without departing from the inventive scope shall fall within the scope of the application.

It should be noted that the terms "first", "second" and the like in the specification and claims of the present application and the above-mentioned drawings are used to distinguish similar objects, and are not necessarily used to describe a specific order or order. It is to be understood that the data so used may be interchanged where appropriate, so that the embodiments of the present application described herein can be implemented in a sequence other than those illustrated or described herein. In addition, the terms "comprises" and "comprises" and "the" and "the" are intended to cover a non-exclusive inclusion, for example, a process, method, system, product, or device that comprises a series of steps or units is not necessarily limited to Those steps or units may include other steps or units not explicitly listed or inherent to such processes, methods, products or devices.

To facilitate the user's understanding of the present application, some of the terms or nouns involved in the embodiments of the present application are explained below:

Pixel: is the smallest unit that can be displayed on the computer screen. It is used to represent the unit of the image. It refers to the array of horizontal and vertical pixels that can be displayed. The more pixels in the screen, the higher the resolution of the image, the more the image is. Delicate and realistic.

Pixel: refers to the value of a pixel.

Binarization: Most of the pictures taken by the camera are color images. The color image contains a huge amount of information. For the content of the picture, it can be simply divided into foreground and background. The color picture is processed first, so that the picture only has foreground information. With the background information, you can simply define the foreground information as black and the background information as white. This is the binarization graph.

Neural network algorithm: refers to the process of reasoning according to logic rules. It first converts information into concepts and represents user symbols. Then, according to symbolic operations, it performs logical reasoning in serial mode; this process can be written as serial instructions. Let the computer execute.

Voiceprint: It is a sound wave spectrum that is displayed with electro-acoustic instruments and carries speech information.

Voiceprint recognition: It is a kind of biometric technology. Also known as speaker recognition, there are two types, namely speaker recognition and speaker confirmation. Different tasks and applications use different voiceprint recognition techniques. For example, when narrowing the scope of criminal investigation, it may be necessary to identify the technology. Confirm the technology.

The following embodiments may be used in various electrical devices, and the types of the electrical devices are not specifically limited, including but not limited to: a washing machine, an air conditioner, a refrigerator, etc., and the above various electrical devices constitute the smart home system in the embodiment of the present application. The embodiments of the present application are described in detail below.

According to an embodiment of the present application, a method embodiment of a device control processing method is provided, and it should be noted that the steps shown in the flowchart of the accompanying drawings may be executed in a computer system such as a set of computer executable instructions, and Although the logical order is shown in the flowcharts, in some cases the steps shown or described may be performed in a different order than the ones described herein.

FIG. 1 is a flowchart of a device control processing method according to an embodiment of the present application. As shown in FIG. 1 , the device control processing method includes the following steps:

Step S102: Acquire information of the user, where the information includes at least one of the following: a photo of the user captured by the imaging device, and a voice of the user received by the audio device.

In step S102, in one aspect, one or more cameras may be installed in the user's home for taking a photo of the user. In the embodiment of the present application, the setting position of the camera is not specifically limited, and may include, but is not limited to, a user. The door, ceiling, etc. of each room in the home can be used to collect photos of the user through cameras installed in different locations. Wherein, when the user's photo is taken by using the one or more cameras described above, the user may be photographed every predetermined time period, and then the emotion of the user in the image may be analyzed. The category of the captured image is not specifically limited in the embodiment of the present application, and may include, but is not limited to, a black and white image (grayscale image) and a color image (RGB image). When analyzing the image, the information in the image can be analyzed according to the binarized image processing manner. Specifically, when analyzing, multiple pixel points in the image can be compared with the pixel position in the historical image to determine the existence. The pixel points of the difference are then distinguished from the pixel points where the difference exists, and the user's information can be extracted from the image captured by the imaging device.

In another aspect, one or more audio devices (that is, sound sensors in the context) may be installed in the user's home for receiving the voice of the user. In the embodiment of the present application, the installation location of the audio device is not performed. The specific limitation may include, but is not limited to, a position at a doorway, a ceiling, and the like of each room in the user's home. However, in order to facilitate receiving the voice of the user, the audio device may be installed at a position where the user is frequently active and the position is equal to the height of the human body. At the office. The voice device includes a voice model library, where the voice model library stores voiceprints of each member of the family. Each member of a family can issue a voice to the voice device, and the voice device can perform feature extraction to store voiceprints of different members into the voice model library. When a member of the family emits a voice, the voice device can extract the feature of the member's voice, obtain the voiceprint of the member, and then match the voiceprint of the member with the voiceprint stored in the voice model library. And then identifying the family member corresponding to the voiceprint, and then obtaining the information corresponding to the member.

Step S104, using the model to evaluate the emotional level of the user corresponding to the information, wherein the model is obtained according to the plurality of sets of data, each of the plurality of sets of data includes: a photo and/or a sound of the user, and is used to identify The photo and/or voice represents the level of the emotional level of the label.

Wherein, the above model may be an image captured by the camera during a predetermined period of time, a voice of the user within a predetermined period of time received by the audio device, and a label for identifying an emotional level represented by the photo and/or sound. Learned by training.

Step S106, sending a control command according to the emotion level, wherein the control command is used to instruct the device to perform a predetermined operation.

Through the above steps, when the smart home system is running, the model can be used to evaluate the user's information to obtain the corresponding user's emotional level, and then send a control command to the smart home system according to the evaluated emotional level. The device control processing method provided by the embodiment of the present application achieves the purpose of controlling the smart home system according to the acquired user's emotion, and achieves the technical effect of letting the user experience the happiness brought by the modern technology and improving the quality of life, thereby solving the problem. In the related art, the home system does not satisfy the technical problem of the user's demand for the intelligence degree of the home system, and the user experience is improved.

In an optional embodiment of the present application, in the artificial intelligence terminal of the smart home system, the camera captures the user's current facial expressions and limb movements, and combines image recognition technology to record the user's facial expressions and limb movements through the neural network. The algorithm compares, judges, feedbacks, and learns; at the same time, the sound sensor is used to receive the user's voice, and the user's voice changes are recorded, and the neural network algorithm is used to compare, judge, feedback, and learn. By observing and recording the facial expressions, limb movements, sound changes, judgments, feedbacks, and learnings exhibited by the daily life of the user, the emotional state equations of the user in various forms are obtained, and this is used as a criterion. Among them, the emotional state equation is: f (facial expression, limb movement, sound ... external stimulation) = user (emotional level), wherein, independent variables: facial expressions, limb movements, sounds..... External stimulus, etc.; dependent variable: the current emotional level of the user; here different degrees of facial expressions, physical movements, sounds... external stimuli, etc., corresponding to different current emotional levels of the user.

It should be noted that the label used to identify the emotional level represented by the photo and/or sound can be obtained from multiple aspects.

On the one hand, in an optional embodiment of the present application, before using the model to evaluate the emotional level of the user corresponding to the information, the device control processing method may further include: sending the user's photo and/or voice to other users; Get tags added by other users to the user's photos and/or sounds.

In the above embodiment, acquiring a tag added by other users for the user's photo and/or sound may include at least one of: transmitting the user's photo and/or sound, and a plurality of selectable emotion levels to other users; receiving Other users select the emotion level from the plurality of emotion levels as a label; obtain other users' evaluations of the user's photos and/or sounds, and extract the emotion level as a label from the evaluation, wherein the evaluation includes at least one of the following: natural language Evaluation, speech evaluation. For example, if necessary, the smart home system will send the user's photo or voice to the user's relatives and friends (that is, other users in the context), and the user's friends and relatives will receive the user's photo or sound and The emotions of the user's historical time period are compared, and then the user's photos and/or sounds are tagged, for example, the user is depressed due to work pressure, or is troubled by unhappy things outside, etc. Of course, the user's friends and relatives can also directly evaluate the received photo and/or voice of the user, and the smart home system extracts the emotion level as a label from the evaluation of the user's relatives and friends. The evaluation may include, but is not limited to, an evaluative text (ie, a natural language evaluation in context) sent by the user's friends and relatives, and an evaluative speech (ie, an evaluation of the voice in the context).

In another aspect, before using the model to evaluate the emotion level of the user corresponding to the information, the device control processing method may further include: after obtaining the photo and/or sound of the user, asking the user according to the photo and/or sound of the user. The emotion level corresponding to the user's photo and/or voice is extracted according to the user's answer to the question questioned. For example, the smart home system can obtain the user's emotional level through a conversational manner. After receiving the user's photo and/or voice, the smart home system will ask the user a question, if the smart home system asks a question: "Today "How the mood"; the user replies: "The work pressure is too big, more irritating"; then the smart home system will extract the user's corresponding emotional level from the user's answer.

In an optional embodiment of the present application, the smart home system can also perform self-correction in order to better understand the user and serve the user. For example, if the user's emotional level is obtained by the user's friends and relatives, and the user's emotional level is happy by the conversational manner, the smart home system combines the above-mentioned user's emotional level obtained from other people and The level of emotions obtained from the users themselves through dialogue, judgment of deviations, and corrections, continuous learning and improvement.

In an optional embodiment of the present application, sending the control command according to the emotion level may include: sending a control command when the emotion level matches the predetermined level, wherein the control command is used to control the device to perform at least the following operations One: Play music corresponding to the emotional level, and play the video corresponding to the emotional level.

A complete embodiment of the present application will be described in detail below.

First, the user opens the smart home system, which observes and records the daily life of the user, interacts with the user, compares, judges, feeds back, and learns the user's emotions; wherein the communication mode may include but is not limited to: voice and system dialogue Such as inner monologues, transcripts, facial expressions, and body movements. One of the conditions here is that the user needs to be open to the smart home system and enter his or her more comprehensive personal information into the smart home system as much as possible. When the user's emotions are abnormal, the smart home system will adjust the user's emotions according to their own decisions through external means (for example, playing music, videos, etc.). In addition, the above-mentioned smart home system can also find out the cause of the user's negative emotion by querying, and then can take targeted measures to alleviate the emotion of the user. For example, when the historical home mood similar to the current mood of the user is stored in the smart home system, the historical solution corresponding to the historical emotion may be searched in the smart home system, and then the historical solution may be referred to or directly used to alleviate the user. Emotions. Wherein, in the above smart home system, when the historical mood similar to the current mood of the user is not found, the smart home system does not store a reference solution for solving the current mood of the user. At this time, the smart home system can The network searches for similar emotions that the user's current emotional similarity reaches a certain threshold, and searches for a similar emotion solution on the network, and then the smart home system can refer to the current mood of the user by referring to the solution searched on the network. Mitigation measures. 2 is a flowchart of an adjustment mechanism of a smart home system according to an embodiment of the present application. Specifically, as shown in FIG. 2, the smart home system may display facial expressions that may be presented by the user during use or before leaving the factory. Actions, sounds, etc. are stored in the smart home system. In addition, in the process of using the smart home system described above, when the user's emotion change, information exchange, and the like are used to feedback past information, the above information may be recorded, and then the facial expression previously stored in the smart home system may be The comparison of the body movements and the sounds, the judgment and the processing of the learning, the solution for the user to alleviate the user's emotions, for example, the corresponding music or video can be played, thereby alleviating the user's emotions.

According to another aspect of the embodiments of the present application, a device control processing device is further provided. FIG. 3 is a schematic diagram of a device control device according to an embodiment of the present application. As shown in FIG. 3, the device control device includes: first acquiring The unit 31, the evaluation unit 33 and the first transmitting unit 35. The device control device will be described in detail below.

The first obtaining unit 31 is configured to acquire information of the user, where the information includes at least one of the following: a photo of the user captured by the imaging device, and a voice of the user received by the audio device.

The evaluation unit 33 is connected to the first obtaining unit 31 for using the emotion level of the user corresponding to the model evaluation information, wherein the model is obtained according to the plurality of sets of data, and each set of data in the plurality of sets of data includes : The user's photo and/or sound, and a label that identifies the level of emotion that the photo and/or sound represents.

The first sending unit 35 is connected to the evaluation unit 33 for transmitting a control command according to an emotion level, wherein the control command is used to instruct the device to perform a predetermined operation.

In the above embodiment, when the smart home system is running, the first acquiring unit 31 is configured to acquire information of the user, where the information includes at least one of the following: a photo of the user captured by the camera device, and received by the audio device. The user's voice; the evaluation unit 33 is configured to use the model to evaluate the user's emotional level corresponding to the information, wherein the model is obtained according to the plurality of sets of data, each of the plurality of sets of data includes: the user's photo And/or a sound, and a label for identifying an emotional level represented by the photo and/or sound; the first transmitting unit 35 is configured to send a control command according to the emotion level, wherein the control command is used to instruct the device to perform a predetermined operation. The device control device provided by the embodiment of the present application achieves the purpose of controlling the smart home system according to the acquired user's emotion, and achieves the technical effect of letting the user experience the happiness brought by the modern technology to improve the quality of life, thereby solving the related In the technology, the home system can not meet the technical problem of the user's demand for the intelligence degree of the home system, and the user experience is improved.

In an optional embodiment of the present application, the device control processing device further includes: a second sending unit, configured to send the photo and/or sound of the user to the user before using the model to evaluate the emotional level of the user corresponding to the information Other users; a second obtaining unit, configured to acquire tags added by other users for photos and/or sounds of the user.

In an optional embodiment of the present application, the second obtaining unit includes at least one of the following: a first sending module, configured to send a photo and/or a sound of the user, and multiple emotion levels that can be selected to other users; Receiving, by the other user, an emotion level selected from a plurality of emotion levels as a label; an extraction module, configured to obtain an evaluation of the user's photo and/or sound by the other user, and extracting an emotion level as a label from the evaluation, wherein the evaluation includes the following At least one: evaluation of natural language, evaluation of speech.

In an optional embodiment of the present application, the device control processing device further includes: a questioning unit, configured to: after acquiring the photo and/or sound of the user, using the model to evaluate the emotional level of the user corresponding to the information, according to The user's photo and/or sound asks the user; the extracting unit is configured to extract the emotional level corresponding to the user's photo and/or sound according to the user's answer to the question questioned.

In an optional embodiment of the present application, the first sending unit includes: a second sending module, configured to send a control command when the emotion level matches the predetermined level, where the control command is used to control the device to perform the following At least one of the operations: playing music corresponding to the emotional level, playing a video corresponding to the emotional level.

According to still another aspect of the embodiments of the present application, there is also provided a storage medium comprising a stored program, wherein the program executes the device control processing method of any of the above.

According to still another aspect of the embodiments of the present application, there is also provided a processor, wherein the processor is configured to run a program, wherein the program is executed to execute the device control processing method of any one of the above.

The serial numbers of the embodiments of the present application are merely for the description, and do not represent the advantages and disadvantages of the embodiments.

In the above-mentioned embodiments of the present application, the descriptions of the various embodiments are different, and the parts that are not detailed in a certain embodiment can be referred to the related descriptions of other embodiments.

In the several embodiments provided by the present application, it should be understood that the disclosed technical contents may be implemented in other manners. The device embodiments described above are only schematic. For example, the division of the unit may be a logical function division. In actual implementation, there may be another division manner, for example, multiple units or components may be combined or may be Integrate into another system, or some features can be ignored or not executed. In addition, the mutual coupling or direct coupling or communication connection shown or discussed may be an indirect coupling or communication connection through some interface, unit or module, and may be electrical or otherwise.

The units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of the embodiment.

In addition, each functional unit in each embodiment of the present application may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit. The above integrated unit can be implemented in the form of hardware or in the form of a software functional unit.

The integrated unit, if implemented in the form of a software functional unit and sold or used as a standalone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present application, in essence or the contribution to the prior art, or all or part of the technical solution may be embodied in the form of a software product stored in a storage medium. A number of instructions are included to cause a computer device (which may be a personal computer, server or network device, etc.) to perform all or part of the steps of the methods described in various embodiments of the present application. The foregoing storage medium includes: a U disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a removable hard disk, a magnetic disk, or an optical disk, and the like. .

The above description is only a preferred embodiment of the present application, and it should be noted that those skilled in the art can also make several improvements and retouchings without departing from the principles of the present application. It should be considered as the scope of protection of this application.

Claims

A device control processing method, comprising:

Obtaining information of the user, where the information of the user includes at least one of the following: a photo of the user captured by the imaging device, and a voice of the user received by the audio device;

Using a model to evaluate an emotional level of the user corresponding to the user's information, wherein the model is obtained according to a plurality of sets of data, each of the plurality of sets of data includes: a photo of the user And/or sound, and a label identifying the level of emotion represented by the photo and/or sound;

And transmitting a control command according to the emotion level, wherein the control command is used to instruct the device to perform a predetermined operation.
The method according to claim 1, wherein before the evaluating the emotion level of the user corresponding to the information of the user by using the model, the method further comprises:

Sending photos and/or sounds of the user to other users;

Obtaining the tag added by the other user for the photo and/or sound of the user.
The method according to claim 2, wherein the obtaining the tag added by the other user for the photo and/or sound of the user comprises at least one of the following:

Sending a photo and/or sound of the user, and a plurality of emotion levels that can be selected to the other user; receiving an emotion level selected by the other user from the plurality of emotion levels as the tag;

Obtaining an evaluation of the photo and/or sound of the user by the other user, extracting an emotion level as the tag from the evaluation, wherein the evaluation includes at least one of the following: evaluation of natural language, evaluation of speech .
The method according to claim 1, wherein before the evaluating the emotion level of the user corresponding to the information of the user by using the model, the method further comprises:

After obtaining the photo and/or sound of the user, asking the user according to the photo and/or sound of the user;

The emotion level corresponding to the photo and/or sound of the user is extracted according to the user's answer to the question questioned.
The method according to claim 1, wherein the transmitting the control command according to the emotion level comprises:

Transmitting the control command when the emotion level matches a predetermined level, wherein the control command is used to control the device to perform at least one of: playing music corresponding to the emotion level, A video corresponding to the emotion level is played.
A device control processing device, comprising:

a first acquiring unit, configured to acquire information of the user, where the information of the user includes at least one of the following: a photo of the user captured by the imaging device, and a voice of the user received by the audio device;

An evaluation unit, configured to use a model to evaluate an emotional level of the user corresponding to the information of the user, wherein the model is obtained according to a plurality of sets of data, each of the plurality of sets of data includes: a photo and/or sound of the user, and a label identifying the level of emotion represented by the photo and/or sound;

a first sending unit, configured to send a control command according to the sentiment level, wherein the control command is used to instruct the device to perform a predetermined operation.
The device according to claim 6, wherein the device further comprises:

a second sending unit, configured to send the photo and/or voice of the user to other users before evaluating the emotion level of the user corresponding to the information of the user by using the model;

And a second acquiring unit, configured to acquire the label added by the other user for the photo and/or sound of the user.
The apparatus according to claim 7, wherein said second acquisition unit comprises at least one of the following:

a first sending module, configured to send a photo and/or a sound of the user, and a plurality of emotion levels that can be selected to the other user; and receive an emotion selected by the other user from the multiple emotion levels Level as the label;

An extraction module, configured to obtain an evaluation of the photo and/or sound of the user by the other user, extracting an emotion level as the label from the evaluation, wherein the evaluation includes at least one of: natural language Evaluation, evaluation of speech.
The device according to claim 6, wherein the device further comprises:

a questioning unit, configured to: according to the photo and/or sound of the user, after acquiring the photo and/or sound of the user before using the model to evaluate the emotional level of the user corresponding to the information of the user Questioning the user;

And an extracting unit, configured to extract an emotion level corresponding to the photo and/or the sound of the user according to the user's answer to the question questioned.
The apparatus according to claim 6, wherein the first sending unit comprises:

a second sending module, configured to send the control command if the emotion level matches a predetermined level, where the control command is used to control the device to perform at least one of: playing and The music corresponding to the emotion level is played, and the video corresponding to the emotion level is played.
A storage medium, characterized in that the storage medium includes a stored program, wherein the program executes the device control processing method according to any one of claims 1 to 5.
A processor, wherein the processor is configured to execute a program, wherein the program is executed to perform the device control processing method according to any one of claims 1 to 5.