CN108885699A - Character identifying method, device, storage medium and electronic equipment - Google Patents

Character identifying method, device, storage medium and electronic equipment Download PDF

Info

Publication number
CN108885699A
CN108885699A CN201880001125.2A CN201880001125A CN108885699A CN 108885699 A CN108885699 A CN 108885699A CN 201880001125 A CN201880001125 A CN 201880001125A CN 108885699 A CN108885699 A CN 108885699A
Authority
CN
China
Prior art keywords
image
character
text
identified
correction
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201880001125.2A
Other languages
Chinese (zh)
Other versions
CN108885699B (en
Inventor
梁昊
南冰
南一冰
廉士国
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Cloudminds Shanghai Robotics Co Ltd
Original Assignee
Cloudminds Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Cloudminds Inc filed Critical Cloudminds Inc
Publication of CN108885699A publication Critical patent/CN108885699A/en
Application granted granted Critical
Publication of CN108885699B publication Critical patent/CN108885699B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/60Type of objects
    • G06V20/62Text, e.g. of license plates, overlay texts or captions on TV images
    • G06V20/63Scene text, e.g. street names
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • G06F18/2413Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on distances to training or reference patterns
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • General Engineering & Computer Science (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Multimedia (AREA)
  • Character Input (AREA)

Abstract

This disclosure relates to a kind of character identifying method, device, storage medium and electronic equipment, the method includes:It is possible, firstly, to determine the corresponding image category of target image including character to be identified;Then, processing is corrected to the target image by the corresponding correction process mode of described image classification;Then, at least one line of text image is extracted from the target image after correction process;Finally, identifying the character to be identified at least one described line of text image by preset characters identification model.Since different image categories corresponds to different correction process modes, in this way, the image of different images classification can be corrected processing according to corresponding correction process mode, and character recognition is carried out to the image after correction process, the disclosure, which can satisfy, carries out character recognition to text image and scene image, so as to avoid the problem that the versatility of character recognition algorithm in the prior art is poor.

Description

Character identifying method, device, storage medium and electronic equipment
Technical field
This disclosure relates to field of image processing, and in particular, to a kind of character identifying method, device, storage medium and electricity Sub- equipment.
Background technique
With computer technology and multimedia fast development, more and more information are propagated with image format, and are schemed Information as in can be descriptive text, currently, text image can be divided into file and picture and scene image, wherein The character quantity that file and picture generally includes is more, the character regularity of distribution, and image background is single;It is different from file and picture, scene The character quantity that image generally includes is less, and character types are abundant, and arbitrarily, image background is complicated for character distribution.
In view of file and picture and scene image have above-mentioned different characteristics of image, and current character recognition algorithm It is for specific text image, so that file and picture and scene image need to carry out respectively by different character recognition algorithms Character recognition, so that the versatility for causing character recognition algorithm is poor.
Summary of the invention
To solve the above-mentioned problems, the disclosure provides a kind of character identifying method, device, storage medium and electronic equipment.
According to the disclosure in a first aspect, provide a kind of character identifying method, the method includes:
Determine the corresponding image category of target image including character to be identified;Wherein, different image categories is corresponding not Same correction process mode;
Processing is corrected to the target image by described image classification corresponding correction process mode;
At least one line of text image is extracted from the target image after correction process;
The character to be identified at least one described line of text image is identified by preset characters identification model.
According to the second aspect of the disclosure, a kind of character recognition device is provided, described device includes:
Determining module, for determining the corresponding image category of target image including character to be identified;Wherein, different figure As classification corresponds to different correction process modes;
Correction module, for being corrected by the corresponding correction process mode of described image classification to the target image Processing;
Extraction module, for extracting at least one line of text image from the target image after correction process;
Identification module, for described in being identified at least one described line of text image by preset characters identification model to Identify character.
According to the third aspect of the disclosure, a kind of computer readable storage medium is provided, computer program is stored thereon with, The program realizes the step of above-mentioned first aspect the method when being executed by processor.
According to the fourth aspect of the disclosure, a kind of electronic equipment is provided, including:
Memory is stored thereon with computer program;
Processor, for executing the computer program in the memory, to realize side described in above-mentioned first aspect The step of method.
In the above-mentioned technical solutions, it is possible, firstly, to determine the corresponding image category of target image including character to be identified; Then, processing is corrected to the target image by the corresponding correction process mode of described image classification;Then, from correction At least one line of text image is extracted in treated target image;Finally, passing through preset characters identification model identification at least one The character to be identified in a line of text image.Since different image categories corresponds to different correction process modes, In this way, the image of different images classification can be corrected processing according to corresponding correction process mode, and to correction process Image afterwards carries out character recognition, and the disclosure, which can satisfy, carries out character recognition to text image and scene image, to avoid The poor problem of the versatility of character recognition algorithm in the prior art.
Other feature and advantage of the disclosure will the following detailed description will be given in the detailed implementation section.
Detailed description of the invention
Attached drawing is and to constitute part of specification for providing further understanding of the disclosure, with following tool Body embodiment is used to explain the disclosure together, but does not constitute the limitation to the disclosure.In the accompanying drawings:
Fig. 1 is a kind of flow diagram of character identifying method shown according to an exemplary embodiment;
Fig. 2 is the block diagram of the first character recognition device shown according to an exemplary embodiment;
Fig. 3 is the block diagram of second of character recognition device shown according to an exemplary embodiment;
Fig. 4 is the block diagram of the third character recognition device shown according to an exemplary embodiment;
Fig. 5 is the block diagram of the 4th kind of character recognition device shown according to an exemplary embodiment;
Fig. 6 is the block diagram of the 5th kind of character recognition device shown according to an exemplary embodiment;
Fig. 7 is the block diagram of the 6th kind of character recognition device shown according to an exemplary embodiment;
Fig. 8 is the block diagram of a kind of electronic equipment shown according to an exemplary embodiment.
Specific embodiment
Firstly, the application scenarios to the disclosure are illustrated, the disclosure can be applied to the scene of character recognition, at this Under scape, character recognition algorithm mainly includes two steps of character machining and character recognition.Currently, character machining can be divided into single word Symbol detection and line of text extract two ways, wherein single character machining be directly to the single character in target image into Row detection, line of text extract the character zone for mainly extracting distribution of embarking on journey.For above two mode, single character machining Easily there is a situation where missing inspections, i.e., one or more characters in target image are not detected, to influence character recognition Accuracy rate;It is that will the embark on journey character of distribution is not susceptible to missing inspection as entirety, but needs after detecting line of text that line of text, which is extracted, Each character in line of text is split, to have higher requirement to the accuracy rate of segmentation.For above-mentioned different word Detection mode is accorded with, character recognition mode is also different:It, can be directly to the single character of extraction point when using single character machining It is not identified, and permutation and combination is carried out to all single characters according to the character location information of single character, to generate most Whole recognition result;Using line of text extract when, need first to be split the character in each line of text, then to segmentation after Character is identified, and carries out arrangement group according to character identification result of the location information of each line of text to each line of text It closes, to generate final recognition result.
Since current text image can be divided into file and picture and scene image, wherein what file and picture generally included Character quantity is more, the character regularity of distribution, and image background is single;Characters different from file and picture, that scene image generally includes Negligible amounts, character types are abundant, and arbitrarily, image background is complicated for character distribution.For file and picture and scene image, due to tool Standby above-mentioned different characteristics of image, so that current character recognition algorithm can not carry out word to file and picture and scene image simultaneously Symbol identification, and need to carry out character recognition respectively by different character recognition algorithms, to cause character recognition algorithm Versatility is poor.
To solve the above-mentioned problems, the present disclosure proposes a kind of character identifying method, device, storage medium and electronics to set It is standby, it is possible, firstly, to determine that the image category of target image then determines the corresponding correction of the target image according to image category Then processing mode is corrected processing to the target image according to the corresponding correction process mode of the target image, secondly, At least one line of text image can be extracted from the target image after correction process, finally, identifying according to character recognition model Character to be identified at least one line of text image.Since different image categories corresponds to different correction process modes, this The image of different images classification can be corrected processing according to corresponding correction process mode by sample, and to correction process after Image carry out character recognition, the disclosure, which can satisfy, carries out character recognition to text image and scene image, so as to avoid The poor problem of the versatility of character recognition algorithm in the prior art.
The disclosure is described in detail below with reference to specific embodiment.
Fig. 1 is a kind of flow diagram of character identifying method shown according to an exemplary embodiment.As shown in Figure 1, The method includes:
S101, the corresponding image category of target image including character to be identified is determined.
In this step, which may include file and picture and scene image, wherein file and picture generally includes Character quantity it is more, the character regularity of distribution, image background is single;Words different from file and picture, that scene image generally includes Accord with negligible amounts, character types are abundant, and arbitrarily, image background is complicated for character distribution, it is contemplated that file and picture and scene image it Between have above-mentioned different characteristics of image, therefore, different images classification corresponds to different correction process modes, above-mentioned image category It is merely illustrative, the disclosure is not construed as limiting this.
In one possible implementation, the available image pattern for having determined that image category, and according to the image Sample determines the corresponding image category of the target image, and further, which may include file and picture sample and field Scape image pattern, and the difference between the quantity of the document image pattern and the quantity of the scene image sample is less than or waits In preset threshold, in this way, can be default by file and picture sample and scene image sample training based on the method for deep learning Classifier obtains object classifiers, thus when the target image is input in the object classifiers, which can be with Export the corresponding image category of the target image.
S102, processing is corrected to the target image by the corresponding correction process mode of the image category.
When the image category is file and picture, since character to be identified in file and picture is generally in dense distribution, this Sample may influence whether the accurate of character recognition if the character to be identified in file and picture has inclination and/or distortion Rate, in order to avoid the problem, the disclosure can be corrected processing to the document image, which includes direction school Positive processing and/or distortion correction processing, at this point, being carried out by the corresponding correction process mode of the image category to the target image Correction process may comprise steps of:
The first tilt angle between the character and trunnion axis to be identified in S11, acquisition the document image.
In one possible implementation, this can be obtained by projective analysis method or Hough transform method etc. first to incline Rake angle, it is, of course, also possible to which carrying out Threshold segmentation to the document image obtains binary document image, and according to binary document image In the pixel acquisition of information of character to be identified first tilt angle, detailed process can refer to the prior art, no longer superfluous It states.
S12, determine whether first tilt angle is more than or equal to predetermined angle.
When first tilt angle is more than or equal to the predetermined angle, step S13 and S14 are executed;
When first tilt angle is less than the predetermined angle, step S14 is executed.
S13, correction for direction processing is carried out to the document image.
Wherein, correction for direction processing, which can be, constantly rotates the target image, until the word to be identified in text image The first tilt angle between symbol and trunnion axis is less than the predetermined angle.
S14, determine that the character to be identified in the document image whether there is distortion.
When using scanner or camera acquisition text image, if text inclination itself and bending or shooting visual angle Inclination etc. then will lead to text image and there is distortion, in this way, line of text originally horizontally or vertically is become bended, from And cause the presence of interference between the line of text in text image, influence the final recognition result of character to be identified.
When character to be identified in the document image has distortion, step S15 is executed;
Character to be identified in the document image determines there is no when distortion and completes correction process.
S15, distortion correction processing is carried out to the document image.
Wherein, distortion correction processing can be by being corrected, so that line of text using the blank position between line of text Horizontal distribution or vertical distribution are reverted to, detailed process can refer to the prior art, repeat no more.
It should be noted that for simple description, therefore, it is stated as a series of dynamic for above method embodiment It combines, but those skilled in the art should understand that, the disclosure is not limited by the described action sequence, because of foundation The disclosure, some steps may be performed in other sequences or simultaneously, for example, step S14 and S15 can step S11 it Preceding execution, at this point it is possible to first distortion correction processing, then carry out correction for direction processing;Secondly, those skilled in the art should also know It knows, the embodiments described in the specification are all preferred embodiments, the related actions and modules not necessarily disclosure It is necessary.
To sum up, based on the characteristics of image of text image, step S11 to S15 can be by character to be identified in text image First tilt angle and distortion are corrected, to improve the accuracy rate of the character recognition in subsequent step.
When the image category is scene image, since character to be identified in scene image is generally in sparse distribution, and And often there is a small amount of line of text for being arbitrarily distributed, in this way, influenced between line of text in scene image it is smaller, without into Line distortion correction process, therefore, for scene image, corresponding correction process mode is correction for direction processing, specifically, is passed through The corresponding correction process mode of the image category is corrected processing to the target image and includes the following steps:
S21, at least one character area is obtained to scene image progress word area detection.
Wherein, word area detection may include based on edge detection, be based on region detection, based on skin texture detection or base In any one of study detection, it is, of course, also possible to be two kinds, three kinds or four kinds of knot in above-mentioned four kinds of detection methods It closes, above-mentioned example is merely illustrative, and the disclosure is not construed as limiting this.
S22, the second inclination between the character and trunnion axis to be identified at least one character area is successively obtained Angle.
May be used also certainly likewise it is possible to obtain second tilt angle by projective analysis method or Hough transform method etc. Two-value scene image is obtained to carry out Threshold segmentation to the scene image, and according to the character to be identified in two-value scene image Pixel acquisition of information second tilt angle, detailed process can refer to the prior art, repeat no more.
When second tilt angle is more than or equal to the predetermined angle, step S23 is executed;
When second tilt angle is less than the predetermined angle, determines and complete correction process.
S23, correction for direction processing is carried out at least one character area.
Wherein, correction for direction processing, which can be, constantly rotates the character area, until the word to be identified in this article one's respective area The second tilt angle between symbol and trunnion axis is less than the predetermined angle.
To sum up, based on the characteristics of image of scene image, step S21 to S23 can be by character to be identified in scene image Second tilt angle is corrected, to improve the accuracy rate of the character recognition in subsequent step.
S103, at least one line of text image is extracted from the target image after correction process.
In this step, at least one line of text image can be extracted based on the method for deep learning specifically can wrap Include following steps:
S31, the space characteristics that target image is extracted by the multilayer convolutional layer in line of text detection model.
Wherein, which can be the correlativity in the target image between pixel.
S32, the Recognition with Recurrent Neural Network layer that the space characteristics are input in line of text detection model is obtained into the target image Sequence signature.
In this step, which can be LSTM (long memory network in short-term;Long Short Term Memory Network), BLSTM (two-way length memory network in short-term;Bi-directional Long Short Term Memory Network) or GRU (Gated Recurrent Unit, LSTM variant) etc., above-mentioned example is merely illustrative, The disclosure is not construed as limiting this.
S33, the candidate text box in the target image is obtained according to preset rules, and based on the sequence signature to the candidate Text box is classified.
It in one possible implementation, can be sliding in the target image using the sliding window of default size and ratio Dynamic, to intercept candidate's text box, detailed process refers to the prior art, and the disclosure repeats no more.
Wherein, which can be completed by the classification layer in this article current row detection model, illustratively, the classification layer Can be softmax layers, and the softmax layers of input is consistent with the dimension exported, the softmax layers of input with it is defeated When dimension out is inconsistent, need to increase full articulamentum before softmax layers, to reach softmax layers of input and output Dimension it is consistent.
S34, the text box location information for returning convolutional layer and obtaining candidate text box in line of text detection model is used.
S35, using NMS, (non-maximum value inhibits;Non maximum suppression) method, according to text frame position Confidence breath and classification results screen candidate text box to obtain line of text image.
S104, the character to be identified at least one this article current row image is identified by preset characters identification model.
Usual character recognition step is handled as unit of character, then carries out Character prediction using character classifier, still, In line of text image complexity, Character segmentation is relatively difficult, may destroy charcter topology, since the precision of Character segmentation is direct The final recognition result of character is influenced, in order to avoid the low problem of recognition accuracy caused by Character segmentation, the disclosure can be with As a whole by line of text image, the character to be identified in this article current row image is not cut, Direct Recognition text Whole character to be identified in row image, so as to make full use of character context relation to be identified.
It should be noted that further including before this step:The location information of at least one this article current row image is obtained, In, after determining line of text image in step s 103, this article current row image pair can be determined according to text box location information The location information answered, at this point, identifying at least one this article current row image by the preset characters identification model and the location information In the character to be identified, which includes deep learning layer, circulating net network layers and coding layer, specifically Ground, character recognition process may comprise steps of:
S41, character feature extraction is carried out at least one this article current row image according to the deep learning layer.
Wherein, which can be CNN (convolutional neural networks;Convolutional Neural Networks), in this way, at least one this article current row image can be formed multiple slices along horizontal direction by CNN, each Slice has corresponded to a character feature, since there may be overlappings between the contiguous slices, so that the character feature includes Certain context relation.
S42, the character feature of extraction is input to the circulating net network layers, and to obtain at least one this article current row image corresponding Feature vector.
Wherein, which can be LSTM, BLSTM or GRU etc., in this way, passing through the neural net layer The character feature can further be learnt, to obtain being sliced corresponding feature vector, above-mentioned example is merely illustrative, this It is open that this is not construed as limiting.
S43, it this feature vector is input to the coding layer obtains the coding result of at least one this article current row image, and root The text information of at least one this article current row image is obtained according to the coding result.
In this step, which can be CTC (timing sorting algorithm;Connectionist Temporal Classification) layer, in this way, coding result can be obtained according to CTC layers, due to may include more in this article current row image Therefore a character to be identified may include multiple codings in the coding result, in this way, by each coding in the coding result Matched to obtain the corresponding character of each coding with pre-arranged code corresponding relationship, it will be every according to the coded sequence of multiple coding The corresponding character of a coding carries out ordered arrangement and obtains the text information of this article current row image, wherein the pre-arranged code is corresponding to close It is the corresponding relationship between coded samples and character sample, above-mentioned example is merely illustrative, and the disclosure is not construed as limiting this.
S44, it is somebody's turn to do according to text information progress ordered arrangement of the location information at least one this article current row image The target identification result of target image.
In this step, can be obtained according to the location information at least one line of text image in this article current row image it Between sequencing, know to be ranked up to obtain target according to sequencing for the text information of at least one line of text image Other result.
It should be noted that the disclosure be by the character to be identified in target image be it is horizontally arranged for be illustrated , when the character to be identified is vertical arrangement, at least one text column image in the target image can be extracted, and pass through Preset characters identification model identifies the character to be identified at least one text column image, and detailed process can refer to above-mentioned The narration of line of text image, repeats no more.
The above method is used, it is possible, firstly, to determine the image category of target image, then, determining according to image category should The corresponding correction process mode of target image, then, according to the corresponding correction process mode of the target image to the target image It is corrected processing, secondly, at least one line of text image can be extracted from the target image after correction process, finally, root The character to be identified at least one line of text image is identified according to character recognition model.Since different image categories is corresponding different Correction process mode, in this way, the image of different images classification can be corrected place according to corresponding correction process mode Reason, and character recognition is carried out to the image after correction process, the disclosure, which can satisfy, carries out word to text image and scene image Symbol identification, so as to avoid the problem that the versatility of character recognition algorithm in the prior art is poor.
Fig. 2 is the block diagram of character recognition device 20 shown according to an exemplary embodiment, as shown in Fig. 2, including:
Determining module 201, for determining the corresponding image category of target image including character to be identified;Wherein, different Image category correspond to different correction process modes;
Correction module 202, for being corrected by the corresponding correction process mode of the image category to the target image Processing;
Extraction module 203, for extracting at least one line of text image from the target image after correction process;
Identification module 204, for by preset characters identification model identify at least one this article current row image should be to Identify character.
Optionally, which includes file and picture and scene image.
Fig. 3 is the block diagram of determining module 201 shown according to an exemplary embodiment, as shown in figure 3, the determining module 201 include:
First acquisition submodule 2011, for obtaining the image pattern for having determined that image category;
First determines submodule 2012, for determining the corresponding image category of the target image according to the image pattern.
Fig. 4 is the block diagram of correction module 202 shown according to an exemplary embodiment, as shown in figure 4, in the image category When for file and picture, which includes correction for direction processing and/or distortion correction processing;In the correction process mode When handling including direction correction process and the distortion correction, which includes:
Second acquisition submodule 2021, for obtaining between the character and trunnion axis to be identified in text image One tilt angle;
First correction module 2022 is used for when first tilt angle is more than or equal to predetermined angle, to this article This image carries out correction for direction processing;
Second determines submodule 2023, for determining the character to be identified in text image with the presence or absence of distortion;
Second correction module 2024, when there is distortion for the character to be identified in text image, to this article This image carries out distortion correction processing.
Fig. 5 is the block diagram of correction module 202 shown according to an exemplary embodiment, as shown in figure 5, in the image category When for scene image, which includes correction for direction processing;The correction module 202 includes:
Detection sub-module 2025 obtains at least one character area for carrying out word area detection to the scene image;
Third acquisition submodule 2026, for successively obtaining the character and water to be identified at least one character area The second tilt angle between flat axis;
Third correction module 2027, be greater than for second tilt angle at least one character area or When equal to predetermined angle, correction for direction processing is carried out at least one character area.
Fig. 6 is the block diagram of character recognition device 20 shown according to an exemplary embodiment, as shown in fig. 6, further including:
Module 305 is obtained, for identifying being somebody's turn to do at least one this article current row image by preset characters identification model Before character to be identified, the location information of at least one this article current row image is obtained;
The identification module 304, for identifying at least one this article by the preset characters identification model and the location information The character to be identified in current row image.
Fig. 7 is the block diagram of identification module 304 shown according to an exemplary embodiment, as shown in fig. 7, the preset characters are known Other model includes deep learning layer, circulating net network layers and coding layer, which includes:
Extracting sub-module 3041, for carrying out character feature at least one this article current row image according to the deep learning layer It extracts;
4th acquisition submodule 3042 obtains at least one for the character feature of extraction to be input to the circulating net network layers The corresponding feature vector of this article current row image;
5th acquisition submodule 3043 obtains at least one this article current row for this feature vector to be input to the coding layer The coding result of image, and the text information of at least one this article current row image is obtained according to the coding result;
6th acquisition submodule 3044, for the text information according to the location information at least one this article current row image It carries out ordered arrangement and obtains the target identification result of the target image.
Above-mentioned apparatus is used, it is possible, firstly, to determine the image category of target image, then, determining according to image category should The corresponding correction process mode of target image, then, according to the corresponding correction process mode of the target image to the target image It is corrected processing, secondly, at least one line of text image can be extracted from the target image after correction process, finally, root The character to be identified at least one line of text image is identified according to character recognition model.Since different image categories is corresponding different Correction process mode, in this way, the image of different images classification can be corrected place according to corresponding correction process mode Reason, and character recognition is carried out to the image after correction process, the disclosure, which can satisfy, carries out word to text image and scene image Symbol identification, so as to avoid the problem that the versatility of character recognition algorithm in the prior art is poor.
About the device in above-described embodiment, wherein modules execute the concrete mode of operation in related this method Embodiment in be described in detail, no detailed explanation will be given here.
Fig. 8 is the block diagram of a kind of electronic equipment 800 shown according to an exemplary embodiment.As shown in figure 8, the electronics is set Standby 800 may include:Processor 801, memory 802.The electronic equipment 800 can also include multimedia component 803, input/ Export one or more of (I/O) interface 804 and communication component 805.
Wherein, processor 801 is used to control the integrated operation of the electronic equipment 800, to complete above-mentioned character recognition side All or part of the steps in method.Memory 802 is for storing various types of data to support the behaviour in the electronic equipment 800 To make, these data for example may include the instruction of any application or method for operating on the electronic equipment 800, with And the relevant data of application program, such as contact data, the message of transmitting-receiving, picture, audio, video etc..The memory 802 It can be realized by any kind of volatibility or non-volatile memory device or their combination, such as static random-access is deposited Reservoir (Static Random Access Memory, abbreviation SRAM), electrically erasable programmable read-only memory (Electrically Erasable Programmable Read-Only Memory, abbreviation EEPROM), erasable programmable Read-only memory (Erasable Programmable Read-Only Memory, abbreviation EPROM), programmable read only memory (Programmable Read-Only Memory, abbreviation PROM), and read-only memory (Read-Only Memory, referred to as ROM), magnetic memory, flash memory, disk or CD.Multimedia component 803 may include screen and audio component.Wherein Screen for example can be touch screen, and audio component is used for output and/or input audio signal.For example, audio component may include One microphone, microphone is for receiving external audio signal.The received audio signal can be further stored in storage Device 802 is sent by communication component 805.Audio component further includes at least one loudspeaker, is used for output audio signal.I/O Interface 804 provides interface between processor 801 and other interface modules, other above-mentioned interface modules can be keyboard, mouse, Button etc..These buttons can be virtual push button or entity button.Communication component 805 is for the electronic equipment 800 and other Wired or wireless communication is carried out between equipment.Wireless communication, such as Wi-Fi, bluetooth, near-field communication (Near Field Communication, abbreviation NFC), 2G, 3G or 4G or they one or more of combination, therefore corresponding communication Component 805 may include:Wi-Fi module, bluetooth module, NFC module.
In one exemplary embodiment, electronic equipment 800 can be by one or more application specific integrated circuit (Application Specific Integrated Circuit, abbreviation ASIC), digital signal processor (Digital Signal Processor, abbreviation DSP), digital signal processing appts (Digital Signal Processing Device, Abbreviation DSPD), programmable logic device (Programmable Logic Device, abbreviation PLD), field programmable gate array (Field Programmable Gate Array, abbreviation FPGA), controller, microcontroller, microprocessor or other electronics member Part is realized, for executing above-mentioned character identifying method.
In a further exemplary embodiment, a kind of computer readable storage medium including program instruction is additionally provided, it should The step of above-mentioned character identifying method is realized when program instruction is executed by processor.For example, the computer readable storage medium It can be the above-mentioned memory 802 including program instruction, above procedure instruction can be executed by the processor 801 of electronic equipment 800 To complete above-mentioned character identifying method.
The preferred embodiment of the disclosure is described in detail in conjunction with attached drawing above, still, the disclosure is not limited to above-mentioned reality The detail in mode is applied, in the range of the technology design of the disclosure, a variety of letters can be carried out to the technical solution of the disclosure Monotropic type, these simple variants belong to the protection scope of the disclosure.
It is further to note that specific technical features described in the above specific embodiments, in not lance In the case where shield, it can be combined in any appropriate way.In order to avoid unnecessary repetition, the disclosure to it is various can No further explanation will be given for the combination of energy.
In addition, any combination can also be carried out between a variety of different embodiments of the disclosure, as long as it is without prejudice to originally Disclosed thought equally should be considered as disclosure disclosure of that.

Claims (16)

1. a kind of character identifying method, which is characterized in that the method includes:
Determine the corresponding image category of target image including character to be identified;Wherein, different image categories corresponds to different Correction process mode;
Processing is corrected to the target image by described image classification corresponding correction process mode;
At least one line of text image is extracted from the target image after correction process;
The character to be identified at least one described line of text image is identified by preset characters identification model.
2. the method according to claim 1, wherein described image classification includes file and picture and scene image.
3. method according to claim 1 or 2, which is characterized in that the determination includes the target image of character to be identified Corresponding image category includes:
Obtain the image pattern for having determined that image category;
The corresponding image category of the target image is determined according to described image sample.
4. according to the method described in claim 2, it is characterized in that, described image classification be file and picture when, the correction Processing mode includes correction for direction processing and/or distortion correction processing;It include the correction for direction in the correction process mode Processing and the distortion correction processing when, it is described by the corresponding correction process mode of described image classification to the target image Being corrected processing includes:
Obtain the character to be identified in the file and picture and the first tilt angle between trunnion axis;
When first tilt angle is more than or equal to predetermined angle, correction for direction processing is carried out to the file and picture;
Determine the character to be identified in the file and picture with the presence or absence of distortion;
When the character to be identified in the file and picture has distortion, the file and picture is carried out at distortion correction Reason.
5. according to the method described in claim 2, it is characterized in that, described image classification be scene image when, the correction Processing mode includes correction for direction processing;It is described by the corresponding correction process mode of described image classification to the target image Being corrected processing includes:
Word area detection is carried out to the scene image and obtains at least one character area;
Successively obtain the character to be identified at least one described character area and the second tilt angle between trunnion axis;
When second tilt angle at least one described character area is more than or equal to predetermined angle, at least one A character area carries out correction for direction processing.
6. method according to claim 1 or 2, which is characterized in that it is described by preset characters identification model identify to Before the character to be identified in a few line of text image, further include:
Obtain the location information of at least one line of text image;
It is described to identify that the character to be identified at least one described line of text includes by preset characters identification model:
Described in being identified at least one described line of text image by the preset characters identification model and the location information Character to be identified.
7. according to the method described in claim 6, it is characterized in that, the preset characters identification model include deep learning layer, Circulating net network layers and coding layer, it is described that at least one institute is identified by the preset characters identification model and the location information The character to be identified stated in line of text image includes:
Character feature extraction is carried out at least one described line of text image according to the deep learning layer;
The character feature of extraction is input to the circulating net network layers and obtains the corresponding feature of at least one described line of text image Vector;
Described eigenvector is input to the coding layer and obtains the coding result of at least one line of text image, and according to The coding result obtains the text information of at least one line of text image;
Ordered arrangement is carried out the text information of at least one line of text image according to the positional information and obtains the mesh The target identification result of logo image.
8. a kind of character recognition device, which is characterized in that described device includes:
Determining module, for determining the corresponding image category of target image including character to be identified;Wherein, different image class Different correction process modes is not corresponded to;
Correction module, for being corrected place to the target image by the corresponding correction process mode of described image classification Reason;
Extraction module, for extracting at least one line of text image from the target image after correction process;
Identification module, it is described to be identified at least one described line of text image for being identified by preset characters identification model Character.
9. device according to claim 8, which is characterized in that described image classification includes file and picture and scene image.
10. device according to claim 8 or claim 9, which is characterized in that the determining module includes:
First acquisition submodule, for obtaining the image pattern for having determined that image category;
First determines submodule, for determining the corresponding image category of the target image according to described image sample.
11. device according to claim 9, which is characterized in that when described image classification is file and picture, the correction Processing mode includes correction for direction processing and/or distortion correction processing;It include the correction for direction in the correction process mode When processing and distortion correction processing, the correction module includes:
Second acquisition submodule is inclined for obtaining the character to be identified in the file and picture and first between trunnion axis Rake angle;
First correction module is used for when first tilt angle is more than or equal to predetermined angle, to the document map As carrying out correction for direction processing;
Second determines submodule, for determining the character to be identified in the file and picture with the presence or absence of distortion;
Second correction module, when there is distortion for the character to be identified in the file and picture, to the document Image carries out distortion correction processing.
12. device according to claim 9, which is characterized in that when described image classification is scene image, the correction Processing mode includes correction for direction processing;The correction module includes:
Detection sub-module obtains at least one character area for carrying out word area detection to the scene image;
Third acquisition submodule, for successively obtaining the character to be identified and trunnion axis at least one described character area Between the second tilt angle;
Third correction module is more than or equal to for second tilt angle at least one described character area When predetermined angle, correction for direction processing is carried out at least one described character area.
13. device according to claim 8 or claim 9, which is characterized in that further include:
Obtain module, for described in identifying at least one described line of text image by preset characters identification model wait know Before malapropism symbol, the location information of at least one line of text image is obtained;
The identification module, for identifying at least one described text by the preset characters identification model and the location information The character to be identified in current row image.
14. device according to claim 13, which is characterized in that the preset characters identification model includes deep learning Layer, circulating net network layers and coding layer, the identification module include:
Extracting sub-module is mentioned for carrying out character feature at least one described line of text image according to the deep learning layer It takes;
4th acquisition submodule obtains at least one described text for the character feature of extraction to be input to the circulating net network layers The corresponding feature vector of current row image;
5th acquisition submodule obtains at least one described line of text figure for described eigenvector to be input to the coding layer The coding result of picture, and the text information of at least one line of text image is obtained according to the coding result;
6th acquisition submodule is carried out for the text information according to the positional information at least one line of text image Ordered arrangement obtains the target identification result of the target image.
15. a kind of computer readable storage medium, is stored thereon with computer program, which is characterized in that the program is by processor The step of any one of claim 1-7 the method is realized when execution.
16. a kind of electronic equipment, which is characterized in that including:
Memory is stored thereon with computer program;
Processor, for executing the computer program in the memory, to realize described in any one of claim 1-7 The step of method.
CN201880001125.2A 2018-07-11 2018-07-11 Character recognition method, device, storage medium and electronic equipment Active CN108885699B (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2018/095295 WO2020010547A1 (en) 2018-07-11 2018-07-11 Character identification method and apparatus, and storage medium and electronic device

Publications (2)

Publication Number Publication Date
CN108885699A true CN108885699A (en) 2018-11-23
CN108885699B CN108885699B (en) 2020-06-26

Family

ID=64325024

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201880001125.2A Active CN108885699B (en) 2018-07-11 2018-07-11 Character recognition method, device, storage medium and electronic equipment

Country Status (2)

Country Link
CN (1) CN108885699B (en)
WO (1) WO2020010547A1 (en)

Cited By (29)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110490190A (en) * 2019-07-04 2019-11-22 贝壳技术有限公司 A kind of structured image character recognition method and system
CN110674811A (en) * 2019-09-04 2020-01-10 广东浪潮大数据研究有限公司 Image recognition method and device
CN111126273A (en) * 2019-12-24 2020-05-08 珠海奔图电子有限公司 Image processing method, image processing apparatus, electronic device, and storage medium
CN111242083A (en) * 2020-01-21 2020-06-05 腾讯云计算(北京)有限责任公司 Text processing method, device, equipment and medium based on artificial intelligence
CN111353493A (en) * 2020-03-31 2020-06-30 中国工商银行股份有限公司 Text image direction correction method and device
CN111444908A (en) * 2020-03-25 2020-07-24 腾讯科技(深圳)有限公司 Image recognition method, device, terminal and storage medium
CN111444834A (en) * 2020-03-26 2020-07-24 同盾控股有限公司 Image text line detection method, device, equipment and storage medium
CN111563502A (en) * 2020-05-09 2020-08-21 腾讯科技(深圳)有限公司 Image text recognition method and device, electronic equipment and computer storage medium
CN111639566A (en) * 2020-05-19 2020-09-08 浙江大华技术股份有限公司 Method and device for extracting form information
CN111695377A (en) * 2019-03-13 2020-09-22 杭州海康威视数字技术股份有限公司 Text detection method and device and computer equipment
CN111723627A (en) * 2019-03-22 2020-09-29 北京搜狗科技发展有限公司 Image processing method and device and electronic equipment
CN111753850A (en) * 2020-06-29 2020-10-09 珠海奔图电子有限公司 Document processing method and device, computer equipment and computer readable storage medium
CN111832371A (en) * 2019-04-23 2020-10-27 珠海金山办公软件有限公司 Text picture correction method and device, electronic equipment and machine-readable storage medium
CN111985465A (en) * 2020-08-17 2020-11-24 中移(杭州)信息技术有限公司 Text recognition method, device, equipment and storage medium
CN112115944A (en) * 2020-09-18 2020-12-22 北京搜狗科技发展有限公司 Data processing method and device and recording equipment
CN112132762A (en) * 2020-09-18 2020-12-25 北京搜狗科技发展有限公司 Data processing method and device and recording equipment
CN112132003A (en) * 2020-09-18 2020-12-25 北京搜狗科技发展有限公司 Data processing method and device and recording equipment
WO2021051527A1 (en) * 2019-09-19 2021-03-25 平安科技(深圳)有限公司 Image segmentation-based text positioning method, apparatus and device, and storage medium
CN112949638A (en) * 2019-11-26 2021-06-11 金毛豆科技发展(北京)有限公司 Certificate image uploading method and device
CN113033377A (en) * 2021-03-16 2021-06-25 北京有竹居网络技术有限公司 Character position correction method, character position correction device, electronic equipment and storage medium
CN113128306A (en) * 2020-01-10 2021-07-16 北京字节跳动网络技术有限公司 Vertical text line recognition method, device, equipment and computer readable storage medium
CN113392756A (en) * 2021-06-11 2021-09-14 北京猿力未来科技有限公司 Method and device for identifying picture book
CN113554558A (en) * 2020-04-26 2021-10-26 北京金山数字娱乐科技有限公司 Image processing method and device
CN113688927A (en) * 2021-08-31 2021-11-23 中国平安人寿保险股份有限公司 Picture sample generation method and device, computer equipment and storage medium
CN114155546A (en) * 2022-02-07 2022-03-08 北京世纪好未来教育科技有限公司 Image correction method and device, electronic equipment and storage medium
CN114387432A (en) * 2022-01-13 2022-04-22 平安普惠企业管理有限公司 Character direction detection method and device, electronic equipment and storage medium
CN115983938A (en) * 2022-12-13 2023-04-18 北京京东拓先科技有限公司 Online medicine purchasing management method and device
CN117237957A (en) * 2023-11-16 2023-12-15 新视焰医疗科技(杭州)有限公司 Method and system for detecting direction of document and correcting inclined or malformed document
WO2024078304A1 (en) * 2022-10-12 2024-04-18 华为技术有限公司 Document detection and correction method and terminal

Families Citing this family (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11651604B2 (en) * 2020-03-31 2023-05-16 Boe Technology Group Co., Ltd. Word recognition method, apparatus and storage medium
CN111611933B (en) * 2020-05-22 2023-07-14 中国科学院自动化研究所 Information extraction method and system for document image
CN111814538B (en) * 2020-05-25 2024-03-05 北京达佳互联信息技术有限公司 Method and device for identifying category of target object, electronic equipment and storage medium
CN111832558A (en) * 2020-06-15 2020-10-27 北京三快在线科技有限公司 Character image correction method, device, storage medium and electronic equipment
CN111695566B (en) * 2020-06-18 2023-03-14 郑州大学 Method and system for identifying and processing fixed format document
CN111767859A (en) * 2020-06-30 2020-10-13 北京百度网讯科技有限公司 Image correction method and device, electronic equipment and computer-readable storage medium
CN111914840A (en) * 2020-07-31 2020-11-10 中国建设银行股份有限公司 Text recognition method, model training method, device and equipment
CN112001331B (en) * 2020-08-26 2024-06-18 上海高德威智能交通系统有限公司 Image recognition method, device, equipment and storage medium
CN114429632B (en) * 2020-10-15 2023-12-12 腾讯科技(深圳)有限公司 Method, device, electronic equipment and computer storage medium for identifying click-to-read content
CN112364834A (en) * 2020-12-07 2021-02-12 上海叠念信息科技有限公司 Form identification restoration method based on deep learning and image processing
CN112560862B (en) * 2020-12-17 2024-02-13 北京百度网讯科技有限公司 Text recognition method and device and electronic equipment
CN112699871B (en) * 2020-12-23 2023-11-14 平安银行股份有限公司 Method, system, device and computer readable storage medium for identifying field content
CN112733623B (en) * 2020-12-26 2024-08-20 科大讯飞华南人工智能研究院(广州)有限公司 Text element extraction method, related device and readable storage medium
CN112784932B (en) * 2021-03-01 2024-06-07 北京百炼智能科技有限公司 Font identification method, device and storage medium
CN113191345A (en) * 2021-04-28 2021-07-30 北京有竹居网络技术有限公司 Text line direction determining method and related equipment thereof
CN113076961B (en) * 2021-05-12 2023-09-05 北京奇艺世纪科技有限公司 Image feature library updating method, image detection method and device
CN113408270B (en) * 2021-06-10 2023-02-10 广州三七极创网络科技有限公司 Variant text recognition method and device and electronic equipment
CN113298079B (en) * 2021-06-28 2023-10-27 北京奇艺世纪科技有限公司 Image processing method and device, electronic equipment and storage medium
CN113610073A (en) * 2021-06-29 2021-11-05 北京搜狗科技发展有限公司 Method and device for identifying formula in picture and storage medium
CN113642556A (en) * 2021-08-04 2021-11-12 五八有限公司 Image processing method and device, electronic equipment and storage medium
CN113657364B (en) * 2021-08-13 2023-07-25 北京百度网讯科技有限公司 Method, device, equipment and storage medium for identifying text mark
CN115147852B (en) * 2022-03-16 2024-10-11 北京有竹居网络技术有限公司 Ancient book identification method, ancient book identification device, storage medium and equipment
CN114495106A (en) * 2022-04-18 2022-05-13 电子科技大学 MOCR (metal-oxide-semiconductor resistor) deep learning method applied to DFB (distributed feedback) laser chip
CN115640401B (en) * 2022-12-07 2023-04-07 恒生电子股份有限公司 Text content extraction method and device

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104636743A (en) * 2013-11-06 2015-05-20 北京三星通信技术研究有限公司 Character image correction method and device
CN105631448A (en) * 2015-12-28 2016-06-01 小米科技有限责任公司 Image correction method and apparatus
CN107610091A (en) * 2017-07-31 2018-01-19 阿里巴巴集团控股有限公司 Vehicle insurance image processing method, device, server and system
CN107862303A (en) * 2017-11-30 2018-03-30 平安科技(深圳)有限公司 Information identifying method, electronic installation and the readable storage medium storing program for executing of form class diagram picture

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104636743A (en) * 2013-11-06 2015-05-20 北京三星通信技术研究有限公司 Character image correction method and device
CN105631448A (en) * 2015-12-28 2016-06-01 小米科技有限责任公司 Image correction method and apparatus
CN107610091A (en) * 2017-07-31 2018-01-19 阿里巴巴集团控股有限公司 Vehicle insurance image processing method, device, server and system
CN107862303A (en) * 2017-11-30 2018-03-30 平安科技(深圳)有限公司 Information identifying method, electronic installation and the readable storage medium storing program for executing of form class diagram picture

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
CHAO REN等: "A New Method on the Segmentation and Recognition of Chinese Characters for Automatic Chinese Seal Imprint Retrieval", 《2011 INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION》 *

Cited By (45)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111695377B (en) * 2019-03-13 2023-09-29 杭州海康威视数字技术股份有限公司 Text detection method and device and computer equipment
CN111695377A (en) * 2019-03-13 2020-09-22 杭州海康威视数字技术股份有限公司 Text detection method and device and computer equipment
CN111723627A (en) * 2019-03-22 2020-09-29 北京搜狗科技发展有限公司 Image processing method and device and electronic equipment
CN111832371A (en) * 2019-04-23 2020-10-27 珠海金山办公软件有限公司 Text picture correction method and device, electronic equipment and machine-readable storage medium
CN110490190A (en) * 2019-07-04 2019-11-22 贝壳技术有限公司 A kind of structured image character recognition method and system
CN110490190B (en) * 2019-07-04 2021-10-26 贝壳技术有限公司 Structured image character recognition method and system
CN110674811A (en) * 2019-09-04 2020-01-10 广东浪潮大数据研究有限公司 Image recognition method and device
WO2021051527A1 (en) * 2019-09-19 2021-03-25 平安科技(深圳)有限公司 Image segmentation-based text positioning method, apparatus and device, and storage medium
CN112949638A (en) * 2019-11-26 2021-06-11 金毛豆科技发展(北京)有限公司 Certificate image uploading method and device
CN112949638B (en) * 2019-11-26 2024-04-05 金毛豆科技发展(北京)有限公司 Certificate image uploading method and device
CN111126273B (en) * 2019-12-24 2024-04-23 珠海奔图电子有限公司 Image processing method, device, electronic equipment and storage medium
CN111126273A (en) * 2019-12-24 2020-05-08 珠海奔图电子有限公司 Image processing method, image processing apparatus, electronic device, and storage medium
CN113128306A (en) * 2020-01-10 2021-07-16 北京字节跳动网络技术有限公司 Vertical text line recognition method, device, equipment and computer readable storage medium
CN111242083A (en) * 2020-01-21 2020-06-05 腾讯云计算(北京)有限责任公司 Text processing method, device, equipment and medium based on artificial intelligence
CN111242083B (en) * 2020-01-21 2024-01-26 腾讯云计算(北京)有限责任公司 Text processing method, device, equipment and medium based on artificial intelligence
WO2021190171A1 (en) * 2020-03-25 2021-09-30 腾讯科技(深圳)有限公司 Image recognition method and apparatus, terminal, and storage medium
US20220245954A1 (en) * 2020-03-25 2022-08-04 Tencent Technology (Shenzhen) Company Limited Image recognition method, apparatus, terminal, and storage medium
US12014556B2 (en) * 2020-03-25 2024-06-18 Tencent Technology (Shenzhen) Company Limited Image recognition method, apparatus, terminal, and storage medium
CN111444908A (en) * 2020-03-25 2020-07-24 腾讯科技(深圳)有限公司 Image recognition method, device, terminal and storage medium
CN111444908B (en) * 2020-03-25 2024-02-02 腾讯科技(深圳)有限公司 Image recognition method, device, terminal and storage medium
TWI808386B (en) * 2020-03-25 2023-07-11 大陸商騰訊科技(深圳)有限公司 Image recognition method, device, terminal and storage medium
CN111444834B (en) * 2020-03-26 2024-10-01 同盾控股有限公司 Image text line detection method, device, equipment and storage medium
CN111444834A (en) * 2020-03-26 2020-07-24 同盾控股有限公司 Image text line detection method, device, equipment and storage medium
CN111353493A (en) * 2020-03-31 2020-06-30 中国工商银行股份有限公司 Text image direction correction method and device
CN111353493B (en) * 2020-03-31 2023-04-28 中国工商银行股份有限公司 Text image direction correction method and device
CN113554558A (en) * 2020-04-26 2021-10-26 北京金山数字娱乐科技有限公司 Image processing method and device
CN113554558B (en) * 2020-04-26 2024-11-01 北京金山数字娱乐科技有限公司 Image processing method and device
CN111563502A (en) * 2020-05-09 2020-08-21 腾讯科技(深圳)有限公司 Image text recognition method and device, electronic equipment and computer storage medium
CN111563502B (en) * 2020-05-09 2023-12-15 腾讯科技(深圳)有限公司 Image text recognition method and device, electronic equipment and computer storage medium
CN111639566B (en) * 2020-05-19 2024-08-09 浙江大华技术股份有限公司 Method and device for extracting form information
CN111639566A (en) * 2020-05-19 2020-09-08 浙江大华技术股份有限公司 Method and device for extracting form information
CN111753850A (en) * 2020-06-29 2020-10-09 珠海奔图电子有限公司 Document processing method and device, computer equipment and computer readable storage medium
CN111985465A (en) * 2020-08-17 2020-11-24 中移(杭州)信息技术有限公司 Text recognition method, device, equipment and storage medium
CN112132003A (en) * 2020-09-18 2020-12-25 北京搜狗科技发展有限公司 Data processing method and device and recording equipment
CN112132762A (en) * 2020-09-18 2020-12-25 北京搜狗科技发展有限公司 Data processing method and device and recording equipment
CN112115944A (en) * 2020-09-18 2020-12-22 北京搜狗科技发展有限公司 Data processing method and device and recording equipment
CN113033377A (en) * 2021-03-16 2021-06-25 北京有竹居网络技术有限公司 Character position correction method, character position correction device, electronic equipment and storage medium
CN113392756A (en) * 2021-06-11 2021-09-14 北京猿力未来科技有限公司 Method and device for identifying picture book
CN113688927A (en) * 2021-08-31 2021-11-23 中国平安人寿保险股份有限公司 Picture sample generation method and device, computer equipment and storage medium
CN114387432A (en) * 2022-01-13 2022-04-22 平安普惠企业管理有限公司 Character direction detection method and device, electronic equipment and storage medium
CN114155546A (en) * 2022-02-07 2022-03-08 北京世纪好未来教育科技有限公司 Image correction method and device, electronic equipment and storage medium
CN114155546B (en) * 2022-02-07 2022-05-20 北京世纪好未来教育科技有限公司 Image correction method and device, electronic equipment and storage medium
WO2024078304A1 (en) * 2022-10-12 2024-04-18 华为技术有限公司 Document detection and correction method and terminal
CN115983938A (en) * 2022-12-13 2023-04-18 北京京东拓先科技有限公司 Online medicine purchasing management method and device
CN117237957A (en) * 2023-11-16 2023-12-15 新视焰医疗科技(杭州)有限公司 Method and system for detecting direction of document and correcting inclined or malformed document

Also Published As

Publication number Publication date
WO2020010547A1 (en) 2020-01-16
CN108885699B (en) 2020-06-26

Similar Documents

Publication Publication Date Title
CN108885699A (en) Character identifying method, device, storage medium and electronic equipment
CN109325954B (en) Image segmentation method and device and electronic equipment
US11475681B2 (en) Image processing method, apparatus, electronic device and computer readable storage medium
CN109840531B (en) Method and device for training multi-label classification model
JP5775225B2 (en) Text detection using multi-layer connected components with histograms
CN110570433B (en) Image semantic segmentation model construction method and device based on generation countermeasure network
CN105574513A (en) Character detection method and device
US11908160B2 (en) Method and apparatus for context-embedding and region-based object detection
CN112581462A (en) Method and device for detecting appearance defects of industrial products and storage medium
CN113112511B (en) Method and device for correcting test paper, storage medium and electronic equipment
CN110135446B (en) Text detection method and computer storage medium
CN114399644A (en) Target detection method and device based on small sample
CN110348475A (en) It is a kind of based on spatial alternation to resisting sample Enhancement Method and model
CN113128481A (en) Face living body detection method, device, equipment and storage medium
CN115937596A (en) Target detection method, training method and device of model thereof, and storage medium
CN112926461B (en) Neural network training and driving control method and device
CN111753729B (en) False face detection method and device, electronic equipment and storage medium
US11164036B2 (en) Human-assisted machine learning through geometric manipulation and refinement
CN108509826B (en) Road identification method and system for remote sensing image
CN111402185B (en) Image detection method and device
CN112330619B (en) Method, device, equipment and storage medium for detecting target area
CN111832550B (en) Data set manufacturing method and device, electronic equipment and storage medium
JP7251078B2 (en) Image processing device and program
CN111898570A (en) Method for recognizing text in image based on bidirectional feature pyramid network
JP4550768B2 (en) Image detection method and image detection apparatus

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right

Effective date of registration: 20210301

Address after: 201111 2nd floor, building 2, no.1508, Kunyang Road, Minhang District, Shanghai

Patentee after: Dalu Robot Co.,Ltd.

Address before: 518000 Room 201, building A, No. 1, Qian Wan Road, Qianhai Shenzhen Hong Kong cooperation zone, Shenzhen, Guangdong (Shenzhen Qianhai business secretary Co., Ltd.)

Patentee before: Shenzhen Qianhaida Yunyun Intelligent Technology Co.,Ltd.

TR01 Transfer of patent right
CP03 Change of name, title or address

Address after: 201111 Building 8, No. 207, Zhongqing Road, Minhang District, Shanghai

Patentee after: Dayu robot Co.,Ltd.

Address before: 201111 2nd floor, building 2, no.1508, Kunyang Road, Minhang District, Shanghai

Patentee before: Dalu Robot Co.,Ltd.

CP03 Change of name, title or address