CN103064987A - Bogus transaction information identification method - Google Patents

Bogus transaction information identification method Download PDF

Info

Publication number
CN103064987A
CN103064987A CN2013100376918A CN201310037691A CN103064987A CN 103064987 A CN103064987 A CN 103064987A CN 2013100376918 A CN2013100376918 A CN 2013100376918A CN 201310037691 A CN201310037691 A CN 201310037691A CN 103064987 A CN103064987 A CN 103064987A
Authority
CN
China
Prior art keywords
information
user
data
wash sale
word
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2013100376918A
Other languages
Chinese (zh)
Other versions
CN103064987B (en
Inventor
王永康
张爱华
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing 58 Information Technology Co Ltd
Original Assignee
Beijing 58 Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing 58 Information Technology Co Ltd filed Critical Beijing 58 Information Technology Co Ltd
Priority to CN201310037691.8A priority Critical patent/CN103064987B/en
Publication of CN103064987A publication Critical patent/CN103064987A/en
Application granted granted Critical
Publication of CN103064987B publication Critical patent/CN103064987B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses a bogus transaction information identification method which comprises the following steps of: S101, acquiring information features and information contents of information published by a user and/or picture information; and S202, performing bogus transaction information identification on the information published by the user according to the information features and the information contents of information published by the user and/or thepicture information. By the method, the amount of bogus transaction information can be greatly reduced, the authenticity of the transaction information is improved, and user experience is improved; and the labor cost can be greatly reduced.

Description

A kind of wash sale information identifying method
Technical field
The present invention relates to Internet technical field, particularly relate to a kind of wash sale information identifying method.
Background technology
Along with the development of internet, online information becomes and more and more spreads unchecked, and is more and more hard to tell whether it is true or false.Website for types such as ecommerce or classified informations, if can provide safety, real merchandise news for the user, become an important and basic content, so how to identify the true and false key of guaranteeing information security that become that the user releases news, this also is the problem that a lot of websites all face.
On identification wash sale information, present method mainly is by artificial audit, add some technological means, for example determine the IP(Internet Protocol of blacklist, the agreement that interconnects between the network) address, determine the information content of issue or form is illegal, price range is illegal etc. will determine the illegal information deletion of information fully.
The shortcoming of Existing policies is: manual examination and verification consume very much manpower, auxiliary technological means can only be deleted the wash sale information of small part, also have a large amount of wash sale information to escape, can delete 100% and be defined as false information, but 85% may be helpless for the information of vacation to having, because can not judgement information be false degree all.
Summary of the invention
The technical problem to be solved in the present invention provides a kind of wash sale information identifying method and puts, and carries out the problem that the upper manpower consumption of wash sale information identification is large, wash sale information discrimination is low in order to solve prior art.
For solving the problems of the technologies described above, on the one hand, the invention provides a kind of wash sale information identifying method, comprising:
Step S101 obtains information characteristics, the information content and/or pictorial information that the user releases news;
Step S201, the information characteristics, the information content and/or the pictorial information that release news according to the user give out information to the user and to carry out the identification of wash sale information.
Further, before obtaining the information characteristics that the user releases news, may further comprise the steps:
Step S1011, the master data that the user gives out information before obtaining;
Step S1012 according to the master data that user before obtaining gives out information, extracts training data, determines positive negative sample;
Step S1013, the data that align in the negative sample are carried out Feature Conversion, obtain the data of setting data form;
Step S1014 according to the data of setting data form, sets up regression model.
Further, step S1013 specifically comprises:
The feature of every data in the positive negative sample is defined as numeric type or enumeration type two classes;
The dimension values of numeric type is constant, is in the numerical value that these numeric type data are disposed in position in the sample in the numeric type data;
The dimension values of enumeration type is calculated first its md5 value, then with the md5 value to the W delivery, obtain the delivery result; In sample, will be in delivery as a result the numerical value of position put 1.
Further, step S1014 specifically comprises:
The data of the setting data form that step S1013 is obtained are converted into sparse matrix;
Sparse matrix (the x that input produces in the model training program 1, x 2, x 3, x 4, x 5..., x p), p is the data volume of the data of setting data form; Obtain parameter (β corresponding to each bar record 0, β 1, β 2, β 3, β 4, β 5..., β p);
Set up regression model, regression model is:
Figure BDA00002800451200021
G (x)=β wherein 0+ β 1x 1+ β 2x 2+ ... + β px p
Further, after setting up regression model, when receiving the user and release news, then step S101 is specially:
Step S1015 obtains the master data that the user gives out information; Comprise the essential characteristic that the extraction user gives out information and obtain first feature; Essential characteristic is with the master data of first feature as excavation.
Further, after obtaining the master data that the user gives out information, step S201 specifically may further comprise the steps:
Step S2011 carries out Feature Conversion to obtaining the master data that the user gives out information, and obtains the accessible data layout of model;
Step S2012, the data that step S2011 is obtained are converted into the form of sparse matrix, carry out spoofing identification by regression model; Wherein, P〉M, Y=1 then, the expression user releases news and is true sale information; Otherwise, P≤M, Y=0 then, the expression user releases news and is wash sale information, and M is predefined threshold value.
Further, before obtaining the information content that the user releases news, may further comprise the steps:
Step S1021, the information content that the user gives out information before obtaining is also examined, will be by examining and not being divided into two classes by the information of examining, as the sample data of classification;
Step S1022 carries out participle to the information content in the sample;
Step S1023 by calculating, extracts Feature Words;
Step S1024 calculates in every class the eigenwert of each Feature Words in every piece of document;
Step S1025, the eigenwert according to obtaining in every class each word in every piece of document obtains model of cognition by training.
Further, step S1023 specifically comprises:
The CHI value asked in each word; Evolution check formula is:
Figure BDA00002800451200031
Wherein, A: the number of documents that under this classification, comprises this word; B: the number of documents that under this classification, does not comprise this word; C: the number of documents that under this classification, does not comprise this word; D: not under this classification, and do not comprise the number of documents of this word; N: expression article sum; T: represent the current word of asking the CHI value; C: the classification of presentation class; x 2: the open check of expression CHI value;
Then get P value of CHI value maximum in all words as Feature Words;
Step S1024 specifically comprises:
Adopt the deformation algorithm computation of characteristic values of TFIDF algorithm or TFIDF, wherein the way of TFIDF is to calculate in every class the number of times of each Feature Words in every piece of document, and the number of files that comprises this word, with the value of TFIDF as eigenwert; Wherein, every piece of document is converted into: category IDs t feature sequence number the form of t eigenwert; The TFIDF formula is: TFIDF=TF * IDF, wherein, and the frequency that TF occurs in this piece document for certain Feature Words, IDF is anti-document frequency, namely total document tree is divided by the number of files that comprises this word.
Further, after obtaining the information content that the user gives out information, step S201 specifically may further comprise the steps:
Step S2021 carries out participle to the information content that the user gives out information;
Step S2022 by calculating, extracts Feature Words;
Step S2023, the eigenwert of each word in the information content that the calculating user gives out information;
Step S2024 according to the model of cognition that obtains, carries out the identification of wash sale information to the information content that the user gives out information.
Further, the pictorial information that releases news according to the user gives out information to the user and to carry out the identification of wash sale information, specifically may further comprise the steps:
Step S2031, the query history picture library judges whether photo current occurs in picture library, if there is, judge further then whether the content of posting is identical, and whether the position is identical, if all different, judge that then it is wash sale information that the user who comprises this picture releases news; Otherwise, judge that then it is true sale information that the user who comprises this picture releases news;
Whether perhaps, judging has watermark on the picture, if having, judges further then whether the watermark on the picture is legal, if illegal, judges that then it is wash sale information that the user who comprises this picture releases news; Otherwise, judge that then it is true sale information that the user who comprises this picture releases news.
Beneficial effect of the present invention is as follows:
The present invention can reduce the falseness amount of Transaction Information greatly, improves the authenticity of Transaction Information, increases the user and experiences, and can greatly reduce human cost simultaneously.
Description of drawings
Fig. 1 is the process flow diagram of a kind of wash sale information identifying method in the embodiment of the invention.
Embodiment
Below in conjunction with accompanying drawing and embodiment, the present invention is further elaborated.Should be appreciated that specific embodiment described herein only in order to explain the present invention, does not limit the present invention.
As shown in Figure 1, the embodiment of the invention relates to a kind of wash sale information identifying method, comprising:
Step S101 obtains information characteristics, the information content and/or pictorial information that the user releases news;
Step S201, the information characteristics, the information content and/or the pictorial information that release news according to the user give out information to the user and to carry out the identification of wash sale information.
Among the step S101, be specifically related to three kinds of situations, the first is to carry out the identification of wash sale information for the information characteristics that the user releases news, and namely carries out the identification of wash sale information based on user characteristics and behavior; The second is to carry out the identification of wash sale information for the information content that the user releases news, and namely carries out the identification of wash sale information based on the model content of text; The third is the wash sale information identification of carrying out for pictorial information.
At first, describe the information characteristics that releases news based on the user and carry out the identification of wash sale information, before obtaining the information characteristics that the user releases news, may further comprise the steps:
Step S1011, the master data that the user gives out information before obtaining.In this step, by the splicing data, the analysis user daily record of posting extracts the essential characteristic that the user gives out information; Wherein, essential characteristic refers to the directly data of extraction acquisition from the user gives out information before, the features such as for example, user's identify label (USER ID), the IP that posts, cookieid, telephone number, temporal information (comprising week, month, date), the duration of posting, pageview, the amount of refreshing, the city of posting, the classification of posting.Then, according to user's essential characteristic, obtain first feature; Wherein, first feature refers on the basis of user's essential characteristic, by the data of adding up or calculating; Such as the number of posting with IP, with IP post the city number, with the user post number, with user post city number, the first features such as number, the city number of posting with cookie of posting with cookie.Essential characteristic is with the master data of first feature as excavation.For example, produce such record R1(123123,192.168.11.11, DFOKIEBNGIDH1232,18311067654 ...).
Step S1012 according to the master data that user before obtaining gives out information, extracts training data.In this step, take the result of step S1011 as the basis, verify out by manual examination and verification to be defined as true or false data, as positive negative sample, True Data is positive sample, and false data is negative sample; For example, R1 is labeled as positive sample or negative sample.The manual examination and verification process can know that altogether information artificially judges according to some, also can carry out demonstration validation by means such as phones.
Step S1013, the data that align in the negative sample are carried out Feature Conversion, obtain the data of setting data form.In this step, the feature of every data in the positive negative sample is defined as numeric type or enumeration type two classes, wherein, numeric type refers to that data itself are exactly numerical value; Enumeration type refers to that data itself are not numerical value, and enumeration type shines upon according to original dimension and value and obtains.Being the data of enumeration type such as USER ID, the IP that posts etc., the duration of posting, is the data of numeric type with user's city number of posting.The dimension values of numeric type is constant; For example, certain characteristic is 20, and the position in sample then puts 20 the 10th position at the 10th.The dimension values of enumeration type is then calculated first its md5(Message Digest Algorithm MD5, Message Digest Algorithm 5) value, then with the md5 value to W(W=300000 for example) delivery, that is: with the md5 value divided by 300000, obtain remainder; The value of enumeration type will drop between the 1-300000 like this.Two features are for example arranged: (telephone number, with the phone number of posting), corresponding value is (18211078765,100), post number for numeric type with phone, and telephone number is enumeration type, so with the phone invariant position of several positions in sample of posting, after telephone number calculates the md5 value, to 300000 deliverys, for example obtain 180834, the vector that this moment, this record produced is (0,100,0 ..., 1), wherein, the 180834th position puts 1 in sample, represents that there is numerical value this position, and numerical value is 1.
Step S1014 according to the data of setting data form, sets up regression model.To the requirement of regression model be the result's that returns codomain between [0,1], perhaps can be mapped in this scope by calculating, below take logistic regression as example.What obtain among the step S1013 is the vector of a rule, for example (0,0,0,0,0,0,0,0,0,12,32,43 ... 1,0,0......1,0,0......), because these vectors may have 300000 dimensions, the expression data volume can quite expend internal memory, thus the vector of a rule is converted into the form of sparse matrix, for example, if upper one is article one, then horizontal ordinate is 1, and the form of corresponding sparse matrix is: 110(is equivalent to ordinate) 12,11132,11243 etc.After each bar all so transforms, the sparse matrix that input produces above being in model training program program, output is parameter corresponding to each bar record.Can simply be interpreted as if a record is (x 1, x 2, x 3, x 4, x 5..., x p), p is the data volume of the data of setting data form; Find the solution by the model training program, produce (β 0, β 1, β 2, β 3, β 4, β 5..., β p) etc. corresponding parameter.Set up regression model this moment, and regression model can be expressed as:
Figure BDA00002800451200071
G (x)=β wherein 0+ β 1x 1+ β 2x 2+ ... + β px p
After setting up regression model, when receiving the user again and release news, then step S101 is specially:
Step S1015 obtains the master data that the user gives out information; Comprise the essential characteristic that the extraction user gives out information and obtain first feature; Essential characteristic is with the master data of first feature as excavation.Particular content is identical with step S1011, and this step is not described in detail.
After obtaining the master data that the user gives out information, step S201 specifically may further comprise the steps:
Step S2011 carries out Feature Conversion to obtaining the master data that the user gives out information, and obtains the data of setting data form.This step is identical with step S1013 method, no longer describes in detail.
Step S2012, the data of the setting data form that step S2011 is obtained are converted into the form of sparse matrix, carry out spoofing identification by regression model.In this step, obtain sparse matrix after, according to the user who the obtains corresponding (x that gives out information 1, x 2, x 3, x 4, x 5..., x p), just can obtain g (x), so just can be in the hope of the result of P (Y=1|x), i.e. the probability of Y=1; Wherein, P〉M, Y=1 then, the expression user releases news and is true sale information; Otherwise, P≤M, Y=0 then, the expression user releases news and is wash sale information; M is predefined threshold value.
Secondly, describe the information content that releases news based on the user and carry out the identification of wash sale information, before obtaining the information content that the user releases news, may further comprise the steps:
Step S1021, the information content that gives out information of user before obtaining, and to foregoing by audit (manual examination and verification or automatically audit), will by audit with ing by the Transaction Information model examined as two classes, as the sample data of classifying; Algorithm that can high by expert's manual tag and part accuracy rate (be higher than threshold value is set) extracts positive and negative sample training collection automatically;
Step S1022 carries out participle to the information content in the sample, can optimize the participle effect by the mode of Custom Dictionaries.Concrete segmenting method can adopt existing segmenting method, for example ICT segmenting method or other segmenting method.
Step S1023 extracts Feature Words.In this step, filter out and stop word, rare words, common word in the step S1022 participle, then with the check of CHI(evolution) etc. method choose the Feature Words large with the class degree of correlation.Concrete choosing method is: the CHI value asked in each word, then get 1000 values of CHI value maximum in all words as Feature Words.Evolution check formula is:
Figure BDA00002800451200081
Wherein, A: the number of documents that under this classification, comprises this word; B: the number of documents that under this classification, does not comprise this word; C: the number of documents that under this classification, does not comprise this word; D: not under this classification, and do not comprise the number of documents of this word; N: expression article sum; T: represent the current word of asking the CHI value; C: the classification of presentation class; x 2: the open check of expression CHI value.
Step S1024 carries out vectorization, obtains in every class the eigenwert of each Feature Words in every piece of document.This step adopts the TFIDF algorithm, calculates in every class the number of times of each Feature Words in every piece of document, and the number of files that comprises this word, with the value of TFIDF as eigenwert.Every piece of document is converted into: category IDs t feature sequence number the form of t eigenwert.The TFIDF formula is: TFIDF=TF * IDF, wherein, and the frequency that TF occurs in this piece document for certain Feature Words, IDF is anti-document frequency, namely total document tree is divided by the number of files that comprises this word.
Step S1025, the eigenwert according to obtaining in every class each word in every piece of document obtains model of cognition by training.In this step, employing SVM(support vector machine support vector machine), the modes such as decision tree, Bayess classification are trained above-mentioned eigenwert, every piece of document has been converted into the form of vector among the step S1024, adopt classification (Waikato Environment for Knowledge Analysis, Waikato intellectual analysis environment) program is trained these vectors, can select different sorting techniques, such as SVM, decision tree, Bayess classification etc. produces a model of cognition.SVM, decision tree, Bayess classification are existing ripe training method, and this step is not described in detail.
After obtaining model of cognition, when receiving the user again and release news, then step S101 is specially:
Step S1026 obtains the information content that the user gives out information, and for example, posts as example with the user, then obtains the particular content of model.
After obtaining the information content that the user gives out information, step S201 specifically may further comprise the steps:
Step S2021 carries out participle to the information content that the user gives out information.
Step S2022 extracts Feature Words.This step is identical with step S1023 method, therefore, is not described in detail.
Step S2023 carries out vectorization, obtains the eigenwert of each word in the information content that the user gives out information.This step is identical with step S1024 method, therefore, is not described in detail.
Step S2024 according to the model of cognition that obtains, carries out the identification of wash sale information to the information content that the user gives out information.In this step, the model of cognition that obtains by modes such as SVM, decision tree, Bayess classifications is existing maturity model, and its recognition methods also is existing mature technology, so this step is not described in detail.
At last, description is carried out the identification of wash sale information based on pictorial information, after obtaining the pictorial information that the user releases news, the pictorial information that releases news according to the user gives out information to the user and to carry out wash sale information identification (step S201) and may further comprise the steps:
Step S2031, the query graph valut judges whether photo current occurs in picture library, if there is, judge further then whether the content of posting is identical, and whether the position is identical, if all different, judge that then it is wash sale information that the user who comprises this picture releases news; Otherwise, judge that then it is true sale information that the user who comprises this picture releases news; Whether perhaps, judging has watermark on the picture, if having, judges further then whether the watermark on the picture is legal, if illegal, judges that then it is wash sale information that the user who comprises this picture releases news; Otherwise, judge that then it is true sale information that the user who comprises this picture releases news.
In addition, above-mentioned three kinds of strategies also can make up, and combine and judge, for example, two kinds of situation combinations, or three kinds of situation combinations; In above-mentioned three kinds of situations, there are any one or two kinds of situations to judge that it is wash sale information that the user releases news, and judges that then it is wash sale information that the user releases news.
As can be seen from the above-described embodiment, the present invention can reduce the falseness amount of Transaction Information greatly, improves the authenticity of Transaction Information, increases the user and experiences, and can greatly reduce human cost simultaneously.
Although be the example purpose, the preferred embodiments of the present invention are disclosed, it also is possible those skilled in the art will recognize various improvement, increase and replacement, therefore, scope of the present invention should be not limited to above-described embodiment.

Claims (10)

1. a wash sale information identifying method is characterized in that, comprising:
Step S101 obtains information characteristics, the information content and/or pictorial information that the user releases news;
Step S201, the information characteristics, the information content and/or the pictorial information that release news according to the user give out information to the user and to carry out the identification of wash sale information.
2. wash sale information identifying method as claimed in claim 1 is characterized in that, before obtaining the information characteristics that the user releases news, may further comprise the steps:
Step S1011, the master data that the user gives out information before obtaining;
Step S1012 according to the master data that user before obtaining gives out information, extracts training data, determines positive negative sample;
Step S1013, the data that align in the negative sample are carried out Feature Conversion, obtain the data of setting data form;
Step S1014 according to the data of setting data form, sets up regression model.
3. wash sale information identifying method as claimed in claim 2 is characterized in that, step S1013 specifically comprises:
The feature of every data in the positive negative sample is defined as numeric type or enumeration type two classes;
The dimension values of numeric type is constant, is in the numerical value that these numeric type data are disposed in position in the sample in the numeric type data;
The dimension values of enumeration type is then calculated first its md5 value, then with the md5 value to the W delivery, obtain the delivery result; In sample, will be in delivery as a result the numerical value of position put 1.
4. wash sale information identifying method as claimed in claim 3 is characterized in that, step S1014 specifically comprises:
The data that step S1013 is obtained are converted into sparse matrix;
Sparse matrix (the x that input produces in model training program program 1, x 2, x 3, x 4, x 5..., x p), p is the data volume of the data of setting data form; Obtain parameter (β corresponding to each bar record 0, β 1, β 2, β 3, β 4, β 5..., β p);
Set up regression model, regression model is: G (x)=β wherein 0+ β 1x 1+ β 2x 2+ ... + β px p
5. wash sale information identifying method as claimed in claim 4 is characterized in that, after setting up regression model, when receiving the user and release news, then step S101 is specially:
Step S1015 obtains the master data that the user gives out information; Comprise the essential characteristic that the extraction user gives out information and obtain first feature; Essential characteristic is with the master data of first feature as excavation.
6. wash sale information identifying method as claimed in claim 5 is characterized in that, after obtaining the master data that the user gives out information, step S201 specifically may further comprise the steps:
Step S2011 carries out Feature Conversion to obtaining the master data that the user gives out information, and obtains the data of setting data form;
Step S2012, the data of the setting data form that step S2011 is obtained are converted into the form of sparse matrix, carry out spoofing identification by regression model; Wherein, P〉M, Y=1 then, the expression user releases news and is true sale information; Otherwise, P≤M, Y=0 then, the expression user releases news and is wash sale information; M is predefined threshold value.
7. such as claim 1 or 6 described wash sale information identifying methods, it is characterized in that, before obtaining the information content that the user releases news, may further comprise the steps:
Step S1021, the information content that the user gives out information before obtaining is also examined, will be by examining and not being divided into two classes by the information of examining, as the sample data of classification;
Step S1022 carries out participle to the information content in the sample;
Step S1023 by calculating, extracts Feature Words;
Step S1024 calculates in every class the eigenwert of each Feature Words in every piece of document;
Step S1025, the eigenwert according to obtaining in every class each word in every piece of document obtains model of cognition by training.
8. wash sale information identifying method as claimed in claim 7 is characterized in that, step S1023 specifically comprises:
The CHI value asked in each word; Evolution check formula is:
Figure FDA00002800451100022
Wherein, A: the number of documents that under this classification, comprises this word; B: the number of documents that under this classification, does not comprise this word; C: the number of documents that under this classification, does not comprise this word; D: not under this classification, and do not comprise the number of documents of this word; N: expression article sum; T: represent the current word of asking the CHI value; C: the classification of presentation class; X2: the open check of expression CHI value;
Then get P value of CHI value maximum in all words as Feature Words;
Step S1024 specifically comprises:
Adopt the TFIDF algorithm, calculate in every class the number of times of each Feature Words in every piece of document, and the number of files that comprises this word, with the value of TFIDF as eigenwert; Wherein, every piece of document is converted into: category IDs t feature sequence number the form of t eigenwert; The TFIDF formula is: TFIDF=TF * IDF, wherein, and the frequency that TF occurs in this piece document for certain Feature Words, IDF is anti-document frequency, namely total document tree is divided by the number of files that comprises this word.
9. wash sale information identifying method as claimed in claim 8 is characterized in that, after obtaining the information content that the user gives out information, step S201 specifically may further comprise the steps:
Step S2021 carries out participle to the information content that the user gives out information;
Step S2022 by calculating, extracts Feature Words;
Step S2023, the eigenwert of each word in the information content that the calculating user gives out information;
Step S2024 according to the model of cognition that obtains, carries out the identification of wash sale information to the information content that the user gives out information.
10. such as claim 1,6 or 9 described wash sale information identifying methods, it is characterized in that, the pictorial information that releases news according to the user gives out information to the user and to carry out the identification of wash sale information, specifically may further comprise the steps:
Step S2031, the query graph valut judges whether photo current occurs in picture library, if there is, judge further then whether the content of posting is identical, and whether the position is identical, if all different, judge that then it is wash sale information that the user who comprises this picture releases news; Otherwise, judge that then it is true sale information that the user who comprises this picture releases news;
Whether perhaps, judging has watermark on the picture, if having, judges further then whether the watermark on the picture is legal, if illegal, judges that then it is wash sale information that the user who comprises this picture releases news; Otherwise, judge that then it is true sale information that the user who comprises this picture releases news.
CN201310037691.8A 2013-01-31 2013-01-31 A kind of wash sale information identifying method Active CN103064987B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310037691.8A CN103064987B (en) 2013-01-31 2013-01-31 A kind of wash sale information identifying method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310037691.8A CN103064987B (en) 2013-01-31 2013-01-31 A kind of wash sale information identifying method

Publications (2)

Publication Number Publication Date
CN103064987A true CN103064987A (en) 2013-04-24
CN103064987B CN103064987B (en) 2016-09-21

Family

ID=48107617

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310037691.8A Active CN103064987B (en) 2013-01-31 2013-01-31 A kind of wash sale information identifying method

Country Status (1)

Country Link
CN (1) CN103064987B (en)

Cited By (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103279868A (en) * 2013-05-22 2013-09-04 兰亭集势有限公司 Method and device for automatically identifying fraud order form
CN103530400A (en) * 2013-10-23 2014-01-22 合山市科学技术情报研究所 Information recognition device
CN104616183A (en) * 2015-01-27 2015-05-13 深圳市斯度尔科技有限公司 Network transaction identifying system, client and method
CN104657503A (en) * 2015-03-13 2015-05-27 浪潮集团有限公司 Method for preprocessing abnormal values of e-business sales amounts based on statistical discrimination process
CN104731803A (en) * 2013-12-21 2015-06-24 长沙微观信息科技有限公司 Technology for preventing false network information
CN106157045A (en) * 2015-03-26 2016-11-23 阿里巴巴集团控股有限公司 Method based on logistics data identification wash sale, device and server
CN106504137A (en) * 2015-09-03 2017-03-15 中山市八喜电脑网络有限公司 Network false information of real estate prosecution system
CN106611321A (en) * 2015-10-22 2017-05-03 百度在线网络技术(北京)有限公司 An identification method and apparatus for fake handset numbers
CN106920109A (en) * 2017-02-21 2017-07-04 福建师范大学福清分校 Recognition methods, system and e-commerce system for ecommerce wash sale
CN106952190A (en) * 2017-03-22 2017-07-14 国信优易数据有限公司 False source of houses typing Activity recognition and early warning system
WO2017140222A1 (en) * 2016-02-19 2017-08-24 阿里巴巴集团控股有限公司 Modelling method and device for machine learning model
CN107133833A (en) * 2016-02-26 2017-09-05 阿里巴巴集团控股有限公司 abnormal transaction identification method and device
CN107545505A (en) * 2016-06-24 2018-01-05 上海壹账通金融科技有限公司 Insure recognition methods and the system of finance product information
CN108053214A (en) * 2017-12-12 2018-05-18 阿里巴巴集团控股有限公司 A kind of recognition methods of wash sale and device
CN108600113A (en) * 2018-04-12 2018-09-28 北京五八信息技术有限公司 A kind of the preliminary audit survey method, apparatus and storage medium of data to be released
CN108734327A (en) * 2017-04-20 2018-11-02 腾讯科技(深圳)有限公司 A kind of data processing method, device and server
CN109284614A (en) * 2018-08-10 2019-01-29 五八有限公司 Information Authentication method, apparatus, computer equipment and computer readable storage medium
CN109685527A (en) * 2018-12-14 2019-04-26 拉扎斯网络科技(上海)有限公司 Method, device, system and computer storage medium for detecting false transactions of merchants
CN111914645A (en) * 2020-06-30 2020-11-10 五八有限公司 Method and device for identifying false information, electronic equipment and storage medium
CN116720864A (en) * 2023-06-26 2023-09-08 北京智思迪科技有限公司 Online transaction system and method with false transaction monitoring function

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060144933A1 (en) * 2004-12-30 2006-07-06 Do Phuc K Method to detect false purchases with a consumer service device
CN102339445A (en) * 2010-07-23 2012-02-01 阿里巴巴集团控股有限公司 Method and system for evaluating credibility of network trade user

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060144933A1 (en) * 2004-12-30 2006-07-06 Do Phuc K Method to detect false purchases with a consumer service device
CN102339445A (en) * 2010-07-23 2012-02-01 阿里巴巴集团控股有限公司 Method and system for evaluating credibility of network trade user

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
王园: "基于内容检索的垃圾邮件过滤器研究与实现", 《中国优秀硕士学位论文全文数据库信息科技辑》 *
蒋杰: "基于分类技术的电子支付平台作弊账户的识别模型研究", 《中国优秀硕士学位论文全文数据库信息科技辑》 *
郭学敏: "基于语义的广告图像垃圾邮件过滤技术研究", 《中国优秀硕士学位论文全文数据库信息科技辑》 *

Cited By (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103279868A (en) * 2013-05-22 2013-09-04 兰亭集势有限公司 Method and device for automatically identifying fraud order form
CN103279868B (en) * 2013-05-22 2016-08-17 兰亭集势有限公司 A kind of method and apparatus of automatic identification swindle order
CN103530400A (en) * 2013-10-23 2014-01-22 合山市科学技术情报研究所 Information recognition device
CN104731803A (en) * 2013-12-21 2015-06-24 长沙微观信息科技有限公司 Technology for preventing false network information
CN104616183A (en) * 2015-01-27 2015-05-13 深圳市斯度尔科技有限公司 Network transaction identifying system, client and method
CN104616183B (en) * 2015-01-27 2017-11-07 杨芳 Network trading identification system, client and discrimination method
CN104657503A (en) * 2015-03-13 2015-05-27 浪潮集团有限公司 Method for preprocessing abnormal values of e-business sales amounts based on statistical discrimination process
CN106157045A (en) * 2015-03-26 2016-11-23 阿里巴巴集团控股有限公司 Method based on logistics data identification wash sale, device and server
CN106157045B (en) * 2015-03-26 2021-07-23 创新先进技术有限公司 Method, device and server for identifying false transactions based on logistics data
CN106504137A (en) * 2015-09-03 2017-03-15 中山市八喜电脑网络有限公司 Network false information of real estate prosecution system
CN106611321A (en) * 2015-10-22 2017-05-03 百度在线网络技术(北京)有限公司 An identification method and apparatus for fake handset numbers
TWI789345B (en) * 2016-02-19 2023-01-11 香港商阿里巴巴集團服務有限公司 Modeling method and device for machine learning model
WO2017140222A1 (en) * 2016-02-19 2017-08-24 阿里巴巴集团控股有限公司 Modelling method and device for machine learning model
CN107133833A (en) * 2016-02-26 2017-09-05 阿里巴巴集团控股有限公司 abnormal transaction identification method and device
CN107545505A (en) * 2016-06-24 2018-01-05 上海壹账通金融科技有限公司 Insure recognition methods and the system of finance product information
CN107545505B (en) * 2016-06-24 2020-09-29 深圳壹账通智能科技有限公司 Method and system for identifying insurance financing product information
CN106920109A (en) * 2017-02-21 2017-07-04 福建师范大学福清分校 Recognition methods, system and e-commerce system for ecommerce wash sale
CN106952190A (en) * 2017-03-22 2017-07-14 国信优易数据有限公司 False source of houses typing Activity recognition and early warning system
CN108734327A (en) * 2017-04-20 2018-11-02 腾讯科技(深圳)有限公司 A kind of data processing method, device and server
CN108053214A (en) * 2017-12-12 2018-05-18 阿里巴巴集团控股有限公司 A kind of recognition methods of wash sale and device
CN108600113A (en) * 2018-04-12 2018-09-28 北京五八信息技术有限公司 A kind of the preliminary audit survey method, apparatus and storage medium of data to be released
CN109284614A (en) * 2018-08-10 2019-01-29 五八有限公司 Information Authentication method, apparatus, computer equipment and computer readable storage medium
CN109685527A (en) * 2018-12-14 2019-04-26 拉扎斯网络科技(上海)有限公司 Method, device, system and computer storage medium for detecting false transactions of merchants
CN109685527B (en) * 2018-12-14 2024-03-29 拉扎斯网络科技(上海)有限公司 Method, device, system and computer storage medium for detecting merchant false transaction
CN111914645A (en) * 2020-06-30 2020-11-10 五八有限公司 Method and device for identifying false information, electronic equipment and storage medium
CN116720864A (en) * 2023-06-26 2023-09-08 北京智思迪科技有限公司 Online transaction system and method with false transaction monitoring function

Also Published As

Publication number Publication date
CN103064987B (en) 2016-09-21

Similar Documents

Publication Publication Date Title
CN103064987B (en) A kind of wash sale information identifying method
Girgis et al. Deep learning algorithms for detecting fake news in online text
CA3138730C (en) Public-opinion analysis method and system for providing early warning of enterprise risks
CN107291780B (en) User comment information display method and device
CN109872162B (en) Wind control classification and identification method and system for processing user complaint information
US11593671B2 (en) Systems and methods for semantic analysis based on knowledge graph
CN111553318A (en) Sensitive information extraction method, referee document processing method and device and electronic equipment
CN107545038B (en) Text classification method and equipment
CN110197389A (en) A kind of user identification method and device
US10956522B1 (en) Regular expression generation and screening of textual items
CN109766441B (en) Text classification method, device and system
CN110825880A (en) Case winning rate determining method, device, equipment and computer readable storage medium
CN110362689A (en) A kind of methods of risk assessment, device, storage medium and server
CN112860841A (en) Text emotion analysis method, device and equipment and storage medium
CN106095939B (en) The acquisition methods and device of account authority
CN110609908A (en) Case serial-parallel method and device
WO2018028065A1 (en) Method and device for classifying short message and computer storage medium
CN103177129A (en) Internet real-time information recommendation and prediction system
CN111324370A (en) Method and device for carrying out risk processing on to-be-on-line small program
CN111695357A (en) Text labeling method and related product
Deekshan et al. Detection and summarization of honest reviews using text mining
CN113420789B (en) Method and device for predicting risk account number, storage medium and computer equipment
CN113095723A (en) Coupon recommendation method and device
CN115618120B (en) Public number information pushing method, system, terminal equipment and storage medium
CN112559679B (en) Political new media propagation force detection method, device, equipment and storage medium

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant