CN103064987A - Bogus transaction information identification method - Google Patents
Bogus transaction information identification method Download PDFInfo
- Publication number
- CN103064987A CN103064987A CN2013100376918A CN201310037691A CN103064987A CN 103064987 A CN103064987 A CN 103064987A CN 2013100376918 A CN2013100376918 A CN 2013100376918A CN 201310037691 A CN201310037691 A CN 201310037691A CN 103064987 A CN103064987 A CN 103064987A
- Authority
- CN
- China
- Prior art keywords
- information
- user
- data
- wash sale
- word
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Landscapes
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The invention discloses a bogus transaction information identification method which comprises the following steps of: S101, acquiring information features and information contents of information published by a user and/or picture information; and S202, performing bogus transaction information identification on the information published by the user according to the information features and the information contents of information published by the user and/or thepicture information. By the method, the amount of bogus transaction information can be greatly reduced, the authenticity of the transaction information is improved, and user experience is improved; and the labor cost can be greatly reduced.
Description
Technical field
The present invention relates to Internet technical field, particularly relate to a kind of wash sale information identifying method.
Background technology
Along with the development of internet, online information becomes and more and more spreads unchecked, and is more and more hard to tell whether it is true or false.Website for types such as ecommerce or classified informations, if can provide safety, real merchandise news for the user, become an important and basic content, so how to identify the true and false key of guaranteeing information security that become that the user releases news, this also is the problem that a lot of websites all face.
On identification wash sale information, present method mainly is by artificial audit, add some technological means, for example determine the IP(Internet Protocol of blacklist, the agreement that interconnects between the network) address, determine the information content of issue or form is illegal, price range is illegal etc. will determine the illegal information deletion of information fully.
The shortcoming of Existing policies is: manual examination and verification consume very much manpower, auxiliary technological means can only be deleted the wash sale information of small part, also have a large amount of wash sale information to escape, can delete 100% and be defined as false information, but 85% may be helpless for the information of vacation to having, because can not judgement information be false degree all.
Summary of the invention
The technical problem to be solved in the present invention provides a kind of wash sale information identifying method and puts, and carries out the problem that the upper manpower consumption of wash sale information identification is large, wash sale information discrimination is low in order to solve prior art.
For solving the problems of the technologies described above, on the one hand, the invention provides a kind of wash sale information identifying method, comprising:
Step S101 obtains information characteristics, the information content and/or pictorial information that the user releases news;
Step S201, the information characteristics, the information content and/or the pictorial information that release news according to the user give out information to the user and to carry out the identification of wash sale information.
Further, before obtaining the information characteristics that the user releases news, may further comprise the steps:
Step S1011, the master data that the user gives out information before obtaining;
Step S1012 according to the master data that user before obtaining gives out information, extracts training data, determines positive negative sample;
Step S1013, the data that align in the negative sample are carried out Feature Conversion, obtain the data of setting data form;
Step S1014 according to the data of setting data form, sets up regression model.
Further, step S1013 specifically comprises:
The feature of every data in the positive negative sample is defined as numeric type or enumeration type two classes;
The dimension values of numeric type is constant, is in the numerical value that these numeric type data are disposed in position in the sample in the numeric type data;
The dimension values of enumeration type is calculated first its md5 value, then with the md5 value to the W delivery, obtain the delivery result; In sample, will be in delivery as a result the numerical value of position put 1.
Further, step S1014 specifically comprises:
The data of the setting data form that step S1013 is obtained are converted into sparse matrix;
Sparse matrix (the x that input produces in the model training program
1, x
2, x
3, x
4, x
5..., x
p), p is the data volume of the data of setting data form; Obtain parameter (β corresponding to each bar record
0, β
1, β
2, β
3, β
4, β
5..., β
p);
Further, after setting up regression model, when receiving the user and release news, then step S101 is specially:
Step S1015 obtains the master data that the user gives out information; Comprise the essential characteristic that the extraction user gives out information and obtain first feature; Essential characteristic is with the master data of first feature as excavation.
Further, after obtaining the master data that the user gives out information, step S201 specifically may further comprise the steps:
Step S2011 carries out Feature Conversion to obtaining the master data that the user gives out information, and obtains the accessible data layout of model;
Step S2012, the data that step S2011 is obtained are converted into the form of sparse matrix, carry out spoofing identification by regression model; Wherein, P〉M, Y=1 then, the expression user releases news and is true sale information; Otherwise, P≤M, Y=0 then, the expression user releases news and is wash sale information, and M is predefined threshold value.
Further, before obtaining the information content that the user releases news, may further comprise the steps:
Step S1021, the information content that the user gives out information before obtaining is also examined, will be by examining and not being divided into two classes by the information of examining, as the sample data of classification;
Step S1022 carries out participle to the information content in the sample;
Step S1023 by calculating, extracts Feature Words;
Step S1024 calculates in every class the eigenwert of each Feature Words in every piece of document;
Step S1025, the eigenwert according to obtaining in every class each word in every piece of document obtains model of cognition by training.
Further, step S1023 specifically comprises:
The CHI value asked in each word; Evolution check formula is:
Wherein, A: the number of documents that under this classification, comprises this word; B: the number of documents that under this classification, does not comprise this word; C: the number of documents that under this classification, does not comprise this word; D: not under this classification, and do not comprise the number of documents of this word; N: expression article sum; T: represent the current word of asking the CHI value; C: the classification of presentation class; x
2: the open check of expression CHI value;
Then get P value of CHI value maximum in all words as Feature Words;
Step S1024 specifically comprises:
Adopt the deformation algorithm computation of characteristic values of TFIDF algorithm or TFIDF, wherein the way of TFIDF is to calculate in every class the number of times of each Feature Words in every piece of document, and the number of files that comprises this word, with the value of TFIDF as eigenwert; Wherein, every piece of document is converted into: category IDs t feature sequence number the form of t eigenwert; The TFIDF formula is: TFIDF=TF * IDF, wherein, and the frequency that TF occurs in this piece document for certain Feature Words, IDF is anti-document frequency, namely total document tree is divided by the number of files that comprises this word.
Further, after obtaining the information content that the user gives out information, step S201 specifically may further comprise the steps:
Step S2021 carries out participle to the information content that the user gives out information;
Step S2022 by calculating, extracts Feature Words;
Step S2023, the eigenwert of each word in the information content that the calculating user gives out information;
Step S2024 according to the model of cognition that obtains, carries out the identification of wash sale information to the information content that the user gives out information.
Further, the pictorial information that releases news according to the user gives out information to the user and to carry out the identification of wash sale information, specifically may further comprise the steps:
Step S2031, the query history picture library judges whether photo current occurs in picture library, if there is, judge further then whether the content of posting is identical, and whether the position is identical, if all different, judge that then it is wash sale information that the user who comprises this picture releases news; Otherwise, judge that then it is true sale information that the user who comprises this picture releases news;
Whether perhaps, judging has watermark on the picture, if having, judges further then whether the watermark on the picture is legal, if illegal, judges that then it is wash sale information that the user who comprises this picture releases news; Otherwise, judge that then it is true sale information that the user who comprises this picture releases news.
Beneficial effect of the present invention is as follows:
The present invention can reduce the falseness amount of Transaction Information greatly, improves the authenticity of Transaction Information, increases the user and experiences, and can greatly reduce human cost simultaneously.
Description of drawings
Fig. 1 is the process flow diagram of a kind of wash sale information identifying method in the embodiment of the invention.
Embodiment
Below in conjunction with accompanying drawing and embodiment, the present invention is further elaborated.Should be appreciated that specific embodiment described herein only in order to explain the present invention, does not limit the present invention.
As shown in Figure 1, the embodiment of the invention relates to a kind of wash sale information identifying method, comprising:
Step S101 obtains information characteristics, the information content and/or pictorial information that the user releases news;
Step S201, the information characteristics, the information content and/or the pictorial information that release news according to the user give out information to the user and to carry out the identification of wash sale information.
Among the step S101, be specifically related to three kinds of situations, the first is to carry out the identification of wash sale information for the information characteristics that the user releases news, and namely carries out the identification of wash sale information based on user characteristics and behavior; The second is to carry out the identification of wash sale information for the information content that the user releases news, and namely carries out the identification of wash sale information based on the model content of text; The third is the wash sale information identification of carrying out for pictorial information.
At first, describe the information characteristics that releases news based on the user and carry out the identification of wash sale information, before obtaining the information characteristics that the user releases news, may further comprise the steps:
Step S1011, the master data that the user gives out information before obtaining.In this step, by the splicing data, the analysis user daily record of posting extracts the essential characteristic that the user gives out information; Wherein, essential characteristic refers to the directly data of extraction acquisition from the user gives out information before, the features such as for example, user's identify label (USER ID), the IP that posts, cookieid, telephone number, temporal information (comprising week, month, date), the duration of posting, pageview, the amount of refreshing, the city of posting, the classification of posting.Then, according to user's essential characteristic, obtain first feature; Wherein, first feature refers on the basis of user's essential characteristic, by the data of adding up or calculating; Such as the number of posting with IP, with IP post the city number, with the user post number, with user post city number, the first features such as number, the city number of posting with cookie of posting with cookie.Essential characteristic is with the master data of first feature as excavation.For example, produce such record R1(123123,192.168.11.11, DFOKIEBNGIDH1232,18311067654 ...).
Step S1012 according to the master data that user before obtaining gives out information, extracts training data.In this step, take the result of step S1011 as the basis, verify out by manual examination and verification to be defined as true or false data, as positive negative sample, True Data is positive sample, and false data is negative sample; For example, R1 is labeled as positive sample or negative sample.The manual examination and verification process can know that altogether information artificially judges according to some, also can carry out demonstration validation by means such as phones.
Step S1013, the data that align in the negative sample are carried out Feature Conversion, obtain the data of setting data form.In this step, the feature of every data in the positive negative sample is defined as numeric type or enumeration type two classes, wherein, numeric type refers to that data itself are exactly numerical value; Enumeration type refers to that data itself are not numerical value, and enumeration type shines upon according to original dimension and value and obtains.Being the data of enumeration type such as USER ID, the IP that posts etc., the duration of posting, is the data of numeric type with user's city number of posting.The dimension values of numeric type is constant; For example, certain characteristic is 20, and the position in sample then puts 20 the 10th position at the 10th.The dimension values of enumeration type is then calculated first its md5(Message Digest Algorithm MD5, Message Digest Algorithm 5) value, then with the md5 value to W(W=300000 for example) delivery, that is: with the md5 value divided by 300000, obtain remainder; The value of enumeration type will drop between the 1-300000 like this.Two features are for example arranged: (telephone number, with the phone number of posting), corresponding value is (18211078765,100), post number for numeric type with phone, and telephone number is enumeration type, so with the phone invariant position of several positions in sample of posting, after telephone number calculates the md5 value, to 300000 deliverys, for example obtain 180834, the vector that this moment, this record produced is (0,100,0 ..., 1), wherein, the 180834th position puts 1 in sample, represents that there is numerical value this position, and numerical value is 1.
Step S1014 according to the data of setting data form, sets up regression model.To the requirement of regression model be the result's that returns codomain between [0,1], perhaps can be mapped in this scope by calculating, below take logistic regression as example.What obtain among the step S1013 is the vector of a rule, for example (0,0,0,0,0,0,0,0,0,12,32,43 ... 1,0,0......1,0,0......), because these vectors may have 300000 dimensions, the expression data volume can quite expend internal memory, thus the vector of a rule is converted into the form of sparse matrix, for example, if upper one is article one, then horizontal ordinate is 1, and the form of corresponding sparse matrix is: 110(is equivalent to ordinate) 12,11132,11243 etc.After each bar all so transforms, the sparse matrix that input produces above being in model training program program, output is parameter corresponding to each bar record.Can simply be interpreted as if a record is (x
1, x
2, x
3, x
4, x
5..., x
p), p is the data volume of the data of setting data form; Find the solution by the model training program, produce (β
0, β
1, β
2, β
3, β
4, β
5..., β
p) etc. corresponding parameter.Set up regression model this moment, and regression model can be expressed as:
G (x)=β wherein
0+ β
1x
1+ β
2x
2+ ... + β
px
p
After setting up regression model, when receiving the user again and release news, then step S101 is specially:
Step S1015 obtains the master data that the user gives out information; Comprise the essential characteristic that the extraction user gives out information and obtain first feature; Essential characteristic is with the master data of first feature as excavation.Particular content is identical with step S1011, and this step is not described in detail.
After obtaining the master data that the user gives out information, step S201 specifically may further comprise the steps:
Step S2011 carries out Feature Conversion to obtaining the master data that the user gives out information, and obtains the data of setting data form.This step is identical with step S1013 method, no longer describes in detail.
Step S2012, the data of the setting data form that step S2011 is obtained are converted into the form of sparse matrix, carry out spoofing identification by regression model.In this step, obtain sparse matrix after, according to the user who the obtains corresponding (x that gives out information
1, x
2, x
3, x
4, x
5..., x
p), just can obtain g (x), so just can be in the hope of the result of P (Y=1|x), i.e. the probability of Y=1; Wherein, P〉M, Y=1 then, the expression user releases news and is true sale information; Otherwise, P≤M, Y=0 then, the expression user releases news and is wash sale information; M is predefined threshold value.
Secondly, describe the information content that releases news based on the user and carry out the identification of wash sale information, before obtaining the information content that the user releases news, may further comprise the steps:
Step S1021, the information content that gives out information of user before obtaining, and to foregoing by audit (manual examination and verification or automatically audit), will by audit with ing by the Transaction Information model examined as two classes, as the sample data of classifying; Algorithm that can high by expert's manual tag and part accuracy rate (be higher than threshold value is set) extracts positive and negative sample training collection automatically;
Step S1022 carries out participle to the information content in the sample, can optimize the participle effect by the mode of Custom Dictionaries.Concrete segmenting method can adopt existing segmenting method, for example ICT segmenting method or other segmenting method.
Step S1023 extracts Feature Words.In this step, filter out and stop word, rare words, common word in the step S1022 participle, then with the check of CHI(evolution) etc. method choose the Feature Words large with the class degree of correlation.Concrete choosing method is: the CHI value asked in each word, then get 1000 values of CHI value maximum in all words as Feature Words.Evolution check formula is:
Wherein, A: the number of documents that under this classification, comprises this word; B: the number of documents that under this classification, does not comprise this word; C: the number of documents that under this classification, does not comprise this word; D: not under this classification, and do not comprise the number of documents of this word; N: expression article sum; T: represent the current word of asking the CHI value; C: the classification of presentation class; x
2: the open check of expression CHI value.
Step S1024 carries out vectorization, obtains in every class the eigenwert of each Feature Words in every piece of document.This step adopts the TFIDF algorithm, calculates in every class the number of times of each Feature Words in every piece of document, and the number of files that comprises this word, with the value of TFIDF as eigenwert.Every piece of document is converted into: category IDs t feature sequence number the form of t eigenwert.The TFIDF formula is: TFIDF=TF * IDF, wherein, and the frequency that TF occurs in this piece document for certain Feature Words, IDF is anti-document frequency, namely total document tree is divided by the number of files that comprises this word.
Step S1025, the eigenwert according to obtaining in every class each word in every piece of document obtains model of cognition by training.In this step, employing SVM(support vector machine support vector machine), the modes such as decision tree, Bayess classification are trained above-mentioned eigenwert, every piece of document has been converted into the form of vector among the step S1024, adopt classification (Waikato Environment for Knowledge Analysis, Waikato intellectual analysis environment) program is trained these vectors, can select different sorting techniques, such as SVM, decision tree, Bayess classification etc. produces a model of cognition.SVM, decision tree, Bayess classification are existing ripe training method, and this step is not described in detail.
After obtaining model of cognition, when receiving the user again and release news, then step S101 is specially:
Step S1026 obtains the information content that the user gives out information, and for example, posts as example with the user, then obtains the particular content of model.
After obtaining the information content that the user gives out information, step S201 specifically may further comprise the steps:
Step S2021 carries out participle to the information content that the user gives out information.
Step S2022 extracts Feature Words.This step is identical with step S1023 method, therefore, is not described in detail.
Step S2023 carries out vectorization, obtains the eigenwert of each word in the information content that the user gives out information.This step is identical with step S1024 method, therefore, is not described in detail.
Step S2024 according to the model of cognition that obtains, carries out the identification of wash sale information to the information content that the user gives out information.In this step, the model of cognition that obtains by modes such as SVM, decision tree, Bayess classifications is existing maturity model, and its recognition methods also is existing mature technology, so this step is not described in detail.
At last, description is carried out the identification of wash sale information based on pictorial information, after obtaining the pictorial information that the user releases news, the pictorial information that releases news according to the user gives out information to the user and to carry out wash sale information identification (step S201) and may further comprise the steps:
Step S2031, the query graph valut judges whether photo current occurs in picture library, if there is, judge further then whether the content of posting is identical, and whether the position is identical, if all different, judge that then it is wash sale information that the user who comprises this picture releases news; Otherwise, judge that then it is true sale information that the user who comprises this picture releases news; Whether perhaps, judging has watermark on the picture, if having, judges further then whether the watermark on the picture is legal, if illegal, judges that then it is wash sale information that the user who comprises this picture releases news; Otherwise, judge that then it is true sale information that the user who comprises this picture releases news.
In addition, above-mentioned three kinds of strategies also can make up, and combine and judge, for example, two kinds of situation combinations, or three kinds of situation combinations; In above-mentioned three kinds of situations, there are any one or two kinds of situations to judge that it is wash sale information that the user releases news, and judges that then it is wash sale information that the user releases news.
As can be seen from the above-described embodiment, the present invention can reduce the falseness amount of Transaction Information greatly, improves the authenticity of Transaction Information, increases the user and experiences, and can greatly reduce human cost simultaneously.
Although be the example purpose, the preferred embodiments of the present invention are disclosed, it also is possible those skilled in the art will recognize various improvement, increase and replacement, therefore, scope of the present invention should be not limited to above-described embodiment.
Claims (10)
1. a wash sale information identifying method is characterized in that, comprising:
Step S101 obtains information characteristics, the information content and/or pictorial information that the user releases news;
Step S201, the information characteristics, the information content and/or the pictorial information that release news according to the user give out information to the user and to carry out the identification of wash sale information.
2. wash sale information identifying method as claimed in claim 1 is characterized in that, before obtaining the information characteristics that the user releases news, may further comprise the steps:
Step S1011, the master data that the user gives out information before obtaining;
Step S1012 according to the master data that user before obtaining gives out information, extracts training data, determines positive negative sample;
Step S1013, the data that align in the negative sample are carried out Feature Conversion, obtain the data of setting data form;
Step S1014 according to the data of setting data form, sets up regression model.
3. wash sale information identifying method as claimed in claim 2 is characterized in that, step S1013 specifically comprises:
The feature of every data in the positive negative sample is defined as numeric type or enumeration type two classes;
The dimension values of numeric type is constant, is in the numerical value that these numeric type data are disposed in position in the sample in the numeric type data;
The dimension values of enumeration type is then calculated first its md5 value, then with the md5 value to the W delivery, obtain the delivery result; In sample, will be in delivery as a result the numerical value of position put 1.
4. wash sale information identifying method as claimed in claim 3 is characterized in that, step S1014 specifically comprises:
The data that step S1013 is obtained are converted into sparse matrix;
Sparse matrix (the x that input produces in model training program program
1, x
2, x
3, x
4, x
5..., x
p), p is the data volume of the data of setting data form; Obtain parameter (β corresponding to each bar record
0, β
1, β
2, β
3, β
4, β
5..., β
p);
Set up regression model, regression model is:
G (x)=β wherein
0+ β
1x
1+ β
2x
2+ ... + β
px
p
5. wash sale information identifying method as claimed in claim 4 is characterized in that, after setting up regression model, when receiving the user and release news, then step S101 is specially:
Step S1015 obtains the master data that the user gives out information; Comprise the essential characteristic that the extraction user gives out information and obtain first feature; Essential characteristic is with the master data of first feature as excavation.
6. wash sale information identifying method as claimed in claim 5 is characterized in that, after obtaining the master data that the user gives out information, step S201 specifically may further comprise the steps:
Step S2011 carries out Feature Conversion to obtaining the master data that the user gives out information, and obtains the data of setting data form;
Step S2012, the data of the setting data form that step S2011 is obtained are converted into the form of sparse matrix, carry out spoofing identification by regression model; Wherein, P〉M, Y=1 then, the expression user releases news and is true sale information; Otherwise, P≤M, Y=0 then, the expression user releases news and is wash sale information; M is predefined threshold value.
7. such as claim 1 or 6 described wash sale information identifying methods, it is characterized in that, before obtaining the information content that the user releases news, may further comprise the steps:
Step S1021, the information content that the user gives out information before obtaining is also examined, will be by examining and not being divided into two classes by the information of examining, as the sample data of classification;
Step S1022 carries out participle to the information content in the sample;
Step S1023 by calculating, extracts Feature Words;
Step S1024 calculates in every class the eigenwert of each Feature Words in every piece of document;
Step S1025, the eigenwert according to obtaining in every class each word in every piece of document obtains model of cognition by training.
8. wash sale information identifying method as claimed in claim 7 is characterized in that, step S1023 specifically comprises:
The CHI value asked in each word; Evolution check formula is:
Wherein, A: the number of documents that under this classification, comprises this word; B: the number of documents that under this classification, does not comprise this word; C: the number of documents that under this classification, does not comprise this word; D: not under this classification, and do not comprise the number of documents of this word; N: expression article sum; T: represent the current word of asking the CHI value; C: the classification of presentation class; X2: the open check of expression CHI value;
Then get P value of CHI value maximum in all words as Feature Words;
Step S1024 specifically comprises:
Adopt the TFIDF algorithm, calculate in every class the number of times of each Feature Words in every piece of document, and the number of files that comprises this word, with the value of TFIDF as eigenwert; Wherein, every piece of document is converted into: category IDs t feature sequence number the form of t eigenwert; The TFIDF formula is: TFIDF=TF * IDF, wherein, and the frequency that TF occurs in this piece document for certain Feature Words, IDF is anti-document frequency, namely total document tree is divided by the number of files that comprises this word.
9. wash sale information identifying method as claimed in claim 8 is characterized in that, after obtaining the information content that the user gives out information, step S201 specifically may further comprise the steps:
Step S2021 carries out participle to the information content that the user gives out information;
Step S2022 by calculating, extracts Feature Words;
Step S2023, the eigenwert of each word in the information content that the calculating user gives out information;
Step S2024 according to the model of cognition that obtains, carries out the identification of wash sale information to the information content that the user gives out information.
10. such as claim 1,6 or 9 described wash sale information identifying methods, it is characterized in that, the pictorial information that releases news according to the user gives out information to the user and to carry out the identification of wash sale information, specifically may further comprise the steps:
Step S2031, the query graph valut judges whether photo current occurs in picture library, if there is, judge further then whether the content of posting is identical, and whether the position is identical, if all different, judge that then it is wash sale information that the user who comprises this picture releases news; Otherwise, judge that then it is true sale information that the user who comprises this picture releases news;
Whether perhaps, judging has watermark on the picture, if having, judges further then whether the watermark on the picture is legal, if illegal, judges that then it is wash sale information that the user who comprises this picture releases news; Otherwise, judge that then it is true sale information that the user who comprises this picture releases news.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310037691.8A CN103064987B (en) | 2013-01-31 | 2013-01-31 | A kind of wash sale information identifying method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310037691.8A CN103064987B (en) | 2013-01-31 | 2013-01-31 | A kind of wash sale information identifying method |
Publications (2)
Publication Number | Publication Date |
---|---|
CN103064987A true CN103064987A (en) | 2013-04-24 |
CN103064987B CN103064987B (en) | 2016-09-21 |
Family
ID=48107617
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201310037691.8A Active CN103064987B (en) | 2013-01-31 | 2013-01-31 | A kind of wash sale information identifying method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN103064987B (en) |
Cited By (20)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103279868A (en) * | 2013-05-22 | 2013-09-04 | 兰亭集势有限公司 | Method and device for automatically identifying fraud order form |
CN103530400A (en) * | 2013-10-23 | 2014-01-22 | 合山市科学技术情报研究所 | Information recognition device |
CN104616183A (en) * | 2015-01-27 | 2015-05-13 | 深圳市斯度尔科技有限公司 | Network transaction identifying system, client and method |
CN104657503A (en) * | 2015-03-13 | 2015-05-27 | 浪潮集团有限公司 | Method for preprocessing abnormal values of e-business sales amounts based on statistical discrimination process |
CN104731803A (en) * | 2013-12-21 | 2015-06-24 | 长沙微观信息科技有限公司 | Technology for preventing false network information |
CN106157045A (en) * | 2015-03-26 | 2016-11-23 | 阿里巴巴集团控股有限公司 | Method based on logistics data identification wash sale, device and server |
CN106504137A (en) * | 2015-09-03 | 2017-03-15 | 中山市八喜电脑网络有限公司 | Network false information of real estate prosecution system |
CN106611321A (en) * | 2015-10-22 | 2017-05-03 | 百度在线网络技术(北京)有限公司 | An identification method and apparatus for fake handset numbers |
CN106920109A (en) * | 2017-02-21 | 2017-07-04 | 福建师范大学福清分校 | Recognition methods, system and e-commerce system for ecommerce wash sale |
CN106952190A (en) * | 2017-03-22 | 2017-07-14 | 国信优易数据有限公司 | False source of houses typing Activity recognition and early warning system |
WO2017140222A1 (en) * | 2016-02-19 | 2017-08-24 | 阿里巴巴集团控股有限公司 | Modelling method and device for machine learning model |
CN107133833A (en) * | 2016-02-26 | 2017-09-05 | 阿里巴巴集团控股有限公司 | abnormal transaction identification method and device |
CN107545505A (en) * | 2016-06-24 | 2018-01-05 | 上海壹账通金融科技有限公司 | Insure recognition methods and the system of finance product information |
CN108053214A (en) * | 2017-12-12 | 2018-05-18 | 阿里巴巴集团控股有限公司 | A kind of recognition methods of wash sale and device |
CN108600113A (en) * | 2018-04-12 | 2018-09-28 | 北京五八信息技术有限公司 | A kind of the preliminary audit survey method, apparatus and storage medium of data to be released |
CN108734327A (en) * | 2017-04-20 | 2018-11-02 | 腾讯科技(深圳)有限公司 | A kind of data processing method, device and server |
CN109284614A (en) * | 2018-08-10 | 2019-01-29 | 五八有限公司 | Information Authentication method, apparatus, computer equipment and computer readable storage medium |
CN109685527A (en) * | 2018-12-14 | 2019-04-26 | 拉扎斯网络科技(上海)有限公司 | Method, device, system and computer storage medium for detecting false transactions of merchants |
CN111914645A (en) * | 2020-06-30 | 2020-11-10 | 五八有限公司 | Method and device for identifying false information, electronic equipment and storage medium |
CN116720864A (en) * | 2023-06-26 | 2023-09-08 | 北京智思迪科技有限公司 | Online transaction system and method with false transaction monitoring function |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20060144933A1 (en) * | 2004-12-30 | 2006-07-06 | Do Phuc K | Method to detect false purchases with a consumer service device |
CN102339445A (en) * | 2010-07-23 | 2012-02-01 | 阿里巴巴集团控股有限公司 | Method and system for evaluating credibility of network trade user |
-
2013
- 2013-01-31 CN CN201310037691.8A patent/CN103064987B/en active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20060144933A1 (en) * | 2004-12-30 | 2006-07-06 | Do Phuc K | Method to detect false purchases with a consumer service device |
CN102339445A (en) * | 2010-07-23 | 2012-02-01 | 阿里巴巴集团控股有限公司 | Method and system for evaluating credibility of network trade user |
Non-Patent Citations (3)
Title |
---|
王园: "基于内容检索的垃圾邮件过滤器研究与实现", 《中国优秀硕士学位论文全文数据库信息科技辑》 * |
蒋杰: "基于分类技术的电子支付平台作弊账户的识别模型研究", 《中国优秀硕士学位论文全文数据库信息科技辑》 * |
郭学敏: "基于语义的广告图像垃圾邮件过滤技术研究", 《中国优秀硕士学位论文全文数据库信息科技辑》 * |
Cited By (26)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103279868A (en) * | 2013-05-22 | 2013-09-04 | 兰亭集势有限公司 | Method and device for automatically identifying fraud order form |
CN103279868B (en) * | 2013-05-22 | 2016-08-17 | 兰亭集势有限公司 | A kind of method and apparatus of automatic identification swindle order |
CN103530400A (en) * | 2013-10-23 | 2014-01-22 | 合山市科学技术情报研究所 | Information recognition device |
CN104731803A (en) * | 2013-12-21 | 2015-06-24 | 长沙微观信息科技有限公司 | Technology for preventing false network information |
CN104616183A (en) * | 2015-01-27 | 2015-05-13 | 深圳市斯度尔科技有限公司 | Network transaction identifying system, client and method |
CN104616183B (en) * | 2015-01-27 | 2017-11-07 | 杨芳 | Network trading identification system, client and discrimination method |
CN104657503A (en) * | 2015-03-13 | 2015-05-27 | 浪潮集团有限公司 | Method for preprocessing abnormal values of e-business sales amounts based on statistical discrimination process |
CN106157045A (en) * | 2015-03-26 | 2016-11-23 | 阿里巴巴集团控股有限公司 | Method based on logistics data identification wash sale, device and server |
CN106157045B (en) * | 2015-03-26 | 2021-07-23 | 创新先进技术有限公司 | Method, device and server for identifying false transactions based on logistics data |
CN106504137A (en) * | 2015-09-03 | 2017-03-15 | 中山市八喜电脑网络有限公司 | Network false information of real estate prosecution system |
CN106611321A (en) * | 2015-10-22 | 2017-05-03 | 百度在线网络技术(北京)有限公司 | An identification method and apparatus for fake handset numbers |
TWI789345B (en) * | 2016-02-19 | 2023-01-11 | 香港商阿里巴巴集團服務有限公司 | Modeling method and device for machine learning model |
WO2017140222A1 (en) * | 2016-02-19 | 2017-08-24 | 阿里巴巴集团控股有限公司 | Modelling method and device for machine learning model |
CN107133833A (en) * | 2016-02-26 | 2017-09-05 | 阿里巴巴集团控股有限公司 | abnormal transaction identification method and device |
CN107545505A (en) * | 2016-06-24 | 2018-01-05 | 上海壹账通金融科技有限公司 | Insure recognition methods and the system of finance product information |
CN107545505B (en) * | 2016-06-24 | 2020-09-29 | 深圳壹账通智能科技有限公司 | Method and system for identifying insurance financing product information |
CN106920109A (en) * | 2017-02-21 | 2017-07-04 | 福建师范大学福清分校 | Recognition methods, system and e-commerce system for ecommerce wash sale |
CN106952190A (en) * | 2017-03-22 | 2017-07-14 | 国信优易数据有限公司 | False source of houses typing Activity recognition and early warning system |
CN108734327A (en) * | 2017-04-20 | 2018-11-02 | 腾讯科技(深圳)有限公司 | A kind of data processing method, device and server |
CN108053214A (en) * | 2017-12-12 | 2018-05-18 | 阿里巴巴集团控股有限公司 | A kind of recognition methods of wash sale and device |
CN108600113A (en) * | 2018-04-12 | 2018-09-28 | 北京五八信息技术有限公司 | A kind of the preliminary audit survey method, apparatus and storage medium of data to be released |
CN109284614A (en) * | 2018-08-10 | 2019-01-29 | 五八有限公司 | Information Authentication method, apparatus, computer equipment and computer readable storage medium |
CN109685527A (en) * | 2018-12-14 | 2019-04-26 | 拉扎斯网络科技(上海)有限公司 | Method, device, system and computer storage medium for detecting false transactions of merchants |
CN109685527B (en) * | 2018-12-14 | 2024-03-29 | 拉扎斯网络科技(上海)有限公司 | Method, device, system and computer storage medium for detecting merchant false transaction |
CN111914645A (en) * | 2020-06-30 | 2020-11-10 | 五八有限公司 | Method and device for identifying false information, electronic equipment and storage medium |
CN116720864A (en) * | 2023-06-26 | 2023-09-08 | 北京智思迪科技有限公司 | Online transaction system and method with false transaction monitoring function |
Also Published As
Publication number | Publication date |
---|---|
CN103064987B (en) | 2016-09-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN103064987B (en) | A kind of wash sale information identifying method | |
Girgis et al. | Deep learning algorithms for detecting fake news in online text | |
CA3138730C (en) | Public-opinion analysis method and system for providing early warning of enterprise risks | |
CN107291780B (en) | User comment information display method and device | |
CN109872162B (en) | Wind control classification and identification method and system for processing user complaint information | |
US11593671B2 (en) | Systems and methods for semantic analysis based on knowledge graph | |
CN111553318A (en) | Sensitive information extraction method, referee document processing method and device and electronic equipment | |
CN107545038B (en) | Text classification method and equipment | |
CN110197389A (en) | A kind of user identification method and device | |
US10956522B1 (en) | Regular expression generation and screening of textual items | |
CN109766441B (en) | Text classification method, device and system | |
CN110825880A (en) | Case winning rate determining method, device, equipment and computer readable storage medium | |
CN110362689A (en) | A kind of methods of risk assessment, device, storage medium and server | |
CN112860841A (en) | Text emotion analysis method, device and equipment and storage medium | |
CN106095939B (en) | The acquisition methods and device of account authority | |
CN110609908A (en) | Case serial-parallel method and device | |
WO2018028065A1 (en) | Method and device for classifying short message and computer storage medium | |
CN103177129A (en) | Internet real-time information recommendation and prediction system | |
CN111324370A (en) | Method and device for carrying out risk processing on to-be-on-line small program | |
CN111695357A (en) | Text labeling method and related product | |
Deekshan et al. | Detection and summarization of honest reviews using text mining | |
CN113420789B (en) | Method and device for predicting risk account number, storage medium and computer equipment | |
CN113095723A (en) | Coupon recommendation method and device | |
CN115618120B (en) | Public number information pushing method, system, terminal equipment and storage medium | |
CN112559679B (en) | Political new media propagation force detection method, device, equipment and storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant |