SG11201906380PA - Data type recognition, model training and risk recognition methods, apparatuses and devices - Google Patents

Data type recognition, model training and risk recognition methods, apparatuses and devices

Info

Publication number
SG11201906380PA
SG11201906380PA SG11201906380PA SG11201906380PA SG11201906380PA SG 11201906380P A SG11201906380P A SG 11201906380PA SG 11201906380P A SG11201906380P A SG 11201906380PA SG 11201906380P A SG11201906380P A SG 11201906380PA SG 11201906380P A SG11201906380P A SG 11201906380PA
Authority
SG
Singapore
Prior art keywords
model
recognition
data
sample data
data set
Prior art date
Application number
SG11201906380PA
Inventor
Yu Cheng
Original Assignee
Alibaba Group Holding Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alibaba Group Holding Ltd filed Critical Alibaba Group Holding Ltd
Publication of SG11201906380PA publication Critical patent/SG11201906380PA/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F21/00Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F21/50Monitoring users, programs or devices to maintain the integrity of platforms, e.g. of processors, firmware or operating systems
    • G06F21/55Detecting local intrusion or implementing counter-measures
    • G06F21/552Detecting local intrusion or implementing counter-measures involving long-term monitoring or reporting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2465Query processing support for facilitating data mining operations in structured databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/063Operations research, analysis or management
    • G06Q10/0635Risk analysis of enterprise or organisation activities
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2221/00Indexing scheme relating to security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F2221/03Indexing scheme relating to G06F21/50, monitoring users, programs or devices to maintain the integrity of platforms
    • G06F2221/034Test or assess a computer or a system
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N7/00Computing arrangements based on specific mathematical models
    • G06N7/01Probabilistic graphical models, e.g. probabilistic networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Business, Economics & Management (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • Human Resources & Organizations (AREA)
  • Software Systems (AREA)
  • Databases & Information Systems (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Strategic Management (AREA)
  • Economics (AREA)
  • Mathematical Physics (AREA)
  • Evolutionary Computation (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Artificial Intelligence (AREA)
  • Computer Security & Cryptography (AREA)
  • Development Economics (AREA)
  • Fuzzy Systems (AREA)
  • Educational Administration (AREA)
  • Computational Linguistics (AREA)
  • Game Theory and Decision Science (AREA)
  • Probability & Statistics with Applications (AREA)
  • Marketing (AREA)
  • Operations Research (AREA)
  • Quality & Reliability (AREA)
  • Tourism & Hospitality (AREA)
  • General Business, Economics & Management (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Evolutionary Biology (AREA)
  • Computer Hardware Design (AREA)
  • Medical Informatics (AREA)
  • Computing Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Image Analysis (AREA)

Abstract

The present application provides data type recognition and model training methods and apparatuses, and computer devices. The model training method includes acquiring a first sample data set, and using the first sample data set to train an anomaly detection model; and detecting an abnormal sample data set from a second sample data set by means of the 5 anomaly detection model, and using the abnormal sample data set to train a classification model. By means of this embodiment, an amount of scoring events of the classification model can be reduced, and relatively balanced sample data sets can also be provided for training, to obtain the classification model with a higher accuracy. In a particular application, data to be recognized is firstly input to the anomaly detection model, and whether the data 10 to be recognized is first-type data can be quickly distinguished; and other data than the first- type data recognized by the anomaly detection model, is input to the classification model for recognition. The speed of online data recognition is relatively fast.
SG11201906380PA 2017-06-16 2018-06-13 Data type recognition, model training and risk recognition methods, apparatuses and devices SG11201906380PA (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201710458652.3A CN107391569B (en) 2017-06-16 2017-06-16 Data type identification, model training and risk identification method, device and equipment
PCT/CN2018/091043 WO2018228428A1 (en) 2017-06-16 2018-06-13 Data type identification, model training, and risk identification method and apparatus, and device

Publications (1)

Publication Number Publication Date
SG11201906380PA true SG11201906380PA (en) 2019-08-27

Family

ID=60333026

Family Applications (1)

Application Number Title Priority Date Filing Date
SG11201906380PA SG11201906380PA (en) 2017-06-16 2018-06-13 Data type recognition, model training and risk recognition methods, apparatuses and devices

Country Status (7)

Country Link
US (2) US11113394B2 (en)
CN (1) CN107391569B (en)
MY (1) MY201302A (en)
PH (1) PH12019501621A1 (en)
SG (1) SG11201906380PA (en)
TW (1) TWI664535B (en)
WO (1) WO2018228428A1 (en)

Families Citing this family (57)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9069725B2 (en) 2011-08-19 2015-06-30 Hartford Steam Boiler Inspection & Insurance Company Dynamic outlier bias reduction system and method
KR102503653B1 (en) 2014-04-11 2023-02-24 하트포드 스팀 보일러 인스펙션 앤드 인슈어런스 컴퍼니 Improving Future Reliability Prediction based on System operational and performance Data Modelling
CN107391569B (en) * 2017-06-16 2020-09-15 阿里巴巴集团控股有限公司 Data type identification, model training and risk identification method, device and equipment
US10845079B1 (en) * 2017-06-28 2020-11-24 Alarm.Com Incorporated HVAC analytics
CN107944874B (en) * 2017-12-13 2021-07-20 创新先进技术有限公司 Wind control method, device and system based on transfer learning
CN108173708A (en) * 2017-12-18 2018-06-15 北京天融信网络安全技术有限公司 Anomalous traffic detection method, device and storage medium based on incremental learning
CN108346098B (en) * 2018-01-19 2022-05-31 创新先进技术有限公司 Method and device for mining wind control rule
CN108304287B (en) * 2018-01-22 2021-05-28 腾讯科技(深圳)有限公司 Disk fault detection method and device and related equipment
CN110472646B (en) * 2018-05-09 2023-02-28 富士通株式会社 Data processing apparatus, data processing method, and medium
CN109145030B (en) * 2018-06-26 2022-07-22 创新先进技术有限公司 Abnormal data access detection method and device
CN109034209B (en) * 2018-07-03 2021-07-30 创新先进技术有限公司 Training method and device for active risk real-time recognition model
US10878388B2 (en) * 2018-07-12 2020-12-29 Visionx, Llc Systems and methods for artificial-intelligence-based automated surface inspection
CN109190676B (en) * 2018-08-06 2022-11-08 百度在线网络技术(北京)有限公司 Model training method, device, equipment and storage medium for image recognition
US11636292B2 (en) 2018-09-28 2023-04-25 Hartford Steam Boiler Inspection And Insurance Company Dynamic outlier bias reduction system and method
CN109461001B (en) * 2018-10-22 2021-07-09 创新先进技术有限公司 Method and device for obtaining training sample of first model based on second model
CN110046632B (en) * 2018-11-09 2023-06-02 创新先进技术有限公司 Model training method and device
CN111275507A (en) * 2018-12-04 2020-06-12 北京嘀嘀无限科技发展有限公司 Order abnormity identification and order risk management and control method and system
CN109684118B (en) * 2018-12-10 2022-04-26 深圳前海微众银行股份有限公司 Abnormal data detection method, device, equipment and computer readable storage medium
US10977738B2 (en) * 2018-12-27 2021-04-13 Futurity Group, Inc. Systems, methods, and platforms for automated quality management and identification of errors, omissions and/or deviations in coordinating services and/or payments responsive to requests for coverage under a policy
CN109670267B (en) * 2018-12-29 2023-06-13 北京航天数据股份有限公司 Data processing method and device
CN109859029A (en) * 2019-01-04 2019-06-07 深圳壹账通智能科技有限公司 Abnormal application detection method, device, computer equipment and storage medium
CN109992578B (en) * 2019-01-07 2023-08-08 平安科技(深圳)有限公司 Anti-fraud method and device based on unsupervised learning, computer equipment and storage medium
CN109936561B (en) * 2019-01-08 2022-05-13 平安科技(深圳)有限公司 User request detection method and device, computer equipment and storage medium
CN109905362B (en) * 2019-01-08 2022-05-13 平安科技(深圳)有限公司 User request detection method and device, computer equipment and storage medium
KR20200108523A (en) * 2019-03-05 2020-09-21 주식회사 엘렉시 System and Method for Detection of Anomaly Pattern
CN110084468B (en) * 2019-03-14 2020-09-01 阿里巴巴集团控股有限公司 Risk identification method and device
CN112016579B (en) * 2019-05-30 2024-09-27 阿里巴巴集团控股有限公司 Data processing method, risk identification method, computer device, and storage medium
CN110363534B (en) * 2019-06-28 2023-11-17 创新先进技术有限公司 Method and device for identifying abnormal transaction
CN112308104A (en) * 2019-08-02 2021-02-02 杭州海康威视数字技术股份有限公司 Abnormity identification method and device and computer storage medium
US11615348B2 (en) 2019-09-18 2023-03-28 Hartford Steam Boiler Inspection And Insurance Company Computer-based systems, computing components and computing objects configured to implement dynamic outlier bias reduction in machine learning models
US11328177B2 (en) 2019-09-18 2022-05-10 Hartford Steam Boiler Inspection And Insurance Company Computer-based systems, computing components and computing objects configured to implement dynamic outlier bias reduction in machine learning models
US11288602B2 (en) 2019-09-18 2022-03-29 Hartford Steam Boiler Inspection And Insurance Company Computer-based systems, computing components and computing objects configured to implement dynamic outlier bias reduction in machine learning models
CN110782349A (en) * 2019-10-25 2020-02-11 支付宝(杭州)信息技术有限公司 Model training method and system
CN110826621A (en) * 2019-11-01 2020-02-21 北京芯盾时代科技有限公司 Risk event processing method and device
CN110995681B (en) * 2019-11-25 2022-04-22 北京奇艺世纪科技有限公司 User identification method and device, electronic equipment and storage medium
CN112861895B (en) * 2019-11-27 2023-11-03 北京京东振世信息技术有限公司 Abnormal article detection method and device
CN110941607A (en) * 2019-12-10 2020-03-31 医渡云(北京)技术有限公司 Dirty data identification method, device, equipment and storage medium
CN111126577A (en) * 2020-03-30 2020-05-08 北京精诊医疗科技有限公司 Loss function design method for unbalanced samples
CN111760292A (en) * 2020-07-07 2020-10-13 网易(杭州)网络有限公司 Method and device for detecting sampling data and electronic equipment
CN112016600A (en) * 2020-08-14 2020-12-01 中国石油大学(北京) Pipeline abnormity identification method, device and system
CN111986027A (en) * 2020-08-21 2020-11-24 腾讯科技(上海)有限公司 Abnormal transaction processing method and device based on artificial intelligence
US11687806B2 (en) 2020-11-03 2023-06-27 International Business Machines Corporation Problem solving using selected datasets of internet-of-things system
CN112541537A (en) * 2020-12-09 2021-03-23 北京捷通华声科技股份有限公司 Method and device for generating character line recognition system and electronic equipment
CN112529109A (en) * 2020-12-29 2021-03-19 四川长虹电器股份有限公司 Unsupervised multi-model-based anomaly detection method and system
US20220222570A1 (en) * 2021-01-12 2022-07-14 Optum Technology, Inc. Column classification machine learning models
CN113127858A (en) * 2021-04-19 2021-07-16 中国工商银行股份有限公司 Anomaly detection model training method, anomaly detection method and anomaly detection device
CN113521750B (en) * 2021-07-15 2023-10-24 珠海金山数字网络科技有限公司 Abnormal account detection model training method and abnormal account detection method
US11353840B1 (en) * 2021-08-04 2022-06-07 Watsco Ventures Llc Actionable alerting and diagnostic system for electromechanical devices
US11803778B2 (en) * 2021-08-04 2023-10-31 Watsco Ventures Llc Actionable alerting and diagnostic system for water metering systems
US20230186152A1 (en) * 2021-12-09 2023-06-15 Kinaxis Inc. Iterative data-driven configuration of optimization methods and systems
CN114726749B (en) * 2022-03-02 2023-10-31 阿里巴巴(中国)有限公司 Data anomaly detection model acquisition method, device, equipment and medium
CN114692892B (en) * 2022-03-23 2023-08-29 支付宝(杭州)信息技术有限公司 Method for processing numerical characteristics, model training method and device
CN114978616B (en) * 2022-05-06 2024-01-09 支付宝(杭州)信息技术有限公司 Construction method and device of risk assessment system, and risk assessment method and device
CN115118505B (en) * 2022-06-29 2023-06-09 上海众人智能科技有限公司 Behavior baseline targeting grabbing method based on intrusion data tracing
CN115277205B (en) * 2022-07-28 2024-05-14 中国电信股份有限公司 Model training method and device and port risk identification method
CN115238805B (en) * 2022-07-29 2023-12-15 中国电信股份有限公司 Training method of abnormal data recognition model and related equipment
CN118505380B (en) * 2024-07-18 2024-09-13 南京昱鑫辰信息技术有限公司 Electronic information management method and platform

Family Cites Families (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9306966B2 (en) * 2001-12-14 2016-04-05 The Trustees Of Columbia University In The City Of New York Methods of unsupervised anomaly detection using a geometric framework
TW200802018A (en) * 2006-06-01 2008-01-01 Academia Sinica Detection device and method thereof
CN101980202A (en) * 2010-11-04 2011-02-23 西安电子科技大学 Semi-supervised classification method of unbalance data
CN102176698A (en) * 2010-12-20 2011-09-07 北京邮电大学 Method for detecting abnormal behaviors of user based on transfer learning
US10599999B2 (en) * 2014-06-02 2020-03-24 Yottamine Analytics, Inc. Digital event profile filters based on cost sensitive support vector machine for fraud detection, risk rating or electronic transaction classification
US9985984B1 (en) * 2014-10-27 2018-05-29 National Technology & Engineering Solutions Of Sandia, Llc Dynamic defense and network randomization for computer systems
CA2977262A1 (en) * 2015-02-23 2016-09-01 Cellanyx Diagnostics, Llc Cell imaging and analysis to differentiate clinically relevant sub-populations of cells
CN104794192B (en) * 2015-04-17 2018-06-08 南京大学 Multistage method for detecting abnormality based on exponential smoothing, integrated study model
CN106156809A (en) * 2015-04-24 2016-11-23 阿里巴巴集团控股有限公司 For updating the method and device of disaggregated model
CN106296195A (en) * 2015-05-29 2017-01-04 阿里巴巴集团控股有限公司 A kind of Risk Identification Method and device
CN106503562A (en) * 2015-09-06 2017-03-15 阿里巴巴集团控股有限公司 A kind of Risk Identification Method and device
CN105279382B (en) * 2015-11-10 2017-12-22 成都数联易康科技有限公司 A kind of medical insurance abnormal data on-line intelligence detection method
CN106779272A (en) * 2015-11-24 2017-05-31 阿里巴巴集团控股有限公司 A kind of Risk Forecast Method and equipment
CN105760889A (en) * 2016-03-01 2016-07-13 中国科学技术大学 Efficient imbalanced data set classification method
CN107391569B (en) * 2017-06-16 2020-09-15 阿里巴巴集团控股有限公司 Data type identification, model training and risk identification method, device and equipment

Also Published As

Publication number Publication date
MY201302A (en) 2024-02-15
TWI664535B (en) 2019-07-01
CN107391569B (en) 2020-09-15
TW201905728A (en) 2019-02-01
US11100220B2 (en) 2021-08-24
CN107391569A (en) 2017-11-24
WO2018228428A1 (en) 2018-12-20
US11113394B2 (en) 2021-09-07
US20190303569A1 (en) 2019-10-03
PH12019501621A1 (en) 2020-01-20
US20200167466A1 (en) 2020-05-28

Similar Documents

Publication Publication Date Title
SG11201906380PA (en) Data type recognition, model training and risk recognition methods, apparatuses and devices
PH12019502321A1 (en) Picture-based vehicle loss assessment method and apparatus, and electronic device
MY193739A (en) Picture-based vehicle loss assessment method and apparatus, and electronic device
PH12018501058A1 (en) Order clustering and malicious information combating method and apparatus
WO2019050966A3 (en) Automated sample workflow gating and data analysis
SG11201900470SA (en) Modeling method and device for evaluation model
MY189945A (en) Statistical analytic method for the determination of the risk posed by file based content
GB201204006D0 (en) Point of interest database maintenance system
MX2017000495A (en) Touch classification.
MX2016001687A (en) Systems and methods for image classification by correlating contextual cues with images.
MX2016014071A (en) Method and apparatus for analyzing media content.
GB2517644A (en) Detecting anomalies in real-time in multiple time series data with automated thresholding
WO2015138497A3 (en) Systems and methods for rapid data analysis
WO2015160415A3 (en) Systems and methods for visual sentiment analysis
EP2781883A3 (en) Method and apparatus for optimizing timing of audio commands based on recognized audio patterns
EE201500014A (en) Method and device for impedance analysis with binary excitation
MX2016004667A (en) Template construction method and apparatus, and information recognition method and apparatus.
GB2563763A (en) Well integrity analysis using sonic measurements over depth interval
WO2012153965A3 (en) Brain-computer interface device and classification method therefor
MX340339B (en) Methods of calibration transfer for a testing instrument.
AU2017408798A1 (en) Method and device of analysis based on model, and computer readable storage medium
WO2015129934A8 (en) Apparatus and method for detecting command and control channels
PH12016500020A1 (en) Apparatus and method for providing connections to contacts based on associations with content
PH12020551136A1 (en) Diagnosis apparatus and diagnosis method
SG10201901587VA (en) Application testing