Skip to content

Latest commit

 

History

History
43 lines (29 loc) · 912 Bytes

dataleakage.rst

File metadata and controls

43 lines (29 loc) · 912 Bytes

Data Leakage

Data leakage is a serious bane in machine learning, which usually results in overly optimistic model results.

Examples

Some subtle examples of data leakages.

images/dataleakage.png

University of Michigan: Coursera Data Science in Python

Types of Leakages

Data Leakages can be classified into two.

images/dataleakage2.png

University of Michigan: Coursera Data Science in Python

Detecting Leakages

images/dataleakage3.png

University of Michigan: Coursera Data Science in Python

Minimising Leakages

images/dataleakage4.png

University of Michigan: Coursera Data Science in Python