Netsec_ML

Series of utilities and training models for network security. Built in Scala for Apache Spark MLLib for learning purposes

IDS_LogReg

Info

Logistic Regression prediction model for IDS Logs.

Usage

This was trained with an updated version of the KDD dataset that everyone uses.

Server_Health

Info

Linear Regression prediction model for server health data.

Usage

This is a demo, but if given real data it yields a model that can be worked with. Currently the label is based upon the cpu load average, but this can be modified to other things, as shown below.

// Use memory utilization as the label
data.select(data("memutil")
    .as("label"),$"hour",$"load",$"netutil")

Additionally datasets can be filtered by modifying the select on the dataframe, as shown below

// Train only on data from the abstract host taken at noon
data.filter("hostname = 'abstract' AND hour = 12")
    .select(data("load")
    .as("label"),$"hour",$"memutil",$"netutil")

Utils

Log Generation

util/health-log-gen.py will generate some csv logs to train on, however this data is randomized and won't yield any worthwhile results. util/network-log-gen.py will generate some JSON logs to train on.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
models		models
src		src
util		util
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Netsec_ML

IDS_LogReg

Info

Usage

Server_Health

Info

Usage

Utils

Log Generation

About

Releases

Packages

Languages

khodges42/Netsec_ML

Folders and files

Latest commit

History

Repository files navigation

Netsec_ML

IDS_LogReg

Info

Usage

Server_Health

Info

Usage

Utils

Log Generation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages