Skip to content

Latest commit

 

History

History

docker

Docker support for testing zeek-kafka

These scripts and containers provide support for building and testing Zeek and the zeek-kafka package using a number of Docker containers. The use of these scripts and containers allow an easier, automated workflow for testing new features, fixes, or regressions than before. One of the goals is for this to be extensible, such that new scripts can be introduced and run as well. This will allow, for example, one or more testing scripts to be added to a pull request, and subsequently to a test suite.

Directories

├── containers
│   └── zeek
│   └── kafka
│   └── zookeeper
├── data
├── in_docker_scripts
├── scripts
└── test_output
  • containers: The parent of all of the containers that this project defines. We use several containers, not all of them ours.
    • zeek: The directory for our zeek container, used for building zeek, the librdkafka, and our plugin, as well as running zeek.
    • kafka: The directory for our kafka container.
    • zookeeper: The directory for our zookeeper container.
  • data: The default path for pcap data to be used in tests.
  • in_docker_scripts: This directory is mapped to the zeek docker container as /root/built_in_scripts. These represent the library of scripts we provide to be run in the docker container.
  • scripts: These are the scripts that are run on the host for creating the docker bits, running containers, running or executing commands against containers ( such as executing one of the built_in_scripts ), and cleaning up resources.
  • test_output: Directory where the zeek logs and kafka logs per test/pcap are stored.

Scripts that execute in the docker container

├── build_plugin.sh
├── configure_plugin.sh
├── process_data_file.sh
  • build_plugin.sh: Runs zkg to build and install the provided version of the plugin.
  • configure_plugin.sh: Configures the plugin for the kafka container, and routes all traffic types.
    Parameters
    --kafka-topic                  [OPTIONAL] The kafka topic to configure. Default: zeek"
  • process_data_file.sh: Runs zeek -r on the passed file

Scripts executed on the host to setup and interact with the docker containers

├── analyze_results.sh
├── docker_execute_build_plugin.sh
├── docker_execute_configure_plugin.sh
├── docker_execute_create_topic_in_kafka.sh
├── docker_execute_process_data_file.sh
├── docker_execute_shell.sh
├── docker_run_consume_kafka.sh
├── docker_run_get_offset_kafka.sh
├── download_sample_pcaps.sh
├── print_results.sh
├── split_kakfa_output_by_log.sh
  • analyze_results.sh: Analyzes the results.csv files for any issues

    Parameters
    --test-directory               [REQUIRED] The directory for the tests
  • docker_execute_build_plugin.sh: Executes build_plugin.sh in the zeek container

    Parameters
     --container-name              [OPTIONAL] The Docker container name. Default: zeek-kafka_zeek_1
  • docker_execute_configure_plugin.sh: Executes configure_plugin.sh in the zeek container

    Parameters
    --container-name               [OPTIONAL] The Docker container name. Default: zeek-kafka_zeek_1
  • docker_execute_create_topic_in_kafka.sh: Creates the specified kafka topic in the kafka container

    Parameters
    --container-name               [OPTIONAL] The Docker container name. Default: zeek-kafka_kafka-1_1
    --kafka-topic                  [OPTIONAL] The kafka topic to create. Default: zeek
    --partitions                   [OPTIONAL] The number of kafka partitions to create. Default: 2
  • docker_execute_process_data_file.sh: Executes process_data_file.sh in the zeek container

    Parameters
    --container-name              [OPTIONAL] The Docker container name. Default: zeek-kafka_zeek_1
  • docker_execute_shell.sh: docker execute -i -t bash to get a shell in a given container

    Parameters
    --container-name               [OPTIONAL] The Docker container name. Default: zeek-kafka_zeek_1
  • docker_run_consume_kafka.sh: Runs an instance of the kafka container, with the console consumer kafka-console-consumer.sh --topic $KAFKA_TOPIC --offset $OFFSET --partition $PARTITION --bootstrap-server kafka-1:9092

    Parameters
    --network-name                 [OPTIONAL] The Docker network name. Default: zeek-kafka_default
    --offset                       [OPTIONAL] The kafka offset to read from. Default: 0
    --partition                    [OPTIONAL] The kafka partition to read from. Default: 0
    --kafka-topic                  [OPTIONAL] The kafka topic to consume from. Default: zeek
  • docker_run_get_offset_kafka.sh: Runs an instance of the kafka container and gets the current offset for the specified topic

    Parameters
    --network-name                 [OPTIONAL] The Docker network name. Default: zeek-kafka_default
    --kafka-topic                  [OPTIONAL] The kafka topic to get the offset from. Default: zeek
  • download_sample_pcaps.sh: Downloads the sample pcaps to a specified directory. If they exist, it is a no-op

    The sample pcaps are:

    Parameters
    --data-path                    [REQUIRED] The pcap data path
  • print_results.sh: Prints the results.csv for all the pcaps processed in the given directory to console

    Parameters
    --test-directory               [REQUIRED] The directory for the tests
  • split_kafka_output_by_log.sh: For a pcap result directory, will create a LOG.kafka.log for each LOG.log's entry in the kafka-output.log

    Parameters
    --log-directory                [REQUIRED] The directory with the logs

The example end to end test script

run_end_to_end.sh is provided as an example of a testing script. Specific or extended scripts can be created similar to this script to use the containers. This script does the following:

  1. Runs docker compose
  2. Creates the specified topic with the specified number of partitions
  3. Downloads sample PCAP data
  4. Runs the zeek container in the background
  5. Builds the zeek plugin
  6. Configures the zeek plugin
  7. Runs zeek against all the pcap data, one at a time
  8. Executes a kafka client to read the data from zeek for each pcap file
  9. Stores the output kafka messages and the zeek logs into the test_output directory
    >tree Tue_Jan__8_21_54_10_EST_2019
    Tue_Jan__8_21_54_10_EST_2019
    ├── exercise-traffic_pcap
    │   ├── capture_loss.log
    │   ├── conn.log
    │   ├── dhcp.log
    │   ├── dns.log
    │   ├── files.log
    │   ├── http.log
    │   ├── kafka-output.log
    │   ├── known_certs.log
    │   ├── loaded_scripts.log
    │   ├── notice.log
    │   ├── packet_filter.log
    │   ├── reporter.log
    │   ├── smtp.log
    │   ├── software.log
    │   ├── ssl.log
    │   ├── stats.log
    │   ├── weird.log
    │   └── x509.log
    ├── ftp_pcap
    │   ├── capture_loss.log
    │   ├── conn.log
    │   ├── files.log
    │   ├── ftp.log
    │   ├── kafka-output.log
    │   ├── loaded_scripts.log
    │   ├── packet_filter.log
    │   ├── reporter.log
    │   ├── software.log
    │   └── stats.log
  10. Creates a results.csv for each pcap that has the line counts of the kafka and the zeek output for each log
  11. Prints all the results.csv to the screen

As we can see, the output is a folder named for the test run time, with a sub folder per pcap, containing all the zeek logs and the kafka_output.log.

At this point the containers are up and running in the background.

Other scripts may then be used to do your testing, for example running:

./scripts/docker_execute_shell.sh

NOTE: If the scripts are run repeatedly, and there is no change in zeek or the librdkafka, the line ./run_end_to_end.sh can be replaced by ./run_end_to_end.sh --skip-docker-build, which uses the --skip-docker-build flag to not rebuild the containers, saving the significant time of rebuilding zeek and librdkafka.

NOTE: After you are done, you must call the finish_end_to_end.sh script to cleanup.

run_end_to_end.sh
Parameters
--data-path                     [OPTIONAL] The pcap data path. Default: ./data
--kafka-topic                   [OPTIONAL] The kafka topic to consume from. Default: zeek
--no-pcap                       [OPTIONAL] Do not run pcaps.
--partitions                    [OPTIONAL] The number of kafka partitions to create. Default: 2
--plugin-version                [OPTIONAL] The plugin version. Default: the current branch name
--skip-docker-build             [OPTIONAL] Skip build of zeek docker machine.
--test-output                   [OPTIONAL] The test output path. Default: ./test_output/DATETIME
--zeek-kafka-os                 [OPTIONAL] The OS to run zeek and zeek-kafka in. Default: ubuntu

NOTE: The provided --plugin-version is passed to the zkg install command within the container, which allows you to specify a version tag, branch name, or commit hash. However, that tag, branch, or commit must be available in the currently checked out plugin repository.

finish_end_to_end.sh
Parameters
--zeek-kafka-os     [OPTIONAL] The OS to run zeek and zeek-kafka in. Default: ubuntu

NOTE: Currently, the OS definition for finish_end_to_end.sh does not need to be specified to work, regardless of which zeek-kafka OS was created with run_end_to_end.sh.