can-ids

This repository contains implementations of X-CANIDS by Jeong et al.[3] It is recommended to read the paper before using the repository. It is also recommended to read my master's thesis in the context of which this code was written.

Structure

It is split into three sub parts: x-canids, x-canids-bytes, and x-mvbids.

The first one contains the implementation of X-CANIDS for signal-translated CAN datasets. It is specialized to the datasets SynCAN by Hanselmann et al. [2] and ROAD by Bridges et al. [1]. However, it can also be adapted easily to other datasets. The second one contains an adaption to the byte-based datasets of ROAD. The third one contains an adaption of the method to MVB data. The used MVB data is proprietary and cannot be provided.

All three types of IDS follow the same general pattern of execution:

extract_constant_signals.py -> indentify constant signals
min-max.py -> determine ranges of each monitored signal
merge-min-max.py -> generate common ranges for multiple files by merging the min-max-files
preprocessing_labeled.py -> applies the feature extraction of X-CANIDS and includes labels for testing
preprocessing_unlabeled.py -> applies the feature extraction of X-CANIDS and does not include labels
train_val_test_split.py -> applies a train val test split on the extracted features
train.py -> trains the model
threshold.py -> determines the thresholds for intrusion detection based on validation and training data
evaluate.py -> uses the model and thresholds to run evaluation on the attack and bening test data

Extra files:

apply_windowing.py -> can be used to apply windowing on a standalone tfrecord file with s-vectors
explanation.py -> can be used to make the loss of the first positive classified attack sample visible and can be adjusted for further use

Setup

Each of the subfolders contains a Dockerfile that can be used for building a docker image with tensorflow and the other according requirements. It is recommended to run a container based on this image and mount a folder for external provision of datasets and to export the training results. A port mapping can be used to publish tensorboards. NVIDIA container runtime is necessary:

cd can-ids/unsupervised/x-canids
docker build . -t can-ids-unsupervised
docker run -it --publish 6006:6006 --name can-ids-unsupervised-training --gpus all --mount type=bind,src="$(pwd)"/datasets,dst=/ids/Data can-ids-unsupervised bash

Usage

The usage of each script can be derived directly in the python script.

Additional notes

A main difference between the literature evaluation and my evaluation is that the ROAD dataset does not provide the ranges of the signals in the dbc-file. Thus, the ranges need to be determined before on the training datasets.

References

[1]: Verma, Miki E., et al. "Addressing the lack of comparability & testing in CAN intrusion detection research: A comprehensive guide to CAN IDS data & introduction of the ROAD dataset." arXiv preprint arXiv:2012.14600 (2020).

[2]: Hanselmann, Markus, et al. "CANet: An unsupervised intrusion detection system for high dimensional CAN bus data." Ieee Access 8 (2020): 58194-58205.

[3]: Jeong, Seonghoon, et al. "X-CANIDS: Signal-Aware Explainable Intrusion Detection System for Controller Area Network-Based In-Vehicle Network." arXiv preprint arXiv:2303.12278 (2023).

freundma / can-ids Goto Github PK

can-ids's Introduction

can-ids

Structure

Setup

Usage

Additional notes

References

can-ids's People

Stargazers

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent