Log-Anomaly-Detection

Based on the implementation of Deeplog, we proposed TransLog by introduced Transformer to improve the performance.

Overall Architecture

Dataset

We used the HDFS dataset in this work. It is generated in a Hadoop cluster, which has 46 cores on five machines, by running MapReduce jobs on more than 200 Amazon EC2 nodes, and is tagged by Hadoop domain experts through manual rules to identify anomalies. the HDFS dataset contains a total of 11,175,629 log messages, with 16,838 log blocks 2.93% indicating anomalies. The dataset was collected for 38.7 hours, during which time a total of 1.47 uncompressed data were collected.

In our experiments, considering the limited computational power of the training platform, we only used HDFS_2K dataset which includes samples from the raw HDFS data.

Usage

Create virtual environment based on the provided configurations

conda env create -f py36.yaml

Activate virtual environment

conda activate py36

Execute parse_log.py to parse the raw log file

python parse_log.py

(This step can be skipped) Execute train.py to train the model. The architecture of model is defined in models/model_collects.py, and you can change it for training costum models.

python train.py

Experiments are conducted in work.ipynb, including the training for different comparsion netowrks and varients of TransLog, the performance evaluation, visualization, etc.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
figures		figures
logparser/Drain		logparser/Drain
logs		logs
models		models
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
dataloader.py		dataloader.py
env.yaml		env.yaml
loss_func.py		loss_func.py
parse_log.py		parse_log.py
train.py		train.py
utils.py		utils.py
work.ipynb		work.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Log-Anomaly-Detection

Overall Architecture

Dataset

Usage

Experimental Results

Training Loss

Validation Loss

Parameter Statistics

About

Releases

Packages

Languages

License

WenliangGuo/Log-Anomaly-Detection

Folders and files

Latest commit

History

Repository files navigation

Log-Anomaly-Detection

Overall Architecture

Dataset

Usage

Experimental Results

Training Loss

Validation Loss

Parameter Statistics

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages