👨‍💻Classifying Cybersecurity Incidents👨‍💻

📖Project Description

This is a Data Science Project to enhance the efficiency of Security Operation Centers (SOCs) by developing a machine learning model that can accurately predict the triage grade of cybersecurity incidents. Utilizing the comprehensive dataset, the goal is to create a classification model that categorizes incidents as true positive (TP), benign positive (BP), or false positive (FP) based on historical evidence and customer responses. The model should be robust enough to support guided response systems in providing SOC analysts with precise, context-rich recommendations, ultimately improving the overall security posture of enterprise environments.

📁Data Set Overview

There are three hierarchies of data: (1) evidence, (2) alert, and (3) incident. At the bottom level, evidence supports an alert. For example, an alert may be associated with multiple pieces of evidence such as an IP address, email, and user details, each containing specific supporting metadata. Above that, we have alerts that consolidate multiple pieces of evidence to signify a potential security incident. These alerts provide a broader context by aggregating related evidences to present a more comprehensive picture of the potential threat. At the highest level, incidents encompass one or more alerts, representing a cohesive narrative of a security breach or threat scenario.

The Dataset is already divided into 2 parts, a train set containing 70% of the data and a test set with 30% containing 45 features, labels, and unique identifiers across 1M triage-annotated incidents. stratified based on triage grade ground-truth, OrgId, and DetectorId. We ensure that incidents are stratified together within the train and test sets to ensure the relevance of evidence and alert rows.

You can download the datasets from here : Datasets

🚩Approach

Checkout the Approach file

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
NoteBooks		NoteBooks
Resources		Resources
.gitattributes		.gitattributes
.gitignore		.gitignore
Approach.md		Approach.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

👨‍💻Classifying Cybersecurity Incidents👨‍💻

Table of Contents

📖Project Description

📁Data Set Overview

🚩Approach

Developed By - Avijit Jana

About

Releases

Packages

Languages

License

Avijit-Jana/Classifying_Cybersecurity_Incidents

Folders and files

Latest commit

History

Repository files navigation

👨‍💻Classifying Cybersecurity Incidents👨‍💻

Table of Contents

📖Project Description

📁Data Set Overview

🚩Approach

Developed By - Avijit Jana

About

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages