Basiq ML Challenge

This repository contains the code that performs bank transaction classification over the training set of 100 000 observations. The code demonstrates the approach that consists of the following steps:

preprocessing the data (data_preparation.py),
building classification models (naive_bayes.py and svc.py), and
making predictions over the scorecard data with the best performing classifier (fill_scorecard.py).

The algorithms chosen are Naive Bayes and SVM (with the linear kernel); both known to perform well with text classification problems, but yet simple to train without requiring high computational time and resources (also reasons for choosing them).

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
data		data
models		models
.gitignore		.gitignore
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
data_preparation.py		data_preparation.py
feature_engineering_util.py		feature_engineering_util.py
fill_scorecard.py		fill_scorecard.py
naive_bayes.py		naive_bayes.py
result_util.py		result_util.py
svc.py		svc.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Basiq ML Challenge

About

Releases

Packages

Languages

milikicn/basiq-ml-challenge

Folders and files

Latest commit

History

Repository files navigation

Basiq ML Challenge

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages