NFL-AI

Usage

After cloning, install the necessary packages using:

pip install -r requirements.txt

To test the neural network and feature engineering individually, run the 'nn_test.py' and 'test_feature_engineering.py' files.

To test the betting system as a whole, run the 'test_betting_system.py' file.

Introduction

The purpose of this project was to determine whether a machine learning approach to sports-betting would outperform popular alternative strategies. Our project utilizes the predictive capabilities of neural networks to assess point spreads and find potential advantageous betting situations for a particular NFL game.

Data

Historic NFL Data (https://www.kaggle.com/datasets/tobycrabtree/nfl-scores-and-betting-data)

Kaggle dataset comprised of the majority of NFL football games dating back to 1966 with 25 unique features for each game.
Games taking place prior to the 2010-2011 season were taken out.
Irrelevant and/or noisy potential indicators were removed.

Feature Engineering

Game Metrics Calculated:
- Total Points and Point Differential
- Is Home Favorite? (binary flag)
- Spread Performance & Over/Under Performance
Team Metrics Calculated:
- Average Points For/Against
- Spread Cover Rate
- Win Streak
Relative Power Ranking System Initialized

Neural Network

Input Initialization

layers - Specifies the number of neurons in each layer and the configuration of neurons.
activation - Activation function for hidden layers.
output_activation - Activation function for output layer.
learning - Network learning rate.
beta - Momentum coefficient

Component Initialization

params - Initializes weights and biases using He Initialization.
velo - Initializes velocity calculation for momentum optimization.

Caching and Storage

params - Stores weights and biases.
cache - Stores intermediate calculations.
gradient - Stores computed gradients.

Forward Propagation

Inputs 'X' are transposed onto a matrix as:

A = X.T
self.cache['A0'] = A

Input and updated weight matrix's are multiplied and bias is added:

Z = np.dot(W, A) + b

Output is ran through the chosen activation function:

A = self.activation_func(Z)

Cache the new input matrix 'A' and dot product + bias result 'Z':

self.cache['A' + str(layer)] = A
self.cache['Z' + str(layer)] = Z

Cost function is calculated using the Mean Squared Error (MSE) fromula.

Backpropagation

Computes cost gradients with respect to weights and biases

Output Layer

Cost function is computed with respect to Z (pre-activation):

dZ = self.cache['A' + str(L)] - Y

Gradient for weights and biases:

self.gradient['dW' + str(L)] = np.dot(dZ, self.cache['A' + str(L - 1)].T) / m
self.gradient['db' + str(L)] = np.sum(dZ, axis=1, keepdims=True) / m

dW = derivative of cost function with respect to weights
db = derivative of cost function with respect to bias
Gradients are averaged over batch size 'm'

Hidden Layers

Moves gradients back through the hidden layers.
Multiplies the gradient from the following layer with weight matrix.
Multiplies result with the derivative of the activation function.

dZ = np.dot(self.params['W' + str(layer + 1)].T, dZ) * self.activation_derivative(self.cache['Z' + str(layer)])

Computation of the gradients is performed as outlined in the output layer.
'dW' and 'db' are cached.

Training & Optimization

Batch size and cost value storage are initialized.
Data is shuffled at the start of each epoch (training round).
For each batch, forward propagation is performed and cost calculated:

Y_pred = self.forward_feed(X_batch)
cost = self.cost(Y_pred, Y_batch.T)

Back propagation is performed and network is updated:

self.backward_feed(Y_batch.T)
self.update_network()

Cost is logged and the network makes predictions on test data:

Y_pred = self.forward_feed(X_pred)

Transpose to original format:

return Y_pred.T

Betting System

1. Initialize Feature Engineering

processed_data = self.feature_processor.process_initial_features(game_data)

2. Generate Predictions using neural network

prediction_features = self._prepare_prediction_features(features)
raw_predictions = self.neural_network.prediction(prediction_features)

3. Find Value Bets

Calculate the potential advantage:

edge = predicted_spread - market_spread

Validate to an advantage between 4.0 and 10.0 spread differential:

if abs(edge) < 4.0 or abs(edge) > 10.0:
    return False

4. Wager Determination

Dynamically determines bet amount based on the edge and current bankroll:

base_stake = self.bankroll * 0.02
edge_multiplier = min(1 + (edge - 4) * 0.1, 2.5)

5. Evaluation and Execution

P&L (Profit and Loss) is determined for a particular bet

actual_spread = actual_score_home - actual_score_away
won_bet = actual_spread > position['market_spread']  # For home bets

Note: Payout assumes even odds (-110).
Trade is executed if wager amount is within the bounds of the bankroll.

Results

Neural Network

The neural network we engineered succeeded in accurately predicting NFL scores after sufficient training.

Cost (First Training Round): 0.51
Cost (Last Training Round): 0.17
RMSE Score: 2.52

Randomly Selected Test Results:
Predicted: -4.86, Actual: -4.50
Predicted: -2.04, Actual: -3.00
Predicted: -4.63, Actual: -2.00
Predicted: -7.35, Actual: -7.00

Cost Graphs Over Epochs:

From nn_test.py

Betting System

The overall betting system shows a substantially superior betting strategy compared to standard industry methods.

Backtesting Results:
Selected Bets: 181
Winning Bets: 145
Win Rate: 80.11%
Total P&L: $25799.73
Total Stake: $47485.33
ROI: 54.33%
Average Bet Size: $262.35

Sample Betting Opportunities:

Game: Philadelphia Eagles vs San Francisco 49ers
Market Spread: 3.5
Predicted Spread: -4.1
Edge: 7.6 points
Bet Side: away
Recommended Stake: $271.49

Game: Los Angeles Rams vs Tampa Bay Buccaneers
Market Spread: 1.0
Predicted Spread: -3.6
Edge: 4.6 points
Bet Side: away
Recommended Stake: $212.24

Game: New England Patriots vs New Orleans Saints
Market Spread: 3.0
Predicted Spread: -5.2
Edge: 8.2 points
Bet Side: away
Recommended Stake: $284.28

From test_betting_system.py

Name		Name	Last commit message	Last commit date
Latest commit History 79 Commits
.idea		.idea
Media		Media
data		data
sports		sports
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NFL-AI

Usage

Introduction

Data

Neural Network

Betting System

Results

About

Contributors 4

Languages

License

blakeziegler/NFL-AI

Folders and files

Latest commit

History

Repository files navigation

NFL-AI

Usage

Introduction

Data

Neural Network

Betting System

Results

About

Topics

Resources

License

Stars

Watchers

Forks

Contributors 4

Languages