Caching Strategies in Distributed Systems - A Comparative Study (LRU, LFU, ARC)

Introduction

This project demonstrates the implementation of various caching algorithms using FastAPI, orchestrated with Docker and balanced using Nginx as a load balancer. It is designed to benchmark the performance of different caching strategies under simulated load conditions.

Project Overview

The core of this project is to explore and compare three different caching mechanisms:

Least Recently Used (LRU)
- The LRU algorithm is a cache eviction policy that removes the least recently used items first. It is based on the idea that items that have been accessed recently are more likely to be accessed again in the near future.
Least Frequently Used (LFU)
- The LFU algorithm is a cache eviction policy that removes the least frequently used items first. It is based on the idea that items that have been accessed frequently in the past are more likely to be accessed frequently in the future.
Adaptive Replacement Cache (ARC)
- The ARC algorithm is a hybrid cache eviction policy that combines the LRU and LFU algorithms. It dynamically adjusts the cache size based on the access patterns of the items in the cache.

These algorithms are implemented in a FastAPI environment, with scalability tested via load balancing managed by Nginx. Load testing is conducted using Locust to simulate traffic and measure the performance impact of each caching strategy.

Setup

Prerequisites

Getting Started

Clone the repository:

git clone https://github.com/mariamills/FastAPI-Cache-Comparison.git
cd FastAPI-Cache-Comparison

Build and run the Docker containers:

docker-compose up --build

How to Use

Access the API: The API is accessible via http://localhost:80 after Docker Compose has started the services.
API Endpoints:
- /{cache}/{key}: Get the value of a key from the specified cache (lru, lfu, arc).
  - Example: http://localhost:80/lru/1

Load Testing

Open a new terminal window and navigate to the load_tests directory.
Run the Locust load testing tool:

cd load_tests
locust

Open a web browser and navigate to http://localhost:8089 to access the Locust dashboard.
Configure the number of users and spawn rate to simulate traffic on the API.
Run the load test and observe the performance of the caching strategies.

Metrics Logged

Response Time: The time taken to process a request from the API.
Cache Hit Rate: The percentage of requests that were served from the cache.
Cache Miss Rate: The percentage of requests that were not found in the cache and had to be fetched from the database.
Cache Size: The number of items stored in the cache at any given time.
Cache Hit Count: The number of requests that were served from the cache.
Cache Miss Count: The number of requests that were not found in the cache and had to be fetched from the database.
Total Requests: The total number of requests made to the API.
Total Requests Served: The total number of requests that were successfully served by the API.
Total Requests Failed: The total number of requests that failed to be served by the API.

Tooling and Libraries

FastAPI: For the web framework and API.
Nginx: Used as a reverse proxy and load balancer.
Docker: Containerization and orchestration.
Locust: For load testing and performance measurement.
Python Libraries: collections, json, logging for caching logic and event logging.

Findings

TODO

Conclusion

This project provides a comprehensive setup for testing and comparing caching algorithms in a simulated distributed system environment. By using FastAPI, Docker, Nginx, and Locust, we can evaluate the performance of different caching strategies under varying load conditions. The results of the load tests can help us understand the strengths and weaknesses of each caching algorithm.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.idea		.idea
algorithms		algorithms
analysis		analysis
load_tests		load_tests
logs		logs
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
data.json		data.json
docker-compose.yml		docker-compose.yml
log.py		log.py
main.py		main.py
nginx.conf		nginx.conf
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Caching Strategies in Distributed Systems - A Comparative Study (LRU, LFU, ARC)

Introduction

Project Overview

Setup

Prerequisites

Getting Started

How to Use

Load Testing

Metrics Logged

Tooling and Libraries

Findings

Conclusion

About

Releases

Packages

Languages

mariamills/FastAPI-Cache-Comparison

Folders and files

Latest commit

History

Repository files navigation

Caching Strategies in Distributed Systems - A Comparative Study (LRU, LFU, ARC)

Introduction

Project Overview

Setup

Prerequisites

Getting Started

How to Use

Load Testing

Metrics Logged

Tooling and Libraries

Findings

Conclusion

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages