This repository contains a Jupyter Notebook that analyzes and predicts air quality in Delhi using historical data. The project leverages data visualization and preprocessing techniques to explore patterns in air pollutants and build predictive insights.
- Importing Libraries: Loading necessary Python libraries.
- EDA (Exploratory Data Analysis): Investigating the dataset structure and initial insights.
- Distribution of Dataset: Exploring data distributions.
- Formatting the Date Column: Converting and handling date values for time-series analysis.
- Visualizing PM2.5 Levels Across Weekdays and Seasons: Seasonal and weekly trends in particulate matter (PM2.5) concentrations.
- Observations: Key findings from visualizations and analysis.
- Monthly Distribution of Pollutants: Monthly trends of air pollutants.
numpy
: For numerical computations.pandas
: For data manipulation and analysis.matplotlib.pyplot
: For data visualization.seaborn
: For enhanced statistical visualizations.warnings
: To manage and suppress warnings.
- File:
delhi_aqi.csv
- Description: Historical data on air quality in Delhi, including various pollutants and timestamps.
- Clone the repository:
git clone https://github.com/B1tW1z/DelhiAQI-Analysis.git
- Install the required libraries:
pip install numpy pandas matplotlib seaborn
- Open the Jupyter Notebook:
jupyter notebook delhi-air-quality.ipynb
- Ensure the dataset
delhi_aqi.csv
is in the same directory as the notebook. - Run the cells sequentially to reproduce the analysis.
- Seasonal and weekly trends of PM2.5 levels.
- Monthly variations in air pollutant concentrations.
- Observations that can inform policy decisions and public health measures.
- Implementing machine learning models to predict air quality indices (AQI).
- Extending analysis to include meteorological data for enhanced predictions.