Data provided by :https://www.kaggle.com/makarandrayate/ecommerce-uk-retailer
The Ecommerce_UK_Retailer data analysis by using Python libraries.
- Perform Basic EDA
a. Boxplot – All Numeric Variables
b. Histogram – All Numeric Variables
c. Distribution Plot – All Numeric Variables
d. Aggregation for all numerical Columns
e. Unique Values across all columns
f. Duplicate values across all columns
g. Correlation – Heatmap - All Numeric Variables
h. Regression Plot - All Numeric Variables
i. Bar Plot – Every Categorical Variable vs every Numerical Variable
j. Pair plot - All Numeric Variables
k. Line chart to show the trend of data - All Numeric/Date Variables
l. Plot the skewness - All Numeric Variables
-
Check for missing values in all columns and replace them with the appropriate metric (Mean/Median/Mode)
-
Remove duplicate rows
-
Remove rows which have negative values in Quantity column
-
Add the columns - Month, Day and Hour for the invoice
-
How many orders made by the customers?
-
TOP 5 customers with higher number of orders
-
How much money spent by the customers?
-
TOP 5 customers with highest money spent
-
How many orders per month?
-
How many orders per day?
-
How many orders per hour?
-
How many orders for each country?
-
Orders trend across months
-
How much money spent by each country?