Skip to content

melirossi/tech-sales-report

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

10 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

InProgress LastCommit Size

πŸ–±οΈ π™π™šπ™˜π™ π™¨π™–π™‘π™šπ™¨ π™§π™šπ™₯𝙀𝙧𝙩

For this project, I combined three datasets with information about sellers, articles, and orders of one-month trade log, to answer some questions. My analytical process involved using Pandas for exploratory data analysis, NumPy for analysis of specific columns, and Matplotlib/Seaborn for visualization of results.

First, I scoped and collected the necessary data, followed by data exploration and preparation. Next, I defined the model and pipelines necessary to achieve the desired results. Using these tools, I was able to answer the questions and draw appropriate conclusions.


dashboard dashboard

LinkProyecto

The sources of data used in this project were:

βœ”οΈ articles.db: DB with articles data

βœ”οΈ sellers.xlsx: Excel file with sellers data

βœ”οΈ orders.csv: CSV file with sales records withing the month

The client requested the following information to be answered:

βœ”οΈ What is the best-selling item? (in units)

βœ”οΈ Which item provided us with the highest revenue?

βœ”οΈ Which seller should be awarded the β€œBest Seller of the Month” bonus?

βœ”οΈ Are there significant variations in sales throughout the month?

βœ”οΈ What were the top 5 countries in terms of purchases, and what was the total amount of their purchases?

βœ”οΈ Notebooks or CPUs? Which did the top 5 purchasing countries buy more of?

Development:

βœ”οΈ To begin with, I was tasked with collecting and organizing data from various sources, including CSV and Excel files, as well as a database. To accomplish this, I utilized a range of Python libraries, such as Pandas, SQLite3, and openpyxl. During the exploratory analysis, I assessed the data for various characteristics, such as the number of columns and entries, null values, data types, and unique indexes, and subsequently prepared the data for analysis. To proceed, I merged all the data frames into a single one for ease of use.

βœ”οΈ In the analytical section, I was tasked with answering four questions using both analytical and graphical approaches. Additionally, I had to formulate three new questions and provide responses to them.

βœ”οΈ Finally, I presented the conclusions and recommendations of the project in a clear and concise manner.

Resources used:

Python

I used Python and its libraries Pandas, NumPy, and Matplotlib/Seaborn for data analysis and visualization

I'd love to hear from you!

If you have any question, comments, or suggestions, do not hesitate to contact me (melisa.s.rossi@gmail.com).

Gmail Linkedin