Cover Image

Pi Day Dataset Analysis

Onyx Data DNA March 2022 Challenge

Skills and Tools Used:

Power BI
DAX
Data Analysis
Trend Analysis
Microsoft Excel

GitHub Repo


1. Problem Statement

In celebration of Pi Day (March 14), a fictional bakery has logged their sales from the past 2.5 years and would like their data analysed in order to make better decisions based on the trends of the provided data.

2. Data

The data is from Onyx Data's DNA Challenge website, which can be accessed here.

3. Data Dictionary

The following data dictionary is a combination of that provided by Onyx Data as well as any columns added to the loaded data prior to any pre-processing was done. Those highlighted in red are created in Excel, expanding on existing data columns.

Column Name Description Data Type
Order NO Order number of Pie Bakery orders String
Order Date Date order was placed Date
Month/Yr Month & Year abbreviation of the Date of order Date
Order Day Day of the month the order was placed Integer
Order Month (Text) The Month the order was placed String
Order Year The Year the order was placed Integer
Day of Week Day of the week the order was placed String
Pie Flavor The flavor of the pie(s) ordered String
Quantity Quantity of pies ordered Integer
Cost Total cost of pies ordered Float
Slice Or Whole Pie Slice or Whole Pie ordered String
Pre-Order/In-Store Purchase Denotes if the order was made in store or pre-ordered String
Organic? Organic if the pie filling ingredients were certified organic String

4. Pre-Processing

Although the data provided is generally clean and organised, some pre-processing is still done. Firstly, the Day of the Week is actually incorrect, and as a result is redone in DAX to ensure the correct day is used. The following DAX command is used:

Day of the Week = FORMAT(Data[Order Date], "dddd")

Further, to ensure that the days are displayed in the correct order and not alphabetically, we associate each day to a number corresponding, for example by default Sunday is 0, Monday is 1, up until Saturday which is 6. Personally, I prefer Monday to be 1, and Sunday to be 7. In order to create that association, the numbers generated per day is executed as follows:

DayNum = WEEKDAY(Data[Order Date],2)

From there, the Day of the Week is sorted by DayNum. The dashboard can now be created from there.

5. Dashboard

The dashboard was built in Power BI and can be downloaded here.

A screenshot of the dashboard is seen below:

Dashboard

6. Final Analysis

It can be suggested that going forward Pi Day Pie Palace could use this data to make crucial business decisions. For instance, ensuring a larger supply of ingredients during April and November could allow the business to make more pies, and if the ingredients are organic in nature can increase the number of organic pies sold. Furthermore, marketing can increase sales through specific festive marketing campaigns.

Since Apple and Strawberry Rhubarb are the best selling flavours it can be suggested to ensure a consistent supply of these flavoured ingredients to keep the best sellers in stock and best in selling.

Pi Day Pie Palace is on track to sell more in 2022, and based on the trends from the previous two years, it can be suggested that Strawberry Rhubarb, Pumpkin, and Apple pies will be the biggest contributors to PDPP increasing their sales, especially if they are able to market it correctly.

Since there is a clear preference for Organic pies, finding a supplier to supply on organic ingredients will allow PDPP to migrate towards selling only organic pies.