This project analyzes sales transaction data from a bike retail business spanning 2011-2014. The analysis focuses on customer segmentation, product performance, and operational efficiency to identify revenue drivers and potential business risks.
| Metric | Value |
|---|---|
| Sales Transactions | 60,379 |
| Unique Customers | 18,482 |
| Products Analyzed | 130 |
| Time Period | 2011-2014 (3+ years) |
| Invalid Records Excluded | 19 |
Top 10% of customers generated 40% of total revenue
This indicates significant dependency on a small customer base, presenting both an opportunity (focus on high-value retention) and a risk (vulnerability to customer churn).
Bikes category contributed 96% of revenue with an average order value of $1,061
Strong product-market fit but highlights potential over-reliance on a single category, suggesting need for product diversification.
7-day average shipping time with 100% on-time delivery rate
Demonstrates efficient logistics operations and strong fulfillment capabilities.
Based on the analysis, the following strategic actions are recommended:
-
Customer Retention Program
Implement targeted retention strategies for top 10% high-value customers to protect 40% of revenue -
Customer Acquisition Strategy
Diversify customer base to reduce concentration risk and dependency on small customer segment -
Product Diversification
Expand beyond Bikes category to reduce 96% revenue dependency on single product line -
Leverage Operational Strength
Maintain and market 100% on-time delivery as competitive advantage in customer acquisition
- Multi-table Joins: Connected fact and dimension tables (
fact_sales,dim_customers,dim_products) - Common Table Expressions (CTEs): Structured complex queries for customer revenue aggregation
- Window Functions: Used
RANKandPERCENT_RANKfor customer segmentation analysis - Aggregations:
SUM,AVG,COUNTwithGROUP BYfor performance metrics - Data Quality Handling: Excluded 19 invalid records with missing order dates to ensure analysis accuracy
fact_sales - Transaction-level data
order_number,product_key,customer_keyorder_date,shipping_date,due_datesales_amount,quantity,price
dim_customers - Customer demographics
customer_key,customer_id,customer_numberfirst_name,last_name,countrygender,birthdate,marital_status
dim_products - Product hierarchy
product_key,product_id,product_namecategory,subcategory,product_linecost,maintenance,start_date
For questions or collaboration opportunities, feel free to reach out!
⭐ If you found this project useful, please consider giving it a star!