Customer-Segmentation-Using-RFM-Analysis-in-E-Commerce

Analyze e-commerce transactional data to identify high-value customers, detect trends, and optimize targeted marketing through effective customer segmentation Using Power Bi dashboard and python.

Dataset Overview

Source: [Kaggle - E-commerce Dataset]

Size: (541909, 8)

Timeframe: 01/12/2010 to 09/12/2011 Key Variables: InvoiceNo: Unique transaction ID StockCode: Unique product identifier Description: Product description Quantity: Number of items purchased InvoiceDate: Timestamp of purchase UnitPrice: Price per unit in GBP CustomerID: Unique customer identifier Country: Location of the customer

Data Cleaning and Preparation

1.Handled Missing Values - The percentage of missing values in the CustomerID column is 24.93%. Since the analysis will revolve around investigating customers and clustering them into categories, the missing values in the CustomerIDs were removed.

2.Removed Duplicates - The number of duplicate rows in the dataset is 5525. These rows were removed from the dataset.

3.Removed Cancelled Orders - There are 8872 rows for which the quantity is negative which can be either due to data-entry errors or return orders or cancelled orders. If we look at the InvoiceNo for all these cases, they start with the letter ‘C’ which indicates they are cancelled orders. Thus these rows were removed from the dataset

4.Removing non-product Stock-Codes - There are certain StockCodes which do not belong to any products. All the rows containing such StockCodes were removed.

Preto Principle - Roughly 80% of outcomes stem from 20% of causes

26% -- customers contribute to 80% of the revenue 21% -- products contribute to 80% of the revenue

RFM Analysis and Customer Segmentation

What is RFM? Recency (R): Days since last purchase Frequency (F): Number of purchases Monetary (M): Total spending

Customers are segmented into five equal buckets based on Recency, Frequency, and Monetary values. Each customer is ranked for each metric, assigned a score from 1 to 5, and their scores are summed to derive an overall RFM score for analysis customer segments based on rfm score

Recommendations based on customers

1- At-risk customers 2- High value customers 3- loyal customers 4- Dormant customers

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Ecommerce_Dashboard.pbix		Ecommerce_Dashboard.pbix
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Customer-Segmentation-Using-RFM-Analysis-in-E-Commerce

Dataset Overview

Data Cleaning and Preparation

Preto Principle - Roughly 80% of outcomes stem from 20% of causes

RFM Analysis and Customer Segmentation

Recommendations based on customers

About

Uh oh!

Releases

Packages

codewithsanaa/Customer-Segmentation-Using-RFM-Analysis-in-E-Commerce

Folders and files

Latest commit

History

Repository files navigation

Customer-Segmentation-Using-RFM-Analysis-in-E-Commerce

Dataset Overview

Data Cleaning and Preparation

Preto Principle - Roughly 80% of outcomes stem from 20% of causes

RFM Analysis and Customer Segmentation

Recommendations based on customers

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages