This project covers the entire customer churn prediction process, including data preprocessing, feature selection, model development, evaluation, and actionable insights. It enables businesses to lower churn rates and improve customer satisfaction.
- members :
- Abhinaba Sarkar
- Dipanjan Mahata
- Uttam Soren
- Arnab Pal
- Himanshu Shekhar Mete
data link: HERE !
- cd into streamlit-files dir
pip install streamlitinstall the streamlit firststreamlit run main.pyrun this onestreamlit run deploy.pyavoid running this one
- Introduction
- Data Preprocessing
- Feature Selection
- Model Building
- Model Evaluation
- Business Recommendations
- Conclusion
The primary objective of this project is to predict customer churn for a telecommunications company. Churn prediction is vital for businesses to identify customers who are likely to leave and take proactive measures to retain them.
Data preprocessing is crucial for creating accurate and reliable predictive models.
- Analyzed data distribution, statistics, and identified potential outliers.
- Explored correlations between features to understand relationships.
- Handled missing values through techniques like imputation or removal.
- Detected and managed duplicate records in the dataset.
- Applied standardization or normalization to ensure features are on the same scale.
- Used techniques like Min-Max scaling or Z-score normalization.
- Created new features based on domain knowledge or transformations of existing features.
- Examples: total service usage, average call duration, etc.
Feature selection enhances model efficiency and interpretability.
- Utilized statistical tests like chi-squared or ANOVA to select features with significant impact.
- Considered p-values and domain knowledge.
- Executed RFE with different models to rank and select most relevant features.
- Eliminated features with lower importance.
Multiple machine learning models were trained to predict churn.
- Constructed a logistic regression model.
- Tuned hyperparameters using techniques like GridSearchCV.
- Focused on interpretability and feature importance.
Model evaluation is crucial to gauge predictive performance.
- Utilized metrics such as accuracy, precision, recall, F1-score, and ROC-AUC.
- Employed techniques like cross-validation to assess generalization.
Insights from models drove actionable recommendations.
- Focused on strategies to retain high-risk customers during action phase.
- Designed personalized approaches based on predicted churn probabilities.
- Aligned marketing efforts with identified influential features.
- This project demonstrates the end-to-end process of customer churn prediction, from data preprocessing, feature selection, and model building to evaluation and business-oriented recommendations. The insights gained empower businesses to reduce churn rates and enhance customer satisfaction.
