项目作者: anupriya1519

项目描述 :
Data Analysis and Decision Making Project using R
高级语言: R
项目地址: git://github.com/anupriya1519/House-Mortgage-Dataset-Analysis.git


House-Mortgage-Dataset-Analysis

Part-1 Exploratory Dataset Analysis Home Mortgage NY

  1. - Loading and analyzing dataset
  2. - Visualizing the dataset
  3. - Building a classification model to decide which predictors are most important
  4. - Calculating the accuracy of the model by plotting ROC curve

Part-2

The main aim of the project was to predict whether a mortgage application will be accepted or not. Dataset was extracted from AER package in R.

Steps involved:

  1. --Data extraction and cleansing
  2. --Outliers detection using QQ-Plot and box-plots
  3. --Skewness detection using density plots
  4. --Model fitting using GLM
  5. --Model selection using forward and hybrid methods
  6. --Choosing the best model using ANOVA
  7. --Predicting the model accuracy
  8. --Plotting the ROC curve
  9. --Conclusion:
  10. On the basis of various analysis performed; the most significant predictors are:
  11. Payment to Income Ratio
  12. Loan to value ratio
  13. Credit history: consumer payments
  14. Public bad credit record
  15. Insurance
  16. Ethnicity
  17. Marital status