The customer is the credit department of the bank. It is necessary to understand whether the marital status and the number of children of the client affect the fact of repayment of the loan on time. Input data from the bank — statistics on the solvency of customers.
The results of the study will be taken into account when building a credit scoring model — a special system that evaluates the ability of a potential borrower to repay a loan to a bank.
- Preprocessing data in order to make it appropriate for use in data analysis.
- Analyzing factors that may have an impact upon the ability of bank clients to repay their debt on time.
No information is given regarding the quality of data. Thus, an inspection of data will be needed before studying the main determinants of creditworthiness. We will conduct data preprocessing and look for ways to correct the most critical data errors.
The study will be carried out in three stages:
- Data overview.
- Data preprocessing.
- Creditworthiness determinants analysis.
The analysis is based on the following features of clients of some bank:
- Number of children
- Number of days being employed
- Age
- Education level
- Family status
- Gender
- Employment type
- Binary indicator of owing a debt
- Monthly income
- Loan purpose