CS代写 | 留学生计算机辅导 | Assignment代做 | Code代做 | 论文辅导

CS代写 | 留学生计算机辅导 | Assignment代做 | Code代做 | 论文辅导

Foundational Business Analytics - Coursework 2024-2025

2025-01-24

Foundational Business Analytics - Coursework 2024-2025

Key Information

  • Release Date: 14th October 2024 (dataset available on Moodle)
  • Deadline: 5th December 2024, 3:00 pm
  • Submission: Via Moodle coursework submission link on the FBA module web page

1. Problem Definition

A financial institution has been over-issuing loans to unqualified applicants to gain market share. This strategy has increased the number of loan defaults, causing significant financial losses. Traditionally, the company reacts only after a loan defaults, but this passive approach is unsustainable.

The institution plans to launch a proactive loan risk management programme to predict loans likely to default and intervene early to avoid losses. The historical dataset contains demographic, credit history, and repayment details, including whether loans defaulted. Your task as a consultant is to analyze this data and build a model to predict loan defaults. Additionally, you must provide business recommendations based on your analysis.


2. Important Message from the CEO

The CEO’s key directive:

  • Predict loans likely to default to enable early intervention and minimize losses.
  • Focus on avoiding financial losses from defaults, even if this risks minor customer dissatisfaction during investigations.

3. The Available Dataset

You are provided with a unique dataset in CSV format containing 300,000 samples of loan data, accessible via Moodle. The dataset schema includes:

TypeNameDescription
IDIDUnique identifier for the loan listing.
InputLoan_AmountLoan amount in dollars.
InputTermLoan term (in years).
InputInterest_RateLoan interest rate.
InputInstallmentFixed regular payment for loan repayment.
InputGradeLoan grade indicating risk (A to G; higher grades are better).
InputSub_GradeDetailed subcategory within the loan grade.
InputEmployment_DurationBorrower’s employment duration (in years).
InputRealestate_OwnershipBorrower’s real-estate ownership (categorical: rent, mortgage, etc.).
InputAnnual_IncomeBorrower’s annual income.
InputPurposePurpose of the loan (e.g., housing, vehicle, education, etc.).
InputDTIDebt-to-income ratio.
InputFICO_Range_LowLower range of borrower’s FICO score.
InputFICO_Range_HighHigher range of borrower’s FICO score.
OutputYWhether the loan defaulted (Y=0 not default, Y=1 default).

4. Formal Task Specification

  • Objective: Build a classification model to predict loan defaults.
  • Process:
    1. Statistical analysis of input features.
    2. Model selection and training using Python 3 or Orange3.
    3. Evaluate implications and provide business recommendations.
  • Submission Requirements:
    • A maximum 8-page report (excluding the front page).
    • A zip file containing your model implementation with instructions for use.

5. Report Sections

Section A: Summarization (10 marks)

  • Perform statistical analysis of the dataset, examining relationships between features and the target variable (Y).
  • Use visuals like tables, bar charts, or scatter graphs to communicate insights clearly.

Section B: Preparation and Exploration (15 marks)

  • Describe your data cleaning and transformation processes, including handling missing values and outliers.
  • Apply a decision tree to explore feature importance and sub-populations.

Section C: Model Evaluation (25 marks)

  • Select and compare three classification models (from Logistic Regression, Decision Trees, Random Forests, Naive Bayes, and KNN).
  • Detail the models, parameters, and evaluation strategy (e.g., confusion matrices, performance metrics).
  • Provide a thorough comparison against a benchmark predictor.

Section D: Final Assessment (5 marks)

  • Justify your selected "winning" classifier, considering its business implications.

Section E: Model Implementation (5 marks)

  • Train the final model on the entire dataset.
  • Provide clear instructions for using the model on new data.

Section F: Business Recommendations (10 marks)

  • Summarize business recommendations based on your findings.

6. Further Available Marks

  • Report Presentation: Clarity, professionalism, and argument quality (5 marks).
  • Model Implementation: Code quality and usability for making predictions on new data (20 marks).
  • Model Effectiveness: Performance on a hidden test dataset (5 marks).

7. Submission Guidelines

  • Submit a zip file containing:
    1. The final report (maximum 8 pages).
    2. Model code/workflow files.
  • Late submissions incur a 5% penalty per day.
  • Only the first 8 pages of the report will be assessed.

8. Final Notes on Plagiarism

  • Each student’s dataset is unique to ensure individual work.
  • Submissions will be checked for originality; plagiarized work receives zero marks.

9. Additional Tips

  • Focus on robust model evaluation and understanding over achieving perfect predictions.
  • Presentation matters; format your report professionally.
  • Using Python offers opportunities for advanced analysis but is not required to score high marks.
  • Decision trees and visuals should be concise and informative.

联系我们

WeChat:pythonyt001
Email: [email protected]
17921737610209_.pic.jpg

  • 1
class="duration-300 fixed bottom-5 right-5 flex items-center justify-center rounded-full bg-neutral p-3 opacity-0 shadow transition-all hover:opacity-70 hover:shadow-lg lg:bottom-10 lg:right-10">