Student Work

Bulk Analysis of Mortgage Data with Cluster Computing

公开

可下载的内容

open in viewer

Angelo, Gordon & Co. is developing a statistical model of loan delinquency status. This project designed and implemented a software package to process a public data set from Wells Fargo to build a rudimentary model. The data requires significant manipulation to convert it into a useful form. A series of compartmentalized modules were created, each of which are combined to form a “Tech Stack”, which runs each step in the sequence. This ends in an upload to a cloud storage provider. Once the Tech Stack had processed the relevant data, several sample analyses were run to demonstrate the data’s capabilities. The size of the data set made computations with a single computer impractical, so a cluster was used to analyze the data.

  • This report represents the work of one or more WPI undergraduate students submitted to the faculty as evidence of completion of a degree requirement. WPI routinely publishes these reports on its website without editorial or peer review.
Creator
Publisher
Identifier
  • E-project-012318-090334
Advisor
Year
  • 2018
Center
Sponsor
Date created
  • 2018-01-23
Resource type
Major
Rights statement

关系

属于 Collection:

项目

单件

Permanent link to this page: https://digital.wpi.edu/show/pz50gx870