Authors

Dr. Jim Grayson

Augusta University

Sam Gardner

Eli Lilly

Mia Stephens

JMP

Objective

Build a variety of prediction models (multiple regression, partition tree, and a neural network) to determine the one that performs the best at predicting house prices based upon various characteristics of the house and its location.

Background

The objective of this study is to develop a model to predict the median value of homes in the Boston area. The data were originally collected and assembled in the mid-1970s (Harrison and Rubinfield, 1978), so this example is a bit dated. However, it is typical of a socioeconomic data set that is used to inform economic or public policy decisions, and the data set is well-known throughout the data mining community.

The Task

Our goal is to use the available data build a model that makes accurate predictions about home values in the Boston area. To ensure that the model predicts well for data not used to build the model, we use model validation. We will build different models (e.g., multiple regression, regression tree and neural network) in JMP Pro, compare the performance of these models, and select the best-performing model. 


Use the links below to read the full case study and download the data files