Machine Learning Models to Predict House Prices based on Home Features

Venkat Shiva Pandiri

Project

Machine Learning Models to Predict House Prices based on Home Features

This project carried out a systematic investigation to predict the final price of each home using machine learning techniques. Various machine learning techniques such as multiple linear regression (base model), random forest regression and polynomial regression were applied to the dataset to compare the results. The data describes the sale of individual properties, various features and details of each home in Ames, IW from 2006 to 2010. The dataset comprises of 80 explanatory variables which include 23 nominal, 23 ordinal, 14 discrete, and 20 continuous variables. The programs were implemented using Python, by using core libraries like pandas, scikit–learn, NumPy. Backward elimination algorithm is applied in building optimal model and selection of features over 270 independent variables with approximately 7,91,320 observations. K-fold cross validation technique is used to measure the performance of all the models. A good high R- squared values with low variance are recorded for linear models. In order to select a good prediction model, all the regression models are explored and compared with each other. Results from K fold cross validation indicates high R-squared values for MLR and Random forest, stating a high level of performance when applied on an actual test set. Each model is evaluated with kaggle score checker. My Random forest model achieved the score of 0.14696, which is better compared to my base model Multiple linear regression (kaggle score 0.16854) and Polynomial regression (kaggle score 0.24399).

Date

2017-08-10

Resource Type

Project

Creator

Venkat Shiva Pandiri

Advisor

Zhang, Xiaoyu

Committee Member

Ye, Xin

Campus

San Marcos

College

Science, Technology, Engineering & Math

Department

Computer Science & Information Systems

Publisher

California State University, San Marcos

Degree Level

Masters

Degree Name

M.S.

Degree Program

Computer Science

Subjects

Date Accessioned

2017-08-09

["1 year"]

Handle

http://hdl.handle.net/10211.3/194683

["Submitted by Venkat Shiva Pandiri (pandi001@cougars.csusm.edu) on 2017-08-09T18:54:15Z\nNo. of bitstreams: 1\nPandiriVenkatShiva_Summer2017.pdf: 3853821 bytes, checksum: 7d2070cc2c46383e4a189203fa2b80f6 (MD5)", "Approved for entry into archive by Carmen Mitchell (cmitchell@csusm.edu) on 2017-08-10T17:24:18Z (GMT) No. of bitstreams: 1\nPandiriVenkatShiva_Summer2017.pdf: 3853821 bytes, checksum: 7d2070cc2c46383e4a189203fa2b80f6 (MD5)", "Made available in DSpace on 2017-08-10T17:24:18Z (GMT). No. of bitstreams: 1\nPandiriVenkatShiva_Summer2017.pdf: 3853821 bytes, checksum: 7d2070cc2c46383e4a189203fa2b80f6 (MD5)"]

Language

English

Thumbnail	Title	Date Uploaded	Visibility	Actions
	PandiriVenkatShiva_Summer2017.pdf	2019-11-12	Public	Download

Downloadable Content

Machine Learning Models to Predict House Prices based on Home Features