# ML Project: House Prices Prediction Advanced Regression Techniques | Kaggle Competition

In this project, we are going to predict the price of a house using its 80 features. Basically we are solving the Kaggle Competition.

Follow the “House Prices Prediction: Advanced Regression Techniques End to End Project” step by step to get 3 Bonus.
1. Raw Dataset
2. Ready to use Clean Dataset for ML project
3. Full Project in Jupyter Notebook File

# House Prices: Advanced Regression Techniques¶

## Goal of the Project¶

Predict the price of a house by its features. If you are a buyer or seller of the house but you don’t know the exact price of the house, so supervised machine learning regression algorithms can help you to predict the price of the house just providing features of the target house.

## Import essential libraries¶

In [1]:
# Import libraries
import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns


## Load Data Set¶

In [2]:
train = pd.read_csv('train.csv')

print("Shape of train: ", train.shape)
print("Shape of test: ", test.shape)


Shape of train:  (1460, 81)
Shape of test:  (1459, 80)

In [3]:
train.head(10)

Out[3]:
Id MSSubClass MSZoning LotFrontage LotArea Street Alley LotShape LandContour Utilities PoolArea PoolQC Fence MiscFeature MiscVal MoSold YrSold SaleType SaleCondition SalePrice
0 1 60 RL 65.0 8450 Pave NaN Reg Lvl AllPub 0 NaN NaN NaN 0 2 2008 WD Normal 208500
1 2 20 RL 80.0 9600 Pave NaN Reg Lvl AllPub 0 NaN NaN NaN 0 5 2007 WD Normal 181500
2 3 60 RL 68.0 11250 Pave NaN IR1 Lvl AllPub 0 NaN NaN NaN 0 9 2008 WD Normal 223500
3 4 70 RL 60.0 9550 Pave NaN IR1 Lvl AllPub 0 NaN NaN NaN 0 2 2006 WD Abnorml 140000
4 5 60 RL 84.0 14260 Pave NaN IR1 Lvl AllPub 0 NaN NaN NaN 0 12 2008 WD Normal 250000
5 6 50 RL 85.0 14115 Pave NaN IR1 Lvl AllPub 0 NaN MnPrv Shed 700 10 2009 WD Normal 143000
6 7 20 RL 75.0 10084 Pave NaN Reg Lvl AllPub 0 NaN NaN NaN 0 8 2007 WD Normal 307000
7 8 60 RL NaN 10382 Pave NaN IR1 Lvl AllPub 0 NaN NaN Shed 350 11 2009 WD Normal 200000
8 9 50 RM 51.0 6120 Pave NaN Reg Lvl AllPub 0 NaN NaN NaN 0 4 2008 WD Abnorml 129900
9 10 190 RL 50.0 7420 Pave NaN Reg Lvl AllPub 0 NaN NaN NaN 0 1 2008 WD Normal 118000

10 rows × 81 columns

In [4]:
test.head(10)

Out[4]:
Id MSSubClass MSZoning LotFrontage LotArea Street Alley LotShape LandContour Utilities ScreenPorch PoolArea PoolQC Fence MiscFeature MiscVal MoSold YrSold SaleType SaleCondition
0 1461 20 RH 80.0 11622 Pave NaN Reg Lvl AllPub 120 0 NaN MnPrv NaN 0 6 2010 WD Normal
1 1462 20 RL 81.0 14267 Pave NaN IR1 Lvl AllPub 0 0 NaN NaN Gar2 12500 6 2010 WD Normal
2 1463 60 RL 74.0 13830 Pave NaN IR1 Lvl AllPub 0 0 NaN MnPrv NaN 0 3 2010 WD Normal
3 1464 60 RL 78.0 9978 Pave NaN IR1 Lvl AllPub 0 0 NaN NaN NaN 0 6 2010 WD Normal
4 1465 120 RL 43.0 5005 Pave NaN IR1 HLS AllPub 144 0 NaN NaN NaN 0 1 2010 WD Normal
5 1466 60 RL 75.0 10000 Pave NaN IR1 Lvl AllPub 0 0 NaN NaN NaN 0 4 2010 WD Normal
6 1467 20 RL NaN 7980 Pave NaN IR1 Lvl AllPub 0 0 NaN GdPrv Shed 500 3 2010 WD Normal
7 1468 60 RL 63.0 8402 Pave NaN IR1 Lvl AllPub 0 0 NaN NaN NaN 0 5 2010 WD Normal
8 1469 20 RL 85.0 10176 Pave NaN Reg Lvl AllPub 0 0 NaN NaN NaN 0 2 2010 WD Normal
9 1470 20 RL 70.0 8400 Pave NaN Reg Lvl AllPub 0 0 NaN MnPrv NaN 0 4 2010 WD Normal

10 rows × 80 columns

In [5]:
## concat train and test
df = pd.concat((train, test))
temp_df = df
print("Shape of df: ", df.shape)


Shape of df:  (2919, 81)


C:\ProgramData\Anaconda3\lib\site-packages\ipykernel_launcher.py:2: FutureWarning: Sorting because non-concatenation axis is not aligned. A future version
of pandas will change to not sort by default.

To accept the future behavior, pass 'sort=False'.

To retain the current behavior and silence the warning, pass 'sort=True'.


In [6]:
df.head(6)

Out[6]:
1stFlrSF 2ndFlrSF 3SsnPorch Alley BedroomAbvGr BldgType BsmtCond BsmtExposure BsmtFinSF1 BsmtFinSF2 SaleType ScreenPorch Street TotRmsAbvGrd TotalBsmtSF Utilities WoodDeckSF YearBuilt YearRemodAdd YrSold
0 856 854 0 NaN 3 1Fam TA No 706.0 0.0 WD 0 Pave 8 856.0 AllPub 0 2003 2003 2008
1 1262 0 0 NaN 3 1Fam TA Gd 978.0 0.0 WD 0 Pave 6 1262.0 AllPub 298 1976 1976 2007
2 920 866 0 NaN 3 1Fam TA Mn 486.0 0.0 WD 0 Pave 6 920.0 AllPub 0 2001 2002 2008
3 961 756 0 NaN 3 1Fam Gd No 216.0 0.0 WD 0 Pave 7 756.0 AllPub 0 1915 1970 2006
4 1145 1053 0 NaN 4 1Fam TA Av 655.0 0.0 WD 0 Pave 9 1145.0 AllPub 192 2000 2000 2008
5 796 566 320 NaN 1 1Fam TA No 732.0 0.0 WD 0 Pave 5 796.0 AllPub 40 1993 1995 2009

6 rows × 81 columns

In [7]:
df.tail(6)

Out[7]:
1stFlrSF 2ndFlrSF 3SsnPorch Alley BedroomAbvGr BldgType BsmtCond BsmtExposure BsmtFinSF1 BsmtFinSF2 SaleType ScreenPorch Street TotRmsAbvGrd TotalBsmtSF Utilities WoodDeckSF YearBuilt YearRemodAdd YrSold
1453 546 546 0 NaN 3 Twnhs TA No 0.0 0.0 WD 0 Pave 5 546.0 AllPub 0 1970 1970 2006
1454 546 546 0 NaN 3 Twnhs TA No 0.0 0.0 WD 0 Pave 5 546.0 AllPub 0 1970 1970 2006
1455 546 546 0 NaN 3 TwnhsE TA No 252.0 0.0 WD 0 Pave 6 546.0 AllPub 0 1970 1970 2006
1456 1224 0 0 NaN 4 1Fam TA No 1224.0 0.0 WD 0 Pave 7 1224.0 AllPub 474 1960 1996 2006
1457 970 0 0 NaN 3 1Fam TA Av 337.0 0.0 WD 0 Pave 6 912.0 AllPub 80 1992 1992 2006
1458 996 1004 0 NaN 3 1Fam TA Av 758.0 0.0 WD 0 Pave 9 996.0 AllPub 190 1993 1994 2006

6 rows × 81 columns

# Exploratory Data Analysis (EDA)¶

In [8]:
# To show the all columns
pd.set_option("display.max_columns", 2000)
pd.set_option("display.max_rows", 85)

In [9]:
df.head(6)

Out[9]:
1stFlrSF 2ndFlrSF 3SsnPorch Alley BedroomAbvGr BldgType BsmtCond BsmtExposure BsmtFinSF1 BsmtFinSF2 BsmtFinType1 BsmtFinType2 BsmtFullBath BsmtHalfBath BsmtQual BsmtUnfSF CentralAir Condition1 Condition2 Electrical EnclosedPorch ExterCond ExterQual Exterior1st Exterior2nd Fence FireplaceQu Fireplaces Foundation FullBath Functional GarageArea GarageCars GarageCond GarageFinish GarageQual GarageType GarageYrBlt GrLivArea HalfBath Heating HeatingQC HouseStyle Id KitchenAbvGr KitchenQual LandContour LandSlope LotArea LotConfig LotFrontage LotShape LowQualFinSF MSSubClass MSZoning MasVnrArea MasVnrType MiscFeature MiscVal MoSold Neighborhood OpenPorchSF OverallCond OverallQual PavedDrive PoolArea PoolQC RoofMatl RoofStyle SaleCondition SalePrice SaleType ScreenPorch Street TotRmsAbvGrd TotalBsmtSF Utilities WoodDeckSF YearBuilt YearRemodAdd YrSold
0 856 854 0 NaN 3 1Fam TA No 706.0 0.0 GLQ Unf 1.0 0.0 Gd 150.0 Y Norm Norm SBrkr 0 TA Gd VinylSd VinylSd NaN NaN 0 PConc 2 Typ 548.0 2.0 TA RFn TA Attchd 2003.0 1710 1 GasA Ex 2Story 1 1 Gd Lvl Gtl 8450 Inside 65.0 Reg 0 60 RL 196.0 BrkFace NaN 0 2 CollgCr 61 5 7 Y 0 NaN CompShg Gable Normal 208500.0 WD 0 Pave 8 856.0 AllPub 0 2003 2003 2008
1 1262 0 0 NaN 3 1Fam TA Gd 978.0 0.0 ALQ Unf 0.0 1.0 Gd 284.0 Y Feedr Norm SBrkr 0 TA TA MetalSd MetalSd NaN TA 1 CBlock 2 Typ 460.0 2.0 TA RFn TA Attchd 1976.0 1262 0 GasA Ex 1Story 2 1 TA Lvl Gtl 9600 FR2 80.0 Reg 0 20 RL 0.0 None NaN 0 5 Veenker 0 8 6 Y 0 NaN CompShg Gable Normal 181500.0 WD 0 Pave 6 1262.0 AllPub 298 1976 1976 2007
2 920 866 0 NaN 3 1Fam TA Mn 486.0 0.0 GLQ Unf 1.0 0.0 Gd 434.0 Y Norm Norm SBrkr 0 TA Gd VinylSd VinylSd NaN TA 1 PConc 2 Typ 608.0 2.0 TA RFn TA Attchd 2001.0 1786 1 GasA Ex 2Story 3 1 Gd Lvl Gtl 11250 Inside 68.0 IR1 0 60 RL 162.0 BrkFace NaN 0 9 CollgCr 42 5 7 Y 0 NaN CompShg Gable Normal 223500.0 WD 0 Pave 6 920.0 AllPub 0 2001 2002 2008
3 961 756 0 NaN 3 1Fam Gd No 216.0 0.0 ALQ Unf 1.0 0.0 TA 540.0 Y Norm Norm SBrkr 272 TA TA Wd Sdng Wd Shng NaN Gd 1 BrkTil 1 Typ 642.0 3.0 TA Unf TA Detchd 1998.0 1717 0 GasA Gd 2Story 4 1 Gd Lvl Gtl 9550 Corner 60.0 IR1 0 70 RL 0.0 None NaN 0 2 Crawfor 35 5 7 Y 0 NaN CompShg Gable Abnorml 140000.0 WD 0 Pave 7 756.0 AllPub 0 1915 1970 2006
4 1145 1053 0 NaN 4 1Fam TA Av 655.0 0.0 GLQ Unf 1.0 0.0 Gd 490.0 Y Norm Norm SBrkr 0 TA Gd VinylSd VinylSd NaN TA 1 PConc 2 Typ 836.0 3.0 TA RFn TA Attchd 2000.0 2198 1 GasA Ex 2Story 5 1 Gd Lvl Gtl 14260 FR2 84.0 IR1 0 60 RL 350.0 BrkFace NaN 0 12 NoRidge 84 5 8 Y 0 NaN CompShg Gable Normal 250000.0 WD 0 Pave 9 1145.0 AllPub 192 2000 2000 2008
5 796 566 320 NaN 1 1Fam TA No 732.0 0.0 GLQ Unf 1.0 0.0 Gd 64.0 Y Norm Norm SBrkr 0 TA TA VinylSd VinylSd MnPrv NaN 0 Wood 1 Typ 480.0 2.0 TA Unf TA Attchd 1993.0 1362 1 GasA Ex 1.5Fin 6 1 TA Lvl Gtl 14115 Inside 85.0 IR1 0 50 RL 0.0 None Shed 700 10 Mitchel 30 5 5 Y 0 NaN CompShg Gable Normal 143000.0 WD 0 Pave 5 796.0 AllPub 40 1993 1995 2009
In [10]:
df.tail(6)

Out[10]:
1stFlrSF 2ndFlrSF 3SsnPorch Alley BedroomAbvGr BldgType BsmtCond BsmtExposure BsmtFinSF1 BsmtFinSF2 BsmtFinType1 BsmtFinType2 BsmtFullBath BsmtHalfBath BsmtQual BsmtUnfSF CentralAir Condition1 Condition2 Electrical EnclosedPorch ExterCond ExterQual Exterior1st Exterior2nd Fence FireplaceQu Fireplaces Foundation FullBath Functional GarageArea GarageCars GarageCond GarageFinish GarageQual GarageType GarageYrBlt GrLivArea HalfBath Heating HeatingQC HouseStyle Id KitchenAbvGr KitchenQual LandContour LandSlope LotArea LotConfig LotFrontage LotShape LowQualFinSF MSSubClass MSZoning MasVnrArea MasVnrType MiscFeature MiscVal MoSold Neighborhood OpenPorchSF OverallCond OverallQual PavedDrive PoolArea PoolQC RoofMatl RoofStyle SaleCondition SalePrice SaleType ScreenPorch Street TotRmsAbvGrd TotalBsmtSF Utilities WoodDeckSF YearBuilt YearRemodAdd YrSold
1453 546 546 0 NaN 3 Twnhs TA No 0.0 0.0 Unf Unf 0.0 0.0 TA 546.0 Y Norm Norm SBrkr 0 TA TA CemntBd CmentBd GdPrv NaN 0 CBlock 1 Typ 0.0 0.0 NaN NaN NaN NaN NaN 1092 1 GasA TA 2Story 2914 1 TA Lvl Gtl 1526 Inside 21.0 Reg 0 160 RM 0.0 None NaN 0 6 MeadowV 34 5 4 Y 0 NaN CompShg Gable Normal NaN WD 0 Pave 5 546.0 AllPub 0 1970 1970 2006
1454 546 546 0 NaN 3 Twnhs TA No 0.0 0.0 Unf Unf 0.0 0.0 TA 546.0 Y Norm Norm SBrkr 0 TA TA CemntBd CmentBd NaN NaN 0 CBlock 1 Typ 0.0 0.0 NaN NaN NaN NaN NaN 1092 1 GasA Gd 2Story 2915 1 TA Lvl Gtl 1936 Inside 21.0 Reg 0 160 RM 0.0 None NaN 0 6 MeadowV 0 7 4 Y 0 NaN CompShg Gable Normal NaN WD 0 Pave 5 546.0 AllPub 0 1970 1970 2006
1455 546 546 0 NaN 3 TwnhsE TA No 252.0 0.0 Rec Unf 0.0 0.0 TA 294.0 Y Norm Norm SBrkr 0 TA TA CemntBd CmentBd NaN NaN 0 CBlock 1 Typ 286.0 1.0 TA Unf TA CarPort 1970.0 1092 1 GasA TA 2Story 2916 1 TA Lvl Gtl 1894 Inside 21.0 Reg 0 160 RM 0.0 None NaN 0 4 MeadowV 24 5 4 Y 0 NaN CompShg Gable Abnorml NaN WD 0 Pave 6 546.0 AllPub 0 1970 1970 2006
1456 1224 0 0 NaN 4 1Fam TA No 1224.0 0.0 ALQ Unf 1.0 0.0 TA 0.0 Y Norm Norm SBrkr 0 TA TA VinylSd VinylSd NaN TA 1 CBlock 1 Typ 576.0 2.0 TA Unf TA Detchd 1960.0 1224 0 GasA Ex 1Story 2917 1 TA Lvl Gtl 20000 Inside 160.0 Reg 0 20 RL 0.0 None NaN 0 9 Mitchel 0 7 5 Y 0 NaN CompShg Gable Abnorml NaN WD 0 Pave 7 1224.0 AllPub 474 1960 1996 2006
1457 970 0 0 NaN 3 1Fam TA Av 337.0 0.0 GLQ Unf 0.0 1.0 Gd 575.0 Y Norm Norm SBrkr 0 TA TA HdBoard Wd Shng MnPrv NaN 0 PConc 1 Typ 0.0 0.0 NaN NaN NaN NaN NaN 970 0 GasA TA SFoyer 2918 1 TA Lvl Gtl 10441 Inside 62.0 Reg 0 85 RL 0.0 None Shed 700 7 Mitchel 32 5 5 Y 0 NaN CompShg Gable Normal NaN WD 0 Pave 6 912.0 AllPub 80 1992 1992 2006
1458 996 1004 0 NaN 3 1Fam TA Av 758.0 0.0 LwQ Unf 0.0 0.0 Gd 238.0 Y Norm Norm SBrkr 0 TA TA HdBoard HdBoard NaN TA 1 PConc 2 Typ 650.0 3.0 TA Fin TA Attchd 1993.0 2000 1 GasA Ex 2Story 2919 1 TA Lvl Mod 9627 Inside 74.0 Reg 0 60 RL 94.0 BrkFace NaN 0 11 Mitchel 48 5 7 Y 0 NaN CompShg Gable Normal NaN WD 0 Pave 9 996.0 AllPub 190 1993 1994 2006
In [11]:
df.info()


<class 'pandas.core.frame.DataFrame'>
Int64Index: 2919 entries, 0 to 1458
Data columns (total 81 columns):
1stFlrSF         2919 non-null int64
2ndFlrSF         2919 non-null int64
3SsnPorch        2919 non-null int64
Alley            198 non-null object
BedroomAbvGr     2919 non-null int64
BldgType         2919 non-null object
BsmtCond         2837 non-null object
BsmtExposure     2837 non-null object
BsmtFinSF1       2918 non-null float64
BsmtFinSF2       2918 non-null float64
BsmtFinType1     2840 non-null object
BsmtFinType2     2839 non-null object
BsmtFullBath     2917 non-null float64
BsmtHalfBath     2917 non-null float64
BsmtQual         2838 non-null object
BsmtUnfSF        2918 non-null float64
CentralAir       2919 non-null object
Condition1       2919 non-null object
Condition2       2919 non-null object
Electrical       2918 non-null object
EnclosedPorch    2919 non-null int64
ExterCond        2919 non-null object
ExterQual        2919 non-null object
Exterior1st      2918 non-null object
Exterior2nd      2918 non-null object
Fence            571 non-null object
FireplaceQu      1499 non-null object
Fireplaces       2919 non-null int64
Foundation       2919 non-null object
FullBath         2919 non-null int64
Functional       2917 non-null object
GarageArea       2918 non-null float64
GarageCars       2918 non-null float64
GarageCond       2760 non-null object
GarageFinish     2760 non-null object
GarageQual       2760 non-null object
GarageType       2762 non-null object
GarageYrBlt      2760 non-null float64
GrLivArea        2919 non-null int64
HalfBath         2919 non-null int64
Heating          2919 non-null object
HeatingQC        2919 non-null object
HouseStyle       2919 non-null object
Id               2919 non-null int64
KitchenAbvGr     2919 non-null int64
KitchenQual      2918 non-null object
LandContour      2919 non-null object
LandSlope        2919 non-null object
LotArea          2919 non-null int64
LotConfig        2919 non-null object
LotFrontage      2433 non-null float64
LotShape         2919 non-null object
LowQualFinSF     2919 non-null int64
MSSubClass       2919 non-null int64
MSZoning         2915 non-null object
MasVnrArea       2896 non-null float64
MasVnrType       2895 non-null object
MiscFeature      105 non-null object
MiscVal          2919 non-null int64
MoSold           2919 non-null int64
Neighborhood     2919 non-null object
OpenPorchSF      2919 non-null int64
OverallCond      2919 non-null int64
OverallQual      2919 non-null int64
PavedDrive       2919 non-null object
PoolArea         2919 non-null int64
PoolQC           10 non-null object
RoofMatl         2919 non-null object
RoofStyle        2919 non-null object
SaleCondition    2919 non-null object
SalePrice        1460 non-null float64
SaleType         2918 non-null object
ScreenPorch      2919 non-null int64
Street           2919 non-null object
TotRmsAbvGrd     2919 non-null int64
TotalBsmtSF      2918 non-null float64
Utilities        2917 non-null object
WoodDeckSF       2919 non-null int64
YearBuilt        2919 non-null int64
YearRemodAdd     2919 non-null int64
YrSold           2919 non-null int64
dtypes: float64(12), int64(26), object(43)
memory usage: 1.8+ MB

In [12]:
df.describe()

Out[12]:
1stFlrSF 2ndFlrSF 3SsnPorch BedroomAbvGr BsmtFinSF1 BsmtFinSF2 BsmtFullBath BsmtHalfBath BsmtUnfSF EnclosedPorch Fireplaces FullBath GarageArea GarageCars GarageYrBlt GrLivArea HalfBath Id KitchenAbvGr LotArea LotFrontage LowQualFinSF MSSubClass MasVnrArea MiscVal MoSold OpenPorchSF OverallCond OverallQual PoolArea SalePrice ScreenPorch TotRmsAbvGrd TotalBsmtSF WoodDeckSF YearBuilt YearRemodAdd YrSold
count 2919.000000 2919.000000 2919.000000 2919.000000 2918.000000 2918.000000 2917.000000 2917.000000 2918.000000 2919.000000 2919.000000 2919.000000 2918.000000 2918.000000 2760.000000 2919.000000 2919.000000 2919.000000 2919.000000 2919.000000 2433.000000 2919.000000 2919.000000 2896.000000 2919.000000 2919.000000 2919.000000 2919.000000 2919.000000 2919.000000 1460.000000 2919.000000 2919.000000 2918.000000 2919.000000 2919.000000 2919.000000 2919.000000
mean 1159.581706 336.483727 2.602261 2.860226 441.423235 49.582248 0.429894 0.061364 560.772104 23.098321 0.597122 1.568003 472.874572 1.766621 1978.113406 1500.759849 0.380267 1460.000000 1.044536 10168.114080 69.305795 4.694416 57.137718 102.201312 50.825968 6.213087 47.486811 5.564577 6.089072 2.251799 180921.195890 16.062350 6.451524 1051.777587 93.709832 1971.312778 1984.264474 2007.792737
std 392.362079 428.701456 25.188169 0.822693 455.610826 169.205611 0.524736 0.245687 439.543659 64.244246 0.646129 0.552969 215.394815 0.761624 25.574285 506.051045 0.502872 842.787043 0.214462 7886.996359 23.344905 46.396825 42.517628 179.334253 567.402211 2.714762 67.575493 1.113131 1.409947 35.663946 79442.502883 56.184365 1.569379 440.766258 126.526589 30.291442 20.894344 1.314964
min 334.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1895.000000 334.000000 0.000000 1.000000 0.000000 1300.000000 21.000000 0.000000 20.000000 0.000000 0.000000 1.000000 0.000000 1.000000 1.000000 0.000000 34900.000000 0.000000 2.000000 0.000000 0.000000 1872.000000 1950.000000 2006.000000
25% 876.000000 0.000000 0.000000 2.000000 0.000000 0.000000 0.000000 0.000000 220.000000 0.000000 0.000000 1.000000 320.000000 1.000000 1960.000000 1126.000000 0.000000 730.500000 1.000000 7478.000000 59.000000 0.000000 20.000000 0.000000 0.000000 4.000000 0.000000 5.000000 5.000000 0.000000 129975.000000 0.000000 5.000000 793.000000 0.000000 1953.500000 1965.000000 2007.000000
50% 1082.000000 0.000000 0.000000 3.000000 368.500000 0.000000 0.000000 0.000000 467.000000 0.000000 1.000000 2.000000 480.000000 2.000000 1979.000000 1444.000000 0.000000 1460.000000 1.000000 9453.000000 68.000000 0.000000 50.000000 0.000000 0.000000 6.000000 26.000000 5.000000 6.000000 0.000000 163000.000000 0.000000 6.000000 989.500000 0.000000 1973.000000 1993.000000 2008.000000
75% 1387.500000 704.000000 0.000000 3.000000 733.000000 0.000000 1.000000 0.000000 805.500000 0.000000 1.000000 2.000000 576.000000 2.000000 2002.000000 1743.500000 1.000000 2189.500000 1.000000 11570.000000 80.000000 0.000000 70.000000 164.000000 0.000000 8.000000 70.000000 6.000000 7.000000 0.000000 214000.000000 0.000000 7.000000 1302.000000 168.000000 2001.000000 2004.000000 2009.000000
max 5095.000000 2065.000000 508.000000 8.000000 5644.000000 1526.000000 3.000000 2.000000 2336.000000 1012.000000 4.000000 4.000000 1488.000000 5.000000 2207.000000 5642.000000 2.000000 2919.000000 3.000000 215245.000000 313.000000 1064.000000 190.000000 1600.000000 17000.000000 12.000000 742.000000 9.000000 10.000000 800.000000 755000.000000 576.000000 15.000000 6110.000000 1424.000000 2010.000000 2010.000000 2010.000000
In [13]:
df.select_dtypes(include=['int64', 'float64']).columns

Out[13]:
Index(['1stFlrSF', '2ndFlrSF', '3SsnPorch', 'BedroomAbvGr', 'BsmtFinSF1',
'BsmtFinSF2', 'BsmtFullBath', 'BsmtHalfBath', 'BsmtUnfSF',
'EnclosedPorch', 'Fireplaces', 'FullBath', 'GarageArea', 'GarageCars',
'GarageYrBlt', 'GrLivArea', 'HalfBath', 'Id', 'KitchenAbvGr', 'LotArea',
'LotFrontage', 'LowQualFinSF', 'MSSubClass', 'MasVnrArea', 'MiscVal',
'MoSold', 'OpenPorchSF', 'OverallCond', 'OverallQual', 'PoolArea',
'SalePrice', 'ScreenPorch', 'TotRmsAbvGrd', 'TotalBsmtSF', 'WoodDeckSF',
dtype='object')
In [14]:
df.select_dtypes(include=['object']).columns

Out[14]:
Index(['Alley', 'BldgType', 'BsmtCond', 'BsmtExposure', 'BsmtFinType1',
'BsmtFinType2', 'BsmtQual', 'CentralAir', 'Condition1', 'Condition2',
'Electrical', 'ExterCond', 'ExterQual', 'Exterior1st', 'Exterior2nd',
'Fence', 'FireplaceQu', 'Foundation', 'Functional', 'GarageCond',
'GarageFinish', 'GarageQual', 'GarageType', 'Heating', 'HeatingQC',
'HouseStyle', 'KitchenQual', 'LandContour', 'LandSlope', 'LotConfig',
'LotShape', 'MSZoning', 'MasVnrType', 'MiscFeature', 'Neighborhood',
'PavedDrive', 'PoolQC', 'RoofMatl', 'RoofStyle', 'SaleCondition',
'SaleType', 'Street', 'Utilities'],
dtype='object')
In [15]:
# Set index as Id column
df = df.set_index("Id")

In [16]:
df.head(6)

Out[16]:
1stFlrSF 2ndFlrSF 3SsnPorch Alley BedroomAbvGr BldgType BsmtCond BsmtExposure BsmtFinSF1 BsmtFinSF2 BsmtFinType1 BsmtFinType2 BsmtFullBath BsmtHalfBath BsmtQual BsmtUnfSF CentralAir Condition1 Condition2 Electrical EnclosedPorch ExterCond ExterQual Exterior1st Exterior2nd Fence FireplaceQu Fireplaces Foundation FullBath Functional GarageArea GarageCars GarageCond GarageFinish GarageQual GarageType GarageYrBlt GrLivArea HalfBath Heating HeatingQC HouseStyle KitchenAbvGr KitchenQual LandContour LandSlope LotArea LotConfig LotFrontage LotShape LowQualFinSF MSSubClass MSZoning MasVnrArea MasVnrType MiscFeature MiscVal MoSold Neighborhood OpenPorchSF OverallCond OverallQual PavedDrive PoolArea PoolQC RoofMatl RoofStyle SaleCondition SalePrice SaleType ScreenPorch Street TotRmsAbvGrd TotalBsmtSF Utilities WoodDeckSF YearBuilt YearRemodAdd YrSold
Id
1 856 854 0 NaN 3 1Fam TA No 706.0 0.0 GLQ Unf 1.0 0.0 Gd 150.0 Y Norm Norm SBrkr 0 TA Gd VinylSd VinylSd NaN NaN 0 PConc 2 Typ 548.0 2.0 TA RFn TA Attchd 2003.0 1710 1 GasA Ex 2Story 1 Gd Lvl Gtl 8450 Inside 65.0 Reg 0 60 RL 196.0 BrkFace NaN 0 2 CollgCr 61 5 7 Y 0 NaN CompShg Gable Normal 208500.0 WD 0 Pave 8 856.0 AllPub 0 2003 2003 2008
2 1262 0 0 NaN 3 1Fam TA Gd 978.0 0.0 ALQ Unf 0.0 1.0 Gd 284.0 Y Feedr Norm SBrkr 0 TA TA MetalSd MetalSd NaN TA 1 CBlock 2 Typ 460.0 2.0 TA RFn TA Attchd 1976.0 1262 0 GasA Ex 1Story 1 TA Lvl Gtl 9600 FR2 80.0 Reg 0 20 RL 0.0 None NaN 0 5 Veenker 0 8 6 Y 0 NaN CompShg Gable Normal 181500.0 WD 0 Pave 6 1262.0 AllPub 298 1976 1976 2007
3 920 866 0 NaN 3 1Fam TA Mn 486.0 0.0 GLQ Unf 1.0 0.0 Gd 434.0 Y Norm Norm SBrkr 0 TA Gd VinylSd VinylSd NaN TA 1 PConc 2 Typ 608.0 2.0 TA RFn TA Attchd 2001.0 1786 1 GasA Ex 2Story 1 Gd Lvl Gtl 11250 Inside 68.0 IR1 0 60 RL 162.0 BrkFace NaN 0 9 CollgCr 42 5 7 Y 0 NaN CompShg Gable Normal 223500.0 WD 0 Pave 6 920.0 AllPub 0 2001 2002 2008
4 961 756 0 NaN 3 1Fam Gd No 216.0 0.0 ALQ Unf 1.0 0.0 TA 540.0 Y Norm Norm SBrkr 272 TA TA Wd Sdng Wd Shng NaN Gd 1 BrkTil 1 Typ 642.0 3.0 TA Unf TA Detchd 1998.0 1717 0 GasA Gd 2Story 1 Gd Lvl Gtl 9550 Corner 60.0 IR1 0 70 RL 0.0 None NaN 0 2 Crawfor 35 5 7 Y 0 NaN CompShg Gable Abnorml 140000.0 WD 0 Pave 7 756.0 AllPub 0 1915 1970 2006
5 1145 1053 0 NaN 4 1Fam TA Av 655.0 0.0 GLQ Unf 1.0 0.0 Gd 490.0 Y Norm Norm SBrkr 0 TA Gd VinylSd VinylSd NaN TA 1 PConc 2 Typ 836.0 3.0 TA RFn TA Attchd 2000.0 2198 1 GasA Ex 2Story 1 Gd Lvl Gtl 14260 FR2 84.0 IR1 0 60 RL 350.0 BrkFace NaN 0 12 NoRidge 84 5 8 Y 0 NaN CompShg Gable Normal 250000.0 WD 0 Pave 9 1145.0 AllPub 192 2000 2000 2008
6 796 566 320 NaN 1 1Fam TA No 732.0 0.0 GLQ Unf 1.0 0.0 Gd 64.0 Y Norm Norm SBrkr 0 TA TA VinylSd VinylSd MnPrv NaN 0 Wood 1 Typ 480.0 2.0 TA Unf TA Attchd 1993.0 1362 1 GasA Ex 1.5Fin 1 TA Lvl Gtl 14115 Inside 85.0 IR1 0 50 RL 0.0 None Shed 700 10 Mitchel 30 5 5 Y 0 NaN CompShg Gable Normal 143000.0 WD 0 Pave 5 796.0 AllPub 40 1993 1995 2009
In [17]:
# Show the null values using heatmap
plt.figure(figsize=(16,9))
sns.heatmap(df.isnull())

Out[17]:
<matplotlib.axes._subplots.AxesSubplot at 0x1bb7ac45908>

In [18]:
# Get the percentages of null value
null_percent = df.isnull().sum()/df.shape[0]*100
null_percent

Out[18]:
1stFlrSF          0.000000
2ndFlrSF          0.000000
3SsnPorch         0.000000
Alley            93.216855
BedroomAbvGr      0.000000
BldgType          0.000000
BsmtCond          2.809181
BsmtExposure      2.809181
BsmtFinSF1        0.034258
BsmtFinSF2        0.034258
BsmtFinType1      2.706406
BsmtFinType2      2.740665
BsmtFullBath      0.068517
BsmtHalfBath      0.068517
BsmtQual          2.774923
BsmtUnfSF         0.034258
CentralAir        0.000000
Condition1        0.000000
Condition2        0.000000
Electrical        0.034258
EnclosedPorch     0.000000
ExterCond         0.000000
ExterQual         0.000000
Exterior1st       0.034258
Exterior2nd       0.034258
Fence            80.438506
FireplaceQu      48.646797
Fireplaces        0.000000
Foundation        0.000000
FullBath          0.000000
Functional        0.068517
GarageArea        0.034258
GarageCars        0.034258
GarageCond        5.447071
GarageFinish      5.447071
GarageQual        5.447071
GarageType        5.378554
GarageYrBlt       5.447071
GrLivArea         0.000000
HalfBath          0.000000
Heating           0.000000
HeatingQC         0.000000
HouseStyle        0.000000
KitchenAbvGr      0.000000
KitchenQual       0.034258
LandContour       0.000000
LandSlope         0.000000
LotArea           0.000000
LotConfig         0.000000
LotFrontage      16.649538
LotShape          0.000000
LowQualFinSF      0.000000
MSSubClass        0.000000
MSZoning          0.137033
MasVnrArea        0.787941
MasVnrType        0.822199
MiscFeature      96.402878
MiscVal           0.000000
MoSold            0.000000
Neighborhood      0.000000
OpenPorchSF       0.000000
OverallCond       0.000000
OverallQual       0.000000
PavedDrive        0.000000
PoolArea          0.000000
PoolQC           99.657417
RoofMatl          0.000000
RoofStyle         0.000000
SaleCondition     0.000000
SalePrice        49.982871
SaleType          0.034258
ScreenPorch       0.000000
Street            0.000000
TotRmsAbvGrd      0.000000
TotalBsmtSF       0.034258
Utilities         0.068517
WoodDeckSF        0.000000
YearBuilt         0.000000
YrSold            0.000000
dtype: float64
In [19]:
col_for_drop = null_percent[null_percent > 20].keys() # if the null value % 20 or > 20 so need to drop it

In [20]:
# drop columns
df = df.drop(col_for_drop, "columns")
df.shape

Out[20]:
(2919, 74)
In [21]:
# find the unique value count
for i in df.columns:
print(i + "\t" + str(len(df[i].unique())))


1stFlrSF	1083
2ndFlrSF	635
3SsnPorch	31
BedroomAbvGr	8
BldgType	5
BsmtCond	5
BsmtExposure	5
BsmtFinSF1	992
BsmtFinSF2	273
BsmtFinType1	7
BsmtFinType2	7
BsmtFullBath	5
BsmtHalfBath	4
BsmtQual	5
BsmtUnfSF	1136
CentralAir	2
Condition1	9
Condition2	8
Electrical	6
EnclosedPorch	183
ExterCond	5
ExterQual	4
Exterior1st	16
Exterior2nd	17
Fireplaces	5
Foundation	6
FullBath	5
Functional	8
GarageArea	604
GarageCars	7
GarageCond	6
GarageFinish	4
GarageQual	6
GarageType	7
GarageYrBlt	104
GrLivArea	1292
HalfBath	3
Heating	6
HeatingQC	5
HouseStyle	8
KitchenAbvGr	4
KitchenQual	5
LandContour	4
LandSlope	3
LotArea	1951
LotConfig	5
LotFrontage	129
LotShape	4
LowQualFinSF	36
MSSubClass	16
MSZoning	6
MasVnrArea	445
MasVnrType	5
MiscVal	38
MoSold	12
Neighborhood	25
OpenPorchSF	252
OverallCond	9
OverallQual	10
PavedDrive	3
PoolArea	14
RoofMatl	8
RoofStyle	6
SaleCondition	6
SaleType	10
ScreenPorch	121
Street	2
TotRmsAbvGrd	14
TotalBsmtSF	1059
Utilities	3
WoodDeckSF	379
YearBuilt	118
YrSold	5

In [22]:
# find unique values of each column
for i in df.columns:
print("Unique value of:>>> {} ({})\n{}\n".format(i, len(df[i].unique()), df[i].unique()))


Unique value of:>>> 1stFlrSF (1083)
[ 856 1262  920 ... 1778 1650 1960]

Unique value of:>>> 2ndFlrSF (635)
[ 854    0  866  756 1053  566  983  752 1142 1218  668 1320  631  716
676  860 1519  530  808  977 1330  833  765  462  213  548  960  670
1116  876  612 1031  881  790  755  592  939  520  639  656 1414  884
729 1523  728  351  688  941 1032  848  836  475  739 1151  448  896
524 1194  956 1070 1096  467  547  551  880  703  901  720  316 1518
704 1178  754  601 1360  929  445  564  882  920  518  817 1257  741
672 1306  504 1304 1100  730  689  591  888 1020  828  700  842 1286
864  829 1092  709  844 1106  596  807  625  649  698  840  780  568
795  648  975  702 1242 1818 1121  371  804  325  809 1200  871 1274
1347 1332 1177 1080  695  167  915  576  605  862  495  403  838  517
1427  784  711  468 1081  886  793  665  858  874  526  590  406 1157
299  936  438 1098  766 1101 1028 1017 1254  378 1160  682  110  600
678  834  384  512  930  868  224 1103  560  811  878  574  910  620
687  546  902 1000  846 1067  914  660 1538 1015 1237  611  707  527
1288  832  806 1182 1040  439  717  511 1129 1370  636  533  745  584
812  684  595  988  800  677  573 1066  778  661 1440  872  788  843
713  567  651  762  482  738  586  679  644  900  887 1872 1281  472
1312  319  978 1093  473  664 1540 1276  441  348 1060  714  744 1203
783 1097  734  767 1589  742  686 1128 1111 1174  787 1072 1088 1063
545  966  623  432  581  540  769 1051  761  779  514  455 1426  785
521  252  813 1120 1037 1169 1001 1215  928 1140 1243  571 1196 1038
561  979  701  332  368  883 1336 1141  634  912  798  985  826  831
750  456  602  855  336  408  980  998 1168 1208  797  850  898 1054
895  954  772 1230  727  454  370  628  304  582 1122 1134  885  640
580 1112  653  220  240 1362  534  539  650  918  933  712 1796  971
1175  743  523 1216 2065  272  685  776  630  984  875  913  464 1039
1259  940  892  725  924  764  925 1479  192  589  992  903  430  748
587  994  950 1323  732 1357  557 1296  390 1185  873 1611  457  796
908  550  989  932  358 1392  349  691 1349  768  208  622  857  556
1044  708  626  904  510 1104  830  981  870  694 1152  563  823  604
715  532  537  505  424  606  185  498  492  608 1074  662  499  180
942  558  614  328 1788 1075  380  615  645  663 1275  816  839 1325
1012 1295  683 1126 1089 1221  967  841 1209  897  786 1629  782 1369
972 1315  726  322  760  629  496  690  646  917  624  320  588  425
747 1114 1619  718  815  926  444  436 1240  516 1420 1158 1162 1139
1285 1061 1250  919  861  794  825  893 1319  959  792 1345  453  412
182  501  375  680  658  552  396  308  973  363  594  554  428  536
486 1721 1099  735  899 1198  343  673  442  890  943  330  420  770
1342 1377  845 1402 1036  570 1238  923  757 1048 1131 1407 1171 1277
995  528  863 1232  976 1008 1309  228  500  544 1778  616  494  642
659  671  144  525  423 1164  356  245 1042  477 1005 1087  638  400
376  916  927  869  753  450 1133  674  125  531  585  775  851  957
1340  955  990 1384 1862 1371 1405 1358  465  466 1335  814  488 1321
1029 1368 1567 1189 1234 1248  821 1007  476  502  867  297  810  434
583  341 1836  541 1246 1124 1045  827 1150  312  218  493  736  818
610  549  697  360 1004]

Unique value of:>>> 3SsnPorch (31)
[  0 320 407 130 180 168 140 508 238 245 196 144 182 162  23 216  96 153
290 304 224 255 225 360 150 174 120 219 176  86 323]

Unique value of:>>> BedroomAbvGr (8)
[3 4 1 2 0 5 6 8]

Unique value of:>>> BldgType (5)
['1Fam' '2fmCon' 'Duplex' 'TwnhsE' 'Twnhs']

Unique value of:>>> BsmtCond (5)
['TA' 'Gd' nan 'Fa' 'Po']

Unique value of:>>> BsmtExposure (5)
['No' 'Gd' 'Mn' 'Av' nan]

Unique value of:>>> BsmtFinSF1 (992)
[7.060e+02 9.780e+02 4.860e+02 2.160e+02 6.550e+02 7.320e+02 1.369e+03
8.590e+02 0.000e+00 8.510e+02 9.060e+02 9.980e+02 7.370e+02 7.330e+02
5.780e+02 6.460e+02 5.040e+02 8.400e+02 1.880e+02 2.340e+02 1.218e+03
1.277e+03 1.018e+03 1.153e+03 1.213e+03 7.310e+02 6.430e+02 9.670e+02
7.470e+02 2.800e+02 1.790e+02 4.560e+02 1.351e+03 2.400e+01 7.630e+02
1.820e+02 1.040e+02 1.810e+03 3.840e+02 4.900e+02 6.490e+02 6.320e+02
9.410e+02 7.390e+02 9.120e+02 1.013e+03 6.030e+02 1.880e+03 5.650e+02
3.200e+02 4.620e+02 2.280e+02 3.360e+02 4.480e+02 1.201e+03 3.300e+01
5.880e+02 6.000e+02 7.130e+02 1.046e+03 6.480e+02 3.100e+02 1.162e+03
5.200e+02 1.080e+02 5.690e+02 1.200e+03 2.240e+02 7.050e+02 4.440e+02
2.500e+02 9.840e+02 3.500e+01 7.740e+02 4.190e+02 1.700e+02 1.470e+03
9.380e+02 5.700e+02 3.000e+02 1.200e+02 1.160e+02 5.120e+02 5.670e+02
4.450e+02 6.950e+02 4.050e+02 1.005e+03 6.680e+02 8.210e+02 4.320e+02
1.300e+03 5.070e+02 6.790e+02 1.332e+03 2.090e+02 6.800e+02 7.160e+02
1.400e+03 4.160e+02 4.290e+02 2.220e+02 5.700e+01 6.600e+02 1.016e+03
3.700e+02 3.510e+02 3.790e+02 1.288e+03 3.600e+02 6.390e+02 4.950e+02
2.880e+02 1.398e+03 4.770e+02 8.310e+02 1.904e+03 4.360e+02 3.520e+02
6.110e+02 1.086e+03 2.970e+02 6.260e+02 5.600e+02 3.900e+02 5.660e+02
1.126e+03 1.036e+03 1.088e+03 6.410e+02 6.170e+02 6.620e+02 3.120e+02
1.065e+03 7.870e+02 4.680e+02 3.600e+01 8.220e+02 3.780e+02 9.460e+02
3.410e+02 1.600e+01 5.500e+02 5.240e+02 5.600e+01 3.210e+02 8.420e+02
6.890e+02 6.250e+02 3.580e+02 4.020e+02 9.400e+01 1.078e+03 3.290e+02
9.290e+02 6.970e+02 1.573e+03 2.700e+02 9.220e+02 5.030e+02 1.334e+03
3.610e+02 6.720e+02 5.060e+02 7.140e+02 4.030e+02 7.510e+02 2.260e+02
6.200e+02 5.460e+02 3.920e+02 4.210e+02 9.050e+02 9.040e+02 4.300e+02
6.140e+02 4.500e+02 2.100e+02 2.920e+02 7.950e+02 1.285e+03 8.190e+02
4.200e+02 8.410e+02 2.810e+02 8.940e+02 1.464e+03 7.000e+02 2.620e+02
1.274e+03 5.180e+02 1.236e+03 4.250e+02 6.920e+02 9.870e+02 9.700e+02
2.800e+01 2.560e+02 1.619e+03 4.000e+01 8.460e+02 1.124e+03 7.200e+02
8.280e+02 1.249e+03 8.100e+02 2.130e+02 5.850e+02 1.290e+02 4.980e+02
1.270e+03 5.730e+02 1.410e+03 1.082e+03 2.360e+02 3.880e+02 3.340e+02
8.740e+02 9.560e+02 7.730e+02 3.990e+02 1.620e+02 7.120e+02 6.090e+02
3.710e+02 5.400e+02 7.200e+01 6.230e+02 4.280e+02 3.500e+02 2.980e+02
1.445e+03 2.180e+02 9.850e+02 6.310e+02 1.280e+03 2.410e+02 6.900e+02
2.660e+02 7.770e+02 8.120e+02 7.860e+02 1.116e+03 7.890e+02 1.056e+03
5.000e+01 1.128e+03 7.750e+02 1.309e+03 1.246e+03 9.860e+02 6.160e+02
1.518e+03 6.640e+02 3.870e+02 4.710e+02 3.850e+02 3.650e+02 1.767e+03
1.330e+02 6.420e+02 2.470e+02 3.310e+02 7.420e+02 1.606e+03 9.160e+02
1.850e+02 5.440e+02 5.530e+02 3.260e+02 7.780e+02 3.860e+02 4.260e+02
3.680e+02 4.590e+02 1.350e+03 1.196e+03 6.300e+02 9.940e+02 1.680e+02
1.261e+03 1.567e+03 2.990e+02 8.970e+02 6.070e+02 8.360e+02 5.150e+02
3.740e+02 1.231e+03 1.110e+02 3.560e+02 4.000e+02 6.980e+02 1.247e+03
2.570e+02 3.800e+02 2.700e+01 1.410e+02 9.910e+02 6.500e+02 5.210e+02
1.436e+03 2.260e+03 7.190e+02 3.770e+02 1.330e+03 3.480e+02 1.219e+03
7.830e+02 9.690e+02 6.730e+02 1.358e+03 1.260e+03 1.440e+02 5.840e+02
5.540e+02 1.002e+03 6.190e+02 1.800e+02 5.590e+02 3.080e+02 8.660e+02
8.950e+02 6.370e+02 6.040e+02 1.302e+03 1.071e+03 2.900e+02 7.280e+02
2.000e+00 1.441e+03 9.430e+02 2.310e+02 4.140e+02 3.490e+02 4.420e+02
3.280e+02 5.940e+02 8.160e+02 1.460e+03 1.324e+03 1.338e+03 6.850e+02
1.422e+03 1.283e+03 8.100e+01 4.540e+02 9.030e+02 6.050e+02 9.900e+02
2.060e+02 1.500e+02 4.570e+02 4.800e+01 8.710e+02 4.100e+01 6.740e+02
6.240e+02 4.800e+02 1.154e+03 7.380e+02 4.930e+02 1.121e+03 2.820e+02
5.000e+02 1.310e+02 1.696e+03 8.060e+02 1.361e+03 9.200e+02 1.721e+03
1.870e+02 1.138e+03 9.880e+02 1.930e+02 5.510e+02 7.670e+02 1.186e+03
8.920e+02 3.110e+02 8.270e+02 5.430e+02 1.003e+03 1.059e+03 2.390e+02
9.450e+02 2.000e+01 1.455e+03 9.650e+02 9.800e+02 8.630e+02 5.330e+02
1.084e+03 1.173e+03 5.230e+02 1.148e+03 1.910e+02 1.234e+03 3.750e+02
8.080e+02 7.240e+02 1.520e+02 1.180e+03 2.520e+02 8.320e+02 5.750e+02
9.190e+02 4.390e+02 3.810e+02 4.380e+02 5.490e+02 6.120e+02 1.163e+03
4.370e+02 3.940e+02 1.416e+03 4.220e+02 7.620e+02 9.750e+02 1.097e+03
2.510e+02 6.860e+02 6.560e+02 5.680e+02 5.390e+02 8.620e+02 1.970e+02
5.160e+02 6.630e+02 6.080e+02 1.636e+03 7.840e+02 2.490e+02 1.040e+03
4.830e+02 1.960e+02 5.720e+02 3.380e+02 3.300e+02 1.560e+02 1.390e+03
5.130e+02 4.600e+02 6.590e+02 3.640e+02 5.640e+02 3.060e+02 5.050e+02
9.320e+02 7.500e+02 6.400e+01 6.330e+02 1.170e+03 8.990e+02 9.020e+02
1.238e+03 5.280e+02 1.024e+03 1.064e+03 2.850e+02 2.188e+03 4.650e+02
3.220e+02 8.600e+02 5.990e+02 3.540e+02 6.300e+01 2.230e+02 3.010e+02
4.430e+02 4.890e+02 2.840e+02 2.940e+02 8.140e+02 1.650e+02 5.520e+02
8.330e+02 4.640e+02 9.360e+02 7.720e+02 1.440e+03 7.480e+02 9.820e+02
3.980e+02 5.620e+02 4.840e+02 4.170e+02 6.990e+02 6.960e+02 8.960e+02
5.560e+02 1.106e+03 6.510e+02 8.670e+02 8.540e+02 1.646e+03 1.074e+03
5.360e+02 1.172e+03 9.150e+02 5.950e+02 1.237e+03 2.730e+02 6.840e+02
3.240e+02 1.165e+03 1.380e+02 1.513e+03 3.170e+02 1.012e+03 1.022e+03
5.090e+02 9.000e+02 1.085e+03 1.104e+03 2.400e+02 3.830e+02 6.440e+02
3.970e+02 7.400e+02 8.370e+02 2.200e+02 5.860e+02 5.350e+02 4.100e+02
7.500e+01 8.240e+02 5.920e+02 1.039e+03 5.100e+02 4.230e+02 6.610e+02
2.480e+02 7.040e+02 4.120e+02 1.032e+03 2.190e+02 7.080e+02 4.150e+02
1.004e+03 3.530e+02 7.020e+02 3.690e+02 6.220e+02 2.120e+02 6.450e+02
8.520e+02 1.150e+03 1.258e+03 2.750e+02 1.760e+02 2.960e+02 5.380e+02
1.157e+03 4.920e+02 1.198e+03 1.387e+03 5.220e+02 6.580e+02 1.216e+03
1.480e+03 2.096e+03 1.159e+03 4.400e+02 1.456e+03 8.830e+02 5.470e+02
7.880e+02 4.850e+02 3.400e+02 1.220e+03 4.270e+02 3.440e+02 7.560e+02
1.540e+03 6.660e+02 8.030e+02 1.000e+03 8.850e+02 1.386e+03 3.190e+02
5.340e+02 1.250e+02 1.314e+03 6.020e+02 1.920e+02 5.930e+02 8.040e+02
1.053e+03 5.320e+02 1.158e+03 1.014e+03 1.940e+02 1.670e+02 7.760e+02
5.644e+03 6.940e+02 1.572e+03 7.460e+02 1.406e+03 9.250e+02 4.820e+02
1.890e+02 7.650e+02 8.000e+01 1.443e+03 2.590e+02 7.350e+02 7.340e+02
1.447e+03 5.480e+02 3.150e+02 1.282e+03 4.080e+02 3.090e+02 2.030e+02
8.650e+02 2.040e+02 7.900e+02 1.320e+03 7.690e+02 1.070e+03 2.640e+02
7.590e+02 1.373e+03 9.760e+02 7.810e+02 2.500e+01 1.110e+03 4.040e+02
5.800e+02 6.780e+02 9.580e+02 1.336e+03 1.079e+03 4.900e+01 8.300e+02
9.230e+02 7.910e+02 2.630e+02 9.350e+02 1.051e+03 5.140e+02 1.100e+02
1.414e+03 1.260e+02 1.129e+03 1.298e+03 3.760e+02 4.660e+02 2.440e+02
1.137e+03 6.870e+02 1.010e+03 1.500e+03 6.700e+02 9.440e+02 1.188e+03
8.560e+02 3.390e+02 4.810e+02 7.170e+02 5.790e+02 2.740e+02 7.800e+02
2.830e+02 4.740e+02 4.520e+02 2.760e+02 9.600e+02 7.660e+02 1.026e+03
7.300e+01 7.360e+02 1.319e+03 2.670e+02 1.092e+03 9.640e+02 9.540e+02
1.346e+03 1.433e+03 8.700e+02 1.980e+02 1.682e+03 2.380e+02 3.430e+02
7.600e+01 6.150e+02 7.800e+01 4.200e+01 4.690e+02 2.070e+02 4.580e+02
4.760e+02 1.341e+03 8.440e+02 8.470e+02 8.500e+02 1.965e+03 7.410e+02
3.630e+02 2.250e+02 1.333e+03 8.880e+02 6.360e+02 7.260e+02 2.540e+02
4.350e+02 3.890e+02 2.790e+02 1.360e+03 1.232e+03 2.288e+03 1.531e+03
1.230e+03 1.015e+03 1.037e+03 1.142e+03 1.262e+03 1.972e+03 8.810e+02
8.760e+02 2.146e+03 1.557e+03 8.000e+02 6.520e+02 4.940e+02 6.830e+02
9.130e+02 1.294e+03 2.158e+03 6.820e+02 1.430e+03 7.710e+02 5.400e+01
5.200e+01 6.800e+01 8.640e+02 1.400e+02 1.733e+03 6.010e+02 9.620e+02
1.252e+03 1.210e+02 9.550e+02 1.000e+02 1.312e+03 1.720e+02 1.550e+02
9.310e+02 8.720e+02 7.450e+02 6.210e+02 4.330e+02 8.260e+02 1.340e+02
1.690e+02 7.490e+02 1.152e+03 5.270e+02 3.420e+02 1.730e+02 7.000e+01
1.094e+03 8.200e+02 1.021e+03 1.359e+03 7.550e+02 9.500e+02 6.060e+02
1.259e+03 7.100e+02 1.111e+03 1.478e+03 3.320e+02 7.930e+02 2.460e+02
1.540e+02 6.500e+01 1.476e+03 5.500e+01 1.758e+03 1.115e+03 1.640e+03
1.140e+02 7.180e+02 4.960e+02 1.337e+03 1.034e+03 9.830e+02 1.206e+03
8.900e+02 1.023e+03 1.190e+02 2.860e+02 1.728e+03 1.375e+03 1.420e+03
2.257e+03 1.149e+03 1.075e+03 3.720e+02 1.204e+03 1.073e+03 1.087e+03
1.660e+03 1.096e+03 7.290e+02 3.620e+02 5.370e+02 4.720e+02 5.300e+01
7.640e+02 1.900e+02 1.027e+03 1.141e+03 6.810e+02 8.130e+02 1.280e+02
1.044e+03 2.600e+02 5.830e+02 3.200e+01 5.310e+02 1.480e+02 7.440e+02
9.600e+01 5.900e+02 2.000e+02 4.060e+02 1.750e+02 2.010e+02       nan
7.580e+02 2.210e+02 6.340e+02 1.035e+03 7.790e+02 1.271e+03 3.550e+02
2.085e+03 7.700e+02 7.220e+02 1.308e+03 6.880e+02 8.800e+01 1.194e+03
1.538e+03 1.593e+03 1.033e+03 3.660e+02 1.474e+03 1.383e+03 8.930e+02
1.029e+03 1.223e+03 1.011e+03 1.571e+03 3.180e+02 5.010e+02 7.850e+02
6.380e+02 6.470e+02 8.380e+02 1.860e+02 9.260e+02 1.101e+03 1.047e+03
7.970e+02 1.558e+03 1.328e+03 3.140e+02 9.300e+02 7.250e+02 1.151e+03
1.304e+03 1.812e+03 1.684e+03 6.690e+02 1.178e+03 1.030e+03 8.480e+02
9.180e+02 5.740e+02 1.181e+03 1.048e+03 3.350e+02 1.225e+03 7.270e+02
9.680e+02 6.000e+01 9.370e+02 9.010e+02 1.732e+03 1.632e+03 9.730e+02
9.100e+02 3.460e+02 7.920e+02 6.540e+02 1.300e+02 8.730e+02 9.080e+02
4.410e+02 8.500e+01 2.420e+02 9.520e+02 1.098e+03 7.820e+02 1.220e+02
3.160e+02 2.580e+02 5.870e+02 4.910e+02 4.530e+02 5.570e+02 1.080e+03
4.970e+02 5.100e+01 5.020e+02 6.710e+02 1.412e+03 7.090e+02 1.320e+02
4.010e+03 4.670e+02 7.700e+01 1.130e+02 5.770e+02 4.340e+02 1.001e+03
1.392e+03 1.239e+03 9.240e+02 9.490e+02 2.150e+02 1.329e+03 1.112e+03
7.960e+02 8.110e+02 1.090e+03 5.960e+02 1.127e+03 2.050e+02 1.191e+03
9.510e+02 3.820e+02 3.730e+02 1.505e+03 1.290e+03 8.800e+02 1.038e+03
1.182e+03 1.562e+03 1.836e+03 2.780e+02 1.810e+02 1.118e+03 7.600e+02
7.990e+02 9.960e+02 9.390e+02 9.140e+02 2.710e+02 4.880e+02 7.010e+02
4.550e+02 8.090e+02 9.530e+02 2.080e+02 1.430e+02 5.760e+02 3.470e+02
7.940e+02 2.300e+02 2.610e+02 3.930e+02 1.576e+03 1.122e+03 8.530e+02
4.750e+02 6.910e+02 4.240e+02 3.050e+02 5.260e+02 1.564e+03 9.090e+02
1.136e+03 1.243e+03 1.490e+02 1.224e+03 3.370e+02]

Unique value of:>>> BsmtFinSF2 (273)
[   0.   32.  668.  486.   93.  491.  506.  712.  362.   41.  169.  869.
150.  670.   28. 1080.  181.  768.  215.  374.  208.  441.  184.  279.
306.  180.  580.  690.  692.  228.  125. 1063.  620.  175.  820. 1474.
264.  479.  147.  232.  380.  544.  294.  258.  121.  391.  531.  344.
539.  713.  210.  311. 1120.  165.  532.   96.  495.  174. 1127.  139.
202.  645.  123.  551.  219.  606.  612.  480.  182.  132.  336.  468.
287.   35.  499.  723.  119.   40.  117.  239.   80.  472.   64. 1057.
127.  630.  128.  377.  764.  345. 1085.  435.  823.  500.  290.  324.
634.  411.  841. 1061.  466.  396.  354.  149.  193.  273.  465.  400.
682.  557.  230.  106.  791.  240.  547.  469.  177.  108.  600.  492.
211.  168. 1031.  438.  375.  144.   81.  906.  608.  276.  661.   68.
173.  972.  105.  420.  546.  334.  352.  872.  110.  627.  163. 1029.
78.  859.  981.   42.   46.  162.  350.  263. 1073.   12.  159.  474.
453.  684.  387.  688.  252.  590.  284.  622.  113. 1526.  360.  774.
364.  596.  884.   92.  216.  136.  201.  512.  247.  483.  750.   60.
102.   95.   63.  262.  393.  286.  450.   72.  243.  694.  875.  507.
419.  250.  116.  624.   76.  270.  288.  186.  449.   48.  613.  852.
555.  799.  811.  842.  382.  456.  308.   52.  196.  488.  319.   nan
956.  120.  679.  604.  153.  619.    6.  351. 1037.  829.   38.  206.
167.  543.  259.  404.  138.  955.  691.   66.  154.  442.  448.  227.
398.  722.  761.  529.  522.  873.  891.  755.  321.  915.  417.  432.
831.  278. 1020.  530.  904.  156. 1393. 1039.  497.  402.  748.  281.
912.  373.  982.  826.  850. 1164. 1083.  337.  297.]

Unique value of:>>> BsmtFinType1 (7)
['GLQ' 'ALQ' 'Unf' 'Rec' 'BLQ' nan 'LwQ']

Unique value of:>>> BsmtFinType2 (7)
['Unf' 'BLQ' nan 'ALQ' 'Rec' 'LwQ' 'GLQ']

Unique value of:>>> BsmtFullBath (5)
[ 1.  0.  2.  3. nan]

Unique value of:>>> BsmtHalfBath (4)
[ 0.  1.  2. nan]

Unique value of:>>> BsmtQual (5)
['Gd' 'TA' 'Ex' nan 'Fa']

Unique value of:>>> BsmtUnfSF (1136)
[ 150.  284.  434. ...  129.   45. 1503.]

Unique value of:>>> CentralAir (2)
['Y' 'N']

Unique value of:>>> Condition1 (9)
['Norm' 'Feedr' 'PosN' 'Artery' 'RRAe' 'RRNn' 'RRAn' 'PosA' 'RRNe']

Unique value of:>>> Condition2 (8)
['Norm' 'Artery' 'RRNn' 'Feedr' 'PosN' 'PosA' 'RRAn' 'RRAe']

Unique value of:>>> Electrical (6)
['SBrkr' 'FuseF' 'FuseA' 'FuseP' 'Mix' nan]

Unique value of:>>> EnclosedPorch (183)
[   0  272  228  205  176   87  172  102   37  144   64  114  202  128
156   44   77  192  140  180  183   39  184   40  552   30  126   96
60  150  120  112  252   52  224  234  244  268  137   24  108  294
177  218  242   91  160  130  169  105   34  248  236   32   80  115
291  116  158  210   36  200   84  148  136  240   54  100  189  293
164  216  239   67   90   56  129   98  143   70  386  154  185  134
196  264  275  230  254   68  194  318   48   94  138  226  174   19
170  220  214  280  190  330  208  145  259   81   42  123  162  286
168   20  301  198  221  212   50   99  186  113  135  334  246   18
41   35  364   45   86  265  222  209  260  203  432   25  238   51
213  288  211   55   57   78   72  368  165   92   16   66  109  139
219  101  117  204  122  231  121  207  249  290  175   26   88 1012
43  584  133  324  161   75  167   28  104  296  256  225  429  132
23]

Unique value of:>>> ExterCond (5)
['TA' 'Gd' 'Fa' 'Po' 'Ex']

Unique value of:>>> ExterQual (4)
['Gd' 'TA' 'Ex' 'Fa']

Unique value of:>>> Exterior1st (16)
['VinylSd' 'MetalSd' 'Wd Sdng' 'HdBoard' 'BrkFace' 'WdShing' 'CemntBd'
'Plywood' 'AsbShng' 'Stucco' 'BrkComm' 'AsphShn' 'Stone' 'ImStucc'
'CBlock' nan]

Unique value of:>>> Exterior2nd (17)
['VinylSd' 'MetalSd' 'Wd Shng' 'HdBoard' 'Plywood' 'Wd Sdng' 'CmentBd'
'BrkFace' 'Stucco' 'AsbShng' 'Brk Cmn' 'ImStucc' 'AsphShn' 'Stone'
'Other' 'CBlock' nan]

Unique value of:>>> Fireplaces (5)
[0 1 2 3 4]

Unique value of:>>> Foundation (6)
['PConc' 'CBlock' 'BrkTil' 'Wood' 'Slab' 'Stone']

Unique value of:>>> FullBath (5)
[2 1 3 0 4]

Unique value of:>>> Functional (8)
['Typ' 'Min1' 'Maj1' 'Min2' 'Mod' 'Maj2' 'Sev' nan]

Unique value of:>>> GarageArea (604)
[ 548.  460.  608.  642.  836.  480.  636.  484.  468.  205.  384.  736.
352.  840.  576.  516.  294.  853.  280.  534.  572.  270.  890.  772.
319.  240.  250.  271.  447.  556.  691.  672.  498.  246.    0.  440.
308.  504.  300.  670.  826.  386.  388.  528.  894.  565.  641.  288.
645.  852.  558.  220.  667.  360.  427.  490.  379.  297.  283.  509.
405.  758.  461.  400.  462.  420.  432.  506.  684.  472.  366.  476.
410.  740.  648.  273.  546.  325.  792.  450.  180.  430.  594.  390.
540.  264.  530.  435.  453.  750.  487.  624.  471.  318.  766.  660.
470.  720.  577.  380.  434.  866.  495.  564.  312.  625.  680.  678.
726.  532.  216.  303.  789.  511.  616.  521.  451. 1166.  252.  497.
682.  666.  786.  795.  856.  473.  398.  500.  349.  454.  644.  299.
210.  431.  438.  675.  968.  721.  336.  810.  494.  457.  818.  463.
604.  389.  538.  520.  309.  429.  673.  884.  868.  492.  413.  924.
1053.  439.  671.  338.  573.  732.  505.  575.  626.  898.  529.  685.
281.  539.  418.  588.  282.  375.  683.  843.  552.  870.  888.  746.
708.  513. 1025.  656.  872.  292.  441.  189.  880.  676.  301.  474.
706.  617.  445.  200.  592.  566.  514.  296.  244.  610.  834.  639.
501.  846.  560.  596.  600.  373.  947.  350.  396.  864.  304.  784.
696.  569.  628.  550.  493.  578.  198.  422.  228.  526.  525.  908.
499.  508.  694.  874.  164.  402.  515.  286.  603.  900.  583.  889.
858.  502.  392.  403.  527.  765.  367.  426.  615.  871.  570.  406.
590.  612.  650. 1390.  275.  452.  842.  816.  621.  544.  486.  230.
261.  531.  393.  774.  749.  364.  627.  260.  256.  478.  442.  562.
512.  839.  330.  711. 1134.  416.  779.  702.  567.  832.  326.  551.
606.  739.  408.  475.  704.  983.  768.  632.  541.  320.  800.  831.
554.  878.  752.  614.  481.  496.  423.  841.  895.  412.  865.  630.
605.  602.  618.  444.  397.  455.  409.  820. 1020.  598.  857.  595.
433.  776. 1220.  458.  613.  456.  436.  812.  686.  611.  425.  343.
479.  619.  902.  574.  523.  414.  738.  354.  483.  327.  756.  690.
284.  833.  601.  533.  522.  788.  555.  689.  796.  808.  510.  255.
424.  305.  368.  824.  328.  160.  437.  665.  290.  912.  905.  542.
716.  586.  467.  582. 1248. 1043.  254.  712.  719.  862.  928.  782.
466.  714. 1052.  225.  234.  324.  306.  830.  807.  358.  186.  693.
482.  813.  995.  757. 1356.  459.  701.  322.  315.  668.  404.  543.
954.  850.  477.  276.  518. 1014.  753. 1418.  213.  844.  860.  748.
248.  287.  825.  647.  342.  770.  663.  377.  804.  936.  722.  208.
662.  754.  622.  620.  370. 1069.  372.  923.  192.  730.  751.  958.
962.  762.  713.  535.  517.  263.  780.  363.  365.  231.  591.  209.
1017.  580.  399.  741.  253.  581.  345.  896.  932.  640.  927.  700.
886.  949.  649.  394.  658.  815.  623.  972.  984.  692.  845.  559.
465.  524.  561.  549.  907.  162.  357.  207. 1184.  316.  226.  340.
266. 1138.  904. 1231.  195.  313.  215.  307.  295.  351.  885.  920.
698.  557.  489. 1314.  787. 1150. 1003.  944.  428.  687.  938.  783.
851.  545.  469.  464.  267. 1488.  401.  311.  828.  869.  355.  249.
1348.  811.  725.  715.  814.  369.  599.  344.  356.  185.  892.  257.
729. 1110.  724.  585.  488. 1040. 1174.  728.  916.  876.  631.  925.
806.  933. 1092.  859.  744. 1105.  310.  293.  371. 1200.  184.  374.
331.  224.  217.  323.  638.  332.  674.  747.  242.  597.  579. 1154.
nan  100.  571. 1041.  963.  443.  773.  485. 1085.  899.  959.  803.
760.  584.  449.  688.  568.  353.  791. 1008.  378.  258.  848.  317.
646.  265.  609.  272.]

Unique value of:>>> GarageCars (7)
[ 2.  3.  1.  0.  4.  5. nan]

Unique value of:>>> GarageCond (6)
['TA' 'Fa' nan 'Gd' 'Po' 'Ex']

Unique value of:>>> GarageFinish (4)
['RFn' 'Unf' 'Fin' nan]

Unique value of:>>> GarageQual (6)
['TA' 'Fa' 'Gd' nan 'Ex' 'Po']

Unique value of:>>> GarageType (7)
['Attchd' 'Detchd' 'BuiltIn' 'CarPort' nan 'Basment' '2Types']

Unique value of:>>> GarageYrBlt (104)
[2003. 1976. 2001. 1998. 2000. 1993. 2004. 1973. 1931. 1939. 1965. 2005.
1962. 2006. 1960. 1991. 1970. 1967. 1958. 1930. 2002. 1968. 2007. 2008.
1957. 1920. 1966. 1959. 1995. 1954. 1953.   nan 1983. 1977. 1997. 1985.
1963. 1981. 1964. 1999. 1935. 1990. 1945. 1987. 1989. 1915. 1956. 1948.
1974. 2009. 1950. 1961. 1921. 1900. 1979. 1951. 1969. 1936. 1975. 1971.
1923. 1984. 1926. 1955. 1986. 1988. 1916. 1932. 1972. 1918. 1980. 1924.
1996. 1940. 1949. 1994. 1910. 1978. 1982. 1992. 1925. 1941. 2010. 1927.
1947. 1937. 1942. 1938. 1952. 1928. 1922. 1934. 1906. 1914. 1946. 1908.
1929. 1933. 1917. 1896. 1895. 2207. 1943. 1919.]

Unique value of:>>> GrLivArea (1292)
[1710 1262 1786 ... 2315  641 1778]

Unique value of:>>> HalfBath (3)
[1 0 2]

Unique value of:>>> Heating (6)
['GasA' 'GasW' 'Grav' 'Wall' 'OthW' 'Floor']

Unique value of:>>> HeatingQC (5)
['Ex' 'Gd' 'TA' 'Fa' 'Po']

Unique value of:>>> HouseStyle (8)
['2Story' '1Story' '1.5Fin' '1.5Unf' 'SFoyer' 'SLvl' '2.5Unf' '2.5Fin']

Unique value of:>>> KitchenAbvGr (4)
[1 2 3 0]

Unique value of:>>> KitchenQual (5)
['Gd' 'TA' 'Ex' 'Fa' nan]

Unique value of:>>> LandContour (4)
['Lvl' 'Bnk' 'Low' 'HLS']

Unique value of:>>> LandSlope (3)
['Gtl' 'Mod' 'Sev']

Unique value of:>>> LotArea (1951)
[ 8450  9600 11250 ...  1894 20000 10441]

Unique value of:>>> LotConfig (5)
['Inside' 'FR2' 'Corner' 'CulDSac' 'FR3']

Unique value of:>>> LotFrontage (129)
[ 65.  80.  68.  60.  84.  85.  75.  nan  51.  50.  70.  91.  72.  66.
101.  57.  44. 110.  98.  47. 108. 112.  74. 115.  61.  48.  33.  52.
100.  24.  89.  63.  76.  81.  95.  69.  21.  32.  78. 121. 122.  40.
105.  73.  77.  64.  94.  34.  90.  55.  88.  82.  71. 120. 107.  92.
134.  62.  86. 141.  97.  54.  41.  79. 174.  99.  67.  83.  43. 103.
93.  30. 129. 140.  35.  37. 118.  87. 116. 150. 111.  49.  96.  59.
36.  56. 102.  58.  38. 109. 130.  53. 137.  45. 106. 104.  42.  39.
144. 114. 128. 149. 313. 168. 182. 138. 160. 152. 124. 153.  46.  26.
25. 119.  31.  28. 117. 113. 125. 135. 136.  22. 123. 195. 155. 126.
200. 131. 133.]

Unique value of:>>> LotShape (4)
['Reg' 'IR1' 'IR2' 'IR3']

Unique value of:>>> LowQualFinSF (36)
[   0  360  513  234  528  572  144  392  371  390  420  473  156  515
80   53  232  481  120  514  397  479  205  384  362 1064  431  436
259  312  108  697  512  114  140  450]

Unique value of:>>> MSSubClass (16)
[ 60  20  70  50 190  45  90 120  30  85  80 160  75 180  40 150]

Unique value of:>>> MSZoning (6)
['RL' 'RM' 'C (all)' 'FV' 'RH' nan]

Unique value of:>>> MasVnrArea (445)
[1.960e+02 0.000e+00 1.620e+02 3.500e+02 1.860e+02 2.400e+02 2.860e+02
3.060e+02 2.120e+02 1.800e+02 3.800e+02 2.810e+02 6.400e+02 2.000e+02
2.460e+02 1.320e+02 6.500e+02 1.010e+02 4.120e+02 2.720e+02 4.560e+02
1.031e+03 1.780e+02 5.730e+02 3.440e+02 2.870e+02 1.670e+02 1.115e+03
4.000e+01 1.040e+02 5.760e+02 4.430e+02 4.680e+02 6.600e+01 2.200e+01
2.840e+02 7.600e+01 2.030e+02 6.800e+01 1.830e+02 4.800e+01 2.800e+01
3.360e+02 6.000e+02 7.680e+02 4.800e+02 2.200e+02 1.840e+02 1.129e+03
1.160e+02 1.350e+02 2.660e+02 8.500e+01 3.090e+02 1.360e+02 2.880e+02
7.000e+01 3.200e+02 5.000e+01 1.200e+02 4.360e+02 2.520e+02 8.400e+01
6.640e+02 2.260e+02 3.000e+02 6.530e+02 1.120e+02 4.910e+02 2.680e+02
7.480e+02 9.800e+01 2.750e+02 1.380e+02 2.050e+02 2.620e+02 1.280e+02
2.600e+02 1.530e+02 6.400e+01 3.120e+02 1.600e+01 9.220e+02 1.420e+02
2.900e+02 1.270e+02 5.060e+02 2.970e+02       nan 6.040e+02 2.540e+02
3.600e+01 1.020e+02 4.720e+02 4.810e+02 1.080e+02 3.020e+02 1.720e+02
3.990e+02 2.700e+02 4.600e+01 2.100e+02 1.740e+02 3.480e+02 3.150e+02
2.990e+02 3.400e+02 1.660e+02 7.200e+01 3.100e+01 3.400e+01 2.380e+02
1.600e+03 3.650e+02 5.600e+01 1.500e+02 2.780e+02 2.560e+02 2.250e+02
3.700e+02 3.880e+02 1.750e+02 2.960e+02 1.460e+02 1.130e+02 1.760e+02
6.160e+02 3.000e+01 1.060e+02 8.700e+02 3.620e+02 5.300e+02 5.000e+02
5.100e+02 2.470e+02 3.050e+02 2.550e+02 1.250e+02 1.000e+02 4.320e+02
1.260e+02 4.730e+02 7.400e+01 1.450e+02 2.320e+02 3.760e+02 4.200e+01
1.610e+02 1.100e+02 1.800e+01 2.240e+02 2.480e+02 8.000e+01 3.040e+02
2.150e+02 7.720e+02 4.350e+02 3.780e+02 5.620e+02 1.680e+02 8.900e+01
2.850e+02 3.600e+02 9.400e+01 3.330e+02 9.210e+02 7.620e+02 5.940e+02
2.190e+02 1.880e+02 4.790e+02 5.840e+02 1.820e+02 2.500e+02 2.920e+02
2.450e+02 2.070e+02 8.200e+01 9.700e+01 3.350e+02 2.080e+02 4.200e+02
1.700e+02 4.590e+02 2.800e+02 9.900e+01 1.920e+02 2.040e+02 2.330e+02
1.560e+02 4.520e+02 5.130e+02 2.610e+02 1.640e+02 2.590e+02 2.090e+02
2.630e+02 2.160e+02 3.510e+02 6.600e+02 3.810e+02 5.400e+01 5.280e+02
2.580e+02 4.640e+02 5.700e+01 1.470e+02 1.170e+03 2.930e+02 6.300e+02
4.660e+02 1.090e+02 4.100e+01 1.600e+02 2.890e+02 6.510e+02 1.690e+02
9.500e+01 4.420e+02 2.020e+02 3.380e+02 8.940e+02 3.280e+02 6.730e+02
6.030e+02 1.000e+00 3.750e+02 9.000e+01 3.800e+01 1.570e+02 1.100e+01
1.400e+02 1.300e+02 1.480e+02 8.600e+02 4.240e+02 1.047e+03 2.430e+02
8.160e+02 3.870e+02 2.230e+02 1.580e+02 1.370e+02 1.150e+02 1.890e+02
2.740e+02 1.170e+02 6.000e+01 1.220e+02 9.200e+01 4.150e+02 7.600e+02
2.700e+01 7.500e+01 3.610e+02 1.050e+02 3.420e+02 2.980e+02 5.410e+02
2.360e+02 1.440e+02 4.230e+02 4.400e+01 1.510e+02 9.750e+02 4.500e+02
2.300e+02 5.710e+02 2.400e+01 5.300e+01 2.060e+02 1.400e+01 3.240e+02
2.950e+02 3.960e+02 6.700e+01 1.540e+02 4.250e+02 4.500e+01 1.378e+03
3.370e+02 1.490e+02 1.430e+02 5.100e+01 1.710e+02 2.340e+02 6.300e+01
7.660e+02 3.200e+01 8.100e+01 1.630e+02 5.540e+02 2.180e+02 6.320e+02
1.140e+02 5.670e+02 3.590e+02 4.510e+02 6.210e+02 7.880e+02 8.600e+01
7.960e+02 3.910e+02 2.280e+02 8.800e+01 1.650e+02 4.280e+02 4.100e+02
5.640e+02 3.680e+02 3.180e+02 5.790e+02 6.500e+01 7.050e+02 4.080e+02
2.440e+02 1.230e+02 3.660e+02 7.310e+02 4.480e+02 2.940e+02 3.100e+02
2.370e+02 4.260e+02 9.600e+01 4.380e+02 1.940e+02 1.190e+02 2.000e+01
5.040e+02 4.920e+02 6.150e+02 1.095e+03 1.159e+03 2.650e+02 9.100e+01
7.710e+02 4.700e+01 1.770e+02 3.710e+02 4.300e+02 4.400e+02 2.290e+02
7.260e+02 4.180e+02 7.240e+02 3.830e+02 7.300e+02 4.700e+02 3.080e+02
6.340e+02 3.720e+02 1.980e+02 1.210e+02 2.640e+02 1.410e+02 2.830e+02
5.090e+02 2.170e+02 3.000e+00 6.570e+02 1.240e+02 4.440e+02 2.300e+01
2.420e+02 3.640e+02 3.520e+02 4.060e+02 4.020e+02 4.220e+02 3.560e+02
6.800e+02 1.110e+03 2.210e+02 7.140e+02 6.470e+02 1.290e+03 4.950e+02
5.680e+02 1.790e+02 1.050e+03 1.870e+02 5.200e+01 2.760e+02 3.900e+01
1.900e+02 2.510e+02 2.270e+02 1.340e+02 2.220e+02 5.800e+01 6.680e+02
6.740e+02 1.970e+02 7.100e+02 9.450e+02 5.490e+02 2.530e+02 4.000e+02
9.700e+02 5.020e+02 3.940e+02 2.350e+02 5.150e+02 5.260e+02 7.540e+02
3.530e+02 5.250e+02 8.700e+01 2.910e+02 6.900e+01 2.790e+02 3.230e+02
2.140e+02 5.190e+02 1.224e+03 6.520e+02 8.860e+02 9.020e+02 4.340e+02
6.620e+02 7.340e+02 5.500e+02 5.140e+02 3.850e+02 5.180e+02 5.720e+02
3.220e+02 8.770e+02 3.970e+02 7.380e+02 5.010e+02 1.180e+02 6.920e+02
3.320e+02 5.220e+02 3.790e+02 5.320e+02 6.200e+01 1.990e+02 3.550e+02
4.050e+02 3.270e+02 2.570e+02 3.820e+02]

Unique value of:>>> MasVnrType (5)
['BrkFace' 'None' 'Stone' 'BrkCmn' nan]

Unique value of:>>> MiscVal (38)
[    0   700   350   500   400   480   450 15500  1200   800  2000   600
3500  1300    54   620   560  1400  8300  1150  2500 12500  1500   300
80   490   650   900   750  6500  1000  4500  3000 17000  1512   455
460   420]

Unique value of:>>> MoSold (12)
[ 2  5  9 12 10  8 11  4  1  7  3  6]

Unique value of:>>> Neighborhood (25)
['CollgCr' 'Veenker' 'Crawfor' 'NoRidge' 'Mitchel' 'Somerst' 'NWAmes'
'OldTown' 'BrkSide' 'Sawyer' 'NridgHt' 'NAmes' 'SawyerW' 'IDOTRR'
'MeadowV' 'Edwards' 'Timber' 'Gilbert' 'StoneBr' 'ClearCr' 'NPkVill'
'Blmngtn' 'BrDale' 'SWISU' 'Blueste']

Unique value of:>>> OpenPorchSF (252)
[ 61   0  42  35  84  30  57 204   4  21  33 213 112 102 154 159 110  90
56  32  50 258  54  65  38  47  64  52 138 104  82  43 146  75  72  70
49  11  36 151  29  94 101 199  99 234 162  63  68  46  45 122 184 120
20  24 130 205 108  80  66  48  25  96 111 106  40 114   8 136 132  62
228  60 238 260  27  74  16 198  26  83  34  55  22  98 172 119 208 105
140 168  28  39 148  12  51 150 117 250  10  81  44 144 175 195 128  76
17  59 214 121  53 231 134 192 123  78 187  85 133 176 113 137 125 523
100 285  88 406 155  73 182 502 274 158 142 243 235 312 124 267 265  87
288  23 152 341 116 160 174 247 291  18 170 156 166 129 418 240  77 364
188 207  67  69 131 191  41 118 252 189 282 135  95 224 169 319  58  93
244 185 200  92 180 263 304 229 103 211 287 292 241 547  91  86 262 210
141  15 126 236 278 197 273 190 183 165 226 178 177 254 215 222 193 201
173 153 251 230 299 365 139 216  89 372 217 276 164 368 203 127 256 194
324 171 570 484 742 444 266  97  37 246  31 382   6 115 253 245 107 225]

Unique value of:>>> OverallCond (9)
[5 8 6 7 4 2 3 9 1]

Unique value of:>>> OverallQual (10)
[ 7  6  8  5  9  4 10  3  1  2]

Unique value of:>>> PavedDrive (3)
['Y' 'N' 'P']

Unique value of:>>> PoolArea (14)
[  0 512 648 576 555 480 519 738 144 368 444 228 561 800]

Unique value of:>>> RoofMatl (8)
['CompShg' 'WdShngl' 'Metal' 'WdShake' 'Membran' 'Tar&Grv' 'Roll'
'ClyTile']

Unique value of:>>> RoofStyle (6)
['Gable' 'Hip' 'Gambrel' 'Mansard' 'Flat' 'Shed']

Unique value of:>>> SaleCondition (6)
['Normal' 'Abnorml' 'Partial' 'AdjLand' 'Alloca' 'Family']

Unique value of:>>> SaleType (10)
['WD' 'New' 'COD' 'ConLD' 'ConLI' 'CWD' 'ConLw' 'Con' 'Oth' nan]

Unique value of:>>> ScreenPorch (121)
[  0 176 198 291 252  99 184 168 130 142 192 410 224 266 170 154 153 144
128 259 160 271 234 374 185 182  90 396 140 276 180 161 145 200 122  95
120  60 126 189 260 147 385 287 156 100 216 210 197 204 225 152 175 312
222 265 322 190 233  63  53 143 273 288 263  80 163 116 480 178 440 155
220 119 165  40 256 240 148 166 108 490 196 121  92 342 255 111 112 231
110 117 195 115 141 208  94 164  64 576 227 221 171 135 174 217 201 109
150  84 228 138  88 280 123 264 270 162 348 113 104]

Unique value of:>>> Street (2)
['Pave' 'Grvl']

Unique value of:>>> TotRmsAbvGrd (14)
[ 8  6  7  9  5 11  4 10 12  3  2 14 13 15]

Unique value of:>>> TotalBsmtSF (1059)
[ 856. 1262.  920. ...  498.  432. 1381.]

Unique value of:>>> Utilities (3)
['AllPub' 'NoSeWa' nan]

Unique value of:>>> WoodDeckSF (379)
[   0  298  192   40  255  235   90  147  140  160   48  240  171  100
406  222  288   49  203  113  392  145  196  168  112  106  857  115
120   12  576  301  144  300   74  127  232  158  352  182  180  166
224   80  367   53  188  105   24   98  276  200  409  239  400  476
178  574  237  210  441  116  280  104   87  132  238  149  355   60
139  108  351  209  216  248  143  365  370   58  197  263  123  138
333  250  292   95  262   81  289  124  172  110  208  468  256  302
190  340  233  184  201  142  122  155  670  135  495  536  306   64
364  353   66  159  146  296  125   44  215  264   88   89   96  414
519  206  141  260  324  156  220   38  261  126   85  466  270   78
169  320  268   72  349   42   35  326  382  161  179  103  253  148
335  176  390  328  312  185  269  195   57  236  517  304  198  426
28  316  322  307  257  219  416  344  380   68  114  327  165  187
181   92  228  245  503  315  241  303  133  403   36   52  265  207
150  290  486  278   70  418  234   26  342   97  272  121  243  511
154  164  173  384  202   56  321   86  194  421  305  117  550  509
153  394  371   63  252  136  186  170  474  214  199  728  436   55
431  448  361  362  162  229  439  379  356   84  635  325   33  212
314  242  294   30  128   45  177  227  218  309  404  500  668  402
283  183  175  586  295   32  366  736  393  360  157  483  275   23
277  657   51   54  221  226  496  336  450   71  331  375  174   22
287  129  225  319   99  230  231  297  205  462  502  501  266  244
189  131   73  329  279  467  119  308  152   16  411  358  385   20
25  490   76  204  311  102   50  424  339  211  259  134  213  318
428  282  167  407  130  460  286  193  455  284  285   14  521  646
386  405  546  118  291  274 1424  690  330  246  444  354  247  870
432    4  641   94  191   75  631  345  520   27   77  684  453  413
530]

Unique value of:>>> YearBuilt (118)
[2003 1976 2001 1915 2000 1993 2004 1973 1931 1939 1965 2005 1962 2006
1960 1929 1970 1967 1958 1930 2002 1968 2007 1951 1957 1927 1920 1966
1959 1994 1954 1953 1955 1983 1975 1997 1934 1963 1981 1964 1999 1972
1921 1945 1982 1998 1956 1948 1910 1995 1991 2009 1950 1961 1977 1985
1979 1885 1919 1990 1969 1935 1988 1971 1952 1936 1923 1924 1984 1926
1940 1941 1987 1986 2008 1908 1892 1916 1932 1918 1912 1947 1925 1900
1980 1989 1992 1949 1880 1928 1978 1922 1996 2010 1946 1913 1937 1942
1938 1974 1893 1914 1906 1890 1898 1904 1882 1875 1911 1917 1872 1905
1907 1896 1902 1895 1879 1901]

Unique value of:>>> YearRemodAdd (61)
[2003 1976 2002 1970 2000 1995 2005 1973 1950 1965 2006 1962 2007 1960
2001 1967 2004 2008 1997 1959 1990 1955 1983 1980 1966 1963 1987 1964
1972 1996 1998 1989 1953 1956 1968 1981 1992 2009 1982 1961 1993 1999
1985 1979 1977 1969 1958 1991 1971 1952 1975 2010 1984 1986 1994 1988
1954 1957 1951 1978 1974]

Unique value of:>>> YrSold (5)
[2008 2007 2006 2009 2010]


In [23]:
# Describe the target
train["SalePrice"].describe()

Out[23]:
count      1460.000000
mean     180921.195890
std       79442.502883
min       34900.000000
25%      129975.000000
50%      163000.000000
75%      214000.000000
max      755000.000000
Name: SalePrice, dtype: float64
In [24]:
# Plot the distplot of target
plt.figure(figsize=(10,8))
bar = sns.distplot(train["SalePrice"])
bar.legend(["Skewness: {:.2f}".format(train['SalePrice'].skew())])

Out[24]:
<matplotlib.legend.Legend at 0x1bb7d6acbe0>

In [25]:
# correlation heatmap
plt.figure(figsize=(25,25))
ax = sns.heatmap(train.corr(), cmap = "coolwarm", annot=True, linewidth=2)

# to fix the bug "first and last row cut in half of heatmap plot"
bottom, top = ax.get_ylim()
ax.set_ylim(bottom + 0.5, top - 0.5)

Out[25]:
(38.0, 0.0)