<aside> 💡 Recommended book for learning exploration are : Multivariate Data Analysis by Joseph F Hair → Chapter ‘Examining your data’ And the following data exploration is based on Kaggle Dataset House Price Prediction by using advance Regression model
</aside>
Variable name
Type (numerical , categorical)
Segment : (building , location and space)
Building 🏦 a variable that related with physical characteristics of a building
location 🗺️ a variable which gives information about the place
space 🚀 a variable that reports the space properties of the house
Expectation : The variable influence in ‘SalePrice’, for this we can use scale like high, medium and low
Conclusion : Correction of expectation
The “Type” and “Segment” is for future reference , “Expectation” help us in our sixth sense. To fill column , reading description of all variable is must thing to do and asking the following questions:
After this we can filter these variable into one place and look carefully to the variable with ‘High Expectation’ then creating the scatter plots between those variable and target variable “SalePrice”, then filling the conclusion column