-Strategy for solving problems
-Non-predictive data Analysis
-Predictive Analysis
-Linear Regressions
Framwork(cross industry standard process for data mining)
business issue understanding, data understanding, data preparation, analysis modeling, validation, personalization
business issue understanding:what decision needs to be made?
what information is needed to inform that decision?
what type of analysis will provide the information to inform that decision?
how can we predict hourly temperatures?
what data is needed?
what data is available?
what are the important of the data?
data preparation
-gather, cleanse, format, blend & combine, sample
analysis modeling
-predict temperature, predict electricity usage
build predictive model, validate model, repeat process, perform analysis
validation
presentation and visualization