Business Methodology

Numeric:number (regression model)
Classification:category(non numeric)

ex.
Tricycle Manufacturer

Numeric Model: what type of numeric?
-continuous, time based, count
continuous models, time series analysis

non-numeric -> binary, non-binary

Selecting an Analytical Methodology

methodology map

business problem
-predict outcome, data analysis

Non-predictive Analysis
-Geospatial, Segmentation, Aggregation, Descriptive

geospatial analysis:types off non-predictive data analysis
-location base data, geographic data
Segmentation Analysis
-grouping together
Aggregation Analysis
-calculating a value across a group or dimention
Descriptive Analysis
-descriptive statistics provides simple summaries of a data sample
-mean, median, mode, standard deviation, interquartile range

Predictive Business Problems
– do you have data on what you are trying to predict?

Data Rich vs. Data Poor
-do not have useful data to solve problems
-experiment is business context, a/b test, estimate sales of new product

The Analytical Problem Solving Framework

-Strategy for solving problems
-Non-predictive data Analysis
-Predictive Analysis
-Linear Regressions

Framwork(cross industry standard process for data mining)
business issue understanding, data understanding, data preparation, analysis modeling, validation, personalization

business issue understanding:what decision needs to be made?
what information is needed to inform that decision?
what type of analysis will provide the information to inform that decision?

how can we predict hourly temperatures?
what data is needed?
what data is available?
what are the important of the data?

data preparation
-gather, cleanse, format, blend & combine, sample

analysis modeling
-predict temperature, predict electricity usage
build predictive model, validate model, repeat process, perform analysis

validation
presentation and visualization

slope

steepness, comparing staircases
5feet, 10feet
slope = 5/5 = 1, 10 /5 = 2, 50/15 = 10/3

slope is vertical change/horizontal change = rize / run
positive slope is uphill, negative slope is downhill

y = mx + b
m is always the slope
b is the y-value of the y-intercept

y = 3x – 7
slope:3
y-intercept:(0, -7)

Linear equation

properties of linear equations
x + 3 = 4

5(x+10)-30 = 5x – 18
equation is false means “Contradiction”, no solution.

pythagorean theorem
4^2 = 16
x^2 = 25, x = 5
√64 = 8
√1 = 1

right triangle 90°
a^2 + b^2 = c^2
hypotenuse 6, 8, 10

Algebraic expressions

A variable is: A symbol or letter that can be used to represent an unknown value

coffee …$2.00
tea …$3.50
Juice …$1.25

4 coffees and two teas calculate
4c + 2t = $15.00

3x + 2 cannot be determined

evaluate: find a value for an expression by substituting a value in for your variable
when x = -2, 3x + 2 = -4

care order of operation
7 + 3(x-2) when x=4

unit conversion

the world’s most amazing number!

1) multiplication
2) division

3/3 = π/π = -4/-4 = 1

12 eggs = 1 carton
12 eggs / 1 carton = 1

conversion factor: a number that relates amount of one thing to amount of another.

cartons -> eggs 12eggs = 1carton
yards -> feet 3feet = 1yard
liters -> ounces
dollar -> Euros

1 mile = 5280 feet
1 gallon = 4 quarts
1 gram = 1000 milligrams
1 pound = 16 ounces

1 mile = 1.609 Km
if you run 5 km, 3.1 miles you run

1 mile = 1760 yards
1 yard = 3 feet
1 m = 1.093 yards
1 km = 1000m
1 m = 100cm
1m = 1000mm
1gal = 16cups
1 gal = 3.785liters