what causes outliers?
-sensor malfunction: ignore
-data entry errors
-freak event: pay attention
Outlier Detection
-train
-remove
-train again
for point in data:
salary = point[0]
bonus = point[1]
matplotlib.pyplot.scatter( salary, bonus )
matplotlib.pyplot.xlabel("salary")
matplotlib.pyplot.ylabel("bonus")
matplotlib.pyplot.show()