what causes outliers?
-sensor malfunction: ignore
-data entry errors
-freak event: pay attention
Outlier Detection
-train
-remove
-train again
for point in data: salary = point[0] bonus = point[1] matplotlib.pyplot.scatter( salary, bonus ) matplotlib.pyplot.xlabel("salary") matplotlib.pyplot.ylabel("bonus") matplotlib.pyplot.show()