• Nie Znaleziono Wyników

INTRODUCTION TO DATA SCIENCE

N/A
N/A
Protected

Academic year: 2021

Share "INTRODUCTION TO DATA SCIENCE"

Copied!
22
0
0

Pełen tekst

(1)

INTRODUCTION TO DATA SCIENCE

1

(2)

Regression for predictions

2

Primer

Advanced

Linear regression

Multiple regression

Accesing performance

Ridge regression

Feature selection and lasso regression

Nearest neighbor and kernel regression

(3)

How much is my house worth

3

Predicting value of the house

How much is worth?

Lets look at the recent sales in the neighborhood.

How much did they sell for?

(4)

Naive: plot recent house sales

4

We take observations that we have and make a

plot of them.

(5)

Predict by prizes of similar houses

5

Is it really reasonably to believe that there is no information there?

We would like to leverage all avaible information.

(6)

Linear regression: a model based relation

6

Use a linear regression model

intercept slope

(7)

Which line?

7

(8)

Defining a cost of a given line

8

(9)

Find „best” line

9

(10)

Predicting your house price

10

Q. What do you think?

Is it good analysis?

A. I am not sure that it has linear trend. Did you tried quadratic function?

(11)

What about quadratic function?

11

Actually that looks pretty good Maybe relation is not linear afterall?

(12)

Or even higher order polynomial?

12

Do you believe this fit?

This function looks crazy.

Minimizes RRS but bad predictions.

Qudratic polynomial was probbaly better

(13)

How to choose model order/complexity

13

We have to work with the data that we have

(14)

Training/test split

14

(15)

Training error

15

Minimize to find

estimated w

(16)

Test error

16

Acces predictions

using estimated w

(17)

Training/test curve

17

(18)

Add more features

18

(19)

Regression ML block

19

(20)

Other applications

20

Stock predictions Tweed popularity

(21)

Other applications

21

Reading your mind

(22)

We discussed how to

22

Cytaty

Powiązane dokumenty

Cetinkaya-Rundel, Duke University Data Analysis and

Case studied are about building, evaluating, deploying inteligence in data analysis. Use pre-specified or develop

Cetinkaya-Rundel, Duke University Data Analysis and

Guestrin, Univ

Case studied are about building, evaluating, deploying inteligence in data analysis. Use pre-specified or develop

WFAiS UJ, Informatyka Stosowana II stopień

WFAiS UJ, Informatyka Stosowana II stopień

WFAiS UJ, Informatyka Stosowana II stopień