• Nie Znaleziono Wyników

INTRODUCTION TO DATA SCIENCE

N/A
N/A
Protected

Academic year: 2021

Share "INTRODUCTION TO DATA SCIENCE"

Copied!
39
0
0

Pełen tekst

(1)

INTRODUCTION TO DATA SCIENCE

WFAiS UJ, Informatyka Stosowana I stopień studiów

1

10/11/2020

This lecture is

based on course by E. Fox and C. Guestrin, Univ of Washington

(2)

Classification

10/11/2020

2

An inteligent restaurant review system

(3)

What is a sentiment of the review

10/11/2020

3

(4)

Topic sentiments

10/11/2020

4

(5)

Inteligent restaurant review system

10/11/2020

5

(6)

Core building block

10/11/2020

6

(7)

Inteligent restaurant review system

10/11/2020

7

(8)

Classifier

10/11/2020

8

(9)

Multiclass classifier

10/11/2020

9

(10)

Spam filtering

10/11/2020

10

(11)

Image classification

10/11/2020

11

(12)

Personalized medical diagnosis

10/11/2020

12

(13)

Reading your mind

10/11/2020

13

(14)

Representing classifiers

10/11/2020

14

(15)

Simple threshold classifier

10/11/2020

15

(16)

Simple threshold classifier

10/11/2020

16

(17)

Problems with threshold classifier

10/11/2020

17

(18)

A (linear) classifier

10/11/2020

18

(19)

Scoring a sentence

10/11/2020

19

(20)

Simple linear classifier

10/11/2020

20

(21)

Suppose only two words had non-zero weight

10/11/2020

21

(22)

Decision boundary example

10/11/2020

22

(23)

Decision boundary

10/11/2020

23

Separates positive & negative predictions

(24)

Training a classifier = Learning the weights

10/11/2020

24

(25)

Classification error & accuracy

10/11/2020

25

(26)

What if you ignore the sentence and just guess?

10/11/2020

26

(27)

Is a classifier with 90% accuracy good?

Depends…

10/11/2020

27

(28)

What is a good accuracy?

10/11/2020

28

(29)

Types of mistakes

10/11/2020

29

True positive

True

negative False negative False

positive

(30)

Cost of mistakes

10/11/2020

30

(31)

Confusion matrix: binary classification

10/11/2020

31

(32)

Confusion matrix: multiclass classification

10/11/2020

32

(33)

How much data does a model need to learn?

10/11/2020

33

(34)

Learning curves

10/11/2020

34

(35)

Learning curves

10/11/2020

35

(36)

More complex models tend to have less bias…

10/11/2020

36

(37)

Classification based on bigrams

10/11/2020

37

(38)

How confident is your prediction?

10/11/2020

38

(39)

We have discussed how to

10/11/2020

39

Cytaty

Powiązane dokumenty

Cetinkaya-Rundel, Duke University Data Analysis and

WFAiS UJ, Informatyka Stosowana I stopień

Case studied are about building, evaluating, deploying inteligence in data analysis. Use pre-specified or develop

Cetinkaya-Rundel, Duke University Data Analysis and

Guestrin, Univ

WFAiS UJ, Informatyka Stosowana I rok studiów, I

Case studied are about building, evaluating, deploying inteligence in data analysis. Use pre-specified or develop

WFAiS UJ, Informatyka Stosowana II stopień