• Nie Znaleziono Wyników

INTRODUCTION TO DATA SCIENCE

N/A
N/A
Protected

Academic year: 2021

Share "INTRODUCTION TO DATA SCIENCE"

Copied!
228
0
0

Pełen tekst

(1)

INTRODUCTION TO DATA SCIENCE

WFAiS UJ, Informatyka Stosowana I stopień studiów

1

19/01/2021

This lecture is based on course by M. Cetinkaya-Rundel, Duke University Data Analysis and Statistical Inference

(2)

Statistical inference

19/01/2021

2

Lets start with small case study:

gender discrimination

(3)

Statistical inference: case study

19/01/2021

3

(4)

Statistical inference: case study

19/01/2021

4

(5)

Statistical inference: case study

19/01/2021

5

(6)

Statistical inference: case study

19/01/2021

6

(7)

Statistical inference: case study

19/01/2021

7

(8)

Statistical inference: case study

19/01/2021

8

(9)

Statistical inference: case study

19/01/2021

9

(10)

Statistical inference: case study

19/01/2021

10

(11)

Statistical inference: case study

19/01/2021

11

0.30

Result of promotion p-value = 0.05

(12)

Statistical inference: case study

19/01/2021

12

(13)

Statistical inference: case study

19/01/2021

13

(14)

Probability and distributions

19/01/2021

14

(15)

Random process

19/01/2021

15

(16)

Probability

19/01/2021

16

(17)

Law of Large Numbers

19/01/2021

17

30 300

(18)

Disjoint (mutually exclusive)

19/01/2021

18

(19)

Union of disjoint events

19/01/2021

19

(20)

Union of ono-disjoint events

19/01/2021

20

(21)

General addition rule

19/01/2021

21

(22)

Sample space

19/01/2021

22

(23)

Probability distributions

19/01/2021

23

(24)

Complementary events

19/01/2021

24

(25)

Disjoint vs complementary

19/01/2021

25

(26)

Independence

19/01/2021

26

(27)

Independence

19/01/2021

27

(28)

Independence

19/01/2021

28

(29)

Independence

19/01/2021

29

(30)

Determining dependence

19/01/2021

30

(31)

Determining dependence

19/01/2021

31

(32)

Example: probability

19/01/2021

32

(33)

Example

19/01/2021

33

(34)

Example

19/01/2021

34

(35)

Example

19/01/2021

35

(36)

Example

19/01/2021

36

(37)

Example

19/01/2021

37

(38)

Example

19/01/2021

38

(39)

Example

19/01/2021

39

(40)

Conditional probability

19/01/2021

40

(41)

Conditional probability

19/01/2021

41

(42)

Marginal probability

19/01/2021

42

(43)

Joint probability

19/01/2021

43

(44)

Conditional probability

19/01/2021

44

(45)

Conditional probability

19/01/2021

45

(46)

Probability trees

19/01/2021

46

(47)

Probability trees

19/01/2021

47

(48)

Probability trees

19/01/2021

48

(49)

Probability trees

19/01/2021

49

(50)

Bayesian inference

19/01/2021

50

(51)

Bayesian inference

19/01/2021

51

(52)

Bayesian inference

19/01/2021

52

(53)

Bayesian inference

19/01/2021

53

(54)

Bayesian inference

19/01/2021

54

(55)

Bayesian inference

19/01/2021

55

(56)

Bayesian inference

19/01/2021

56

(57)

Bayesian inference

19/01/2021

57

(58)

Bayesian inference

19/01/2021

58

(59)

Normal distribution

19/01/2021

59

(60)

Normal distribution

19/01/2021

60

(61)

Foundation for inference

19/01/2021

61

(62)

Sampling distribution

19/01/2021

62

(63)

Sampling distribution

19/01/2021

63

(64)

Central Limit Theorem

19/01/2021

64

(65)

Example

19/01/2021

65

(66)

Example

19/01/2021

66

(67)

Example

19/01/2021

67

(68)

Confidence interval (for a mean)

19/01/2021

68

(69)

Confidence interval

19/01/2021

69

(70)

Confidence interval

19/01/2021

70

(71)

Confidence interval

19/01/2021

71

(72)

Confidence level

19/01/2021

72

(73)

Confidence level

19/01/2021

73

(74)

Confidence level

19/01/2021

74

(75)

Confidence level

19/01/2021

75

(76)

Required sample size

19/01/2021

76

(77)

Examples: Confidence interval

19/01/2021

77

(78)

Examples: Confidence interval

19/01/2021

78

(79)

Examples: Confidence interval

19/01/2021

79

(80)

Hypothesis testing framework

19/01/2021

80

(81)

Example

19/01/2021

81

(82)

Example

19/01/2021

82

(83)

Example

19/01/2021

83

(84)

Example

19/01/2021

84

(85)

Inference for other estimators

19/01/2021

85

(86)

Inference for other estimators

19/01/2021

86

(87)

Inference for other estimators

19/01/2021

87

(88)

Decision errors

19/01/2021

88

(89)

Decision errors

19/01/2021

89

(90)

Decision errors

19/01/2021

90

(91)

Decision errors

19/01/2021

91

(92)

Decision errors

19/01/2021

92

(93)

Decision errors

19/01/2021

93

(94)

Decision errors

19/01/2021

94

(95)

Significance vs confidence level

19/01/2021

95

(96)

Significance vs confidence level

19/01/2021

96

(97)

Significance vs confidence level

19/01/2021

97

(98)

Inference for numerical variables

19/01/2021

98

(99)

Hypothesis testing for paired data

19/01/2021

99

(100)

Hypothesis testing for paired data

19/01/2021

100

(101)

Hypothesis testing for paired data

19/01/2021

101

(102)

Hypothesis testing for paired data

19/01/2021

102

(103)

Hypothesis testing for paired data

19/01/2021

103

(104)

Hypothesis testing for paired data

19/01/2021

104

(105)

Hypothesis testing for paired data

19/01/2021

105

(106)

Hypothesis testing for paired data

19/01/2021

106

(107)

Hypothesis testing for paired data

19/01/2021

107

(108)

Bootstrapping

19/01/2021

108

(109)

Bootstrapping

19/01/2021

109

(110)

Bootstrapping

19/01/2021

110

(111)

Bootstrapping

19/01/2021

111

(112)

Bootstrapping

19/01/2021

112

(113)

Bootstrapping

19/01/2021

113

(114)

Bootstrapping limitations

19/01/2021

114

(115)

Bootstrapping vs sampling distribution

19/01/2021

115

(116)

t distribution

19/01/2021

116

(117)

t distribution

19/01/2021

117

(118)

t distribution

19/01/2021

118

(119)

t distribution

19/01/2021

119

(120)

t distribution

19/01/2021

120

(121)

Inference for a small sample mean

19/01/2021

121

(122)

Inference for a small sample mean

19/01/2021

122

(123)

Inference for a small sample mean

19/01/2021

123

(124)

Inference for comparing two small sample means

19/01/2021

124

(125)

Inference for comparing two small sample means

19/01/2021

125

(126)

Comparing more than two means

19/01/2021

126

(127)

Comparing more than two means

19/01/2021

127

(128)

Comparing more than two means

19/01/2021

128

(129)

Comparing more than two means

19/01/2021

129

(130)

Comparing more than two means

19/01/2021

130

(131)

Comparing more than two means

19/01/2021

131

(132)

Comparing more than two means

19/01/2021

132

(133)

Inference for categorical variables

19/01/2021

133

(134)

Sampling variability & CLT for proportions

19/01/2021

134

(135)

19/01/2021

135

(136)

19/01/2021

136

(137)

What if

19/01/2021

137

(138)

19/01/2021

138

(139)

Hypothesis testing for a proportion

19/01/2021

139

(140)

19/01/2021

140

(141)

Estimating diference between two proportions

19/01/2021

141

(142)

Estimating diference between two proportions

19/01/2021

142

(143)

Hypothesis tests for comparing two proportions

19/01/2021

143

(144)

19/01/2021

144

(145)

19/01/2021

145

(146)

19/01/2021

146

MORE EXAMPLES

(147)

Example: Bayesian inference

19/01/2021

147

(148)

Example

19/01/2021

148

(149)

Example

19/01/2021

149

(150)

Example

19/01/2021

150

(151)

Example

19/01/2021

151

(152)

Examples: Confidence interval

19/01/2021

152

(153)

Examples: Confidence interval

19/01/2021

153

(154)

Example

19/01/2021

154

(155)

Example

19/01/2021

155

(156)

19/01/2021

156

PRACTICE

(157)

Practice

19/01/2021

157

(158)

Practice

19/01/2021

158

(159)

Practice

19/01/2021

159

(160)

Practice

19/01/2021

160

(161)

Practice

19/01/2021

161

(162)

Normal distribution

19/01/2021

162

(163)

Practice

19/01/2021

163

(164)

Practice

19/01/2021

164

(165)

Practice

19/01/2021

165

(166)

Practice

19/01/2021

166

(167)

Practice

19/01/2021

167

(168)

Practice

19/01/2021

168

(169)

Practice

19/01/2021

169

(170)

Practice

19/01/2021

170

(171)

Practice

19/01/2021

171

(172)

Practice

19/01/2021

172

(173)

Practice

19/01/2021

173

(174)

Practice

19/01/2021

174

(175)

Practice

19/01/2021

175

(176)

Practice

19/01/2021

176

(177)

Practice

19/01/2021

177

(178)

Practice

19/01/2021

178

(179)

Practice

19/01/2021

179

(180)

Practice

19/01/2021

180

(181)

Practice

19/01/2021

181

(182)

Practice

19/01/2021

182

(183)

Practice

19/01/2021

183

(184)

Practice

19/01/2021

184

(185)

Practice

19/01/2021

185

(186)

Practice

19/01/2021

186

(187)

Practice

19/01/2021

187

(188)

Practice

19/01/2021

188

(189)

Practice

19/01/2021

189

(190)

Practice

19/01/2021

190

(191)

Practice

19/01/2021

191

(192)

Practice

19/01/2021

192

(193)

Practice

19/01/2021

193

(194)

Practice

19/01/2021

194

(195)

Practice

19/01/2021

195

(196)

Practice

19/01/2021

196

(197)

Practice

19/01/2021

197

(198)

19/01/2021

198

(199)

Practice

19/01/2021

199

(200)

19/01/2021

200

Cytaty

Powiązane dokumenty

Guestrin, Univ

 Personalisation: purhase history, monthly and yearly trends, etc.?. Customers who bought product A also bought

Case studied are about building, evaluating, deploying inteligence in data analysis.. Regression: Predicting

Case studied are about building, evaluating, deploying inteligence in data analysis. Use pre-specified or develop

Cetinkaya-Rundel, Duke University Data Analysis and

Guestrin, Univ

The choice of the regularization parameter a, or equivalently the choice of ~ log L (or ~X2), determines the trade-off between the bias and variance of the estima- tors

Case studied are about building, evaluating, deploying inteligence in data analysis. Use pre-specified or develop