• Nie Znaleziono Wyników

INTRODUCTION TO DATA SCIENCE

N/A
N/A
Protected

Academic year: 2021

Share "INTRODUCTION TO DATA SCIENCE"

Copied!
234
0
0

Pełen tekst

(1)

INTRODUCTION TO DATA SCIENCE

WFAiS UJ, Informatyka Stosowana I stopień studiów

1

21/01/2020

This lecture is based on course by M. Cetinkaya-Rundel, Duke University Data Analysis and Statistical Inference

(2)

Statistical inference

21/01/2020

2

Lets start with small case study:

gender discrimination

(3)

Statistical inference: case study

21/01/2020

3

(4)

Statistical inference: case study

21/01/2020

4

(5)

Statistical inference: case study

21/01/2020

5

(6)

Statistical inference: case study

21/01/2020

6

(7)

Statistical inference: case study

21/01/2020

7

(8)

Statistical inference: case study

21/01/2020

8

(9)

Statistical inference: case study

21/01/2020

9

(10)

Statistical inference: case study

21/01/2020

10

(11)

Statistical inference: case study

21/01/2020

11

0.30

Result of promotion p-value = 0.05

(12)

Statistical inference: case study

21/01/2020

12

(13)

Statistical inference: case study

21/01/2020

13

(14)

Probability and distributions

21/01/2020

14

(15)

Probability and distributions

21/01/2020

15

(16)

Probability and distributions

21/01/2020

16

(17)

Probability and distributions

21/01/2020

17

30 300

(18)

Probability and distributions

21/01/2020

18

(19)

Disjoint (mutually exclusive)

21/01/2020

19

(20)

Union of disjoint events

21/01/2020

20

(21)

Union of ono-disjoint events

21/01/2020

21

(22)

General addition rule

21/01/2020

22

(23)

Sample space

21/01/2020

23

(24)

Probability distributions

21/01/2020

24

(25)

Complementary events

21/01/2020

25

(26)

Disjoint vs complementary

21/01/2020

26

(27)

Independence

21/01/2020

27

(28)

Independence

21/01/2020

28

(29)

Independence

21/01/2020

29

(30)

Independence

21/01/2020

30

(31)

Practice

21/01/2020

31

(32)

Determining dependence

21/01/2020

32

(33)

Determining dependence

21/01/2020

33

(34)

Practice

21/01/2020

34

(35)

Example: probability

21/01/2020

35

(36)

Example

21/01/2020

36

(37)

Example

21/01/2020

37

(38)

Example

21/01/2020

38

(39)

Example

21/01/2020

39

(40)

Example

21/01/2020

40

(41)

Example

21/01/2020

41

(42)

Example

21/01/2020

42

(43)

Conditional probability

21/01/2020

43

(44)

Conditional probability

21/01/2020

44

(45)

Marginal probability

21/01/2020

45

(46)

Joint probability

21/01/2020

46

(47)

Conditional probability

21/01/2020

47

(48)

Conditional probability

21/01/2020

48

(49)

Practice

21/01/2020

49

(50)

Practice

21/01/2020

50

(51)

Practice

21/01/2020

51

(52)

Probability trees

21/01/2020

52

(53)

Probability trees

21/01/2020

53

(54)

Probability trees

21/01/2020

54

(55)

Probability trees

21/01/2020

55

(56)

Bayesian inference

21/01/2020

56

(57)

Bayesian inference

21/01/2020

57

(58)

Bayesian inference

21/01/2020

58

(59)

Bayesian inference

21/01/2020

59

(60)

Bayesian inference

21/01/2020

60

(61)

Bayesian inference

21/01/2020

61

(62)

Bayesian inference

21/01/2020

62

(63)

Bayesian inference

21/01/2020

63

(64)

Bayesian inference

21/01/2020

64

(65)

Example: Bayesian inference

21/01/2020

65

(66)

Example

21/01/2020

66

(67)

Example

21/01/2020

67

(68)

Example

21/01/2020

68

(69)

Example

21/01/2020

69

(70)

Normal distribution

21/01/2020

70

(71)

Normal distribution

21/01/2020

71

(72)

Normal distribution

21/01/2020

72

(73)

Practice

21/01/2020

73

(74)

Practice

21/01/2020

74

(75)

Practice

21/01/2020

75

(76)

Practice

21/01/2020

76

(77)

Practice

21/01/2020

77

(78)

Foundation for inference

21/01/2020

78

(79)

21/01/2020

79

(80)

21/01/2020

80

(81)

21/01/2020

81

(82)

Sampling distribution

21/01/2020

82

(83)

Sampling distribution

21/01/2020

83

(84)

Central Limit Theorem

21/01/2020

84

(85)

Example

21/01/2020

85

(86)

Example

21/01/2020

86

(87)

Example

21/01/2020

87

(88)

Confidence interval (for a mean)

21/01/2020

88

(89)

Confidence interval

21/01/2020

89

(90)

Confidence interval

21/01/2020

90

(91)

Confidence interval

21/01/2020

91

(92)

Confidence level

21/01/2020

92

(93)

Confidence level

21/01/2020

93

(94)

Confidence level

21/01/2020

94

(95)

Confidence level

21/01/2020

95

(96)

Practice

21/01/2020

96

(97)

Required sample size

21/01/2020

97

(98)

Practice

21/01/2020

98

(99)

Practice

21/01/2020

99

(100)

Examples: Confidence interval

21/01/2020

100

(101)

Examples: Confidence interval

21/01/2020

101

(102)

Examples: Confidence interval

21/01/2020

102

(103)

Examples: Confidence interval

21/01/2020

103

(104)

Examples: Confidence interval

21/01/2020

104

(105)

Hypothesis testing framework

21/01/2020

105

(106)

Example

21/01/2020

106

(107)

Example

21/01/2020

107

(108)

Example

21/01/2020

108

(109)

Example

21/01/2020

109

(110)

Example

21/01/2020

110

(111)

Example

21/01/2020

111

(112)

Inference for other estimators

21/01/2020

112

(113)

Inference for other estimators

21/01/2020

113

(114)

Inference for other estimators

21/01/2020

114

(115)

Practice

21/01/2020

115

(116)

Practice

21/01/2020

116

(117)

Practice

21/01/2020

117

(118)

Practice

21/01/2020

118

(119)

Decision errors

21/01/2020

119

(120)

Decision errors

21/01/2020

120

(121)

Decision errors

21/01/2020

121

(122)

Decision errors

21/01/2020

122

(123)

Decision errors

21/01/2020

123

(124)

Decision errors

21/01/2020

124

(125)

Decision errors

21/01/2020

125

(126)

Significance vs confidence level

21/01/2020

126

(127)

Significance vs confidence level

21/01/2020

127

(128)

Significance vs confidence level

21/01/2020

128

(129)

Statistical vs. practical significance

21/01/2020

129

(130)

Statistical vs. practical significance

21/01/2020

130

(131)

Inference for numerical variables

21/01/2020

131

(132)

Hypothesis testing for paired data

21/01/2020

132

(133)

Hypothesis testing for paired data

21/01/2020

133

(134)

Hypothesis testing for paired data

21/01/2020

134

(135)

Hypothesis testing for paired data

21/01/2020

135

(136)

Hypothesis testing for paired data

21/01/2020

136

(137)

Hypothesis testing for paired data

21/01/2020

137

(138)

Hypothesis testing for paired data

21/01/2020

138

(139)

Hypothesis testing for paired data

21/01/2020

139

(140)

Hypothesis testing for paired data

21/01/2020

140

(141)

Practice

21/01/2020

141

(142)

Practice

21/01/2020

142

(143)

Practice

21/01/2020

143

(144)

Bootstrapping

21/01/2020

144

(145)

Bootstrapping

21/01/2020

145

(146)

Bootstrapping

21/01/2020

146

(147)

Bootstrapping

21/01/2020

147

(148)

Bootstrapping

21/01/2020

148

(149)

Bootstrapping

21/01/2020

149

(150)

Practice

21/01/2020

150

(151)

Practice

21/01/2020

151

(152)

Practice

21/01/2020

152

(153)

Bootstrapping limitations

21/01/2020

153

(154)

Bootstrapping vs sampling distribution

21/01/2020

154

(155)

t distribution

21/01/2020

155

(156)

t distribution

21/01/2020

156

(157)

t distribution

21/01/2020

157

(158)

t distribution

21/01/2020

158

(159)

t distribution

21/01/2020

159

(160)

Practice

21/01/2020

160

(161)

Inference for a small sample mean

21/01/2020

161

(162)

Inference for a small sample mean

21/01/2020

162

(163)

Inference for a small sample mean

21/01/2020

163

(164)

Practice

21/01/2020

164

(165)

Practice

21/01/2020

165

(166)

Practice

21/01/2020

166

(167)

Practice

21/01/2020

167

(168)

Practice

21/01/2020

168

(169)

Practice

21/01/2020

169

(170)

Inference for comparing two small sample means

21/01/2020

170

(171)

Inference for comparing two small sample means

21/01/2020

171

(172)

Practice

21/01/2020

172

(173)

Practice

21/01/2020

173

(174)

Practice

21/01/2020

174

(175)

Comparing more than two means

21/01/2020

175

(176)

Comparing more than two means

21/01/2020

176

(177)

Comparing more than two means

21/01/2020

177

(178)

Comparing more than two means

21/01/2020

178

(179)

Comparing more than two means

21/01/2020

179

(180)

Comparing more than two means

21/01/2020

180

(181)

Comparing more than two means

21/01/2020

181

(182)

Comparing more than two means

21/01/2020

182

(183)

Comparing more than two means

21/01/2020

183

(184)

Comparing more than two means

21/01/2020

184

(185)

Comparing more than two means

21/01/2020

185

(186)

ANOVA

21/01/2020

186

(187)

ANOVA

21/01/2020

187

(188)

ANOVA

21/01/2020

188

(189)

ANOVA

21/01/2020

189

(190)

ANOVA

21/01/2020

190

(191)

ANOVA

21/01/2020

191

(192)

ANOVA

21/01/2020

192

(193)

ANOVA

21/01/2020

193

(194)

ANOVA

21/01/2020

194

(195)

21/01/2020

195

(196)

21/01/2020

196

(197)

21/01/2020

197

(198)

21/01/2020

198

(199)

21/01/2020

199

(200)

Conditions for ANOVA

21/01/2020

200

Cytaty

Powiązane dokumenty

Guestrin, Univ

 Personalisation: purhase history, monthly and yearly trends, etc.?. Customers who bought product A also bought

Cetinkaya-Rundel, Duke University Data Analysis and

Case studied are about building, evaluating, deploying inteligence in data analysis.. Regression: Predicting

Case studied are about building, evaluating, deploying inteligence in data analysis. Use pre-specified or develop

Guestrin, Univ

The choice of the regularization parameter a, or equivalently the choice of ~ log L (or ~X2), determines the trade-off between the bias and variance of the estima- tors

Case studied are about building, evaluating, deploying inteligence in data analysis. Use pre-specified or develop