• Nie Znaleziono Wyników

Machine Learning and Multivariate Techniques in HEP data Analyses

N/A
N/A
Protected

Academic year: 2021

Share "Machine Learning and Multivariate Techniques in HEP data Analyses"

Copied!
54
0
0

Pełen tekst

(1)

Machine Learning and Multivariate Techniques in HEP data Analyses

Prof. dr hab. Elżbieta Richter-Wąs

Extracted from slides by:

G. Cowan’s lectures at RH London Univ., H. Voss at SOS 2016, K. Reygers lectures at Heilderbeg Univ.

What is: Machine Learning (ML) & Multivariate Analysis/Technique (MVA)

Basics (classification, regression)

ROC-curve

generative vs predictive models

MVA/ML algorithms

Naïve Basian, KNN,

Linear discriminators, SVM

model fitting – gradient decent and loss function

General comments about MVAs

(2)

HEP Experiments: Simulated Higgs event in CMS

2

(3)

HEP Experiments: Simulated Higgs event in CMS

3

(4)

HEP Experiments: Event Signatures in the Detector

4

(5)

NOnA long baseline oscillation exp.

(nm)/ne (dis-)/appearance

O(100k) background, O(100) nm , O(10) ne per year 5

(6)

Machine Learning ‚elsewhere’

6

(7)

HEP: Everything started Multivariate

7

(8)

Outline

8

What is: Machine Learning (ML) & Multivariate Analysis/Technique (MVA)

Basics (classification, regression)

ROC-curve

generative vs predictive models

MVA/ML algorithms

Naïve Basian, KNN,

Linear discriminators, SVM

model fitting – gradient decent and loss function

General comments about MVAs

(9)

What is Machine Learning

9

(10)

What are Multivariate Techniques

10

Background

Signal

(11)

Machine Learning - Multivariate Techniques

11

(12)

Regression

12

(13)

Regression -> model functional behaviour

13

(14)

Multi-Variate Classification

14

(15)

Classification: Different Approaches

15

(16)

Signal Probability Instead of Hard Decisions

16

(17)

Event Classification

17

(18)

Event Classification

18

efficiency and purity

(19)

Classification <-> Regression

19

(20)

Event Classification

20

(21)

Receiver Operation Characteristic (ROC) curve

21

(22)

Receiver Operation Characteristic (ROC) curve

22

(23)

Receiver Operation Characteristic (ROC) curve

23

(24)

Event Classification -> finding the mapping function y(x)

24

(25)

Machine Learning Categories

25

(26)

Kernel Density Estimator

26

(27)

Kernel Density Estimator

27

(28)

K- Nearest Neighbour

28

(29)

Kernel Density Estimator

29

(30)

„Curse of Dimensionality”

30

(31)

Naive Bayesian Classifier

(Projective Likelihood Classifier)

31

(32)

De-Correlation

32

(33)

De-Correlation via PCA

(Principal Component Analysis)

33

(34)

Decorrelation at Work

34

(35)

Correlation Coefficients

35

(36)

Discriminative Classifiers

36

(37)

Linear Discriminant

37

(38)

Fisher’s Linear Discriminant

38

(39)

Linear Discriminant and non linear correlations

39

(40)

Classifier Training and Loss Function

40

(41)

Classifier Training and Loss Function

41 -

(42)

Logistic Regression

42

(43)

Logistic Regression

43

(44)

Logistic Regression

44

(45)

(Stochastic) Gradient Decent SDG

45

(46)

Overtraining

46

(47)

Regulatisation

47

(48)

Cross Validation

48

(49)

General Advice for (MVA) Analyses

49

(50)

MVA Categories

50

(51)

About Systematic Errors

51

(52)

MVA and Systematic Uncertainties

52

(53)

Classifiers and Their Properties

53

(54)

Summary

54

Cytaty

Powiązane dokumenty

We fitted a cumulative Gaussian distribution to the average speeds in free flow on the middle and the left lane using the fractions of traffic on these lanes. We added 5% to

Since 2010 new era: rapidly increasing areas of

High Statistics Limit of Poisson

Error propagation II: the general

Binned Maximum Likelihood

Bayesian Credible Intervals -

Berger lectures at CERN Summer School 2019... Randomness in High

Berger lectures at CERN Summer School 2019... How to represent