• Nie Znaleziono Wyników

Introduction to Data Science (for physics)

N/A
N/A
Protected

Academic year: 2021

Share "Introduction to Data Science (for physics)"

Copied!
24
0
0

Pełen tekst

(1)

Outline of the course:

1. Statistics and Data Analysis

2. Multivariate Techniques and Machine Learning

3. Physics Modeling, Simulation and Monte Carlo Methods 4. Regression, Classification, Clustering and Retrieval

First three parts will focus on applications in physics (mostly in High Energy Physics)

The last part will discuss more typical „Data Science”

problems and solutions.

Introduction to Data Science (for physics)

Prof. dr hab. Elżbieta Richter-Wąs

Acknowledgement: slides below „borrowed” fron different courses in HEP and Data Science.

(2)

Part 1: Statistics and Data Analysis

2

(3)

Part 1: Statistics and Data Analysis

From N. Berger, CERN Summer School, 2019 3

(4)

Part 1: Statistics and Data Analysis

From N. Berger, CERN Summer School, 2019 4

(5)

Part 1: Statistics and Data Analysis

From N. Berger, CERN Summer School, 2019 5

(6)

Part 1: Statistics and Data Analysis

From N. Berger, CERN Summer School, 2019 6

(7)

Part 1: Statistics and Data Analysis

7

(8)

Part 2: Multivariate Analysis and Machine Learming

8

In HEP everything started multivariate.

Below: inteligent „Multivariate Pattern Recognition”

used to identify particles

Nowdays: let computer help you.

(9)

Classifiers and their properties

9

(10)

Classical Learning

10

(11)

Machine Learning

11

(12)

Part 3: Physics modeling, simulation and Monte Carlo methods

12

What is the model?

(13)

Part 3: Physics modeling, simulation and and Monte Carlo methods

13 Visualised model of the detector used for simulation Detector

(14)

Part 4: Regression, Classification, Clustering

14

• Current view on Machine Learning :

disruptive inteligent applications are used by leading comercial companies

(15)

Part 4: Regresion, Classification, Clustering

15

• Data → inteligence pipeline

New kind of analysis which brings inteligence how to solve a problem

Eg. which product to buy which film to chose

connect people and taxi driver

(16)

Regression

16

Case study: prediction for the house price

(17)

Classification

17

Case study: Score of the restaurant

(18)

Clustering

18

Case study: assigning books to groups by topics

(19)

Recommendation

19

Case study: personalisation of recommending items

(20)

Deploying inteligence module

20

Case studied are about building, evaluating, deploying inteligence in data analysis.

(21)

Regression: Predicting house prices

21

(22)

Classification: Sentiment analysis

22

(23)

Clustering: Finding documents

23

(24)

Getting your ETCs for lectures

24

• I foresee written exam on the theory part.

• List of topical questions will be available before Xmass break.

• You will be asked to answer 5 questions out of 25-30 on the list.

Cytaty

Powiązane dokumenty

 Personalisation: purhase history, monthly and yearly trends, etc.?. Customers who bought product A also bought

Cetinkaya-Rundel, Duke University Data Analysis and

Case studied are about building, evaluating, deploying inteligence in data analysis. Use pre-specified or develop

– Time for you to write your code and (for me) to disscuss with each student her/his progress with assignments.. • COVID-19 times:

Case studied are about building, evaluating, deploying inteligence in data analysis. Use pre-specified or develop

Cetinkaya-Rundel, Duke University Data Analysis and

Case studied are about building, evaluating, deploying inteligence in data analysis. Use pre-specified or develop

Guestrin, Univ