Methodology for improving data analysis accuracy in the ATLAS experiment: a case study of WWZ Production in pp collisions

Method development for improving the accuracy of LHC data analysis using machine learning tools. In particular, this study explores the statistical impact and systematic uncertainties affecting the accuracy of data analysis.

Here WVZ analysis is replicated [1].

You must have access to the NR TSU server:

ssh -XY -p 10023 user_name@92.63.70.26

Create and enter the working directory:

mkdir myProject 
cd myProject

BDT training:

source /share/shared_data/root/bin/thisroot.sh
root -x -b -q compact_teacher.cpp 2>&1 | tee BDT.log

Write BDT weights to ntuple:

root -x -b -q compact_writer.cpp 2>&1 |tee writer2.log

TRExFitter [2]:

Input histogram production:

First of all, we need to read the ntuples and turn them into histograms for further use within the framework. To do so, we make use of the n action (for example 3l2j region):

trex-fitter n clear_full_old.config "Regions=three_lep_presel_2jets" | tee trex_n.log

Creating the workspace

The first step after creating/reading the histograms is to produce a workspace containing our fit model:

trex-fitter wfs clear_full_old.config "Regions=three_lep_presel_2jets" | tee trex_w.log

w - create the RooStats xmls and workspace

f - fit the workspace

s - calculate significance

Producing the first plots

Next up, we are going to visualize the regions we want to fit. Run the d action next to produce pre-fit plots:

trex-fitter d compact.config "Regions=three_lep_presel_2jets" | tee trex_d.log

the Plots/ folder contains plots showing data and MC per region you defined, as well as summary plots
the Tables/ folder contains various tables in text or .tex format, showing you for example the yields per sample and per region

The plots produced include the effects from all systematics sources specified in the config in the bands drawn. As an example, here is the plot of the 3l2j region:

Producing the post-fit plots

Time to see how our model describes data after the fit has been done. We use the p option to produce post-fit plots:

trex-fitter p compact.config "Regions=three_lep_presel_2jets" | tee trex_p.log

Ranking plot

To see which nuisance parameter has the largest impact on the uncertainty of our signal strength, we make use of the r action (see the TRExFitter readme for more information on this [3]). For this tutorial, you can run them all at once:

trex-fitter r compact.config "Regions=three_lep_presel_2jets"

For each nuisance parameter, we perform four fits. The specific nuisance parameter is fixed to one of these configurations per fit:

pre-fit value + pre-fit uncertainty

pre-fit value - pre-fit uncertainty

post-fit value + post-fit uncertainty

post-fit value - post-fit uncertainty

Name		Name	Last commit message	Last commit date
Latest commit History 66 Commits
Plots		Plots
reco		reco
BDT.log		BDT.log
README.md		README.md
compact.config		compact.config
compact_auto.config		compact_auto.config
compact_teacher.cpp		compact_teacher.cpp
compact_writer.cpp		compact_writer.cpp

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Methodology for improving data analysis accuracy in the ATLAS experiment: a case study of WWZ Production in pp collisions

BDT training:

Write BDT weights to ntuple:

TRExFitter [2]:

Input histogram production:

Creating the workspace

Producing the first plots

Producing the post-fit plots

Ranking plot

Good luck for you!

About

Releases

Packages

Contributors 2

Languages

OlesyaTSU14/WVZ

Folders and files

Latest commit

History

Repository files navigation

Methodology for improving data analysis accuracy in the ATLAS experiment: a case study of WWZ Production in pp collisions

BDT training:

Write BDT weights to ntuple:

TRExFitter [2]:

Input histogram production:

Creating the workspace

Producing the first plots

Producing the post-fit plots

Ranking plot

Good luck for you!

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages