• No results found

Unfolding of multivariate tools and statistical analysis for Higgs boson pair production searches in the ATLAS detector at the Large Hadron Collider

N/A
N/A
Protected

Academic year: 2022

Share "Unfolding of multivariate tools and statistical analysis for Higgs boson pair production searches in the ATLAS detector at the Large Hadron Collider"

Copied!
42
0
0

Loading.... (view fulltext now)

Full text

(1)

Uppsala University

Department of Physics and Astronomy

Master’s Degree Project 30hp

Unfolding of multivariate tools and statistical analysis for Higgs boson pair production searches in the ATLAS detector at the

Large Hadron Collider

Student:

Christina Dimitriadi

Supervisors:

Arnaud Ferrari Petar Bokan

Subject Reader:

Richard Brenner

October 8, 2019

(2)

Abstract

Recently, searches for pair production of Higgs bosons in several final states have been carried out by the ATLAS exeperiment at the Large Hadron Collider (LHC). This study focuses on the search for non-resonant di-Higgs production decaying to a final state with two b-jets and two τ -leptons using 36.1 fb−1of data recorded by the ATLAS detector. The analysis for this process has already been performed. Boosted decision trees (BDTs) are used in the analysis to improve the separation of the signal from background processes and several variables that provide good discrimination between signal and background are used as inputs to the BDT. This study aims to unfold the BDT of the analysis and optimize a cut-based analysis so that the gain from using the BDT can be estimated. Two variables, related to the invariant masses and angular distances of the Higgs boson decay products, are defined and the optimal cuts are found to be Xmτ τmbb < 1.8 and X∆Rτ τ∆Rbb < 4.0. Then, the upper limits on the SM HH production cross section are set when fitting mHH

with the cut-based analysis. An expected limit of 0.78 pb, 23 times the SM prediction is obtained when neglecting systematic uncertainties, compared to the limit of 15 times the SM as recomputed when using the BDT. Comparing the two results, the sensitivity is worsened by 50% when not using the BDT.

(3)

Sammanfattning

Partikelfysik ¨ar den gren av fysiken som studerar de element¨ara best˚andsdelarna i materia och interaktionerna mellan dem. Den kallas ofta h¨ogenergifysik, eftersom m˚anga partiklar skapas och uppt¨acks under energirika kollisioner mellan andra partiklar, s˚asom i partikelacceleratorer. Ett teoretiskt ramverk, Standardmodellen, som formulerades p˚a 1960- och 1970-talet, beskriver materiens byggstenar och deras interaktioner med stor framg˚ang. Standardmodellen inneh˚aller sex leptoner och sex kvarkar, samt fyra bosoner som f¨ormedlar den elektromagnetiska, svaga och starka v¨axelverkan mellan leptoner och kvarkar. En ytterligare v¨asentlig komponent i Standardmodellen ¨ar Higgsbosonen, vars existens f¨orutsp˚addes p˚a 1960-talet och observerades av ATLAS- och CMS-experimenten vid CERN:s Large Hadron Collider (LHC) den 4 juli 2012. Denna element¨ara spin-0-partikel med en uppm¨att massa p˚a 125 GeV genererar alla partiklars massa genom Higgs mekanismen.

Som en f¨oljd av Higgs-bosonens uppt¨ackt har flera studier och m¨atningar av dess egenskaper genomf¨orts f¨or att verifiera huruvida dess beteende st¨ammer ¨overens med Standardmodellens f¨oruts¨agelser. En viktig studie

¨

ar att m¨ata Higgsbosonens v¨axelverkan med sig sj¨alv, som h¨arr¨or fr˚an Higgs mekanismen. Detta kan g¨oras genom att unders¨oka parproduktion av Higgsbosonen, H → HH, vilken kan ge insikt i Higgs-potentialens faktiska form. I detta arbete studeras en analys som s¨oker efter parproduktion av Higgsbosoner som onderfaller till tv˚a τ -leptoner och tv˚a b-kvarkar (HH → bbτ τ ), d¨ar b˚ada τ -leptonerna s¨onderfaller till hadroner. Analysen f¨or detta sluttillst˚and har redan utf¨orts av ATLAS-samarbetet med hj¨alp av en multivariat analysteknik (MVA) som kallas Boosted Decision Tree (BDT) och anv¨ands f¨or att separera signalen fr˚an den relativa bakgrunden.

MVA tekniker har blivit allt vanligare i h¨ogenergifysik, eftersom det finns en st¨andigt ¨okande produktion av stora datam¨angder vilket kr¨aver mer avancerade analystekniker f¨or att utvinna s˚a mycket information som m¨ojligt. Dessa involverar flera storheter eller variabler som vanligtvis ¨ar korrelerade. F¨ordelen med att anv¨anda MVA tekniker ¨ar att de tar h¨ansyn till s˚adana korrelationer. Ibland ¨ar dessa resultat emellertid sv˚artolkade, vilket medf¨or att teoretiker har sv˚art att dra slutsatser av dem. D¨arf¨or syftar denna studie till att omarbeta MVA-analysen genom att inf¨ora en snittbaserad analys s˚a att vinsten fr˚an att anv¨anda MVA kan uppskattas.

Metoderna vi anv¨ander oss av ¨ar avfaltning av BDT:n och granskning av inmatningen till BDT-n f¨or att hitta de optimala snitten s˚a att den st¨orsta delen av bakgrunden tas bort, medan en stor del av signalen

¨

overlever. En rad signifikanstester genomf¨ors f¨oljaktligen med de optimala snitten. ¨Ovre gr¨anser s¨atts sedan a HH-produktionstv¨arsnittet genom anpassning av den invarianta massan hos di-Higgs-systemet, mHH, i signal- och kontrollomr˚aden baserat p˚a de optimala snitten. Den f¨orv¨antade ¨ovre gr¨ansen visar sig vara 23.4 anger Standardmodellens f¨oruts¨agelse vid 95% konfidensniv˚a, utan att beakta systematiska os¨akerheter.

Slutsatsen ¨ar att anv¨andningen av den snittbaserade analysen f¨orv¨arrar k¨ansligheten f¨or HH → bbτ τ med 50%.

(4)

Contents

1 Introduction 5

1.1 Motivation . . . . 5

1.2 Particle physics and the Standard Model . . . . 5

1.3 Shortcomings of the Standard Model . . . . 6

1.4 Beyond the Standard Model . . . . 7

2 Higgs physics 8 2.1 The Higgs mechanism . . . . 8

2.2 Higgs boson pair production . . . . 9

2.3 Higgs boson decay modes . . . . 10

3 The ATLAS experiment at the LHC 11 3.1 The Large Hadron Collider at CERN . . . . 11

3.2 ATLAS . . . . 12

3.2.1 Sub-detector systems . . . . 12

3.2.2 Coordinate system and useful variables . . . . 12

3.2.3 Trigger system . . . . 13

4 Decay of Higgs boson pair to di-τ + di-b final state 14 4.1 Data and simulation framework . . . . 14

4.2 Object reconstruction in the ATLAS detector . . . . 15

4.2.1 Tau-leptons . . . . 15

4.2.2 Bottom quarks . . . . 16

4.2.3 Missing transverse momentum . . . . 17

4.3 Event selection . . . . 17

4.3.1 Background processes . . . . 17

5 Multivariate analysis in particle physics 19 5.1 Boosted Decision Trees . . . . 19

5.1.1 Input variables . . . . 20

5.1.2 Existing upper limits . . . . 21

5.2 Purpose . . . . 21

5.3 Cut-based analysis . . . . 22

5.3.1 Unfolding the BDT . . . . 22

5.3.2 Fitting the BDT score distribution . . . . 26

5.3.3 Fitting the mHH distribution . . . . 26

5.3.4 Definition of variables . . . . 27

5.3.5 Optimization of cuts . . . . 28

6 Results and discussion 33

7 Conclusion 35

List of figures 36

List of tables 37

(5)

References 38

(6)

Chapter 1

Introduction

1.1 Motivation

Particle physics seeks to understand the fundamental nature of the Universe and the building blocks of the matter created after the Big Bang. Although the underlying theory has been very successful, it only describes 4% of the known Universe. Hence, scientists and experimental laboratories all over the world are in constant search for answers to questions about dark matter, dark energy, anti-matter and more, in order to push back the frontiers of knowledge. The discovery of a Higgs boson by the Large Hadron Collider (LHC) at CERN in 2012 [1, 2] was a big achievement for the particle physics society. Since then, measurements of its properties and behavior have been carried out. Searches for the production of pairs of Higgs bosons are very interesting and valuable since they can provide insight to the Higgs boson self-coupling, which is promptly linked to the shape of the Higgs potential in the Standard Model.

The aim of this project is to re-interpret the recently published result of the bbτhadτhad final state in the search for Higgs boson pair production by the ATLAS experiment at the LHC, suggest and implement an alternative cut-based analysis and compare the results [3].

1.2 Particle physics and the Standard Model

Particle physics is the field studying the fundamental constituents of matter in the Universe and the forces between them. These are successfully depicted in a theoretical framework, the Standard Model (SM), which was formulated in the 1960s and 1970s. The SM of particle physics is based on a gauge quantum field theory (QFT) describing the unified electroweak (EW) interaction and quantum chromodynamics (QCD), based on a gauge symmetry group of SU (3)C× SU (2)L× U (1)Y.

The SU (3)C group is associated to QCD, describing the strong force. The index C stands for the color charge, which is a property of the strong interactions. There are three colors, red, green and blue (r, g, b) followed by their anti-colors (¯r, ¯g, ¯b). The SU (2)L× U (1)Y group is associated to the electroweak theory, where the SU (2)L group describes the interaction between left-handed (L) particles with the SU (2) gauge fields and U (1)Y is associated to the hypercharge Y , which is the conserved quantity in the unified electroweak interaction and is defined as

Y = 2(Q − I3W) (1.1)

with Q being the electrical charge and I3W = ±1/2 the third component of the weak isospin [4].

The SM contains three generations of fermions (half integer spin) and four force carriers, the gauge bosons (integer spin), mediating the three fundamental forces. The electromagnetic force is carried by the photon (γ), while the W± and Z bosons are the mediators of the weak interaction and the gluon (g) carries the strong interaction. Photons and gluons are both massless and electrically neutral, while the W± and Z bosons are quite heavy compared to other elementary particles and carry an electrical charge of ±e. Gluons, unlike other gauge bosons, carry a color charge.

Fermions are divided into leptons (l) and quarks (q) and they are called matter particles. The lepton family consists of three charged leptons, the electron e, the muon µ and the tau-lepton τ , their anti-particles as well as the corresponding electrically neutral neutrinos νe, νµ and ντ. Quarks, like leptons, are divided in three

(7)

generations, each one consisting of a +2/3 charged quark: up (u), charm (c), top (t), and a −1/3 charged quark: down (d), strange (s), bottom (b). They all have their own anti-particles and they also carry a color charge. Quarks form hadrons, which are color-neutral particles, containing either a quark and an anti-quark (mesons) or three quarks (baryons) [5].

In addition to the matter particles and the force carriers, an essential component of the SM is the Higgs boson, the existence of which was predicted in the 1960s [6, 7, 8]. It was observed by the ATLAS and CMS experiments at CERN’s Large Hadron Collider (LHC), as announced on 4th July 2012 [1, 2]. The Higgs boson is an elementary spin-0 particle with a measured mass of 125 GeV1. A summary of the SM particles is shown in Figure 1.1.

Figure 1.1: The fundamental particles in the Standard Model [9].

1.3 Shortcomings of the Standard Model

The SM is undoubtedly a very successful theoretical model to describe particle physics. However, despite its consistency to nearly all experimental results and its accurate predictions of various phenomena, it cannot account for some observations [10].

One downside of the SM is the fact that it incorporates only three of the four fundamental forces in the Universe, excluding the weakest one of them, gravity. Besides, there is a hierarchy problem which cannot be explained by the SM, and it has to do with either the large spanning of masses in the SM or the fine-tuning needed to stabilise the Higgs boson mass.

There is no interpretation of why the Higgs boson mass is much smaller than other energy scales and in particular the scale of gravity,

mh' 125 GeV  MP lanck= 1

G= 1019GeV . (1.2)

Moreover, the SM cannot account for the matter-antimatter asymmetry happening in the Universe. After the Big Bang equal amounts of matter and antimatter should have been created, yet today the Universe consists mostly of matter. Hence, physicists are studying this imbalance through experiments in order to discover the reasons why it exists.

Another flaw of the SM is the lack of an explanation for the existence of dark matter that astrophysical observations have validated. Only 4% of our Universe is made of baryonic matter, while 21% is dark matter and the remaining 75% is dark energy. Nonetheless, no particle candidates for these phenomena can be provided by the SM.

1In this thesis natural units are chosen, where c = ~ = kB= 1.

(8)

Last, in the SM, neutrinos are massless particles. In 1998 it was discovered that neutrinos do have a very small, non-zero mass, giving rise to neutrino oscillations, which again contradicts the SM [11].

1.4 Beyond the Standard Model

Remaining unanswered questions and phenomena seeking explanation have motivated to extend the SM and develop theories beyond the Standard Model (BSM) in order to completely understand the Universe.

Possible extensions of the SM are based on the principle of Supersymmetry (SUSY). The main point of SUSY is the prediction of a partner for each particle in the SM, which is rather essential since these new particles would stabilise the mass of the Higgs boson. The extra particles predicted by Supersymmetry would make a light Higgs boson possible and would interact through the same forces as SM particles. The SUSY links fermions and bosons since it predicts that each of the SM particles has a partner with a spin that differs by half of a unit. However, SUSY must be broken to allow the supersymmetric partners to acquire different masses than the SM fermions and bosons to which they are associated.

The simplest supersymmetric extension of the SM is the Minimal Supersymmetric Standard Model (MSSM).

Its Higgs sector must be described by the Two Higgs Doublet Models (2HDM). In comparison to the SM, an additional complex doublet and thereby four new Higgs bosons must be introduced [12].

It is noteworthy to mention that there are many more possible theoretical extensions of the SM and SUSY is only one of them.

(9)

Chapter 2

Higgs physics

2.1 The Higgs mechanism

The mechanism responsible for explaining how particles obtain mass was developed by Robert Brout, Fran¸cois Englert, Peter Higgs, Gerald Guralnik, Carl Hagen and Tom Kibble [6, 7, 8]. A more detailed review of the subject can be found in [13]. As mentioned before, the SM is based on a QFT, where all fundamental particles are described as excitations of quantum fields. In QFT each non-interacting field is described by a Lagrangian density of the form

L = Lmass+ Lkin (2.1)

However, having the Lmass term in the Lagrangian density would violate gauge invariance. This problem is solved by a spontaneous electroweak symmetry breaking (EWSB), resulting in the attribution of mass to each particle. The Higgs mechanism postulates the existence of a complex scalar doublet

Φ(x) =+(x) φ0(x)



= 1

2

+1(x) + iφ+2(x) φ01(x) + iφ02(x)



, φi∈ R (2.2)

with a corresponding potential

V (Φ) = µ2ΦΦ + λ(ΦΦ)2 (2.3)

where λ > 0 and µ2< 0. Due to those bounds on the parameters, the Higgs potential, as seen in Figure 2.1, does not have a minimum at Φ = 0 and therefore the vacuum expectation value (vev, v) is shifted to a non-zero value for the ground state,

v = r−µ2

λ . (2.4)

After expanding Φ around v and choosing a unitary gauge, Equation (2.2) can be rewritten as Φ(x) = 1

2

 0

v + H



, (2.5)

where H is an electrically neutral scalar field, later identified with the Higgs boson. By spontaneously breaking the symmetry SU (2)L× U (1)Y to U (1)Q, three massless gauge fields of the SU (2) are absorbed by the W± and Z bosons which in turn get their masses. The fermion masses are also generated through the Yukawa coupling, via the interaction with the ”Higgs field”. The electromagnetic U (1)Q symmetry stays unbroken, hence the photon remains massless.

All in all, this Higgs potential, V (Φ), introduced in the Lagrangian density, adds couplings of the Higgs field to the gauge fields, to fermions and to itself, generating mass terms for all the particles that the Higgs field couples to [5, 15].

(10)

Figure 2.1: The Higgs potential, in the shape of a Mexican hat, with the minimum energy not at Φ = 0 [14].

2.2 Higgs boson pair production

The discovery of a particle consistent with the SM Higgs (H) boson by the ATLAS and CMS experiments at the LHC in 2012 has been followed by numerous studies and measurements of its properties, which aim to verify if its behavior is in agreement with the SM predictions. However no significant deviation has been found up until now.

Measuring the Higgs boson self-interactions that arise from the Higgs mechanism is an important next step of testing the EWSB. After the spontaneous symmetry breaking the Higgs potential takes the following form:

V (Φ) = µ2Φ2+ λΦ4

Φ→1 2(v+H)

−−−−−−−−−→ V = V0+ λv2H2+ λvH3+λ

4H4, (2.6)

where v = µ/

λ, λ = m2H/(2v2) and V0 is a constant term. The second term in the scalar potential is the mass term, 12m2HH2, and the other two are trilinear and quartic Higgs boson self-coupling terms. In order to further test the SM, experimental measurements of the Higgs boson self-coupling are necessary. This can be done by probing the Higgs boson pair production, H → HH.

The dominant production mode for a Higgs boson pair at the LHC is gluon-gluon fusion (ggF). Figure 2.2 shows three Feynman diagrams for Higgs boson pair production at leading order (LO)2. In the first one (a), often called the box diagram, a Higgs boson pair is produced via the Higgs-fermion Yukawa interaction (mostly top-quarks), while in (b), often referred to as the triangle diagram, the Higgs boson pair production is mediated by an off-shell Higgs boson. Due to the fact that the box and triangle diagrams interfere destructively, the production cross section in the SM turns out to be very small, namely σHH = 31.05 fb at 13 TeV, about three orders of magnitude less than for the single-Higgs boson production rate [16]. Enhancements of the Higgs boson pair production cross section may occur in various BSM scenarios, introducing new couplings or deviations of the Yukawa- or self-coupling constants from their SM values. Another BSM scenario corresponding to Figure 2.2(c) shows a resonant Higgs boson pair production through the decay of an intermediate state X, which can be for example a spin-2 graviton or a scalar resonance.

Figure 2.2: Leading-order Feynman diagrams for Higgs boson pair production in the SM via gluon-gluon fusion, (a) through a heavy-quark loop, (b) through the Higgs boson self-coupling. In (c) Higgs boson pairs are produced through an intermediate heavy resonance in a BSM scenario.

This study covers only the non-resonant SM Higgs boson pair production mode.

2The LO implies a diagram corresponding to the lowest order in couplings at which a process can occur.

(11)

2.3 Higgs boson decay modes

In general, Higgs bosons can decay to all massive SM particles. The branching ratios of the various decay channels of the Higgs boson, along with their uncertainties, as a function of the Higgs boson mass are illustrated in Figure 2.3. Next, Table 2.1 shows the SM expected branching ratios at mH= 125 GeV for the most relevant decay channels.

[GeV]

M

H

80 100 120 140 160 180 200

Higgs BR + Total Uncert

10

-4

10

-3

10

-2

10

-1

1

LHC HIGGS XS WG 2013

b b

τ τ

µ µ c c

gg

γ

γ Zγ

WW

ZZ

Figure 2.3: SM Higgs boson decay branching ratios and their total uncertainty as a function of its mass [17].

Table 2.1: The branching ratios and their uncertainty for a SM Higgs boson with mass mH = 125 GeV [18].

Decay Channel Branching Ratio Uncertainty H → b¯b 58.4 % +3.2%−3.3%

H → W+W 21.4% +4.3%−4.2%

H → τ+τ 6.3% +5.7%−5.7%

H → ZZ 2.6% +4.3%−4.1%

H → γγ 0.23% +5.0%−4.9%

The final states of a Higgs boson pair are numerous, however the one of interest for this study is HH → +τ. This decay process has a relatively clean final state compared to the most probable HH → b¯bb¯b and a relatively large branching ratio (7.4%) compared to HH → b¯bγγ, which has an even cleaner signature due to an excellent di-photon mass resolution. A statistical combination of searches for Higgs boson pairs in the above three decay channels can be found in Ref. [19]. The combined observed (expected) limit on the non-resonant Higgs boson pair cross section set by the ATLAS Collaboration is 0.22 pb (0.35 pb) at 95%

confidence level, corresponding to 6.7 (10.4) times the predicted SM cross section.

(12)

Chapter 3

The ATLAS experiment at the LHC

3.1 The Large Hadron Collider at CERN

The Large Hadron Collider (LHC) is the largest and most powerful existing particle accelerator. It consists of a 27 kilometer ring installed in a tunnel built at CERN in Geneva. Protons or ions are accelerated in opposite directions inside this ring and collide in four points at the intersection of the two beams. Their collision at energies in the TeV scale allows both the probe of new physics and precise measurements of known phenomena, enhancing our understanding of the fundamental constituents of matter in the Universe.

The first data-taking period (Run I) was performed between 2010 and 2012, while the collision energy achieved was

s = 7 − 8 TeV. The recorded integrated luminosity3 was 5/fb in 2011 and 23/fb in 2012. A long shut down followed from 2013 to 2015. Then the LHC energy increased to

s = 13 TeV. Run II was recently completed with a recorded integrated luminosity of nearly 140/fb being reached. Right now the LHC has stopped its operation for a second shut down so that beam optics near the interaction point are upgraded allowing for higher luminosities to be achieved.

Two general purpose experiments, ATLAS and CMS, one dedicated to studies of b-quarks, LHCb, and one dedicated to studies of quark-gluon plasma, ALICE, are the four major experiments at the LHC, as seen in the overview of Figure 3.1.

Figure 3.1: Overall view of the LHC experiments [20].

3The integrated luminosity, L, is a measure of the number of collisions that have happened over a period of time, usually expressed in fb−1.

(13)

3.2 ATLAS

The ATLAS (A Toroidal LHC Apparatus) detector is one of the general-purpose detectors at the LHC designed to probe p − p collisions and study the physical processes arising from them by measuring a wide spectrum of signatures. It is 25 m tall and 44 m long, with an overall weight of approximately 7000 tons.

A barrel part and two end-caps make up the detector, which is practically hermetic. In this way, the energy of neutrinos and weakly interacting particles, predicted by BSM theories, can be reconstructed and observed through the missing transverse energy. As can be seen in the layout of ATLAS in Figure 3.2, it consists of different sub-detector systems, each providing some level of particle identification and kinematic measurement.

3.2.1 Sub-detector systems

An inner detector (tracker) is used for measuring the charged particle momenta by reconstructing their trajectories and it is also designed to detect secondary vertices from short-lived particles. Closest to the beam, it is placed inside a 2 T magnetic field, created by a solenoid magnet surrounding it, making the charged particles bend. The identification of electrons/positrons and photons as well as the measurement of their energy is performed by an electromagnetic (EM) calorimeter. In addition, a hadronic calorimeter, installed outside the EM calorimeter, is used for the energy measurement of charged and neutral hadrons.

The calorimeters measure with high precision and granularity the energy, position and shower shape of electrons, photons and hadronic jets. The calorimeter system is able to disentangle the electrons/positrons and photons from showers coming from high-pThadrons, τ -lepton decays and QCD-induced backgrounds up to an energy scale of TeV. Last, surrounding the hadronic calorimeter, there is a muon spectrometer used for muon identification, together with the track left in the inner detector. It is necessary to have this detector system, since muons escape the calorimeters usually undetected. The ATLAS muon spectrometer consists of three superconducting air-core toroids to create a strong magnetic field, necessary in order to bend the muon trajectories and thereby measure their charge and transverse momentum. The ATLAS detector and its predicted performance are reviewed in detail in Ref. [21].

Figure 3.2: Cut-away view of the ATLAS detector [21].

3.2.2 Coordinate system and useful variables

ATLAS follows a cartesian coordinate system with its origin being the nominal collision point. The z-axis defines the beam line direction, while the x − y plane is transverse to the beam with the x-axis pointing towards the center of the LHC ring and the y-axis pointing upwards. The azimuthal angle φ defines the angle around the beam axis, while the polar angle θ is measured from the beam axis. At a proton-proton collider, the longitudinal momentum of the partons in collision is unknown, therefore it is necessary to define kinematic variables which are invariant under longitudinal boosts. Hence, the polar angle θ can be replaced by the pseudorapidity, defined as η = − ln(tan(θ2)). Another useful variable is the distance between

(14)

two variables in the η − φ plane, defined as ∆R = p

∆η2+ ∆φ2. Last, two commonly used variables, also invariant under longitudinal boosts are the transverse energy, ET, and transverse momentum, pT=q

p2x+ p2y. Table 3.1 shows some of the performance goals and characteristics of the ATLAS detector, such as the designed resolution of the EM and hadronic calorimeters. The expected momentum resolution for the tracking system and the muon spectrometer in specific pseudorapidity coverage is also reported.

Table 3.1: Performance of the ATLAS detector. The units of E and pTare GeV [21].

3.2.3 Trigger system

A challenge of the LHC experiments is that the event production rate is too high to keep all events. In fact, according to Ref. [22], ATLAS is designed to observe up to 1.7 billion proton-proton collisions per second, while having a data volume of more than 60 million megabytes per second. At the same time the majority of the events are not interesting, thus the trigger system is used so that the data flow is reduced to a controllable level. The role of the trigger is to make the online selection of particle collisions potentially containing interesting physics. Trigger menus are used, which determine what events the system is supposed to accept.

In Run II, the ATLAS trigger system performs the selection process in two stages. The Level-1 (L1) trigger, based on custom hardware, uses a subset of information from the muon spectrometer and the calorimeter systems and defines so-called Regions-of-Interest (ROI), which in turn are passed to the next level trigger.

The L1 trigger searches for signatures of electrons/photons, high-pT muons, hadronic decays of τ -leptons, jets, as well as events with high missing transverse energy. A maximum of 100,000 events per second can be passed from the L1 trigger on to the High-Level Trigger (HLT).

The HLT is a software based trigger, which improves the analysis of the L1 trigger. It makes use of the full granularity and precision data coming from the calorimeters, the inner tracker and the muon spectrometer or handles data in smaller and isolated regions of the detector. Thus, it is able to trigger on more complex signatures, like secondary vertices. The HLT selects approximately 1000 events per second and assembles them into an event record. Finally, these events are allocated to a data storage system for offline4 analysis.

4Offline analysis refers to the analysis which is performed on permanently stored events, i.e. after the processing by the trigger system.

(15)

Chapter 4

Decay of Higgs boson pair to di-τ + di-b final state

The published analysis of Ref. [3] searching for Higgs boson pair production in a final state with two b-quarks and two τ -leptons considers two decay sub-channels, τlepτhad and τhadτhad. The subscripts, lep = lepton and had = hadron, denote the decay mode of the τ -lepton. The study presented in this report focuses on the search for SM pair production of Higgs bosons in the b¯+τ final state, where both τ -leptons decay in a hadronic mode (τhadτhad).

4.1 Data and simulation framework

This analysis uses proton-proton collision data recorded in 2015 and 2016 by the ATLAS experiment at the LHC with a center-of-mass energy of

s = 13 TeV, corresponding to an integrated luminosity of 36.1 fb−1. The data samples used for the analysis have previously gone through a process chain, illustrated in the left part of Figure 4.1. First, as long as the data taking is running, the ATLAS trigger system decides which events get written to the disk. As it has already been mentioned in subsection 3.2.3, the LHC delivers events with a rate of up to 40 MHz, while ATLAS can afford to record up to 1 kHz, meaning that only one in 40.000 events can be kept. Hence, a clear-out among the events should be carried out by the two-step mechanism of the ATLAS trigger system, so that all interesting events are stored. Then, the output of the trigger, raw data, is organised into streams and used for reconstruction. The output produced by the reconstruction is AOD, which stands for analysis object data. The next step is the production of derived AOD (DAOD) files that are much smaller and contain the information specific to a targeted final state.

Figure 4.1: The ATLAS data (left) and MC samples (right) processing chain [23].

Apart from the data samples, it is necessary to have Monte Carlo (MC) simulation samples for the signal and background processes. Several steps are included in the production of MC simulated samples, such as

(16)

event generation, parton showering, hadronization and detector simulation. These steps are presented in the right side of the sketch in Figure 4.1. The interactions between quarks and gluons in the proton-proton collisions are simulated during the event generation, while the interactions of outgoing particles, arising from the generator, with the detector material are computed during the detector simulation: the deposited energy in each sensitive element of the detector is determined. During the digitisation, the simulated energy deposits are turned into a detector response, looking similar to the raw data from the real detector. After this step, the process is the same as for real data.

The signal and background samples used in this thesis are official ATLAS samples and have been used in the published analysis. The SM HH signal sample is simulated with MadGraph5 [24], while parton showers and hadronization are simulated with Herwig++ [25]. Table 4.1 shows how the different background MC samples are generated. All the plots presented in this thesis are edited with ROOT [26] .

Table 4.1: Event generators for the different background samples.

Background Processes Event Generators Parton Shower Simulation t Powheg-Box [27], MadSpin [28] Pythia 6 [29]

Single-top quark Powheg-Box, MadSpin Pythia 6

Z + jets Sherpa 2.2.1 [30]

W + jets Sherpa 2.2.1

Diboson Sherpa 2.1.1

Quark-induced ZH Pythia 8 [31]

Gluon-induced ZH Powheg Pythia 8

ttH MadGraph5 Pythia 8

4.2 Object reconstruction in the ATLAS detector

4.2.1 Tau-leptons

Tau-leptons are the heaviest of the three charged leptons in the SM with a mass of 1776.86 ± 0.12 MeV and decay almost immediately with a mean life time of 290.3 fs [18]. Therefore, τ -leptons can only be reconstructed by their decay products, since they ”disappear” before reaching any of the detector components. They are the only leptons heavy enough to decay both in lighter leptons and hadrons. As can be seen in the Feynman diagram of Figure 4.2, the τ -lepton decays into a neutrino ντ and an off-shell W boson, where the W boson further decays into either lνl, where l is electron or muon, or two quarks. The former case is referred to as the leptonic decay mode and the latter as hadronic. The leptonic decay mode has a branching fraction of around 35%, while the hadronic decay mode is more common with a branching fraction of around 65%.

Figure 4.2: Feynman diagram of τ -lepton decays by an emission of an off-shell W boson.

(17)

Hadronic τ -leptons

Hadronically decaying τ -leptons, τhad, usually create final states of one or three charged pions π± and some number of π0. Depending on the number of charged tracks left in the inner detector, the hadronically decaying τ -leptons are referred to as 1-prong or 3-prong if they decay in one or three charged pions respectively. The most common 1-prong final state is τ→ ππ0ντ.

The τhadform jets in the calorimeter of the ATLAS detector at the LHC, which then have to be differentiated from quark- and gluon-initiated jets that are dominating at hadron collider experiments. The most essential difference is that τhadhas a more narrow, collimated shape and lower charged track multiplicity (one or three tracks) in the inner detector. QCD-induced jets, in general, have a wider shape with more particles, which among other characteristics, is used to discriminate τ decays from other jets, as shown in Figure 4.3(a,b).

The τhad candidates are reconstructed as jets in the ATLAS detector using the anti-kt algorithm [32] with a radius parameter R = 0.4 and calorimeter energy clusters as inputs. These jets are used as seeds for the reconstruction of visible products of hadronically decaying τ -leptons, τhad−vis, which are required to have one or three tracks. Boosted decision trees (BDTs)5 [33] are trained in order to distinguish τhad−vis candidates from QCD-induced jets.

Another challenge in the reconstruction of τ -leptons from detector objects is the existence of neutrinos, which escape detection. Many methods have been developed for the mass reconstruction of resonances decaying to τ -leptons. In the ATLAS experiment, the most usual method for calculating the invariant mass of particles that decay into two τ -leptons is the Missing Mass Calculator (MMC) [34].

4.2.2 Bottom quarks

Bottom quarks, with a mass of approximately 4.18 GeV [18], form B-hadrons that have sufficiently long lifetime to travel some distance before decaying. This lifetime on the other hand is short enough so that they decay before reaching the inner detector.

b-jets

Due to sufficiently long lifetime of B-hadrons, there is a displacement of their decay (secondary) vertex with respect to the primary vertex (interaction point), which can be reconstructed through the extrapolation of charged particle tracks. The decay products of the B-meson further form jets. Then, the presence of a secondary vertex within these jets indicates a presence of a B hadron and they are referred to as b-jets.

The identification of heavy-flavor jets is also based on some impact parameter information. As can be seen in Figure 4.3(c), the transverse impact parameter, d0, of a track is the closest distance between the track and the primary vertex in the plane perpendicular to the direction of collision. The longitudinal impact parameter, L0, is the aforementioned closest distance in longitudinal direction. Similarly to hadronically decaying τ -leptons, b-jets are reconstructed using the anti-kt algorithm with a radius parameter R = 0.4.

Then, they are identified using the MV2c10 multivariate discriminant, also based on BDTs [35, 36].

(a) (b)

(c)

Figure 4.3: Comparison of τ -jets and b-jets with regular jets [37].

5See Chapter 5.

(18)

4.2.3 Missing transverse momentum

The missing transverse momentum (and its magnitude ETmiss) account for the momentum imbalance in the plane transverse to the beam axis. Its measurement can allow the indirect detection of neutrinos. In this analysis, EmissT is defined as the magnitude of the negative vector sum of all reconstructed, calibrated objects together with a track-based ”soft” term (TST), which is reconstructed from the transverse momentum deposited in the detector, but not associated with any of the aforementioned identified objects [38].

4.3 Event selection

Following the reconstruction of detector objects, an event selection is applied so that the selected final state consists of two τhad−vis candidates of opposite charge satisfying the ”medium” identification (ID) criteria, along with two b-tagged jets and EmissT . A BDT classification is applied on these events in order to separate the signal from background processes.

Only events that pass the single τhadtriggers (STT) and di-τhadtriggers (DTT) are used. The event selection criteria depend on the trigger category. A minimum pT of 40 GeV for DTT events is required for the leading τhad−vis candidate, while for STT events the minimum pT ranges between 100 and 180 GeV6. The sub-leading7 τhad−viscandidate is required to have a pT threshold of 20 GeV for STT and 30 GeV for DTT events. The selection also requires at least two jets, with the leading one having pT> 45 GeV for STT events, while for DTT the threshold is increased to 80 GeV, due to a L1 trigger requirement of the presence of a jet in the event. The minimum pTrequirement for the sub-leading jet in both cases is 20 GeV. In addition, the invariant mass of the di-τ system, mMMCτ τ , which is calculated using the Missing Mass Calculator, is constrained to be higher than 60 GeV. Finally, the signal region (SR) is defined by selecting events that meet the above requirements and have exactly two b-tagged jets. The event selection is summarized in Table 4.2.

Table 4.2: Signal region event selection for the bbτhadτhad decay channel [3].

Preselection (had-had channel)

Single-τhadtrigger (STT) Di-τhad trigger (DTT) exactly 2 opposite-sign τhad’s

pτT1(2) > 100 − 180 (20) GeV pτT1(2) > 40 (30) GeV

≥ 2 central jets

pT> 45, 20 GeV pT> 80, 20 GeV mMMCτ τ > 60 GeV

2 b-jets

4.3.1 Background processes

The dominant background processes in the bbτhadτhad channel are t¯t, Z → τ τ produced in association with heavy flavour jets and multijet events.

Background processes including real τ -lepton decays are derived from simulation, while data-driven methods are used to estimate backgrounds where jets are misidentified as τhad−vis objects. In particular, t¯t events, where one or more reconstructed τhad decays are misidentified quark- and gluon-initiated jets, often referred to as fake-τhad background, are estimated in a semi-data-driven way. A set of probabilities for a jet to be misidentified as a ”medium” τhad−vis (fake rates) is measured in data in a control region enriched in fake τhad−vis candidates with some very loose τ -ID criteria applied. These fake rates are then applied to all

6The pTthreshold varies due to different trigger thresholds corresponding to different data-taking periods.

7The leading τhad−visjet is the one with the highest transverse momentum, while sub-leading refers to that jet with the second highest transverse momentum.

(19)

non-truth-matched τhad−vis candidates with the same identification criteria to estimate the fake ”medium”

τhad−viscontribution from t¯t events in the SR.

A data-driven method is used to estimate the multijet background. It is assumed that the probability for a jet to fake a τhad−vis candidate is the same in the region where two τhad−vis objects have the same electric charge (SS) and the region where they have opposite electric charge (OS). In both regions all non-multijet backgrounds are subtracted from data. A set of fake factors (FFs) are measured in the SS region as the ratio of the number of events where both τhad−vis candidates pass the ”medium” τ -ID criteria to the number of events where at least one of them fails these criteria. These FFs are then applied to all events where at least one τ candidate fails the ”medium” ID in the OS region to obtain the multijet estimate in the SR.

A large background contribution is coming from processes involving the decay mode of the Z-boson in association with heavy-flavor jets, Z → τ τ +(bb, bc, cc). The jet emission is considered to be not well-modelled by the simulation, hence these processes are normalised to data in a control region. The production of jets should not depend on the decay mode of the Z-boson, hence Z → µµ + (bb, bc, cc) jet events are selected from data. A dedicated control region is included in the final fit, so that the normalisation is derived.

(20)

Chapter 5

Multivariate analysis in particle physics

In high energy physics experiments, there is an ever-increasing production of large data, which is followed by the demand for advanced analysis techniques, so that it is possible to extract the maximal amount of information from the data. These typically involve multiple quantities or variables, like transverse momentum, energy deposited in calorimeters, kinematic variables, and other more complicated variables aimed to describe characteristics of a specific final state. Since many variables can be correlated, the benefit from using multivariate techniques is that they take into account such correlations [39]. Multivariate analysis (MVA) methods, based on machine learning8, are widely used in particle physics, mostly for signal and background classification. For this reason, a Toolkit for Multivariate Analysis (TMVA) [33], has been developed and integrated into the analysis framework ROOT. The TMVA includes different algorithms for multivariate classification, for instance Boosted Decision Trees (BDTs).

5.1 Boosted Decision Trees

The multivariate analysis technique of BDTs is widely used by the ATLAS Collaboration. A decision tree is a binary classifier in the shape of a tree, as illustrated in Figure 5.1, consisting of nodes and leaves.

Decision trees (DTs) consist of recursive left/right decisions which determine the classification of data, based on sequential cuts applied on a single variable at each time. At each step in the sequence, the best cut is looked for and used to split the data until in the end a terminal criterion is fulfilled. As shown in Figure 5.1, all decisions begin from an initial, root node, and each leaf gets assigned one of two classes, like for example signal or background, consequent to a binary decision taken at each node. The splitting of data stops when impurity cannot be further reduced. A measure of impurity is the Gini index, which is given by

Gini = (s + b)P (1 − P ) = sb

s + b, (5.1)

with P = s/(s + b) being the signal purity and s, b referring to the signal and background events respectively.

In the end, all events are assigned to one of the classes and are given a score, which usually ranges between -1 and +1. Background-like events get a score near -1, while signal-like events have score close to +1.

One shortcoming of a single decision tree is that its performance drops rapidly, if the dataset has a complex structure. However, a good solution is boosting, where the classification is repeated by creating many single trees, but the misclassified events in one tree are given a larger weight when the next tree is created. In more detail, the training procedure of a BDT includes building multiple decision trees and putting them in a forest to yield a combined score. This lets BDTs have an enhanced separation performance and stability compared to single DTs. For each individual tree, the best cut for each variable xi is found, so that the separation between signal and background is maximum. The process is repeated for each node. Next, misclassified events are checked for one classifier, and a higher weight is assigned to them. The next classifier, in turn, is trained on the reweighted dataset, and the procedure is repeated until all classifiers are trained. The weighted sum of all classifiers gives the final score.

8Machine learning is the study of algorithms that learn from data, and then provide predictions for future data [39].

(21)

Figure 5.1: Illustration of a binary decision tree [33]. Each branch of the decision tree represents a sequence of cuts, which classifies an event as signal (S) or background (B).

5.1.1 Input variables

The TMVA is used for the training and evaluation of various BDTs. Several variables that provide good discrimination between signal and background are used as inputs to the BDT, and are listed in Table 5.1 for the τhadτhad search channel [3]..

Table 5.1: Variables used as inputs to the BDTs for the non-resonant di-Higgs signal in the τhadτhad search channel [3].

Variables - τhadτhad channel mHH

mMMCτ τ mbb

∆R(τ, τ )

∆R(b, b) EmissT φ centrality

Explicitly, in the order they appear in Table 5.1, mHH is the invariant mass of the di-Higgs system, which is reconstructed from the di-τ and di-b systems. Then, mMMCτ τ is the invariant mass of the di-τ system, calculated using the Missing Mass Calculator, and mbb is the invariant mass of the di-b-jet system. The distance evaluated between two τhad−vis decay products and the one between two b-jets are referred to as

∆R(τ, τ ) and ∆R(b, b) respectively. The ETmissφ centrality9 is a measurement of the angular position, φ, of the missing transverse momentum, ETmiss, with respect to the visible decay products of the two τ objects. Its definition is given by:

ETmissφ centrality = A + B

A2+ B2, (5.2)

where A and B are defined as:

A = sin(φEmiss

T − φτ2)

sin(φτ1− φτ2) , B = sin(φτ1− φEmiss

T )

sin(φτ1− φτ2) (5.3)

9In the plots below, it is referred to as MET φ centrality.

(22)

The ETmissφ centrality should be equal to:

2, when the EmissT is exactly between the two τhas−vis objects,

• 1, if the ETmisslines up with either of the τhas−vis objects,

• less than 1, if the ETmisslies outside of the angular region that the two τhas−vis objects form.

The BDT for the τhadτhad channel is trained against the dominant backgrounds, namely t¯t, Z → τ τ and multi-jet events. The training of the BDT has already been performed by the ATLAS Collaboration and the results can be found in Ref. [3]. In Figure 5.2, one can see the BDT score distribution using the SM HH signal in the training for the τhadτhad channel.

BDT score

1 0.8 0.60.4 0.2 0 0.2 0.4 0.6 0.8 1

Events / Bin

1 10 102

103

104

Data

NR HH at exp limit Top-quark

fakes (Multi-jets) τhad

jet

+(bb,bc,cc) τ τ

Z

) t fakes (t τhad

jet Other SM Higgs Uncertainty Pre-fit background ATLAS

13 TeV, 36.1 fb-1

2 b-tags τhad

τhad

BDT score

1 0.8 0.60.4 0.2 0 0.2 0.4 0.6 0.8 1 Data/Pred. 0.5

1 1.5

Figure 5.2: Distribution of the BDT score for non-resonant di-Higgs signal in the τhadτhad channel, taken from the published analysis of Ref. [3]. The signal is scaled to the expected limit. The lower panel shows the ratio of the data to the sum of the backgrounds.

The distribution of the BDT score is shown after a profile-likelihood fit [40] with the background-only hypothesis. The binning used for the fit is determined after fulfilling two conditions. First, the pre-fit relative background statistical uncertainty in each bin of the BDT score should be less than 0.5 times the signal fraction in that bin, and second, the minimum number of expected background events in each bin should be five [41]. Figure 5.2 shows a transformed BDT score after rebinning.

5.1.2 Existing upper limits

Next, upper limits are set on the non-resonant Higgs boson pair-production at 95% confidence level (CL) using the CLs method [42]. The published analysis of Ref. [3] sets upper limits on the cross section for non-resonant HH production times the branching ratio of HH → bbτ τ . These results, which pertain to both sub-channels, i.e. τlepτhadand τhadτhad, and their combination, are presented in Table 5.2. The limits take into account both systematic and statistical uncertainties. The cross section times branching ratio for non-resonant Higgs boson pair-production is constrained to be less than 30.9 fb, corresponding to 12.7 times the SM expectation, at 95% CL for the combination, representing the most stringent limit on non-resonant HH set by an individual channel at the LHC. For the remainder of the thesis, we focus on the limit corresponding to the τhadτhad channel. In that case, still with both systematic and statistical uncertainties, the observed (expected) limit is 40.0 fb (42.4 fb), 16.4 (17.4) times the SM expectation.

5.2 Purpose

The aim of this study is the rescoping of the multivariate analysis by introducing a cut-based analysis, in order to estimate the gain from using the MVA technique. Despite its limitations, a cut-based analysis offers

References

Related documents

According to the empirical examples, one strategy when knowledge (the property) is circulated and used in problematic, misinformed ways (as by patients) is to share

This thesis project presents a research through design process that has aimed to explore how we can design interactive technologies that can support mindfulness practices

I juni 2013 blev der udsendt et spørgeskema (Elbe, 2013) til alle 98 danske kommuner adresseret til de respektive kommunaldirektører. I følgebrevet blev adressaten bedt om at

perioden hos barnmorskan var mycket viktigt för kvinnorna, men många av dem tyckte att besöket var för kort, de ville diskutera de fysiska förändringarna efter förlossningen men

The direct impact of armed conflict on Syrian children education is due to the following factors: direct and deliberate attacks on schools, the military use of schools,

Annat som kan utgöra ett hinder för äldre kvinnor att söka hjälp eller berätta om våldet är att samhället kommer således tendera att behandla den våldsutsatta utifrån

Trots att det även inom CTC talas om orsaker – och även om satsningen beskrivs som en preventiv verksamhet (alltså som en verksamhet syftande till att förebygga problem),

While organizational scholars have for some time focused on the natural world as a context for business activities (see e.g. Bansal and Hoffman, 2012), far fewer have sought