Next: Likelihood fit
Up: Simultaneous Template-Based Top Quark
Previous: Backgrounds
Probability density functions for
-
and
-
at every point in the
grid and for backgrounds are derived using a Kernel Density Estimate (KDE) approach. KDE is a non-parametric method for forming density estimates that can easily be generalized to more than one dimension, making it useful for this analysis, which has two observables per event. The probability for an event with observable (
) is given by the linear sum of contributions from all entries in the MC:
In the above equation,
is the probability to observe
given some MC sample with known mass and JES (or the background). The MC has
entries, with observables
. The kernel function
is a normalized function that adds less probability to a measurement at
as its distance from
increases. The smoothing parameter
(sometimes called the bandwidth) is a number that determines the width of the kernel. Larger values of
smooth out the contribution to the density estimate and give more weight at
farther from
. Smaller values of
provide less bias to the density estimate, but are more sensitive to statistical fluctuations. We use the Epanechnikov kernel, defined as:
so that only events with
contribute to
. We use an adaptive KDE method in which the value of
is replaced by
in that the amount of smoothing around
depends on the value of
. In the peak of the distributions, where statistics are high, we use small values of
to capture as much shape information as possible. In the tails of the distribution, where there are few events and the density estimates are sensitive to statistical fluctuations, a larger value of
is used. The overall scale of
is set by the number of entries in the MC sample (larger smoothing is used when fewer events are available), and by the RMS of the distribution (larger smoothing is used for wider distributions).
We extend KDE to two dimensions by multiplying the two kernels together:
Figures 5 and 6 show the 2d density estimates for Lepton+Jets and Dilepton signal events. Figures 7 and 8 show the 2d estimates for background events. The backgrounds density estimates are derived separately for the individual backgrounds, taking into account the sample size and width, and are then combined with the appropriate weights.
Figure 5:
Full 2d density estimates for input mass of
and
for Lepton+Jets 1-tag events (left) and 2-tag events (right).
|
|
Figure 6:
Full 2d density estimates for input mass of
and
for Dilepton untagged events (left) and tagged events (right).
|
|
Figure 7:
Full 2d density estimates for the combined background for Lepton+Jets 1-tag events (left) and 2-tag events (right).
|
|
Figure 8:
Full 2d density estimates for the combined background for Dilepton untagged events (left) and tagged events (right).
|
|
Next: Likelihood fit
Up: Simultaneous Template-Based Top Quark
Previous: Backgrounds
Hyunsu Lee
2009-02-06