Some details on Subgaussian and Hoeffdings
When I started to read multi armed bandit (a.k.a. sequential treatment allocation) literature, had to deal with concentration bounds, the key idea is essentially bounding the tail probabilities of the random variables. Since I myself sometimes forget many details, I decided to write someting about it... hopefully by this I will learn many things and this helps people who has struggled like me.
This is the first post of the series of posts on bandits that I will try to write. Below I do a short recap of sub-Gaussian random variables. There are different but equivalent ways to define a sub-Gaussian random variable, I will follow following definition
Definition 1 [Sub-Gaussian random variable]: We say a random variable is a sub-Gaussian random variable with sub-Gaussian parameter
Now let's quickly state some important properties of sub-Gaussian random variables,
Some Properties of sub-Gaussian Random Variables
Using Markov's inequality, it is possible to show that for any
If
If
Many random variables are sub-Gaussian, for example,
The idea of the Hoeffding's Bound is to get a tail bound for the average / sum of the sub-Gaussian random variables, i.e., we are looking for
or more simply,
In this case the key step is to first derive the sub-Gaussian parameter of
then applying Chernoff bound is easy and we get,
So the idea is to know
We write the general proposition below,
Proposition [Hoeffding's bound]: Suppose we have a sequence
This is easy to show using the sub-Gaussian rules we have outlined above, here
Usually Hoeffding's bound is written for the bounded random variables, here we just wrote the proposition with general independent sub-Gaussian random variables. One thing to note is, the fast way to get the Hoeffding's bound is to apply sub-Gaussain rules and calculate the
Example 1. Let
Example 2. Let
This is possibly the most well known Hoeffding's bound. We can also let
for
This is also used in many places, where for simplicity we assume the random variables are bounded between