$$ \newcommand{\arginf}{\mathrm{arginf}} \newcommand{\argmin}{\mathrm{argmin}} \newcommand{\argmax}{\mathrm{argmax}} \newcommand{\asconv}[1]{\stackrel{#1-a.s.}{\rightarrow}} \newcommand{\Aset}{\mathsf{A}} \newcommand{\b}[1]{{\mathbf{#1}}} \newcommand{\ball}[1]{\mathsf{B}(#1)} \newcommand{\bbQ}{{\mathbb Q}} \newcommand{\bproof}{\textbf{Proof :}\quad} \newcommand{\bmuf}[2]{b_{#1,#2}} \newcommand{\card}{\mathrm{card}} \newcommand{\chunk}[3]{{#1}_{#2:#3}} \newcommand{\condtrans}[3]{p_{#1}(#2|#3)} \newcommand{\convprob}[1]{\stackrel{#1-\text{prob}}{\rightarrow}} \newcommand{\Cov}{\mathbb{C}\mathrm{ov}} \newcommand{\cro}[1]{\langle #1 \rangle} \newcommand{\CPE}[2]{\PE\lr{#1| #2}} \renewcommand{\det}{\mathrm{det}} \newcommand{\dimlabel}{\mathsf{m}} \newcommand{\dimU}{\mathsf{q}} \newcommand{\dimX}{\mathsf{d}} \newcommand{\dimY}{\mathsf{p}} \newcommand{\dlim}{\Rightarrow} \newcommand{\e}[1]{{\left\lfloor #1 \right\rfloor}} \newcommand{\eproof}{\quad \Box} \newcommand{\eremark}{</WRAP>} \newcommand{\eqdef}{:=} \newcommand{\eqlaw}{\stackrel{\mathcal{L}}{=}} \newcommand{\eqsp}{\;} \newcommand{\Eset}{ {\mathsf E}} \newcommand{\esssup}{\mathrm{essup}} \newcommand{\fr}[1]{{\left\langle #1 \right\rangle}} \newcommand{\falph}{f} \renewcommand{\geq}{\geqslant} \newcommand{\hchi}{\hat \chi} \newcommand{\Hset}{\mathsf{H}} \newcommand{\Id}{\mathrm{Id}} \newcommand{\img}{\text{Im}} \newcommand{\indi}[1]{\mathbf{1}_{#1}} \newcommand{\indiacc}[1]{\mathbf{1}_{\{#1\}}} \newcommand{\indin}[1]{\mathbf{1}\{#1\}} \newcommand{\itemm}{\quad \quad \blacktriangleright \;} \newcommand{\jointtrans}[3]{p_{#1}(#2,#3)} \newcommand{\ker}{\text{Ker}} \newcommand{\klbck}[2]{\mathrm{K}\lr{#1||#2}} \newcommand{\law}{\mathcal{L}} \newcommand{\labelinit}{\pi} \newcommand{\labelkernel}{Q} \renewcommand{\leq}{\leqslant} \newcommand{\lone}{\mathsf{L}_1} \newcommand{\lp}[1]{\mathsf{L}_{{#1}}} \newcommand{\lrav}[1]{\left|#1 \right|} \newcommand{\lr}[1]{\left(#1 \right)} \newcommand{\lrb}[1]{\left[#1 \right]} \newcommand{\lrc}[1]{\left\{#1 \right\}} \newcommand{\lrcb}[1]{\left\{#1 \right\}} \newcommand{\ltwo}[1]{\PE^{1/2}\lrb{\lrcb{#1}^2}} \newcommand{\Ltwo}{\mathrm{L}^2} \newcommand{\mc}[1]{\mathcal{#1}} \newcommand{\mcbb}{\mathcal B} \newcommand{\mcf}{\mathcal{F}} \newcommand{\meas}[1]{\mathrm{M}_{#1}} \newcommand{\norm}[1]{\left\|#1\right\|} \newcommand{\normmat}[1]{{\left\vert\kern-0.25ex\left\vert\kern-0.25ex\left\vert #1 \right\vert\kern-0.25ex\right\vert\kern-0.25ex\right\vert}} \newcommand{\nset}{\mathbb N} \newcommand{\N}{\mathcal{N}} \newcommand{\one}{\mathsf{1}} \newcommand{\PE}{\mathbb E} \newcommand{\pminfty}{_{-\infty}^\infty} \newcommand{\PP}{\mathbb P} \newcommand{\projorth}[1]{\mathsf{P}^\perp_{#1}} \newcommand{\Psif}{\Psi_f} \newcommand{\pscal}[2]{\langle #1,#2\rangle} \newcommand{\pscal}[2]{\langle #1,#2\rangle} \newcommand{\psconv}{\stackrel{\PP-a.s.}{\rightarrow}} \newcommand{\qset}{\mathbb Q} \newcommand{\revcondtrans}[3]{q_{#1}(#2|#3)} \newcommand{\rmd}{\mathrm d} \newcommand{\rme}{\mathrm e} \newcommand{\rmi}{\mathrm i} \newcommand{\Rset}{\mathbb{R}} \newcommand{\rset}{\mathbb{R}} \newcommand{\rti}{\sigma} \newcommand{\section}[1]{==== #1 ====} \newcommand{\seq}[2]{\lrc{#1\eqsp: \eqsp #2}} \newcommand{\set}[2]{\lrc{#1\eqsp: \eqsp #2}} \newcommand{\sg}{\mathrm{sgn}} \newcommand{\supnorm}[1]{\left\|#1\right\|_{\infty}} \newcommand{\thv}{{\theta_\star}} \newcommand{\tmu}{ {\tilde{\mu}}} \newcommand{\Tset}{ {\mathsf{T}}} \newcommand{\Tsigma}{ {\mathcal{T}}} \newcommand{\ttheta}{{\tilde \theta}} \newcommand{\tv}[1]{\left\|#1\right\|_{\mathrm{TV}}} \newcommand{\unif}{\mathrm{Unif}} \newcommand{\weaklim}[1]{\stackrel{\mathcal{L}_{#1}}{\rightsquigarrow}} \newcommand{\Xset}{{\mathsf X}} \newcommand{\Xsigma}{\mathcal X} \newcommand{\Yset}{{\mathsf Y}} \newcommand{\Ysigma}{\mathcal Y} \newcommand{\Var}{\mathbb{V}\mathrm{ar}} \newcommand{\zset}{\mathbb{Z}} \newcommand{\Zset}{\mathsf{Z}} $$

2023/11/14 18:37

$g$-lemma

Let $X$ be a random variable non-negative a.s. and $g \colon \rset_+ \rightarrow \rset_+$ an increasing differentiable function such that $g(0)=0$. Then, \begin{equation*} \PE\lrb{g(X)} = \int_{\rset_+} g'(x) \PP\lr{X \geq x} \rmd x \in \rset_+ \cup \lrc{+\infty}. \end{equation*}

Click to display ⇲

Click to hide ⇱

Write, using that for $x\in \rset_+$, $g'(x) 1_{X\geq x}\geq 0$, \begin{equation*} \int_{\rset_+} g'(x) \underbrace{\PP\lr{X \geq x}}_{\PE\lrb{1_{X\geq x}}} \rmd x = \int_{\rset_+} \PE \lrb{g'(x) 1_{X\geq x}} \rmd x = \PE \lrb{\int_{\rset_+} g'(x) 1_{X\geq x} \rmd x} = \PE \lrb{\int_{0}^X g'(x)\rmd x} = \PE \lrb{g(X) - \underbrace{g(0)}_{=0}}. \end{equation*}

Convexity inequality

Let $p \geq 1$, $n \in \nset$ and $\lr{X_i}_{1\leq i \leq n}$ real-valued random variables. Then, by convexity \begin{equation*} \PE \lrb{\lrav{\sum_{i=1}^n X_i}^p} \leq n^{p-1} \sum_{i=1}^n \PE \lrb{\lrav{X_i}^p}. \end{equation*}

Marcinkiewicz–Zygmund inequality

Let $p \geq 2$, $n \in \nset$ and $\lr{X_i}_{1\leq i \leq n}$ centered independent real-valued random variables in $L^p$. Then, there exists a universal constant $C_p$ depending only on $p$ such that \begin{equation*} \PE \lrb{\lrav{\sum_{i=1}^n X_i}^p} \leq C_p \eqsp n^{p/2-1} \sum_{i=1}^n \PE \lrb{\lrav{X_i}^p}. \end{equation*}

Proof

Set $S_n \eqdef \sum_{i=1}^n X_i$. Let $x > 0$. We first establish an upper-bound for $\PP \lr{\lrav{S_n} \geq x}$.

Let $y > 0$ and define for all $i \in [1;n]$, $Z_i \eqdef X_i 1_{X_i < y}$ and $T_n \eqdef \sum_{i=1}^n Z_i$. Then, \begin{equation} \label{eq:s_n_t_n} \PP \lr{S_n \geq x} \leq \PP \lr{T_n \geq x} + \PP \lr{S_n \neq T_n} \leq \PP \lr{T_n \geq x} + \sum_{i=1}^n \PP \lr{X_i \geq y}. \end{equation} Let $h > 0$. The Chernoff bound and the independence of the $\lr{Z_i}_{1\leq i \leq n}$ by independence of the $\lr{X_i}_{1\leq i \leq n}$ both provide \begin{equation} \label{eq:t_n_only} \PP \lr{T_n \geq x} \leq e^{-hx} \PE\lrb{e^{h T_n}} = e^{-hx} \prod_{i=1}^n \PE\lrb{e^{h Z_i}}. \end{equation} Using the Taylor formula with the exponential function yields that the function defined on $\rset$ by $s \mapsto \frac {e^s-1-s} {s^2} = \frac 1 {s^2} \int_0^s (u-s)e^u \rmd u= \int_0^1 (u-1)e^{su} \rmd u$ is increasing, and together with $Z_i \leq y$ for all $i \in [1;n]$, we deduce \begin{equation*} e^{h Z_i} \leq 1 + h Z_i + Z_i^2 \frac {e^{hy}-1-y} {y^2}. \end{equation*} The fact that $y>0$ implies $Z_i \leq X_i$ and thus $\PE \lrb{Z_i} \leq \PE \lrb{X_i} = 0$. Combining with $\PE \lrb{Z_i^2} = \PE \lrb{X_i^2 1_{X_i < y}} \leq \PE \lrb{X_i^2}$ yields for all $i \in [1;n]$, \begin{equation*} \PE \lrb{e^{h Z_i}} \leq 1 + \PE \lrb{X_i^2} \frac {e^{hy}-1-y} {y^2}. \end{equation*} Together with \eqref{eq:s_n_t_n} and \eqref{eq:t_n_only} this provides \begin{equation} \label{eq:s_n_step_1} \PP \lr{S_n \geq x} \leq \sum_{i=1}^n \PP \lr{X_i \geq y} + \exp \lrb{-hx + B_n \frac {e^{hy}-1-y} {y^2}}, \end{equation} where $B_n \eqdef \sum_{i=1}^n \PE \lrb{X_i^2} < \infty$. Note that $B_n = 0$ implies that the $\lr{X_i}_{1 \leq i \leq n}$ are all equal to zero a.s., a situation where the inequality is trivially true, and we can thus assume $B_n > 0$. The argument of the exponential in \eqref{eq:s_n_step_1} is then minimized in $h$ at $h_{\min} \eqdef \frac 1 y \log \lr{1 + \frac {xy} {B_n}}$, with \begin{equation*} -h_{\min}x + B_n \frac {e^{h_{\min}y}-1-y} {y^2} = - \frac x y \log \lr{1 + \frac {xy} {B_n}} + \frac {B_n} {y^2} \lrb{\frac {xy} {B_n} \underbrace{- \log \lr{1 + \frac {xy} {B_n}}}_{\leq 0}} \leq \frac x y - \frac x y \log \lr{1 + \frac {xy} {B_n}}. \end{equation*}

Click to display ⇲

Click to hide ⇱

The function defined on $\rset_+^*$ by $h \mapsto -hx + B_n \frac {e^{hy}-1-y} {y^2}$ is continuous, diverges to infinity when $h \rightarrow +\infty$, and its derivative $h \mapsto -x + \frac {B_n} y \lr{e^{hy}-1}$ has a unique zero $h_{\min}$ on $\rset_+^*$ defined by $e^{h_{\min}y}-1 = \frac {xy} {B_n}$.

With $y = \frac x r$ where $r > 0$, combining with \eqref{eq:s_n_step_1} yields \begin{equation*} \PP \lr{S_n \geq x} \leq \sum_{i=1}^n \PP \lr{X_i \geq \frac x r} + e^r \lr{1 + \frac {x^2} {r B_n}}^{-r}. \end{equation*} Considering $\lr{-X_i}_{1 \leq i \leq n}$ provides a similar inequality for $-S_n$, and using the fact that $x > 0$ we deduce \begin{equation*} \PP \lr{\lrav{S_n} \geq x} = \PP \lr{S_n \geq x} + \PP \lr{-S_n \geq x} \leq \sum_{i=1}^n \PP \lr{\lrav{X_i} \geq \frac x r}+ 2 e^r \lr{1 + \frac {x^2} {r B_n}}^{-r}. \end{equation*}

Using the $g$-lemma with $g \colon x \mapsto x^p$ we deduce \begin{align*} \PE \lrb{\lrav{S_n}^p} = p \int_{\rset_+} x^{p-1} \PP\lr{\lrav{S_n} \geq x} \rmd x &\leq \sum_{i=1}^n p \int_{\rset_+} x^{p-1} \PP\lr{\lrav{X_i} \geq x} \rmd x + 2p e^r \int_{\rset_+} \frac {x^{p-1}} {\lr{1 + \frac {x^2} {r B_n}}^r} \rmd x \\ &= r^p \sum_{i=1}^n \PE \lrb{\lrav{X_i}^p} + 2p e^r B_n^{p/2} \int_0^{+\infty} \frac {u^{p/2-1}} {\lr{1+\frac u r}^r} \rmd u \quad \in \rset_+ \cup \lrc{+\infty}, \end{align*} with the change of variables $u = \frac {x^2} {B_n}$. The integral is finite iff $r>p/2$, and we can choose $r = p$ to deduce the Rosenthal inequality: \begin{equation} \label{eq:rosenthal} \PE \lrb{\lrav{S_n}^p} \leq c_p \lr{\sum_{i=1}^n \PE \lrb{\lrav{X_i}^p} + \lr{\sum_{i=1}^n \PE \lrb{X_i^2}}^{p/2}}, \end{equation} where $c_p \eqdef \max(p^p, 2p e^p \int_{\rset_+} \frac {u^{p/2-1}} {\lr{1+\frac u p}^p} \rmd u)$ only depends on $p$. Finally, by Jensen inequality as $p \geq 2$, and by convexity, \begin{equation*} \lr{\frac 1 n \sum_{i=1}^n \PE \lrb{X_i^2}}^{p/2} = \PE \lrb{\frac 1 n \sum_{i=1}^n X_i^2}^{p/2} \leq \PE \lrb{\lr{\frac 1 n \sum_{i=1}^n X_i^2}^{p/2}} \leq \PE \lrb{\frac 1 n \sum_{i=1}^n \lrav{X_i}^p}. \end{equation*}

which together with the Rosenthal inequality \eqref{eq:rosenthal} yields the Marcinkiewicz–Zygmund inequality: \begin{equation*} \PE \lrb{\lrav{\sum_{i=1}^n X_i}^p} \leq C_p \eqsp n^{p/2-1} \sum_{i=1}^n \PE \lrb{\lrav{X_i}^p}, \end{equation*} where $C_p \eqdef 2 c_p$.

Generalized Marcinkiewicz–Zygmund inequality

Let $d \in \nset^*$ and $\norm{\cdot}$ a norm on $\rset^d$. Let $n \in \nset^*$ and $\lr{X_i}_{1 \leq i \leq n}$ independent random variables of $L^p(\rset^d)$ with $2 \leq p < \infty$. Then, \begin{equation*} \mathbb{E}\lrb{\norm{\sum_{i=1}^n \lr{X_i-\mathbb{E}\lrb{X_i}} }^p} \leq C_{p, \norm{}} \times n^{p/2-1} \times \sum_{i=1}^n \mathbb{E}\lrb{\norm{X_i}^p} , \end{equation*} where $C_{p, \norm{}}$ is a constant depending only on $p$ and on the choice of the norm $\norm{\cdot}$.

Click to display ⇲

Click to hide ⇱

First, notice that the result only needs to be proved for centered random variables. Indeed, by convexity, for any random variable $X$, \begin{equation*} \mathbb{E}\lrb{\norm{X-\mathbb{E}\lrb{X}}^p} \leq 2^{p-1} \mathbb{E}\lrb{\norm{X}^p + \norm{\mathbb{E}\lrb{X}}^p} \leq 2^p \mathbb{E}\lrb{\norm{X}^p} . \end{equation*} Moreover, by equivalence of norms in finite dimension, the result only needs to be proved for the norm $\norm{\cdot}_p$ on $\rset^d$. Using the Marcinkiewicz–Zygmund inequality in dimension 1 provides \begin{align*} \mathbb{E}\lrb{\norm{\sum_{i=1}^n X_i }_p^p} &= \mathbb{E}\lrb{\sum_{j=1}^d \lrav{ \sum_{i=1}^n X_i(j) }^p} \\ &= \sum_{j=1}^d \mathbb{E}\lrb{\lrav{ \sum_{i=1}^n X_i(j) }^p} \\ &\leq \sum_{j=1}^d C_p \times n^{p/2-1} \times \sum_{i=1}^n \mathbb{E}\lrb{\lrav{X_i(j)}^p} \\ &= C_p \times n^{p/2-1} \times \sum_{i=1}^n \mathbb{E}\lrb{\sum_{j=1}^d \lrav{X_i(j)}^p} \\ &= C_p \times n^{p/2-1} \times \sum_{i=1}^n \mathbb{E}\lrb{\norm{X_i}_p^p} \eqsp. \end{align*}

Welcome to Randal Douc's wiki

Sidebar

Wiki

Wiki

Courses and public working groups

Courses and public working groups

Private Working Groups

Private Working Groups

Personal Notes

Personal Notes

Réponses

Réponses

Miscellanous

Miscellanous

Table of Contents

$g$-lemma

Convexity inequality

Marcinkiewicz–Zygmund inequality

Proof

Generalized Marcinkiewicz–Zygmund inequality

Welcome to Randal Douc's wiki

User Tools

Site Tools

Sidebar

Wiki

Wiki

Courses and public working groups

Courses and public working groups

Private Working Groups

Private Working Groups

Personal Notes

Personal Notes

Réponses

Réponses

Miscellanous

Miscellanous

Table of Contents

$g$-lemma

Convexity inequality

Marcinkiewicz–Zygmund inequality

Proof

Generalized Marcinkiewicz–Zygmund inequality

Page Tools