Wald's equation can also be proved by using renewal reward processes. Let \(N\)
be a stopping time for the sequence of independent and identically distributed
random variables \(X_{i}, i \geqslant 1\)
(a) Let \(N_{1}=N\). Argue that the sequence of random variables \(X_{N_{1}+1},
X_{N_{1}+2}, \ldots\) is independent of \(X_{1}, \ldots, X_{N}\) and has the same
distribution as the original sequence \(X_{i}, i \geqslant 1\)
Now treat \(X_{N_{1}+1}, X_{N_{1}+2}, \ldots\) as a new sequence, and define a
stopping time
\(\mathrm{N}_{2}\) for this sequence that is defined exactly as \(\mathrm{N}_{1}\)
is on the original sequence. (For instance, if \(N_{1}=\min \left(n:
X_{n}>0\right\\}\), then \(\left.N_{2}=\min \left[n: X_{N_{1}+n}>0\right\\}
.\right)\)
Similarly, define a stopping time \(N_{3}\) on the sequence \(X_{N_{1}+N_{2}+1},
X_{N_{1}+N_{2}+2}, \ldots\) that is identically defined on this sequence as
\(N_{1}\) is on the original sequence, and so on.
(b) Is the reward process in which \(X_{i}\) is the reward earned during period
\(i\) a renewal Ireward process? If so, what is the length of the successive
cycles?
(c) Derive an expression for the average reward per unit time.
(d) Use the strong law of large numbers to derive a second expression for the
average reward per unit time.
(e) Conclude Wald's equation.