Warning: foreach() argument must be of type array|object, bool given in /var/www/html/web/app/themes/studypress-core-theme/template-parts/header/mobile-offcanvas.php on line 20

Show that Huffman codes are optimal in the sense that they represent a string of symbols using the fewest bits among all binary prefix codes.

Short Answer

Expert verified

In the solution below it is proved that\({{\bf{H}}_{\bf{k}}}\) is optimal.

Step by step solution

Achieve better grades quicker with Premium

  • Unlimited AI interaction
  • Study offline
  • Say goodbye to ads
  • Export flashcards

Over 22 million students worldwide already upgrade their learning with Vaia!

01

Huffman coding

It could be a procedure for constructing an optimal code for a group of symbols where the frequencies of those symbols are given.

02

Step2:One shall prove the optimality of Huffman's coding algorithm.

One proves this by the mathematical induction method.

For \({\bf{n = 2}}\), that is if there are only 2 symbols Huffman's code use \({\bf{0}}\) for one symbol and \({\bf{1}}\) for the second one. That is, it uses just one bit and then it’s optimal.

Therefore, assume the result is true for some k symbols.

That is, the inductive hypothesis is that Huffman codes are optimal for \({\bf{K}}\)symbols.

03

Now we shall prove this is true for \({\bf{k + 1}}\) symbols

Let us consider one has\({\bf{k + 1}}\) symbols. Since the tree created could be a full tree, the leaves at the bottom-most level are always paired.

Let \({\bf{a}}\) and \({\bf{b}}\)be two symbols of the smallest frequencies, \({{\bf{f}}_{\bf{a}}}\) and\({{\bf{f}}_{\bf{b}}}\). If they do not come in pairs together of some binary prefix code tree, then one can interchange the symbols on some leaves with \({\bf{a}}\) and \({\bf{b}}\) so that now they’re siblings at the underside most level.

Since moving a more frequently occurring symbol closer to the root won’t affect the efficiency, one can consider the code is a minimum efficient because the old one. Therefore,one can conclude that a and b are siblings in every most efficient tree.

Now it consider the symbols \({\bf{a}}\) and \({\bf{b}}\) as one new symbol, say \({\bf{c}}\), having a frequency equal to the frequencies of \({\bf{a}}\)and\({\bf{b}}\). Two symbols got replaced with one new symbol therefore the number of symbols we are left with is \({\bf{k}}\).

Now obtain the optimal binary prefix code (by inductive hypothesis it is optimal)on \({\bf{k}}\) symbols via the Huffman algorithm and we will call it as \({{\bf{H}}_{\bf{k}}}\). Note that this is equivalent to applying the Huffman algorithm to \({\bf{k + 1}}\) symbols and obtaining the code \({{\bf{H}}_{{\bf{k + 1}}}}\).

04

One has to show this \({{\bf{H}}_{{\bf{k + 1}}}}\) is optimal for \({\bf{k + 1}}\) symbols

The average numbers of bits required to encode a symbol in \({{\bf{H}}_{\bf{k}}}\) and in \({{\bf{H}}_{{\bf{k + 1}}}}\) are the same except for the symbols \({\bf{a, b, and c}}\). This difference is \({{\bf{f}}_{\bf{a}}}{\bf{ + }}{{\bf{f}}_{\bf{b}}}\) One extra bit would be needed for \({\bf{a}}\) and \({\bf{b}}\)and all other cord words are the same. Now we assume \({{\bf{H}}_{{\bf{k + 1}}}}\) 1 is not optimal and another code \({\bf{H}}{{\bf{'}}_{{\bf{k + 1}}}}\), is a better code. That is, it encodes with a lesser average amount of bits per symbol. But one can assume that \({\bf{a}}\) and \({\bf{b}}\) are siblings at the bottom-most level in \({\bf{H}}{{\bf{'}}_{{\bf{k + 1}}}}\) tree also. Consider the code \({\bf{H}}{{\bf{'}}_{\bf{k}}}\) for \({\bf{k}}\) symbols obtained by replacing \({\bf{a}}\) and \({\bf{b}}\) with their parent. It has got an average number of bits equal to the average for \({\bf{H}}{{\bf{'}}_{{\bf{k + 1}}}}\) minus \({{\bf{f}}_{\bf{a}}}{\bf{ + }}{{\bf{f}}_{\bf{b}}}\).

This contradicts the inductive hypothesis that\({{\bf{H}}_{\bf{k}}}\)was optimal.

Hence\({{\bf{H}}_{\bf{k}}}\) is optimal.

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Study anywhere. Anytime. Across all devices.

Sign-up for free