Chapter 8: Q21E (page 281)
Sequencing by hybridization. One experimental procedure for identifying a new DNA sequence repeatedly probes it to determine which (substrings of length ) it contains. Based on these, the full sequence must then be reconstructed.Let’s now formulate this as a combinatorial problem. For any string x (the DNA sequence), let denote the multiset of all of its localid="1658905204605" . In particular, localid="1658904556515" contains exactly elements.The reconstruction problem is now easy to state: given a multiset of strings, find a string x such that is exactly this multiset.
Show that the reconstruction problem reduces to RUDRATA PATH. (Hint: Construct a directed graph with one node for each localid="1658904858295" , and with an edge from a to b if the last characters of match the first localid="1658905395287" characters of b.)
But in fact, there is much better news. Show that the same problem also reduces to EULER PATH. (Hint: This time, use one directed edge for each .)