Chapter 21: Problem 13
Open reading frames in \(E .\) coll In this problem, we will search the \(E\). coli genome for open reading frames. The actual genome sequence of \(E\). coli is available on the book's website. (a) Write a program that scans the DNA sequence and records the distance between start and stop codons in each of the three ORFs on the forward strand. You may skip the calculation for the reverse strand. You can find an example of this code implemented in Matlab on the book's website. (b) Plot the distribution of ORF lengths \(L\) and compare it with that expected for random DNA calculated in Problem 4.7 (c) Estimate a cut-off value \(L_{\text {cut }}\), above which the ORFs are statistically significant, that is, the number of observed ORFs with \(L>L\) cut is much greater than expected by chance. (Problem courtesy of Sharad Ramanathan.)
Short Answer
Step by step solution
Key Concepts
These are the key concepts you need to understand to accurately answer the question.