Chapter 1: Problem 10
The probabilities of identical sequences of amino acids. You are comparing protein amino acid sequences for homology. You have a 20-letter alphabet (20 different amino acids). Each sequence is a string \(n\) letters in length. You have one test sequence and \(s\) different data base sequences. You may find any one of the 20 different amino acids at any position in the sequence, independent of what you find at any other position. Let \(p\) represent the probability that there will be a 'match' at a given position in the two sequences. (a) In terms of \(s, p\), and \(n\), how many of the \(s\) sequences will be perfect matches (identical residues at every position)? (b) How many of the \(s\) comparisons (of the test sequence against each database sequence) will have exactly one mismatch at any position in the sequences?
Short Answer
Step by step solution
Key Concepts
These are the key concepts you need to understand to accurately answer the question.