Traditional algorithms for hidden Markov model decoding seek to maximizeeither the probability of a state path or the number of positions of a sequenceassigned to the correct state. These algorithms provide only a single answer andin practice do not produce good results. The most mathematically sound of thesealgorithms is the Viterbi algorithm, which returns the state path that has thehighest probability of generating a given sequence. Here, we explore an extension tothis algorithm that allows us to find the k paths of highest probabilities. The naiveimplementation of k best Viterbi paths is highly space-inefficient, so we adapt recentwork on the Viterbi algorithm for a single path to this domain. Out algorithm usesmuch less memory than the naive approach. We then investigate the usefulnessof the k best Viterbi paths on the example of transmembrane protein topologyprediction. For membrane proteins, even simple path combination algorithms givegood explanations, and if we look at the paths we are combining, we can give asense of confidence in the explanation as well. For proteins with two topologies,the k best paths can give insight into both correct explanations of a sequence, afeature lacking from traditional algorithms in this domain.
【 预 览 】
附件列表
Files
Size
Format
View
The k-best paths in Hidden Markov Models. Algorithms and Applications to TransmembraneProtein Topology Recognition.