The mutual information function is used to describe the auto-correlation of amino acids in protein. We find two interesting phenomenon: (1) for any given big protein, the mutual information function I(k) is almost a const, where k is the length of gap. (2) for any two sequence similar proteins, the mutual information are nearly the same. As a consequent, we may use mutual information of protein as a character for sequences comparison.
Shi Feng, Huang Jing, Li Yuan-xiang, Zhou Huai-beiSchool of Mathematics and Statistics, Wuhan University, Wuhan 430072, Hubei, ChinaAdvanced Research Center for Science & Technology, Wuhan University, Wuhan 430072, Hubei, ChinaState Key Laboratory of Software Engineer, Wuhan University, Wuhan 430072,Hubei, China
A new approach based on the concept of the diversity increment is applied to reconstruct a phylogeny. The phylogeny of the Eutherian orders use concatenated H-stran-ded amino acid sequences, and the result is consistent with the commonly accepted one for the Eutherians.
Shi Feng, Li Na-na, Li Yuan-xiang, Zhou Huai-beiSchool of Mathematics and Statistics, Wuhan University, Wuhan 430072,Hubei,ChinaAdvanced Research Center for Science and Technology, Wuhan University, Wuhan 430072, Hubei,ChinaState Key Laboratory of Software Engineering, Wuhan University, Wuhan 430072,Hubei,China