BMC Bioinformatics | |
Phylogenetic analysis of Harmonin homology domains | |
Julien Guglielmini1  Nicolas Wolff2  Baptiste Colcombet-Cazenave3  Olivier Spérandio4  Karen Druart4  Crystel Bonnet5  Christine Petit5  | |
[1] Hub de Bioinformatique et Biostatistique – Département Biologie Computationnelle, USR 3756 CNRS, Institut Pasteur, Paris, France;Unité Récepteurs-Canaux, Institut Pasteur, 75015, Paris, France;Unité Récepteurs-Canaux, Institut Pasteur, 75015, Paris, France;Collège Doctoral, Sorbonne Université, 75005, Paris, France;Unité de Bio-Informatique Structurale, Institut Pasteur, 75015, Paris, France;Unité de Génétique et Physiologie de l’Audition, Institut Pasteur, 75015, Paris, France;INSERM, Institut de l’Audition, Institut Pasteur, 75012, Paris, France; | |
关键词: Harmonin homology domains; Profile HMM; Screening; Phylogeny; Usher syndrome; Sequence analysis; | |
DOI : 10.1186/s12859-021-04116-5 | |
来源: Springer | |
【 摘 要 】
BackgroundHarmonin Homogy Domains (HHD) are recently identified orphan domains of about 70 residues folded in a compact five alpha-helix bundle that proved to be versatile in terms of function, allowing for direct binding to a partner as well as regulating the affinity and specificity of adjacent domains for their own targets. Adding their small size and rather simple fold, HHDs appear as convenient modules to regulate protein–protein interactions in various biological contexts. Surprisingly, only nine HHDs have been detected in six proteins, mainly expressed in sensory neurons.ResultsHere, we built a profile Hidden Markov Model to screen the entire UniProtKB for new HHD-containing proteins. Every hit was manually annotated, using a clustering approach, confirming that only a few proteins contain HHDs. We report the phylogenetic coverage of each protein and build a phylogenetic tree to trace the evolution of HHDs. We suggest that a HHD ancestor is shared with Paired Amphipathic Helices (PAH) domains, a four-helix bundle partially sharing fold and functional properties. We characterized amino-acid sequences of the various HHDs using pairwise BLASTP scoring coupled with community clustering and manually assessed sequence features among each individual family. These sequence features were analyzed using reported structures as well as homology models to highlight structural motifs underlying HHDs fold. We show that functional divergence is carried out by subtle differences in sequences that automatized approaches failed to detect.ConclusionsWe provide the first HHD databases, including sequences and conservation, phylogenic trees and a list of HHD variants found in the auditory system, which are available for the community. This case study highlights surprising phylogenetic properties found in orphan domains and will assist further studies of HHDs. We unveil the implication of HHDs in their various binding interfaces using conservation across families and a new protein–protein surface predictor. Finally, we discussed the functional consequences of three identified pathogenic HHD variants involved in Hoyeraal-Hreidarsson syndrome and of three newly reported pathogenic variants identified in patients suffering from Usher Syndrome.
【 授权许可】
CC BY
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
RO202107032294570ZK.pdf | 6388KB | download |