Genome-based identification and analysis of collagen-related structural motifs in bacterial and viral proteins.
Collagens are extended trimeric proteins composed of the repetitive sequence glycine-X-Y. A (c) under bar ollagen- related (S) under bar tructural (m) under bar otif (CSM) containing glycine-X-Y repeats is also found in numerous proteins often referred to as collagen-like proteins. Little is known about CSMs in bacteria and viruses, but the occurrence of such motifs has recently been demonstrated.
