Home  |  Contact

UniProtKB/Swiss-Prot P02452: Variant p.Pro1460His

Collagen alpha-1(I) chain
Gene: COL1A1
Chromosomal location: 17q21.3-q22
Variant information

Variant position:  1460
The position of the amino-acid change on the UniProtKB canonical protein sequence.

Type of variant:  Polymorphism
The variants are classified into three categories: Disease, Polymorphism and Unclassified.
  • Disease: Variants implicated in disease according to literature reports.
  • Polymorphism: Variants not reported to be implicated in disease.
  • Unclassified: Variants with uncertain implication in disease according to literature reports. Evidence against or in favor of a pathogenic role is limited and/or conflicting.

Residue change:  From Proline (P) to Histidine (H) at position 1460 (P1460H, p.Pro1460His).
Indicates the amino acid change of the variant. The one-letter and three-letter codes for amino acids used in UniProtKB/Swiss-Prot are those adopted by the commission on Biochemical Nomenclature of the IUPAC-IUB.

Physico-chemical properties:  Change from medium size and hydrophobic (P) to medium size and polar (H)
The physico-chemical property of the reference and variant residues and the change implicated.

BLOSUM score:  -2
The score within a Blosum matrix for the corresponding wild-type to variant amino acid change. The log-odds score measures the logarithm for the ratio of the likelihood of two amino acids appearing by chance. The Blosum62 substitution matrix is used. This substitution matrix contains scores for all possible exchanges of one amino acid with another:
  • Lowest score: -4 (low probability of substitution).
  • Highest score: 11 (high probability of substitution).
More information can be found on the following page

Other resources:  
Links to websites of interest for the variant.



Sequence information

Variant position:  1460
The position of the amino-acid change on the UniProtKB canonical protein sequence.

Protein sequence length:  1464
The length of the canonical sequence.

Location on the sequence:   IDVAPLDVGAPDQEFGFDVG  P VCFL
The residue change on the sequence. Unless the variant is located at the beginning or at the end of the protein sequence, both residues upstream (20) and downstream (20) of the variant will be shown.

Residue conservation: 
The multiple alignment of the region surrounding the variant against various orthologous sequences.

Human                         IDVAPLDVGAPDQEFGFDVGPVCFL

                              IDVAPLDVGAPDQEFGMDIGPVCFL

Mouse                         IDVAPLDIGAPDQEFGLDIGPACFV

Rat                           IDVAPLDIGAPDQEFGMDIGPACFV

Bovine                        IDVAPLDVGAPDQEFGFDVGPACFL

Chicken                       IDLAPMDVGAPDQEFGIDIGPVCFL

Sequence annotation in neighborhood:  
The regions or sites of interest surrounding the variant. In general the features listed are posttranslational modifications, binding sites, enzyme active sites, local secondary structure or other characteristics reported in the cited references. The "Sequence annotation in neighborhood" lines have a fixed format:
  • Type: the type of sequence feature.
  • Positions: endpoints of the sequence feature.
  • Description: contains additional information about the feature.

TypePositionsDescription
Propeptide 1219 – 1464 C-terminal propeptide
Domain 1229 – 1464 Fibrillar collagen NC1
Disulfide bond 1299 – 1462
Beta strand 1453 – 1463


Literature citations

The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC).
The MGC Project Team;
Genome Res. 14:2121-2127(2004)
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA]; VARIANTS ALA-1075; ARG-1438 AND HIS-1460;

Disclaimer: Any medical or genetic information present in this entry is provided for research, educational and informational purposes only. They are not in any way intended to be used as a substitute for professional medical advice, diagnostic, treatment or care.