UniProtKB/Swiss-Prot O14497: Variant p.Gly2087Arg

AT-rich interactive domain-containing protein 1A
Gene: ARID1A
Chromosomal location: 1p35.3
Variant information

Variant position:  2087
The position of the amino-acid change on the UniProtKB canonical protein sequence.

Type of variant:  Unclassified
The variants are classified into three categories: Disease, Polymorphism and Unclassified.
  • Disease: Variants have been found in patients and disease-association is reported in literature. However, this classification is not a definitive assessment of variant pathogenicity.
  • Polymorphism: No disease-association has been reported.
  • Unclassified: Variants have been found in patients but disease-association remains unclear.

Residue change:  From Glycine (G) to Arginine (R) at position 2087 (G2087R, p.Gly2087Arg).
Indicates the amino acid change of the variant. The one-letter and three-letter codes for amino acids used in UniProtKB/Swiss-Prot are those adopted by the commission on Biochemical Nomenclature of the IUPAC-IUB.

Physico-chemical properties:  Change from glycine (G) to large size and basic (R)
The physico-chemical property of the reference and variant residues and the change implicated.

BLOSUM score:  -2
The score within a Blosum matrix for the corresponding wild-type to variant amino acid change. The log-odds score measures the logarithm for the ratio of the likelihood of two amino acids appearing by chance. The Blosum62 substitution matrix is used. This substitution matrix contains scores for all possible exchanges of one amino acid with another:
  • Lowest score: -4 (low probability of substitution).
  • Highest score: 11 (high probability of substitution).
More information can be found on the following page

Variant description:  Found in a breast cancer sample; somatic mutation.
Any additional useful information about the variant.



Sequence information

Variant position:  2087
The position of the amino-acid change on the UniProtKB canonical protein sequence.

Protein sequence length:  2285
The length of the canonical sequence.

Location on the sequence:   ISGQLDLSPYPESICLPVLD  G LLHWAVCPSAEAQDPFSTLG
The residue change on the sequence. Unless the variant is located at the beginning or at the end of the protein sequence, both residues upstream (20) and downstream (20) of the variant will be shown.

Residue conservation: 
The multiple alignment of the region surrounding the variant against various orthologous sequences.

Human                         ISGQLDLSPYPESICLPVLDGLLHWAVCPSAEAQDPFSTLG

Mouse                         ISGQLDLSPYPESICLPVLDGLLHWAVCPSAEAQDPFSTLG

Sequence annotation in neighborhood:  
The regions or sites of interest surrounding the variant. In general the features listed are posttranslational modifications, binding sites, enzyme active sites, local secondary structure or other characteristics reported in the cited references. The "Sequence annotation in neighborhood" lines have a fixed format:
  • Type: the type of sequence feature.
  • Positions: endpoints of the sequence feature.
  • Description: contains additional information about the feature.

TypePositionsDescription
Chain 2 – 2285 AT-rich interactive domain-containing protein 1A
Motif 2085 – 2089 LXXLL


Literature citations

Somatic mutations in the chromatin remodeling gene ARID1A occur in several tumor types.
Jones S.; Li M.; Parsons D.W.; Zhang X.; Wesseling J.; Kristel P.; Schmidt M.K.; Markowitz S.; Yan H.; Bigner D.; Hruban R.H.; Eshleman J.R.; Iacobuzio-Donahue C.A.; Goggins M.; Maitra A.; Malek S.N.; Powell S.; Vogelstein B.; Kinzler K.W.; Velculescu V.E.; Papadopoulos N.;
Hum. Mutat. 33:100-103(2012)
Cited for: VARIANTS TRP-1658; PHE-1907 AND ARG-2087;

Disclaimer: Any medical or genetic information present in this entry is provided for research, educational and informational purposes only. They are not in any way intended to be used as a substitute for professional medical advice, diagnostic, treatment or care.