Amino acid dipepetide frequency for Gorilla anellovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.846AlaAla: 3.846 ± 2.244
0.0AlaCys: 0.0 ± 0.0
2.564AlaAsp: 2.564 ± 2.842
5.128AlaGlu: 5.128 ± 1.646
2.564AlaPhe: 2.564 ± 1.196
1.282AlaGly: 1.282 ± 3.44
5.128AlaHis: 5.128 ± 2.391
2.564AlaIle: 2.564 ± 2.842
0.0AlaLys: 0.0 ± 0.0
3.846AlaLeu: 3.846 ± 2.244
0.0AlaMet: 0.0 ± 0.0
1.282AlaAsn: 1.282 ± 0.598
0.0AlaPro: 0.0 ± 0.0
1.282AlaGln: 1.282 ± 0.598
1.282AlaArg: 1.282 ± 3.44
5.128AlaSer: 5.128 ± 2.391
2.564AlaThr: 2.564 ± 2.842
1.282AlaVal: 1.282 ± 3.44
1.282AlaTrp: 1.282 ± 3.44
2.564AlaTyr: 2.564 ± 1.196
0.0AlaXaa: 0.0 ± 0.0
Cys
1.282CysAla: 1.282 ± 0.598
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
2.564CysGly: 2.564 ± 2.842
0.0CysHis: 0.0 ± 0.0
1.282CysIle: 1.282 ± 0.598
1.282CysLys: 1.282 ± 0.598
1.282CysLeu: 1.282 ± 0.598
0.0CysMet: 0.0 ± 0.0
1.282CysAsn: 1.282 ± 0.598
1.282CysPro: 1.282 ± 3.44
0.0CysGln: 0.0 ± 0.0
2.564CysArg: 2.564 ± 1.196
3.846CysSer: 3.846 ± 2.244
1.282CysThr: 1.282 ± 0.598
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.846AspAla: 3.846 ± 6.281
0.0AspCys: 0.0 ± 0.0
1.282AspAsp: 1.282 ± 3.44
0.0AspGlu: 0.0 ± 0.0
2.564AspPhe: 2.564 ± 1.196
1.282AspGly: 1.282 ± 0.598
1.282AspHis: 1.282 ± 3.44
6.41AspIle: 6.41 ± 2.989
2.564AspLys: 2.564 ± 1.196
3.846AspLeu: 3.846 ± 2.244
0.0AspMet: 0.0 ± 0.0
2.564AspAsn: 2.564 ± 1.196
2.564AspPro: 2.564 ± 1.196
2.564AspGln: 2.564 ± 1.196
1.282AspArg: 1.282 ± 3.44
1.282AspSer: 1.282 ± 3.44
3.846AspThr: 3.846 ± 6.281
2.564AspVal: 2.564 ± 1.196
1.282AspTrp: 1.282 ± 0.598
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
1.282GluAla: 1.282 ± 0.598
2.564GluCys: 2.564 ± 2.842
3.846GluAsp: 3.846 ± 2.244
11.538GluGlu: 11.538 ± 14.807
2.564GluPhe: 2.564 ± 2.842
5.128GluGly: 5.128 ± 1.646
0.0GluHis: 0.0 ± 0.0
1.282GluIle: 1.282 ± 3.44
1.282GluLys: 1.282 ± 3.44
2.564GluLeu: 2.564 ± 2.842
0.0GluMet: 0.0 ± 0.0
2.564GluAsn: 2.564 ± 1.196
3.846GluPro: 3.846 ± 1.794
5.128GluGln: 5.128 ± 2.391
1.282GluArg: 1.282 ± 3.44
3.846GluSer: 3.846 ± 1.794
7.692GluThr: 7.692 ± 3.587
0.0GluVal: 0.0 ± 0.0
0.0GluTrp: 0.0 ± 0.0
1.282GluTyr: 1.282 ± 0.598
0.0GluXaa: 0.0 ± 0.0
Phe
5.128PheAla: 5.128 ± 1.646
0.0PheCys: 0.0 ± 0.0
0.0PheAsp: 0.0 ± 0.0
1.282PheGlu: 1.282 ± 0.598
3.846PhePhe: 3.846 ± 1.794
1.282PheGly: 1.282 ± 0.598
0.0PheHis: 0.0 ± 0.0
2.564PheIle: 2.564 ± 1.196
7.692PheLys: 7.692 ± 3.587
6.41PheLeu: 6.41 ± 9.123
1.282PheMet: 1.282 ± 0.598
2.564PheAsn: 2.564 ± 1.196
0.0PhePro: 0.0 ± 0.0
3.846PheGln: 3.846 ± 1.794
0.0PheArg: 0.0 ± 0.0
3.846PheSer: 3.846 ± 1.794
0.0PheThr: 0.0 ± 0.0
0.0PheVal: 0.0 ± 0.0
3.846PheTrp: 3.846 ± 1.794
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
1.282GlyAla: 1.282 ± 0.598
1.282GlyCys: 1.282 ± 3.44
3.846GlyAsp: 3.846 ± 6.281
2.564GlyGlu: 2.564 ± 2.842
1.282GlyPhe: 1.282 ± 0.598
6.41GlyGly: 6.41 ± 2.989
1.282GlyHis: 1.282 ± 0.598
2.564GlyIle: 2.564 ± 2.842
0.0GlyLys: 0.0 ± 0.0
5.128GlyLeu: 5.128 ± 1.646
0.0GlyMet: 0.0 ± 1.775
10.256GlyAsn: 10.256 ± 4.783
1.282GlyPro: 1.282 ± 0.598
1.282GlyGln: 1.282 ± 0.598
1.282GlyArg: 1.282 ± 3.44
2.564GlySer: 2.564 ± 1.196
6.41GlyThr: 6.41 ± 2.989
0.0GlyVal: 0.0 ± 0.0
1.282GlyTrp: 1.282 ± 0.598
0.0GlyTyr: 0.0 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
2.564HisAla: 2.564 ± 1.196
1.282HisCys: 1.282 ± 0.598
1.282HisAsp: 1.282 ± 3.44
1.282HisGlu: 1.282 ± 3.44
1.282HisPhe: 1.282 ± 3.44
1.282HisGly: 1.282 ± 0.598
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
2.564HisLys: 2.564 ± 1.196
0.0HisLeu: 0.0 ± 0.0
1.282HisMet: 1.282 ± 0.598
1.282HisAsn: 1.282 ± 0.598
2.564HisPro: 2.564 ± 1.196
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
2.564HisSer: 2.564 ± 1.196
2.564HisThr: 2.564 ± 6.879
0.0HisVal: 0.0 ± 0.0
1.282HisTrp: 1.282 ± 0.598
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
1.282IleAla: 1.282 ± 0.598
0.0IleCys: 0.0 ± 0.0
1.282IleAsp: 1.282 ± 0.598
5.128IleGlu: 5.128 ± 2.391
6.41IlePhe: 6.41 ± 2.989
0.0IleGly: 0.0 ± 0.0
3.846IleHis: 3.846 ± 6.281
3.846IleIle: 3.846 ± 1.794
5.128IleLys: 5.128 ± 2.391
6.41IleLeu: 6.41 ± 2.989
0.0IleMet: 0.0 ± 0.0
5.128IleAsn: 5.128 ± 1.646
5.128IlePro: 5.128 ± 5.684
1.282IleGln: 1.282 ± 0.598
5.128IleArg: 5.128 ± 2.391
7.692IleSer: 7.692 ± 0.45
6.41IleThr: 6.41 ± 5.086
3.846IleVal: 3.846 ± 1.794
0.0IleTrp: 0.0 ± 0.0
2.564IleTyr: 2.564 ± 1.196
0.0IleXaa: 0.0 ± 0.0
Lys
3.846LysAla: 3.846 ± 2.244
2.564LysCys: 2.564 ± 1.196
2.564LysAsp: 2.564 ± 1.196
3.846LysGlu: 3.846 ± 1.794
1.282LysPhe: 1.282 ± 0.598
3.846LysGly: 3.846 ± 1.794
0.0LysHis: 0.0 ± 0.0
3.846LysIle: 3.846 ± 1.794
7.692LysLys: 7.692 ± 3.587
5.128LysLeu: 5.128 ± 2.391
1.282LysMet: 1.282 ± 0.598
2.564LysAsn: 2.564 ± 1.196
3.846LysPro: 3.846 ± 1.794
3.846LysGln: 3.846 ± 1.794
5.128LysArg: 5.128 ± 2.391
2.564LysSer: 2.564 ± 1.196
3.846LysThr: 3.846 ± 1.794
0.0LysVal: 0.0 ± 0.0
2.564LysTrp: 2.564 ± 1.196
2.564LysTyr: 2.564 ± 2.842
0.0LysXaa: 0.0 ± 0.0
Leu
1.282LeuAla: 1.282 ± 0.598
1.282LeuCys: 1.282 ± 0.598
3.846LeuAsp: 3.846 ± 2.244
2.564LeuGlu: 2.564 ± 2.842
6.41LeuPhe: 6.41 ± 1.048
3.846LeuGly: 3.846 ± 1.794
2.564LeuHis: 2.564 ± 1.196
3.846LeuIle: 3.846 ± 1.794
5.128LeuLys: 5.128 ± 2.391
10.256LeuLeu: 10.256 ± 0.745
1.282LeuMet: 1.282 ± 0.547
3.846LeuAsn: 3.846 ± 1.794
5.128LeuPro: 5.128 ± 5.684
1.282LeuGln: 1.282 ± 3.44
6.41LeuArg: 6.41 ± 1.048
5.128LeuSer: 5.128 ± 2.391
8.974LeuThr: 8.974 ± 4.185
2.564LeuVal: 2.564 ± 1.196
0.0LeuTrp: 0.0 ± 0.0
6.41LeuTyr: 6.41 ± 2.989
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
2.564MetCys: 2.564 ± 1.196
0.0MetAsp: 0.0 ± 0.0
1.282MetGlu: 1.282 ± 0.598
0.0MetPhe: 0.0 ± 0.0
1.282MetGly: 1.282 ± 0.598
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
0.0MetLeu: 0.0 ± 0.0
1.282MetMet: 1.282 ± 0.598
0.0MetAsn: 0.0 ± 0.0
2.564MetPro: 2.564 ± 1.196
2.564MetGln: 2.564 ± 1.196
0.0MetArg: 0.0 ± 0.0
1.282MetSer: 1.282 ± 3.44
1.282MetThr: 1.282 ± 0.598
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.282AsnAla: 1.282 ± 0.598
0.0AsnCys: 0.0 ± 0.0
1.282AsnAsp: 1.282 ± 0.598
2.564AsnGlu: 2.564 ± 1.196
2.564AsnPhe: 2.564 ± 1.196
2.564AsnGly: 2.564 ± 1.196
1.282AsnHis: 1.282 ± 0.598
5.128AsnIle: 5.128 ± 2.391
6.41AsnLys: 6.41 ± 2.989
5.128AsnLeu: 5.128 ± 2.391
0.0AsnMet: 0.0 ± 0.0
2.564AsnAsn: 2.564 ± 1.196
7.692AsnPro: 7.692 ± 3.587
2.564AsnGln: 2.564 ± 1.196
1.282AsnArg: 1.282 ± 0.598
2.564AsnSer: 2.564 ± 2.842
3.846AsnThr: 3.846 ± 1.794
1.282AsnVal: 1.282 ± 0.598
0.0AsnTrp: 0.0 ± 0.0
7.692AsnTyr: 7.692 ± 3.587
0.0AsnXaa: 0.0 ± 0.0
Pro
6.41ProAla: 6.41 ± 1.048
0.0ProCys: 0.0 ± 0.0
2.564ProAsp: 2.564 ± 1.196
2.564ProGlu: 2.564 ± 2.842
1.282ProPhe: 1.282 ± 0.598
2.564ProGly: 2.564 ± 1.196
0.0ProHis: 0.0 ± 0.0
3.846ProIle: 3.846 ± 2.244
3.846ProLys: 3.846 ± 2.244
8.974ProLeu: 8.974 ± 4.185
1.282ProMet: 1.282 ± 0.598
0.0ProAsn: 0.0 ± 0.0
8.974ProPro: 8.974 ± 3.89
0.0ProGln: 0.0 ± 0.0
6.41ProArg: 6.41 ± 2.989
7.692ProSer: 7.692 ± 3.587
2.564ProThr: 2.564 ± 2.842
1.282ProVal: 1.282 ± 3.44
3.846ProTrp: 3.846 ± 1.794
6.41ProTyr: 6.41 ± 2.989
0.0ProXaa: 0.0 ± 0.0
Gln
1.282GlnAla: 1.282 ± 0.598
0.0GlnCys: 0.0 ± 0.0
1.282GlnAsp: 1.282 ± 0.598
2.564GlnGlu: 2.564 ± 1.196
2.564GlnPhe: 2.564 ± 1.196
2.564GlnGly: 2.564 ± 2.842
1.282GlnHis: 1.282 ± 0.598
6.41GlnIle: 6.41 ± 2.989
5.128GlnLys: 5.128 ± 2.391
5.128GlnLeu: 5.128 ± 2.391
2.564GlnMet: 2.564 ± 1.196
2.564GlnAsn: 2.564 ± 1.196
6.41GlnPro: 6.41 ± 2.989
11.538GlnGln: 11.538 ± 5.381
1.282GlnArg: 1.282 ± 0.598
0.0GlnSer: 0.0 ± 0.0
0.0GlnThr: 0.0 ± 0.0
0.0GlnVal: 0.0 ± 0.0
2.564GlnTrp: 2.564 ± 2.842
1.282GlnTyr: 1.282 ± 0.598
0.0GlnXaa: 0.0 ± 0.0
Arg
2.564ArgAla: 2.564 ± 2.842
1.282ArgCys: 1.282 ± 0.598
0.0ArgAsp: 0.0 ± 0.0
1.282ArgGlu: 1.282 ± 3.44
1.282ArgPhe: 1.282 ± 0.598
3.846ArgGly: 3.846 ± 6.281
1.282ArgHis: 1.282 ± 0.598
2.564ArgIle: 2.564 ± 1.196
3.846ArgLys: 3.846 ± 1.794
5.128ArgLeu: 5.128 ± 2.391
0.0ArgMet: 0.0 ± 0.0
3.846ArgAsn: 3.846 ± 1.794
3.846ArgPro: 3.846 ± 1.794
2.564ArgGln: 2.564 ± 1.196
24.359ArgArg: 24.359 ± 11.359
0.0ArgSer: 0.0 ± 0.0
1.282ArgThr: 1.282 ± 0.598
1.282ArgVal: 1.282 ± 0.598
2.564ArgTrp: 2.564 ± 2.842
5.128ArgTyr: 5.128 ± 2.391
0.0ArgXaa: 0.0 ± 0.0
Ser
1.282SerAla: 1.282 ± 3.44
2.564SerCys: 2.564 ± 1.196
6.41SerAsp: 6.41 ± 5.086
1.282SerGlu: 1.282 ± 0.598
2.564SerPhe: 2.564 ± 2.842
5.128SerGly: 5.128 ± 1.646
2.564SerHis: 2.564 ± 1.196
3.846SerIle: 3.846 ± 2.244
1.282SerLys: 1.282 ± 0.598
5.128SerLeu: 5.128 ± 2.391
1.282SerMet: 1.282 ± 0.598
5.128SerAsn: 5.128 ± 2.391
1.282SerPro: 1.282 ± 0.598
5.128SerGln: 5.128 ± 2.391
1.282SerArg: 1.282 ± 0.598
1.282SerSer: 1.282 ± 3.44
3.846SerThr: 3.846 ± 1.794
3.846SerVal: 3.846 ± 2.244
1.282SerTrp: 1.282 ± 0.598
5.128SerTyr: 5.128 ± 2.391
0.0SerXaa: 0.0 ± 0.0
Thr
2.564ThrAla: 2.564 ± 2.842
0.0ThrCys: 0.0 ± 0.0
6.41ThrAsp: 6.41 ± 2.989
10.256ThrGlu: 10.256 ± 3.292
0.0ThrPhe: 0.0 ± 0.0
3.846ThrGly: 3.846 ± 2.244
2.564ThrHis: 2.564 ± 6.879
5.128ThrIle: 5.128 ± 2.391
3.846ThrLys: 3.846 ± 1.794
6.41ThrLeu: 6.41 ± 2.989
1.282ThrMet: 1.282 ± 0.598
3.846ThrAsn: 3.846 ± 1.794
5.128ThrPro: 5.128 ± 1.646
3.846ThrGln: 3.846 ± 1.794
0.0ThrArg: 0.0 ± 0.0
5.128ThrSer: 5.128 ± 1.646
12.821ThrThr: 12.821 ± 6.134
2.564ThrVal: 2.564 ± 1.196
2.564ThrTrp: 2.564 ± 2.842
2.564ThrTyr: 2.564 ± 1.196
0.0ThrXaa: 0.0 ± 0.0
Val
0.0ValAla: 0.0 ± 0.0
0.0ValCys: 0.0 ± 0.0
0.0ValAsp: 0.0 ± 0.0
0.0ValGlu: 0.0 ± 0.0
1.282ValPhe: 1.282 ± 0.598
1.282ValGly: 1.282 ± 0.598
0.0ValHis: 0.0 ± 0.0
3.846ValIle: 3.846 ± 2.244
3.846ValLys: 3.846 ± 1.794
0.0ValLeu: 0.0 ± 0.0
0.0ValMet: 0.0 ± 0.0
0.0ValAsn: 0.0 ± 0.0
3.846ValPro: 3.846 ± 1.794
0.0ValGln: 0.0 ± 0.0
2.564ValArg: 2.564 ± 1.196
3.846ValSer: 3.846 ± 2.244
2.564ValThr: 2.564 ± 1.196
0.0ValVal: 0.0 ± 0.0
0.0ValTrp: 0.0 ± 0.0
1.282ValTyr: 1.282 ± 3.44
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
1.282TrpCys: 1.282 ± 3.44
2.564TrpAsp: 2.564 ± 1.196
1.282TrpGlu: 1.282 ± 0.598
1.282TrpPhe: 1.282 ± 0.598
1.282TrpGly: 1.282 ± 0.598
0.0TrpHis: 0.0 ± 0.0
3.846TrpIle: 3.846 ± 6.281
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
1.282TrpAsn: 1.282 ± 0.598
1.282TrpPro: 1.282 ± 0.598
3.846TrpGln: 3.846 ± 2.244
1.282TrpArg: 1.282 ± 0.598
0.0TrpSer: 0.0 ± 0.0
2.564TrpThr: 2.564 ± 1.196
1.282TrpVal: 1.282 ± 0.598
0.0TrpTrp: 0.0 ± 0.0
1.282TrpTyr: 1.282 ± 0.598
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.564TyrAla: 2.564 ± 2.842
1.282TyrCys: 1.282 ± 0.598
0.0TyrAsp: 0.0 ± 0.0
1.282TyrGlu: 1.282 ± 0.598
2.564TyrPhe: 2.564 ± 1.196
0.0TyrGly: 0.0 ± 0.0
0.0TyrHis: 0.0 ± 0.0
7.692TyrIle: 7.692 ± 3.587
1.282TyrLys: 1.282 ± 0.598
1.282TyrLeu: 1.282 ± 0.598
0.0TyrMet: 0.0 ± 0.0
5.128TyrAsn: 5.128 ± 2.391
3.846TyrPro: 3.846 ± 1.794
3.846TyrGln: 3.846 ± 1.794
5.128TyrArg: 5.128 ± 2.391
1.282TyrSer: 1.282 ± 0.598
6.41TyrThr: 6.41 ± 1.048
2.564TyrVal: 2.564 ± 1.196
0.0TyrTrp: 0.0 ± 0.0
3.846TyrTyr: 3.846 ± 1.794
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (781 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski