Amino acid dipepetide frequency for Gyrovirus 10

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.357AlaAla: 1.357 ± 1.383
0.0AlaCys: 0.0 ± 0.0
0.0AlaAsp: 0.0 ± 0.0
1.357AlaGlu: 1.357 ± 0.798
5.427AlaPhe: 5.427 ± 1.961
4.071AlaGly: 4.071 ± 1.742
2.714AlaHis: 2.714 ± 2.766
2.714AlaIle: 2.714 ± 1.596
1.357AlaLys: 1.357 ± 0.798
2.714AlaLeu: 2.714 ± 0.98
1.357AlaMet: 1.357 ± 1.996
2.714AlaAsn: 2.714 ± 2.766
1.357AlaPro: 1.357 ± 1.383
2.714AlaGln: 2.714 ± 1.596
6.784AlaArg: 6.784 ± 3.694
8.141AlaSer: 8.141 ± 3.145
9.498AlaThr: 9.498 ± 6.289
4.071AlaVal: 4.071 ± 2.394
0.0AlaTrp: 0.0 ± 0.0
0.0AlaTyr: 0.0 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
1.357CysGlu: 1.357 ± 0.798
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.357CysIle: 1.357 ± 0.798
2.714CysLys: 2.714 ± 2.474
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
1.357CysAsn: 1.357 ± 1.383
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
1.357CysArg: 1.357 ± 0.798
4.071CysSer: 4.071 ± 2.261
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
1.357CysTrp: 1.357 ± 0.798
1.357CysTyr: 1.357 ± 0.798
0.0CysXaa: 0.0 ± 0.0
Asp
4.071AspAla: 4.071 ± 4.149
2.714AspCys: 2.714 ± 0.98
5.427AspAsp: 5.427 ± 5.532
2.714AspGlu: 2.714 ± 0.98
0.0AspPhe: 0.0 ± 0.0
2.714AspGly: 2.714 ± 0.98
0.0AspHis: 0.0 ± 0.0
1.357AspIle: 1.357 ± 1.383
4.071AspLys: 4.071 ± 2.261
5.427AspLeu: 5.427 ± 3.617
0.0AspMet: 0.0 ± 0.0
1.357AspAsn: 1.357 ± 0.798
2.714AspPro: 2.714 ± 1.596
4.071AspGln: 4.071 ± 2.394
1.357AspArg: 1.357 ± 0.798
4.071AspSer: 4.071 ± 2.261
1.357AspThr: 1.357 ± 1.383
4.071AspVal: 4.071 ± 1.742
1.357AspTrp: 1.357 ± 1.383
2.714AspTyr: 2.714 ± 1.596
0.0AspXaa: 0.0 ± 0.0
Glu
1.357GluAla: 1.357 ± 0.798
1.357GluCys: 1.357 ± 0.798
2.714GluAsp: 2.714 ± 2.766
1.357GluGlu: 1.357 ± 1.383
1.357GluPhe: 1.357 ± 0.798
1.357GluGly: 1.357 ± 0.798
0.0GluHis: 0.0 ± 0.0
0.0GluIle: 0.0 ± 0.0
1.357GluLys: 1.357 ± 0.798
1.357GluLeu: 1.357 ± 2.228
0.0GluMet: 0.0 ± 0.0
1.357GluAsn: 1.357 ± 2.228
0.0GluPro: 0.0 ± 0.0
4.071GluGln: 4.071 ± 1.799
2.714GluArg: 2.714 ± 2.766
4.071GluSer: 4.071 ± 3.332
4.071GluThr: 4.071 ± 1.799
0.0GluVal: 0.0 ± 0.0
0.0GluTrp: 0.0 ± 0.0
1.357GluTyr: 1.357 ± 0.798
0.0GluXaa: 0.0 ± 0.0
Phe
2.714PheAla: 2.714 ± 0.98
1.357PheCys: 1.357 ± 1.383
2.714PheAsp: 2.714 ± 2.766
0.0PheGlu: 0.0 ± 0.0
0.0PhePhe: 0.0 ± 0.0
1.357PheGly: 1.357 ± 0.798
0.0PheHis: 0.0 ± 0.0
0.0PheIle: 0.0 ± 0.0
1.357PheLys: 1.357 ± 0.798
4.071PheLeu: 4.071 ± 2.394
2.714PheMet: 2.714 ± 1.596
0.0PheAsn: 0.0 ± 0.0
8.141PhePro: 8.141 ± 4.788
1.357PheGln: 1.357 ± 0.798
12.212PheArg: 12.212 ± 5.474
1.357PheSer: 1.357 ± 0.798
1.357PheThr: 1.357 ± 0.798
0.0PheVal: 0.0 ± 0.0
1.357PheTrp: 1.357 ± 0.798
1.357PheTyr: 1.357 ± 0.798
0.0PheXaa: 0.0 ± 0.0
Gly
2.714GlyAla: 2.714 ± 1.596
0.0GlyCys: 0.0 ± 0.0
1.357GlyAsp: 1.357 ± 0.798
4.071GlyGlu: 4.071 ± 2.261
1.357GlyPhe: 1.357 ± 0.798
2.714GlyGly: 2.714 ± 1.596
0.0GlyHis: 0.0 ± 0.0
5.427GlyIle: 5.427 ± 1.104
2.714GlyLys: 2.714 ± 1.596
1.357GlyLeu: 1.357 ± 0.798
0.0GlyMet: 0.0 ± 0.0
5.427GlyAsn: 5.427 ± 1.697
2.714GlyPro: 2.714 ± 0.98
6.784GlyGln: 6.784 ± 2.57
2.714GlyArg: 2.714 ± 1.861
8.141GlySer: 8.141 ± 4.788
10.855GlyThr: 10.855 ± 4.746
1.357GlyVal: 1.357 ± 2.228
2.714GlyTrp: 2.714 ± 0.98
2.714GlyTyr: 2.714 ± 1.861
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
1.357HisCys: 1.357 ± 0.798
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
1.357HisGly: 1.357 ± 0.798
1.357HisHis: 1.357 ± 0.798
1.357HisIle: 1.357 ± 0.798
2.714HisLys: 2.714 ± 0.98
1.357HisLeu: 1.357 ± 2.228
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
1.357HisPro: 1.357 ± 0.798
2.714HisGln: 2.714 ± 2.474
2.714HisArg: 2.714 ± 1.861
2.714HisSer: 2.714 ± 2.474
1.357HisThr: 1.357 ± 0.798
0.0HisVal: 0.0 ± 0.0
2.714HisTrp: 2.714 ± 2.766
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
5.427IleAla: 5.427 ± 1.697
1.357IleCys: 1.357 ± 1.383
4.071IleAsp: 4.071 ± 1.133
1.357IleGlu: 1.357 ± 2.228
0.0IlePhe: 0.0 ± 0.0
6.784IleGly: 6.784 ± 3.329
2.714IleHis: 2.714 ± 1.861
2.714IleIle: 2.714 ± 0.98
1.357IleLys: 1.357 ± 1.383
4.071IleLeu: 4.071 ± 1.742
1.357IleMet: 1.357 ± 0.798
4.071IleAsn: 4.071 ± 2.261
1.357IlePro: 1.357 ± 0.798
0.0IleGln: 0.0 ± 0.0
2.714IleArg: 2.714 ± 2.474
1.357IleSer: 1.357 ± 0.798
5.427IleThr: 5.427 ± 1.697
1.357IleVal: 1.357 ± 0.798
0.0IleTrp: 0.0 ± 0.0
1.357IleTyr: 1.357 ± 0.798
0.0IleXaa: 0.0 ± 0.0
Lys
0.0LysAla: 0.0 ± 0.0
0.0LysCys: 0.0 ± 0.0
2.714LysAsp: 2.714 ± 1.596
0.0LysGlu: 0.0 ± 0.0
5.427LysPhe: 5.427 ± 1.697
1.357LysGly: 1.357 ± 0.798
2.714LysHis: 2.714 ± 0.98
5.427LysIle: 5.427 ± 2.535
4.071LysLys: 4.071 ± 2.261
8.141LysLeu: 8.141 ± 1.187
0.0LysMet: 0.0 ± 0.0
1.357LysAsn: 1.357 ± 0.798
0.0LysPro: 0.0 ± 0.0
1.357LysGln: 1.357 ± 0.798
4.071LysArg: 4.071 ± 1.133
2.714LysSer: 2.714 ± 2.474
6.784LysThr: 6.784 ± 3.199
1.357LysVal: 1.357 ± 0.798
0.0LysTrp: 0.0 ± 0.0
2.714LysTyr: 2.714 ± 1.596
0.0LysXaa: 0.0 ± 0.0
Leu
5.427LeuAla: 5.427 ± 1.961
0.0LeuCys: 0.0 ± 0.0
1.357LeuAsp: 1.357 ± 1.383
0.0LeuGlu: 0.0 ± 0.0
1.357LeuPhe: 1.357 ± 0.798
5.427LeuGly: 5.427 ± 3.192
0.0LeuHis: 0.0 ± 0.0
2.714LeuIle: 2.714 ± 1.861
2.714LeuLys: 2.714 ± 1.596
5.427LeuLeu: 5.427 ± 6.238
2.714LeuMet: 2.714 ± 1.407
4.071LeuAsn: 4.071 ± 1.133
8.141LeuPro: 8.141 ± 3.193
5.427LeuGln: 5.427 ± 3.878
1.357LeuArg: 1.357 ± 1.383
5.427LeuSer: 5.427 ± 2.07
8.141LeuThr: 8.141 ± 3.483
1.357LeuVal: 1.357 ± 0.798
0.0LeuTrp: 0.0 ± 0.0
1.357LeuTyr: 1.357 ± 0.798
0.0LeuXaa: 0.0 ± 0.0
Met
1.357MetAla: 1.357 ± 1.383
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
2.714MetGlu: 2.714 ± 1.861
2.714MetPhe: 2.714 ± 1.596
1.357MetGly: 1.357 ± 0.798
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
0.0MetLeu: 0.0 ± 0.0
0.0MetMet: 0.0 ± 0.0
1.357MetAsn: 1.357 ± 0.798
1.357MetPro: 1.357 ± 2.228
0.0MetGln: 0.0 ± 0.0
1.357MetArg: 1.357 ± 0.798
2.714MetSer: 2.714 ± 1.861
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
1.357MetTrp: 1.357 ± 1.383
1.357MetTyr: 1.357 ± 0.798
0.0MetXaa: 0.0 ± 0.0
Asn
2.714AsnAla: 2.714 ± 2.766
1.357AsnCys: 1.357 ± 2.228
0.0AsnAsp: 0.0 ± 0.0
0.0AsnGlu: 0.0 ± 0.0
2.714AsnPhe: 2.714 ± 0.98
1.357AsnGly: 1.357 ± 1.383
2.714AsnHis: 2.714 ± 0.98
2.714AsnIle: 2.714 ± 1.596
2.714AsnLys: 2.714 ± 0.98
1.357AsnLeu: 1.357 ± 0.798
0.0AsnMet: 0.0 ± 0.0
0.0AsnAsn: 0.0 ± 0.0
2.714AsnPro: 2.714 ± 1.596
2.714AsnGln: 2.714 ± 1.596
2.714AsnArg: 2.714 ± 2.766
2.714AsnSer: 2.714 ± 0.98
4.071AsnThr: 4.071 ± 1.133
8.141AsnVal: 8.141 ± 0.946
2.714AsnTrp: 2.714 ± 1.596
4.071AsnTyr: 4.071 ± 2.261
0.0AsnXaa: 0.0 ± 0.0
Pro
4.071ProAla: 4.071 ± 2.261
0.0ProCys: 0.0 ± 0.0
8.141ProAsp: 8.141 ± 0.946
2.714ProGlu: 2.714 ± 0.98
0.0ProPhe: 0.0 ± 0.0
8.141ProGly: 8.141 ± 2.899
2.714ProHis: 2.714 ± 4.457
5.427ProIle: 5.427 ± 3.617
2.714ProLys: 2.714 ± 1.596
6.784ProLeu: 6.784 ± 3.573
0.0ProMet: 0.0 ± 0.0
2.714ProAsn: 2.714 ± 1.596
6.784ProPro: 6.784 ± 1.963
6.784ProGln: 6.784 ± 3.99
2.714ProArg: 2.714 ± 1.861
8.141ProSer: 8.141 ± 2.266
1.357ProThr: 1.357 ± 0.798
2.714ProVal: 2.714 ± 1.861
0.0ProTrp: 0.0 ± 0.0
4.071ProTyr: 4.071 ± 2.394
0.0ProXaa: 0.0 ± 0.0
Gln
4.071GlnAla: 4.071 ± 1.133
1.357GlnCys: 1.357 ± 0.798
4.071GlnAsp: 4.071 ± 2.394
2.714GlnGlu: 2.714 ± 1.861
2.714GlnPhe: 2.714 ± 1.596
5.427GlnGly: 5.427 ± 3.192
2.714GlnHis: 2.714 ± 1.861
1.357GlnIle: 1.357 ± 2.228
1.357GlnLys: 1.357 ± 0.798
0.0GlnLeu: 0.0 ± 0.0
2.714GlnMet: 2.714 ± 1.861
5.427GlnAsn: 5.427 ± 1.697
4.071GlnPro: 4.071 ± 2.394
1.357GlnGln: 1.357 ± 0.798
2.714GlnArg: 2.714 ± 1.596
2.714GlnSer: 2.714 ± 1.861
9.498GlnThr: 9.498 ± 2.805
0.0GlnVal: 0.0 ± 0.0
1.357GlnTrp: 1.357 ± 0.798
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
6.784ArgAla: 6.784 ± 3.199
0.0ArgCys: 0.0 ± 0.0
1.357ArgAsp: 1.357 ± 1.383
1.357ArgGlu: 1.357 ± 1.383
5.427ArgPhe: 5.427 ± 3.192
5.427ArgGly: 5.427 ± 3.723
2.714ArgHis: 2.714 ± 1.596
1.357ArgIle: 1.357 ± 0.798
8.141ArgLys: 8.141 ± 0.946
2.714ArgLeu: 2.714 ± 1.861
1.357ArgMet: 1.357 ± 1.383
2.714ArgAsn: 2.714 ± 2.766
6.784ArgPro: 6.784 ± 2.57
5.427ArgGln: 5.427 ± 3.878
14.925ArgArg: 14.925 ± 3.058
5.427ArgSer: 5.427 ± 3.723
2.714ArgThr: 2.714 ± 0.98
2.714ArgVal: 2.714 ± 1.596
4.071ArgTrp: 4.071 ± 2.394
1.357ArgTyr: 1.357 ± 0.798
0.0ArgXaa: 0.0 ± 0.0
Ser
6.784SerAla: 6.784 ± 3.329
1.357SerCys: 1.357 ± 1.383
4.071SerAsp: 4.071 ± 1.133
4.071SerGlu: 4.071 ± 4.028
5.427SerPhe: 5.427 ± 1.697
5.427SerGly: 5.427 ± 1.697
1.357SerHis: 1.357 ± 0.798
6.784SerIle: 6.784 ± 0.823
1.357SerLys: 1.357 ± 1.383
8.141SerLeu: 8.141 ± 3.193
0.0SerMet: 0.0 ± 0.0
2.714SerAsn: 2.714 ± 0.98
9.498SerPro: 9.498 ± 4.255
5.427SerGln: 5.427 ± 3.192
2.714SerArg: 2.714 ± 1.596
9.498SerSer: 9.498 ± 6.675
5.427SerThr: 5.427 ± 2.535
2.714SerVal: 2.714 ± 1.861
1.357SerTrp: 1.357 ± 0.798
2.714SerTyr: 2.714 ± 1.596
0.0SerXaa: 0.0 ± 0.0
Thr
2.714ThrAla: 2.714 ± 2.474
1.357ThrCys: 1.357 ± 0.798
8.141ThrAsp: 8.141 ± 4.521
1.357ThrGlu: 1.357 ± 1.383
4.071ThrPhe: 4.071 ± 2.394
4.071ThrGly: 4.071 ± 1.799
0.0ThrHis: 0.0 ± 0.0
5.427ThrIle: 5.427 ± 2.535
6.784ThrLys: 6.784 ± 2.398
6.784ThrLeu: 6.784 ± 2.398
0.0ThrMet: 0.0 ± 0.0
5.427ThrAsn: 5.427 ± 1.104
10.855ThrPro: 10.855 ± 6.967
4.071ThrGln: 4.071 ± 1.133
2.714ThrArg: 2.714 ± 1.861
8.141ThrSer: 8.141 ± 1.187
6.784ThrThr: 6.784 ± 3.329
4.071ThrVal: 4.071 ± 4.501
4.071ThrTrp: 4.071 ± 3.332
0.0ThrTyr: 0.0 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
2.714ValAla: 2.714 ± 1.596
0.0ValCys: 0.0 ± 0.0
1.357ValAsp: 1.357 ± 1.383
1.357ValGlu: 1.357 ± 0.798
1.357ValPhe: 1.357 ± 0.798
4.071ValGly: 4.071 ± 1.133
0.0ValHis: 0.0 ± 0.0
2.714ValIle: 2.714 ± 1.596
1.357ValLys: 1.357 ± 2.228
0.0ValLeu: 0.0 ± 0.0
1.357ValMet: 1.357 ± 0.798
2.714ValAsn: 2.714 ± 0.98
4.071ValPro: 4.071 ± 6.685
1.357ValGln: 1.357 ± 0.798
5.427ValArg: 5.427 ± 6.238
0.0ValSer: 0.0 ± 0.0
4.071ValThr: 4.071 ± 2.261
1.357ValVal: 1.357 ± 0.798
2.714ValTrp: 2.714 ± 1.596
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
4.071TrpAla: 4.071 ± 2.394
0.0TrpCys: 0.0 ± 0.0
2.714TrpAsp: 2.714 ± 2.766
0.0TrpGlu: 0.0 ± 0.0
1.357TrpPhe: 1.357 ± 0.798
0.0TrpGly: 0.0 ± 0.0
1.357TrpHis: 1.357 ± 1.383
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
1.357TrpLeu: 1.357 ± 1.383
1.357TrpMet: 1.357 ± 1.061
1.357TrpAsn: 1.357 ± 0.798
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
5.427TrpArg: 5.427 ± 3.192
4.071TrpSer: 4.071 ± 1.133
1.357TrpThr: 1.357 ± 2.228
0.0TrpVal: 0.0 ± 0.0
1.357TrpTrp: 1.357 ± 0.798
2.714TrpTyr: 2.714 ± 0.98
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.357TyrAla: 1.357 ± 2.228
1.357TyrCys: 1.357 ± 0.798
1.357TyrAsp: 1.357 ± 0.798
1.357TyrGlu: 1.357 ± 0.798
2.714TyrPhe: 2.714 ± 1.596
2.714TyrGly: 2.714 ± 1.596
0.0TyrHis: 0.0 ± 0.0
0.0TyrIle: 0.0 ± 0.0
1.357TyrLys: 1.357 ± 0.798
1.357TyrLeu: 1.357 ± 0.798
1.357TyrMet: 1.357 ± 0.798
1.357TyrAsn: 1.357 ± 1.383
4.071TyrPro: 4.071 ± 2.394
0.0TyrGln: 0.0 ± 0.0
4.071TyrArg: 4.071 ± 1.133
1.357TyrSer: 1.357 ± 0.798
2.714TyrThr: 2.714 ± 1.596
2.714TyrVal: 2.714 ± 0.98
0.0TyrTrp: 0.0 ± 0.0
1.357TyrTyr: 1.357 ± 0.798
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (738 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski