Amino acid dipepetide frequency for Glis glis polyomavirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.778AlaAla: 7.778 ± 3.399
1.111AlaCys: 1.111 ± 0.631
1.111AlaAsp: 1.111 ± 0.537
3.889AlaGlu: 3.889 ± 2.167
0.556AlaPhe: 0.556 ± 0.389
4.444AlaGly: 4.444 ± 1.641
1.667AlaHis: 1.667 ± 0.652
3.333AlaIle: 3.333 ± 1.362
0.556AlaLys: 0.556 ± 0.637
8.889AlaLeu: 8.889 ± 3.718
2.222AlaMet: 2.222 ± 0.809
3.333AlaAsn: 3.333 ± 1.048
3.889AlaPro: 3.889 ± 1.898
3.333AlaGln: 3.333 ± 2.238
5.556AlaArg: 5.556 ± 2.832
5.0AlaSer: 5.0 ± 1.42
7.778AlaThr: 7.778 ± 1.307
6.667AlaVal: 6.667 ± 2.494
1.111AlaTrp: 1.111 ± 0.779
1.667AlaTyr: 1.667 ± 0.616
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.556CysAsp: 0.556 ± 0.389
0.0CysGlu: 0.0 ± 0.0
1.667CysPhe: 1.667 ± 1.293
1.667CysGly: 1.667 ± 1.069
0.556CysHis: 0.556 ± 0.637
0.0CysIle: 0.0 ± 0.0
3.889CysLys: 3.889 ± 0.778
2.222CysLeu: 2.222 ± 1.133
0.556CysMet: 0.556 ± 0.637
2.222CysAsn: 2.222 ± 0.674
1.667CysPro: 1.667 ± 0.971
0.556CysGln: 0.556 ± 0.389
0.0CysArg: 0.0 ± 0.0
0.556CysSer: 0.556 ± 0.389
1.667CysThr: 1.667 ± 0.616
0.556CysVal: 0.556 ± 0.599
2.778CysTrp: 2.778 ± 1.094
1.111CysTyr: 1.111 ± 0.631
0.0CysXaa: 0.0 ± 0.0
Asp
3.333AspAla: 3.333 ± 0.966
0.556AspCys: 0.556 ± 0.599
1.667AspAsp: 1.667 ± 0.723
2.778AspGlu: 2.778 ± 0.912
1.667AspPhe: 1.667 ± 0.652
2.222AspGly: 2.222 ± 1.075
0.556AspHis: 0.556 ± 0.389
6.111AspIle: 6.111 ± 1.955
4.444AspLys: 4.444 ± 1.227
2.222AspLeu: 2.222 ± 1.137
3.333AspMet: 3.333 ± 1.598
3.333AspAsn: 3.333 ± 1.458
4.444AspPro: 4.444 ± 1.191
1.111AspGln: 1.111 ± 0.537
1.667AspArg: 1.667 ± 1.168
2.778AspSer: 2.778 ± 0.85
7.778AspThr: 7.778 ± 5.457
1.111AspVal: 1.111 ± 0.779
0.556AspTrp: 0.556 ± 0.389
1.667AspTyr: 1.667 ± 0.652
0.0AspXaa: 0.0 ± 0.0
Glu
7.222GluAla: 7.222 ± 3.431
1.111GluCys: 1.111 ± 0.78
6.667GluAsp: 6.667 ± 3.666
6.111GluGlu: 6.111 ± 1.897
1.667GluPhe: 1.667 ± 1.168
4.444GluGly: 4.444 ± 1.703
0.556GluHis: 0.556 ± 0.599
2.778GluIle: 2.778 ± 1.58
2.222GluLys: 2.222 ± 1.137
7.222GluLeu: 7.222 ± 1.187
0.556GluMet: 0.556 ± 0.389
3.333GluAsn: 3.333 ± 0.738
2.222GluPro: 2.222 ± 1.076
3.889GluGln: 3.889 ± 1.348
2.778GluArg: 2.778 ± 1.324
2.778GluSer: 2.778 ± 1.482
5.0GluThr: 5.0 ± 1.926
4.444GluVal: 4.444 ± 2.05
0.556GluTrp: 0.556 ± 0.637
3.333GluTyr: 3.333 ± 1.091
0.0GluXaa: 0.0 ± 0.0
Phe
3.889PheAla: 3.889 ± 1.257
1.667PheCys: 1.667 ± 1.168
2.778PheAsp: 2.778 ± 1.063
2.222PheGlu: 2.222 ± 0.674
0.0PhePhe: 0.0 ± 0.0
3.889PheGly: 3.889 ± 1.038
1.667PheHis: 1.667 ± 0.723
1.667PheIle: 1.667 ± 1.168
0.0PheLys: 0.0 ± 0.0
2.222PheLeu: 2.222 ± 1.029
1.111PheMet: 1.111 ± 0.614
1.111PheAsn: 1.111 ± 0.537
1.667PhePro: 1.667 ± 0.652
1.111PheGln: 1.111 ± 0.904
2.778PheArg: 2.778 ± 0.912
2.778PheSer: 2.778 ± 1.459
3.333PheThr: 3.333 ± 0.713
2.222PheVal: 2.222 ± 1.453
1.667PheTrp: 1.667 ± 0.756
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
7.222GlyAla: 7.222 ± 2.998
0.0GlyCys: 0.0 ± 0.0
3.889GlyAsp: 3.889 ± 0.778
3.333GlyGlu: 3.333 ± 1.856
1.111GlyPhe: 1.111 ± 0.537
6.667GlyGly: 6.667 ± 1.09
0.0GlyHis: 0.0 ± 0.0
6.111GlyIle: 6.111 ± 1.515
1.667GlyLys: 1.667 ± 0.723
10.0GlyLeu: 10.0 ± 3.048
1.667GlyMet: 1.667 ± 1.168
2.222GlyAsn: 2.222 ± 1.009
6.667GlyPro: 6.667 ± 2.656
3.333GlyGln: 3.333 ± 1.43
1.667GlyArg: 1.667 ± 1.069
3.333GlySer: 3.333 ± 1.421
2.778GlyThr: 2.778 ± 0.996
3.889GlyVal: 3.889 ± 1.477
0.556GlyTrp: 0.556 ± 0.599
1.667GlyTyr: 1.667 ± 0.501
0.0GlyXaa: 0.0 ± 0.0
His
2.778HisAla: 2.778 ± 1.063
0.556HisCys: 0.556 ± 0.599
0.556HisAsp: 0.556 ± 0.389
1.111HisGlu: 1.111 ± 0.726
1.111HisPhe: 1.111 ± 0.779
0.556HisGly: 0.556 ± 0.599
1.111HisHis: 1.111 ± 0.779
1.111HisIle: 1.111 ± 0.779
1.111HisLys: 1.111 ± 0.631
0.556HisLeu: 0.556 ± 0.389
0.0HisMet: 0.0 ± 0.0
1.111HisAsn: 1.111 ± 0.779
1.111HisPro: 1.111 ± 0.631
0.0HisGln: 0.0 ± 0.0
1.667HisArg: 1.667 ± 0.833
3.333HisSer: 3.333 ± 1.23
0.556HisThr: 0.556 ± 0.637
0.556HisVal: 0.556 ± 0.389
0.556HisTrp: 0.556 ± 0.599
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.222IleAla: 2.222 ± 0.879
0.556IleCys: 0.556 ± 0.389
3.333IleAsp: 3.333 ± 0.713
3.889IleGlu: 3.889 ± 1.243
1.667IlePhe: 1.667 ± 0.997
0.0IleGly: 0.0 ± 0.0
0.556IleHis: 0.556 ± 0.389
0.556IleIle: 0.556 ± 0.389
0.556IleLys: 0.556 ± 0.389
8.333IleLeu: 8.333 ± 2.316
1.111IleMet: 1.111 ± 0.779
3.889IleAsn: 3.889 ± 1.556
2.778IlePro: 2.778 ± 0.854
1.111IleGln: 1.111 ± 0.683
3.333IleArg: 3.333 ± 0.975
1.111IleSer: 1.111 ± 0.78
2.778IleThr: 2.778 ± 2.238
5.0IleVal: 5.0 ± 1.125
1.111IleTrp: 1.111 ± 0.779
1.667IleTyr: 1.667 ± 0.756
0.0IleXaa: 0.0 ± 0.0
Lys
1.667LysAla: 1.667 ± 0.723
2.222LysCys: 2.222 ± 1.263
0.556LysAsp: 0.556 ± 0.599
2.222LysGlu: 2.222 ± 1.137
1.111LysPhe: 1.111 ± 0.537
3.889LysGly: 3.889 ± 1.435
2.778LysHis: 2.778 ± 0.85
1.667LysIle: 1.667 ± 1.069
3.333LysLys: 3.333 ± 1.666
4.444LysLeu: 4.444 ± 1.66
1.111LysMet: 1.111 ± 0.631
3.889LysAsn: 3.889 ± 0.778
1.111LysPro: 1.111 ± 0.73
0.556LysGln: 0.556 ± 0.389
8.333LysArg: 8.333 ± 1.005
2.222LysSer: 2.222 ± 0.791
6.667LysThr: 6.667 ± 1.904
1.667LysVal: 1.667 ± 0.723
0.0LysTrp: 0.0 ± 0.0
0.0LysTyr: 0.0 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
3.889LeuAla: 3.889 ± 1.97
1.111LeuCys: 1.111 ± 0.537
6.111LeuAsp: 6.111 ± 1.865
5.0LeuGlu: 5.0 ± 1.725
6.111LeuPhe: 6.111 ± 0.8
7.222LeuGly: 7.222 ± 2.486
2.778LeuHis: 2.778 ± 1.378
4.444LeuIle: 4.444 ± 1.156
2.778LeuLys: 2.778 ± 1.947
8.333LeuLeu: 8.333 ± 2.835
6.667LeuMet: 6.667 ± 1.926
6.111LeuAsn: 6.111 ± 0.729
8.333LeuPro: 8.333 ± 2.373
6.111LeuGln: 6.111 ± 2.852
5.556LeuArg: 5.556 ± 1.146
2.778LeuSer: 2.778 ± 0.908
3.333LeuThr: 3.333 ± 2.028
3.889LeuVal: 3.889 ± 1.443
2.222LeuTrp: 2.222 ± 0.826
3.333LeuTyr: 3.333 ± 1.034
0.0LeuXaa: 0.0 ± 0.0
Met
3.333MetAla: 3.333 ± 0.409
0.556MetCys: 0.556 ± 0.637
2.778MetAsp: 2.778 ± 1.426
2.778MetGlu: 2.778 ± 0.59
0.0MetPhe: 0.0 ± 0.0
2.222MetGly: 2.222 ± 0.582
1.111MetHis: 1.111 ± 0.779
1.111MetIle: 1.111 ± 0.537
2.222MetLys: 2.222 ± 1.263
1.111MetLeu: 1.111 ± 0.73
0.0MetMet: 0.0 ± 0.0
1.667MetAsn: 1.667 ± 0.833
2.222MetPro: 2.222 ± 1.029
1.111MetGln: 1.111 ± 0.631
1.667MetArg: 1.667 ± 0.616
0.0MetSer: 0.0 ± 0.0
1.111MetThr: 1.111 ± 0.537
0.0MetVal: 0.0 ± 0.0
0.556MetTrp: 0.556 ± 0.599
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.778AsnAla: 2.778 ± 1.07
2.222AsnCys: 2.222 ± 0.912
0.556AsnAsp: 0.556 ± 0.599
3.889AsnGlu: 3.889 ± 1.243
2.778AsnPhe: 2.778 ± 0.912
2.778AsnGly: 2.778 ± 1.12
0.0AsnHis: 0.0 ± 0.0
3.333AsnIle: 3.333 ± 1.01
3.333AsnLys: 3.333 ± 1.445
6.111AsnLeu: 6.111 ± 1.886
0.556AsnMet: 0.556 ± 0.536
2.222AsnAsn: 2.222 ± 1.029
3.333AsnPro: 3.333 ± 1.512
2.222AsnGln: 2.222 ± 1.155
3.333AsnArg: 3.333 ± 1.034
2.778AsnSer: 2.778 ± 1.58
2.778AsnThr: 2.778 ± 1.063
3.333AsnVal: 3.333 ± 1.043
1.111AsnTrp: 1.111 ± 0.726
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
2.778ProAla: 2.778 ± 1.309
1.667ProCys: 1.667 ± 0.616
4.444ProAsp: 4.444 ± 1.1
4.444ProGlu: 4.444 ± 2.245
2.222ProPhe: 2.222 ± 1.075
7.222ProGly: 7.222 ± 2.635
0.0ProHis: 0.0 ± 0.0
2.222ProIle: 2.222 ± 1.648
4.444ProLys: 4.444 ± 1.191
3.333ProLeu: 3.333 ± 1.048
0.556ProMet: 0.556 ± 0.599
1.111ProAsn: 1.111 ± 0.726
3.333ProPro: 3.333 ± 1.612
5.0ProGln: 5.0 ± 1.308
2.778ProArg: 2.778 ± 1.389
5.0ProSer: 5.0 ± 2.365
3.889ProThr: 3.889 ± 1.264
2.222ProVal: 2.222 ± 0.497
0.0ProTrp: 0.0 ± 0.0
2.778ProTyr: 2.778 ± 0.59
0.0ProXaa: 0.0 ± 0.0
Gln
3.889GlnAla: 3.889 ± 1.477
1.667GlnCys: 1.667 ± 0.723
2.778GlnAsp: 2.778 ± 0.476
2.222GlnGlu: 2.222 ± 0.827
1.111GlnPhe: 1.111 ± 0.779
3.333GlnGly: 3.333 ± 1.43
0.0GlnHis: 0.0 ± 0.0
3.889GlnIle: 3.889 ± 1.477
3.333GlnLys: 3.333 ± 1.005
2.222GlnLeu: 2.222 ± 1.133
0.0GlnMet: 0.0 ± 0.0
1.111GlnAsn: 1.111 ± 0.78
2.222GlnPro: 2.222 ± 2.396
2.222GlnGln: 2.222 ± 1.043
5.0GlnArg: 5.0 ± 2.753
2.222GlnSer: 2.222 ± 1.097
3.333GlnThr: 3.333 ± 0.729
1.667GlnVal: 1.667 ± 1.069
1.667GlnTrp: 1.667 ± 0.652
0.556GlnTyr: 0.556 ± 0.782
0.0GlnXaa: 0.0 ± 0.0
Arg
6.111ArgAla: 6.111 ± 2.134
0.556ArgCys: 0.556 ± 0.637
3.333ArgAsp: 3.333 ± 1.043
7.778ArgGlu: 7.778 ± 2.789
2.778ArgPhe: 2.778 ± 1.063
3.333ArgGly: 3.333 ± 1.074
1.667ArgHis: 1.667 ± 0.833
2.222ArgIle: 2.222 ± 1.243
5.0ArgLys: 5.0 ± 1.485
2.778ArgLeu: 2.778 ± 1.629
2.778ArgMet: 2.778 ± 0.912
2.222ArgAsn: 2.222 ± 1.076
1.667ArgPro: 1.667 ± 1.168
1.667ArgGln: 1.667 ± 0.652
5.556ArgArg: 5.556 ± 1.801
3.333ArgSer: 3.333 ± 1.392
0.0ArgThr: 0.0 ± 0.0
2.778ArgVal: 2.778 ± 1.063
0.556ArgTrp: 0.556 ± 0.599
5.556ArgTyr: 5.556 ± 1.966
0.0ArgXaa: 0.0 ± 0.0
Ser
5.0SerAla: 5.0 ± 1.285
2.222SerCys: 2.222 ± 0.912
5.0SerAsp: 5.0 ± 2.212
2.222SerGlu: 2.222 ± 0.548
1.111SerPhe: 1.111 ± 0.537
1.667SerGly: 1.667 ± 0.652
0.0SerHis: 0.0 ± 0.0
2.778SerIle: 2.778 ± 0.651
3.333SerLys: 3.333 ± 1.123
6.667SerLeu: 6.667 ± 1.14
0.0SerMet: 0.0 ± 0.0
3.333SerAsn: 3.333 ± 1.034
2.222SerPro: 2.222 ± 0.879
2.778SerGln: 2.778 ± 1.378
2.778SerArg: 2.778 ± 1.378
5.0SerSer: 5.0 ± 1.676
2.222SerThr: 2.222 ± 0.674
3.333SerVal: 3.333 ± 1.048
1.111SerTrp: 1.111 ± 0.726
2.222SerTyr: 2.222 ± 1.263
0.0SerXaa: 0.0 ± 0.0
Thr
2.778ThrAla: 2.778 ± 1.947
2.222ThrCys: 2.222 ± 1.039
1.111ThrAsp: 1.111 ± 0.537
6.111ThrGlu: 6.111 ± 3.119
5.0ThrPhe: 5.0 ± 1.375
5.556ThrGly: 5.556 ± 1.804
1.111ThrHis: 1.111 ± 0.779
1.667ThrIle: 1.667 ± 0.833
2.778ThrLys: 2.778 ± 0.887
6.667ThrLeu: 6.667 ± 1.129
1.111ThrMet: 1.111 ± 0.538
3.889ThrAsn: 3.889 ± 2.04
4.444ThrPro: 4.444 ± 1.868
3.889ThrGln: 3.889 ± 0.778
1.667ThrArg: 1.667 ± 0.652
3.333ThrSer: 3.333 ± 1.584
5.0ThrThr: 5.0 ± 0.937
2.778ThrVal: 2.778 ± 1.583
2.778ThrTrp: 2.778 ± 1.323
1.111ThrTyr: 1.111 ± 0.779
0.0ThrXaa: 0.0 ± 0.0
Val
4.444ValAla: 4.444 ± 1.848
0.556ValCys: 0.556 ± 0.389
1.667ValAsp: 1.667 ± 0.774
6.111ValGlu: 6.111 ± 2.248
1.111ValPhe: 1.111 ± 0.726
2.778ValGly: 2.778 ± 2.307
1.111ValHis: 1.111 ± 0.537
0.0ValIle: 0.0 ± 0.0
2.222ValLys: 2.222 ± 1.648
7.222ValLeu: 7.222 ± 2.198
0.0ValMet: 0.0 ± 0.0
1.111ValAsn: 1.111 ± 0.779
3.889ValPro: 3.889 ± 2.005
1.667ValGln: 1.667 ± 0.723
3.889ValArg: 3.889 ± 1.171
5.556ValSer: 5.556 ± 0.952
2.222ValThr: 2.222 ± 0.883
6.111ValVal: 6.111 ± 1.429
1.111ValTrp: 1.111 ± 1.124
1.111ValTyr: 1.111 ± 0.537
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
1.111TrpCys: 1.111 ± 0.631
1.111TrpAsp: 1.111 ± 0.631
1.111TrpGlu: 1.111 ± 0.537
2.222TrpPhe: 2.222 ± 0.826
3.333TrpGly: 3.333 ± 1.609
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.556TrpLys: 0.556 ± 0.389
1.667TrpLeu: 1.667 ± 0.971
0.556TrpMet: 0.556 ± 0.389
0.556TrpAsn: 0.556 ± 0.389
0.556TrpPro: 0.556 ± 0.599
1.667TrpGln: 1.667 ± 0.756
1.111TrpArg: 1.111 ± 0.78
0.556TrpSer: 0.556 ± 0.599
1.667TrpThr: 1.667 ± 0.652
1.667TrpVal: 1.667 ± 0.756
0.556TrpTrp: 0.556 ± 0.389
0.556TrpTyr: 0.556 ± 0.389
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.778TyrAla: 2.778 ± 1.856
0.556TyrCys: 0.556 ± 0.637
3.333TyrAsp: 3.333 ± 1.957
1.667TyrGlu: 1.667 ± 0.652
2.778TyrPhe: 2.778 ± 0.887
0.556TyrGly: 0.556 ± 0.389
2.222TyrHis: 2.222 ± 1.263
0.556TyrIle: 0.556 ± 0.599
1.667TyrLys: 1.667 ± 0.833
3.889TyrLeu: 3.889 ± 1.586
1.111TyrMet: 1.111 ± 0.998
2.222TyrAsn: 2.222 ± 0.791
1.667TyrPro: 1.667 ± 1.797
1.111TyrGln: 1.111 ± 0.78
0.556TyrArg: 0.556 ± 0.389
0.556TyrSer: 0.556 ± 0.389
1.111TyrThr: 1.111 ± 0.779
0.0TyrVal: 0.0 ± 0.0
0.0TyrTrp: 0.0 ± 0.0
2.222TyrTyr: 2.222 ± 0.826
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1801 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski