Amino acid dipepetide frequency for Guama virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.567AlaAla: 3.567 ± 4.034
1.529AlaCys: 1.529 ± 0.687
2.038AlaAsp: 2.038 ± 0.466
4.331AlaGlu: 4.331 ± 1.733
1.529AlaPhe: 1.529 ± 0.596
1.783AlaGly: 1.783 ± 0.806
1.274AlaHis: 1.274 ± 0.446
3.312AlaIle: 3.312 ± 0.313
3.822AlaLys: 3.822 ± 1.03
5.096AlaLeu: 5.096 ± 0.614
0.255AlaMet: 0.255 ± 0.232
4.586AlaAsn: 4.586 ± 0.513
1.019AlaPro: 1.019 ± 0.264
2.293AlaGln: 2.293 ± 1.298
3.822AlaArg: 3.822 ± 1.908
2.803AlaSer: 2.803 ± 1.181
2.293AlaThr: 2.293 ± 0.402
1.783AlaVal: 1.783 ± 0.567
0.255AlaTrp: 0.255 ± 0.155
2.293AlaTyr: 2.293 ± 1.281
0.0AlaXaa: 0.0 ± 0.0
Cys
1.019CysAla: 1.019 ± 0.264
0.255CysCys: 0.255 ± 0.155
1.529CysAsp: 1.529 ± 0.396
1.019CysGlu: 1.019 ± 0.927
2.038CysPhe: 2.038 ± 1.142
2.803CysGly: 2.803 ± 1.485
1.274CysHis: 1.274 ± 0.801
2.548CysIle: 2.548 ± 0.933
2.038CysLys: 2.038 ± 1.142
2.548CysLeu: 2.548 ± 0.933
0.764CysMet: 0.764 ± 0.344
2.803CysAsn: 2.803 ± 0.851
1.529CysPro: 1.529 ± 0.396
1.019CysGln: 1.019 ± 0.264
0.764CysArg: 0.764 ± 0.695
2.038CysSer: 2.038 ± 0.809
1.529CysThr: 1.529 ± 1.032
1.529CysVal: 1.529 ± 1.032
0.255CysTrp: 0.255 ± 0.155
1.019CysTyr: 1.019 ± 0.264
0.0CysXaa: 0.0 ± 0.0
Asp
2.038AspAla: 2.038 ± 0.466
1.019AspCys: 1.019 ± 0.264
2.293AspAsp: 2.293 ± 0.722
3.567AspGlu: 3.567 ± 1.498
4.841AspPhe: 4.841 ± 1.875
1.019AspGly: 1.019 ± 0.3
1.019AspHis: 1.019 ± 0.738
3.567AspIle: 3.567 ± 0.761
4.076AspLys: 4.076 ± 0.851
7.643AspLeu: 7.643 ± 1.389
2.803AspMet: 2.803 ± 1.112
2.803AspAsn: 2.803 ± 0.593
2.038AspPro: 2.038 ± 0.903
1.529AspGln: 1.529 ± 0.396
3.057AspArg: 3.057 ± 1.536
2.038AspSer: 2.038 ± 1.142
3.312AspThr: 3.312 ± 1.011
2.803AspVal: 2.803 ± 0.441
0.51AspTrp: 0.51 ± 0.311
1.529AspTyr: 1.529 ± 0.343
0.0AspXaa: 0.0 ± 0.0
Glu
4.076GluAla: 4.076 ± 1.323
1.274GluCys: 1.274 ± 0.466
2.548GluAsp: 2.548 ± 0.989
5.35GluGlu: 5.35 ± 1.072
6.624GluPhe: 6.624 ± 2.211
1.529GluGly: 1.529 ± 0.343
2.293GluHis: 2.293 ± 0.522
6.879GluIle: 6.879 ± 2.23
3.822GluLys: 3.822 ± 1.112
7.134GluLeu: 7.134 ± 0.838
3.312GluMet: 3.312 ± 0.955
2.293GluAsn: 2.293 ± 0.402
2.038GluPro: 2.038 ± 0.523
2.803GluGln: 2.803 ± 0.593
3.057GluArg: 3.057 ± 1.609
4.586GluSer: 4.586 ± 1.254
3.822GluThr: 3.822 ± 1.112
1.783GluVal: 1.783 ± 0.914
0.255GluTrp: 0.255 ± 0.155
2.548GluTyr: 2.548 ± 0.891
0.0GluXaa: 0.0 ± 0.0
Phe
2.293PheAla: 2.293 ± 0.505
2.803PheCys: 2.803 ± 0.593
2.803PheAsp: 2.803 ± 0.593
2.803PheGlu: 2.803 ± 1.286
3.567PhePhe: 3.567 ± 1.177
3.312PheGly: 3.312 ± 1.481
1.019PheHis: 1.019 ± 0.3
2.548PheIle: 2.548 ± 1.44
5.096PheLys: 5.096 ± 1.262
5.096PheLeu: 5.096 ± 4.194
1.783PheMet: 1.783 ± 0.749
2.293PheAsn: 2.293 ± 0.743
1.019PhePro: 1.019 ± 0.786
2.038PheGln: 2.038 ± 0.425
3.057PheArg: 3.057 ± 0.263
5.86PheSer: 5.86 ± 1.431
2.293PheThr: 2.293 ± 0.515
2.803PheVal: 2.803 ± 0.632
0.764PheTrp: 0.764 ± 0.466
2.548PheTyr: 2.548 ± 1.257
0.0PheXaa: 0.0 ± 0.0
Gly
1.529GlyAla: 1.529 ± 0.343
2.803GlyCys: 2.803 ± 1.833
3.822GlyAsp: 3.822 ± 1.216
3.057GlyGlu: 3.057 ± 0.899
0.255GlyPhe: 0.255 ± 0.232
0.51GlyGly: 0.51 ± 0.311
0.764GlyHis: 0.764 ± 0.172
5.096GlyIle: 5.096 ± 1.817
3.057GlyLys: 3.057 ± 1.375
3.567GlyLeu: 3.567 ± 0.374
1.274GlyMet: 1.274 ± 0.466
3.567GlyAsn: 3.567 ± 0.926
1.529GlyPro: 1.529 ± 0.396
1.274GlyGln: 1.274 ± 0.777
2.038GlyArg: 2.038 ± 0.466
1.783GlySer: 1.783 ± 1.622
3.567GlyThr: 3.567 ± 1.917
2.038GlyVal: 2.038 ± 0.528
0.764GlyTrp: 0.764 ± 0.172
2.038GlyTyr: 2.038 ± 0.661
0.0GlyXaa: 0.0 ± 0.0
His
1.019HisAla: 1.019 ± 0.961
0.255HisCys: 0.255 ± 0.232
0.764HisAsp: 0.764 ± 0.723
0.764HisGlu: 0.764 ± 0.344
1.529HisPhe: 1.529 ± 0.933
2.038HisGly: 2.038 ± 0.523
0.764HisHis: 0.764 ± 0.723
1.274HisIle: 1.274 ± 0.264
2.293HisLys: 2.293 ± 0.743
1.529HisLeu: 1.529 ± 0.906
0.764HisMet: 0.764 ± 0.723
1.529HisAsn: 1.529 ± 0.596
0.764HisPro: 0.764 ± 0.344
0.764HisGln: 0.764 ± 0.172
0.764HisArg: 0.764 ± 0.723
1.274HisSer: 1.274 ± 0.264
1.529HisThr: 1.529 ± 0.687
1.274HisVal: 1.274 ± 0.264
0.51HisTrp: 0.51 ± 0.311
1.019HisTyr: 1.019 ± 0.264
0.0HisXaa: 0.0 ± 0.0
Ile
4.586IleAla: 4.586 ± 0.155
2.293IleCys: 2.293 ± 1.031
3.312IleAsp: 3.312 ± 0.649
6.115IleGlu: 6.115 ± 1.283
3.567IlePhe: 3.567 ± 0.924
4.586IleGly: 4.586 ± 1.354
2.038IleHis: 2.038 ± 1.244
5.605IleIle: 5.605 ± 1.185
5.605IleLys: 5.605 ± 0.296
6.879IleLeu: 6.879 ± 1.442
2.038IleMet: 2.038 ± 0.809
3.567IleAsn: 3.567 ± 0.926
3.567IlePro: 3.567 ± 1.134
3.822IleGln: 3.822 ± 0.95
4.841IleArg: 4.841 ± 1.359
4.841IleSer: 4.841 ± 0.383
5.096IleThr: 5.096 ± 0.695
3.312IleVal: 3.312 ± 1.011
0.51IleTrp: 0.51 ± 0.311
2.038IleTyr: 2.038 ± 1.356
0.0IleXaa: 0.0 ± 0.0
Lys
3.567LysAla: 3.567 ± 1.134
2.038LysCys: 2.038 ± 0.809
6.369LysAsp: 6.369 ± 1.484
5.35LysGlu: 5.35 ± 1.112
3.567LysPhe: 3.567 ± 1.404
4.841LysGly: 4.841 ± 1.142
2.548LysHis: 2.548 ± 1.272
6.115LysIle: 6.115 ± 1.393
6.115LysLys: 6.115 ± 1.031
7.643LysLeu: 7.643 ± 0.17
2.293LysMet: 2.293 ± 0.722
2.803LysAsn: 2.803 ± 1.188
3.312LysPro: 3.312 ± 0.982
2.803LysGln: 2.803 ± 1.734
1.274LysArg: 1.274 ± 0.777
3.312LysSer: 3.312 ± 0.761
5.86LysThr: 5.86 ± 1.213
3.312LysVal: 3.312 ± 0.761
0.51LysTrp: 0.51 ± 0.311
2.803LysTyr: 2.803 ± 0.593
0.0LysXaa: 0.0 ± 0.0
Leu
4.841LeuAla: 4.841 ± 1.939
2.803LeuCys: 2.803 ± 1.485
5.35LeuAsp: 5.35 ± 1.641
7.643LeuGlu: 7.643 ± 1.834
4.076LeuPhe: 4.076 ± 0.851
2.803LeuGly: 2.803 ± 0.841
2.038LeuHis: 2.038 ± 0.523
5.86LeuIle: 5.86 ± 0.87
7.643LeuLys: 7.643 ± 1.917
6.879LeuLeu: 6.879 ± 1.725
2.803LeuMet: 2.803 ± 0.653
6.879LeuAsn: 6.879 ± 0.702
3.057LeuPro: 3.057 ± 0.263
3.057LeuGln: 3.057 ± 0.642
2.803LeuArg: 2.803 ± 0.441
6.879LeuSer: 6.879 ± 0.34
4.841LeuThr: 4.841 ± 1.653
5.86LeuVal: 5.86 ± 1.415
0.255LeuTrp: 0.255 ± 0.155
3.312LeuTyr: 3.312 ± 0.801
0.0LeuXaa: 0.0 ± 0.0
Met
1.274MetAla: 1.274 ± 0.446
1.019MetCys: 1.019 ± 0.571
2.548MetAsp: 2.548 ± 0.631
2.038MetGlu: 2.038 ± 0.779
1.274MetPhe: 1.274 ± 0.641
1.783MetGly: 1.783 ± 0.593
0.255MetHis: 0.255 ± 0.155
2.803MetIle: 2.803 ± 0.298
2.548MetLys: 2.548 ± 0.396
3.312MetLeu: 3.312 ± 0.687
1.274MetMet: 1.274 ± 0.264
1.783MetAsn: 1.783 ± 0.914
2.038MetPro: 2.038 ± 0.466
0.255MetGln: 0.255 ± 0.155
1.019MetArg: 1.019 ± 1.56
3.822MetSer: 3.822 ± 0.917
2.038MetThr: 2.038 ± 0.661
1.529MetVal: 1.529 ± 0.343
0.0MetTrp: 0.0 ± 0.0
0.255MetTyr: 0.255 ± 0.155
0.0MetXaa: 0.0 ± 0.0
Asn
3.822AsnAla: 3.822 ± 1.03
1.019AsnCys: 1.019 ± 0.571
3.822AsnAsp: 3.822 ± 0.917
4.076AsnGlu: 4.076 ± 0.429
2.293AsnPhe: 2.293 ± 1.369
0.51AsnGly: 0.51 ± 0.464
2.038AsnHis: 2.038 ± 0.523
4.841AsnIle: 4.841 ± 0.765
4.586AsnLys: 4.586 ± 1.03
4.586AsnLeu: 4.586 ± 0.95
2.293AsnMet: 2.293 ± 1.298
3.567AsnAsn: 3.567 ± 1.404
3.822AsnPro: 3.822 ± 0.219
2.038AsnGln: 2.038 ± 0.425
2.803AsnArg: 2.803 ± 0.593
2.803AsnSer: 2.803 ± 0.851
3.822AsnThr: 3.822 ± 0.548
2.548AsnVal: 2.548 ± 0.392
0.255AsnTrp: 0.255 ± 0.155
2.293AsnTyr: 2.293 ± 0.743
0.0AsnXaa: 0.0 ± 0.0
Pro
2.038ProAla: 2.038 ± 1.572
0.764ProCys: 0.764 ± 0.172
1.019ProAsp: 1.019 ± 0.3
3.312ProGlu: 3.312 ± 0.687
2.038ProPhe: 2.038 ± 0.528
2.038ProGly: 2.038 ± 0.466
1.019ProHis: 1.019 ± 0.571
3.822ProIle: 3.822 ± 0.95
2.803ProLys: 2.803 ± 0.632
2.038ProLeu: 2.038 ± 0.661
0.51ProMet: 0.51 ± 0.132
1.783ProAsn: 1.783 ± 0.519
0.51ProPro: 0.51 ± 0.311
1.019ProGln: 1.019 ± 0.622
0.764ProArg: 0.764 ± 0.344
3.567ProSer: 3.567 ± 0.418
2.293ProThr: 2.293 ± 0.522
2.548ProVal: 2.548 ± 0.528
0.764ProTrp: 0.764 ± 0.767
0.764ProTyr: 0.764 ± 0.466
0.0ProXaa: 0.0 ± 0.0
Gln
1.529GlnAla: 1.529 ± 0.933
1.274GlnCys: 1.274 ± 0.466
1.274GlnAsp: 1.274 ± 0.264
2.038GlnGlu: 2.038 ± 0.6
1.274GlnPhe: 1.274 ± 0.446
1.783GlnGly: 1.783 ± 0.567
0.51GlnHis: 0.51 ± 0.132
3.057GlnIle: 3.057 ± 0.533
3.567GlnLys: 3.567 ± 2.226
2.803GlnLeu: 2.803 ± 0.632
1.019GlnMet: 1.019 ± 0.666
1.274GlnAsn: 1.274 ± 0.636
0.51GlnPro: 0.51 ± 0.132
1.529GlnGln: 1.529 ± 1.446
1.783GlnArg: 1.783 ± 1.088
3.822GlnSer: 3.822 ± 0.891
2.038GlnThr: 2.038 ± 0.6
2.038GlnVal: 2.038 ± 0.425
0.255GlnTrp: 0.255 ± 0.822
1.529GlnTyr: 1.529 ± 0.688
0.0GlnXaa: 0.0 ± 0.0
Arg
1.529ArgAla: 1.529 ± 1.446
2.038ArgCys: 2.038 ± 0.809
3.822ArgAsp: 3.822 ± 1.986
2.803ArgGlu: 2.803 ± 1.041
2.548ArgPhe: 2.548 ± 0.396
0.255ArgGly: 0.255 ± 0.232
0.51ArgHis: 0.51 ± 0.311
3.567ArgIle: 3.567 ± 0.779
2.293ArgLys: 2.293 ± 0.722
3.822ArgLeu: 3.822 ± 1.651
2.038ArgMet: 2.038 ± 0.601
4.076ArgAsn: 4.076 ± 3.614
1.019ArgPro: 1.019 ± 0.738
1.274ArgGln: 1.274 ± 0.833
2.548ArgArg: 2.548 ± 0.528
3.312ArgSer: 3.312 ± 0.649
1.529ArgThr: 1.529 ± 0.396
3.057ArgVal: 3.057 ± 1.159
0.255ArgTrp: 0.255 ± 0.232
1.783ArgTyr: 1.783 ± 0.749
0.0ArgXaa: 0.0 ± 0.0
Ser
4.076SerAla: 4.076 ± 1.082
2.548SerCys: 2.548 ± 1.602
4.841SerAsp: 4.841 ± 0.128
3.312SerGlu: 3.312 ± 0.53
3.567SerPhe: 3.567 ± 0.733
3.822SerGly: 3.822 ± 0.95
1.019SerHis: 1.019 ± 0.622
5.35SerIle: 5.35 ± 1.142
6.115SerLys: 6.115 ± 1.18
6.879SerLeu: 6.879 ± 0.904
3.057SerMet: 3.057 ± 0.899
2.293SerAsn: 2.293 ± 1.372
2.293SerPro: 2.293 ± 0.743
2.293SerGln: 2.293 ± 0.522
3.057SerArg: 3.057 ± 0.899
4.076SerSer: 4.076 ± 1.057
4.586SerThr: 4.586 ± 1.487
5.096SerVal: 5.096 ± 1.056
1.019SerTrp: 1.019 ± 0.3
2.038SerTyr: 2.038 ± 1.142
0.0SerXaa: 0.0 ± 0.0
Thr
3.567ThrAla: 3.567 ± 0.779
1.529ThrCys: 1.529 ± 0.687
2.803ThrAsp: 2.803 ± 0.509
3.822ThrGlu: 3.822 ± 0.958
3.567ThrPhe: 3.567 ± 0.374
3.312ThrGly: 3.312 ± 1.275
0.255ThrHis: 0.255 ± 0.232
3.822ThrIle: 3.822 ± 0.891
3.312ThrLys: 3.312 ± 0.801
4.331ThrLeu: 4.331 ± 3.482
1.274ThrMet: 1.274 ± 0.466
3.567ThrAsn: 3.567 ± 0.924
2.038ThrPro: 2.038 ± 0.903
1.783ThrGln: 1.783 ± 0.463
3.057ThrArg: 3.057 ± 1.242
4.841ThrSer: 4.841 ± 1.152
3.057ThrThr: 3.057 ± 1.375
3.312ThrVal: 3.312 ± 0.313
1.019ThrTrp: 1.019 ± 0.738
4.076ThrTyr: 4.076 ± 0.884
0.0ThrXaa: 0.0 ± 0.0
Val
2.293ValAla: 2.293 ± 0.522
1.783ValCys: 1.783 ± 1.263
1.274ValAsp: 1.274 ± 0.264
2.803ValGlu: 2.803 ± 0.298
4.841ValPhe: 4.841 ± 0.779
2.038ValGly: 2.038 ± 0.528
0.51ValHis: 0.51 ± 0.311
2.803ValIle: 2.803 ± 0.761
4.076ValLys: 4.076 ± 1.199
4.586ValLeu: 4.586 ± 1.443
1.274ValMet: 1.274 ± 0.264
2.803ValAsn: 2.803 ± 1.474
2.038ValPro: 2.038 ± 0.466
2.038ValGln: 2.038 ± 1.356
2.038ValArg: 2.038 ± 0.466
5.86ValSer: 5.86 ± 0.406
2.038ValThr: 2.038 ± 0.528
1.783ValVal: 1.783 ± 0.593
0.51ValTrp: 0.51 ± 0.464
2.548ValTyr: 2.548 ± 0.933
0.0ValXaa: 0.0 ± 0.0
Trp
0.255TrpAla: 0.255 ± 0.155
0.0TrpCys: 0.0 ± 0.0
0.51TrpAsp: 0.51 ± 0.78
0.255TrpGlu: 0.255 ± 0.155
0.51TrpPhe: 0.51 ± 0.132
1.529TrpGly: 1.529 ± 0.396
0.0TrpHis: 0.0 ± 0.0
0.255TrpIle: 0.255 ± 0.155
0.0TrpLys: 0.0 ± 0.0
1.019TrpLeu: 1.019 ± 0.264
0.255TrpMet: 0.255 ± 0.822
0.764TrpAsn: 0.764 ± 0.466
0.0TrpPro: 0.0 ± 0.0
0.764TrpGln: 0.764 ± 0.466
0.0TrpArg: 0.0 ± 0.0
1.019TrpSer: 1.019 ± 0.622
0.255TrpThr: 0.255 ± 0.822
0.51TrpVal: 0.51 ± 0.311
0.0TrpTrp: 0.0 ± 0.0
0.764TrpTyr: 0.764 ± 0.172
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.764TyrAla: 0.764 ± 0.723
1.274TyrCys: 1.274 ± 0.466
0.764TyrAsp: 0.764 ± 0.767
3.567TyrGlu: 3.567 ± 0.924
2.293TyrPhe: 2.293 ± 0.515
2.038TyrGly: 2.038 ± 0.967
1.274TyrHis: 1.274 ± 0.466
5.096TyrIle: 5.096 ± 1.499
3.567TyrLys: 3.567 ± 1.495
2.548TyrLeu: 2.548 ± 1.283
1.529TyrMet: 1.529 ± 0.343
2.803TyrAsn: 2.803 ± 0.761
1.274TyrPro: 1.274 ± 0.466
0.51TyrGln: 0.51 ± 0.311
1.529TyrArg: 1.529 ± 0.688
2.803TyrSer: 2.803 ± 0.593
2.548TyrThr: 2.548 ± 0.891
1.019TyrVal: 1.019 ± 0.927
0.0TyrTrp: 0.0 ± 0.0
1.019TyrTyr: 1.019 ± 0.264
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3926 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski