Amino acid dipepetide frequency for Gamboa virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.599AlaAla: 2.599 ± 1.625
0.945AlaCys: 0.945 ± 0.695
2.363AlaAsp: 2.363 ± 0.61
3.781AlaGlu: 3.781 ± 1.061
1.89AlaPhe: 1.89 ± 0.851
1.89AlaGly: 1.89 ± 0.488
0.473AlaHis: 0.473 ± 0.314
5.671AlaIle: 5.671 ± 1.098
2.836AlaLys: 2.836 ± 0.822
5.671AlaLeu: 5.671 ± 1.852
2.599AlaMet: 2.599 ± 0.474
2.127AlaAsn: 2.127 ± 0.848
0.945AlaPro: 0.945 ± 0.695
1.654AlaGln: 1.654 ± 0.384
2.363AlaArg: 2.363 ± 1.608
4.017AlaSer: 4.017 ± 1.806
2.599AlaThr: 2.599 ± 1.372
1.89AlaVal: 1.89 ± 0.52
0.236AlaTrp: 0.236 ± 0.157
2.127AlaTyr: 2.127 ± 0.907
0.0AlaXaa: 0.0 ± 0.0
Cys
1.418CysAla: 1.418 ± 0.378
0.236CysCys: 0.236 ± 0.208
0.473CysAsp: 0.473 ± 0.415
1.181CysGlu: 1.181 ± 0.709
1.654CysPhe: 1.654 ± 0.658
1.654CysGly: 1.654 ± 1.454
1.181CysHis: 1.181 ± 0.412
2.127CysIle: 2.127 ± 0.798
2.599CysLys: 2.599 ± 1.625
3.781CysLeu: 3.781 ± 1.69
1.181CysMet: 1.181 ± 0.624
2.127CysAsn: 2.127 ± 0.498
1.181CysPro: 1.181 ± 0.412
1.89CysGln: 1.89 ± 0.488
0.473CysArg: 0.473 ± 1.484
1.181CysSer: 1.181 ± 0.277
2.363CysThr: 2.363 ± 1.555
1.418CysVal: 1.418 ± 1.246
0.0CysTrp: 0.0 ± 0.0
1.181CysTyr: 1.181 ± 0.412
0.0CysXaa: 0.0 ± 0.0
Asp
2.363AspAla: 2.363 ± 1.107
1.654AspCys: 1.654 ± 0.384
2.599AspAsp: 2.599 ± 0.992
4.49AspGlu: 4.49 ± 1.482
4.253AspPhe: 4.253 ± 1.341
1.89AspGly: 1.89 ± 0.339
1.654AspHis: 1.654 ± 0.42
5.671AspIle: 5.671 ± 1.809
2.363AspLys: 2.363 ± 0.554
5.198AspLeu: 5.198 ± 0.774
2.127AspMet: 2.127 ± 0.539
3.072AspAsn: 3.072 ± 1.068
2.127AspPro: 2.127 ± 0.988
2.363AspGln: 2.363 ± 0.373
2.363AspArg: 2.363 ± 1.568
3.072AspSer: 3.072 ± 0.804
3.072AspThr: 3.072 ± 0.733
2.599AspVal: 2.599 ± 0.615
0.236AspTrp: 0.236 ± 0.208
1.654AspTyr: 1.654 ± 0.384
0.0AspXaa: 0.0 ± 0.0
Glu
3.544GluAla: 3.544 ± 0.328
1.654GluCys: 1.654 ± 0.805
4.49GluAsp: 4.49 ± 1.042
4.253GluGlu: 4.253 ± 0.698
3.072GluPhe: 3.072 ± 0.796
2.127GluGly: 2.127 ± 0.498
2.836GluHis: 2.836 ± 0.732
6.144GluIle: 6.144 ± 1.765
4.017GluLys: 4.017 ± 1.005
5.435GluLeu: 5.435 ± 1.019
2.127GluMet: 2.127 ± 0.848
3.544GluAsn: 3.544 ± 1.157
2.599GluPro: 2.599 ± 0.875
2.836GluGln: 2.836 ± 0.719
2.836GluArg: 2.836 ± 1.249
3.072GluSer: 3.072 ± 0.62
1.181GluThr: 1.181 ± 0.714
1.89GluVal: 1.89 ± 0.339
0.945GluTrp: 0.945 ± 0.503
3.544GluTyr: 3.544 ± 0.708
0.0GluXaa: 0.0 ± 0.0
Phe
2.363PheAla: 2.363 ± 0.846
1.418PheCys: 1.418 ± 0.378
2.836PheAsp: 2.836 ± 0.309
2.599PheGlu: 2.599 ± 0.273
3.072PhePhe: 3.072 ± 1.153
2.127PheGly: 2.127 ± 0.394
0.945PheHis: 0.945 ± 0.244
3.072PheIle: 3.072 ± 0.276
4.017PheLys: 4.017 ± 1.124
5.671PheLeu: 5.671 ± 0.534
0.473PheMet: 0.473 ± 0.136
3.072PheAsn: 3.072 ± 1.123
1.181PhePro: 1.181 ± 1.004
0.945PheGln: 0.945 ± 0.642
2.127PheArg: 2.127 ± 1.093
4.253PheSer: 4.253 ± 1.341
3.781PheThr: 3.781 ± 0.365
2.127PheVal: 2.127 ± 0.32
0.236PheTrp: 0.236 ± 0.157
1.418PheTyr: 1.418 ± 0.378
0.0PheXaa: 0.0 ± 0.0
Gly
1.89GlyAla: 1.89 ± 1.155
2.363GlyCys: 2.363 ± 0.61
3.308GlyAsp: 3.308 ± 0.325
2.836GlyGlu: 2.836 ± 0.6
0.945GlyPhe: 0.945 ± 0.504
0.945GlyGly: 0.945 ± 0.244
0.236GlyHis: 0.236 ± 0.157
3.544GlyIle: 3.544 ± 0.747
2.363GlyLys: 2.363 ± 0.373
4.017GlyLeu: 4.017 ± 1.124
0.0GlyMet: 0.0 ± 0.0
2.363GlyAsn: 2.363 ± 0.61
0.945GlyPro: 0.945 ± 0.504
1.89GlyGln: 1.89 ± 0.712
3.072GlyArg: 3.072 ± 0.595
4.017GlySer: 4.017 ± 1.278
2.127GlyThr: 2.127 ± 1.08
1.89GlyVal: 1.89 ± 0.816
1.181GlyTrp: 1.181 ± 0.412
2.127GlyTyr: 2.127 ± 0.394
0.0GlyXaa: 0.0 ± 0.0
His
2.127HisAla: 2.127 ± 0.993
0.945HisCys: 0.945 ± 0.244
2.127HisAsp: 2.127 ± 0.32
1.181HisGlu: 1.181 ± 0.412
0.945HisPhe: 0.945 ± 0.504
1.654HisGly: 1.654 ± 1.092
0.236HisHis: 0.236 ± 0.157
1.89HisIle: 1.89 ± 0.458
3.544HisLys: 3.544 ± 0.522
1.654HisLeu: 1.654 ± 0.509
0.473HisMet: 0.473 ± 0.415
0.709HisAsn: 0.709 ± 0.302
1.181HisPro: 1.181 ± 0.474
0.0HisGln: 0.0 ± 0.0
1.418HisArg: 1.418 ± 0.411
1.654HisSer: 1.654 ± 0.509
1.89HisThr: 1.89 ± 0.458
1.181HisVal: 1.181 ± 0.802
0.236HisTrp: 0.236 ± 0.157
0.709HisTyr: 0.709 ± 0.554
0.0HisXaa: 0.0 ± 0.0
Ile
3.781IleAla: 3.781 ± 0.917
2.363IleCys: 2.363 ± 0.823
4.962IleAsp: 4.962 ± 1.152
5.671IleGlu: 5.671 ± 0.756
4.726IlePhe: 4.726 ± 1.112
3.308IleGly: 3.308 ± 1.272
2.363IleHis: 2.363 ± 0.823
5.671IleIle: 5.671 ± 0.986
8.979IleLys: 8.979 ± 1.181
8.507IleLeu: 8.507 ± 1.518
1.654IleMet: 1.654 ± 0.782
5.198IleAsn: 5.198 ± 1.104
3.072IlePro: 3.072 ± 0.595
3.544IleGln: 3.544 ± 0.816
3.308IleArg: 3.308 ± 0.697
6.144IleSer: 6.144 ± 1.222
4.726IleThr: 4.726 ± 1.125
2.127IleVal: 2.127 ± 0.498
0.473IleTrp: 0.473 ± 0.314
2.127IleTyr: 2.127 ± 0.671
0.0IleXaa: 0.0 ± 0.0
Lys
2.599LysAla: 2.599 ± 1.118
2.599LysCys: 2.599 ± 1.034
3.544LysAsp: 3.544 ± 0.522
5.671LysGlu: 5.671 ± 1.712
1.89LysPhe: 1.89 ± 0.458
2.599LysGly: 2.599 ± 0.644
1.418LysHis: 1.418 ± 0.378
6.616LysIle: 6.616 ± 1.536
4.017LysLys: 4.017 ± 1.486
8.27LysLeu: 8.27 ± 1.308
1.418LysMet: 1.418 ± 0.386
4.017LysAsn: 4.017 ± 0.411
3.072LysPro: 3.072 ± 1.123
2.599LysGln: 2.599 ± 0.273
1.418LysArg: 1.418 ± 0.627
4.726LysSer: 4.726 ± 0.527
6.616LysThr: 6.616 ± 1.903
5.198LysVal: 5.198 ± 1.777
1.418LysTrp: 1.418 ± 0.378
3.544LysTyr: 3.544 ± 1.003
0.0LysXaa: 0.0 ± 0.0
Leu
5.671LeuAla: 5.671 ± 1.959
2.363LeuCys: 2.363 ± 1.419
4.726LeuAsp: 4.726 ± 0.867
6.144LeuGlu: 6.144 ± 1.192
3.781LeuPhe: 3.781 ± 0.855
3.781LeuGly: 3.781 ± 1.563
3.544LeuHis: 3.544 ± 1.364
5.198LeuIle: 5.198 ± 1.367
7.798LeuLys: 7.798 ± 1.933
9.452LeuLeu: 9.452 ± 2.776
2.363LeuMet: 2.363 ± 0.76
5.907LeuAsn: 5.907 ± 1.66
4.962LeuPro: 4.962 ± 0.788
2.599LeuGln: 2.599 ± 1.448
4.726LeuArg: 4.726 ± 1.22
7.561LeuSer: 7.561 ± 0.816
6.853LeuThr: 6.853 ± 0.998
4.726LeuVal: 4.726 ± 0.991
0.236LeuTrp: 0.236 ± 0.157
3.544LeuTyr: 3.544 ± 0.712
0.0LeuXaa: 0.0 ± 0.0
Met
0.709MetAla: 0.709 ± 0.623
0.945MetCys: 0.945 ± 0.325
1.418MetAsp: 1.418 ± 0.941
1.418MetGlu: 1.418 ± 0.888
1.181MetPhe: 1.181 ± 0.476
1.181MetGly: 1.181 ± 0.498
0.473MetHis: 0.473 ± 0.707
3.308MetIle: 3.308 ± 2.53
1.654MetLys: 1.654 ± 0.509
1.89MetLeu: 1.89 ± 0.458
0.945MetMet: 0.945 ± 0.627
2.127MetAsn: 2.127 ± 0.394
0.473MetPro: 0.473 ± 0.122
0.709MetGln: 0.709 ± 0.47
1.654MetArg: 1.654 ± 0.4
3.072MetSer: 3.072 ± 1.02
0.236MetThr: 0.236 ± 0.208
2.599MetVal: 2.599 ± 1.101
0.236MetTrp: 0.236 ± 0.208
1.181MetTyr: 1.181 ± 0.277
0.0MetXaa: 0.0 ± 0.0
Asn
2.127AsnAla: 2.127 ± 0.509
1.181AsnCys: 1.181 ± 1.038
2.836AsnAsp: 2.836 ± 0.976
2.127AsnGlu: 2.127 ± 0.798
3.072AsnPhe: 3.072 ± 0.435
1.418AsnGly: 1.418 ± 0.366
1.89AsnHis: 1.89 ± 0.448
3.781AsnIle: 3.781 ± 0.855
2.836AsnLys: 2.836 ± 0.57
5.907AsnLeu: 5.907 ± 1.028
1.181AsnMet: 1.181 ± 0.277
1.89AsnAsn: 1.89 ± 0.448
3.781AsnPro: 3.781 ± 0.855
2.599AsnGln: 2.599 ± 0.644
2.836AsnArg: 2.836 ± 0.976
4.726AsnSer: 4.726 ± 0.527
2.836AsnThr: 2.836 ± 0.722
3.308AsnVal: 3.308 ± 0.607
1.181AsnTrp: 1.181 ± 0.498
2.599AsnTyr: 2.599 ± 0.833
0.0AsnXaa: 0.0 ± 0.0
Pro
2.363ProAla: 2.363 ± 1.355
0.709ProCys: 0.709 ± 1.471
2.363ProAsp: 2.363 ± 1.049
3.781ProGlu: 3.781 ± 0.855
1.181ProPhe: 1.181 ± 0.412
2.836ProGly: 2.836 ± 0.895
0.236ProHis: 0.236 ± 0.208
4.962ProIle: 4.962 ± 0.617
2.127ProLys: 2.127 ± 0.644
1.654ProLeu: 1.654 ± 1.163
0.709ProMet: 0.709 ± 0.47
1.181ProAsn: 1.181 ± 0.412
0.0ProPro: 0.0 ± 0.0
0.0ProGln: 0.0 ± 0.0
1.654ProArg: 1.654 ± 1.289
2.363ProSer: 2.363 ± 0.695
1.654ProThr: 1.654 ± 0.658
2.836ProVal: 2.836 ± 0.57
0.709ProTrp: 0.709 ± 0.706
1.181ProTyr: 1.181 ± 0.474
0.0ProXaa: 0.0 ± 0.0
Gln
1.654GlnAla: 1.654 ± 0.578
0.709GlnCys: 0.709 ± 0.67
2.363GlnAsp: 2.363 ± 0.301
1.181GlnGlu: 1.181 ± 0.498
1.89GlnPhe: 1.89 ± 0.651
0.945GlnGly: 0.945 ± 0.325
1.181GlnHis: 1.181 ± 0.476
3.072GlnIle: 3.072 ± 0.883
1.89GlnLys: 1.89 ± 0.651
2.127GlnLeu: 2.127 ± 0.957
0.473GlnMet: 0.473 ± 0.519
2.363GlnAsn: 2.363 ± 0.91
0.709GlnPro: 0.709 ± 0.67
1.418GlnGln: 1.418 ± 0.658
3.308GlnArg: 3.308 ± 1.311
1.654GlnSer: 1.654 ± 0.578
4.49GlnThr: 4.49 ± 0.742
1.418GlnVal: 1.418 ± 0.366
0.473GlnTrp: 0.473 ± 0.122
0.709GlnTyr: 0.709 ± 0.189
0.0GlnXaa: 0.0 ± 0.0
Arg
0.709ArgAla: 0.709 ± 0.554
2.127ArgCys: 2.127 ± 0.907
2.599ArgAsp: 2.599 ± 1.101
3.072ArgGlu: 3.072 ± 0.883
1.89ArgPhe: 1.89 ± 0.458
0.945ArgGly: 0.945 ± 0.831
0.709ArgHis: 0.709 ± 0.189
4.49ArgIle: 4.49 ± 1.463
4.962ArgLys: 4.962 ± 1.324
5.435ArgLeu: 5.435 ± 3.371
1.89ArgMet: 1.89 ± 0.845
3.781ArgAsn: 3.781 ± 1.635
0.945ArgPro: 0.945 ± 0.628
1.89ArgGln: 1.89 ± 1.161
4.017ArgArg: 4.017 ± 2.924
4.017ArgSer: 4.017 ± 1.083
3.072ArgThr: 3.072 ± 2.617
1.89ArgVal: 1.89 ± 0.339
0.236ArgTrp: 0.236 ± 0.208
1.654ArgTyr: 1.654 ± 0.62
0.0ArgXaa: 0.0 ± 0.0
Ser
3.072SerAla: 3.072 ± 0.733
2.836SerCys: 2.836 ± 0.992
4.253SerAsp: 4.253 ± 2.158
3.544SerGlu: 3.544 ± 1.103
3.308SerPhe: 3.308 ± 0.854
3.544SerGly: 3.544 ± 0.522
1.418SerHis: 1.418 ± 0.627
4.726SerIle: 4.726 ± 1.112
4.962SerLys: 4.962 ± 1.152
6.853SerLeu: 6.853 ± 1.829
2.836SerMet: 2.836 ± 0.636
2.836SerAsn: 2.836 ± 1.209
1.89SerPro: 1.89 ± 0.851
2.363SerGln: 2.363 ± 0.798
4.253SerArg: 4.253 ± 0.941
5.907SerSer: 5.907 ± 3.218
6.853SerThr: 6.853 ± 2.232
4.49SerVal: 4.49 ± 0.51
0.709SerTrp: 0.709 ± 0.7
2.363SerTyr: 2.363 ± 1.107
0.0SerXaa: 0.0 ± 0.0
Thr
5.671ThrAla: 5.671 ± 3.266
0.709ThrCys: 0.709 ± 0.623
2.127ThrAsp: 2.127 ± 0.498
4.253ThrGlu: 4.253 ± 0.927
3.544ThrPhe: 3.544 ± 1.342
5.435ThrGly: 5.435 ± 2.112
1.654ThrHis: 1.654 ± 0.922
3.781ThrIle: 3.781 ± 0.834
4.49ThrLys: 4.49 ± 0.711
4.962ThrLeu: 4.962 ± 2.143
1.654ThrMet: 1.654 ± 1.349
1.89ThrAsn: 1.89 ± 0.937
2.363ThrPro: 2.363 ± 0.91
1.418ThrGln: 1.418 ± 0.411
2.599ThrArg: 2.599 ± 1.168
3.781ThrSer: 3.781 ± 0.496
3.781ThrThr: 3.781 ± 0.976
6.616ThrVal: 6.616 ± 2.235
0.709ThrTrp: 0.709 ± 0.302
2.599ThrTyr: 2.599 ± 0.273
0.0ThrXaa: 0.0 ± 0.0
Val
1.654ValAla: 1.654 ± 0.527
1.89ValCys: 1.89 ± 0.712
2.599ValAsp: 2.599 ± 1.013
2.836ValGlu: 2.836 ± 0.822
3.308ValPhe: 3.308 ± 0.743
2.127ValGly: 2.127 ± 0.394
1.89ValHis: 1.89 ± 0.488
4.017ValIle: 4.017 ± 0.609
3.544ValLys: 3.544 ± 1.428
4.962ValLeu: 4.962 ± 0.485
0.945ValMet: 0.945 ± 0.325
3.308ValAsn: 3.308 ± 0.325
1.89ValPro: 1.89 ± 0.488
1.654ValGln: 1.654 ± 0.98
2.836ValArg: 2.836 ± 0.57
4.017ValSer: 4.017 ± 0.623
3.308ValThr: 3.308 ± 1.241
2.127ValVal: 2.127 ± 1.812
0.236ValTrp: 0.236 ± 0.742
2.836ValTyr: 2.836 ± 0.658
0.0ValXaa: 0.0 ± 0.0
Trp
0.473TrpAla: 0.473 ± 0.122
0.473TrpCys: 0.473 ± 0.122
0.473TrpAsp: 0.473 ± 0.314
0.236TrpGlu: 0.236 ± 0.157
0.709TrpPhe: 0.709 ± 0.302
0.473TrpGly: 0.473 ± 0.743
0.236TrpHis: 0.236 ± 0.208
1.181TrpIle: 1.181 ± 0.709
0.236TrpLys: 0.236 ± 0.646
0.945TrpLeu: 0.945 ± 0.325
0.236TrpMet: 0.236 ± 0.646
0.473TrpAsn: 0.473 ± 0.314
0.236TrpPro: 0.236 ± 0.208
0.709TrpGln: 0.709 ± 1.113
0.945TrpArg: 0.945 ± 0.244
1.181TrpSer: 1.181 ± 0.784
0.473TrpThr: 0.473 ± 0.415
0.236TrpVal: 0.236 ± 0.157
0.0TrpTrp: 0.0 ± 0.0
0.236TrpTyr: 0.236 ± 0.157
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.89TyrAla: 1.89 ± 0.712
1.181TyrCys: 1.181 ± 0.668
2.363TyrAsp: 2.363 ± 0.995
1.89TyrGlu: 1.89 ± 0.937
1.418TyrPhe: 1.418 ± 0.627
1.181TyrGly: 1.181 ± 0.277
1.181TyrHis: 1.181 ± 0.709
3.781TyrIle: 3.781 ± 1.711
4.253TyrLys: 4.253 ± 0.996
3.544TyrLeu: 3.544 ± 0.835
1.89TyrMet: 1.89 ± 0.937
2.127TyrAsn: 2.127 ± 0.394
0.945TyrPro: 0.945 ± 0.244
0.945TyrGln: 0.945 ± 0.244
2.127TyrArg: 2.127 ± 0.394
2.599TyrSer: 2.599 ± 0.763
2.127TyrThr: 2.127 ± 0.798
1.418TyrVal: 1.418 ± 0.411
0.473TyrTrp: 0.473 ± 0.122
0.709TyrTyr: 0.709 ± 0.623
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (4233 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski