Amino acid dipepetide frequency for Kamese virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.779AlaAla: 2.779 ± 1.147
0.926AlaCys: 0.926 ± 0.429
3.242AlaAsp: 3.242 ± 0.699
1.158AlaGlu: 1.158 ± 0.607
2.316AlaPhe: 2.316 ± 0.994
2.084AlaGly: 2.084 ± 1.563
1.158AlaHis: 1.158 ± 0.623
3.474AlaIle: 3.474 ± 0.607
3.011AlaLys: 3.011 ± 0.924
4.4AlaLeu: 4.4 ± 1.096
0.695AlaMet: 0.695 ± 0.521
2.316AlaAsn: 2.316 ± 0.778
1.621AlaPro: 1.621 ± 0.993
1.39AlaGln: 1.39 ± 0.651
2.547AlaArg: 2.547 ± 0.531
3.705AlaSer: 3.705 ± 1.256
1.621AlaThr: 1.621 ± 0.602
3.242AlaVal: 3.242 ± 0.736
1.853AlaTrp: 1.853 ± 0.582
2.316AlaTyr: 2.316 ± 0.999
0.0AlaXaa: 0.0 ± 0.0
Cys
0.926CysAla: 0.926 ± 0.284
0.232CysCys: 0.232 ± 0.15
0.232CysAsp: 0.232 ± 0.373
0.926CysGlu: 0.926 ± 0.47
1.158CysPhe: 1.158 ± 0.658
0.926CysGly: 0.926 ± 0.51
0.926CysHis: 0.926 ± 0.519
1.158CysIle: 1.158 ± 0.393
1.158CysLys: 1.158 ± 0.973
1.39CysLeu: 1.39 ± 0.619
0.232CysMet: 0.232 ± 0.266
0.695CysAsn: 0.695 ± 0.479
0.926CysPro: 0.926 ± 0.441
0.463CysGln: 0.463 ± 0.299
0.926CysArg: 0.926 ± 0.398
1.39CysSer: 1.39 ± 0.747
0.695CysThr: 0.695 ± 0.359
1.853CysVal: 1.853 ± 0.476
0.232CysTrp: 0.232 ± 0.15
1.621CysTyr: 1.621 ± 0.578
0.0CysXaa: 0.0 ± 0.0
Asp
2.779AspAla: 2.779 ± 1.633
1.158AspCys: 1.158 ± 0.335
3.474AspAsp: 3.474 ± 1.738
3.011AspGlu: 3.011 ± 0.644
0.926AspPhe: 0.926 ± 0.445
4.169AspGly: 4.169 ± 1.077
1.158AspHis: 1.158 ± 0.406
3.937AspIle: 3.937 ± 1.322
3.937AspLys: 3.937 ± 1.43
6.716AspLeu: 6.716 ± 1.725
1.39AspMet: 1.39 ± 0.65
1.853AspAsn: 1.853 ± 0.675
3.474AspPro: 3.474 ± 0.938
2.547AspGln: 2.547 ± 1.239
1.853AspArg: 1.853 ± 0.925
2.779AspSer: 2.779 ± 0.63
1.158AspThr: 1.158 ± 0.333
3.937AspVal: 3.937 ± 0.984
0.926AspTrp: 0.926 ± 0.456
1.853AspTyr: 1.853 ± 0.685
0.0AspXaa: 0.0 ± 0.0
Glu
1.621GluAla: 1.621 ± 0.969
0.463GluCys: 0.463 ± 0.465
5.095GluAsp: 5.095 ± 1.189
4.632GluGlu: 4.632 ± 1.478
3.011GluPhe: 3.011 ± 1.53
3.242GluGly: 3.242 ± 1.024
0.463GluHis: 0.463 ± 0.442
3.705GluIle: 3.705 ± 0.723
4.4GluLys: 4.4 ± 1.604
6.948GluLeu: 6.948 ± 1.338
1.853GluMet: 1.853 ± 0.796
2.316GluAsn: 2.316 ± 0.625
1.621GluPro: 1.621 ± 0.503
2.316GluGln: 2.316 ± 1.082
1.853GluArg: 1.853 ± 0.863
3.937GluSer: 3.937 ± 1.174
3.242GluThr: 3.242 ± 0.611
2.547GluVal: 2.547 ± 0.87
2.547GluTrp: 2.547 ± 0.863
2.779GluTyr: 2.779 ± 1.299
0.0GluXaa: 0.0 ± 0.0
Phe
1.621PheAla: 1.621 ± 0.589
0.463PheCys: 0.463 ± 0.252
2.084PheAsp: 2.084 ± 0.861
2.084PheGlu: 2.084 ± 0.562
2.547PhePhe: 2.547 ± 0.515
2.084PheGly: 2.084 ± 0.472
1.39PheHis: 1.39 ± 0.747
1.621PheIle: 1.621 ± 1.066
3.474PheLys: 3.474 ± 1.118
4.632PheLeu: 4.632 ± 1.14
0.695PheMet: 0.695 ± 0.481
2.779PheAsn: 2.779 ± 0.643
2.779PhePro: 2.779 ± 0.732
2.547PheGln: 2.547 ± 0.594
2.316PheArg: 2.316 ± 0.924
3.705PheSer: 3.705 ± 0.928
1.853PheThr: 1.853 ± 1.199
2.316PheVal: 2.316 ± 0.627
0.463PheTrp: 0.463 ± 0.299
0.926PheTyr: 0.926 ± 0.362
0.0PheXaa: 0.0 ± 0.0
Gly
1.853GlyAla: 1.853 ± 0.799
0.463GlyCys: 0.463 ± 0.235
2.547GlyAsp: 2.547 ± 0.726
2.779GlyGlu: 2.779 ± 1.22
2.084GlyPhe: 2.084 ± 0.733
3.474GlyGly: 3.474 ± 1.262
0.926GlyHis: 0.926 ± 0.598
4.4GlyIle: 4.4 ± 0.943
2.779GlyLys: 2.779 ± 0.8
7.411GlyLeu: 7.411 ± 1.913
1.39GlyMet: 1.39 ± 0.958
1.853GlyAsn: 1.853 ± 0.601
1.853GlyPro: 1.853 ± 0.612
2.316GlyGln: 2.316 ± 0.734
2.547GlyArg: 2.547 ± 0.828
4.632GlySer: 4.632 ± 1.098
3.937GlyThr: 3.937 ± 1.273
3.242GlyVal: 3.242 ± 1.344
0.463GlyTrp: 0.463 ± 0.235
2.084GlyTyr: 2.084 ± 0.507
0.0GlyXaa: 0.0 ± 0.0
His
0.695HisAla: 0.695 ± 0.294
0.695HisCys: 0.695 ± 0.479
2.084HisAsp: 2.084 ± 1.06
1.621HisGlu: 1.621 ± 0.517
1.158HisPhe: 1.158 ± 0.526
1.158HisGly: 1.158 ± 0.584
0.232HisHis: 0.232 ± 0.15
2.084HisIle: 2.084 ± 0.552
0.926HisLys: 0.926 ± 0.398
1.621HisLeu: 1.621 ± 0.416
1.158HisMet: 1.158 ± 0.577
1.621HisAsn: 1.621 ± 0.539
2.084HisPro: 2.084 ± 0.921
0.926HisGln: 0.926 ± 0.598
1.621HisArg: 1.621 ± 0.567
1.853HisSer: 1.853 ± 0.545
0.926HisThr: 0.926 ± 0.466
1.158HisVal: 1.158 ± 0.517
0.926HisTrp: 0.926 ± 0.323
0.463HisTyr: 0.463 ± 0.299
0.0HisXaa: 0.0 ± 0.0
Ile
2.316IleAla: 2.316 ± 0.75
1.158IleCys: 1.158 ± 0.748
3.474IleAsp: 3.474 ± 0.547
4.4IleGlu: 4.4 ± 0.862
1.853IlePhe: 1.853 ± 0.41
5.558IleGly: 5.558 ± 0.781
1.853IleHis: 1.853 ± 0.504
5.095IleIle: 5.095 ± 1.43
3.937IleLys: 3.937 ± 0.982
5.79IleLeu: 5.79 ± 1.339
0.926IleMet: 0.926 ± 0.453
5.327IleAsn: 5.327 ± 1.041
2.084IlePro: 2.084 ± 0.751
3.242IleGln: 3.242 ± 1.112
6.021IleArg: 6.021 ± 0.808
7.179IleSer: 7.179 ± 1.141
3.705IleThr: 3.705 ± 1.223
4.169IleVal: 4.169 ± 0.778
0.695IleTrp: 0.695 ± 0.479
4.169IleTyr: 4.169 ± 0.599
0.0IleXaa: 0.0 ± 0.0
Lys
3.011LysAla: 3.011 ± 0.924
0.926LysCys: 0.926 ± 0.511
1.853LysAsp: 1.853 ± 0.45
2.779LysGlu: 2.779 ± 0.609
1.853LysPhe: 1.853 ± 0.684
3.242LysGly: 3.242 ± 1.104
1.621LysHis: 1.621 ± 0.405
6.021LysIle: 6.021 ± 1.686
4.632LysLys: 4.632 ± 1.681
5.095LysLeu: 5.095 ± 1.808
1.621LysMet: 1.621 ± 0.703
2.547LysAsn: 2.547 ± 0.67
2.084LysPro: 2.084 ± 0.688
1.158LysGln: 1.158 ± 0.507
4.632LysArg: 4.632 ± 1.69
5.558LysSer: 5.558 ± 1.349
2.547LysThr: 2.547 ± 0.941
3.474LysVal: 3.474 ± 1.073
0.926LysTrp: 0.926 ± 0.403
1.39LysTyr: 1.39 ± 0.4
0.0LysXaa: 0.0 ± 0.0
Leu
5.095LeuAla: 5.095 ± 0.94
2.547LeuCys: 2.547 ± 0.594
5.558LeuAsp: 5.558 ± 1.08
7.874LeuGlu: 7.874 ± 1.569
5.327LeuPhe: 5.327 ± 1.16
3.705LeuGly: 3.705 ± 1.345
2.779LeuHis: 2.779 ± 0.845
7.179LeuIle: 7.179 ± 2.263
6.253LeuLys: 6.253 ± 1.246
9.727LeuLeu: 9.727 ± 1.463
3.011LeuMet: 3.011 ± 0.389
7.411LeuAsn: 7.411 ± 1.459
4.632LeuPro: 4.632 ± 1.466
4.169LeuGln: 4.169 ± 1.303
5.558LeuArg: 5.558 ± 1.482
6.484LeuSer: 6.484 ± 1.324
5.558LeuThr: 5.558 ± 1.021
4.169LeuVal: 4.169 ± 0.872
1.621LeuTrp: 1.621 ± 1.065
4.169LeuTyr: 4.169 ± 0.744
0.0LeuXaa: 0.0 ± 0.0
Met
1.853MetAla: 1.853 ± 0.961
0.463MetCys: 0.463 ± 0.323
1.853MetAsp: 1.853 ± 0.793
2.779MetGlu: 2.779 ± 0.812
2.084MetPhe: 2.084 ± 0.471
1.853MetGly: 1.853 ± 0.639
0.463MetHis: 0.463 ± 0.252
1.621MetIle: 1.621 ± 0.743
1.158MetLys: 1.158 ± 0.374
2.316MetLeu: 2.316 ± 0.722
0.695MetMet: 0.695 ± 0.294
1.853MetAsn: 1.853 ± 0.68
0.232MetPro: 0.232 ± 0.15
0.0MetGln: 0.0 ± 0.0
1.853MetArg: 1.853 ± 1.135
1.853MetSer: 1.853 ± 0.913
0.926MetThr: 0.926 ± 0.387
1.39MetVal: 1.39 ± 0.63
0.232MetTrp: 0.232 ± 0.48
0.926MetTyr: 0.926 ± 0.519
0.0MetXaa: 0.0 ± 0.0
Asn
3.011AsnAla: 3.011 ± 1.101
1.621AsnCys: 1.621 ± 0.481
2.084AsnAsp: 2.084 ± 0.513
2.084AsnGlu: 2.084 ± 1.095
2.316AsnPhe: 2.316 ± 0.801
2.547AsnGly: 2.547 ± 0.987
2.547AsnHis: 2.547 ± 0.762
2.779AsnIle: 2.779 ± 1.025
1.621AsnLys: 1.621 ± 0.508
7.642AsnLeu: 7.642 ± 1.839
1.853AsnMet: 1.853 ± 0.435
3.705AsnAsn: 3.705 ± 1.105
3.937AsnPro: 3.937 ± 0.502
2.547AsnGln: 2.547 ± 1.189
2.547AsnArg: 2.547 ± 0.679
4.169AsnSer: 4.169 ± 1.34
3.705AsnThr: 3.705 ± 1.088
3.242AsnVal: 3.242 ± 1.058
1.621AsnTrp: 1.621 ± 0.551
3.011AsnTyr: 3.011 ± 0.866
0.0AsnXaa: 0.0 ± 0.0
Pro
1.39ProAla: 1.39 ± 0.379
0.0ProCys: 0.0 ± 0.0
3.937ProAsp: 3.937 ± 1.431
2.316ProGlu: 2.316 ± 0.904
1.158ProPhe: 1.158 ± 0.845
2.316ProGly: 2.316 ± 1.442
1.158ProHis: 1.158 ± 0.467
3.242ProIle: 3.242 ± 0.573
3.705ProLys: 3.705 ± 1.609
4.169ProLeu: 4.169 ± 0.777
0.0ProMet: 0.0 ± 0.0
2.547ProAsn: 2.547 ± 0.556
2.316ProPro: 2.316 ± 1.253
1.853ProGln: 1.853 ± 0.725
3.011ProArg: 3.011 ± 0.809
3.705ProSer: 3.705 ± 0.627
3.011ProThr: 3.011 ± 1.141
2.779ProVal: 2.779 ± 0.479
1.158ProTrp: 1.158 ± 0.451
1.621ProTyr: 1.621 ± 0.521
0.0ProXaa: 0.0 ± 0.0
Gln
1.39GlnAla: 1.39 ± 0.651
0.695GlnCys: 0.695 ± 0.479
1.621GlnAsp: 1.621 ± 1.194
2.779GlnGlu: 2.779 ± 0.688
1.39GlnPhe: 1.39 ± 0.431
2.084GlnGly: 2.084 ± 0.599
0.232GlnHis: 0.232 ± 0.291
2.084GlnIle: 2.084 ± 0.539
1.853GlnLys: 1.853 ± 0.441
3.242GlnLeu: 3.242 ± 0.575
0.926GlnMet: 0.926 ± 0.675
2.547GlnAsn: 2.547 ± 0.613
0.463GlnPro: 0.463 ± 0.321
0.695GlnGln: 0.695 ± 0.639
2.084GlnArg: 2.084 ± 0.9
2.779GlnSer: 2.779 ± 0.913
3.242GlnThr: 3.242 ± 0.57
2.316GlnVal: 2.316 ± 0.799
1.158GlnTrp: 1.158 ± 0.424
1.158GlnTyr: 1.158 ± 0.372
0.0GlnXaa: 0.0 ± 0.0
Arg
2.084ArgAla: 2.084 ± 0.472
0.695ArgCys: 0.695 ± 0.409
3.011ArgAsp: 3.011 ± 0.697
4.863ArgGlu: 4.863 ± 0.827
3.474ArgPhe: 3.474 ± 1.382
3.242ArgGly: 3.242 ± 0.672
1.39ArgHis: 1.39 ± 0.377
3.937ArgIle: 3.937 ± 1.154
1.853ArgLys: 1.853 ± 1.381
4.863ArgLeu: 4.863 ± 0.717
0.926ArgMet: 0.926 ± 0.284
3.474ArgAsn: 3.474 ± 1.073
1.621ArgPro: 1.621 ± 0.321
1.158ArgGln: 1.158 ± 0.57
3.011ArgArg: 3.011 ± 1.083
5.327ArgSer: 5.327 ± 1.031
3.705ArgThr: 3.705 ± 0.968
3.937ArgVal: 3.937 ± 0.828
0.463ArgTrp: 0.463 ± 0.252
1.853ArgTyr: 1.853 ± 0.791
0.0ArgXaa: 0.0 ± 0.0
Ser
5.327SerAla: 5.327 ± 1.459
0.232SerCys: 0.232 ± 0.15
3.011SerAsp: 3.011 ± 1.281
3.705SerGlu: 3.705 ± 0.935
2.316SerPhe: 2.316 ± 1.071
4.863SerGly: 4.863 ± 0.897
2.779SerHis: 2.779 ± 1.281
6.948SerIle: 6.948 ± 0.926
4.169SerLys: 4.169 ± 0.804
8.8SerLeu: 8.8 ± 1.538
1.39SerMet: 1.39 ± 0.377
3.011SerAsn: 3.011 ± 1.034
3.242SerPro: 3.242 ± 0.34
2.084SerGln: 2.084 ± 0.885
4.4SerArg: 4.4 ± 0.947
6.484SerSer: 6.484 ± 1.361
4.169SerThr: 4.169 ± 1.401
4.863SerVal: 4.863 ± 0.764
2.547SerTrp: 2.547 ± 0.629
3.474SerTyr: 3.474 ± 0.91
0.0SerXaa: 0.0 ± 0.0
Thr
1.158ThrAla: 1.158 ± 0.882
1.39ThrCys: 1.39 ± 0.502
1.39ThrAsp: 1.39 ± 0.564
2.547ThrGlu: 2.547 ± 0.926
1.853ThrPhe: 1.853 ± 0.939
2.547ThrGly: 2.547 ± 1.137
1.158ThrHis: 1.158 ± 0.582
4.863ThrIle: 4.863 ± 1.002
2.547ThrLys: 2.547 ± 0.783
2.084ThrLeu: 2.084 ± 0.82
3.011ThrMet: 3.011 ± 0.951
5.095ThrAsn: 5.095 ± 0.98
3.474ThrPro: 3.474 ± 2.228
2.084ThrGln: 2.084 ± 0.917
2.547ThrArg: 2.547 ± 0.746
4.863ThrSer: 4.863 ± 1.341
3.705ThrThr: 3.705 ± 0.938
3.011ThrVal: 3.011 ± 1.066
2.779ThrTrp: 2.779 ± 0.687
2.547ThrTyr: 2.547 ± 1.341
0.0ThrXaa: 0.0 ± 0.0
Val
3.474ValAla: 3.474 ± 1.208
2.547ValCys: 2.547 ± 0.328
3.011ValAsp: 3.011 ± 0.743
2.316ValGlu: 2.316 ± 1.131
3.242ValPhe: 3.242 ± 1.361
0.695ValGly: 0.695 ± 0.479
1.39ValHis: 1.39 ± 0.897
6.021ValIle: 6.021 ± 1.242
2.779ValLys: 2.779 ± 0.981
6.948ValLeu: 6.948 ± 0.863
1.621ValMet: 1.621 ± 0.641
3.705ValAsn: 3.705 ± 1.169
3.705ValPro: 3.705 ± 1.044
1.621ValGln: 1.621 ± 0.724
2.547ValArg: 2.547 ± 0.728
3.705ValSer: 3.705 ± 0.648
2.547ValThr: 2.547 ± 0.708
1.39ValVal: 1.39 ± 0.564
0.926ValTrp: 0.926 ± 0.453
1.158ValTyr: 1.158 ± 0.497
0.0ValXaa: 0.0 ± 0.0
Trp
1.39TrpAla: 1.39 ± 0.525
0.463TrpCys: 0.463 ± 0.394
1.158TrpAsp: 1.158 ± 0.57
1.853TrpGlu: 1.853 ± 0.68
1.158TrpPhe: 1.158 ± 0.507
2.084TrpGly: 2.084 ± 0.836
0.463TrpHis: 0.463 ± 0.299
1.853TrpIle: 1.853 ± 0.771
0.926TrpLys: 0.926 ± 0.403
2.316TrpLeu: 2.316 ± 1.12
1.158TrpMet: 1.158 ± 0.798
0.926TrpAsn: 0.926 ± 0.403
0.463TrpPro: 0.463 ± 0.321
0.463TrpGln: 0.463 ± 0.235
0.695TrpArg: 0.695 ± 0.437
1.158TrpSer: 1.158 ± 0.547
1.158TrpThr: 1.158 ± 0.374
0.926TrpVal: 0.926 ± 0.47
0.695TrpTrp: 0.695 ± 0.597
1.39TrpTyr: 1.39 ± 0.783
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.084TyrAla: 2.084 ± 0.823
0.926TyrCys: 0.926 ± 0.848
2.316TyrAsp: 2.316 ± 0.576
2.084TyrGlu: 2.084 ± 0.644
1.158TyrPhe: 1.158 ± 0.706
1.158TyrGly: 1.158 ± 0.467
1.158TyrHis: 1.158 ± 0.645
0.926TyrIle: 0.926 ± 0.408
1.853TyrLys: 1.853 ± 0.553
6.716TyrLeu: 6.716 ± 1.102
1.621TyrMet: 1.621 ± 0.743
3.011TyrAsn: 3.011 ± 0.916
2.779TyrPro: 2.779 ± 1.29
1.158TyrGln: 1.158 ± 0.351
2.316TyrArg: 2.316 ± 1.02
2.547TyrSer: 2.547 ± 1.143
3.242TyrThr: 3.242 ± 1.598
1.39TyrVal: 1.39 ± 0.664
0.463TyrTrp: 0.463 ± 0.252
1.853TyrTyr: 1.853 ± 0.796
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (4319 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski