Amino acid dipepetide frequency for Midway nyavirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.694AlaAla: 5.694 ± 0.676
1.356AlaCys: 1.356 ± 0.496
2.711AlaAsp: 2.711 ± 0.675
5.423AlaGlu: 5.423 ± 2.219
0.813AlaPhe: 0.813 ± 0.387
4.067AlaGly: 4.067 ± 1.002
1.898AlaHis: 1.898 ± 0.761
2.983AlaIle: 2.983 ± 0.732
2.711AlaLys: 2.711 ± 1.586
9.219AlaLeu: 9.219 ± 1.996
2.169AlaMet: 2.169 ± 0.543
1.898AlaAsn: 1.898 ± 0.373
3.796AlaPro: 3.796 ± 1.184
3.254AlaGln: 3.254 ± 0.729
6.236AlaArg: 6.236 ± 2.428
4.61AlaSer: 4.61 ± 1.379
4.61AlaThr: 4.61 ± 1.41
3.796AlaVal: 3.796 ± 1.333
1.356AlaTrp: 1.356 ± 0.552
2.44AlaTyr: 2.44 ± 0.863
0.0AlaXaa: 0.0 ± 0.0
Cys
0.813CysAla: 0.813 ± 0.716
0.542CysCys: 0.542 ± 0.294
0.271CysAsp: 0.271 ± 0.333
1.356CysGlu: 1.356 ± 0.616
0.813CysPhe: 0.813 ± 0.799
0.813CysGly: 0.813 ± 0.342
0.813CysHis: 0.813 ± 0.499
0.271CysIle: 0.271 ± 0.543
1.627CysLys: 1.627 ± 0.881
3.254CysLeu: 3.254 ± 1.375
0.271CysMet: 0.271 ± 0.431
1.085CysAsn: 1.085 ± 0.587
1.356CysPro: 1.356 ± 0.38
1.085CysGln: 1.085 ± 0.451
1.356CysArg: 1.356 ± 0.496
1.085CysSer: 1.085 ± 0.451
2.169CysThr: 2.169 ± 0.765
0.813CysVal: 0.813 ± 0.342
0.0CysTrp: 0.0 ± 0.0
1.085CysTyr: 1.085 ± 0.451
0.0CysXaa: 0.0 ± 0.0
Asp
1.898AspAla: 1.898 ± 0.686
1.085AspCys: 1.085 ± 0.587
1.898AspAsp: 1.898 ± 1.12
4.881AspGlu: 4.881 ± 2.967
1.085AspPhe: 1.085 ± 0.527
1.085AspGly: 1.085 ± 0.379
1.356AspHis: 1.356 ± 0.832
1.627AspIle: 1.627 ± 0.258
4.338AspLys: 4.338 ± 3.311
6.508AspLeu: 6.508 ± 1.096
0.271AspMet: 0.271 ± 0.514
0.542AspAsn: 0.542 ± 0.294
3.796AspPro: 3.796 ± 1.572
1.898AspGln: 1.898 ± 0.975
4.61AspArg: 4.61 ± 0.948
5.152AspSer: 5.152 ± 2.499
1.898AspThr: 1.898 ± 0.783
0.813AspVal: 0.813 ± 0.51
0.542AspTrp: 0.542 ± 0.333
2.983AspTyr: 2.983 ± 0.568
0.0AspXaa: 0.0 ± 0.0
Glu
8.406GluAla: 8.406 ± 3.727
1.356GluCys: 1.356 ± 0.832
5.152GluAsp: 5.152 ± 2.637
10.033GluGlu: 10.033 ± 5.266
1.085GluPhe: 1.085 ± 0.482
3.796GluGly: 3.796 ± 1.466
2.44GluHis: 2.44 ± 1.034
4.61GluIle: 4.61 ± 1.389
5.965GluLys: 5.965 ± 3.578
8.948GluLeu: 8.948 ± 1.34
1.627GluMet: 1.627 ± 1.27
1.627GluAsn: 1.627 ± 0.695
2.44GluPro: 2.44 ± 1.046
3.525GluGln: 3.525 ± 1.39
7.321GluArg: 7.321 ± 4.879
3.796GluSer: 3.796 ± 1.504
4.61GluThr: 4.61 ± 1.039
3.254GluVal: 3.254 ± 1.245
1.085GluTrp: 1.085 ± 0.587
2.44GluTyr: 2.44 ± 0.756
0.0GluXaa: 0.0 ± 0.0
Phe
1.627PheAla: 1.627 ± 0.581
0.813PheCys: 0.813 ± 0.499
0.813PheAsp: 0.813 ± 0.418
1.627PheGlu: 1.627 ± 0.589
1.085PhePhe: 1.085 ± 0.451
2.169PheGly: 2.169 ± 0.809
0.542PheHis: 0.542 ± 0.508
2.169PheIle: 2.169 ± 0.534
2.169PheLys: 2.169 ± 0.965
4.61PheLeu: 4.61 ± 1.699
0.542PheMet: 0.542 ± 0.279
0.813PheAsn: 0.813 ± 0.342
1.085PhePro: 1.085 ± 0.382
0.0PheGln: 0.0 ± 0.0
0.813PheArg: 0.813 ± 0.499
2.44PheSer: 2.44 ± 0.93
0.813PheThr: 0.813 ± 0.342
1.085PheVal: 1.085 ± 0.488
0.542PheTrp: 0.542 ± 0.294
0.271PheTyr: 0.271 ± 0.431
0.0PheXaa: 0.0 ± 0.0
Gly
2.983GlyAla: 2.983 ± 1.97
1.085GlyCys: 1.085 ± 0.587
2.169GlyAsp: 2.169 ± 0.401
2.983GlyGlu: 2.983 ± 1.249
0.813GlyPhe: 0.813 ± 0.499
5.423GlyGly: 5.423 ± 0.811
1.627GlyHis: 1.627 ± 0.911
2.44GlyIle: 2.44 ± 1.763
4.067GlyLys: 4.067 ± 1.276
9.219GlyLeu: 9.219 ± 1.703
1.898GlyMet: 1.898 ± 0.705
0.813GlyAsn: 0.813 ± 0.342
4.61GlyPro: 4.61 ± 0.664
3.254GlyGln: 3.254 ± 1.171
2.44GlyArg: 2.44 ± 0.865
5.152GlySer: 5.152 ± 0.922
3.254GlyThr: 3.254 ± 0.896
1.898GlyVal: 1.898 ± 0.908
1.085GlyTrp: 1.085 ± 0.562
1.356GlyTyr: 1.356 ± 0.38
0.0GlyXaa: 0.0 ± 0.0
His
1.898HisAla: 1.898 ± 0.783
1.085HisCys: 1.085 ± 0.451
1.085HisAsp: 1.085 ± 0.725
2.44HisGlu: 2.44 ± 0.693
0.542HisPhe: 0.542 ± 0.333
1.085HisGly: 1.085 ± 0.915
1.085HisHis: 1.085 ± 0.504
0.271HisIle: 0.271 ± 0.166
1.085HisLys: 1.085 ± 0.451
3.796HisLeu: 3.796 ± 2.027
0.271HisMet: 0.271 ± 0.333
1.627HisAsn: 1.627 ± 0.761
1.898HisPro: 1.898 ± 0.878
0.542HisGln: 0.542 ± 0.313
1.898HisArg: 1.898 ± 0.523
1.898HisSer: 1.898 ± 0.746
0.542HisThr: 0.542 ± 0.294
0.542HisVal: 0.542 ± 0.333
0.813HisTrp: 0.813 ± 0.499
0.542HisTyr: 0.542 ± 0.294
0.0HisXaa: 0.0 ± 0.0
Ile
2.44IleAla: 2.44 ± 0.771
0.813IleCys: 0.813 ± 0.499
1.627IleAsp: 1.627 ± 0.942
2.983IleGlu: 2.983 ± 0.507
1.627IlePhe: 1.627 ± 0.574
3.525IleGly: 3.525 ± 0.896
1.356IleHis: 1.356 ± 0.653
1.085IleIle: 1.085 ± 0.451
3.254IleLys: 3.254 ± 0.866
3.796IleLeu: 3.796 ± 0.841
0.271IleMet: 0.271 ± 0.333
2.169IleAsn: 2.169 ± 0.627
1.627IlePro: 1.627 ± 0.79
1.085IleGln: 1.085 ± 0.713
3.254IleArg: 3.254 ± 0.891
4.067IleSer: 4.067 ± 1.356
3.254IleThr: 3.254 ± 1.122
1.898IleVal: 1.898 ± 0.447
0.542IleTrp: 0.542 ± 0.665
1.627IleTyr: 1.627 ± 0.736
0.0IleXaa: 0.0 ± 0.0
Lys
3.254LysAla: 3.254 ± 1.265
0.813LysCys: 0.813 ± 0.406
4.338LysAsp: 4.338 ± 2.49
8.948LysGlu: 8.948 ± 5.076
0.542LysPhe: 0.542 ± 0.853
4.338LysGly: 4.338 ± 1.009
1.356LysHis: 1.356 ± 0.626
2.711LysIle: 2.711 ± 0.999
5.965LysLys: 5.965 ± 1.947
4.067LysLeu: 4.067 ± 1.409
1.085LysMet: 1.085 ± 0.665
2.169LysAsn: 2.169 ± 0.987
1.898LysPro: 1.898 ± 0.681
1.898LysGln: 1.898 ± 0.789
5.152LysArg: 5.152 ± 1.558
2.983LysSer: 2.983 ± 0.67
4.067LysThr: 4.067 ± 0.832
3.254LysVal: 3.254 ± 1.302
0.813LysTrp: 0.813 ± 0.342
1.356LysTyr: 1.356 ± 0.587
0.0LysXaa: 0.0 ± 0.0
Leu
10.575LeuAla: 10.575 ± 2.055
1.627LeuCys: 1.627 ± 0.835
4.881LeuAsp: 4.881 ± 1.175
9.219LeuGlu: 9.219 ± 1.967
4.067LeuPhe: 4.067 ± 1.118
6.236LeuGly: 6.236 ± 1.791
2.711LeuHis: 2.711 ± 0.499
5.694LeuIle: 5.694 ± 1.725
7.863LeuLys: 7.863 ± 2.194
14.642LeuLeu: 14.642 ± 4.06
2.711LeuMet: 2.711 ± 0.479
2.711LeuAsn: 2.711 ± 1.231
5.694LeuPro: 5.694 ± 1.867
5.423LeuGln: 5.423 ± 0.667
6.779LeuArg: 6.779 ± 1.322
11.659LeuSer: 11.659 ± 2.978
5.965LeuThr: 5.965 ± 1.367
5.152LeuVal: 5.152 ± 2.006
2.44LeuTrp: 2.44 ± 1.034
4.067LeuTyr: 4.067 ± 0.847
0.0LeuXaa: 0.0 ± 0.0
Met
0.813MetAla: 0.813 ± 0.499
0.0MetCys: 0.0 ± 0.0
0.813MetAsp: 0.813 ± 0.418
2.169MetGlu: 2.169 ± 1.295
0.271MetPhe: 0.271 ± 0.333
0.813MetGly: 0.813 ± 0.334
0.542MetHis: 0.542 ± 0.294
1.085MetIle: 1.085 ± 0.425
0.0MetLys: 0.0 ± 0.0
2.169MetLeu: 2.169 ± 1.294
0.542MetMet: 0.542 ± 0.333
0.542MetAsn: 0.542 ± 0.609
1.085MetPro: 1.085 ± 0.665
2.169MetGln: 2.169 ± 0.534
0.813MetArg: 0.813 ± 0.836
1.356MetSer: 1.356 ± 0.582
1.898MetThr: 1.898 ± 0.628
0.813MetVal: 0.813 ± 0.499
0.542MetTrp: 0.542 ± 0.333
0.542MetTyr: 0.542 ± 0.333
0.0MetXaa: 0.0 ± 0.0
Asn
1.085AsnAla: 1.085 ± 0.379
0.813AsnCys: 0.813 ± 0.342
1.356AsnAsp: 1.356 ± 0.576
0.271AsnGlu: 0.271 ± 0.374
0.813AsnPhe: 0.813 ± 0.342
1.085AsnGly: 1.085 ± 0.451
0.542AsnHis: 0.542 ± 0.294
1.898AsnIle: 1.898 ± 0.593
0.542AsnLys: 0.542 ± 0.388
3.796AsnLeu: 3.796 ± 1.222
0.542AsnMet: 0.542 ± 0.294
1.627AsnAsn: 1.627 ± 1.272
2.711AsnPro: 2.711 ± 0.666
2.983AsnGln: 2.983 ± 0.998
1.898AsnArg: 1.898 ± 0.905
1.898AsnSer: 1.898 ± 0.731
1.898AsnThr: 1.898 ± 0.902
1.085AsnVal: 1.085 ± 0.562
0.542AsnTrp: 0.542 ± 0.294
1.356AsnTyr: 1.356 ± 0.576
0.0AsnXaa: 0.0 ± 0.0
Pro
3.254ProAla: 3.254 ± 1.163
2.169ProCys: 2.169 ± 0.626
1.898ProAsp: 1.898 ± 0.866
4.338ProGlu: 4.338 ± 1.946
2.983ProPhe: 2.983 ± 1.724
2.44ProGly: 2.44 ± 0.628
1.356ProHis: 1.356 ± 0.832
1.627ProIle: 1.627 ± 0.736
2.983ProLys: 2.983 ± 1.019
7.592ProLeu: 7.592 ± 1.113
1.085ProMet: 1.085 ± 0.562
2.169ProAsn: 2.169 ± 0.54
3.254ProPro: 3.254 ± 1.162
1.356ProGln: 1.356 ± 0.622
1.627ProArg: 1.627 ± 0.684
3.254ProSer: 3.254 ± 1.09
3.796ProThr: 3.796 ± 0.568
3.796ProVal: 3.796 ± 0.493
0.271ProTrp: 0.271 ± 0.166
2.169ProTyr: 2.169 ± 1.121
0.0ProXaa: 0.0 ± 0.0
Gln
4.338GlnAla: 4.338 ± 0.66
0.813GlnCys: 0.813 ± 0.598
3.525GlnAsp: 3.525 ± 1.15
5.423GlnGlu: 5.423 ± 1.05
1.085GlnPhe: 1.085 ± 0.504
4.61GlnGly: 4.61 ± 2.522
0.271GlnHis: 0.271 ± 0.374
1.898GlnIle: 1.898 ± 0.705
1.898GlnLys: 1.898 ± 0.868
5.152GlnLeu: 5.152 ± 1.058
1.085GlnMet: 1.085 ± 0.587
1.627GlnAsn: 1.627 ± 0.902
1.356GlnPro: 1.356 ± 0.33
1.898GlnGln: 1.898 ± 0.468
2.169GlnArg: 2.169 ± 0.62
3.254GlnSer: 3.254 ± 1.0
2.44GlnThr: 2.44 ± 0.521
2.711GlnVal: 2.711 ± 2.462
1.085GlnTrp: 1.085 ± 0.352
0.271GlnTyr: 0.271 ± 0.427
0.0GlnXaa: 0.0 ± 0.0
Arg
5.152ArgAla: 5.152 ± 1.67
1.356ArgCys: 1.356 ± 0.616
5.152ArgAsp: 5.152 ± 2.177
6.236ArgGlu: 6.236 ± 5.133
2.169ArgPhe: 2.169 ± 0.975
5.152ArgGly: 5.152 ± 1.728
2.711ArgHis: 2.711 ± 0.778
1.085ArgIle: 1.085 ± 0.527
4.61ArgLys: 4.61 ± 1.971
6.236ArgLeu: 6.236 ± 1.906
0.542ArgMet: 0.542 ± 0.333
0.813ArgAsn: 0.813 ± 0.499
4.338ArgPro: 4.338 ± 0.79
2.983ArgGln: 2.983 ± 1.097
6.508ArgArg: 6.508 ± 2.737
5.152ArgSer: 5.152 ± 1.324
3.525ArgThr: 3.525 ± 0.893
4.067ArgVal: 4.067 ± 0.73
0.813ArgTrp: 0.813 ± 0.598
0.542ArgTyr: 0.542 ± 0.61
0.0ArgXaa: 0.0 ± 0.0
Ser
4.881SerAla: 4.881 ± 1.791
1.085SerCys: 1.085 ± 0.647
5.423SerAsp: 5.423 ± 2.626
5.965SerGlu: 5.965 ± 1.605
0.813SerPhe: 0.813 ± 0.499
4.338SerGly: 4.338 ± 1.687
1.085SerHis: 1.085 ± 0.451
2.983SerIle: 2.983 ± 0.793
3.796SerLys: 3.796 ± 0.781
10.304SerLeu: 10.304 ± 2.873
1.085SerMet: 1.085 ± 0.451
2.169SerAsn: 2.169 ± 0.888
4.881SerPro: 4.881 ± 1.428
4.067SerGln: 4.067 ± 1.029
5.965SerArg: 5.965 ± 1.334
7.321SerSer: 7.321 ± 1.927
4.338SerThr: 4.338 ± 1.484
3.525SerVal: 3.525 ± 0.604
1.356SerTrp: 1.356 ± 0.38
1.356SerTyr: 1.356 ± 0.958
0.0SerXaa: 0.0 ± 0.0
Thr
5.152ThrAla: 5.152 ± 1.36
1.356ThrCys: 1.356 ± 0.891
1.627ThrAsp: 1.627 ± 0.684
3.525ThrGlu: 3.525 ± 0.863
1.627ThrPhe: 1.627 ± 0.998
2.983ThrGly: 2.983 ± 1.124
0.813ThrHis: 0.813 ± 0.499
2.44ThrIle: 2.44 ± 0.521
2.169ThrLys: 2.169 ± 0.79
8.677ThrLeu: 8.677 ± 2.512
1.356ThrMet: 1.356 ± 0.624
2.169ThrAsn: 2.169 ± 0.97
2.44ThrPro: 2.44 ± 0.618
1.898ThrGln: 1.898 ± 0.704
3.254ThrArg: 3.254 ± 0.974
5.965ThrSer: 5.965 ± 1.895
4.067ThrThr: 4.067 ± 1.762
3.254ThrVal: 3.254 ± 1.121
1.898ThrTrp: 1.898 ± 0.789
1.356ThrTyr: 1.356 ± 0.773
0.0ThrXaa: 0.0 ± 0.0
Val
4.61ValAla: 4.61 ± 1.112
1.356ValCys: 1.356 ± 0.969
1.898ValAsp: 1.898 ± 0.572
2.44ValGlu: 2.44 ± 1.251
2.169ValPhe: 2.169 ± 0.963
2.44ValGly: 2.44 ± 0.864
1.085ValHis: 1.085 ± 0.425
2.711ValIle: 2.711 ± 1.109
3.525ValLys: 3.525 ± 1.34
4.067ValLeu: 4.067 ± 0.91
0.813ValMet: 0.813 ± 0.342
0.813ValAsn: 0.813 ± 0.799
2.983ValPro: 2.983 ± 1.041
3.254ValGln: 3.254 ± 1.149
3.254ValArg: 3.254 ± 1.302
2.169ValSer: 2.169 ± 0.849
1.085ValThr: 1.085 ± 0.504
2.44ValVal: 2.44 ± 1.06
0.271ValTrp: 0.271 ± 0.543
2.711ValTyr: 2.711 ± 1.164
0.0ValXaa: 0.0 ± 0.0
Trp
0.813TrpAla: 0.813 ± 0.406
1.085TrpCys: 1.085 ± 0.665
1.085TrpAsp: 1.085 ± 0.517
1.085TrpGlu: 1.085 ± 0.382
0.542TrpPhe: 0.542 ± 0.333
0.542TrpGly: 0.542 ± 0.333
0.542TrpHis: 0.542 ± 0.333
0.813TrpIle: 0.813 ± 0.499
1.356TrpLys: 1.356 ± 0.653
1.356TrpLeu: 1.356 ± 0.891
0.271TrpMet: 0.271 ± 0.166
0.542TrpAsn: 0.542 ± 0.294
1.085TrpPro: 1.085 ± 0.451
0.813TrpGln: 0.813 ± 0.799
1.085TrpArg: 1.085 ± 0.451
1.085TrpSer: 1.085 ± 0.504
1.627TrpThr: 1.627 ± 0.881
0.542TrpVal: 0.542 ± 0.665
0.0TrpTrp: 0.0 ± 0.0
0.271TrpTyr: 0.271 ± 0.543
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.356TyrAla: 1.356 ± 0.358
0.271TyrCys: 0.271 ± 0.166
0.813TyrAsp: 0.813 ± 0.342
2.169TyrGlu: 2.169 ± 0.484
1.085TyrPhe: 1.085 ± 0.382
1.627TyrGly: 1.627 ± 0.79
0.813TyrHis: 0.813 ± 0.334
1.898TyrIle: 1.898 ± 0.783
1.085TyrLys: 1.085 ± 0.791
2.711TyrLeu: 2.711 ± 1.003
0.271TyrMet: 0.271 ± 0.333
0.813TyrAsn: 0.813 ± 0.342
1.085TyrPro: 1.085 ± 0.504
3.525TyrGln: 3.525 ± 0.607
2.711TyrArg: 2.711 ± 0.662
2.44TyrSer: 2.44 ± 0.863
2.169TyrThr: 2.169 ± 0.385
1.356TyrVal: 1.356 ± 0.587
0.542TyrTrp: 0.542 ± 0.333
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (3689 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski