Amino acid dipepetide frequency for Andrena haemorrhoa nege-like virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.545AlaAla: 3.545 ± 1.784
1.063AlaCys: 1.063 ± 0.718
2.481AlaAsp: 2.481 ± 0.963
1.418AlaGlu: 1.418 ± 0.714
2.481AlaPhe: 2.481 ± 2.349
2.127AlaGly: 2.127 ± 0.386
0.709AlaHis: 0.709 ± 0.357
3.19AlaIle: 3.19 ± 1.38
1.063AlaLys: 1.063 ± 0.535
4.963AlaLeu: 4.963 ± 2.498
0.709AlaMet: 0.709 ± 0.357
2.836AlaAsn: 2.836 ± 0.97
0.709AlaPro: 0.709 ± 0.357
0.354AlaGln: 0.354 ± 0.178
0.354AlaArg: 0.354 ± 0.178
2.481AlaSer: 2.481 ± 0.541
1.772AlaThr: 1.772 ± 0.892
3.545AlaVal: 3.545 ± 1.197
0.0AlaTrp: 0.0 ± 0.0
2.481AlaTyr: 2.481 ± 0.477
0.0AlaXaa: 0.0 ± 0.0
Cys
1.772CysAla: 1.772 ± 0.703
0.354CysCys: 0.354 ± 0.178
0.709CysAsp: 0.709 ± 0.357
0.354CysGlu: 0.354 ± 0.178
1.063CysPhe: 1.063 ± 0.535
0.354CysGly: 0.354 ± 0.178
0.709CysHis: 0.709 ± 0.687
1.418CysIle: 1.418 ± 0.688
1.772CysLys: 1.772 ± 0.892
1.772CysLeu: 1.772 ± 0.703
0.354CysMet: 0.354 ± 0.844
1.418CysAsn: 1.418 ± 0.688
0.354CysPro: 0.354 ± 0.178
0.354CysGln: 0.354 ± 0.178
1.418CysArg: 1.418 ± 0.428
1.063CysSer: 1.063 ± 0.535
3.545CysThr: 3.545 ± 1.784
1.418CysVal: 1.418 ± 0.428
0.0CysTrp: 0.0 ± 0.0
0.709CysTyr: 0.709 ± 0.788
0.0CysXaa: 0.0 ± 0.0
Asp
3.899AspAla: 3.899 ± 1.387
1.063AspCys: 1.063 ± 0.535
2.481AspAsp: 2.481 ± 0.963
2.836AspGlu: 2.836 ± 0.856
1.063AspPhe: 1.063 ± 0.535
1.418AspGly: 1.418 ± 0.428
1.063AspHis: 1.063 ± 0.535
5.672AspIle: 5.672 ± 1.948
3.19AspLys: 3.19 ± 0.333
6.026AspLeu: 6.026 ± 0.666
2.127AspMet: 2.127 ± 1.07
3.899AspAsn: 3.899 ± 0.731
2.481AspPro: 2.481 ± 1.374
1.418AspGln: 1.418 ± 1.375
2.127AspArg: 2.127 ± 1.088
3.19AspSer: 3.19 ± 1.606
3.19AspThr: 3.19 ± 0.777
5.317AspVal: 5.317 ± 1.773
0.0AspTrp: 0.0 ± 0.0
3.899AspTyr: 3.899 ± 1.65
0.0AspXaa: 0.0 ± 0.0
Glu
0.354GluAla: 0.354 ± 0.178
0.709GluCys: 0.709 ± 0.357
3.19GluAsp: 3.19 ± 0.777
2.127GluGlu: 2.127 ± 1.07
4.963GluPhe: 4.963 ± 1.137
1.418GluGly: 1.418 ± 0.428
1.063GluHis: 1.063 ± 0.544
4.963GluIle: 4.963 ± 1.599
5.672GluLys: 5.672 ± 1.948
5.672GluLeu: 5.672 ± 1.948
2.127GluMet: 2.127 ± 1.07
3.19GluAsn: 3.19 ± 0.758
1.063GluPro: 1.063 ± 0.544
1.063GluGln: 1.063 ± 0.544
1.772GluArg: 1.772 ± 0.703
3.899GluSer: 3.899 ± 1.962
3.899GluThr: 3.899 ± 1.085
3.545GluVal: 3.545 ± 0.733
0.0GluTrp: 0.0 ± 0.0
2.481GluTyr: 2.481 ± 1.374
0.0GluXaa: 0.0 ± 0.0
Phe
2.481PheAla: 2.481 ± 0.854
1.772PheCys: 1.772 ± 0.892
3.899PheAsp: 3.899 ± 1.401
3.545PheGlu: 3.545 ± 0.919
3.19PhePhe: 3.19 ± 2.6
1.063PheGly: 1.063 ± 0.544
2.127PheHis: 2.127 ± 1.07
4.608PheIle: 4.608 ± 3.238
6.026PheLys: 6.026 ± 1.303
7.799PheLeu: 7.799 ± 3.945
0.709PheMet: 0.709 ± 0.319
4.963PheAsn: 4.963 ± 1.599
2.127PhePro: 2.127 ± 0.692
1.418PheGln: 1.418 ± 0.714
2.481PheArg: 2.481 ± 1.551
6.026PheSer: 6.026 ± 1.626
4.254PheThr: 4.254 ± 0.483
4.608PheVal: 4.608 ± 2.996
0.354PheTrp: 0.354 ± 0.178
3.545PheTyr: 3.545 ± 1.824
0.0PheXaa: 0.0 ± 0.0
Gly
1.063GlyAla: 1.063 ± 0.535
1.063GlyCys: 1.063 ± 0.535
2.836GlyAsp: 2.836 ± 0.608
0.709GlyGlu: 0.709 ± 0.357
1.063GlyPhe: 1.063 ± 0.535
0.709GlyGly: 0.709 ± 0.687
0.354GlyHis: 0.354 ± 0.178
0.709GlyIle: 0.709 ± 0.357
1.063GlyLys: 1.063 ± 0.535
2.127GlyLeu: 2.127 ± 0.386
0.709GlyMet: 0.709 ± 0.357
1.418GlyAsn: 1.418 ± 1.375
0.0GlyPro: 0.0 ± 0.0
0.709GlyGln: 0.709 ± 0.687
1.418GlyArg: 1.418 ± 0.688
1.772GlySer: 1.772 ± 0.892
1.772GlyThr: 1.772 ± 0.703
2.127GlyVal: 2.127 ± 0.386
0.354GlyTrp: 0.354 ± 0.844
1.418GlyTyr: 1.418 ± 0.428
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.354HisCys: 0.354 ± 0.178
0.354HisAsp: 0.354 ± 0.889
1.418HisGlu: 1.418 ± 0.714
0.709HisPhe: 0.709 ± 0.357
0.709HisGly: 0.709 ± 0.357
0.0HisHis: 0.0 ± 0.0
1.418HisIle: 1.418 ± 1.021
0.709HisLys: 0.709 ± 0.357
2.127HisLeu: 2.127 ± 1.088
0.354HisMet: 0.354 ± 0.178
2.481HisAsn: 2.481 ± 0.963
0.709HisPro: 0.709 ± 0.357
0.0HisGln: 0.0 ± 0.0
0.354HisArg: 0.354 ± 0.178
4.254HisSer: 4.254 ± 1.558
2.127HisThr: 2.127 ± 1.07
0.354HisVal: 0.354 ± 0.178
0.0HisTrp: 0.0 ± 0.0
1.063HisTyr: 1.063 ± 0.544
0.0HisXaa: 0.0 ± 0.0
Ile
3.899IleAla: 3.899 ± 1.962
1.772IleCys: 1.772 ± 0.703
5.317IleAsp: 5.317 ± 2.676
5.317IleGlu: 5.317 ± 1.814
4.254IlePhe: 4.254 ± 3.49
1.772IleGly: 1.772 ± 0.703
1.772IleHis: 1.772 ± 0.703
3.545IleIle: 3.545 ± 0.919
5.672IleLys: 5.672 ± 1.091
6.735IleLeu: 6.735 ± 1.713
2.127IleMet: 2.127 ± 1.088
4.608IleAsn: 4.608 ± 1.426
5.317IlePro: 5.317 ± 2.968
3.19IleGln: 3.19 ± 0.758
2.127IleArg: 2.127 ± 0.762
4.963IleSer: 4.963 ± 0.877
4.608IleThr: 4.608 ± 1.426
4.608IleVal: 4.608 ± 0.306
0.0IleTrp: 0.0 ± 0.0
4.254IleTyr: 4.254 ± 1.385
0.0IleXaa: 0.0 ± 0.0
Lys
1.772LysAla: 1.772 ± 0.367
1.063LysCys: 1.063 ± 0.535
1.772LysAsp: 1.772 ± 1.227
1.772LysGlu: 1.772 ± 0.892
6.381LysPhe: 6.381 ± 1.13
1.772LysGly: 1.772 ± 0.703
1.772LysHis: 1.772 ± 0.892
6.735LysIle: 6.735 ± 1.675
3.899LysLys: 3.899 ± 0.731
9.926LysLeu: 9.926 ± 3.329
2.127LysMet: 2.127 ± 0.914
4.608LysAsn: 4.608 ± 0.715
1.418LysPro: 1.418 ± 0.688
3.19LysGln: 3.19 ± 0.333
2.481LysArg: 2.481 ± 0.477
6.735LysSer: 6.735 ± 0.648
2.481LysThr: 2.481 ± 0.963
2.127LysVal: 2.127 ± 1.07
0.354LysTrp: 0.354 ± 0.844
5.317LysTyr: 5.317 ± 1.1
0.0LysXaa: 0.0 ± 0.0
Leu
3.545LeuAla: 3.545 ± 1.784
2.481LeuCys: 2.481 ± 0.477
5.317LeuAsp: 5.317 ± 3.681
8.508LeuGlu: 8.508 ± 1.544
4.608LeuPhe: 4.608 ± 1.017
2.836LeuGly: 2.836 ± 0.608
0.354LeuHis: 0.354 ± 0.844
8.153LeuIle: 8.153 ± 1.753
7.09LeuLys: 7.09 ± 2.653
7.799LeuLeu: 7.799 ± 2.15
1.063LeuMet: 1.063 ± 0.535
10.635LeuAsn: 10.635 ± 3.684
5.317LeuPro: 5.317 ± 1.653
2.481LeuGln: 2.481 ± 1.249
5.672LeuArg: 5.672 ± 0.815
9.217LeuSer: 9.217 ± 4.106
4.963LeuThr: 4.963 ± 2.728
7.09LeuVal: 7.09 ± 0.679
0.354LeuTrp: 0.354 ± 0.178
4.963LeuTyr: 4.963 ± 1.082
0.0LeuXaa: 0.0 ± 0.0
Met
0.354MetAla: 0.354 ± 0.178
0.354MetCys: 0.354 ± 0.844
1.772MetAsp: 1.772 ± 0.367
2.481MetGlu: 2.481 ± 1.249
1.418MetPhe: 1.418 ± 0.714
0.709MetGly: 0.709 ± 0.357
0.0MetHis: 0.0 ± 0.0
1.418MetIle: 1.418 ± 0.714
1.418MetLys: 1.418 ± 0.714
1.063MetLeu: 1.063 ± 0.535
0.709MetMet: 0.709 ± 0.357
0.354MetAsn: 0.354 ± 0.178
1.063MetPro: 1.063 ± 0.535
1.063MetGln: 1.063 ± 0.718
0.354MetArg: 0.354 ± 0.178
3.545MetSer: 3.545 ± 1.824
2.127MetThr: 2.127 ± 0.386
0.709MetVal: 0.709 ± 0.357
0.0MetTrp: 0.0 ± 0.0
1.418MetTyr: 1.418 ± 0.714
0.0MetXaa: 0.0 ± 0.0
Asn
2.127AsnAla: 2.127 ± 1.07
0.709AsnCys: 0.709 ± 0.357
4.254AsnAsp: 4.254 ± 2.141
3.899AsnGlu: 3.899 ± 1.962
5.672AsnPhe: 5.672 ± 1.091
2.127AsnGly: 2.127 ± 0.386
1.418AsnHis: 1.418 ± 1.375
7.09AsnIle: 7.09 ± 2.653
5.672AsnLys: 5.672 ± 1.818
7.799AsnLeu: 7.799 ± 0.542
1.418AsnMet: 1.418 ± 0.714
4.254AsnAsn: 4.254 ± 2.141
3.899AsnPro: 3.899 ± 1.962
1.063AsnGln: 1.063 ± 1.529
2.836AsnArg: 2.836 ± 0.97
5.672AsnSer: 5.672 ± 2.608
4.608AsnThr: 4.608 ± 1.426
6.026AsnVal: 6.026 ± 1.389
0.709AsnTrp: 0.709 ± 0.788
2.127AsnTyr: 2.127 ± 0.762
0.0AsnXaa: 0.0 ± 0.0
Pro
1.418ProAla: 1.418 ± 0.688
0.709ProCys: 0.709 ± 0.357
3.545ProAsp: 3.545 ± 1.504
2.836ProGlu: 2.836 ± 1.196
2.481ProPhe: 2.481 ± 0.854
0.709ProGly: 0.709 ± 0.357
0.0ProHis: 0.0 ± 0.0
1.772ProIle: 1.772 ± 1.227
2.836ProLys: 2.836 ± 1.376
3.19ProLeu: 3.19 ± 0.333
0.354ProMet: 0.354 ± 0.844
1.772ProAsn: 1.772 ± 0.367
2.127ProPro: 2.127 ± 1.553
1.418ProGln: 1.418 ± 0.714
0.709ProArg: 0.709 ± 0.357
2.481ProSer: 2.481 ± 0.541
2.836ProThr: 2.836 ± 1.42
3.19ProVal: 3.19 ± 1.874
0.0ProTrp: 0.0 ± 0.0
3.19ProTyr: 3.19 ± 0.777
0.0ProXaa: 0.0 ± 0.0
Gln
0.354GlnAla: 0.354 ± 0.889
1.063GlnCys: 1.063 ± 0.544
1.418GlnAsp: 1.418 ± 0.428
0.709GlnGlu: 0.709 ± 0.357
1.063GlnPhe: 1.063 ± 0.718
1.418GlnGly: 1.418 ± 0.428
0.354GlnHis: 0.354 ± 0.178
3.19GlnIle: 3.19 ± 0.758
1.418GlnLys: 1.418 ± 0.714
3.545GlnLeu: 3.545 ± 1.504
1.063GlnMet: 1.063 ± 0.535
3.19GlnAsn: 3.19 ± 1.606
0.354GlnPro: 0.354 ± 0.889
1.063GlnGln: 1.063 ± 1.529
1.772GlnArg: 1.772 ± 0.703
0.709GlnSer: 0.709 ± 0.357
1.418GlnThr: 1.418 ± 0.714
1.418GlnVal: 1.418 ± 1.375
0.709GlnTrp: 0.709 ± 0.687
2.127GlnTyr: 2.127 ± 0.386
0.0GlnXaa: 0.0 ± 0.0
Arg
1.063ArgAla: 1.063 ± 0.535
0.709ArgCys: 0.709 ± 0.357
2.127ArgAsp: 2.127 ± 0.386
2.127ArgGlu: 2.127 ± 1.07
2.481ArgPhe: 2.481 ± 0.477
1.063ArgGly: 1.063 ± 0.535
0.354ArgHis: 0.354 ± 0.178
3.545ArgIle: 3.545 ± 1.407
1.772ArgLys: 1.772 ± 0.703
2.481ArgLeu: 2.481 ± 0.541
0.709ArgMet: 0.709 ± 0.357
4.254ArgAsn: 4.254 ± 2.063
1.063ArgPro: 1.063 ± 0.544
1.418ArgGln: 1.418 ± 0.428
1.772ArgArg: 1.772 ± 0.703
1.772ArgSer: 1.772 ± 0.892
3.545ArgThr: 3.545 ± 2.679
3.899ArgVal: 3.899 ± 2.312
0.0ArgTrp: 0.0 ± 0.0
2.481ArgTyr: 2.481 ± 0.541
0.0ArgXaa: 0.0 ± 0.0
Ser
3.545SerAla: 3.545 ± 2.19
0.709SerCys: 0.709 ± 0.357
5.672SerAsp: 5.672 ± 0.235
4.254SerGlu: 4.254 ± 1.254
6.381SerPhe: 6.381 ± 2.077
1.063SerGly: 1.063 ± 0.535
2.836SerHis: 2.836 ± 1.376
3.899SerIle: 3.899 ± 1.401
6.026SerLys: 6.026 ± 2.124
9.571SerLeu: 9.571 ± 1.512
1.063SerMet: 1.063 ± 0.535
6.735SerAsn: 6.735 ± 2.344
3.545SerPro: 3.545 ± 1.504
2.836SerGln: 2.836 ± 0.97
1.063SerArg: 1.063 ± 0.535
6.381SerSer: 6.381 ± 2.563
3.545SerThr: 3.545 ± 0.339
7.444SerVal: 7.444 ± 2.391
1.063SerTrp: 1.063 ± 0.544
4.254SerTyr: 4.254 ± 0.561
0.0SerXaa: 0.0 ± 0.0
Thr
2.481ThrAla: 2.481 ± 0.477
1.772ThrCys: 1.772 ± 0.892
2.836ThrAsp: 2.836 ± 0.856
4.608ThrGlu: 4.608 ± 0.849
6.381ThrPhe: 6.381 ± 2.485
0.709ThrGly: 0.709 ± 0.357
0.709ThrHis: 0.709 ± 0.357
5.317ThrIle: 5.317 ± 1.981
2.836ThrLys: 2.836 ± 0.412
4.254ThrLeu: 4.254 ± 1.478
1.772ThrMet: 1.772 ± 0.461
2.836ThrAsn: 2.836 ± 1.427
2.481ThrPro: 2.481 ± 0.963
1.063ThrGln: 1.063 ± 0.544
2.836ThrArg: 2.836 ± 0.97
6.381ThrSer: 6.381 ± 1.281
5.317ThrThr: 5.317 ± 3.366
2.481ThrVal: 2.481 ± 2.282
0.0ThrTrp: 0.0 ± 0.0
4.963ThrTyr: 4.963 ± 0.953
0.0ThrXaa: 0.0 ± 0.0
Val
2.127ValAla: 2.127 ± 0.692
1.418ValCys: 1.418 ± 1.987
2.836ValAsp: 2.836 ± 1.427
3.19ValGlu: 3.19 ± 1.606
4.608ValPhe: 4.608 ± 2.047
0.709ValGly: 0.709 ± 0.687
2.836ValHis: 2.836 ± 0.608
4.608ValIle: 4.608 ± 0.306
6.026ValLys: 6.026 ± 2.214
8.862ValLeu: 8.862 ± 3.413
1.063ValMet: 1.063 ± 0.718
4.608ValAsn: 4.608 ± 2.319
2.127ValPro: 2.127 ± 0.692
1.772ValGln: 1.772 ± 0.703
4.254ValArg: 4.254 ± 1.385
6.381ValSer: 6.381 ± 1.563
3.899ValThr: 3.899 ± 1.65
4.608ValVal: 4.608 ± 1.898
0.0ValTrp: 0.0 ± 0.0
2.481ValTyr: 2.481 ± 1.551
0.0ValXaa: 0.0 ± 0.0
Trp
0.354TrpAla: 0.354 ± 0.844
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.354TrpPhe: 0.354 ± 0.178
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.063TrpIle: 1.063 ± 1.192
0.0TrpLys: 0.0 ± 0.0
0.709TrpLeu: 0.709 ± 0.357
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.063TrpArg: 1.063 ± 0.544
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
0.354TrpVal: 0.354 ± 0.844
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.127TyrAla: 2.127 ± 0.386
1.772TyrCys: 1.772 ± 0.703
3.545TyrAsp: 3.545 ± 0.839
1.063TyrGlu: 1.063 ± 2.088
6.381TyrPhe: 6.381 ± 0.535
0.709TyrGly: 0.709 ± 0.357
1.063TyrHis: 1.063 ± 0.535
3.545TyrIle: 3.545 ± 0.839
3.899TyrLys: 3.899 ± 1.085
6.026TyrLeu: 6.026 ± 2.214
1.418TyrMet: 1.418 ± 0.428
5.317TyrAsn: 5.317 ± 0.945
1.418TyrPro: 1.418 ± 0.714
2.481TyrGln: 2.481 ± 0.477
1.772TyrArg: 1.772 ± 0.367
4.963TyrSer: 4.963 ± 1.082
2.127TyrThr: 2.127 ± 0.692
3.545TyrVal: 3.545 ± 1.824
0.0TyrTrp: 0.0 ± 0.0
5.672TyrTyr: 5.672 ± 2.825
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2822 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski