Amino acid dipepetide frequency for Rattail cactus necrosis-associated virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.469AlaAla: 5.469 ± 0.813
1.215AlaCys: 1.215 ± 0.282
3.95AlaAsp: 3.95 ± 0.274
3.646AlaGlu: 3.646 ± 0.447
3.342AlaPhe: 3.342 ± 0.499
4.254AlaGly: 4.254 ± 0.692
1.823AlaHis: 1.823 ± 0.424
2.431AlaIle: 2.431 ± 0.634
5.469AlaLys: 5.469 ± 0.528
6.381AlaLeu: 6.381 ± 0.259
2.127AlaMet: 2.127 ± 0.338
1.519AlaAsn: 1.519 ± 0.345
2.735AlaPro: 2.735 ± 0.597
4.254AlaGln: 4.254 ± 0.692
0.912AlaArg: 0.912 ± 0.374
5.469AlaSer: 5.469 ± 0.889
1.823AlaThr: 1.823 ± 0.904
7.293AlaVal: 7.293 ± 0.784
0.0AlaTrp: 0.0 ± 0.0
2.127AlaTyr: 2.127 ± 0.374
0.304AlaXaa: 0.304 ± 0.205
Cys
2.127CysAla: 2.127 ± 0.467
0.608CysCys: 0.608 ± 0.141
2.127CysAsp: 2.127 ± 0.467
0.0CysGlu: 0.0 ± 0.0
2.431CysPhe: 2.431 ± 0.164
1.519CysGly: 1.519 ± 0.349
1.215CysHis: 1.215 ± 0.282
0.304CysIle: 0.304 ± 0.446
1.215CysLys: 1.215 ± 0.282
0.608CysLeu: 0.608 ± 0.141
0.0CysMet: 0.0 ± 0.0
1.519CysAsn: 1.519 ± 0.269
0.912CysPro: 0.912 ± 0.34
0.304CysGln: 0.304 ± 0.446
1.519CysArg: 1.519 ± 0.629
2.735CysSer: 2.735 ± 0.264
1.823CysThr: 1.823 ± 0.178
2.127CysVal: 2.127 ± 0.275
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
6.381AspAla: 6.381 ± 0.589
1.519AspCys: 1.519 ± 0.349
5.469AspAsp: 5.469 ± 0.499
2.431AspGlu: 2.431 ± 0.565
3.95AspPhe: 3.95 ± 0.274
7.596AspGly: 7.596 ± 0.898
0.304AspHis: 0.304 ± 0.205
4.558AspIle: 4.558 ± 0.834
4.254AspLys: 4.254 ± 0.549
4.862AspLeu: 4.862 ± 1.256
1.519AspMet: 1.519 ± 0.371
1.823AspAsn: 1.823 ± 0.68
1.215AspPro: 1.215 ± 0.615
2.127AspGln: 2.127 ± 0.584
3.342AspArg: 3.342 ± 0.652
2.127AspSer: 2.127 ± 0.528
3.342AspThr: 3.342 ± 0.632
7.596AspVal: 7.596 ± 0.866
0.608AspTrp: 0.608 ± 0.411
2.127AspTyr: 2.127 ± 0.467
0.0AspXaa: 0.0 ± 0.0
Glu
4.558GluAla: 4.558 ± 0.756
0.0GluCys: 0.0 ± 0.0
0.608GluAsp: 0.608 ± 0.38
1.823GluGlu: 1.823 ± 0.502
3.342GluPhe: 3.342 ± 0.731
2.431GluGly: 2.431 ± 0.596
2.431GluHis: 2.431 ± 0.565
2.735GluIle: 2.735 ± 0.597
1.519GluLys: 1.519 ± 0.345
4.254GluLeu: 4.254 ± 0.445
1.215GluMet: 1.215 ± 0.282
3.039GluAsn: 3.039 ± 0.456
1.519GluPro: 1.519 ± 0.269
1.519GluGln: 1.519 ± 0.345
5.469GluArg: 5.469 ± 0.626
3.342GluSer: 3.342 ± 1.095
1.215GluThr: 1.215 ± 0.282
2.127GluVal: 2.127 ± 0.843
0.608GluTrp: 0.608 ± 0.141
0.912GluTyr: 0.912 ± 0.247
0.0GluXaa: 0.0 ± 0.0
Phe
0.608PheAla: 0.608 ± 0.141
1.823PheCys: 1.823 ± 0.493
5.166PheAsp: 5.166 ± 1.144
3.342PheGlu: 3.342 ± 0.652
2.431PhePhe: 2.431 ± 0.452
2.431PheGly: 2.431 ± 0.364
2.127PheHis: 2.127 ± 0.467
2.431PheIle: 2.431 ± 0.863
3.342PheLys: 3.342 ± 1.359
5.773PheLeu: 5.773 ± 1.019
0.0PheMet: 0.0 ± 0.0
1.519PheAsn: 1.519 ± 0.349
4.254PhePro: 4.254 ± 0.473
2.735PheGln: 2.735 ± 0.3
2.735PheArg: 2.735 ± 0.597
4.558PheSer: 4.558 ± 1.153
3.646PheThr: 3.646 ± 0.37
1.519PheVal: 1.519 ± 0.514
1.519PheTrp: 1.519 ± 0.269
2.431PheTyr: 2.431 ± 0.583
0.0PheXaa: 0.0 ± 0.0
Gly
4.558GlyAla: 4.558 ± 0.674
1.519GlyCys: 1.519 ± 0.345
3.039GlyAsp: 3.039 ± 0.916
1.519GlyGlu: 1.519 ± 0.371
1.519GlyPhe: 1.519 ± 0.349
3.95GlyGly: 3.95 ± 0.655
0.608GlyHis: 0.608 ± 0.401
2.431GlyIle: 2.431 ± 0.164
6.381GlyLys: 6.381 ± 1.078
4.558GlyLeu: 4.558 ± 0.507
0.0GlyMet: 0.0 ± 0.0
1.823GlyAsn: 1.823 ± 0.493
2.431GlyPro: 2.431 ± 0.785
0.608GlyGln: 0.608 ± 0.141
3.342GlyArg: 3.342 ± 1.356
4.254GlySer: 4.254 ± 0.524
5.773GlyThr: 5.773 ± 0.879
3.646GlyVal: 3.646 ± 0.847
0.912GlyTrp: 0.912 ± 0.374
2.127GlyTyr: 2.127 ± 0.275
0.0GlyXaa: 0.0 ± 0.0
His
2.431HisAla: 2.431 ± 0.565
0.912HisCys: 0.912 ± 0.247
1.215HisAsp: 1.215 ± 0.282
0.608HisGlu: 0.608 ± 0.141
0.608HisPhe: 0.608 ± 0.141
1.519HisGly: 1.519 ± 0.345
0.304HisHis: 0.304 ± 0.205
1.215HisIle: 1.215 ± 0.431
0.0HisLys: 0.0 ± 0.0
3.646HisLeu: 3.646 ± 0.829
0.608HisMet: 0.608 ± 0.141
2.431HisAsn: 2.431 ± 0.565
1.215HisPro: 1.215 ± 0.282
0.304HisGln: 0.304 ± 0.446
2.431HisArg: 2.431 ± 0.164
2.127HisSer: 2.127 ± 0.275
1.519HisThr: 1.519 ± 0.269
2.431HisVal: 2.431 ± 0.565
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.127IleAla: 2.127 ± 0.843
1.215IleCys: 1.215 ± 0.431
4.254IleAsp: 4.254 ± 0.692
1.215IleGlu: 1.215 ± 0.431
1.823IlePhe: 1.823 ± 0.502
3.039IleGly: 3.039 ± 0.249
0.304IleHis: 0.304 ± 0.205
2.431IleIle: 2.431 ± 0.583
3.039IleLys: 3.039 ± 0.549
1.823IleLeu: 1.823 ± 0.675
0.608IleMet: 0.608 ± 0.141
4.254IleAsn: 4.254 ± 0.661
3.95IlePro: 3.95 ± 0.867
1.823IleGln: 1.823 ± 0.31
0.912IleArg: 0.912 ± 0.374
2.127IleSer: 2.127 ± 0.584
2.431IleThr: 2.431 ± 0.364
2.127IleVal: 2.127 ± 0.467
1.215IleTrp: 1.215 ± 0.317
1.823IleTyr: 1.823 ± 0.528
0.0IleXaa: 0.0 ± 0.0
Lys
3.342LysAla: 3.342 ± 0.827
0.608LysCys: 0.608 ± 0.401
4.862LysAsp: 4.862 ± 0.689
1.823LysGlu: 1.823 ± 0.493
3.646LysPhe: 3.646 ± 0.356
2.735LysGly: 2.735 ± 0.3
0.608LysHis: 0.608 ± 0.141
3.039LysIle: 3.039 ± 0.691
3.039LysLys: 3.039 ± 0.249
5.166LysLeu: 5.166 ± 0.648
1.823LysMet: 1.823 ± 0.424
2.735LysAsn: 2.735 ± 0.324
0.912LysPro: 0.912 ± 0.34
1.823LysGln: 1.823 ± 0.493
3.039LysArg: 3.039 ± 0.444
2.735LysSer: 2.735 ± 0.729
4.254LysThr: 4.254 ± 0.502
7.9LysVal: 7.9 ± 0.655
0.912LysTrp: 0.912 ± 0.34
1.519LysTyr: 1.519 ± 0.7
0.0LysXaa: 0.0 ± 0.0
Leu
3.039LeuAla: 3.039 ± 0.229
2.431LeuCys: 2.431 ± 1.56
3.95LeuAsp: 3.95 ± 1.49
4.558LeuGlu: 4.558 ± 0.377
3.342LeuPhe: 3.342 ± 0.639
3.039LeuGly: 3.039 ± 0.881
3.646LeuHis: 3.646 ± 0.816
4.254LeuIle: 4.254 ± 0.979
4.862LeuLys: 4.862 ± 0.329
7.9LeuLeu: 7.9 ± 0.446
1.823LeuMet: 1.823 ± 0.464
3.646LeuAsn: 3.646 ± 0.153
5.166LeuPro: 5.166 ± 1.046
4.558LeuGln: 4.558 ± 0.715
7.596LeuArg: 7.596 ± 0.778
10.331LeuSer: 10.331 ± 1.738
3.95LeuThr: 3.95 ± 0.556
8.508LeuVal: 8.508 ± 1.218
2.127LeuTrp: 2.127 ± 0.467
1.519LeuTyr: 1.519 ± 0.269
0.0LeuXaa: 0.0 ± 0.0
Met
1.519MetAla: 1.519 ± 0.345
1.519MetCys: 1.519 ± 0.345
1.215MetAsp: 1.215 ± 0.282
0.608MetGlu: 0.608 ± 0.141
0.304MetPhe: 0.304 ± 0.205
0.912MetGly: 0.912 ± 0.34
0.0MetHis: 0.0 ± 0.0
1.215MetIle: 1.215 ± 0.282
0.0MetLys: 0.0 ± 0.0
3.039MetLeu: 3.039 ± 0.706
0.0MetMet: 0.0 ± 0.0
0.608MetAsn: 0.608 ± 0.141
0.608MetPro: 0.608 ± 0.728
0.304MetGln: 0.304 ± 0.427
0.608MetArg: 0.608 ± 0.141
3.646MetSer: 3.646 ± 0.829
1.519MetThr: 1.519 ± 0.269
0.304MetVal: 0.304 ± 0.427
0.0MetTrp: 0.0 ± 0.0
0.608MetTyr: 0.608 ± 0.141
0.0MetXaa: 0.0 ± 0.0
Asn
2.127AsnAla: 2.127 ± 0.719
0.0AsnCys: 0.0 ± 0.0
2.127AsnAsp: 2.127 ± 0.275
3.342AsnGlu: 3.342 ± 0.25
4.558AsnPhe: 4.558 ± 0.539
1.215AsnGly: 1.215 ± 0.282
1.215AsnHis: 1.215 ± 0.282
0.304AsnIle: 0.304 ± 0.427
2.735AsnLys: 2.735 ± 0.324
6.381AsnLeu: 6.381 ± 1.104
1.215AsnMet: 1.215 ± 0.276
3.342AsnAsn: 3.342 ± 0.731
4.254AsnPro: 4.254 ± 0.524
0.0AsnGln: 0.0 ± 0.0
0.304AsnArg: 0.304 ± 0.205
3.039AsnSer: 3.039 ± 0.443
2.127AsnThr: 2.127 ± 0.467
5.773AsnVal: 5.773 ± 1.642
0.0AsnTrp: 0.0 ± 0.0
1.823AsnTyr: 1.823 ± 0.424
0.0AsnXaa: 0.0 ± 0.0
Pro
4.254ProAla: 4.254 ± 0.717
0.304ProCys: 0.304 ± 0.446
2.735ProAsp: 2.735 ± 0.264
3.646ProGlu: 3.646 ± 0.37
2.431ProPhe: 2.431 ± 0.596
4.254ProGly: 4.254 ± 0.502
3.342ProHis: 3.342 ± 0.322
1.215ProIle: 1.215 ± 0.317
1.519ProLys: 1.519 ± 0.381
3.342ProLeu: 3.342 ± 0.849
0.0ProMet: 0.0 ± 0.0
1.215ProAsn: 1.215 ± 0.276
3.039ProPro: 3.039 ± 1.263
2.127ProGln: 2.127 ± 0.922
1.823ProArg: 1.823 ± 0.178
3.646ProSer: 3.646 ± 0.613
1.823ProThr: 1.823 ± 0.31
3.95ProVal: 3.95 ± 0.867
0.0ProTrp: 0.0 ± 0.0
1.519ProTyr: 1.519 ± 1.013
0.0ProXaa: 0.0 ± 0.0
Gln
2.431GlnAla: 2.431 ± 0.731
2.127GlnCys: 2.127 ± 0.467
0.0GlnAsp: 0.0 ± 0.0
1.215GlnGlu: 1.215 ± 0.62
1.823GlnPhe: 1.823 ± 0.528
2.127GlnGly: 2.127 ± 0.467
0.608GlnHis: 0.608 ± 0.141
1.519GlnIle: 1.519 ± 0.269
1.823GlnLys: 1.823 ± 0.424
4.862GlnLeu: 4.862 ± 0.26
1.215GlnMet: 1.215 ± 0.282
3.039GlnAsn: 3.039 ± 0.249
1.215GlnPro: 1.215 ± 0.317
2.431GlnGln: 2.431 ± 0.364
1.823GlnArg: 1.823 ± 0.904
1.215GlnSer: 1.215 ± 0.76
2.127GlnThr: 2.127 ± 0.719
3.646GlnVal: 3.646 ± 0.511
0.912GlnTrp: 0.912 ± 0.247
0.304GlnTyr: 0.304 ± 0.205
0.0GlnXaa: 0.0 ± 0.0
Arg
3.95ArgAla: 3.95 ± 0.274
1.215ArgCys: 1.215 ± 0.282
4.254ArgAsp: 4.254 ± 0.933
1.215ArgGlu: 1.215 ± 0.282
1.823ArgPhe: 1.823 ± 0.479
2.127ArgGly: 2.127 ± 0.262
1.215ArgHis: 1.215 ± 0.282
1.519ArgIle: 1.519 ± 0.345
1.823ArgLys: 1.823 ± 0.493
6.685ArgLeu: 6.685 ± 0.871
0.0ArgMet: 0.0 ± 0.0
3.039ArgAsn: 3.039 ± 0.667
2.127ArgPro: 2.127 ± 0.683
1.823ArgGln: 1.823 ± 0.493
4.862ArgArg: 4.862 ± 0.643
5.773ArgSer: 5.773 ± 0.198
3.95ArgThr: 3.95 ± 1.298
3.95ArgVal: 3.95 ± 0.656
0.912ArgTrp: 0.912 ± 0.34
2.127ArgTyr: 2.127 ± 0.584
0.0ArgXaa: 0.0 ± 0.0
Ser
3.646SerAla: 3.646 ± 1.387
2.127SerCys: 2.127 ± 0.262
5.773SerAsp: 5.773 ± 1.771
5.469SerGlu: 5.469 ± 0.644
4.254SerPhe: 4.254 ± 0.186
3.039SerGly: 3.039 ± 0.443
0.304SerHis: 0.304 ± 0.446
2.127SerIle: 2.127 ± 0.584
5.773SerLys: 5.773 ± 0.248
6.685SerLeu: 6.685 ± 1.322
2.127SerMet: 2.127 ± 0.584
2.431SerAsn: 2.431 ± 0.553
0.912SerPro: 0.912 ± 0.34
3.039SerGln: 3.039 ± 0.249
3.646SerArg: 3.646 ± 1.055
3.95SerSer: 3.95 ± 1.251
4.558SerThr: 4.558 ± 0.568
11.243SerVal: 11.243 ± 1.132
0.912SerTrp: 0.912 ± 0.374
2.431SerTyr: 2.431 ± 0.364
0.0SerXaa: 0.0 ± 0.0
Thr
3.342ThrAla: 3.342 ± 0.25
1.823ThrCys: 1.823 ± 0.178
3.95ThrAsp: 3.95 ± 0.416
1.823ThrGlu: 1.823 ± 0.178
3.646ThrPhe: 3.646 ± 0.37
2.735ThrGly: 2.735 ± 0.264
1.215ThrHis: 1.215 ± 0.431
2.431ThrIle: 2.431 ± 0.583
3.039ThrLys: 3.039 ± 0.997
4.558ThrLeu: 4.558 ± 0.834
0.304ThrMet: 0.304 ± 0.746
1.823ThrAsn: 1.823 ± 0.528
2.431ThrPro: 2.431 ± 0.164
3.039ThrGln: 3.039 ± 1.533
3.342ThrArg: 3.342 ± 0.499
3.646ThrSer: 3.646 ± 0.447
2.735ThrThr: 2.735 ± 0.483
8.204ThrVal: 8.204 ± 0.279
0.608ThrTrp: 0.608 ± 0.141
1.823ThrTyr: 1.823 ± 0.31
0.0ThrXaa: 0.0 ± 0.0
Val
7.293ValAla: 7.293 ± 0.83
2.127ValCys: 2.127 ± 0.467
10.027ValAsp: 10.027 ± 0.57
3.646ValGlu: 3.646 ± 0.37
5.773ValPhe: 5.773 ± 0.593
3.95ValGly: 3.95 ± 0.371
3.342ValHis: 3.342 ± 0.731
3.95ValIle: 3.95 ± 0.74
2.735ValLys: 2.735 ± 1.468
5.469ValLeu: 5.469 ± 1.191
2.127ValMet: 2.127 ± 0.262
5.773ValAsn: 5.773 ± 0.941
3.039ValPro: 3.039 ± 1.456
1.215ValGln: 1.215 ± 0.62
4.862ValArg: 4.862 ± 0.638
7.596ValSer: 7.596 ± 1.361
6.077ValThr: 6.077 ± 0.555
11.851ValVal: 11.851 ± 0.468
1.823ValTrp: 1.823 ± 0.493
4.558ValTyr: 4.558 ± 0.568
0.0ValXaa: 0.0 ± 0.0
Trp
0.912TrpAla: 0.912 ± 0.247
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.304TrpGlu: 0.304 ± 0.205
2.431TrpPhe: 2.431 ± 0.364
1.215TrpGly: 1.215 ± 0.282
0.304TrpHis: 0.304 ± 0.446
0.912TrpIle: 0.912 ± 0.374
1.823TrpLys: 1.823 ± 0.178
1.215TrpLeu: 1.215 ± 0.282
0.608TrpMet: 0.608 ± 0.141
0.304TrpAsn: 0.304 ± 0.205
0.608TrpPro: 0.608 ± 0.141
0.912TrpGln: 0.912 ± 0.374
0.304TrpArg: 0.304 ± 0.205
0.608TrpSer: 0.608 ± 0.141
0.304TrpThr: 0.304 ± 0.205
0.608TrpVal: 0.608 ± 0.728
0.608TrpTrp: 0.608 ± 0.141
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.431TyrAla: 2.431 ± 0.164
0.0TyrCys: 0.0 ± 0.0
2.431TyrAsp: 2.431 ± 0.452
2.431TyrGlu: 2.431 ± 0.164
0.912TyrPhe: 0.912 ± 0.247
0.912TyrGly: 0.912 ± 0.642
0.608TyrHis: 0.608 ± 0.141
1.519TyrIle: 1.519 ± 0.514
2.127TyrLys: 2.127 ± 0.467
3.039TyrLeu: 3.039 ± 0.997
0.912TyrMet: 0.912 ± 0.279
0.0TyrAsn: 0.0 ± 0.0
3.646TyrPro: 3.646 ± 0.479
1.215TyrGln: 1.215 ± 0.282
1.519TyrArg: 1.519 ± 0.269
1.519TyrSer: 1.519 ± 0.371
1.823TyrThr: 1.823 ± 0.493
2.431TyrVal: 2.431 ± 0.596
0.304TyrTrp: 0.304 ± 0.427
0.608TyrTyr: 0.608 ± 0.141
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.304XaaGln: 0.304 ± 0.205
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3292 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski