Amino acid dipepetide frequency for Cimodo virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.09AlaAla: 5.09 ± 0.765
0.891AlaCys: 0.891 ± 0.276
3.691AlaAsp: 3.691 ± 0.397
4.072AlaGlu: 4.072 ± 0.412
3.181AlaPhe: 3.181 ± 0.513
3.691AlaGly: 3.691 ± 0.656
2.163AlaHis: 2.163 ± 0.425
5.345AlaIle: 5.345 ± 0.832
2.927AlaLys: 2.927 ± 0.752
7.763AlaLeu: 7.763 ± 0.824
1.909AlaMet: 1.909 ± 0.574
4.709AlaAsn: 4.709 ± 0.885
3.691AlaPro: 3.691 ± 0.481
4.072AlaGln: 4.072 ± 0.392
4.963AlaArg: 4.963 ± 0.759
5.09AlaSer: 5.09 ± 0.664
5.09AlaThr: 5.09 ± 0.806
4.709AlaVal: 4.709 ± 0.638
0.636AlaTrp: 0.636 ± 0.237
2.8AlaTyr: 2.8 ± 0.668
0.0AlaXaa: 0.0 ± 0.0
Cys
0.509CysAla: 0.509 ± 0.259
0.255CysCys: 0.255 ± 0.153
0.509CysAsp: 0.509 ± 0.21
0.636CysGlu: 0.636 ± 0.213
0.382CysPhe: 0.382 ± 0.181
0.636CysGly: 0.636 ± 0.292
0.127CysHis: 0.127 ± 0.156
0.636CysIle: 0.636 ± 0.252
0.0CysLys: 0.0 ± 0.0
0.891CysLeu: 0.891 ± 0.293
0.255CysMet: 0.255 ± 0.153
0.255CysAsn: 0.255 ± 0.146
0.509CysPro: 0.509 ± 0.304
0.0CysGln: 0.0 ± 0.0
0.891CysArg: 0.891 ± 0.258
0.764CysSer: 0.764 ± 0.416
0.764CysThr: 0.764 ± 0.273
1.018CysVal: 1.018 ± 0.327
0.509CysTrp: 0.509 ± 0.263
0.255CysTyr: 0.255 ± 0.214
0.0CysXaa: 0.0 ± 0.0
Asp
4.709AspAla: 4.709 ± 0.588
0.636AspCys: 0.636 ± 0.295
3.054AspAsp: 3.054 ± 0.592
3.818AspGlu: 3.818 ± 0.563
3.436AspPhe: 3.436 ± 0.757
3.309AspGly: 3.309 ± 0.664
1.018AspHis: 1.018 ± 0.528
3.691AspIle: 3.691 ± 0.824
2.163AspLys: 2.163 ± 0.211
4.327AspLeu: 4.327 ± 0.809
1.654AspMet: 1.654 ± 0.434
2.036AspAsn: 2.036 ± 0.54
4.072AspPro: 4.072 ± 0.838
2.672AspGln: 2.672 ± 0.382
3.309AspArg: 3.309 ± 0.401
4.072AspSer: 4.072 ± 0.47
2.927AspThr: 2.927 ± 0.471
5.345AspVal: 5.345 ± 0.752
1.145AspTrp: 1.145 ± 0.424
2.672AspTyr: 2.672 ± 0.48
0.0AspXaa: 0.0 ± 0.0
Glu
3.691GluAla: 3.691 ± 0.484
0.382GluCys: 0.382 ± 0.203
3.691GluAsp: 3.691 ± 0.618
2.672GluGlu: 2.672 ± 0.688
2.291GluPhe: 2.291 ± 0.449
2.036GluGly: 2.036 ± 0.561
1.909GluHis: 1.909 ± 0.493
3.818GluIle: 3.818 ± 0.72
2.545GluLys: 2.545 ± 0.575
5.472GluLeu: 5.472 ± 0.599
1.4GluMet: 1.4 ± 0.351
2.8GluAsn: 2.8 ± 0.562
2.291GluPro: 2.291 ± 0.338
3.563GluGln: 3.563 ± 0.569
2.8GluArg: 2.8 ± 0.454
3.181GluSer: 3.181 ± 0.586
2.8GluThr: 2.8 ± 0.486
3.436GluVal: 3.436 ± 0.438
0.636GluTrp: 0.636 ± 0.208
2.418GluTyr: 2.418 ± 0.492
0.0GluXaa: 0.0 ± 0.0
Phe
3.181PheAla: 3.181 ± 0.622
0.382PheCys: 0.382 ± 0.181
2.8PheAsp: 2.8 ± 0.594
1.909PheGlu: 1.909 ± 0.368
2.672PhePhe: 2.672 ± 0.431
3.563PheGly: 3.563 ± 0.582
1.145PheHis: 1.145 ± 0.397
2.418PheIle: 2.418 ± 0.592
1.018PheLys: 1.018 ± 0.238
2.927PheLeu: 2.927 ± 0.541
0.891PheMet: 0.891 ± 0.232
2.545PheAsn: 2.545 ± 0.406
1.909PhePro: 1.909 ± 0.468
2.291PheGln: 2.291 ± 0.637
2.418PheArg: 2.418 ± 0.633
3.054PheSer: 3.054 ± 0.582
3.436PheThr: 3.436 ± 0.573
2.672PheVal: 2.672 ± 0.498
0.764PheTrp: 0.764 ± 0.204
2.8PheTyr: 2.8 ± 0.447
0.0PheXaa: 0.0 ± 0.0
Gly
4.581GlyAla: 4.581 ± 0.963
0.255GlyCys: 0.255 ± 0.155
3.181GlyAsp: 3.181 ± 0.539
2.418GlyGlu: 2.418 ± 0.557
1.909GlyPhe: 1.909 ± 0.338
2.036GlyGly: 2.036 ± 0.384
1.145GlyHis: 1.145 ± 0.351
3.181GlyIle: 3.181 ± 0.557
2.545GlyLys: 2.545 ± 0.631
4.072GlyLeu: 4.072 ± 0.53
1.4GlyMet: 1.4 ± 0.336
2.545GlyAsn: 2.545 ± 0.463
1.782GlyPro: 1.782 ± 0.507
2.163GlyGln: 2.163 ± 0.344
2.672GlyArg: 2.672 ± 0.487
3.436GlySer: 3.436 ± 0.744
3.436GlyThr: 3.436 ± 0.645
3.436GlyVal: 3.436 ± 0.631
0.509GlyTrp: 0.509 ± 0.31
2.291GlyTyr: 2.291 ± 0.544
0.0GlyXaa: 0.0 ± 0.0
His
1.654HisAla: 1.654 ± 0.347
0.0HisCys: 0.0 ± 0.0
1.527HisAsp: 1.527 ± 0.354
0.509HisGlu: 0.509 ± 0.344
1.273HisPhe: 1.273 ± 0.412
1.654HisGly: 1.654 ± 0.614
0.382HisHis: 0.382 ± 0.197
1.909HisIle: 1.909 ± 0.436
1.018HisLys: 1.018 ± 0.404
2.163HisLeu: 2.163 ± 0.64
0.382HisMet: 0.382 ± 0.227
0.891HisAsn: 0.891 ± 0.269
1.527HisPro: 1.527 ± 0.547
0.764HisGln: 0.764 ± 0.28
1.654HisArg: 1.654 ± 0.296
1.145HisSer: 1.145 ± 0.473
1.654HisThr: 1.654 ± 0.593
1.654HisVal: 1.654 ± 0.589
0.127HisTrp: 0.127 ± 0.102
0.509HisTyr: 0.509 ± 0.236
0.0HisXaa: 0.0 ± 0.0
Ile
5.345IleAla: 5.345 ± 0.706
0.382IleCys: 0.382 ± 0.188
4.836IleAsp: 4.836 ± 0.977
2.927IleGlu: 2.927 ± 0.618
2.163IlePhe: 2.163 ± 0.429
4.2IleGly: 4.2 ± 0.72
1.145IleHis: 1.145 ± 0.445
3.054IleIle: 3.054 ± 0.502
2.927IleLys: 2.927 ± 0.694
6.236IleLeu: 6.236 ± 0.57
1.4IleMet: 1.4 ± 0.319
3.436IleAsn: 3.436 ± 0.611
3.945IlePro: 3.945 ± 0.876
2.036IleGln: 2.036 ± 0.34
2.672IleArg: 2.672 ± 0.596
5.727IleSer: 5.727 ± 1.1
5.218IleThr: 5.218 ± 0.941
4.2IleVal: 4.2 ± 0.692
0.255IleTrp: 0.255 ± 0.217
2.291IleTyr: 2.291 ± 0.411
0.0IleXaa: 0.0 ± 0.0
Lys
2.036LysAla: 2.036 ± 0.458
0.636LysCys: 0.636 ± 0.189
2.418LysAsp: 2.418 ± 0.527
2.545LysGlu: 2.545 ± 0.553
1.654LysPhe: 1.654 ± 0.381
1.909LysGly: 1.909 ± 0.71
1.527LysHis: 1.527 ± 0.481
2.927LysIle: 2.927 ± 0.968
1.654LysLys: 1.654 ± 0.479
5.218LysLeu: 5.218 ± 0.617
1.018LysMet: 1.018 ± 0.386
1.4LysAsn: 1.4 ± 0.526
2.163LysPro: 2.163 ± 0.587
1.782LysGln: 1.782 ± 0.329
3.691LysArg: 3.691 ± 0.856
3.054LysSer: 3.054 ± 0.528
2.163LysThr: 2.163 ± 0.483
2.672LysVal: 2.672 ± 0.733
0.636LysTrp: 0.636 ± 0.378
2.036LysTyr: 2.036 ± 0.309
0.0LysXaa: 0.0 ± 0.0
Leu
7.508LeuAla: 7.508 ± 0.884
1.4LeuCys: 1.4 ± 0.472
4.963LeuAsp: 4.963 ± 0.947
3.945LeuGlu: 3.945 ± 0.373
2.8LeuPhe: 2.8 ± 0.395
3.691LeuGly: 3.691 ± 0.393
2.163LeuHis: 2.163 ± 0.581
5.09LeuIle: 5.09 ± 0.937
4.709LeuLys: 4.709 ± 0.555
8.908LeuLeu: 8.908 ± 1.083
2.672LeuMet: 2.672 ± 0.34
4.581LeuAsn: 4.581 ± 0.736
6.108LeuPro: 6.108 ± 0.484
4.2LeuGln: 4.2 ± 0.899
5.09LeuArg: 5.09 ± 0.717
7.254LeuSer: 7.254 ± 0.981
4.963LeuThr: 4.963 ± 0.963
4.836LeuVal: 4.836 ± 0.744
0.891LeuTrp: 0.891 ± 0.35
3.563LeuTyr: 3.563 ± 0.579
0.0LeuXaa: 0.0 ± 0.0
Met
0.891MetAla: 0.891 ± 0.342
0.382MetCys: 0.382 ± 0.147
1.018MetAsp: 1.018 ± 0.215
1.527MetGlu: 1.527 ± 0.376
1.273MetPhe: 1.273 ± 0.317
1.273MetGly: 1.273 ± 0.283
0.636MetHis: 0.636 ± 0.265
1.4MetIle: 1.4 ± 0.322
1.145MetLys: 1.145 ± 0.28
2.036MetLeu: 2.036 ± 0.35
0.509MetMet: 0.509 ± 0.231
1.145MetAsn: 1.145 ± 0.305
1.145MetPro: 1.145 ± 0.426
0.891MetGln: 0.891 ± 0.279
1.654MetArg: 1.654 ± 0.433
1.273MetSer: 1.273 ± 0.415
2.291MetThr: 2.291 ± 0.482
1.273MetVal: 1.273 ± 0.288
0.127MetTrp: 0.127 ± 0.102
0.382MetTyr: 0.382 ± 0.235
0.0MetXaa: 0.0 ± 0.0
Asn
3.945AsnAla: 3.945 ± 0.796
0.255AsnCys: 0.255 ± 0.17
2.036AsnAsp: 2.036 ± 0.439
2.418AsnGlu: 2.418 ± 0.306
1.782AsnPhe: 1.782 ± 0.517
2.163AsnGly: 2.163 ± 0.626
0.891AsnHis: 0.891 ± 0.261
3.054AsnIle: 3.054 ± 0.403
2.036AsnLys: 2.036 ± 0.337
4.709AsnLeu: 4.709 ± 0.771
0.509AsnMet: 0.509 ± 0.247
2.545AsnAsn: 2.545 ± 0.534
3.181AsnPro: 3.181 ± 0.539
2.672AsnGln: 2.672 ± 0.307
2.8AsnArg: 2.8 ± 0.643
2.8AsnSer: 2.8 ± 0.519
1.782AsnThr: 1.782 ± 0.409
4.072AsnVal: 4.072 ± 0.508
0.636AsnTrp: 0.636 ± 0.23
2.672AsnTyr: 2.672 ± 0.634
0.0AsnXaa: 0.0 ± 0.0
Pro
4.581ProAla: 4.581 ± 0.964
1.018ProCys: 1.018 ± 0.559
3.181ProAsp: 3.181 ± 0.607
3.436ProGlu: 3.436 ± 0.615
3.309ProPhe: 3.309 ± 0.526
2.291ProGly: 2.291 ± 0.521
1.018ProHis: 1.018 ± 0.395
4.2ProIle: 4.2 ± 0.327
3.691ProLys: 3.691 ± 0.671
3.054ProLeu: 3.054 ± 0.67
0.891ProMet: 0.891 ± 0.276
2.036ProAsn: 2.036 ± 0.673
3.054ProPro: 3.054 ± 0.465
2.545ProGln: 2.545 ± 0.569
2.418ProArg: 2.418 ± 0.516
5.345ProSer: 5.345 ± 1.308
3.818ProThr: 3.818 ± 0.687
3.563ProVal: 3.563 ± 0.917
0.255ProTrp: 0.255 ± 0.143
1.654ProTyr: 1.654 ± 0.319
0.0ProXaa: 0.0 ± 0.0
Gln
3.818GlnAla: 3.818 ± 0.435
0.382GlnCys: 0.382 ± 0.278
3.309GlnAsp: 3.309 ± 0.576
1.782GlnGlu: 1.782 ± 0.399
2.545GlnPhe: 2.545 ± 0.478
1.654GlnGly: 1.654 ± 0.319
1.018GlnHis: 1.018 ± 0.489
3.945GlnIle: 3.945 ± 0.834
2.291GlnLys: 2.291 ± 0.705
4.454GlnLeu: 4.454 ± 0.659
1.145GlnMet: 1.145 ± 0.337
2.163GlnAsn: 2.163 ± 0.618
2.291GlnPro: 2.291 ± 0.537
2.545GlnGln: 2.545 ± 0.635
2.163GlnArg: 2.163 ± 0.608
3.181GlnSer: 3.181 ± 0.731
2.418GlnThr: 2.418 ± 0.6
2.163GlnVal: 2.163 ± 0.343
0.636GlnTrp: 0.636 ± 0.243
2.418GlnTyr: 2.418 ± 0.625
0.0GlnXaa: 0.0 ± 0.0
Arg
4.454ArgAla: 4.454 ± 0.645
0.509ArgCys: 0.509 ± 0.271
4.454ArgAsp: 4.454 ± 0.53
3.945ArgGlu: 3.945 ± 0.712
2.163ArgPhe: 2.163 ± 0.42
2.8ArgGly: 2.8 ± 0.757
1.018ArgHis: 1.018 ± 0.38
3.309ArgIle: 3.309 ± 0.36
2.163ArgLys: 2.163 ± 0.443
4.836ArgLeu: 4.836 ± 0.955
1.145ArgMet: 1.145 ± 0.359
1.909ArgAsn: 1.909 ± 0.349
2.163ArgPro: 2.163 ± 0.48
1.909ArgGln: 1.909 ± 0.528
4.454ArgArg: 4.454 ± 0.584
5.472ArgSer: 5.472 ± 0.871
3.181ArgThr: 3.181 ± 0.734
4.709ArgVal: 4.709 ± 1.074
1.145ArgTrp: 1.145 ± 0.499
2.672ArgTyr: 2.672 ± 0.569
0.0ArgXaa: 0.0 ± 0.0
Ser
6.108SerAla: 6.108 ± 0.749
0.382SerCys: 0.382 ± 0.232
4.581SerAsp: 4.581 ± 0.51
4.836SerGlu: 4.836 ± 0.716
3.563SerPhe: 3.563 ± 0.823
3.691SerGly: 3.691 ± 0.522
1.4SerHis: 1.4 ± 0.348
5.218SerIle: 5.218 ± 0.828
2.927SerLys: 2.927 ± 0.745
6.236SerLeu: 6.236 ± 0.995
1.527SerMet: 1.527 ± 0.513
2.672SerAsn: 2.672 ± 0.498
5.218SerPro: 5.218 ± 0.89
3.181SerGln: 3.181 ± 0.659
5.09SerArg: 5.09 ± 0.858
5.599SerSer: 5.599 ± 0.775
5.09SerThr: 5.09 ± 1.353
5.09SerVal: 5.09 ± 0.875
0.636SerTrp: 0.636 ± 0.252
3.309SerTyr: 3.309 ± 0.505
0.0SerXaa: 0.0 ± 0.0
Thr
3.309ThrAla: 3.309 ± 0.405
0.127ThrCys: 0.127 ± 0.109
3.436ThrAsp: 3.436 ± 0.519
2.672ThrGlu: 2.672 ± 0.66
3.309ThrPhe: 3.309 ± 0.612
2.927ThrGly: 2.927 ± 0.87
1.4ThrHis: 1.4 ± 0.3
4.836ThrIle: 4.836 ± 0.525
3.563ThrLys: 3.563 ± 0.647
5.599ThrLeu: 5.599 ± 0.399
0.891ThrMet: 0.891 ± 0.251
2.927ThrAsn: 2.927 ± 0.348
4.454ThrPro: 4.454 ± 0.495
2.291ThrGln: 2.291 ± 0.334
2.8ThrArg: 2.8 ± 0.314
5.727ThrSer: 5.727 ± 0.798
3.818ThrThr: 3.818 ± 0.707
3.945ThrVal: 3.945 ± 0.87
0.636ThrTrp: 0.636 ± 0.241
3.691ThrTyr: 3.691 ± 0.546
0.0ThrXaa: 0.0 ± 0.0
Val
5.981ValAla: 5.981 ± 0.901
0.509ValCys: 0.509 ± 0.185
3.945ValAsp: 3.945 ± 0.669
4.709ValGlu: 4.709 ± 0.864
2.672ValPhe: 2.672 ± 0.327
2.8ValGly: 2.8 ± 0.698
0.891ValHis: 0.891 ± 0.332
3.563ValIle: 3.563 ± 0.668
1.909ValLys: 1.909 ± 0.519
5.218ValLeu: 5.218 ± 0.541
1.527ValMet: 1.527 ± 0.426
3.309ValAsn: 3.309 ± 0.759
3.563ValPro: 3.563 ± 0.358
3.945ValGln: 3.945 ± 0.718
4.2ValArg: 4.2 ± 0.694
5.599ValSer: 5.599 ± 0.891
4.072ValThr: 4.072 ± 0.667
3.945ValVal: 3.945 ± 0.588
0.127ValTrp: 0.127 ± 0.116
2.418ValTyr: 2.418 ± 0.252
0.0ValXaa: 0.0 ± 0.0
Trp
1.018TrpAla: 1.018 ± 0.229
0.127TrpCys: 0.127 ± 0.132
0.891TrpAsp: 0.891 ± 0.389
0.509TrpGlu: 0.509 ± 0.198
0.255TrpPhe: 0.255 ± 0.142
0.382TrpGly: 0.382 ± 0.181
0.127TrpHis: 0.127 ± 0.109
0.636TrpIle: 0.636 ± 0.333
0.891TrpLys: 0.891 ± 0.265
1.4TrpLeu: 1.4 ± 0.308
0.0TrpMet: 0.0 ± 0.0
0.891TrpAsn: 0.891 ± 0.349
0.255TrpPro: 0.255 ± 0.147
0.891TrpGln: 0.891 ± 0.309
0.509TrpArg: 0.509 ± 0.263
0.636TrpSer: 0.636 ± 0.318
1.018TrpThr: 1.018 ± 0.337
0.0TrpVal: 0.0 ± 0.0
0.127TrpTrp: 0.127 ± 0.112
0.509TrpTyr: 0.509 ± 0.218
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.072TyrAla: 4.072 ± 0.613
0.891TyrCys: 0.891 ± 0.292
2.545TyrAsp: 2.545 ± 0.497
2.545TyrGlu: 2.545 ± 0.731
2.163TyrPhe: 2.163 ± 0.324
2.291TyrGly: 2.291 ± 0.499
1.273TyrHis: 1.273 ± 0.31
2.163TyrIle: 2.163 ± 0.555
1.018TyrLys: 1.018 ± 0.248
3.818TyrLeu: 3.818 ± 0.564
1.018TyrMet: 1.018 ± 0.435
2.291TyrAsn: 2.291 ± 0.427
2.036TyrPro: 2.036 ± 0.789
1.909TyrGln: 1.909 ± 0.39
2.163TyrArg: 2.163 ± 0.547
3.945TyrSer: 3.945 ± 0.603
2.418TyrThr: 2.418 ± 0.633
2.163TyrVal: 2.163 ± 0.517
0.636TyrTrp: 0.636 ± 0.292
1.4TyrTyr: 1.4 ± 0.42
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 12 proteins (7859 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski