Amino acid dipepetide frequency for Alteromonas phage vB_AcoS-R7M

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.64AlaAla: 6.64 ± 1.067
0.732AlaCys: 0.732 ± 0.207
4.615AlaAsp: 4.615 ± 0.538
5.74AlaGlu: 5.74 ± 0.942
2.42AlaPhe: 2.42 ± 0.466
6.528AlaGly: 6.528 ± 0.698
1.351AlaHis: 1.351 ± 0.315
4.615AlaIle: 4.615 ± 0.582
5.008AlaLys: 5.008 ± 0.997
6.753AlaLeu: 6.753 ± 0.899
2.138AlaMet: 2.138 ± 0.424
4.052AlaAsn: 4.052 ± 0.478
3.545AlaPro: 3.545 ± 0.438
2.926AlaGln: 2.926 ± 0.439
3.658AlaArg: 3.658 ± 0.678
5.459AlaSer: 5.459 ± 0.658
4.671AlaThr: 4.671 ± 0.547
5.571AlaVal: 5.571 ± 0.454
1.069AlaTrp: 1.069 ± 0.26
2.476AlaTyr: 2.476 ± 0.415
0.0AlaXaa: 0.0 ± 0.0
Cys
0.506CysAla: 0.506 ± 0.154
0.0CysCys: 0.0 ± 0.0
0.957CysAsp: 0.957 ± 0.245
1.069CysGlu: 1.069 ± 0.266
0.506CysPhe: 0.506 ± 0.141
0.675CysGly: 0.675 ± 0.274
0.338CysHis: 0.338 ± 0.16
0.675CysIle: 0.675 ± 0.21
0.619CysLys: 0.619 ± 0.203
0.619CysLeu: 0.619 ± 0.194
0.225CysMet: 0.225 ± 0.121
0.675CysAsn: 0.675 ± 0.171
0.45CysPro: 0.45 ± 0.183
0.506CysGln: 0.506 ± 0.165
0.619CysArg: 0.619 ± 0.167
0.563CysSer: 0.563 ± 0.179
0.619CysThr: 0.619 ± 0.223
0.619CysVal: 0.619 ± 0.176
0.338CysTrp: 0.338 ± 0.128
0.506CysTyr: 0.506 ± 0.184
0.0CysXaa: 0.0 ± 0.0
Asp
5.177AspAla: 5.177 ± 0.498
0.732AspCys: 0.732 ± 0.217
3.489AspAsp: 3.489 ± 0.511
4.502AspGlu: 4.502 ± 0.401
1.97AspPhe: 1.97 ± 0.373
5.121AspGly: 5.121 ± 0.654
1.125AspHis: 1.125 ± 0.304
3.995AspIle: 3.995 ± 0.415
3.039AspLys: 3.039 ± 0.366
5.29AspLeu: 5.29 ± 0.441
1.519AspMet: 1.519 ± 0.248
2.87AspAsn: 2.87 ± 0.404
2.814AspPro: 2.814 ± 0.282
1.519AspGln: 1.519 ± 0.267
2.476AspArg: 2.476 ± 0.385
3.714AspSer: 3.714 ± 0.413
3.376AspThr: 3.376 ± 0.595
3.939AspVal: 3.939 ± 0.537
0.957AspTrp: 0.957 ± 0.227
1.97AspTyr: 1.97 ± 0.335
0.0AspXaa: 0.0 ± 0.0
Glu
6.697GluAla: 6.697 ± 0.95
1.013GluCys: 1.013 ± 0.234
3.545GluAsp: 3.545 ± 0.333
5.065GluGlu: 5.065 ± 0.8
3.039GluPhe: 3.039 ± 0.499
5.346GluGly: 5.346 ± 0.464
1.238GluHis: 1.238 ± 0.278
3.883GluIle: 3.883 ± 0.475
3.32GluLys: 3.32 ± 0.589
7.766GluLeu: 7.766 ± 0.693
1.688GluMet: 1.688 ± 0.334
2.983GluAsn: 2.983 ± 0.355
1.632GluPro: 1.632 ± 0.342
2.87GluGln: 2.87 ± 0.452
4.615GluArg: 4.615 ± 0.603
3.827GluSer: 3.827 ± 0.492
4.615GluThr: 4.615 ± 0.557
5.121GluVal: 5.121 ± 0.648
1.576GluTrp: 1.576 ± 0.252
2.42GluTyr: 2.42 ± 0.388
0.0GluXaa: 0.0 ± 0.0
Phe
2.645PheAla: 2.645 ± 0.471
0.169PheCys: 0.169 ± 0.109
3.602PheAsp: 3.602 ± 0.513
2.589PheGlu: 2.589 ± 0.35
0.9PhePhe: 0.9 ± 0.172
2.757PheGly: 2.757 ± 0.332
0.506PheHis: 0.506 ± 0.152
2.42PheIle: 2.42 ± 0.39
2.757PheLys: 2.757 ± 0.378
2.195PheLeu: 2.195 ± 0.316
0.9PheMet: 0.9 ± 0.217
2.026PheAsn: 2.026 ± 0.354
1.745PhePro: 1.745 ± 0.263
1.294PheGln: 1.294 ± 0.288
2.026PheArg: 2.026 ± 0.467
1.632PheSer: 1.632 ± 0.274
2.251PheThr: 2.251 ± 0.273
2.926PheVal: 2.926 ± 0.375
0.506PheTrp: 0.506 ± 0.175
1.125PheTyr: 1.125 ± 0.202
0.0PheXaa: 0.0 ± 0.0
Gly
5.346GlyAla: 5.346 ± 0.641
1.013GlyCys: 1.013 ± 0.284
3.827GlyAsp: 3.827 ± 0.506
5.402GlyGlu: 5.402 ± 0.516
2.701GlyPhe: 2.701 ± 0.354
5.346GlyGly: 5.346 ± 0.715
1.069GlyHis: 1.069 ± 0.279
4.615GlyIle: 4.615 ± 0.536
4.727GlyLys: 4.727 ± 0.489
4.952GlyLeu: 4.952 ± 0.445
1.745GlyMet: 1.745 ± 0.348
3.658GlyAsn: 3.658 ± 0.508
2.532GlyPro: 2.532 ± 0.412
2.195GlyGln: 2.195 ± 0.462
3.658GlyArg: 3.658 ± 0.468
5.402GlySer: 5.402 ± 0.619
4.896GlyThr: 4.896 ± 0.608
4.952GlyVal: 4.952 ± 0.532
0.844GlyTrp: 0.844 ± 0.288
3.939GlyTyr: 3.939 ± 0.704
0.0GlyXaa: 0.0 ± 0.0
His
1.351HisAla: 1.351 ± 0.33
0.225HisCys: 0.225 ± 0.162
0.619HisAsp: 0.619 ± 0.161
1.182HisGlu: 1.182 ± 0.301
0.844HisPhe: 0.844 ± 0.271
1.182HisGly: 1.182 ± 0.28
0.506HisHis: 0.506 ± 0.209
0.788HisIle: 0.788 ± 0.178
1.688HisLys: 1.688 ± 0.321
1.857HisLeu: 1.857 ± 0.318
0.619HisMet: 0.619 ± 0.236
1.294HisAsn: 1.294 ± 0.285
0.506HisPro: 0.506 ± 0.162
0.394HisGln: 0.394 ± 0.13
0.675HisArg: 0.675 ± 0.19
1.463HisSer: 1.463 ± 0.29
0.732HisThr: 0.732 ± 0.242
1.238HisVal: 1.238 ± 0.268
0.506HisTrp: 0.506 ± 0.193
0.563HisTyr: 0.563 ± 0.217
0.0HisXaa: 0.0 ± 0.0
Ile
4.389IleAla: 4.389 ± 0.552
0.619IleCys: 0.619 ± 0.142
3.883IleAsp: 3.883 ± 0.424
4.558IleGlu: 4.558 ± 0.499
1.519IlePhe: 1.519 ± 0.379
3.545IleGly: 3.545 ± 0.411
1.238IleHis: 1.238 ± 0.245
2.532IleIle: 2.532 ± 0.42
3.545IleLys: 3.545 ± 0.403
3.489IleLeu: 3.489 ± 0.49
1.576IleMet: 1.576 ± 0.287
3.77IleAsn: 3.77 ± 0.357
3.208IlePro: 3.208 ± 0.463
2.42IleGln: 2.42 ± 0.348
2.814IleArg: 2.814 ± 0.379
3.658IleSer: 3.658 ± 0.457
3.095IleThr: 3.095 ± 0.385
3.995IleVal: 3.995 ± 0.582
0.675IleTrp: 0.675 ± 0.173
1.857IleTyr: 1.857 ± 0.322
0.0IleXaa: 0.0 ± 0.0
Lys
5.515LysAla: 5.515 ± 1.068
0.563LysCys: 0.563 ± 0.25
3.208LysAsp: 3.208 ± 0.42
3.264LysGlu: 3.264 ± 0.466
2.138LysPhe: 2.138 ± 0.342
3.489LysGly: 3.489 ± 0.477
1.913LysHis: 1.913 ± 0.393
3.827LysIle: 3.827 ± 0.496
2.42LysLys: 2.42 ± 0.377
5.177LysLeu: 5.177 ± 0.611
1.913LysMet: 1.913 ± 0.321
2.082LysAsn: 2.082 ± 0.306
2.645LysPro: 2.645 ± 0.464
2.757LysGln: 2.757 ± 0.473
3.376LysArg: 3.376 ± 0.569
2.983LysSer: 2.983 ± 0.45
3.264LysThr: 3.264 ± 0.397
3.714LysVal: 3.714 ± 0.459
0.619LysTrp: 0.619 ± 0.191
2.307LysTyr: 2.307 ± 0.441
0.0LysXaa: 0.0 ± 0.0
Leu
5.853LeuAla: 5.853 ± 0.544
0.844LeuCys: 0.844 ± 0.209
5.346LeuAsp: 5.346 ± 0.593
5.909LeuGlu: 5.909 ± 0.624
2.532LeuPhe: 2.532 ± 0.427
4.558LeuGly: 4.558 ± 0.508
1.801LeuHis: 1.801 ± 0.302
4.727LeuIle: 4.727 ± 0.613
5.065LeuLys: 5.065 ± 0.501
5.515LeuLeu: 5.515 ± 0.661
2.364LeuMet: 2.364 ± 0.354
4.389LeuAsn: 4.389 ± 0.465
3.714LeuPro: 3.714 ± 0.397
3.658LeuGln: 3.658 ± 0.56
4.502LeuArg: 4.502 ± 0.615
5.571LeuSer: 5.571 ± 0.722
5.627LeuThr: 5.627 ± 0.486
4.783LeuVal: 4.783 ± 0.678
0.844LeuTrp: 0.844 ± 0.238
2.87LeuTyr: 2.87 ± 0.431
0.0LeuXaa: 0.0 ± 0.0
Met
1.857MetAla: 1.857 ± 0.284
0.394MetCys: 0.394 ± 0.165
1.351MetAsp: 1.351 ± 0.282
2.757MetGlu: 2.757 ± 0.4
1.069MetPhe: 1.069 ± 0.27
1.857MetGly: 1.857 ± 0.335
0.675MetHis: 0.675 ± 0.192
0.338MetIle: 0.338 ± 0.143
1.632MetLys: 1.632 ± 0.312
1.97MetLeu: 1.97 ± 0.341
0.113MetMet: 0.113 ± 0.079
0.675MetAsn: 0.675 ± 0.183
1.351MetPro: 1.351 ± 0.318
1.069MetGln: 1.069 ± 0.26
1.351MetArg: 1.351 ± 0.274
1.913MetSer: 1.913 ± 0.307
0.957MetThr: 0.957 ± 0.287
2.251MetVal: 2.251 ± 0.386
0.338MetTrp: 0.338 ± 0.117
0.788MetTyr: 0.788 ± 0.198
0.0MetXaa: 0.0 ± 0.0
Asn
3.77AsnAla: 3.77 ± 0.519
0.506AsnCys: 0.506 ± 0.204
3.827AsnAsp: 3.827 ± 0.486
3.433AsnGlu: 3.433 ± 0.351
1.745AsnPhe: 1.745 ± 0.351
5.402AsnGly: 5.402 ± 0.581
0.619AsnHis: 0.619 ± 0.17
2.757AsnIle: 2.757 ± 0.335
2.589AsnLys: 2.589 ± 0.344
4.277AsnLeu: 4.277 ± 0.543
0.844AsnMet: 0.844 ± 0.246
2.532AsnAsn: 2.532 ± 0.404
2.082AsnPro: 2.082 ± 0.403
1.801AsnGln: 1.801 ± 0.333
2.251AsnArg: 2.251 ± 0.375
2.701AsnSer: 2.701 ± 0.536
3.095AsnThr: 3.095 ± 0.432
2.983AsnVal: 2.983 ± 0.387
1.013AsnTrp: 1.013 ± 0.228
1.745AsnTyr: 1.745 ± 0.274
0.0AsnXaa: 0.0 ± 0.0
Pro
3.545ProAla: 3.545 ± 0.43
0.45ProCys: 0.45 ± 0.161
2.757ProAsp: 2.757 ± 0.381
3.376ProGlu: 3.376 ± 0.402
2.138ProPhe: 2.138 ± 0.294
2.645ProGly: 2.645 ± 0.407
0.394ProHis: 0.394 ± 0.184
2.589ProIle: 2.589 ± 0.395
1.913ProLys: 1.913 ± 0.443
3.151ProLeu: 3.151 ± 0.437
1.182ProMet: 1.182 ± 0.281
2.251ProAsn: 2.251 ± 0.303
1.801ProPro: 1.801 ± 0.371
1.857ProGln: 1.857 ± 0.382
1.688ProArg: 1.688 ± 0.313
3.039ProSer: 3.039 ± 0.444
3.264ProThr: 3.264 ± 0.396
2.532ProVal: 2.532 ± 0.407
0.619ProTrp: 0.619 ± 0.195
1.97ProTyr: 1.97 ± 0.396
0.0ProXaa: 0.0 ± 0.0
Gln
3.376GlnAla: 3.376 ± 0.498
0.506GlnCys: 0.506 ± 0.178
1.463GlnAsp: 1.463 ± 0.251
2.026GlnGlu: 2.026 ± 0.283
1.745GlnPhe: 1.745 ± 0.395
2.814GlnGly: 2.814 ± 0.38
0.732GlnHis: 0.732 ± 0.286
2.195GlnIle: 2.195 ± 0.33
1.238GlnLys: 1.238 ± 0.247
3.883GlnLeu: 3.883 ± 0.515
1.238GlnMet: 1.238 ± 0.276
1.801GlnAsn: 1.801 ± 0.424
2.307GlnPro: 2.307 ± 0.689
3.208GlnGln: 3.208 ± 1.364
2.138GlnArg: 2.138 ± 0.426
2.364GlnSer: 2.364 ± 0.299
2.251GlnThr: 2.251 ± 0.41
2.138GlnVal: 2.138 ± 0.298
0.732GlnTrp: 0.732 ± 0.249
1.463GlnTyr: 1.463 ± 0.277
0.0GlnXaa: 0.0 ± 0.0
Arg
3.77ArgAla: 3.77 ± 0.485
0.563ArgCys: 0.563 ± 0.146
2.476ArgAsp: 2.476 ± 0.286
3.827ArgGlu: 3.827 ± 0.763
1.745ArgPhe: 1.745 ± 0.267
3.264ArgGly: 3.264 ± 0.4
0.619ArgHis: 0.619 ± 0.184
2.757ArgIle: 2.757 ± 0.443
3.77ArgLys: 3.77 ± 0.552
4.615ArgLeu: 4.615 ± 0.544
1.125ArgMet: 1.125 ± 0.29
2.251ArgAsn: 2.251 ± 0.336
2.138ArgPro: 2.138 ± 0.341
2.307ArgGln: 2.307 ± 0.339
2.701ArgArg: 2.701 ± 0.493
3.039ArgSer: 3.039 ± 0.434
3.095ArgThr: 3.095 ± 0.403
3.376ArgVal: 3.376 ± 0.462
0.675ArgTrp: 0.675 ± 0.205
2.251ArgTyr: 2.251 ± 0.34
0.0ArgXaa: 0.0 ± 0.0
Ser
5.627SerAla: 5.627 ± 0.634
0.844SerCys: 0.844 ± 0.186
3.77SerAsp: 3.77 ± 0.408
3.939SerGlu: 3.939 ± 0.54
2.814SerPhe: 2.814 ± 0.425
5.684SerGly: 5.684 ± 0.576
1.238SerHis: 1.238 ± 0.279
3.208SerIle: 3.208 ± 0.416
3.939SerLys: 3.939 ± 0.587
4.221SerLeu: 4.221 ± 0.516
1.407SerMet: 1.407 ± 0.304
3.264SerAsn: 3.264 ± 0.469
2.195SerPro: 2.195 ± 0.44
2.026SerGln: 2.026 ± 0.336
2.926SerArg: 2.926 ± 0.413
4.615SerSer: 4.615 ± 0.669
4.052SerThr: 4.052 ± 0.66
4.446SerVal: 4.446 ± 0.569
0.563SerTrp: 0.563 ± 0.169
3.095SerTyr: 3.095 ± 0.317
0.0SerXaa: 0.0 ± 0.0
Thr
5.571ThrAla: 5.571 ± 0.696
0.45ThrCys: 0.45 ± 0.159
4.052ThrAsp: 4.052 ± 0.632
4.783ThrGlu: 4.783 ± 0.588
2.476ThrPhe: 2.476 ± 0.465
4.389ThrGly: 4.389 ± 0.47
1.407ThrHis: 1.407 ± 0.277
4.221ThrIle: 4.221 ± 0.581
2.983ThrLys: 2.983 ± 0.369
5.29ThrLeu: 5.29 ± 0.776
1.182ThrMet: 1.182 ± 0.257
2.364ThrAsn: 2.364 ± 0.308
2.87ThrPro: 2.87 ± 0.494
2.251ThrGln: 2.251 ± 0.458
2.926ThrArg: 2.926 ± 0.311
3.883ThrSer: 3.883 ± 0.496
4.783ThrThr: 4.783 ± 0.647
4.277ThrVal: 4.277 ± 0.596
1.013ThrTrp: 1.013 ± 0.414
2.138ThrTyr: 2.138 ± 0.477
0.0ThrXaa: 0.0 ± 0.0
Val
4.389ValAla: 4.389 ± 0.545
0.732ValCys: 0.732 ± 0.193
4.052ValAsp: 4.052 ± 0.601
5.571ValGlu: 5.571 ± 0.587
2.307ValPhe: 2.307 ± 0.434
4.164ValGly: 4.164 ± 0.425
0.9ValHis: 0.9 ± 0.268
2.814ValIle: 2.814 ± 0.307
4.727ValLys: 4.727 ± 0.542
4.615ValLeu: 4.615 ± 0.53
1.688ValMet: 1.688 ± 0.371
3.77ValAsn: 3.77 ± 0.422
3.376ValPro: 3.376 ± 0.454
2.364ValGln: 2.364 ± 0.314
3.545ValArg: 3.545 ± 0.404
4.277ValSer: 4.277 ± 0.62
5.065ValThr: 5.065 ± 0.763
4.84ValVal: 4.84 ± 0.552
1.238ValTrp: 1.238 ± 0.261
2.589ValTyr: 2.589 ± 0.317
0.0ValXaa: 0.0 ± 0.0
Trp
1.182TrpAla: 1.182 ± 0.211
0.338TrpCys: 0.338 ± 0.126
0.732TrpAsp: 0.732 ± 0.193
0.957TrpGlu: 0.957 ± 0.223
0.563TrpPhe: 0.563 ± 0.222
0.619TrpGly: 0.619 ± 0.184
0.225TrpHis: 0.225 ± 0.113
0.844TrpIle: 0.844 ± 0.329
0.619TrpLys: 0.619 ± 0.218
1.745TrpLeu: 1.745 ± 0.33
0.281TrpMet: 0.281 ± 0.132
0.788TrpAsn: 0.788 ± 0.214
0.563TrpPro: 0.563 ± 0.127
0.45TrpGln: 0.45 ± 0.208
0.788TrpArg: 0.788 ± 0.239
0.957TrpSer: 0.957 ± 0.291
0.9TrpThr: 0.9 ± 0.256
1.069TrpVal: 1.069 ± 0.221
0.394TrpTrp: 0.394 ± 0.169
0.732TrpTyr: 0.732 ± 0.223
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.757TyrAla: 2.757 ± 0.378
0.394TyrCys: 0.394 ± 0.165
2.026TyrAsp: 2.026 ± 0.314
2.364TyrGlu: 2.364 ± 0.315
1.801TyrPhe: 1.801 ± 0.309
3.433TyrGly: 3.433 ± 0.48
0.338TyrHis: 0.338 ± 0.141
2.589TyrIle: 2.589 ± 0.353
1.576TyrLys: 1.576 ± 0.285
3.095TyrLeu: 3.095 ± 0.361
0.9TyrMet: 0.9 ± 0.24
2.476TyrAsn: 2.476 ± 0.343
1.688TyrPro: 1.688 ± 0.353
1.745TyrGln: 1.745 ± 0.328
1.632TyrArg: 1.632 ± 0.313
2.814TyrSer: 2.814 ± 0.459
2.701TyrThr: 2.701 ± 0.447
2.251TyrVal: 2.251 ± 0.331
0.225TyrTrp: 0.225 ± 0.123
1.463TyrTyr: 1.463 ± 0.345
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 67 proteins (17771 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski