Amino acid dipepetide frequency for Aeromonas phage Ahp2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.432AlaAla: 12.432 ± 1.373
1.181AlaCys: 1.181 ± 0.354
5.279AlaAsp: 5.279 ± 0.715
6.181AlaGlu: 6.181 ± 0.731
2.778AlaPhe: 2.778 ± 0.465
8.612AlaGly: 8.612 ± 1.011
2.223AlaHis: 2.223 ± 0.484
6.112AlaIle: 6.112 ± 0.6
6.807AlaLys: 6.807 ± 1.004
9.515AlaLeu: 9.515 ± 1.047
3.681AlaMet: 3.681 ± 0.521
2.223AlaAsn: 2.223 ± 0.301
3.889AlaPro: 3.889 ± 0.43
4.167AlaGln: 4.167 ± 0.596
4.167AlaArg: 4.167 ± 0.45
5.487AlaSer: 5.487 ± 0.808
6.32AlaThr: 6.32 ± 0.717
7.57AlaVal: 7.57 ± 0.758
0.972AlaTrp: 0.972 ± 0.222
2.709AlaTyr: 2.709 ± 0.403
0.0AlaXaa: 0.0 ± 0.0
Cys
1.042CysAla: 1.042 ± 0.292
0.139CysCys: 0.139 ± 0.103
0.764CysAsp: 0.764 ± 0.185
0.833CysGlu: 0.833 ± 0.266
0.278CysPhe: 0.278 ± 0.138
1.875CysGly: 1.875 ± 0.423
0.417CysHis: 0.417 ± 0.158
0.764CysIle: 0.764 ± 0.218
0.764CysLys: 0.764 ± 0.229
1.736CysLeu: 1.736 ± 0.359
0.139CysMet: 0.139 ± 0.089
0.417CysAsn: 0.417 ± 0.182
0.903CysPro: 0.903 ± 0.247
0.625CysGln: 0.625 ± 0.207
0.486CysArg: 0.486 ± 0.16
0.695CysSer: 0.695 ± 0.238
0.833CysThr: 0.833 ± 0.197
0.556CysVal: 0.556 ± 0.214
0.417CysTrp: 0.417 ± 0.2
0.139CysTyr: 0.139 ± 0.108
0.0CysXaa: 0.0 ± 0.0
Asp
5.904AspAla: 5.904 ± 0.53
0.833AspCys: 0.833 ± 0.252
2.709AspAsp: 2.709 ± 0.336
2.848AspGlu: 2.848 ± 0.436
1.528AspPhe: 1.528 ± 0.276
5.07AspGly: 5.07 ± 0.585
0.903AspHis: 0.903 ± 0.247
3.056AspIle: 3.056 ± 0.417
3.195AspLys: 3.195 ± 0.539
4.931AspLeu: 4.931 ± 0.57
1.806AspMet: 1.806 ± 0.372
1.945AspAsn: 1.945 ± 0.446
2.5AspPro: 2.5 ± 0.387
1.945AspGln: 1.945 ± 0.347
2.987AspArg: 2.987 ± 0.565
2.639AspSer: 2.639 ± 0.343
3.056AspThr: 3.056 ± 0.407
3.542AspVal: 3.542 ± 0.4
1.459AspTrp: 1.459 ± 0.35
1.042AspTyr: 1.042 ± 0.25
0.0AspXaa: 0.0 ± 0.0
Glu
7.084GluAla: 7.084 ± 0.581
0.695GluCys: 0.695 ± 0.255
2.223GluAsp: 2.223 ± 0.445
4.584GluGlu: 4.584 ± 0.643
1.875GluPhe: 1.875 ± 0.359
4.792GluGly: 4.792 ± 0.582
1.181GluHis: 1.181 ± 0.245
3.612GluIle: 3.612 ± 0.637
4.167GluLys: 4.167 ± 0.456
6.737GluLeu: 6.737 ± 0.711
2.153GluMet: 2.153 ± 0.492
1.389GluAsn: 1.389 ± 0.231
1.667GluPro: 1.667 ± 0.399
3.334GluGln: 3.334 ± 0.548
4.723GluArg: 4.723 ± 0.541
3.612GluSer: 3.612 ± 0.451
4.306GluThr: 4.306 ± 0.504
4.931GluVal: 4.931 ± 0.551
1.32GluTrp: 1.32 ± 0.349
1.597GluTyr: 1.597 ± 0.312
0.0GluXaa: 0.0 ± 0.0
Phe
2.014PheAla: 2.014 ± 0.383
0.556PheCys: 0.556 ± 0.179
2.153PheAsp: 2.153 ± 0.474
2.084PheGlu: 2.084 ± 0.375
0.625PhePhe: 0.625 ± 0.177
2.431PheGly: 2.431 ± 0.43
0.903PheHis: 0.903 ± 0.22
1.806PheIle: 1.806 ± 0.348
1.597PheLys: 1.597 ± 0.375
2.848PheLeu: 2.848 ± 0.474
0.903PheMet: 0.903 ± 0.227
1.181PheAsn: 1.181 ± 0.322
1.806PhePro: 1.806 ± 0.365
1.25PheGln: 1.25 ± 0.258
1.667PheArg: 1.667 ± 0.335
1.32PheSer: 1.32 ± 0.316
1.945PheThr: 1.945 ± 0.388
1.597PheVal: 1.597 ± 0.274
0.833PheTrp: 0.833 ± 0.262
0.625PheTyr: 0.625 ± 0.249
0.0PheXaa: 0.0 ± 0.0
Gly
7.293GlyAla: 7.293 ± 0.973
0.972GlyCys: 0.972 ± 0.301
3.959GlyAsp: 3.959 ± 0.595
5.487GlyGlu: 5.487 ± 0.571
3.403GlyPhe: 3.403 ± 0.506
6.598GlyGly: 6.598 ± 0.903
2.153GlyHis: 2.153 ± 0.421
5.14GlyIle: 5.14 ± 0.602
4.723GlyLys: 4.723 ± 0.529
6.459GlyLeu: 6.459 ± 0.74
2.709GlyMet: 2.709 ± 0.367
2.639GlyAsn: 2.639 ± 0.568
2.431GlyPro: 2.431 ± 0.379
3.264GlyGln: 3.264 ± 0.475
4.792GlyArg: 4.792 ± 0.535
4.098GlySer: 4.098 ± 0.667
4.237GlyThr: 4.237 ± 0.559
5.626GlyVal: 5.626 ± 0.64
1.389GlyTrp: 1.389 ± 0.255
1.945GlyTyr: 1.945 ± 0.314
0.0GlyXaa: 0.0 ± 0.0
His
1.32HisAla: 1.32 ± 0.3
0.417HisCys: 0.417 ± 0.162
1.25HisAsp: 1.25 ± 0.32
1.389HisGlu: 1.389 ± 0.327
0.903HisPhe: 0.903 ± 0.255
2.084HisGly: 2.084 ± 0.42
1.111HisHis: 1.111 ± 0.266
1.736HisIle: 1.736 ± 0.344
1.042HisLys: 1.042 ± 0.246
2.223HisLeu: 2.223 ± 0.446
0.278HisMet: 0.278 ± 0.129
0.903HisAsn: 0.903 ± 0.252
1.389HisPro: 1.389 ± 0.297
1.528HisGln: 1.528 ± 0.34
1.597HisArg: 1.597 ± 0.311
1.181HisSer: 1.181 ± 0.265
1.459HisThr: 1.459 ± 0.38
1.25HisVal: 1.25 ± 0.369
0.695HisTrp: 0.695 ± 0.2
0.833HisTyr: 0.833 ± 0.269
0.0HisXaa: 0.0 ± 0.0
Ile
5.417IleAla: 5.417 ± 0.554
0.556IleCys: 0.556 ± 0.171
3.264IleAsp: 3.264 ± 0.471
3.889IleGlu: 3.889 ± 0.495
0.486IlePhe: 0.486 ± 0.186
3.473IleGly: 3.473 ± 0.433
1.806IleHis: 1.806 ± 0.378
2.5IleIle: 2.5 ± 0.448
3.195IleLys: 3.195 ± 0.48
3.542IleLeu: 3.542 ± 0.5
1.459IleMet: 1.459 ± 0.265
1.875IleAsn: 1.875 ± 0.359
2.987IlePro: 2.987 ± 0.465
1.528IleGln: 1.528 ± 0.261
3.473IleArg: 3.473 ± 0.576
3.82IleSer: 3.82 ± 0.524
3.959IleThr: 3.959 ± 0.448
3.542IleVal: 3.542 ± 0.552
1.111IleTrp: 1.111 ± 0.31
0.903IleTyr: 0.903 ± 0.273
0.0IleXaa: 0.0 ± 0.0
Lys
9.098LysAla: 9.098 ± 0.961
0.764LysCys: 0.764 ± 0.205
2.848LysAsp: 2.848 ± 0.435
3.959LysGlu: 3.959 ± 0.606
1.667LysPhe: 1.667 ± 0.37
5.001LysGly: 5.001 ± 0.605
1.042LysHis: 1.042 ± 0.3
1.667LysIle: 1.667 ± 0.334
3.334LysLys: 3.334 ± 0.545
5.765LysLeu: 5.765 ± 0.527
2.361LysMet: 2.361 ± 0.491
1.528LysAsn: 1.528 ± 0.385
2.848LysPro: 2.848 ± 0.328
1.806LysGln: 1.806 ± 0.369
3.195LysArg: 3.195 ± 0.463
2.5LysSer: 2.5 ± 0.424
3.334LysThr: 3.334 ± 0.442
5.765LysVal: 5.765 ± 0.616
1.459LysTrp: 1.459 ± 0.265
1.25LysTyr: 1.25 ± 0.27
0.0LysXaa: 0.0 ± 0.0
Leu
8.196LeuAla: 8.196 ± 1.01
1.181LeuCys: 1.181 ± 0.27
5.209LeuAsp: 5.209 ± 0.564
7.223LeuGlu: 7.223 ± 0.693
1.875LeuPhe: 1.875 ± 0.297
5.765LeuGly: 5.765 ± 0.74
2.153LeuHis: 2.153 ± 0.321
4.028LeuIle: 4.028 ± 0.511
5.695LeuLys: 5.695 ± 0.647
6.598LeuLeu: 6.598 ± 0.658
2.778LeuMet: 2.778 ± 0.461
3.195LeuAsn: 3.195 ± 0.389
3.681LeuPro: 3.681 ± 0.509
2.361LeuGln: 2.361 ± 0.395
6.181LeuArg: 6.181 ± 0.67
4.445LeuSer: 4.445 ± 0.512
6.112LeuThr: 6.112 ± 0.607
4.792LeuVal: 4.792 ± 0.631
1.806LeuTrp: 1.806 ± 0.311
1.945LeuTyr: 1.945 ± 0.271
0.0LeuXaa: 0.0 ± 0.0
Met
4.306MetAla: 4.306 ± 0.807
0.417MetCys: 0.417 ± 0.185
1.597MetAsp: 1.597 ± 0.348
2.014MetGlu: 2.014 ± 0.411
0.833MetPhe: 0.833 ± 0.237
1.945MetGly: 1.945 ± 0.308
0.625MetHis: 0.625 ± 0.201
1.667MetIle: 1.667 ± 0.309
2.848MetLys: 2.848 ± 0.452
1.25MetLeu: 1.25 ± 0.306
1.32MetMet: 1.32 ± 0.314
1.111MetAsn: 1.111 ± 0.286
1.25MetPro: 1.25 ± 0.317
0.486MetGln: 0.486 ± 0.188
1.806MetArg: 1.806 ± 0.371
2.709MetSer: 2.709 ± 0.431
2.639MetThr: 2.639 ± 0.406
2.709MetVal: 2.709 ± 0.419
0.486MetTrp: 0.486 ± 0.186
0.903MetTyr: 0.903 ± 0.225
0.0MetXaa: 0.0 ± 0.0
Asn
3.056AsnAla: 3.056 ± 0.372
0.417AsnCys: 0.417 ± 0.141
1.389AsnAsp: 1.389 ± 0.296
2.431AsnGlu: 2.431 ± 0.354
0.764AsnPhe: 0.764 ± 0.199
3.542AsnGly: 3.542 ± 0.464
1.181AsnHis: 1.181 ± 0.375
1.389AsnIle: 1.389 ± 0.268
1.736AsnLys: 1.736 ± 0.438
2.5AsnLeu: 2.5 ± 0.419
0.625AsnMet: 0.625 ± 0.209
0.903AsnAsn: 0.903 ± 0.27
1.806AsnPro: 1.806 ± 0.331
1.806AsnGln: 1.806 ± 0.285
2.153AsnArg: 2.153 ± 0.352
1.875AsnSer: 1.875 ± 0.343
1.32AsnThr: 1.32 ± 0.284
2.014AsnVal: 2.014 ± 0.459
0.695AsnTrp: 0.695 ± 0.184
0.764AsnTyr: 0.764 ± 0.246
0.0AsnXaa: 0.0 ± 0.0
Pro
3.681ProAla: 3.681 ± 0.414
0.625ProCys: 0.625 ± 0.223
2.709ProAsp: 2.709 ± 0.475
3.473ProGlu: 3.473 ± 0.413
1.667ProPhe: 1.667 ± 0.344
3.264ProGly: 3.264 ± 0.586
0.764ProHis: 0.764 ± 0.227
1.945ProIle: 1.945 ± 0.311
3.125ProLys: 3.125 ± 0.431
3.473ProLeu: 3.473 ± 0.444
1.181ProMet: 1.181 ± 0.267
0.972ProAsn: 0.972 ± 0.299
1.528ProPro: 1.528 ± 0.471
1.32ProGln: 1.32 ± 0.297
1.875ProArg: 1.875 ± 0.329
3.403ProSer: 3.403 ± 0.487
1.528ProThr: 1.528 ± 0.352
3.889ProVal: 3.889 ± 0.54
0.625ProTrp: 0.625 ± 0.212
0.764ProTyr: 0.764 ± 0.244
0.0ProXaa: 0.0 ± 0.0
Gln
4.028GlnAla: 4.028 ± 0.549
0.417GlnCys: 0.417 ± 0.189
1.32GlnAsp: 1.32 ± 0.327
3.334GlnGlu: 3.334 ± 0.373
1.111GlnPhe: 1.111 ± 0.276
2.709GlnGly: 2.709 ± 0.391
0.903GlnHis: 0.903 ± 0.32
1.667GlnIle: 1.667 ± 0.398
2.084GlnLys: 2.084 ± 0.428
3.959GlnLeu: 3.959 ± 0.438
1.597GlnMet: 1.597 ± 0.302
0.903GlnAsn: 0.903 ± 0.174
1.25GlnPro: 1.25 ± 0.305
1.875GlnGln: 1.875 ± 0.393
2.084GlnArg: 2.084 ± 0.362
2.709GlnSer: 2.709 ± 0.419
1.528GlnThr: 1.528 ± 0.369
3.195GlnVal: 3.195 ± 0.513
1.042GlnTrp: 1.042 ± 0.227
0.972GlnTyr: 0.972 ± 0.257
0.0GlnXaa: 0.0 ± 0.0
Arg
5.556ArgAla: 5.556 ± 0.681
0.764ArgCys: 0.764 ± 0.239
3.056ArgAsp: 3.056 ± 0.414
3.751ArgGlu: 3.751 ± 0.573
2.639ArgPhe: 2.639 ± 0.42
4.098ArgGly: 4.098 ± 0.503
1.806ArgHis: 1.806 ± 0.325
2.5ArgIle: 2.5 ± 0.351
3.264ArgLys: 3.264 ± 0.429
5.556ArgLeu: 5.556 ± 0.554
2.153ArgMet: 2.153 ± 0.378
3.056ArgAsn: 3.056 ± 0.443
2.848ArgPro: 2.848 ± 0.521
2.431ArgGln: 2.431 ± 0.452
4.167ArgArg: 4.167 ± 0.825
2.848ArgSer: 2.848 ± 0.5
2.987ArgThr: 2.987 ± 0.435
3.959ArgVal: 3.959 ± 0.524
0.903ArgTrp: 0.903 ± 0.231
1.528ArgTyr: 1.528 ± 0.253
0.0ArgXaa: 0.0 ± 0.0
Ser
4.515SerAla: 4.515 ± 0.674
0.903SerCys: 0.903 ± 0.233
3.751SerAsp: 3.751 ± 0.669
3.195SerGlu: 3.195 ± 0.507
1.806SerPhe: 1.806 ± 0.389
4.376SerGly: 4.376 ± 0.663
1.111SerHis: 1.111 ± 0.196
3.195SerIle: 3.195 ± 0.463
2.848SerLys: 2.848 ± 0.38
5.904SerLeu: 5.904 ± 0.625
2.153SerMet: 2.153 ± 0.41
2.5SerAsn: 2.5 ± 0.38
2.639SerPro: 2.639 ± 0.362
2.153SerGln: 2.153 ± 0.372
3.264SerArg: 3.264 ± 0.433
2.917SerSer: 2.917 ± 0.404
2.778SerThr: 2.778 ± 0.455
2.709SerVal: 2.709 ± 0.473
0.695SerTrp: 0.695 ± 0.209
1.597SerTyr: 1.597 ± 0.274
0.0SerXaa: 0.0 ± 0.0
Thr
6.251ThrAla: 6.251 ± 0.654
0.903ThrCys: 0.903 ± 0.288
3.334ThrAsp: 3.334 ± 0.543
2.848ThrGlu: 2.848 ± 0.373
2.431ThrPhe: 2.431 ± 0.437
5.556ThrGly: 5.556 ± 0.636
1.528ThrHis: 1.528 ± 0.328
3.473ThrIle: 3.473 ± 0.514
4.098ThrLys: 4.098 ± 0.588
5.001ThrLeu: 5.001 ± 0.726
1.667ThrMet: 1.667 ± 0.342
1.389ThrAsn: 1.389 ± 0.298
2.848ThrPro: 2.848 ± 0.469
1.597ThrGln: 1.597 ± 0.272
4.167ThrArg: 4.167 ± 0.563
2.5ThrSer: 2.5 ± 0.409
2.987ThrThr: 2.987 ± 0.5
3.889ThrVal: 3.889 ± 0.541
0.625ThrTrp: 0.625 ± 0.222
1.389ThrTyr: 1.389 ± 0.285
0.0ThrXaa: 0.0 ± 0.0
Val
6.668ValAla: 6.668 ± 1.119
1.181ValCys: 1.181 ± 0.28
4.098ValAsp: 4.098 ± 0.594
4.306ValGlu: 4.306 ± 0.593
1.945ValPhe: 1.945 ± 0.34
5.07ValGly: 5.07 ± 0.547
1.042ValHis: 1.042 ± 0.26
4.931ValIle: 4.931 ± 0.652
4.862ValLys: 4.862 ± 0.585
4.028ValLeu: 4.028 ± 0.539
2.848ValMet: 2.848 ± 0.464
2.917ValAsn: 2.917 ± 0.572
2.361ValPro: 2.361 ± 0.431
2.917ValGln: 2.917 ± 0.427
3.681ValArg: 3.681 ± 0.459
3.959ValSer: 3.959 ± 0.453
5.348ValThr: 5.348 ± 0.672
6.181ValVal: 6.181 ± 0.782
1.042ValTrp: 1.042 ± 0.264
1.042ValTyr: 1.042 ± 0.277
0.0ValXaa: 0.0 ± 0.0
Trp
1.806TrpAla: 1.806 ± 0.418
0.417TrpCys: 0.417 ± 0.17
1.667TrpAsp: 1.667 ± 0.337
0.972TrpGlu: 0.972 ± 0.273
1.042TrpPhe: 1.042 ± 0.24
1.25TrpGly: 1.25 ± 0.353
0.625TrpHis: 0.625 ± 0.228
0.833TrpIle: 0.833 ± 0.218
0.903TrpLys: 0.903 ± 0.277
1.459TrpLeu: 1.459 ± 0.295
0.833TrpMet: 0.833 ± 0.227
0.417TrpAsn: 0.417 ± 0.145
0.208TrpPro: 0.208 ± 0.127
0.972TrpGln: 0.972 ± 0.242
0.972TrpArg: 0.972 ± 0.261
1.25TrpSer: 1.25 ± 0.253
0.417TrpThr: 0.417 ± 0.149
1.459TrpVal: 1.459 ± 0.245
0.347TrpTrp: 0.347 ± 0.169
0.417TrpTyr: 0.417 ± 0.156
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.639TyrAla: 2.639 ± 0.457
0.486TyrCys: 0.486 ± 0.18
1.32TyrAsp: 1.32 ± 0.334
1.181TyrGlu: 1.181 ± 0.224
0.486TyrPhe: 0.486 ± 0.184
1.806TyrGly: 1.806 ± 0.342
1.111TyrHis: 1.111 ± 0.288
0.764TyrIle: 0.764 ± 0.215
0.903TyrLys: 0.903 ± 0.231
1.736TyrLeu: 1.736 ± 0.332
0.208TyrMet: 0.208 ± 0.139
0.903TyrAsn: 0.903 ± 0.219
0.972TyrPro: 0.972 ± 0.266
1.32TyrGln: 1.32 ± 0.32
2.431TyrArg: 2.431 ± 0.388
1.042TyrSer: 1.042 ± 0.24
1.389TyrThr: 1.389 ± 0.355
1.25TyrVal: 1.25 ± 0.253
0.347TyrTrp: 0.347 ± 0.154
0.625TyrTyr: 0.625 ± 0.212
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 86 proteins (14399 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski