Amino acid dipepetide frequency for Erwinia phage phiEaP-8

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.889AlaAla: 8.889 ± 0.733
0.501AlaCys: 0.501 ± 0.115
5.509AlaAsp: 5.509 ± 0.607
5.384AlaGlu: 5.384 ± 0.559
2.921AlaPhe: 2.921 ± 0.329
6.844AlaGly: 6.844 ± 0.56
1.21AlaHis: 1.21 ± 0.234
5.759AlaIle: 5.759 ± 0.454
5.175AlaLys: 5.175 ± 0.587
7.387AlaLeu: 7.387 ± 0.667
2.88AlaMet: 2.88 ± 0.301
5.175AlaAsn: 5.175 ± 0.686
3.798AlaPro: 3.798 ± 0.332
5.467AlaGln: 5.467 ± 0.919
3.38AlaArg: 3.38 ± 0.398
4.758AlaSer: 4.758 ± 0.491
5.634AlaThr: 5.634 ± 0.593
6.844AlaVal: 6.844 ± 0.542
1.294AlaTrp: 1.294 ± 0.244
3.38AlaTyr: 3.38 ± 0.316
0.0AlaXaa: 0.0 ± 0.0
Cys
0.668CysAla: 0.668 ± 0.211
0.125CysCys: 0.125 ± 0.079
0.459CysAsp: 0.459 ± 0.155
0.668CysGlu: 0.668 ± 0.186
0.292CysPhe: 0.292 ± 0.129
0.459CysGly: 0.459 ± 0.171
0.25CysHis: 0.25 ± 0.11
0.334CysIle: 0.334 ± 0.121
0.376CysLys: 0.376 ± 0.141
0.793CysLeu: 0.793 ± 0.224
0.209CysMet: 0.209 ± 0.09
0.417CysAsn: 0.417 ± 0.2
0.209CysPro: 0.209 ± 0.095
0.417CysGln: 0.417 ± 0.151
0.376CysArg: 0.376 ± 0.147
0.376CysSer: 0.376 ± 0.15
0.292CysThr: 0.292 ± 0.109
0.584CysVal: 0.584 ± 0.223
0.125CysTrp: 0.125 ± 0.064
0.125CysTyr: 0.125 ± 0.068
0.0CysXaa: 0.0 ± 0.0
Asp
6.427AspAla: 6.427 ± 0.749
0.584AspCys: 0.584 ± 0.167
3.255AspAsp: 3.255 ± 0.45
4.215AspGlu: 4.215 ± 0.5
1.628AspPhe: 1.628 ± 0.261
4.716AspGly: 4.716 ± 0.49
0.96AspHis: 0.96 ± 0.217
3.005AspIle: 3.005 ± 0.365
3.297AspLys: 3.297 ± 0.473
5.801AspLeu: 5.801 ± 0.442
1.711AspMet: 1.711 ± 0.222
3.047AspAsn: 3.047 ± 0.307
3.214AspPro: 3.214 ± 0.405
2.17AspGln: 2.17 ± 0.265
3.047AspArg: 3.047 ± 0.358
2.546AspSer: 2.546 ± 0.321
4.007AspThr: 4.007 ± 0.437
3.214AspVal: 3.214 ± 0.398
0.96AspTrp: 0.96 ± 0.272
2.087AspTyr: 2.087 ± 0.343
0.0AspXaa: 0.0 ± 0.0
Glu
5.718GluAla: 5.718 ± 0.539
0.417GluCys: 0.417 ± 0.137
3.214GluAsp: 3.214 ± 0.391
4.549GluGlu: 4.549 ± 0.648
2.88GluPhe: 2.88 ± 0.37
3.756GluGly: 3.756 ± 0.425
1.544GluHis: 1.544 ± 0.283
3.422GluIle: 3.422 ± 0.49
3.673GluLys: 3.673 ± 0.416
4.591GluLeu: 4.591 ± 0.555
2.295GluMet: 2.295 ± 0.279
2.963GluAsn: 2.963 ± 0.334
3.047GluPro: 3.047 ± 0.363
3.172GluGln: 3.172 ± 0.421
1.962GluArg: 1.962 ± 0.323
3.673GluSer: 3.673 ± 0.323
3.547GluThr: 3.547 ± 0.313
3.798GluVal: 3.798 ± 0.62
0.709GluTrp: 0.709 ± 0.186
2.254GluTyr: 2.254 ± 0.286
0.0GluXaa: 0.0 ± 0.0
Phe
2.88PheAla: 2.88 ± 0.413
0.543PheCys: 0.543 ± 0.178
1.711PheAsp: 1.711 ± 0.306
1.711PheGlu: 1.711 ± 0.26
0.876PhePhe: 0.876 ± 0.217
2.754PheGly: 2.754 ± 0.325
0.709PheHis: 0.709 ± 0.166
1.92PheIle: 1.92 ± 0.291
1.878PheLys: 1.878 ± 0.228
2.629PheLeu: 2.629 ± 0.338
1.461PheMet: 1.461 ± 0.228
2.671PheAsn: 2.671 ± 0.335
1.294PhePro: 1.294 ± 0.165
1.669PheGln: 1.669 ± 0.253
1.962PheArg: 1.962 ± 0.348
2.504PheSer: 2.504 ± 0.381
2.754PheThr: 2.754 ± 0.342
1.628PheVal: 1.628 ± 0.326
0.501PheTrp: 0.501 ± 0.174
1.002PheTyr: 1.002 ± 0.245
0.0PheXaa: 0.0 ± 0.0
Gly
5.3GlyAla: 5.3 ± 0.611
0.334GlyCys: 0.334 ± 0.128
4.007GlyAsp: 4.007 ± 0.516
4.215GlyGlu: 4.215 ± 0.393
2.838GlyPhe: 2.838 ± 0.377
4.466GlyGly: 4.466 ± 0.634
0.918GlyHis: 0.918 ± 0.222
4.048GlyIle: 4.048 ± 0.334
4.716GlyLys: 4.716 ± 0.58
4.966GlyLeu: 4.966 ± 0.524
2.003GlyMet: 2.003 ± 0.266
4.591GlyAsn: 4.591 ± 0.47
1.336GlyPro: 1.336 ± 0.218
2.421GlyGln: 2.421 ± 0.305
3.214GlyArg: 3.214 ± 0.381
5.801GlySer: 5.801 ± 0.822
4.758GlyThr: 4.758 ± 0.693
5.3GlyVal: 5.3 ± 0.606
1.002GlyTrp: 1.002 ± 0.25
2.838GlyTyr: 2.838 ± 0.361
0.0GlyXaa: 0.0 ± 0.0
His
1.502HisAla: 1.502 ± 0.274
0.167HisCys: 0.167 ± 0.087
1.169HisAsp: 1.169 ± 0.223
1.21HisGlu: 1.21 ± 0.284
0.459HisPhe: 0.459 ± 0.118
1.085HisGly: 1.085 ± 0.215
0.668HisHis: 0.668 ± 0.181
1.002HisIle: 1.002 ± 0.215
1.043HisLys: 1.043 ± 0.237
1.669HisLeu: 1.669 ± 0.314
0.459HisMet: 0.459 ± 0.107
0.668HisAsn: 0.668 ± 0.124
0.417HisPro: 0.417 ± 0.112
0.96HisGln: 0.96 ± 0.223
0.835HisArg: 0.835 ± 0.198
0.918HisSer: 0.918 ± 0.255
0.751HisThr: 0.751 ± 0.184
0.918HisVal: 0.918 ± 0.251
0.501HisTrp: 0.501 ± 0.12
0.96HisTyr: 0.96 ± 0.278
0.0HisXaa: 0.0 ± 0.0
Ile
4.925IleAla: 4.925 ± 0.388
0.584IleCys: 0.584 ± 0.199
4.507IleAsp: 4.507 ± 0.424
4.34IleGlu: 4.34 ± 0.49
1.711IlePhe: 1.711 ± 0.281
3.297IleGly: 3.297 ± 0.323
0.96IleHis: 0.96 ± 0.204
2.671IleIle: 2.671 ± 0.381
2.963IleLys: 2.963 ± 0.452
3.589IleLeu: 3.589 ± 0.364
1.419IleMet: 1.419 ± 0.229
3.339IleAsn: 3.339 ± 0.33
3.422IlePro: 3.422 ± 0.357
2.087IleGln: 2.087 ± 0.298
2.88IleArg: 2.88 ± 0.323
2.796IleSer: 2.796 ± 0.282
3.673IleThr: 3.673 ± 0.447
3.005IleVal: 3.005 ± 0.507
0.668IleTrp: 0.668 ± 0.168
1.878IleTyr: 1.878 ± 0.333
0.0IleXaa: 0.0 ± 0.0
Lys
6.093LysAla: 6.093 ± 0.588
0.167LysCys: 0.167 ± 0.09
3.13LysAsp: 3.13 ± 0.316
3.589LysGlu: 3.589 ± 0.452
2.128LysPhe: 2.128 ± 0.333
2.88LysGly: 2.88 ± 0.378
1.002LysHis: 1.002 ± 0.261
3.13LysIle: 3.13 ± 0.326
3.088LysLys: 3.088 ± 0.555
4.883LysLeu: 4.883 ± 0.584
2.087LysMet: 2.087 ± 0.32
2.963LysAsn: 2.963 ± 0.398
2.754LysPro: 2.754 ± 0.53
3.464LysGln: 3.464 ± 0.354
2.713LysArg: 2.713 ± 0.46
3.506LysSer: 3.506 ± 0.431
3.297LysThr: 3.297 ± 0.386
4.966LysVal: 4.966 ± 0.474
0.793LysTrp: 0.793 ± 0.193
1.586LysTyr: 1.586 ± 0.265
0.0LysXaa: 0.0 ± 0.0
Leu
7.22LeuAla: 7.22 ± 0.684
0.835LeuCys: 0.835 ± 0.245
5.509LeuAsp: 5.509 ± 0.43
4.299LeuGlu: 4.299 ± 0.456
3.088LeuPhe: 3.088 ± 0.472
4.841LeuGly: 4.841 ± 0.475
1.252LeuHis: 1.252 ± 0.269
3.714LeuIle: 3.714 ± 0.408
3.923LeuLys: 3.923 ± 0.379
5.885LeuLeu: 5.885 ± 0.561
2.88LeuMet: 2.88 ± 0.393
5.133LeuAsn: 5.133 ± 0.53
3.464LeuPro: 3.464 ± 0.423
3.589LeuGln: 3.589 ± 0.47
4.173LeuArg: 4.173 ± 0.538
6.761LeuSer: 6.761 ± 0.629
5.676LeuThr: 5.676 ± 0.423
4.841LeuVal: 4.841 ± 0.412
0.751LeuTrp: 0.751 ± 0.25
2.17LeuTyr: 2.17 ± 0.291
0.0LeuXaa: 0.0 ± 0.0
Met
2.713MetAla: 2.713 ± 0.382
0.25MetCys: 0.25 ± 0.123
1.419MetAsp: 1.419 ± 0.241
1.711MetGlu: 1.711 ± 0.196
1.043MetPhe: 1.043 ± 0.208
1.711MetGly: 1.711 ± 0.199
0.584MetHis: 0.584 ± 0.151
1.836MetIle: 1.836 ± 0.245
1.795MetLys: 1.795 ± 0.322
2.838MetLeu: 2.838 ± 0.344
0.626MetMet: 0.626 ± 0.165
1.836MetAsn: 1.836 ± 0.227
1.336MetPro: 1.336 ± 0.194
1.502MetGln: 1.502 ± 0.261
1.336MetArg: 1.336 ± 0.192
2.17MetSer: 2.17 ± 0.449
2.128MetThr: 2.128 ± 0.29
1.795MetVal: 1.795 ± 0.239
0.292MetTrp: 0.292 ± 0.125
0.918MetTyr: 0.918 ± 0.225
0.0MetXaa: 0.0 ± 0.0
Asn
5.926AsnAla: 5.926 ± 0.697
0.626AsnCys: 0.626 ± 0.179
3.214AsnAsp: 3.214 ± 0.381
2.796AsnGlu: 2.796 ± 0.418
2.17AsnPhe: 2.17 ± 0.301
4.215AsnGly: 4.215 ± 0.581
1.085AsnHis: 1.085 ± 0.159
3.38AsnIle: 3.38 ± 0.377
3.339AsnLys: 3.339 ± 0.359
4.758AsnLeu: 4.758 ± 0.489
1.544AsnMet: 1.544 ± 0.244
2.796AsnAsn: 2.796 ± 0.337
3.088AsnPro: 3.088 ± 0.461
2.963AsnGln: 2.963 ± 0.411
2.588AsnArg: 2.588 ± 0.324
3.881AsnSer: 3.881 ± 0.358
3.047AsnThr: 3.047 ± 0.362
3.631AsnVal: 3.631 ± 0.46
0.709AsnTrp: 0.709 ± 0.219
1.753AsnTyr: 1.753 ± 0.286
0.0AsnXaa: 0.0 ± 0.0
Pro
3.965ProAla: 3.965 ± 0.498
0.25ProCys: 0.25 ± 0.112
2.588ProAsp: 2.588 ± 0.441
3.631ProGlu: 3.631 ± 0.486
1.628ProPhe: 1.628 ± 0.358
2.462ProGly: 2.462 ± 0.345
0.459ProHis: 0.459 ± 0.141
2.087ProIle: 2.087 ± 0.261
1.836ProLys: 1.836 ± 0.319
3.297ProLeu: 3.297 ± 0.413
0.96ProMet: 0.96 ± 0.165
2.087ProAsn: 2.087 ± 0.346
1.043ProPro: 1.043 ± 0.297
1.878ProGln: 1.878 ± 0.333
1.169ProArg: 1.169 ± 0.237
2.838ProSer: 2.838 ± 0.418
3.047ProThr: 3.047 ± 0.402
4.466ProVal: 4.466 ± 0.436
0.459ProTrp: 0.459 ± 0.155
1.169ProTyr: 1.169 ± 0.23
0.0ProXaa: 0.0 ± 0.0
Gln
5.425GlnAla: 5.425 ± 0.736
0.209GlnCys: 0.209 ± 0.087
2.254GlnAsp: 2.254 ± 0.305
2.88GlnGlu: 2.88 ± 0.465
1.461GlnPhe: 1.461 ± 0.18
2.462GlnGly: 2.462 ± 0.344
0.793GlnHis: 0.793 ± 0.143
2.838GlnIle: 2.838 ± 0.369
3.088GlnLys: 3.088 ± 0.391
4.34GlnLeu: 4.34 ± 0.471
1.753GlnMet: 1.753 ± 0.313
3.422GlnAsn: 3.422 ± 0.712
1.586GlnPro: 1.586 ± 0.232
3.172GlnGln: 3.172 ± 0.608
2.504GlnArg: 2.504 ± 0.285
3.047GlnSer: 3.047 ± 0.353
3.172GlnThr: 3.172 ± 0.371
2.295GlnVal: 2.295 ± 0.317
0.543GlnTrp: 0.543 ± 0.154
1.419GlnTyr: 1.419 ± 0.248
0.0GlnXaa: 0.0 ± 0.0
Arg
3.005ArgAla: 3.005 ± 0.469
0.417ArgCys: 0.417 ± 0.139
2.754ArgAsp: 2.754 ± 0.381
2.963ArgGlu: 2.963 ± 0.335
1.377ArgPhe: 1.377 ± 0.347
2.629ArgGly: 2.629 ± 0.299
0.751ArgHis: 0.751 ± 0.192
2.337ArgIle: 2.337 ± 0.272
2.379ArgLys: 2.379 ± 0.348
3.547ArgLeu: 3.547 ± 0.409
1.544ArgMet: 1.544 ± 0.247
2.379ArgAsn: 2.379 ± 0.398
1.294ArgPro: 1.294 ± 0.236
2.337ArgGln: 2.337 ± 0.327
2.087ArgArg: 2.087 ± 0.35
2.671ArgSer: 2.671 ± 0.431
3.214ArgThr: 3.214 ± 0.382
3.255ArgVal: 3.255 ± 0.373
0.584ArgTrp: 0.584 ± 0.165
1.836ArgTyr: 1.836 ± 0.344
0.0ArgXaa: 0.0 ± 0.0
Ser
5.926SerAla: 5.926 ± 0.682
0.083SerCys: 0.083 ± 0.056
3.798SerAsp: 3.798 ± 0.447
3.297SerGlu: 3.297 ± 0.455
1.753SerPhe: 1.753 ± 0.288
5.718SerGly: 5.718 ± 0.65
1.085SerHis: 1.085 ± 0.205
3.088SerIle: 3.088 ± 0.273
3.631SerLys: 3.631 ± 0.328
5.759SerLeu: 5.759 ± 0.477
2.045SerMet: 2.045 ± 0.293
3.714SerAsn: 3.714 ± 0.366
1.836SerPro: 1.836 ± 0.335
2.087SerGln: 2.087 ± 0.284
2.838SerArg: 2.838 ± 0.37
3.756SerSer: 3.756 ± 0.421
3.965SerThr: 3.965 ± 0.519
5.592SerVal: 5.592 ± 0.522
0.751SerTrp: 0.751 ± 0.239
2.671SerTyr: 2.671 ± 0.478
0.0SerXaa: 0.0 ± 0.0
Thr
5.843ThrAla: 5.843 ± 0.541
0.417ThrCys: 0.417 ± 0.157
4.507ThrAsp: 4.507 ± 0.42
3.84ThrGlu: 3.84 ± 0.386
2.254ThrPhe: 2.254 ± 0.347
6.052ThrGly: 6.052 ± 0.572
0.668ThrHis: 0.668 ± 0.176
3.589ThrIle: 3.589 ± 0.46
3.923ThrLys: 3.923 ± 0.44
4.966ThrLeu: 4.966 ± 0.428
1.252ThrMet: 1.252 ± 0.283
3.422ThrAsn: 3.422 ± 0.452
3.38ThrPro: 3.38 ± 0.346
2.88ThrGln: 2.88 ± 0.429
1.586ThrArg: 1.586 ± 0.236
3.923ThrSer: 3.923 ± 0.546
4.215ThrThr: 4.215 ± 0.597
5.676ThrVal: 5.676 ± 0.47
1.002ThrTrp: 1.002 ± 0.238
1.711ThrTyr: 1.711 ± 0.328
0.0ThrXaa: 0.0 ± 0.0
Val
6.302ValAla: 6.302 ± 0.746
0.417ValCys: 0.417 ± 0.157
4.466ValAsp: 4.466 ± 0.422
4.048ValGlu: 4.048 ± 0.473
2.462ValPhe: 2.462 ± 0.336
4.841ValGly: 4.841 ± 0.562
1.294ValHis: 1.294 ± 0.194
4.132ValIle: 4.132 ± 0.467
5.467ValLys: 5.467 ± 0.414
4.716ValLeu: 4.716 ± 0.486
1.669ValMet: 1.669 ± 0.247
4.299ValAsn: 4.299 ± 0.516
3.047ValPro: 3.047 ± 0.434
3.589ValGln: 3.589 ± 0.53
3.047ValArg: 3.047 ± 0.326
4.424ValSer: 4.424 ± 0.475
4.883ValThr: 4.883 ± 0.542
4.132ValVal: 4.132 ± 0.604
0.668ValTrp: 0.668 ± 0.179
2.128ValTyr: 2.128 ± 0.394
0.0ValXaa: 0.0 ± 0.0
Trp
1.377TrpAla: 1.377 ± 0.214
0.167TrpCys: 0.167 ± 0.085
0.709TrpAsp: 0.709 ± 0.192
0.459TrpGlu: 0.459 ± 0.143
0.709TrpPhe: 0.709 ± 0.196
0.918TrpGly: 0.918 ± 0.303
0.25TrpHis: 0.25 ± 0.154
0.668TrpIle: 0.668 ± 0.246
0.751TrpLys: 0.751 ± 0.21
1.21TrpLeu: 1.21 ± 0.239
0.042TrpMet: 0.042 ± 0.038
1.002TrpAsn: 1.002 ± 0.184
0.292TrpPro: 0.292 ± 0.093
0.793TrpGln: 0.793 ± 0.144
0.417TrpArg: 0.417 ± 0.117
0.876TrpSer: 0.876 ± 0.192
0.751TrpThr: 0.751 ± 0.208
1.043TrpVal: 1.043 ± 0.241
0.167TrpTrp: 0.167 ± 0.084
0.25TrpTyr: 0.25 ± 0.094
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.087TyrAla: 2.087 ± 0.246
0.459TyrCys: 0.459 ± 0.144
1.711TyrAsp: 1.711 ± 0.229
1.377TyrGlu: 1.377 ± 0.204
1.377TyrPhe: 1.377 ± 0.276
2.963TyrGly: 2.963 ± 0.371
0.918TyrHis: 0.918 ± 0.227
1.753TyrIle: 1.753 ± 0.331
1.92TyrLys: 1.92 ± 0.309
2.379TyrLeu: 2.379 ± 0.344
0.96TyrMet: 0.96 ± 0.165
1.586TyrAsn: 1.586 ± 0.228
1.461TyrPro: 1.461 ± 0.24
2.045TyrGln: 2.045 ± 0.354
1.252TyrArg: 1.252 ± 0.199
2.17TyrSer: 2.17 ± 0.288
2.295TyrThr: 2.295 ± 0.316
3.047TyrVal: 3.047 ± 0.374
0.376TyrTrp: 0.376 ± 0.131
0.543TyrTyr: 0.543 ± 0.156
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 78 proteins (23962 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski