Amino acid dipepetide frequency for Erwinia phage Ea9-2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.441AlaAla: 9.441 ± 0.758
0.59AlaCys: 0.59 ± 0.186
4.594AlaAsp: 4.594 ± 0.467
6.533AlaGlu: 6.533 ± 0.677
2.782AlaPhe: 2.782 ± 0.311
6.533AlaGly: 6.533 ± 0.688
1.433AlaHis: 1.433 ± 0.252
4.678AlaIle: 4.678 ± 0.484
6.49AlaLys: 6.49 ± 0.564
7.755AlaLeu: 7.755 ± 0.784
2.782AlaMet: 2.782 ± 0.359
4.046AlaAsn: 4.046 ± 0.397
4.215AlaPro: 4.215 ± 0.443
4.51AlaGln: 4.51 ± 0.694
2.992AlaArg: 2.992 ± 0.423
4.383AlaSer: 4.383 ± 0.434
6.406AlaThr: 6.406 ± 0.579
7.291AlaVal: 7.291 ± 0.678
0.969AlaTrp: 0.969 ± 0.211
3.54AlaTyr: 3.54 ± 0.354
0.0AlaXaa: 0.0 ± 0.0
Cys
0.632CysAla: 0.632 ± 0.23
0.084CysCys: 0.084 ± 0.063
0.59CysAsp: 0.59 ± 0.199
0.801CysGlu: 0.801 ± 0.22
0.337CysPhe: 0.337 ± 0.128
0.632CysGly: 0.632 ± 0.208
0.379CysHis: 0.379 ± 0.165
0.632CysIle: 0.632 ± 0.187
0.464CysLys: 0.464 ± 0.181
0.632CysLeu: 0.632 ± 0.199
0.295CysMet: 0.295 ± 0.133
0.548CysAsn: 0.548 ± 0.204
0.295CysPro: 0.295 ± 0.133
0.337CysGln: 0.337 ± 0.144
0.379CysArg: 0.379 ± 0.151
0.464CysSer: 0.464 ± 0.155
0.506CysThr: 0.506 ± 0.157
0.506CysVal: 0.506 ± 0.153
0.042CysTrp: 0.042 ± 0.041
0.211CysTyr: 0.211 ± 0.111
0.0CysXaa: 0.0 ± 0.0
Asp
5.943AspAla: 5.943 ± 0.704
0.59AspCys: 0.59 ± 0.192
3.456AspAsp: 3.456 ± 0.417
4.341AspGlu: 4.341 ± 0.482
1.812AspPhe: 1.812 ± 0.281
4.004AspGly: 4.004 ± 0.381
1.138AspHis: 1.138 ± 0.205
3.54AspIle: 3.54 ± 0.452
3.077AspLys: 3.077 ± 0.401
5.816AspLeu: 5.816 ± 0.502
1.77AspMet: 1.77 ± 0.288
3.287AspAsn: 3.287 ± 0.354
3.077AspPro: 3.077 ± 0.306
2.613AspGln: 2.613 ± 0.39
2.908AspArg: 2.908 ± 0.36
3.372AspSer: 3.372 ± 0.384
3.33AspThr: 3.33 ± 0.389
3.877AspVal: 3.877 ± 0.378
0.674AspTrp: 0.674 ± 0.179
2.318AspTyr: 2.318 ± 0.321
0.0AspXaa: 0.0 ± 0.0
Glu
5.268GluAla: 5.268 ± 0.514
0.59GluCys: 0.59 ± 0.202
3.667GluAsp: 3.667 ± 0.499
5.437GluGlu: 5.437 ± 0.81
2.613GluPhe: 2.613 ± 0.348
3.287GluGly: 3.287 ± 0.278
1.686GluHis: 1.686 ± 0.293
3.456GluIle: 3.456 ± 0.383
3.456GluLys: 3.456 ± 0.398
5.521GluLeu: 5.521 ± 0.571
1.77GluMet: 1.77 ± 0.251
2.866GluAsn: 2.866 ± 0.32
3.119GluPro: 3.119 ± 0.563
2.36GluGln: 2.36 ± 0.318
2.739GluArg: 2.739 ± 0.364
3.414GluSer: 3.414 ± 0.473
3.498GluThr: 3.498 ± 0.405
4.383GluVal: 4.383 ± 0.593
0.843GluTrp: 0.843 ± 0.214
2.065GluTyr: 2.065 ± 0.229
0.0GluXaa: 0.0 ± 0.0
Phe
2.234PheAla: 2.234 ± 0.257
0.421PheCys: 0.421 ± 0.142
2.444PheAsp: 2.444 ± 0.33
1.391PheGlu: 1.391 ± 0.248
0.843PhePhe: 0.843 ± 0.155
2.782PheGly: 2.782 ± 0.31
1.307PheHis: 1.307 ± 0.298
2.487PheIle: 2.487 ± 0.291
2.065PheLys: 2.065 ± 0.332
2.529PheLeu: 2.529 ± 0.31
1.559PheMet: 1.559 ± 0.264
2.444PheAsn: 2.444 ± 0.401
1.18PhePro: 1.18 ± 0.209
0.927PheGln: 0.927 ± 0.209
1.475PheArg: 1.475 ± 0.25
1.981PheSer: 1.981 ± 0.236
2.992PheThr: 2.992 ± 0.457
2.276PheVal: 2.276 ± 0.264
0.379PheTrp: 0.379 ± 0.137
1.054PheTyr: 1.054 ± 0.163
0.0PheXaa: 0.0 ± 0.0
Gly
5.605GlyAla: 5.605 ± 0.631
0.379GlyCys: 0.379 ± 0.134
3.33GlyAsp: 3.33 ± 0.342
3.962GlyGlu: 3.962 ± 0.347
2.782GlyPhe: 2.782 ± 0.325
4.046GlyGly: 4.046 ± 0.521
0.969GlyHis: 0.969 ± 0.238
4.341GlyIle: 4.341 ± 0.391
4.51GlyLys: 4.51 ± 0.471
5.31GlyLeu: 5.31 ± 0.48
2.318GlyMet: 2.318 ± 0.293
3.793GlyAsn: 3.793 ± 0.478
1.18GlyPro: 1.18 ± 0.187
2.655GlyGln: 2.655 ± 0.436
3.33GlyArg: 3.33 ± 0.45
4.847GlySer: 4.847 ± 0.432
4.847GlyThr: 4.847 ± 0.447
4.847GlyVal: 4.847 ± 0.516
0.759GlyTrp: 0.759 ± 0.194
2.571GlyTyr: 2.571 ± 0.365
0.0GlyXaa: 0.0 ± 0.0
His
1.307HisAla: 1.307 ± 0.233
0.421HisCys: 0.421 ± 0.145
1.18HisAsp: 1.18 ± 0.249
1.349HisGlu: 1.349 ± 0.256
0.632HisPhe: 0.632 ± 0.185
1.517HisGly: 1.517 ± 0.316
0.421HisHis: 0.421 ± 0.154
1.307HisIle: 1.307 ± 0.269
1.264HisLys: 1.264 ± 0.247
1.939HisLeu: 1.939 ± 0.322
0.759HisMet: 0.759 ± 0.167
0.632HisAsn: 0.632 ± 0.17
0.969HisPro: 0.969 ± 0.19
0.801HisGln: 0.801 ± 0.163
0.927HisArg: 0.927 ± 0.256
1.264HisSer: 1.264 ± 0.216
0.59HisThr: 0.59 ± 0.152
1.18HisVal: 1.18 ± 0.252
0.379HisTrp: 0.379 ± 0.129
0.716HisTyr: 0.716 ± 0.175
0.0HisXaa: 0.0 ± 0.0
Ile
5.479IleAla: 5.479 ± 0.499
0.506IleCys: 0.506 ± 0.186
4.299IleAsp: 4.299 ± 0.442
3.456IleGlu: 3.456 ± 0.572
1.391IlePhe: 1.391 ± 0.3
2.992IleGly: 2.992 ± 0.312
1.18IleHis: 1.18 ± 0.209
2.992IleIle: 2.992 ± 0.406
3.582IleLys: 3.582 ± 0.442
3.962IleLeu: 3.962 ± 0.555
1.475IleMet: 1.475 ± 0.21
2.992IleAsn: 2.992 ± 0.34
2.824IlePro: 2.824 ± 0.39
2.402IleGln: 2.402 ± 0.331
2.866IleArg: 2.866 ± 0.331
2.697IleSer: 2.697 ± 0.296
3.498IleThr: 3.498 ± 0.351
3.203IleVal: 3.203 ± 0.348
0.421IleTrp: 0.421 ± 0.138
1.981IleTyr: 1.981 ± 0.379
0.0IleXaa: 0.0 ± 0.0
Lys
6.448LysAla: 6.448 ± 0.582
0.295LysCys: 0.295 ± 0.123
3.245LysAsp: 3.245 ± 0.435
3.962LysGlu: 3.962 ± 0.433
1.897LysPhe: 1.897 ± 0.282
3.33LysGly: 3.33 ± 0.505
1.18LysHis: 1.18 ± 0.225
2.908LysIle: 2.908 ± 0.394
3.456LysLys: 3.456 ± 0.622
4.973LysLeu: 4.973 ± 0.544
1.981LysMet: 1.981 ± 0.227
3.161LysAsn: 3.161 ± 0.365
2.95LysPro: 2.95 ± 0.445
2.655LysGln: 2.655 ± 0.375
2.95LysArg: 2.95 ± 0.396
3.877LysSer: 3.877 ± 0.424
3.92LysThr: 3.92 ± 0.432
4.425LysVal: 4.425 ± 0.406
0.59LysTrp: 0.59 ± 0.177
1.559LysTyr: 1.559 ± 0.275
0.0LysXaa: 0.0 ± 0.0
Leu
6.996LeuAla: 6.996 ± 0.609
1.012LeuCys: 1.012 ± 0.244
5.605LeuAsp: 5.605 ± 0.526
4.172LeuGlu: 4.172 ± 0.402
3.203LeuPhe: 3.203 ± 0.368
5.816LeuGly: 5.816 ± 0.435
1.602LeuHis: 1.602 ± 0.252
4.215LeuIle: 4.215 ± 0.421
4.215LeuLys: 4.215 ± 0.391
6.617LeuLeu: 6.617 ± 0.52
2.739LeuMet: 2.739 ± 0.301
5.015LeuAsn: 5.015 ± 0.44
3.835LeuPro: 3.835 ± 0.418
3.625LeuGln: 3.625 ± 0.522
3.877LeuArg: 3.877 ± 0.426
6.238LeuSer: 6.238 ± 0.569
5.437LeuThr: 5.437 ± 0.432
5.732LeuVal: 5.732 ± 0.411
0.716LeuTrp: 0.716 ± 0.181
2.655LeuTyr: 2.655 ± 0.396
0.0LeuXaa: 0.0 ± 0.0
Met
2.318MetAla: 2.318 ± 0.363
0.126MetCys: 0.126 ± 0.087
2.234MetAsp: 2.234 ± 0.29
1.854MetGlu: 1.854 ± 0.332
0.885MetPhe: 0.885 ± 0.17
1.517MetGly: 1.517 ± 0.22
0.421MetHis: 0.421 ± 0.113
1.559MetIle: 1.559 ± 0.229
1.897MetLys: 1.897 ± 0.273
2.529MetLeu: 2.529 ± 0.34
0.716MetMet: 0.716 ± 0.172
1.728MetAsn: 1.728 ± 0.245
1.391MetPro: 1.391 ± 0.214
1.391MetGln: 1.391 ± 0.273
2.065MetArg: 2.065 ± 0.28
2.697MetSer: 2.697 ± 0.36
2.487MetThr: 2.487 ± 0.403
1.939MetVal: 1.939 ± 0.282
0.126MetTrp: 0.126 ± 0.075
0.632MetTyr: 0.632 ± 0.155
0.0MetXaa: 0.0 ± 0.0
Asn
4.973AsnAla: 4.973 ± 0.608
0.632AsnCys: 0.632 ± 0.175
3.161AsnAsp: 3.161 ± 0.424
2.95AsnGlu: 2.95 ± 0.337
2.023AsnPhe: 2.023 ± 0.277
4.046AsnGly: 4.046 ± 0.369
1.012AsnHis: 1.012 ± 0.198
2.908AsnIle: 2.908 ± 0.326
3.456AsnLys: 3.456 ± 0.407
4.13AsnLeu: 4.13 ± 0.355
1.475AsnMet: 1.475 ± 0.284
3.035AsnAsn: 3.035 ± 0.425
2.908AsnPro: 2.908 ± 0.439
2.444AsnGln: 2.444 ± 0.339
2.571AsnArg: 2.571 ± 0.329
3.498AsnSer: 3.498 ± 0.365
3.667AsnThr: 3.667 ± 0.492
3.203AsnVal: 3.203 ± 0.302
0.674AsnTrp: 0.674 ± 0.18
1.559AsnTyr: 1.559 ± 0.21
0.0AsnXaa: 0.0 ± 0.0
Pro
4.13ProAla: 4.13 ± 0.461
0.295ProCys: 0.295 ± 0.119
2.655ProAsp: 2.655 ± 0.326
3.582ProGlu: 3.582 ± 0.508
1.981ProPhe: 1.981 ± 0.323
1.981ProGly: 1.981 ± 0.321
0.506ProHis: 0.506 ± 0.133
1.728ProIle: 1.728 ± 0.216
2.065ProLys: 2.065 ± 0.301
3.582ProLeu: 3.582 ± 0.341
1.222ProMet: 1.222 ± 0.188
2.065ProAsn: 2.065 ± 0.312
0.801ProPro: 0.801 ± 0.258
1.602ProGln: 1.602 ± 0.277
1.391ProArg: 1.391 ± 0.247
2.487ProSer: 2.487 ± 0.338
3.035ProThr: 3.035 ± 0.388
4.763ProVal: 4.763 ± 0.586
0.674ProTrp: 0.674 ± 0.15
0.927ProTyr: 0.927 ± 0.257
0.0ProXaa: 0.0 ± 0.0
Gln
4.467GlnAla: 4.467 ± 0.643
0.253GlnCys: 0.253 ± 0.125
2.318GlnAsp: 2.318 ± 0.385
2.613GlnGlu: 2.613 ± 0.279
1.391GlnPhe: 1.391 ± 0.194
2.192GlnGly: 2.192 ± 0.408
1.054GlnHis: 1.054 ± 0.187
2.149GlnIle: 2.149 ± 0.266
2.655GlnLys: 2.655 ± 0.334
3.793GlnLeu: 3.793 ± 0.424
1.812GlnMet: 1.812 ± 0.286
2.824GlnAsn: 2.824 ± 0.651
1.433GlnPro: 1.433 ± 0.276
3.33GlnGln: 3.33 ± 0.741
1.77GlnArg: 1.77 ± 0.25
2.234GlnSer: 2.234 ± 0.311
2.655GlnThr: 2.655 ± 0.383
3.161GlnVal: 3.161 ± 0.315
0.506GlnTrp: 0.506 ± 0.154
1.854GlnTyr: 1.854 ± 0.313
0.0GlnXaa: 0.0 ± 0.0
Arg
3.793ArgAla: 3.793 ± 0.457
0.506ArgCys: 0.506 ± 0.217
3.119ArgAsp: 3.119 ± 0.377
2.234ArgGlu: 2.234 ± 0.434
1.475ArgPhe: 1.475 ± 0.266
2.613ArgGly: 2.613 ± 0.323
0.843ArgHis: 0.843 ± 0.214
2.866ArgIle: 2.866 ± 0.407
2.992ArgLys: 2.992 ± 0.443
4.299ArgLeu: 4.299 ± 0.427
1.812ArgMet: 1.812 ± 0.241
2.402ArgAsn: 2.402 ± 0.361
1.433ArgPro: 1.433 ± 0.277
1.602ArgGln: 1.602 ± 0.278
2.065ArgArg: 2.065 ± 0.343
2.824ArgSer: 2.824 ± 0.339
2.487ArgThr: 2.487 ± 0.239
2.992ArgVal: 2.992 ± 0.288
0.506ArgTrp: 0.506 ± 0.164
1.981ArgTyr: 1.981 ± 0.384
0.0ArgXaa: 0.0 ± 0.0
Ser
5.058SerAla: 5.058 ± 0.621
0.337SerCys: 0.337 ± 0.149
3.498SerAsp: 3.498 ± 0.447
2.655SerGlu: 2.655 ± 0.425
1.812SerPhe: 1.812 ± 0.302
5.395SerGly: 5.395 ± 0.452
0.632SerHis: 0.632 ± 0.175
3.33SerIle: 3.33 ± 0.452
3.161SerLys: 3.161 ± 0.412
5.563SerLeu: 5.563 ± 0.578
1.602SerMet: 1.602 ± 0.259
3.414SerAsn: 3.414 ± 0.364
1.897SerPro: 1.897 ± 0.38
2.782SerGln: 2.782 ± 0.407
3.077SerArg: 3.077 ± 0.396
3.962SerSer: 3.962 ± 0.535
4.13SerThr: 4.13 ± 0.51
5.226SerVal: 5.226 ± 0.637
0.885SerTrp: 0.885 ± 0.216
2.318SerTyr: 2.318 ± 0.268
0.0SerXaa: 0.0 ± 0.0
Thr
6.153ThrAla: 6.153 ± 0.545
0.464ThrCys: 0.464 ± 0.168
4.678ThrAsp: 4.678 ± 0.347
3.625ThrGlu: 3.625 ± 0.487
2.529ThrPhe: 2.529 ± 0.241
6.195ThrGly: 6.195 ± 0.532
1.475ThrHis: 1.475 ± 0.292
2.95ThrIle: 2.95 ± 0.469
3.877ThrLys: 3.877 ± 0.343
5.437ThrLeu: 5.437 ± 0.449
1.307ThrMet: 1.307 ± 0.201
3.582ThrAsn: 3.582 ± 0.329
3.077ThrPro: 3.077 ± 0.301
2.992ThrGln: 2.992 ± 0.487
2.276ThrArg: 2.276 ± 0.342
3.203ThrSer: 3.203 ± 0.353
4.088ThrThr: 4.088 ± 0.397
5.605ThrVal: 5.605 ± 0.549
0.885ThrTrp: 0.885 ± 0.152
1.264ThrTyr: 1.264 ± 0.222
0.0ThrXaa: 0.0 ± 0.0
Val
7.038ValAla: 7.038 ± 0.735
0.843ValCys: 0.843 ± 0.189
4.257ValAsp: 4.257 ± 0.355
4.467ValGlu: 4.467 ± 0.522
2.276ValPhe: 2.276 ± 0.426
4.257ValGly: 4.257 ± 0.433
1.433ValHis: 1.433 ± 0.294
3.793ValIle: 3.793 ± 0.48
4.425ValLys: 4.425 ± 0.449
5.1ValLeu: 5.1 ± 0.485
2.023ValMet: 2.023 ± 0.288
4.215ValAsn: 4.215 ± 0.504
3.625ValPro: 3.625 ± 0.537
3.372ValGln: 3.372 ± 0.347
2.95ValArg: 2.95 ± 0.332
5.058ValSer: 5.058 ± 0.552
5.774ValThr: 5.774 ± 0.563
4.552ValVal: 4.552 ± 0.589
0.801ValTrp: 0.801 ± 0.28
2.107ValTyr: 2.107 ± 0.352
0.0ValXaa: 0.0 ± 0.0
Trp
1.096TrpAla: 1.096 ± 0.236
0.042TrpCys: 0.042 ± 0.041
0.716TrpAsp: 0.716 ± 0.215
0.801TrpGlu: 0.801 ± 0.24
0.927TrpPhe: 0.927 ± 0.217
0.801TrpGly: 0.801 ± 0.162
0.211TrpHis: 0.211 ± 0.095
0.548TrpIle: 0.548 ± 0.144
0.632TrpLys: 0.632 ± 0.21
1.18TrpLeu: 1.18 ± 0.238
0.295TrpMet: 0.295 ± 0.12
0.59TrpAsn: 0.59 ± 0.157
0.169TrpPro: 0.169 ± 0.103
0.464TrpGln: 0.464 ± 0.155
0.421TrpArg: 0.421 ± 0.115
0.548TrpSer: 0.548 ± 0.21
0.548TrpThr: 0.548 ± 0.128
0.801TrpVal: 0.801 ± 0.185
0.084TrpTrp: 0.084 ± 0.058
0.421TrpTyr: 0.421 ± 0.132
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.582TyrAla: 3.582 ± 0.394
0.379TyrCys: 0.379 ± 0.137
1.897TyrAsp: 1.897 ± 0.266
1.854TyrGlu: 1.854 ± 0.377
1.18TyrPhe: 1.18 ± 0.221
2.444TyrGly: 2.444 ± 0.352
0.674TyrHis: 0.674 ± 0.189
1.812TyrIle: 1.812 ± 0.286
2.023TyrLys: 2.023 ± 0.317
2.655TyrLeu: 2.655 ± 0.288
0.759TyrMet: 0.759 ± 0.17
1.686TyrAsn: 1.686 ± 0.226
1.096TyrPro: 1.096 ± 0.264
1.602TyrGln: 1.602 ± 0.331
1.812TyrArg: 1.812 ± 0.37
1.728TyrSer: 1.728 ± 0.206
1.77TyrThr: 1.77 ± 0.25
2.402TyrVal: 2.402 ± 0.371
0.379TyrTrp: 0.379 ± 0.109
0.885TyrTyr: 0.885 ± 0.165
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 94 proteins (23728 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski