Amino acid dipepetide frequency for Escherichia phage Lambda_ev017

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.479AlaAla: 12.479 ± 1.883
1.034AlaCys: 1.034 ± 0.374
4.895AlaAsp: 4.895 ± 0.625
8.205AlaGlu: 8.205 ± 0.898
3.172AlaPhe: 3.172 ± 0.45
7.033AlaGly: 7.033 ± 0.812
1.517AlaHis: 1.517 ± 0.287
6.067AlaIle: 6.067 ± 0.527
4.275AlaLys: 4.275 ± 0.569
7.998AlaLeu: 7.998 ± 0.78
3.309AlaMet: 3.309 ± 0.573
3.172AlaAsn: 3.172 ± 0.364
2.413AlaPro: 2.413 ± 0.441
4.688AlaGln: 4.688 ± 0.876
5.654AlaArg: 5.654 ± 0.679
5.723AlaSer: 5.723 ± 0.725
5.654AlaThr: 5.654 ± 1.021
6.619AlaVal: 6.619 ± 0.683
1.655AlaTrp: 1.655 ± 0.324
2.551AlaTyr: 2.551 ± 0.359
0.0AlaXaa: 0.0 ± 0.0
Cys
1.31CysAla: 1.31 ± 0.339
0.207CysCys: 0.207 ± 0.154
1.034CysAsp: 1.034 ± 0.229
0.621CysGlu: 0.621 ± 0.238
0.138CysPhe: 0.138 ± 0.104
0.896CysGly: 0.896 ± 0.296
0.414CysHis: 0.414 ± 0.15
0.689CysIle: 0.689 ± 0.188
0.483CysLys: 0.483 ± 0.178
1.103CysLeu: 1.103 ± 0.249
0.414CysMet: 0.414 ± 0.163
0.758CysAsn: 0.758 ± 0.19
0.276CysPro: 0.276 ± 0.119
0.483CysGln: 0.483 ± 0.163
1.172CysArg: 1.172 ± 0.308
1.241CysSer: 1.241 ± 0.315
0.689CysThr: 0.689 ± 0.219
0.896CysVal: 0.896 ± 0.197
0.207CysTrp: 0.207 ± 0.107
0.621CysTyr: 0.621 ± 0.178
0.0CysXaa: 0.0 ± 0.0
Asp
5.929AspAla: 5.929 ± 0.656
0.621AspCys: 0.621 ± 0.206
4.413AspAsp: 4.413 ± 0.615
3.861AspGlu: 3.861 ± 0.57
1.931AspPhe: 1.931 ± 0.302
5.792AspGly: 5.792 ± 0.719
0.483AspHis: 0.483 ± 0.181
4.275AspIle: 4.275 ± 0.541
3.034AspLys: 3.034 ± 0.487
4.206AspLeu: 4.206 ± 0.631
1.931AspMet: 1.931 ± 0.353
2.068AspAsn: 2.068 ± 0.39
1.931AspPro: 1.931 ± 0.43
1.724AspGln: 1.724 ± 0.379
3.103AspArg: 3.103 ± 0.433
3.172AspSer: 3.172 ± 0.442
2.689AspThr: 2.689 ± 0.417
4.482AspVal: 4.482 ± 0.676
1.172AspTrp: 1.172 ± 0.322
1.931AspTyr: 1.931 ± 0.319
0.0AspXaa: 0.0 ± 0.0
Glu
5.723GluAla: 5.723 ± 0.928
0.896GluCys: 0.896 ± 0.282
3.034GluAsp: 3.034 ± 0.508
3.654GluGlu: 3.654 ± 0.595
1.999GluPhe: 1.999 ± 0.582
3.792GluGly: 3.792 ± 0.411
1.103GluHis: 1.103 ± 0.247
3.93GluIle: 3.93 ± 0.528
3.861GluLys: 3.861 ± 0.528
5.516GluLeu: 5.516 ± 0.565
1.655GluMet: 1.655 ± 0.369
2.482GluAsn: 2.482 ± 0.418
2.551GluPro: 2.551 ± 0.341
3.792GluGln: 3.792 ± 0.622
4.413GluArg: 4.413 ± 0.637
4.206GluSer: 4.206 ± 0.597
4.068GluThr: 4.068 ± 0.475
3.378GluVal: 3.378 ± 0.498
1.31GluTrp: 1.31 ± 0.331
2.068GluTyr: 2.068 ± 0.34
0.0GluXaa: 0.0 ± 0.0
Phe
2.137PheAla: 2.137 ± 0.482
0.689PheCys: 0.689 ± 0.166
2.965PheAsp: 2.965 ± 0.419
2.344PheGlu: 2.344 ± 0.372
1.241PhePhe: 1.241 ± 0.388
3.309PheGly: 3.309 ± 0.594
0.896PheHis: 0.896 ± 0.217
1.448PheIle: 1.448 ± 0.337
1.999PheLys: 1.999 ± 0.359
2.344PheLeu: 2.344 ± 0.462
0.965PheMet: 0.965 ± 0.25
1.31PheAsn: 1.31 ± 0.275
1.448PhePro: 1.448 ± 0.294
0.827PheGln: 0.827 ± 0.256
2.689PheArg: 2.689 ± 0.438
3.309PheSer: 3.309 ± 0.463
2.758PheThr: 2.758 ± 0.432
2.344PheVal: 2.344 ± 0.303
0.552PheTrp: 0.552 ± 0.156
0.827PheTyr: 0.827 ± 0.264
0.0PheXaa: 0.0 ± 0.0
Gly
6.205GlyAla: 6.205 ± 0.769
0.896GlyCys: 0.896 ± 0.234
4.826GlyAsp: 4.826 ± 0.432
3.723GlyGlu: 3.723 ± 0.463
2.689GlyPhe: 2.689 ± 0.44
5.033GlyGly: 5.033 ± 0.844
1.379GlyHis: 1.379 ± 0.342
4.344GlyIle: 4.344 ± 0.57
4.826GlyLys: 4.826 ± 0.486
5.998GlyLeu: 5.998 ± 0.82
3.103GlyMet: 3.103 ± 0.489
3.103GlyAsn: 3.103 ± 0.434
1.103GlyPro: 1.103 ± 0.26
3.24GlyGln: 3.24 ± 0.572
3.792GlyArg: 3.792 ± 0.434
4.137GlySer: 4.137 ± 0.53
3.172GlyThr: 3.172 ± 0.574
6.136GlyVal: 6.136 ± 0.485
1.586GlyTrp: 1.586 ± 0.267
2.206GlyTyr: 2.206 ± 0.392
0.0GlyXaa: 0.0 ± 0.0
His
1.655HisAla: 1.655 ± 0.299
0.207HisCys: 0.207 ± 0.103
0.827HisAsp: 0.827 ± 0.243
0.758HisGlu: 0.758 ± 0.217
0.758HisPhe: 0.758 ± 0.213
1.241HisGly: 1.241 ± 0.213
0.276HisHis: 0.276 ± 0.12
1.31HisIle: 1.31 ± 0.318
1.034HisLys: 1.034 ± 0.249
1.724HisLeu: 1.724 ± 0.361
0.345HisMet: 0.345 ± 0.144
0.827HisAsn: 0.827 ± 0.259
0.827HisPro: 0.827 ± 0.219
0.483HisGln: 0.483 ± 0.167
1.034HisArg: 1.034 ± 0.188
0.621HisSer: 0.621 ± 0.232
1.034HisThr: 1.034 ± 0.28
0.965HisVal: 0.965 ± 0.249
0.345HisTrp: 0.345 ± 0.174
0.965HisTyr: 0.965 ± 0.269
0.0HisXaa: 0.0 ± 0.0
Ile
5.929IleAla: 5.929 ± 0.643
0.827IleCys: 0.827 ± 0.207
3.172IleAsp: 3.172 ± 0.483
3.103IleGlu: 3.103 ± 0.473
1.379IlePhe: 1.379 ± 0.307
3.378IleGly: 3.378 ± 0.428
0.758IleHis: 0.758 ± 0.238
3.309IleIle: 3.309 ± 0.628
2.827IleLys: 2.827 ± 0.522
3.378IleLeu: 3.378 ± 0.496
1.103IleMet: 1.103 ± 0.289
3.24IleAsn: 3.24 ± 0.444
2.275IlePro: 2.275 ± 0.387
2.068IleGln: 2.068 ± 0.331
3.309IleArg: 3.309 ± 0.395
4.206IleSer: 4.206 ± 0.435
4.964IleThr: 4.964 ± 0.752
3.172IleVal: 3.172 ± 0.504
0.483IleTrp: 0.483 ± 0.165
1.31IleTyr: 1.31 ± 0.385
0.0IleXaa: 0.0 ± 0.0
Lys
5.86LysAla: 5.86 ± 0.767
0.621LysCys: 0.621 ± 0.257
3.034LysAsp: 3.034 ± 0.507
3.447LysGlu: 3.447 ± 0.474
1.379LysPhe: 1.379 ± 0.282
3.861LysGly: 3.861 ± 0.564
1.379LysHis: 1.379 ± 0.358
1.931LysIle: 1.931 ± 0.408
3.24LysLys: 3.24 ± 0.437
3.999LysLeu: 3.999 ± 0.487
1.517LysMet: 1.517 ± 0.274
2.413LysAsn: 2.413 ± 0.397
2.068LysPro: 2.068 ± 0.38
2.413LysGln: 2.413 ± 0.446
3.447LysArg: 3.447 ± 0.397
2.758LysSer: 2.758 ± 0.493
3.585LysThr: 3.585 ± 0.548
3.034LysVal: 3.034 ± 0.497
1.379LysTrp: 1.379 ± 0.273
2.206LysTyr: 2.206 ± 0.285
0.0LysXaa: 0.0 ± 0.0
Leu
8.618LeuAla: 8.618 ± 0.924
1.034LeuCys: 1.034 ± 0.246
4.688LeuAsp: 4.688 ± 0.483
3.93LeuGlu: 3.93 ± 0.532
2.827LeuPhe: 2.827 ± 0.495
4.757LeuGly: 4.757 ± 0.619
1.379LeuHis: 1.379 ± 0.305
4.275LeuIle: 4.275 ± 0.584
5.447LeuLys: 5.447 ± 0.607
6.964LeuLeu: 6.964 ± 0.998
1.931LeuMet: 1.931 ± 0.417
3.309LeuAsn: 3.309 ± 0.4
3.792LeuPro: 3.792 ± 0.508
3.309LeuGln: 3.309 ± 0.522
4.619LeuArg: 4.619 ± 0.587
5.929LeuSer: 5.929 ± 0.719
5.654LeuThr: 5.654 ± 0.736
5.033LeuVal: 5.033 ± 0.545
1.448LeuTrp: 1.448 ± 0.353
2.275LeuTyr: 2.275 ± 0.348
0.0LeuXaa: 0.0 ± 0.0
Met
3.447MetAla: 3.447 ± 0.595
0.207MetCys: 0.207 ± 0.112
1.379MetAsp: 1.379 ± 0.329
1.103MetGlu: 1.103 ± 0.332
1.448MetPhe: 1.448 ± 0.342
1.586MetGly: 1.586 ± 0.319
0.483MetHis: 0.483 ± 0.225
1.172MetIle: 1.172 ± 0.24
1.793MetLys: 1.793 ± 0.352
3.447MetLeu: 3.447 ± 0.448
0.758MetMet: 0.758 ± 0.212
0.896MetAsn: 0.896 ± 0.193
1.31MetPro: 1.31 ± 0.398
1.241MetGln: 1.241 ± 0.264
1.793MetArg: 1.793 ± 0.295
1.655MetSer: 1.655 ± 0.317
2.344MetThr: 2.344 ± 0.4
2.137MetVal: 2.137 ± 0.406
0.483MetTrp: 0.483 ± 0.176
0.483MetTyr: 0.483 ± 0.165
0.0MetXaa: 0.0 ± 0.0
Asn
3.999AsnAla: 3.999 ± 0.62
0.345AsnCys: 0.345 ± 0.167
2.275AsnAsp: 2.275 ± 0.376
2.344AsnGlu: 2.344 ± 0.391
1.448AsnPhe: 1.448 ± 0.325
3.93AsnGly: 3.93 ± 0.519
0.965AsnHis: 0.965 ± 0.244
2.206AsnIle: 2.206 ± 0.359
2.413AsnLys: 2.413 ± 0.453
2.413AsnLeu: 2.413 ± 0.387
1.448AsnMet: 1.448 ± 0.285
2.413AsnAsn: 2.413 ± 0.488
1.793AsnPro: 1.793 ± 0.357
1.448AsnGln: 1.448 ± 0.306
2.551AsnArg: 2.551 ± 0.544
2.137AsnSer: 2.137 ± 0.382
1.793AsnThr: 1.793 ± 0.431
2.344AsnVal: 2.344 ± 0.412
0.552AsnTrp: 0.552 ± 0.192
1.103AsnTyr: 1.103 ± 0.268
0.0AsnXaa: 0.0 ± 0.0
Pro
3.861ProAla: 3.861 ± 0.57
0.621ProCys: 0.621 ± 0.184
3.585ProAsp: 3.585 ± 0.676
3.172ProGlu: 3.172 ± 0.487
1.379ProPhe: 1.379 ± 0.302
2.758ProGly: 2.758 ± 0.403
0.552ProHis: 0.552 ± 0.149
1.517ProIle: 1.517 ± 0.337
1.586ProLys: 1.586 ± 0.404
2.551ProLeu: 2.551 ± 0.435
0.689ProMet: 0.689 ± 0.192
1.034ProAsn: 1.034 ± 0.246
1.586ProPro: 1.586 ± 0.385
1.379ProGln: 1.379 ± 0.279
1.586ProArg: 1.586 ± 0.408
2.62ProSer: 2.62 ± 0.482
2.137ProThr: 2.137 ± 0.425
4.137ProVal: 4.137 ± 0.564
0.689ProTrp: 0.689 ± 0.225
0.689ProTyr: 0.689 ± 0.24
0.0ProXaa: 0.0 ± 0.0
Gln
4.482GlnAla: 4.482 ± 0.566
0.827GlnCys: 0.827 ± 0.237
1.586GlnAsp: 1.586 ± 0.303
2.413GlnGlu: 2.413 ± 0.419
1.517GlnPhe: 1.517 ± 0.327
2.413GlnGly: 2.413 ± 0.373
0.689GlnHis: 0.689 ± 0.258
2.413GlnIle: 2.413 ± 0.457
2.344GlnLys: 2.344 ± 0.398
3.516GlnLeu: 3.516 ± 0.449
1.517GlnMet: 1.517 ± 0.327
1.793GlnAsn: 1.793 ± 0.3
1.586GlnPro: 1.586 ± 0.3
3.792GlnGln: 3.792 ± 0.642
3.24GlnArg: 3.24 ± 0.472
2.827GlnSer: 2.827 ± 0.41
2.413GlnThr: 2.413 ± 0.418
3.516GlnVal: 3.516 ± 0.478
0.621GlnTrp: 0.621 ± 0.191
1.241GlnTyr: 1.241 ± 0.288
0.0GlnXaa: 0.0 ± 0.0
Arg
4.619ArgAla: 4.619 ± 0.491
0.758ArgCys: 0.758 ± 0.226
3.447ArgAsp: 3.447 ± 0.605
5.378ArgGlu: 5.378 ± 0.791
2.689ArgPhe: 2.689 ± 0.455
3.172ArgGly: 3.172 ± 0.407
1.241ArgHis: 1.241 ± 0.25
4.413ArgIle: 4.413 ± 0.589
2.758ArgLys: 2.758 ± 0.431
5.792ArgLeu: 5.792 ± 0.692
2.206ArgMet: 2.206 ± 0.304
2.344ArgAsn: 2.344 ± 0.529
2.344ArgPro: 2.344 ± 0.368
3.447ArgGln: 3.447 ± 0.431
4.619ArgArg: 4.619 ± 0.853
1.862ArgSer: 1.862 ± 0.326
2.344ArgThr: 2.344 ± 0.408
3.378ArgVal: 3.378 ± 0.528
1.31ArgTrp: 1.31 ± 0.299
1.999ArgTyr: 1.999 ± 0.363
0.0ArgXaa: 0.0 ± 0.0
Ser
6.136SerAla: 6.136 ± 0.785
0.827SerCys: 0.827 ± 0.225
3.585SerAsp: 3.585 ± 0.491
4.275SerGlu: 4.275 ± 0.531
2.344SerPhe: 2.344 ± 0.337
6.205SerGly: 6.205 ± 0.785
1.034SerHis: 1.034 ± 0.292
2.551SerIle: 2.551 ± 0.415
2.827SerLys: 2.827 ± 0.497
4.757SerLeu: 4.757 ± 0.601
1.724SerMet: 1.724 ± 0.3
2.137SerAsn: 2.137 ± 0.385
2.62SerPro: 2.62 ± 0.398
2.965SerGln: 2.965 ± 0.39
3.723SerArg: 3.723 ± 0.428
3.999SerSer: 3.999 ± 0.486
3.24SerThr: 3.24 ± 0.463
4.757SerVal: 4.757 ± 0.636
0.758SerTrp: 0.758 ± 0.241
1.999SerTyr: 1.999 ± 0.404
0.0SerXaa: 0.0 ± 0.0
Thr
5.723ThrAla: 5.723 ± 0.762
0.965ThrCys: 0.965 ± 0.263
3.034ThrAsp: 3.034 ± 0.433
4.619ThrGlu: 4.619 ± 0.486
2.827ThrPhe: 2.827 ± 0.398
4.688ThrGly: 4.688 ± 0.681
0.896ThrHis: 0.896 ± 0.224
2.62ThrIle: 2.62 ± 0.394
2.62ThrLys: 2.62 ± 0.385
5.516ThrLeu: 5.516 ± 0.664
1.172ThrMet: 1.172 ± 0.339
1.586ThrAsn: 1.586 ± 0.444
3.172ThrPro: 3.172 ± 0.733
2.482ThrGln: 2.482 ± 0.378
2.689ThrArg: 2.689 ± 0.399
3.447ThrSer: 3.447 ± 0.46
3.654ThrThr: 3.654 ± 0.566
4.895ThrVal: 4.895 ± 0.781
0.896ThrTrp: 0.896 ± 0.259
2.344ThrTyr: 2.344 ± 0.436
0.0ThrXaa: 0.0 ± 0.0
Val
5.86ValAla: 5.86 ± 0.787
0.965ValCys: 0.965 ± 0.246
4.275ValAsp: 4.275 ± 0.424
3.792ValGlu: 3.792 ± 0.575
3.172ValPhe: 3.172 ± 0.491
4.275ValGly: 4.275 ± 0.55
0.689ValHis: 0.689 ± 0.207
2.896ValIle: 2.896 ± 0.444
4.413ValLys: 4.413 ± 0.61
5.792ValLeu: 5.792 ± 0.545
2.068ValMet: 2.068 ± 0.344
3.654ValAsn: 3.654 ± 0.463
2.965ValPro: 2.965 ± 0.491
2.413ValGln: 2.413 ± 0.556
3.034ValArg: 3.034 ± 0.492
4.964ValSer: 4.964 ± 0.599
5.171ValThr: 5.171 ± 0.735
5.516ValVal: 5.516 ± 0.584
1.31ValTrp: 1.31 ± 0.282
2.068ValTyr: 2.068 ± 0.441
0.0ValXaa: 0.0 ± 0.0
Trp
1.517TrpAla: 1.517 ± 0.264
0.414TrpCys: 0.414 ± 0.14
1.172TrpAsp: 1.172 ± 0.331
0.896TrpGlu: 0.896 ± 0.253
0.483TrpPhe: 0.483 ± 0.157
0.827TrpGly: 0.827 ± 0.239
0.414TrpHis: 0.414 ± 0.154
0.965TrpIle: 0.965 ± 0.242
0.965TrpLys: 0.965 ± 0.282
1.862TrpLeu: 1.862 ± 0.465
0.621TrpMet: 0.621 ± 0.233
0.552TrpAsn: 0.552 ± 0.183
0.758TrpPro: 0.758 ± 0.198
0.621TrpGln: 0.621 ± 0.197
1.172TrpArg: 1.172 ± 0.322
1.448TrpSer: 1.448 ± 0.295
1.034TrpThr: 1.034 ± 0.284
0.965TrpVal: 0.965 ± 0.246
0.207TrpTrp: 0.207 ± 0.139
0.689TrpTyr: 0.689 ± 0.2
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.62TyrAla: 2.62 ± 0.363
0.689TyrCys: 0.689 ± 0.213
1.517TyrAsp: 1.517 ± 0.253
1.793TyrGlu: 1.793 ± 0.427
1.586TyrPhe: 1.586 ± 0.29
2.551TyrGly: 2.551 ± 0.403
0.689TyrHis: 0.689 ± 0.274
1.862TyrIle: 1.862 ± 0.39
0.896TyrLys: 0.896 ± 0.262
2.344TyrLeu: 2.344 ± 0.428
0.621TyrMet: 0.621 ± 0.252
0.827TyrAsn: 0.827 ± 0.193
1.31TyrPro: 1.31 ± 0.363
1.931TyrGln: 1.931 ± 0.382
2.482TyrArg: 2.482 ± 0.413
2.206TyrSer: 2.206 ± 0.43
1.448TyrThr: 1.448 ± 0.292
1.586TyrVal: 1.586 ± 0.319
0.621TyrTrp: 0.621 ± 0.209
1.103TyrTyr: 1.103 ± 0.308
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 73 proteins (14505 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski