Amino acid dipepetide frequency for Escherichia phage NJ01

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.399AlaAla: 7.399 ± 1.174
0.668AlaCys: 0.668 ± 0.244
5.674AlaAsp: 5.674 ± 0.697
4.339AlaGlu: 4.339 ± 0.751
2.893AlaPhe: 2.893 ± 0.384
4.395AlaGly: 4.395 ± 0.625
1.391AlaHis: 1.391 ± 0.32
4.673AlaIle: 4.673 ± 0.588
5.007AlaLys: 5.007 ± 0.936
6.62AlaLeu: 6.62 ± 0.66
2.336AlaMet: 2.336 ± 0.349
4.45AlaAsn: 4.45 ± 0.558
3.282AlaPro: 3.282 ± 0.531
4.061AlaGln: 4.061 ± 0.917
4.061AlaArg: 4.061 ± 0.693
4.283AlaSer: 4.283 ± 0.658
4.784AlaThr: 4.784 ± 0.636
4.784AlaVal: 4.784 ± 0.583
1.001AlaTrp: 1.001 ± 0.24
2.503AlaTyr: 2.503 ± 0.351
0.0AlaXaa: 0.0 ± 0.0
Cys
0.723CysAla: 0.723 ± 0.248
0.223CysCys: 0.223 ± 0.169
0.556CysAsp: 0.556 ± 0.224
0.723CysGlu: 0.723 ± 0.216
0.389CysPhe: 0.389 ± 0.138
1.001CysGly: 1.001 ± 0.307
0.445CysHis: 0.445 ± 0.166
0.501CysIle: 0.501 ± 0.177
1.224CysLys: 1.224 ± 0.324
0.89CysLeu: 0.89 ± 0.248
0.223CysMet: 0.223 ± 0.114
0.334CysAsn: 0.334 ± 0.169
0.445CysPro: 0.445 ± 0.144
0.167CysGln: 0.167 ± 0.109
0.501CysArg: 0.501 ± 0.18
1.057CysSer: 1.057 ± 0.275
0.779CysThr: 0.779 ± 0.235
0.946CysVal: 0.946 ± 0.248
0.056CysTrp: 0.056 ± 0.064
0.445CysTyr: 0.445 ± 0.133
0.0CysXaa: 0.0 ± 0.0
Asp
6.175AspAla: 6.175 ± 0.681
0.445AspCys: 0.445 ± 0.163
3.06AspAsp: 3.06 ± 0.54
4.617AspGlu: 4.617 ± 0.602
1.78AspPhe: 1.78 ± 0.261
4.506AspGly: 4.506 ± 0.587
1.001AspHis: 1.001 ± 0.319
4.005AspIle: 4.005 ± 0.437
3.227AspLys: 3.227 ± 0.46
5.118AspLeu: 5.118 ± 0.545
2.392AspMet: 2.392 ± 0.396
4.117AspAsn: 4.117 ± 0.481
1.78AspPro: 1.78 ± 0.308
1.78AspGln: 1.78 ± 0.305
2.837AspArg: 2.837 ± 0.425
3.783AspSer: 3.783 ± 0.386
3.06AspThr: 3.06 ± 0.399
4.673AspVal: 4.673 ± 0.546
0.723AspTrp: 0.723 ± 0.21
2.837AspTyr: 2.837 ± 0.446
0.0AspXaa: 0.0 ± 0.0
Glu
6.119GluAla: 6.119 ± 0.843
0.89GluCys: 0.89 ± 0.246
5.007GluAsp: 5.007 ± 0.747
7.287GluGlu: 7.287 ± 1.175
2.336GluPhe: 2.336 ± 0.385
4.061GluGly: 4.061 ± 0.489
1.446GluHis: 1.446 ± 0.301
3.616GluIle: 3.616 ± 0.377
4.117GluLys: 4.117 ± 0.616
4.339GluLeu: 4.339 ± 0.552
2.67GluMet: 2.67 ± 0.387
3.505GluAsn: 3.505 ± 0.522
2.058GluPro: 2.058 ± 0.384
3.004GluGln: 3.004 ± 0.506
3.282GluArg: 3.282 ± 0.431
4.45GluSer: 4.45 ± 0.419
3.004GluThr: 3.004 ± 0.409
4.673GluVal: 4.673 ± 0.477
1.001GluTrp: 1.001 ± 0.221
2.281GluTyr: 2.281 ± 0.304
0.0GluXaa: 0.0 ± 0.0
Phe
2.781PheAla: 2.781 ± 0.559
0.668PheCys: 0.668 ± 0.22
2.781PheAsp: 2.781 ± 0.496
1.947PheGlu: 1.947 ± 0.307
1.057PhePhe: 1.057 ± 0.259
2.225PheGly: 2.225 ± 0.312
0.612PheHis: 0.612 ± 0.198
1.891PheIle: 1.891 ± 0.409
2.503PheLys: 2.503 ± 0.381
2.448PheLeu: 2.448 ± 0.536
0.834PheMet: 0.834 ± 0.225
2.503PheAsn: 2.503 ± 0.376
1.391PhePro: 1.391 ± 0.303
1.613PheGln: 1.613 ± 0.286
2.17PheArg: 2.17 ± 0.416
2.67PheSer: 2.67 ± 0.459
1.836PheThr: 1.836 ± 0.297
2.726PheVal: 2.726 ± 0.405
0.445PheTrp: 0.445 ± 0.152
1.669PheTyr: 1.669 ± 0.297
0.0PheXaa: 0.0 ± 0.0
Gly
5.285GlyAla: 5.285 ± 0.912
1.113GlyCys: 1.113 ± 0.274
3.727GlyAsp: 3.727 ± 0.562
3.727GlyGlu: 3.727 ± 0.477
2.503GlyPhe: 2.503 ± 0.35
6.064GlyGly: 6.064 ± 0.996
1.001GlyHis: 1.001 ± 0.237
4.061GlyIle: 4.061 ± 0.397
4.005GlyLys: 4.005 ± 0.531
4.951GlyLeu: 4.951 ± 0.528
2.003GlyMet: 2.003 ± 0.379
3.616GlyAsn: 3.616 ± 0.576
0.89GlyPro: 0.89 ± 0.249
2.893GlyGln: 2.893 ± 0.498
3.56GlyArg: 3.56 ± 0.508
5.785GlySer: 5.785 ± 0.682
3.449GlyThr: 3.449 ± 0.511
5.062GlyVal: 5.062 ± 0.583
1.113GlyTrp: 1.113 ± 0.28
2.559GlyTyr: 2.559 ± 0.396
0.0GlyXaa: 0.0 ± 0.0
His
1.057HisAla: 1.057 ± 0.214
0.167HisCys: 0.167 ± 0.099
1.113HisAsp: 1.113 ± 0.309
1.224HisGlu: 1.224 ± 0.286
0.779HisPhe: 0.779 ± 0.186
1.001HisGly: 1.001 ± 0.235
0.278HisHis: 0.278 ± 0.114
1.502HisIle: 1.502 ± 0.323
1.391HisLys: 1.391 ± 0.297
1.279HisLeu: 1.279 ± 0.265
0.278HisMet: 0.278 ± 0.135
1.113HisAsn: 1.113 ± 0.22
0.834HisPro: 0.834 ± 0.22
0.556HisGln: 0.556 ± 0.147
1.057HisArg: 1.057 ± 0.318
1.391HisSer: 1.391 ± 0.334
1.001HisThr: 1.001 ± 0.228
1.113HisVal: 1.113 ± 0.254
0.389HisTrp: 0.389 ± 0.139
0.779HisTyr: 0.779 ± 0.227
0.0HisXaa: 0.0 ± 0.0
Ile
3.505IleAla: 3.505 ± 0.418
0.946IleCys: 0.946 ± 0.311
3.894IleAsp: 3.894 ± 0.43
4.061IleGlu: 4.061 ± 0.462
1.891IlePhe: 1.891 ± 0.275
3.505IleGly: 3.505 ± 0.444
0.946IleHis: 0.946 ± 0.225
2.448IleIle: 2.448 ± 0.47
4.562IleLys: 4.562 ± 0.438
3.449IleLeu: 3.449 ± 0.456
1.168IleMet: 1.168 ± 0.286
3.727IleAsn: 3.727 ± 0.453
3.56IlePro: 3.56 ± 0.57
1.891IleGln: 1.891 ± 0.332
3.505IleArg: 3.505 ± 0.475
3.783IleSer: 3.783 ± 0.394
4.951IleThr: 4.951 ± 0.548
2.948IleVal: 2.948 ± 0.439
0.779IleTrp: 0.779 ± 0.257
2.114IleTyr: 2.114 ± 0.384
0.0IleXaa: 0.0 ± 0.0
Lys
5.952LysAla: 5.952 ± 0.945
0.723LysCys: 0.723 ± 0.206
4.172LysAsp: 4.172 ± 0.525
5.674LysGlu: 5.674 ± 0.629
2.225LysPhe: 2.225 ± 0.377
3.727LysGly: 3.727 ± 0.468
1.391LysHis: 1.391 ± 0.276
3.282LysIle: 3.282 ± 0.44
4.617LysLys: 4.617 ± 0.58
4.45LysLeu: 4.45 ± 0.542
2.225LysMet: 2.225 ± 0.403
2.726LysAsn: 2.726 ± 0.404
1.613LysPro: 1.613 ± 0.298
2.559LysGln: 2.559 ± 0.463
2.615LysArg: 2.615 ± 0.5
3.616LysSer: 3.616 ± 0.505
3.727LysThr: 3.727 ± 0.473
4.117LysVal: 4.117 ± 0.541
1.279LysTrp: 1.279 ± 0.294
2.336LysTyr: 2.336 ± 0.492
0.0LysXaa: 0.0 ± 0.0
Leu
5.396LeuAla: 5.396 ± 0.555
0.723LeuCys: 0.723 ± 0.259
4.061LeuAsp: 4.061 ± 0.518
6.008LeuGlu: 6.008 ± 0.601
2.726LeuPhe: 2.726 ± 0.417
5.507LeuGly: 5.507 ± 0.833
0.946LeuHis: 0.946 ± 0.195
4.005LeuIle: 4.005 ± 0.501
4.339LeuLys: 4.339 ± 0.625
4.784LeuLeu: 4.784 ± 0.667
2.781LeuMet: 2.781 ± 0.331
4.005LeuAsn: 4.005 ± 0.638
3.338LeuPro: 3.338 ± 0.313
3.115LeuGln: 3.115 ± 0.464
4.506LeuArg: 4.506 ± 0.471
4.673LeuSer: 4.673 ± 0.605
4.061LeuThr: 4.061 ± 0.546
5.007LeuVal: 5.007 ± 0.581
0.556LeuTrp: 0.556 ± 0.19
2.336LeuTyr: 2.336 ± 0.308
0.0LeuXaa: 0.0 ± 0.0
Met
2.225MetAla: 2.225 ± 0.357
0.334MetCys: 0.334 ± 0.156
1.725MetAsp: 1.725 ± 0.286
1.891MetGlu: 1.891 ± 0.328
1.168MetPhe: 1.168 ± 0.263
0.946MetGly: 0.946 ± 0.255
0.334MetHis: 0.334 ± 0.166
1.891MetIle: 1.891 ± 0.405
2.503MetLys: 2.503 ± 0.504
2.058MetLeu: 2.058 ± 0.405
1.001MetMet: 1.001 ± 0.229
1.669MetAsn: 1.669 ± 0.275
1.113MetPro: 1.113 ± 0.208
1.001MetGln: 1.001 ± 0.217
1.279MetArg: 1.279 ± 0.299
2.781MetSer: 2.781 ± 0.369
2.225MetThr: 2.225 ± 0.369
1.613MetVal: 1.613 ± 0.345
0.389MetTrp: 0.389 ± 0.162
0.501MetTyr: 0.501 ± 0.175
0.0MetXaa: 0.0 ± 0.0
Asn
4.84AsnAla: 4.84 ± 0.662
0.334AsnCys: 0.334 ± 0.149
2.559AsnAsp: 2.559 ± 0.39
2.114AsnGlu: 2.114 ± 0.286
2.837AsnPhe: 2.837 ± 0.391
4.784AsnGly: 4.784 ± 0.7
0.946AsnHis: 0.946 ± 0.251
3.56AsnIle: 3.56 ± 0.504
3.171AsnLys: 3.171 ± 0.365
5.062AsnLeu: 5.062 ± 0.642
1.446AsnMet: 1.446 ± 0.303
3.56AsnAsn: 3.56 ± 0.525
2.503AsnPro: 2.503 ± 0.37
2.837AsnGln: 2.837 ± 0.482
3.004AsnArg: 3.004 ± 0.396
4.005AsnSer: 4.005 ± 0.51
3.393AsnThr: 3.393 ± 0.453
3.338AsnVal: 3.338 ± 0.459
0.668AsnTrp: 0.668 ± 0.235
1.78AsnTyr: 1.78 ± 0.278
0.0AsnXaa: 0.0 ± 0.0
Pro
2.67ProAla: 2.67 ± 0.327
0.389ProCys: 0.389 ± 0.153
2.726ProAsp: 2.726 ± 0.399
3.004ProGlu: 3.004 ± 0.445
1.335ProPhe: 1.335 ± 0.286
1.669ProGly: 1.669 ± 0.364
0.668ProHis: 0.668 ± 0.199
2.392ProIle: 2.392 ± 0.288
2.281ProLys: 2.281 ± 0.432
2.837ProLeu: 2.837 ± 0.591
0.946ProMet: 0.946 ± 0.255
1.947ProAsn: 1.947 ± 0.367
1.391ProPro: 1.391 ± 0.318
1.168ProGln: 1.168 ± 0.237
1.224ProArg: 1.224 ± 0.272
2.67ProSer: 2.67 ± 0.399
2.003ProThr: 2.003 ± 0.334
3.449ProVal: 3.449 ± 0.431
0.445ProTrp: 0.445 ± 0.128
0.946ProTyr: 0.946 ± 0.208
0.0ProXaa: 0.0 ± 0.0
Gln
3.783GlnAla: 3.783 ± 0.66
0.946GlnCys: 0.946 ± 0.256
2.225GlnAsp: 2.225 ± 0.441
2.67GlnGlu: 2.67 ± 0.45
1.113GlnPhe: 1.113 ± 0.231
1.947GlnGly: 1.947 ± 0.486
0.445GlnHis: 0.445 ± 0.146
2.336GlnIle: 2.336 ± 0.43
2.726GlnLys: 2.726 ± 0.429
3.004GlnLeu: 3.004 ± 0.579
1.057GlnMet: 1.057 ± 0.228
2.503GlnAsn: 2.503 ± 0.562
1.279GlnPro: 1.279 ± 0.303
3.282GlnGln: 3.282 ± 1.135
2.781GlnArg: 2.781 ± 0.513
2.17GlnSer: 2.17 ± 0.325
2.225GlnThr: 2.225 ± 0.302
2.559GlnVal: 2.559 ± 0.384
0.501GlnTrp: 0.501 ± 0.182
1.725GlnTyr: 1.725 ± 0.298
0.0GlnXaa: 0.0 ± 0.0
Arg
3.95ArgAla: 3.95 ± 0.665
0.556ArgCys: 0.556 ± 0.195
3.227ArgAsp: 3.227 ± 0.491
3.393ArgGlu: 3.393 ± 0.583
2.336ArgPhe: 2.336 ± 0.31
2.893ArgGly: 2.893 ± 0.433
1.001ArgHis: 1.001 ± 0.259
2.948ArgIle: 2.948 ± 0.413
3.227ArgLys: 3.227 ± 0.511
3.672ArgLeu: 3.672 ± 0.373
1.335ArgMet: 1.335 ± 0.26
2.67ArgAsn: 2.67 ± 0.485
1.558ArgPro: 1.558 ± 0.278
2.003ArgGln: 2.003 ± 0.386
3.393ArgArg: 3.393 ± 0.47
3.894ArgSer: 3.894 ± 0.423
3.115ArgThr: 3.115 ± 0.388
4.339ArgVal: 4.339 ± 0.489
0.501ArgTrp: 0.501 ± 0.17
1.891ArgTyr: 1.891 ± 0.373
0.0ArgXaa: 0.0 ± 0.0
Ser
3.894SerAla: 3.894 ± 0.536
0.668SerCys: 0.668 ± 0.215
3.727SerAsp: 3.727 ± 0.403
4.005SerGlu: 4.005 ± 0.414
2.781SerPhe: 2.781 ± 0.526
6.342SerGly: 6.342 ± 0.878
1.502SerHis: 1.502 ± 0.339
3.56SerIle: 3.56 ± 0.557
4.228SerLys: 4.228 ± 0.579
5.619SerLeu: 5.619 ± 0.502
1.725SerMet: 1.725 ± 0.276
4.617SerAsn: 4.617 ± 0.69
2.948SerPro: 2.948 ± 0.413
2.559SerGln: 2.559 ± 0.53
3.227SerArg: 3.227 ± 0.404
4.729SerSer: 4.729 ± 0.48
4.395SerThr: 4.395 ± 0.562
4.228SerVal: 4.228 ± 0.569
0.834SerTrp: 0.834 ± 0.241
2.781SerTyr: 2.781 ± 0.365
0.0SerXaa: 0.0 ± 0.0
Thr
4.172ThrAla: 4.172 ± 0.685
0.445ThrCys: 0.445 ± 0.174
3.338ThrAsp: 3.338 ± 0.35
4.172ThrGlu: 4.172 ± 0.525
2.336ThrPhe: 2.336 ± 0.489
5.34ThrGly: 5.34 ± 0.8
1.279ThrHis: 1.279 ± 0.28
3.727ThrIle: 3.727 ± 0.548
3.004ThrLys: 3.004 ± 0.359
5.229ThrLeu: 5.229 ± 0.561
1.224ThrMet: 1.224 ± 0.295
2.781ThrAsn: 2.781 ± 0.465
2.726ThrPro: 2.726 ± 0.372
1.836ThrGln: 1.836 ± 0.351
2.559ThrArg: 2.559 ± 0.407
3.393ThrSer: 3.393 ± 0.482
3.616ThrThr: 3.616 ± 0.607
5.062ThrVal: 5.062 ± 0.625
1.001ThrTrp: 1.001 ± 0.293
2.281ThrTyr: 2.281 ± 0.48
0.0ThrXaa: 0.0 ± 0.0
Val
5.229ValAla: 5.229 ± 0.571
0.89ValCys: 0.89 ± 0.22
4.951ValAsp: 4.951 ± 0.427
5.174ValGlu: 5.174 ± 0.502
2.726ValPhe: 2.726 ± 0.418
4.283ValGly: 4.283 ± 0.49
1.613ValHis: 1.613 ± 0.246
3.894ValIle: 3.894 ± 0.554
4.339ValLys: 4.339 ± 0.408
3.838ValLeu: 3.838 ± 0.48
1.335ValMet: 1.335 ± 0.254
4.005ValAsn: 4.005 ± 0.542
1.947ValPro: 1.947 ± 0.353
2.503ValGln: 2.503 ± 0.491
3.338ValArg: 3.338 ± 0.41
5.062ValSer: 5.062 ± 0.825
5.118ValThr: 5.118 ± 0.788
6.008ValVal: 6.008 ± 0.776
1.558ValTrp: 1.558 ± 0.278
2.392ValTyr: 2.392 ± 0.477
0.0ValXaa: 0.0 ± 0.0
Trp
0.779TrpAla: 0.779 ± 0.246
0.056TrpCys: 0.056 ± 0.071
1.168TrpAsp: 1.168 ± 0.226
0.834TrpGlu: 0.834 ± 0.193
0.445TrpPhe: 0.445 ± 0.152
0.946TrpGly: 0.946 ± 0.251
0.334TrpHis: 0.334 ± 0.186
0.834TrpIle: 0.834 ± 0.25
0.668TrpLys: 0.668 ± 0.191
1.168TrpLeu: 1.168 ± 0.259
0.389TrpMet: 0.389 ± 0.141
0.89TrpAsn: 0.89 ± 0.237
0.167TrpPro: 0.167 ± 0.096
0.389TrpGln: 0.389 ± 0.141
0.779TrpArg: 0.779 ± 0.187
1.446TrpSer: 1.446 ± 0.353
1.168TrpThr: 1.168 ± 0.216
0.946TrpVal: 0.946 ± 0.209
0.111TrpTrp: 0.111 ± 0.103
0.445TrpTyr: 0.445 ± 0.145
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.67TyrAla: 2.67 ± 0.377
0.445TyrCys: 0.445 ± 0.174
2.225TyrAsp: 2.225 ± 0.445
2.336TyrGlu: 2.336 ± 0.385
1.113TyrPhe: 1.113 ± 0.255
2.225TyrGly: 2.225 ± 0.338
0.723TyrHis: 0.723 ± 0.188
2.503TyrIle: 2.503 ± 0.421
1.891TyrLys: 1.891 ± 0.324
2.058TyrLeu: 2.058 ± 0.314
1.224TyrMet: 1.224 ± 0.237
2.003TyrAsn: 2.003 ± 0.297
1.335TyrPro: 1.335 ± 0.3
2.058TyrGln: 2.058 ± 0.266
2.225TyrArg: 2.225 ± 0.456
2.726TyrSer: 2.726 ± 0.34
1.613TyrThr: 1.613 ± 0.285
2.615TyrVal: 2.615 ± 0.385
0.612TyrTrp: 0.612 ± 0.187
1.335TyrTyr: 1.335 ± 0.308
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 109 proteins (17977 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski