Amino acid dipepetide frequency for Escherichia phage K1ind1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.391AlaAla: 11.391 ± 1.905
1.274AlaCys: 1.274 ± 0.335
5.656AlaAsp: 5.656 ± 0.631
6.452AlaGlu: 6.452 ± 0.915
4.142AlaPhe: 4.142 ± 0.596
7.806AlaGly: 7.806 ± 0.642
1.673AlaHis: 1.673 ± 0.359
5.018AlaIle: 5.018 ± 0.878
5.417AlaLys: 5.417 ± 0.776
9.16AlaLeu: 9.16 ± 0.935
1.912AlaMet: 1.912 ± 0.421
3.744AlaAsn: 3.744 ± 0.616
4.222AlaPro: 4.222 ± 0.718
3.027AlaGln: 3.027 ± 0.744
4.461AlaArg: 4.461 ± 0.608
5.895AlaSer: 5.895 ± 0.823
6.293AlaThr: 6.293 ± 0.895
7.647AlaVal: 7.647 ± 0.809
1.274AlaTrp: 1.274 ± 0.28
3.346AlaTyr: 3.346 ± 0.435
0.0AlaXaa: 0.0 ± 0.0
Cys
0.876CysAla: 0.876 ± 0.271
0.159CysCys: 0.159 ± 0.12
1.195CysAsp: 1.195 ± 0.296
0.956CysGlu: 0.956 ± 0.391
0.239CysPhe: 0.239 ± 0.117
1.036CysGly: 1.036 ± 0.363
0.398CysHis: 0.398 ± 0.149
0.637CysIle: 0.637 ± 0.219
0.717CysLys: 0.717 ± 0.257
0.558CysLeu: 0.558 ± 0.256
0.0CysMet: 0.0 ± 0.0
0.478CysAsn: 0.478 ± 0.254
0.319CysPro: 0.319 ± 0.17
0.159CysGln: 0.159 ± 0.117
0.797CysArg: 0.797 ± 0.282
0.876CysSer: 0.876 ± 0.318
0.637CysThr: 0.637 ± 0.194
0.319CysVal: 0.319 ± 0.155
0.159CysTrp: 0.159 ± 0.123
0.398CysTyr: 0.398 ± 0.168
0.0CysXaa: 0.0 ± 0.0
Asp
6.691AspAla: 6.691 ± 0.806
0.797AspCys: 0.797 ± 0.224
4.939AspAsp: 4.939 ± 0.693
4.222AspGlu: 4.222 ± 0.566
2.39AspPhe: 2.39 ± 0.419
6.293AspGly: 6.293 ± 0.908
0.717AspHis: 0.717 ± 0.26
4.142AspIle: 4.142 ± 0.497
3.107AspLys: 3.107 ± 0.476
4.939AspLeu: 4.939 ± 0.72
2.071AspMet: 2.071 ± 0.298
2.23AspAsn: 2.23 ± 0.494
1.593AspPro: 1.593 ± 0.428
0.717AspGln: 0.717 ± 0.22
2.629AspArg: 2.629 ± 0.588
3.505AspSer: 3.505 ± 0.622
3.744AspThr: 3.744 ± 0.531
3.903AspVal: 3.903 ± 0.614
0.797AspTrp: 0.797 ± 0.235
2.549AspTyr: 2.549 ± 0.307
0.0AspXaa: 0.0 ± 0.0
Glu
5.735GluAla: 5.735 ± 0.725
0.478GluCys: 0.478 ± 0.314
3.505GluAsp: 3.505 ± 0.511
4.7GluGlu: 4.7 ± 0.931
3.266GluPhe: 3.266 ± 0.51
3.903GluGly: 3.903 ± 0.667
0.797GluHis: 0.797 ± 0.259
3.107GluIle: 3.107 ± 0.592
4.381GluLys: 4.381 ± 0.671
5.974GluLeu: 5.974 ± 0.558
1.832GluMet: 1.832 ± 0.369
2.071GluAsn: 2.071 ± 0.503
1.832GluPro: 1.832 ± 0.416
3.107GluGln: 3.107 ± 0.917
3.505GluArg: 3.505 ± 0.643
2.868GluSer: 2.868 ± 0.483
3.664GluThr: 3.664 ± 0.631
4.939GluVal: 4.939 ± 0.487
1.274GluTrp: 1.274 ± 0.365
2.39GluTyr: 2.39 ± 0.36
0.0GluXaa: 0.0 ± 0.0
Phe
2.629PheAla: 2.629 ± 0.465
0.478PheCys: 0.478 ± 0.222
3.983PheAsp: 3.983 ± 0.606
2.708PheGlu: 2.708 ± 0.587
1.115PhePhe: 1.115 ± 0.283
3.346PheGly: 3.346 ± 0.484
0.637PheHis: 0.637 ± 0.238
3.107PheIle: 3.107 ± 0.473
1.912PheLys: 1.912 ± 0.414
2.071PheLeu: 2.071 ± 0.483
0.717PheMet: 0.717 ± 0.224
1.434PheAsn: 1.434 ± 0.339
1.115PhePro: 1.115 ± 0.333
1.593PheGln: 1.593 ± 0.354
2.151PheArg: 2.151 ± 0.325
3.107PheSer: 3.107 ± 0.574
2.868PheThr: 2.868 ± 0.388
2.071PheVal: 2.071 ± 0.413
0.558PheTrp: 0.558 ± 0.228
1.036PheTyr: 1.036 ± 0.259
0.0PheXaa: 0.0 ± 0.0
Gly
7.249GlyAla: 7.249 ± 0.806
1.115GlyCys: 1.115 ± 0.251
4.939GlyAsp: 4.939 ± 0.732
4.779GlyGlu: 4.779 ± 0.615
3.266GlyPhe: 3.266 ± 0.557
5.656GlyGly: 5.656 ± 0.73
1.673GlyHis: 1.673 ± 0.447
2.708GlyIle: 2.708 ± 0.444
5.337GlyLys: 5.337 ± 0.79
5.576GlyLeu: 5.576 ± 0.746
2.151GlyMet: 2.151 ± 0.521
3.346GlyAsn: 3.346 ± 0.62
2.31GlyPro: 2.31 ± 0.359
3.346GlyGln: 3.346 ± 0.63
3.425GlyArg: 3.425 ± 0.513
6.532GlySer: 6.532 ± 0.892
4.7GlyThr: 4.7 ± 0.557
5.735GlyVal: 5.735 ± 0.642
1.593GlyTrp: 1.593 ± 0.321
3.266GlyTyr: 3.266 ± 0.564
0.0GlyXaa: 0.0 ± 0.0
His
0.876HisAla: 0.876 ± 0.262
0.398HisCys: 0.398 ± 0.168
1.115HisAsp: 1.115 ± 0.233
0.956HisGlu: 0.956 ± 0.297
0.717HisPhe: 0.717 ± 0.233
1.115HisGly: 1.115 ± 0.308
0.956HisHis: 0.956 ± 0.318
1.115HisIle: 1.115 ± 0.279
1.274HisLys: 1.274 ± 0.415
1.513HisLeu: 1.513 ± 0.367
0.478HisMet: 0.478 ± 0.157
0.797HisAsn: 0.797 ± 0.232
0.717HisPro: 0.717 ± 0.226
0.717HisGln: 0.717 ± 0.233
1.354HisArg: 1.354 ± 0.373
0.797HisSer: 0.797 ± 0.231
0.717HisThr: 0.717 ± 0.242
1.274HisVal: 1.274 ± 0.272
0.0HisTrp: 0.0 ± 0.0
0.398HisTyr: 0.398 ± 0.204
0.0HisXaa: 0.0 ± 0.0
Ile
4.779IleAla: 4.779 ± 0.674
0.478IleCys: 0.478 ± 0.177
3.983IleAsp: 3.983 ± 0.586
3.027IleGlu: 3.027 ± 0.46
1.274IlePhe: 1.274 ± 0.391
2.947IleGly: 2.947 ± 0.437
0.717IleHis: 0.717 ± 0.207
2.868IleIle: 2.868 ± 0.353
3.107IleLys: 3.107 ± 0.587
3.266IleLeu: 3.266 ± 0.609
0.956IleMet: 0.956 ± 0.302
2.947IleAsn: 2.947 ± 0.592
3.027IlePro: 3.027 ± 0.396
1.912IleGln: 1.912 ± 0.498
2.151IleArg: 2.151 ± 0.387
3.903IleSer: 3.903 ± 0.549
4.54IleThr: 4.54 ± 0.63
3.585IleVal: 3.585 ± 0.583
0.956IleTrp: 0.956 ± 0.251
1.752IleTyr: 1.752 ± 0.483
0.0IleXaa: 0.0 ± 0.0
Lys
6.452LysAla: 6.452 ± 1.206
0.398LysCys: 0.398 ± 0.186
3.823LysAsp: 3.823 ± 0.665
3.983LysGlu: 3.983 ± 0.784
2.151LysPhe: 2.151 ± 0.337
3.425LysGly: 3.425 ± 0.534
1.274LysHis: 1.274 ± 0.261
1.991LysIle: 1.991 ± 0.421
2.947LysLys: 2.947 ± 0.571
4.142LysLeu: 4.142 ± 0.615
2.947LysMet: 2.947 ± 0.533
1.434LysAsn: 1.434 ± 0.348
2.071LysPro: 2.071 ± 0.389
2.071LysGln: 2.071 ± 0.547
3.505LysArg: 3.505 ± 0.592
3.027LysSer: 3.027 ± 0.539
3.823LysThr: 3.823 ± 0.465
3.107LysVal: 3.107 ± 0.536
0.398LysTrp: 0.398 ± 0.164
2.071LysTyr: 2.071 ± 0.421
0.0LysXaa: 0.0 ± 0.0
Leu
9.399LeuAla: 9.399 ± 0.787
0.797LeuCys: 0.797 ± 0.263
4.142LeuAsp: 4.142 ± 0.581
4.7LeuGlu: 4.7 ± 0.661
2.469LeuPhe: 2.469 ± 0.56
4.939LeuGly: 4.939 ± 0.449
1.593LeuHis: 1.593 ± 0.356
4.54LeuIle: 4.54 ± 0.375
4.062LeuLys: 4.062 ± 0.558
5.656LeuLeu: 5.656 ± 0.715
1.513LeuMet: 1.513 ± 0.303
4.222LeuAsn: 4.222 ± 0.707
3.664LeuPro: 3.664 ± 0.618
3.027LeuGln: 3.027 ± 0.525
4.859LeuArg: 4.859 ± 0.714
4.54LeuSer: 4.54 ± 0.651
5.895LeuThr: 5.895 ± 0.77
4.859LeuVal: 4.859 ± 0.698
1.036LeuTrp: 1.036 ± 0.333
2.151LeuTyr: 2.151 ± 0.401
0.0LeuXaa: 0.0 ± 0.0
Met
2.868MetAla: 2.868 ± 0.43
0.558MetCys: 0.558 ± 0.151
0.797MetAsp: 0.797 ± 0.282
1.115MetGlu: 1.115 ± 0.279
0.558MetPhe: 0.558 ± 0.225
1.991MetGly: 1.991 ± 0.419
0.319MetHis: 0.319 ± 0.199
1.274MetIle: 1.274 ± 0.359
1.354MetLys: 1.354 ± 0.435
1.752MetLeu: 1.752 ± 0.36
0.478MetMet: 0.478 ± 0.202
0.637MetAsn: 0.637 ± 0.234
1.354MetPro: 1.354 ± 0.399
0.558MetGln: 0.558 ± 0.235
1.354MetArg: 1.354 ± 0.274
2.071MetSer: 2.071 ± 0.304
1.752MetThr: 1.752 ± 0.399
2.31MetVal: 2.31 ± 0.469
0.159MetTrp: 0.159 ± 0.103
0.478MetTyr: 0.478 ± 0.181
0.0MetXaa: 0.0 ± 0.0
Asn
4.062AsnAla: 4.062 ± 0.489
0.876AsnCys: 0.876 ± 0.499
1.912AsnAsp: 1.912 ± 0.375
2.549AsnGlu: 2.549 ± 0.444
1.036AsnPhe: 1.036 ± 0.26
4.939AsnGly: 4.939 ± 0.777
0.558AsnHis: 0.558 ± 0.196
1.912AsnIle: 1.912 ± 0.473
1.434AsnLys: 1.434 ± 0.278
2.788AsnLeu: 2.788 ± 0.394
0.876AsnMet: 0.876 ± 0.317
2.151AsnAsn: 2.151 ± 0.332
2.39AsnPro: 2.39 ± 0.399
1.434AsnGln: 1.434 ± 0.393
2.071AsnArg: 2.071 ± 0.475
2.708AsnSer: 2.708 ± 0.39
2.39AsnThr: 2.39 ± 0.48
3.505AsnVal: 3.505 ± 0.471
0.956AsnTrp: 0.956 ± 0.265
1.354AsnTyr: 1.354 ± 0.373
0.0AsnXaa: 0.0 ± 0.0
Pro
3.346ProAla: 3.346 ± 0.422
0.319ProCys: 0.319 ± 0.168
2.708ProAsp: 2.708 ± 0.461
3.903ProGlu: 3.903 ± 0.548
1.832ProPhe: 1.832 ± 0.404
3.107ProGly: 3.107 ± 0.531
0.558ProHis: 0.558 ± 0.223
1.673ProIle: 1.673 ± 0.331
1.832ProLys: 1.832 ± 0.422
3.027ProLeu: 3.027 ± 0.547
0.717ProMet: 0.717 ± 0.272
1.434ProAsn: 1.434 ± 0.42
1.434ProPro: 1.434 ± 0.423
1.274ProGln: 1.274 ± 0.283
1.673ProArg: 1.673 ± 0.433
2.629ProSer: 2.629 ± 0.483
2.151ProThr: 2.151 ± 0.485
4.142ProVal: 4.142 ± 0.612
0.239ProTrp: 0.239 ± 0.14
1.036ProTyr: 1.036 ± 0.286
0.0ProXaa: 0.0 ± 0.0
Gln
4.461GlnAla: 4.461 ± 0.722
0.478GlnCys: 0.478 ± 0.234
1.912GlnAsp: 1.912 ± 0.414
2.151GlnGlu: 2.151 ± 0.437
1.274GlnPhe: 1.274 ± 0.243
1.991GlnGly: 1.991 ± 0.425
0.637GlnHis: 0.637 ± 0.256
1.912GlnIle: 1.912 ± 0.337
1.991GlnLys: 1.991 ± 0.563
3.744GlnLeu: 3.744 ± 0.604
0.797GlnMet: 0.797 ± 0.23
1.752GlnAsn: 1.752 ± 0.416
1.593GlnPro: 1.593 ± 0.32
2.23GlnGln: 2.23 ± 0.743
2.469GlnArg: 2.469 ± 0.383
1.673GlnSer: 1.673 ± 0.39
2.071GlnThr: 2.071 ± 0.477
2.469GlnVal: 2.469 ± 0.517
0.797GlnTrp: 0.797 ± 0.263
1.354GlnTyr: 1.354 ± 0.315
0.0GlnXaa: 0.0 ± 0.0
Arg
4.142ArgAla: 4.142 ± 0.598
0.558ArgCys: 0.558 ± 0.202
2.071ArgAsp: 2.071 ± 0.399
3.744ArgGlu: 3.744 ± 0.534
2.23ArgPhe: 2.23 ± 0.339
3.585ArgGly: 3.585 ± 0.57
1.274ArgHis: 1.274 ± 0.254
3.186ArgIle: 3.186 ± 0.471
3.823ArgLys: 3.823 ± 0.649
4.381ArgLeu: 4.381 ± 0.654
1.354ArgMet: 1.354 ± 0.307
2.708ArgAsn: 2.708 ± 0.37
1.673ArgPro: 1.673 ± 0.393
2.947ArgGln: 2.947 ± 0.523
4.461ArgArg: 4.461 ± 0.708
3.266ArgSer: 3.266 ± 0.316
3.107ArgThr: 3.107 ± 0.578
3.744ArgVal: 3.744 ± 0.508
0.478ArgTrp: 0.478 ± 0.213
0.956ArgTyr: 0.956 ± 0.312
0.0ArgXaa: 0.0 ± 0.0
Ser
5.895SerAla: 5.895 ± 0.609
0.319SerCys: 0.319 ± 0.118
3.744SerAsp: 3.744 ± 0.506
2.788SerGlu: 2.788 ± 0.449
3.027SerPhe: 3.027 ± 0.597
8.205SerGly: 8.205 ± 1.125
0.956SerHis: 0.956 ± 0.254
3.266SerIle: 3.266 ± 0.814
3.186SerLys: 3.186 ± 0.532
4.779SerLeu: 4.779 ± 0.634
1.274SerMet: 1.274 ± 0.345
3.107SerAsn: 3.107 ± 0.706
2.151SerPro: 2.151 ± 0.421
2.151SerGln: 2.151 ± 0.502
3.425SerArg: 3.425 ± 0.442
4.859SerSer: 4.859 ± 1.131
4.301SerThr: 4.301 ± 0.543
4.939SerVal: 4.939 ± 0.685
0.478SerTrp: 0.478 ± 0.174
1.912SerTyr: 1.912 ± 0.378
0.0SerXaa: 0.0 ± 0.0
Thr
7.886ThrAla: 7.886 ± 1.098
0.558ThrCys: 0.558 ± 0.196
3.903ThrAsp: 3.903 ± 0.534
3.983ThrGlu: 3.983 ± 0.543
3.186ThrPhe: 3.186 ± 0.578
6.213ThrGly: 6.213 ± 0.901
0.956ThrHis: 0.956 ± 0.293
3.107ThrIle: 3.107 ± 0.453
3.027ThrLys: 3.027 ± 0.453
5.417ThrLeu: 5.417 ± 0.58
1.434ThrMet: 1.434 ± 0.271
1.752ThrAsn: 1.752 ± 0.425
3.505ThrPro: 3.505 ± 0.618
1.991ThrGln: 1.991 ± 0.414
2.469ThrArg: 2.469 ± 0.348
3.425ThrSer: 3.425 ± 0.565
4.7ThrThr: 4.7 ± 0.79
4.7ThrVal: 4.7 ± 0.647
0.956ThrTrp: 0.956 ± 0.317
2.549ThrTyr: 2.549 ± 0.397
0.0ThrXaa: 0.0 ± 0.0
Val
7.01ValAla: 7.01 ± 0.627
0.478ValCys: 0.478 ± 0.199
4.461ValAsp: 4.461 ± 0.575
4.54ValGlu: 4.54 ± 0.841
1.912ValPhe: 1.912 ± 0.345
4.461ValGly: 4.461 ± 0.479
0.797ValHis: 0.797 ± 0.267
4.62ValIle: 4.62 ± 0.688
3.823ValLys: 3.823 ± 0.546
5.098ValLeu: 5.098 ± 0.614
1.115ValMet: 1.115 ± 0.233
3.425ValAsn: 3.425 ± 0.571
2.23ValPro: 2.23 ± 0.427
3.107ValGln: 3.107 ± 0.493
3.983ValArg: 3.983 ± 0.52
5.656ValSer: 5.656 ± 0.811
5.656ValThr: 5.656 ± 0.836
5.576ValVal: 5.576 ± 0.744
0.876ValTrp: 0.876 ± 0.286
2.39ValTyr: 2.39 ± 0.54
0.0ValXaa: 0.0 ± 0.0
Trp
1.115TrpAla: 1.115 ± 0.407
0.08TrpCys: 0.08 ± 0.078
0.956TrpAsp: 0.956 ± 0.217
0.398TrpGlu: 0.398 ± 0.227
0.717TrpPhe: 0.717 ± 0.341
0.797TrpGly: 0.797 ± 0.258
0.319TrpHis: 0.319 ± 0.221
0.239TrpIle: 0.239 ± 0.153
0.717TrpLys: 0.717 ± 0.21
1.832TrpLeu: 1.832 ± 0.339
0.478TrpMet: 0.478 ± 0.19
0.717TrpAsn: 0.717 ± 0.297
0.558TrpPro: 0.558 ± 0.213
0.637TrpGln: 0.637 ± 0.254
0.797TrpArg: 0.797 ± 0.299
1.115TrpSer: 1.115 ± 0.322
0.558TrpThr: 0.558 ± 0.165
0.637TrpVal: 0.637 ± 0.209
0.159TrpTrp: 0.159 ± 0.108
0.478TrpTyr: 0.478 ± 0.196
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.947TyrAla: 2.947 ± 0.452
0.319TyrCys: 0.319 ± 0.168
2.23TyrAsp: 2.23 ± 0.663
1.752TyrGlu: 1.752 ± 0.328
1.832TyrPhe: 1.832 ± 0.428
2.947TyrGly: 2.947 ± 0.39
0.478TyrHis: 0.478 ± 0.217
1.752TyrIle: 1.752 ± 0.367
1.593TyrLys: 1.593 ± 0.382
2.39TyrLeu: 2.39 ± 0.458
0.558TyrMet: 0.558 ± 0.193
1.513TyrAsn: 1.513 ± 0.352
1.274TyrPro: 1.274 ± 0.509
1.752TyrGln: 1.752 ± 0.353
2.151TyrArg: 2.151 ± 0.465
2.31TyrSer: 2.31 ± 0.426
2.151TyrThr: 2.151 ± 0.473
1.752TyrVal: 1.752 ± 0.433
0.159TyrTrp: 0.159 ± 0.11
1.115TyrTyr: 1.115 ± 0.257
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 51 proteins (12555 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski