Amino acid dipepetide frequency for Pseudomonas phage phi297

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.627AlaAla: 16.627 ± 2.207
1.541AlaCys: 1.541 ± 0.303
6.42AlaAsp: 6.42 ± 0.795
8.41AlaGlu: 8.41 ± 0.865
2.568AlaPhe: 2.568 ± 0.493
9.18AlaGly: 9.18 ± 0.761
2.76AlaHis: 2.76 ± 0.472
6.163AlaIle: 6.163 ± 0.576
4.365AlaLys: 4.365 ± 0.608
11.427AlaLeu: 11.427 ± 1.122
3.531AlaMet: 3.531 ± 0.51
3.595AlaAsn: 3.595 ± 0.571
5.778AlaPro: 5.778 ± 0.748
6.035AlaGln: 6.035 ± 0.78
8.346AlaArg: 8.346 ± 0.826
6.741AlaSer: 6.741 ± 0.733
6.163AlaThr: 6.163 ± 1.054
6.227AlaVal: 6.227 ± 0.768
2.568AlaTrp: 2.568 ± 0.403
2.889AlaTyr: 2.889 ± 0.446
0.0AlaXaa: 0.0 ± 0.0
Cys
1.156CysAla: 1.156 ± 0.235
0.128CysCys: 0.128 ± 0.073
0.578CysAsp: 0.578 ± 0.233
0.321CysGlu: 0.321 ± 0.117
0.257CysPhe: 0.257 ± 0.124
1.091CysGly: 1.091 ± 0.233
0.257CysHis: 0.257 ± 0.108
0.449CysIle: 0.449 ± 0.208
0.385CysLys: 0.385 ± 0.151
0.963CysLeu: 0.963 ± 0.222
0.321CysMet: 0.321 ± 0.141
0.385CysAsn: 0.385 ± 0.146
0.963CysPro: 0.963 ± 0.264
0.642CysGln: 0.642 ± 0.187
0.706CysArg: 0.706 ± 0.181
1.091CysSer: 1.091 ± 0.233
0.514CysThr: 0.514 ± 0.175
0.193CysVal: 0.193 ± 0.103
0.128CysTrp: 0.128 ± 0.084
0.321CysTyr: 0.321 ± 0.126
0.0CysXaa: 0.0 ± 0.0
Asp
6.612AspAla: 6.612 ± 0.512
0.449AspCys: 0.449 ± 0.196
2.953AspAsp: 2.953 ± 0.444
3.338AspGlu: 3.338 ± 0.424
1.862AspPhe: 1.862 ± 0.331
4.815AspGly: 4.815 ± 0.578
1.284AspHis: 1.284 ± 0.274
2.953AspIle: 2.953 ± 0.393
1.99AspLys: 1.99 ± 0.362
6.548AspLeu: 6.548 ± 0.587
1.541AspMet: 1.541 ± 0.301
1.541AspAsn: 1.541 ± 0.353
2.183AspPro: 2.183 ± 0.359
3.338AspGln: 3.338 ± 0.488
4.43AspArg: 4.43 ± 0.51
2.568AspSer: 2.568 ± 0.374
2.825AspThr: 2.825 ± 0.464
4.237AspVal: 4.237 ± 0.566
1.284AspTrp: 1.284 ± 0.302
1.22AspTyr: 1.22 ± 0.279
0.0AspXaa: 0.0 ± 0.0
Glu
7.318GluAla: 7.318 ± 0.989
0.385GluCys: 0.385 ± 0.159
2.825GluAsp: 2.825 ± 0.358
3.467GluGlu: 3.467 ± 0.516
2.439GluPhe: 2.439 ± 0.387
4.301GluGly: 4.301 ± 0.582
1.798GluHis: 1.798 ± 0.382
3.723GluIle: 3.723 ± 0.493
2.889GluLys: 2.889 ± 0.4
5.328GluLeu: 5.328 ± 0.556
1.541GluMet: 1.541 ± 0.277
1.926GluAsn: 1.926 ± 0.447
3.531GluPro: 3.531 ± 0.495
3.081GluGln: 3.081 ± 0.475
5.007GluArg: 5.007 ± 0.681
2.825GluSer: 2.825 ± 0.498
2.054GluThr: 2.054 ± 0.312
4.301GluVal: 4.301 ± 0.439
1.091GluTrp: 1.091 ± 0.222
1.99GluTyr: 1.99 ± 0.291
0.0GluXaa: 0.0 ± 0.0
Phe
3.146PheAla: 3.146 ± 0.425
0.064PheCys: 0.064 ± 0.061
2.183PheAsp: 2.183 ± 0.302
2.054PheGlu: 2.054 ± 0.371
0.642PhePhe: 0.642 ± 0.206
2.183PheGly: 2.183 ± 0.338
0.514PheHis: 0.514 ± 0.225
1.541PheIle: 1.541 ± 0.268
0.963PheLys: 0.963 ± 0.239
2.632PheLeu: 2.632 ± 0.504
0.642PheMet: 0.642 ± 0.231
0.77PheAsn: 0.77 ± 0.193
1.348PhePro: 1.348 ± 0.242
1.091PheGln: 1.091 ± 0.285
1.926PheArg: 1.926 ± 0.357
2.119PheSer: 2.119 ± 0.37
2.119PheThr: 2.119 ± 0.341
1.99PheVal: 1.99 ± 0.395
0.514PheTrp: 0.514 ± 0.144
0.385PheTyr: 0.385 ± 0.148
0.0PheXaa: 0.0 ± 0.0
Gly
9.052GlyAla: 9.052 ± 0.876
0.899GlyCys: 0.899 ± 0.238
4.622GlyAsp: 4.622 ± 0.679
4.879GlyGlu: 4.879 ± 0.547
2.439GlyPhe: 2.439 ± 0.441
6.356GlyGly: 6.356 ± 0.747
1.348GlyHis: 1.348 ± 0.257
3.402GlyIle: 3.402 ± 0.414
2.889GlyLys: 2.889 ± 0.432
7.318GlyLeu: 7.318 ± 0.605
2.439GlyMet: 2.439 ± 0.44
2.76GlyAsn: 2.76 ± 0.45
2.825GlyPro: 2.825 ± 0.485
3.595GlyGln: 3.595 ± 0.329
5.457GlyArg: 5.457 ± 0.609
4.237GlySer: 4.237 ± 0.588
4.237GlyThr: 4.237 ± 0.451
5.842GlyVal: 5.842 ± 0.559
1.412GlyTrp: 1.412 ± 0.306
3.017GlyTyr: 3.017 ± 0.43
0.0GlyXaa: 0.0 ± 0.0
His
2.76HisAla: 2.76 ± 0.395
0.257HisCys: 0.257 ± 0.134
1.22HisAsp: 1.22 ± 0.257
1.284HisGlu: 1.284 ± 0.289
0.642HisPhe: 0.642 ± 0.194
1.798HisGly: 1.798 ± 0.307
0.321HisHis: 0.321 ± 0.127
0.835HisIle: 0.835 ± 0.235
0.578HisLys: 0.578 ± 0.178
1.605HisLeu: 1.605 ± 0.344
0.449HisMet: 0.449 ± 0.145
0.642HisAsn: 0.642 ± 0.152
1.284HisPro: 1.284 ± 0.332
1.027HisGln: 1.027 ± 0.261
1.477HisArg: 1.477 ± 0.309
1.091HisSer: 1.091 ± 0.242
1.027HisThr: 1.027 ± 0.298
1.091HisVal: 1.091 ± 0.258
0.321HisTrp: 0.321 ± 0.128
0.514HisTyr: 0.514 ± 0.172
0.0HisXaa: 0.0 ± 0.0
Ile
5.521IleAla: 5.521 ± 0.447
0.193IleCys: 0.193 ± 0.127
4.365IleAsp: 4.365 ± 0.548
3.338IleGlu: 3.338 ± 0.407
1.156IlePhe: 1.156 ± 0.249
3.467IleGly: 3.467 ± 0.521
1.22IleHis: 1.22 ± 0.287
2.183IleIle: 2.183 ± 0.319
2.119IleLys: 2.119 ± 0.372
3.852IleLeu: 3.852 ± 0.717
0.706IleMet: 0.706 ± 0.198
1.477IleAsn: 1.477 ± 0.299
2.889IlePro: 2.889 ± 0.579
1.412IleGln: 1.412 ± 0.272
2.183IleArg: 2.183 ± 0.433
2.889IleSer: 2.889 ± 0.49
3.21IleThr: 3.21 ± 0.495
3.21IleVal: 3.21 ± 0.449
0.899IleTrp: 0.899 ± 0.219
1.733IleTyr: 1.733 ± 0.355
0.0IleXaa: 0.0 ± 0.0
Lys
4.044LysAla: 4.044 ± 0.549
0.385LysCys: 0.385 ± 0.147
1.926LysAsp: 1.926 ± 0.353
2.247LysGlu: 2.247 ± 0.468
0.899LysPhe: 0.899 ± 0.231
3.274LysGly: 3.274 ± 0.453
0.77LysHis: 0.77 ± 0.202
1.477LysIle: 1.477 ± 0.354
1.477LysLys: 1.477 ± 0.384
3.659LysLeu: 3.659 ± 0.505
0.642LysMet: 0.642 ± 0.182
0.963LysAsn: 0.963 ± 0.22
2.054LysPro: 2.054 ± 0.399
1.412LysGln: 1.412 ± 0.278
2.696LysArg: 2.696 ± 0.336
1.926LysSer: 1.926 ± 0.331
2.632LysThr: 2.632 ± 0.33
3.402LysVal: 3.402 ± 0.403
0.578LysTrp: 0.578 ± 0.184
1.091LysTyr: 1.091 ± 0.286
0.0LysXaa: 0.0 ± 0.0
Leu
10.272LeuAla: 10.272 ± 0.961
1.284LeuCys: 1.284 ± 0.293
6.227LeuAsp: 6.227 ± 0.621
5.2LeuGlu: 5.2 ± 0.528
2.311LeuPhe: 2.311 ± 0.483
6.227LeuGly: 6.227 ± 0.662
1.284LeuHis: 1.284 ± 0.269
3.916LeuIle: 3.916 ± 0.549
3.595LeuLys: 3.595 ± 0.527
7.768LeuLeu: 7.768 ± 0.724
1.605LeuMet: 1.605 ± 0.298
3.274LeuAsn: 3.274 ± 0.4
4.815LeuPro: 4.815 ± 0.6
3.916LeuGln: 3.916 ± 0.43
6.42LeuArg: 6.42 ± 0.78
4.558LeuSer: 4.558 ± 0.532
5.521LeuThr: 5.521 ± 0.529
5.714LeuVal: 5.714 ± 0.601
0.642LeuTrp: 0.642 ± 0.204
2.632LeuTyr: 2.632 ± 0.361
0.0LeuXaa: 0.0 ± 0.0
Met
3.402MetAla: 3.402 ± 0.475
0.193MetCys: 0.193 ± 0.109
0.963MetAsp: 0.963 ± 0.209
1.091MetGlu: 1.091 ± 0.274
1.091MetPhe: 1.091 ± 0.268
1.284MetGly: 1.284 ± 0.31
0.706MetHis: 0.706 ± 0.322
1.027MetIle: 1.027 ± 0.23
1.541MetLys: 1.541 ± 0.319
1.862MetLeu: 1.862 ± 0.302
0.321MetMet: 0.321 ± 0.12
0.899MetAsn: 0.899 ± 0.229
1.412MetPro: 1.412 ± 0.272
0.963MetGln: 0.963 ± 0.248
2.119MetArg: 2.119 ± 0.292
1.605MetSer: 1.605 ± 0.331
1.798MetThr: 1.798 ± 0.361
1.22MetVal: 1.22 ± 0.342
0.321MetTrp: 0.321 ± 0.12
0.257MetTyr: 0.257 ± 0.14
0.0MetXaa: 0.0 ± 0.0
Asn
3.659AsnAla: 3.659 ± 0.508
0.449AsnCys: 0.449 ± 0.156
1.669AsnAsp: 1.669 ± 0.352
1.605AsnGlu: 1.605 ± 0.278
1.027AsnPhe: 1.027 ± 0.19
3.467AsnGly: 3.467 ± 0.518
0.514AsnHis: 0.514 ± 0.149
1.348AsnIle: 1.348 ± 0.332
1.22AsnLys: 1.22 ± 0.229
3.21AsnLeu: 3.21 ± 0.481
0.578AsnMet: 0.578 ± 0.21
0.835AsnAsn: 0.835 ± 0.251
1.541AsnPro: 1.541 ± 0.331
1.284AsnGln: 1.284 ± 0.277
2.247AsnArg: 2.247 ± 0.414
2.568AsnSer: 2.568 ± 0.488
1.348AsnThr: 1.348 ± 0.235
1.99AsnVal: 1.99 ± 0.358
0.385AsnTrp: 0.385 ± 0.16
1.156AsnTyr: 1.156 ± 0.205
0.0AsnXaa: 0.0 ± 0.0
Pro
6.805ProAla: 6.805 ± 0.621
0.578ProCys: 0.578 ± 0.191
2.953ProAsp: 2.953 ± 0.415
3.659ProGlu: 3.659 ± 0.496
1.733ProPhe: 1.733 ± 0.308
5.328ProGly: 5.328 ± 0.652
0.899ProHis: 0.899 ± 0.224
1.862ProIle: 1.862 ± 0.294
1.477ProLys: 1.477 ± 0.276
3.916ProLeu: 3.916 ± 0.477
1.348ProMet: 1.348 ± 0.358
1.926ProAsn: 1.926 ± 0.319
1.862ProPro: 1.862 ± 0.34
1.733ProGln: 1.733 ± 0.291
2.632ProArg: 2.632 ± 0.446
3.274ProSer: 3.274 ± 0.423
2.889ProThr: 2.889 ± 0.465
3.467ProVal: 3.467 ± 0.449
0.642ProTrp: 0.642 ± 0.212
1.284ProTyr: 1.284 ± 0.268
0.0ProXaa: 0.0 ± 0.0
Gln
6.933GlnAla: 6.933 ± 1.247
0.706GlnCys: 0.706 ± 0.176
2.119GlnAsp: 2.119 ± 0.313
2.375GlnGlu: 2.375 ± 0.384
0.899GlnPhe: 0.899 ± 0.237
3.402GlnGly: 3.402 ± 0.46
1.284GlnHis: 1.284 ± 0.319
2.632GlnIle: 2.632 ± 0.385
1.412GlnLys: 1.412 ± 0.317
3.402GlnLeu: 3.402 ± 0.449
0.963GlnMet: 0.963 ± 0.241
0.835GlnAsn: 0.835 ± 0.217
1.926GlnPro: 1.926 ± 0.397
2.825GlnGln: 2.825 ± 0.502
3.852GlnArg: 3.852 ± 0.569
1.926GlnSer: 1.926 ± 0.293
2.504GlnThr: 2.504 ± 0.392
3.081GlnVal: 3.081 ± 0.54
0.706GlnTrp: 0.706 ± 0.219
1.284GlnTyr: 1.284 ± 0.299
0.0GlnXaa: 0.0 ± 0.0
Arg
8.41ArgAla: 8.41 ± 0.898
0.899ArgCys: 0.899 ± 0.216
3.852ArgAsp: 3.852 ± 0.399
5.136ArgGlu: 5.136 ± 0.701
2.183ArgPhe: 2.183 ± 0.254
4.751ArgGly: 4.751 ± 0.651
1.862ArgHis: 1.862 ± 0.321
4.173ArgIle: 4.173 ± 0.462
2.504ArgLys: 2.504 ± 0.435
6.227ArgLeu: 6.227 ± 0.616
1.99ArgMet: 1.99 ± 0.449
2.504ArgAsn: 2.504 ± 0.447
3.531ArgPro: 3.531 ± 0.618
3.338ArgGln: 3.338 ± 0.521
6.099ArgArg: 6.099 ± 0.767
3.788ArgSer: 3.788 ± 0.437
3.531ArgThr: 3.531 ± 0.455
3.852ArgVal: 3.852 ± 0.511
1.091ArgTrp: 1.091 ± 0.241
2.375ArgTyr: 2.375 ± 0.447
0.0ArgXaa: 0.0 ± 0.0
Ser
6.933SerAla: 6.933 ± 0.806
0.449SerCys: 0.449 ± 0.15
3.595SerAsp: 3.595 ± 0.414
3.98SerGlu: 3.98 ± 0.551
2.119SerPhe: 2.119 ± 0.432
4.622SerGly: 4.622 ± 0.62
0.835SerHis: 0.835 ± 0.202
2.439SerIle: 2.439 ± 0.402
1.798SerLys: 1.798 ± 0.288
5.007SerLeu: 5.007 ± 0.61
1.412SerMet: 1.412 ± 0.246
1.99SerAsn: 1.99 ± 0.35
3.21SerPro: 3.21 ± 0.513
2.504SerGln: 2.504 ± 0.407
3.338SerArg: 3.338 ± 0.391
4.365SerSer: 4.365 ± 0.583
3.017SerThr: 3.017 ± 0.354
4.365SerVal: 4.365 ± 0.656
1.156SerTrp: 1.156 ± 0.275
1.412SerTyr: 1.412 ± 0.294
0.0SerXaa: 0.0 ± 0.0
Thr
7.704ThrAla: 7.704 ± 0.819
0.642ThrCys: 0.642 ± 0.193
3.081ThrAsp: 3.081 ± 0.388
2.568ThrGlu: 2.568 ± 0.345
1.733ThrPhe: 1.733 ± 0.307
5.264ThrGly: 5.264 ± 0.546
0.385ThrHis: 0.385 ± 0.176
2.953ThrIle: 2.953 ± 0.509
1.926ThrLys: 1.926 ± 0.419
3.659ThrLeu: 3.659 ± 0.484
1.477ThrMet: 1.477 ± 0.336
1.862ThrAsn: 1.862 ± 0.43
2.825ThrPro: 2.825 ± 0.343
2.119ThrGln: 2.119 ± 0.284
3.146ThrArg: 3.146 ± 0.502
2.889ThrSer: 2.889 ± 0.633
2.825ThrThr: 2.825 ± 0.44
5.136ThrVal: 5.136 ± 0.736
0.77ThrTrp: 0.77 ± 0.18
2.183ThrTyr: 2.183 ± 0.376
0.0ThrXaa: 0.0 ± 0.0
Val
7.318ValAla: 7.318 ± 0.662
0.642ValCys: 0.642 ± 0.165
3.659ValAsp: 3.659 ± 0.492
4.622ValGlu: 4.622 ± 0.495
1.605ValPhe: 1.605 ± 0.266
4.815ValGly: 4.815 ± 0.608
1.091ValHis: 1.091 ± 0.309
3.21ValIle: 3.21 ± 0.406
2.889ValLys: 2.889 ± 0.378
4.494ValLeu: 4.494 ± 0.6
1.669ValMet: 1.669 ± 0.359
2.504ValAsn: 2.504 ± 0.416
3.852ValPro: 3.852 ± 0.49
2.76ValGln: 2.76 ± 0.421
4.943ValArg: 4.943 ± 0.842
5.136ValSer: 5.136 ± 0.687
4.173ValThr: 4.173 ± 0.623
4.109ValVal: 4.109 ± 0.56
0.77ValTrp: 0.77 ± 0.23
1.541ValTyr: 1.541 ± 0.43
0.0ValXaa: 0.0 ± 0.0
Trp
1.412TrpAla: 1.412 ± 0.275
0.128TrpCys: 0.128 ± 0.088
1.027TrpAsp: 1.027 ± 0.263
0.77TrpGlu: 0.77 ± 0.198
0.321TrpPhe: 0.321 ± 0.139
0.642TrpGly: 0.642 ± 0.191
0.514TrpHis: 0.514 ± 0.194
0.642TrpIle: 0.642 ± 0.214
0.963TrpLys: 0.963 ± 0.31
1.605TrpLeu: 1.605 ± 0.286
0.257TrpMet: 0.257 ± 0.121
0.706TrpAsn: 0.706 ± 0.178
1.156TrpPro: 1.156 ± 0.346
0.578TrpGln: 0.578 ± 0.207
1.605TrpArg: 1.605 ± 0.325
0.963TrpSer: 0.963 ± 0.264
1.027TrpThr: 1.027 ± 0.259
1.156TrpVal: 1.156 ± 0.337
0.193TrpTrp: 0.193 ± 0.102
0.257TrpTyr: 0.257 ± 0.119
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.247TyrAla: 2.247 ± 0.335
0.578TyrCys: 0.578 ± 0.212
1.733TyrAsp: 1.733 ± 0.348
1.541TyrGlu: 1.541 ± 0.272
0.899TyrPhe: 0.899 ± 0.247
2.76TyrGly: 2.76 ± 0.53
0.578TyrHis: 0.578 ± 0.249
1.091TyrIle: 1.091 ± 0.276
0.578TyrLys: 0.578 ± 0.175
2.696TyrLeu: 2.696 ± 0.426
0.578TyrMet: 0.578 ± 0.167
0.706TyrAsn: 0.706 ± 0.237
1.284TyrPro: 1.284 ± 0.377
1.412TyrGln: 1.412 ± 0.253
3.467TyrArg: 3.467 ± 0.428
1.99TyrSer: 1.99 ± 0.457
1.733TyrThr: 1.733 ± 0.313
1.22TyrVal: 1.22 ± 0.29
0.449TyrTrp: 0.449 ± 0.187
0.642TyrTyr: 0.642 ± 0.173
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 69 proteins (15578 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski