Amino acid dipepetide frequency for Streptococcus phage P9903

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.737AlaAla: 2.737 ± 0.822
0.182AlaCys: 0.182 ± 0.134
4.47AlaAsp: 4.47 ± 0.772
4.379AlaGlu: 4.379 ± 0.755
2.737AlaPhe: 2.737 ± 0.741
3.467AlaGly: 3.467 ± 0.587
0.821AlaHis: 0.821 ± 0.306
4.288AlaIle: 4.288 ± 0.87
6.386AlaLys: 6.386 ± 1.038
6.751AlaLeu: 6.751 ± 0.57
0.912AlaMet: 0.912 ± 0.259
4.653AlaAsn: 4.653 ± 0.832
1.46AlaPro: 1.46 ± 0.321
3.102AlaGln: 3.102 ± 0.44
2.463AlaArg: 2.463 ± 0.475
4.744AlaSer: 4.744 ± 0.701
4.105AlaThr: 4.105 ± 0.714
4.653AlaVal: 4.653 ± 0.599
0.821AlaTrp: 0.821 ± 0.181
2.098AlaTyr: 2.098 ± 0.498
0.0AlaXaa: 0.0 ± 0.0
Cys
0.182CysAla: 0.182 ± 0.11
0.0CysCys: 0.0 ± 0.0
0.73CysAsp: 0.73 ± 0.285
0.547CysGlu: 0.547 ± 0.259
0.456CysPhe: 0.456 ± 0.231
0.091CysGly: 0.091 ± 0.095
0.091CysHis: 0.091 ± 0.082
0.182CysIle: 0.182 ± 0.137
0.182CysLys: 0.182 ± 0.173
0.639CysLeu: 0.639 ± 0.27
0.091CysMet: 0.091 ± 0.088
0.274CysAsn: 0.274 ± 0.16
0.091CysPro: 0.091 ± 0.088
0.091CysGln: 0.091 ± 0.111
0.182CysArg: 0.182 ± 0.222
0.456CysSer: 0.456 ± 0.269
0.365CysThr: 0.365 ± 0.182
0.365CysVal: 0.365 ± 0.142
0.274CysTrp: 0.274 ± 0.138
0.274CysTyr: 0.274 ± 0.138
0.0CysXaa: 0.0 ± 0.0
Asp
3.832AspAla: 3.832 ± 0.607
0.547AspCys: 0.547 ± 0.24
4.288AspAsp: 4.288 ± 0.679
4.379AspGlu: 4.379 ± 0.67
3.923AspPhe: 3.923 ± 0.517
5.291AspGly: 5.291 ± 0.77
0.912AspHis: 0.912 ± 0.34
5.474AspIle: 5.474 ± 0.701
4.927AspLys: 4.927 ± 0.811
4.379AspLeu: 4.379 ± 0.768
2.463AspMet: 2.463 ± 0.419
5.018AspAsn: 5.018 ± 1.141
2.098AspPro: 2.098 ± 0.487
1.004AspGln: 1.004 ± 0.177
3.102AspArg: 3.102 ± 0.529
3.102AspSer: 3.102 ± 0.546
4.014AspThr: 4.014 ± 0.646
3.011AspVal: 3.011 ± 0.539
0.639AspTrp: 0.639 ± 0.243
2.372AspTyr: 2.372 ± 0.382
0.0AspXaa: 0.0 ± 0.0
Glu
4.653GluAla: 4.653 ± 0.546
0.274GluCys: 0.274 ± 0.134
4.197GluAsp: 4.197 ± 0.689
5.109GluGlu: 5.109 ± 1.081
2.098GluPhe: 2.098 ± 0.408
2.19GluGly: 2.19 ± 0.437
1.277GluHis: 1.277 ± 0.361
6.021GluIle: 6.021 ± 0.714
4.562GluLys: 4.562 ± 0.991
7.664GluLeu: 7.664 ± 1.165
2.463GluMet: 2.463 ± 0.528
4.288GluAsn: 4.288 ± 0.764
1.825GluPro: 1.825 ± 0.406
3.741GluGln: 3.741 ± 0.68
3.284GluArg: 3.284 ± 0.68
3.102GluSer: 3.102 ± 0.433
3.193GluThr: 3.193 ± 0.536
5.2GluVal: 5.2 ± 0.839
1.277GluTrp: 1.277 ± 0.295
3.649GluTyr: 3.649 ± 0.717
0.0GluXaa: 0.0 ± 0.0
Phe
2.828PheAla: 2.828 ± 0.45
0.091PheCys: 0.091 ± 0.102
3.923PheAsp: 3.923 ± 0.504
3.011PheGlu: 3.011 ± 0.701
1.551PhePhe: 1.551 ± 0.33
3.376PheGly: 3.376 ± 0.813
0.365PheHis: 0.365 ± 0.148
3.284PheIle: 3.284 ± 0.617
4.379PheLys: 4.379 ± 0.57
3.102PheLeu: 3.102 ± 0.407
0.547PheMet: 0.547 ± 0.251
2.919PheAsn: 2.919 ± 0.49
0.73PhePro: 0.73 ± 0.297
1.004PheGln: 1.004 ± 0.25
1.916PheArg: 1.916 ± 0.383
2.646PheSer: 2.646 ± 0.459
2.737PheThr: 2.737 ± 0.468
2.646PheVal: 2.646 ± 0.355
0.547PheTrp: 0.547 ± 0.231
2.19PheTyr: 2.19 ± 0.503
0.0PheXaa: 0.0 ± 0.0
Gly
2.919GlyAla: 2.919 ± 0.605
0.547GlyCys: 0.547 ± 0.211
4.105GlyAsp: 4.105 ± 0.601
3.923GlyGlu: 3.923 ± 0.626
3.193GlyPhe: 3.193 ± 0.458
4.288GlyGly: 4.288 ± 0.697
0.639GlyHis: 0.639 ± 0.241
4.927GlyIle: 4.927 ± 0.769
5.383GlyLys: 5.383 ± 0.834
5.839GlyLeu: 5.839 ± 0.833
1.46GlyMet: 1.46 ± 0.326
3.832GlyAsn: 3.832 ± 0.683
1.277GlyPro: 1.277 ± 0.332
3.011GlyGln: 3.011 ± 0.498
2.828GlyArg: 2.828 ± 0.481
4.197GlySer: 4.197 ± 0.526
4.197GlyThr: 4.197 ± 0.596
3.741GlyVal: 3.741 ± 0.684
1.004GlyTrp: 1.004 ± 0.353
3.011GlyTyr: 3.011 ± 0.435
0.0GlyXaa: 0.0 ± 0.0
His
0.456HisAla: 0.456 ± 0.231
0.0HisCys: 0.0 ± 0.0
0.73HisAsp: 0.73 ± 0.256
0.456HisGlu: 0.456 ± 0.18
0.73HisPhe: 0.73 ± 0.286
1.004HisGly: 1.004 ± 0.29
0.274HisHis: 0.274 ± 0.172
0.821HisIle: 0.821 ± 0.291
0.912HisLys: 0.912 ± 0.265
1.46HisLeu: 1.46 ± 0.318
0.547HisMet: 0.547 ± 0.248
1.186HisAsn: 1.186 ± 0.338
0.547HisPro: 0.547 ± 0.177
0.821HisGln: 0.821 ± 0.371
0.639HisArg: 0.639 ± 0.219
0.73HisSer: 0.73 ± 0.225
0.73HisThr: 0.73 ± 0.211
1.368HisVal: 1.368 ± 0.261
0.0HisTrp: 0.0 ± 0.0
0.821HisTyr: 0.821 ± 0.319
0.0HisXaa: 0.0 ± 0.0
Ile
4.47IleAla: 4.47 ± 1.053
0.456IleCys: 0.456 ± 0.185
4.835IleAsp: 4.835 ± 0.586
4.835IleGlu: 4.835 ± 0.788
1.733IlePhe: 1.733 ± 0.355
4.562IleGly: 4.562 ± 0.569
0.821IleHis: 0.821 ± 0.254
2.919IleIle: 2.919 ± 0.501
7.207IleLys: 7.207 ± 0.69
4.562IleLeu: 4.562 ± 0.877
1.825IleMet: 1.825 ± 0.42
4.379IleAsn: 4.379 ± 0.647
3.011IlePro: 3.011 ± 0.522
3.102IleGln: 3.102 ± 0.542
2.372IleArg: 2.372 ± 0.437
3.923IleSer: 3.923 ± 0.449
3.649IleThr: 3.649 ± 0.606
3.467IleVal: 3.467 ± 0.598
0.73IleTrp: 0.73 ± 0.232
2.007IleTyr: 2.007 ± 0.386
0.0IleXaa: 0.0 ± 0.0
Lys
6.478LysAla: 6.478 ± 0.72
0.365LysCys: 0.365 ± 0.186
5.018LysAsp: 5.018 ± 0.679
7.025LysGlu: 7.025 ± 0.876
3.649LysPhe: 3.649 ± 0.873
5.565LysGly: 5.565 ± 0.615
1.46LysHis: 1.46 ± 0.389
5.656LysIle: 5.656 ± 0.656
7.572LysLys: 7.572 ± 1.333
6.934LysLeu: 6.934 ± 0.84
2.372LysMet: 2.372 ± 0.481
4.653LysAsn: 4.653 ± 0.593
3.193LysPro: 3.193 ± 0.589
4.562LysGln: 4.562 ± 0.533
3.741LysArg: 3.741 ± 0.572
4.014LysSer: 4.014 ± 0.487
4.835LysThr: 4.835 ± 0.708
4.379LysVal: 4.379 ± 0.66
1.186LysTrp: 1.186 ± 0.292
3.376LysTyr: 3.376 ± 0.532
0.0LysXaa: 0.0 ± 0.0
Leu
7.207LeuAla: 7.207 ± 0.641
0.639LeuCys: 0.639 ± 0.302
5.2LeuAsp: 5.2 ± 0.742
7.846LeuGlu: 7.846 ± 1.044
3.284LeuPhe: 3.284 ± 0.398
5.565LeuGly: 5.565 ± 0.899
0.821LeuHis: 0.821 ± 0.355
3.467LeuIle: 3.467 ± 0.543
7.39LeuLys: 7.39 ± 0.696
5.565LeuLeu: 5.565 ± 0.638
2.281LeuMet: 2.281 ± 0.509
5.109LeuAsn: 5.109 ± 0.554
2.372LeuPro: 2.372 ± 0.342
2.281LeuGln: 2.281 ± 0.424
3.741LeuArg: 3.741 ± 0.744
5.383LeuSer: 5.383 ± 0.634
5.474LeuThr: 5.474 ± 0.818
4.47LeuVal: 4.47 ± 0.698
0.547LeuTrp: 0.547 ± 0.202
2.646LeuTyr: 2.646 ± 0.52
0.0LeuXaa: 0.0 ± 0.0
Met
1.46MetAla: 1.46 ± 0.339
0.0MetCys: 0.0 ± 0.0
0.912MetAsp: 0.912 ± 0.238
1.551MetGlu: 1.551 ± 0.445
1.46MetPhe: 1.46 ± 0.464
1.004MetGly: 1.004 ± 0.337
0.365MetHis: 0.365 ± 0.207
1.642MetIle: 1.642 ± 0.332
3.011MetLys: 3.011 ± 0.51
2.281MetLeu: 2.281 ± 0.325
0.639MetMet: 0.639 ± 0.237
1.277MetAsn: 1.277 ± 0.341
0.821MetPro: 0.821 ± 0.267
1.004MetGln: 1.004 ± 0.349
0.639MetArg: 0.639 ± 0.183
1.642MetSer: 1.642 ± 0.368
1.551MetThr: 1.551 ± 0.332
2.098MetVal: 2.098 ± 0.407
0.091MetTrp: 0.091 ± 0.074
1.277MetTyr: 1.277 ± 0.374
0.0MetXaa: 0.0 ± 0.0
Asn
5.2AsnAla: 5.2 ± 0.98
0.365AsnCys: 0.365 ± 0.186
3.376AsnAsp: 3.376 ± 0.482
3.832AsnGlu: 3.832 ± 0.889
3.193AsnPhe: 3.193 ± 0.404
7.025AsnGly: 7.025 ± 1.234
0.821AsnHis: 0.821 ± 0.226
4.379AsnIle: 4.379 ± 0.658
5.018AsnLys: 5.018 ± 0.675
5.383AsnLeu: 5.383 ± 0.627
0.912AsnMet: 0.912 ± 0.258
4.288AsnAsn: 4.288 ± 0.78
2.737AsnPro: 2.737 ± 0.488
3.193AsnGln: 3.193 ± 0.592
2.463AsnArg: 2.463 ± 0.567
4.379AsnSer: 4.379 ± 0.791
3.011AsnThr: 3.011 ± 0.451
3.467AsnVal: 3.467 ± 0.509
1.186AsnTrp: 1.186 ± 0.302
2.19AsnTyr: 2.19 ± 0.417
0.0AsnXaa: 0.0 ± 0.0
Pro
1.551ProAla: 1.551 ± 0.32
0.274ProCys: 0.274 ± 0.193
1.186ProAsp: 1.186 ± 0.372
1.825ProGlu: 1.825 ± 0.421
1.277ProPhe: 1.277 ± 0.282
1.004ProGly: 1.004 ± 0.301
0.547ProHis: 0.547 ± 0.197
1.825ProIle: 1.825 ± 0.359
3.649ProLys: 3.649 ± 0.516
2.281ProLeu: 2.281 ± 0.428
0.091ProMet: 0.091 ± 0.111
2.828ProAsn: 2.828 ± 0.423
0.456ProPro: 0.456 ± 0.229
1.825ProGln: 1.825 ± 0.274
0.912ProArg: 0.912 ± 0.406
2.372ProSer: 2.372 ± 0.349
1.551ProThr: 1.551 ± 0.371
1.277ProVal: 1.277 ± 0.321
0.456ProTrp: 0.456 ± 0.15
1.46ProTyr: 1.46 ± 0.515
0.0ProXaa: 0.0 ± 0.0
Gln
3.923GlnAla: 3.923 ± 0.485
0.182GlnCys: 0.182 ± 0.111
2.372GlnAsp: 2.372 ± 0.41
2.919GlnGlu: 2.919 ± 0.642
1.733GlnPhe: 1.733 ± 0.462
3.011GlnGly: 3.011 ± 0.599
0.639GlnHis: 0.639 ± 0.227
1.916GlnIle: 1.916 ± 0.37
3.467GlnLys: 3.467 ± 0.482
3.832GlnLeu: 3.832 ± 0.459
1.916GlnMet: 1.916 ± 0.285
2.737GlnAsn: 2.737 ± 0.477
0.547GlnPro: 0.547 ± 0.226
3.558GlnGln: 3.558 ± 0.746
1.551GlnArg: 1.551 ± 0.329
2.646GlnSer: 2.646 ± 0.4
2.828GlnThr: 2.828 ± 0.431
1.916GlnVal: 1.916 ± 0.369
0.821GlnTrp: 0.821 ± 0.241
1.733GlnTyr: 1.733 ± 0.324
0.0GlnXaa: 0.0 ± 0.0
Arg
2.555ArgAla: 2.555 ± 0.494
0.091ArgCys: 0.091 ± 0.111
2.372ArgAsp: 2.372 ± 0.518
2.646ArgGlu: 2.646 ± 0.638
2.007ArgPhe: 2.007 ± 0.439
2.098ArgGly: 2.098 ± 0.507
0.73ArgHis: 0.73 ± 0.254
2.919ArgIle: 2.919 ± 0.524
3.102ArgLys: 3.102 ± 0.635
3.558ArgLeu: 3.558 ± 0.517
1.277ArgMet: 1.277 ± 0.329
2.372ArgAsn: 2.372 ± 0.321
0.912ArgPro: 0.912 ± 0.242
2.555ArgGln: 2.555 ± 0.485
1.186ArgArg: 1.186 ± 0.263
1.642ArgSer: 1.642 ± 0.375
2.463ArgThr: 2.463 ± 0.596
2.919ArgVal: 2.919 ± 0.488
0.912ArgTrp: 0.912 ± 0.231
2.372ArgTyr: 2.372 ± 0.489
0.0ArgXaa: 0.0 ± 0.0
Ser
3.649SerAla: 3.649 ± 0.514
0.365SerCys: 0.365 ± 0.205
4.379SerAsp: 4.379 ± 0.392
3.649SerGlu: 3.649 ± 0.475
2.463SerPhe: 2.463 ± 0.532
4.105SerGly: 4.105 ± 0.533
0.73SerHis: 0.73 ± 0.291
4.197SerIle: 4.197 ± 0.626
5.383SerLys: 5.383 ± 0.831
4.014SerLeu: 4.014 ± 0.706
1.642SerMet: 1.642 ± 0.339
4.47SerAsn: 4.47 ± 0.637
2.007SerPro: 2.007 ± 0.293
2.372SerGln: 2.372 ± 0.4
3.011SerArg: 3.011 ± 0.587
3.284SerSer: 3.284 ± 0.524
3.832SerThr: 3.832 ± 0.59
4.653SerVal: 4.653 ± 0.552
0.456SerTrp: 0.456 ± 0.211
2.281SerTyr: 2.281 ± 0.411
0.0SerXaa: 0.0 ± 0.0
Thr
4.379ThrAla: 4.379 ± 0.674
0.274ThrCys: 0.274 ± 0.162
3.649ThrAsp: 3.649 ± 0.65
3.832ThrGlu: 3.832 ± 0.426
2.737ThrPhe: 2.737 ± 0.469
3.558ThrGly: 3.558 ± 0.503
0.821ThrHis: 0.821 ± 0.272
4.197ThrIle: 4.197 ± 0.557
4.835ThrLys: 4.835 ± 0.718
6.113ThrLeu: 6.113 ± 0.668
0.73ThrMet: 0.73 ± 0.209
4.653ThrAsn: 4.653 ± 0.674
1.825ThrPro: 1.825 ± 0.451
2.19ThrGln: 2.19 ± 0.405
2.098ThrArg: 2.098 ± 0.446
3.832ThrSer: 3.832 ± 0.46
3.011ThrThr: 3.011 ± 0.599
3.102ThrVal: 3.102 ± 0.521
0.73ThrTrp: 0.73 ± 0.266
2.919ThrTyr: 2.919 ± 0.5
0.0ThrXaa: 0.0 ± 0.0
Val
3.741ValAla: 3.741 ± 0.618
0.365ValCys: 0.365 ± 0.156
4.927ValAsp: 4.927 ± 0.589
4.927ValGlu: 4.927 ± 0.757
2.372ValPhe: 2.372 ± 0.463
4.014ValGly: 4.014 ± 0.538
1.004ValHis: 1.004 ± 0.305
3.284ValIle: 3.284 ± 0.409
5.018ValLys: 5.018 ± 0.624
2.372ValLeu: 2.372 ± 0.411
1.186ValMet: 1.186 ± 0.267
4.105ValAsn: 4.105 ± 0.623
1.551ValPro: 1.551 ± 0.36
2.19ValGln: 2.19 ± 0.606
2.007ValArg: 2.007 ± 0.486
4.562ValSer: 4.562 ± 0.655
5.109ValThr: 5.109 ± 0.815
3.649ValVal: 3.649 ± 0.584
1.277ValTrp: 1.277 ± 0.356
2.19ValTyr: 2.19 ± 0.433
0.0ValXaa: 0.0 ± 0.0
Trp
0.365TrpAla: 0.365 ± 0.184
0.0TrpCys: 0.0 ± 0.0
1.004TrpAsp: 1.004 ± 0.406
0.639TrpGlu: 0.639 ± 0.23
0.639TrpPhe: 0.639 ± 0.217
1.004TrpGly: 1.004 ± 0.383
0.365TrpHis: 0.365 ± 0.162
0.73TrpIle: 0.73 ± 0.227
0.547TrpLys: 0.547 ± 0.293
1.277TrpLeu: 1.277 ± 0.376
0.091TrpMet: 0.091 ± 0.093
1.004TrpAsn: 1.004 ± 0.333
0.091TrpPro: 0.091 ± 0.095
0.639TrpGln: 0.639 ± 0.183
0.821TrpArg: 0.821 ± 0.234
1.551TrpSer: 1.551 ± 0.454
0.821TrpThr: 0.821 ± 0.211
1.277TrpVal: 1.277 ± 0.246
0.365TrpTrp: 0.365 ± 0.171
0.274TrpTyr: 0.274 ± 0.173
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.646TyrAla: 2.646 ± 0.445
0.547TyrCys: 0.547 ± 0.281
3.102TyrAsp: 3.102 ± 0.487
2.828TyrGlu: 2.828 ± 0.543
2.555TyrPhe: 2.555 ± 0.455
2.098TyrGly: 2.098 ± 0.422
0.73TyrHis: 0.73 ± 0.235
2.737TyrIle: 2.737 ± 0.53
3.102TyrLys: 3.102 ± 0.513
3.284TyrLeu: 3.284 ± 0.53
1.186TyrMet: 1.186 ± 0.421
2.372TyrAsn: 2.372 ± 0.452
1.095TyrPro: 1.095 ± 0.288
2.007TyrGln: 2.007 ± 0.338
1.733TyrArg: 1.733 ± 0.324
2.737TyrSer: 2.737 ± 0.531
2.007TyrThr: 2.007 ± 0.427
2.19TyrVal: 2.19 ± 0.376
0.182TyrTrp: 0.182 ± 0.102
2.19TyrTyr: 2.19 ± 0.516
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 50 proteins (10962 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski