Amino acid dipepetide frequency for Streptococcus phage P7574

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.582AlaAla: 3.582 ± 0.999
0.184AlaCys: 0.184 ± 0.122
3.95AlaAsp: 3.95 ± 0.576
4.593AlaGlu: 4.593 ± 0.55
2.204AlaPhe: 2.204 ± 0.573
3.95AlaGly: 3.95 ± 0.753
0.919AlaHis: 0.919 ± 0.263
4.684AlaIle: 4.684 ± 0.676
5.511AlaLys: 5.511 ± 1.015
6.062AlaLeu: 6.062 ± 0.686
1.653AlaMet: 1.653 ± 0.301
4.501AlaAsn: 4.501 ± 0.914
2.113AlaPro: 2.113 ± 0.52
2.113AlaGln: 2.113 ± 0.496
2.847AlaArg: 2.847 ± 0.398
5.236AlaSer: 5.236 ± 0.633
3.674AlaThr: 3.674 ± 0.868
3.49AlaVal: 3.49 ± 0.646
1.01AlaTrp: 1.01 ± 0.27
2.48AlaTyr: 2.48 ± 0.364
0.0AlaXaa: 0.0 ± 0.0
Cys
0.092CysAla: 0.092 ± 0.075
0.0CysCys: 0.0 ± 0.0
0.827CysAsp: 0.827 ± 0.286
0.551CysGlu: 0.551 ± 0.248
0.184CysPhe: 0.184 ± 0.201
0.367CysGly: 0.367 ± 0.252
0.184CysHis: 0.184 ± 0.15
0.092CysIle: 0.092 ± 0.095
0.459CysLys: 0.459 ± 0.223
0.643CysLeu: 0.643 ± 0.292
0.092CysMet: 0.092 ± 0.101
0.276CysAsn: 0.276 ± 0.139
0.184CysPro: 0.184 ± 0.152
0.092CysGln: 0.092 ± 0.116
0.551CysArg: 0.551 ± 0.324
0.551CysSer: 0.551 ± 0.266
0.367CysThr: 0.367 ± 0.195
0.367CysVal: 0.367 ± 0.155
0.184CysTrp: 0.184 ± 0.133
0.092CysTyr: 0.092 ± 0.105
0.0CysXaa: 0.0 ± 0.0
Asp
3.582AspAla: 3.582 ± 0.49
0.367AspCys: 0.367 ± 0.221
4.96AspAsp: 4.96 ± 0.843
3.766AspGlu: 3.766 ± 0.62
4.133AspPhe: 4.133 ± 0.506
6.705AspGly: 6.705 ± 1.272
0.827AspHis: 0.827 ± 0.33
4.042AspIle: 4.042 ± 0.58
5.144AspLys: 5.144 ± 0.886
4.776AspLeu: 4.776 ± 0.804
1.837AspMet: 1.837 ± 0.389
4.317AspAsn: 4.317 ± 0.683
1.837AspPro: 1.837 ± 0.412
1.286AspGln: 1.286 ± 0.332
2.939AspArg: 2.939 ± 0.491
4.042AspSer: 4.042 ± 0.623
3.674AspThr: 3.674 ± 0.562
3.307AspVal: 3.307 ± 0.543
0.643AspTrp: 0.643 ± 0.203
1.653AspTyr: 1.653 ± 0.375
0.0AspXaa: 0.0 ± 0.0
Glu
4.409GluAla: 4.409 ± 0.527
0.276GluCys: 0.276 ± 0.12
3.399GluAsp: 3.399 ± 0.569
3.766GluGlu: 3.766 ± 0.845
1.929GluPhe: 1.929 ± 0.542
3.031GluGly: 3.031 ± 0.481
1.47GluHis: 1.47 ± 0.335
5.144GluIle: 5.144 ± 0.624
4.409GluLys: 4.409 ± 0.903
5.511GluLeu: 5.511 ± 0.716
2.204GluMet: 2.204 ± 0.447
3.95GluAsn: 3.95 ± 0.663
1.561GluPro: 1.561 ± 0.378
3.399GluGln: 3.399 ± 0.594
3.307GluArg: 3.307 ± 0.661
3.399GluSer: 3.399 ± 0.486
3.766GluThr: 3.766 ± 0.666
4.868GluVal: 4.868 ± 0.934
1.745GluTrp: 1.745 ± 0.368
3.399GluTyr: 3.399 ± 0.461
0.0GluXaa: 0.0 ± 0.0
Phe
3.582PheAla: 3.582 ± 0.686
0.184PheCys: 0.184 ± 0.11
3.123PheAsp: 3.123 ± 0.551
2.113PheGlu: 2.113 ± 0.461
1.837PhePhe: 1.837 ± 0.409
3.399PheGly: 3.399 ± 0.549
0.276PheHis: 0.276 ± 0.152
2.664PheIle: 2.664 ± 0.44
4.225PheLys: 4.225 ± 0.595
3.49PheLeu: 3.49 ± 0.615
0.919PheMet: 0.919 ± 0.318
2.939PheAsn: 2.939 ± 0.646
0.735PhePro: 0.735 ± 0.214
1.653PheGln: 1.653 ± 0.356
1.745PheArg: 1.745 ± 0.29
3.307PheSer: 3.307 ± 0.349
1.929PheThr: 1.929 ± 0.486
2.572PheVal: 2.572 ± 0.612
0.827PheTrp: 0.827 ± 0.292
1.837PheTyr: 1.837 ± 0.366
0.0PheXaa: 0.0 ± 0.0
Gly
3.123GlyAla: 3.123 ± 0.559
0.459GlyCys: 0.459 ± 0.195
3.766GlyAsp: 3.766 ± 0.662
3.399GlyGlu: 3.399 ± 0.59
3.307GlyPhe: 3.307 ± 0.548
4.225GlyGly: 4.225 ± 0.842
1.286GlyHis: 1.286 ± 0.38
5.236GlyIle: 5.236 ± 0.894
5.879GlyLys: 5.879 ± 0.908
6.522GlyLeu: 6.522 ± 0.599
1.653GlyMet: 1.653 ± 0.342
3.123GlyAsn: 3.123 ± 0.404
1.102GlyPro: 1.102 ± 0.291
3.215GlyGln: 3.215 ± 0.567
2.756GlyArg: 2.756 ± 0.526
3.766GlySer: 3.766 ± 0.723
4.684GlyThr: 4.684 ± 0.736
3.399GlyVal: 3.399 ± 0.484
1.286GlyTrp: 1.286 ± 0.44
3.674GlyTyr: 3.674 ± 0.601
0.0GlyXaa: 0.0 ± 0.0
His
0.551HisAla: 0.551 ± 0.197
0.092HisCys: 0.092 ± 0.086
1.286HisAsp: 1.286 ± 0.316
0.735HisGlu: 0.735 ± 0.272
0.367HisPhe: 0.367 ± 0.183
0.643HisGly: 0.643 ± 0.248
0.276HisHis: 0.276 ± 0.164
1.01HisIle: 1.01 ± 0.32
1.194HisLys: 1.194 ± 0.297
1.102HisLeu: 1.102 ± 0.322
0.276HisMet: 0.276 ± 0.161
0.827HisAsn: 0.827 ± 0.274
0.459HisPro: 0.459 ± 0.175
0.551HisGln: 0.551 ± 0.229
0.919HisArg: 0.919 ± 0.301
0.827HisSer: 0.827 ± 0.248
0.827HisThr: 0.827 ± 0.209
1.47HisVal: 1.47 ± 0.24
0.092HisTrp: 0.092 ± 0.075
1.01HisTyr: 1.01 ± 0.365
0.0HisXaa: 0.0 ± 0.0
Ile
4.409IleAla: 4.409 ± 0.767
0.459IleCys: 0.459 ± 0.245
4.96IleAsp: 4.96 ± 0.602
4.776IleGlu: 4.776 ± 0.706
1.194IlePhe: 1.194 ± 0.363
4.593IleGly: 4.593 ± 0.625
0.919IleHis: 0.919 ± 0.252
3.399IleIle: 3.399 ± 0.464
6.522IleLys: 6.522 ± 0.688
4.501IleLeu: 4.501 ± 0.659
1.653IleMet: 1.653 ± 0.395
4.501IleAsn: 4.501 ± 0.619
2.572IlePro: 2.572 ± 0.545
2.756IleGln: 2.756 ± 0.546
2.847IleArg: 2.847 ± 0.469
4.868IleSer: 4.868 ± 0.498
3.399IleThr: 3.399 ± 0.442
3.582IleVal: 3.582 ± 0.626
1.01IleTrp: 1.01 ± 0.256
1.653IleTyr: 1.653 ± 0.416
0.0IleXaa: 0.0 ± 0.0
Lys
5.236LysAla: 5.236 ± 0.592
0.367LysCys: 0.367 ± 0.203
4.96LysAsp: 4.96 ± 0.785
7.256LysGlu: 7.256 ± 0.824
3.49LysPhe: 3.49 ± 0.712
5.236LysGly: 5.236 ± 0.709
1.378LysHis: 1.378 ± 0.373
5.879LysIle: 5.879 ± 0.722
6.705LysLys: 6.705 ± 1.03
7.44LysLeu: 7.44 ± 1.042
1.929LysMet: 1.929 ± 0.414
4.776LysAsn: 4.776 ± 0.647
2.847LysPro: 2.847 ± 0.439
3.582LysGln: 3.582 ± 0.446
3.399LysArg: 3.399 ± 0.496
5.144LysSer: 5.144 ± 0.535
5.879LysThr: 5.879 ± 0.794
4.776LysVal: 4.776 ± 0.695
0.551LysTrp: 0.551 ± 0.239
3.307LysTyr: 3.307 ± 0.637
0.0LysXaa: 0.0 ± 0.0
Leu
6.797LeuAla: 6.797 ± 0.625
0.827LeuCys: 0.827 ± 0.31
4.868LeuAsp: 4.868 ± 0.723
5.879LeuGlu: 5.879 ± 0.905
3.582LeuPhe: 3.582 ± 0.517
5.97LeuGly: 5.97 ± 0.683
1.194LeuHis: 1.194 ± 0.339
3.858LeuIle: 3.858 ± 0.55
7.073LeuLys: 7.073 ± 0.771
5.879LeuLeu: 5.879 ± 0.913
2.756LeuMet: 2.756 ± 0.445
5.511LeuAsn: 5.511 ± 0.757
2.388LeuPro: 2.388 ± 0.405
2.572LeuGln: 2.572 ± 0.385
3.49LeuArg: 3.49 ± 0.695
6.338LeuSer: 6.338 ± 0.972
5.236LeuThr: 5.236 ± 0.759
4.225LeuVal: 4.225 ± 0.644
0.827LeuTrp: 0.827 ± 0.223
2.48LeuTyr: 2.48 ± 0.542
0.0LeuXaa: 0.0 ± 0.0
Met
2.021MetAla: 2.021 ± 0.41
0.0MetCys: 0.0 ± 0.0
1.102MetAsp: 1.102 ± 0.29
1.286MetGlu: 1.286 ± 0.413
1.286MetPhe: 1.286 ± 0.236
0.827MetGly: 0.827 ± 0.225
0.184MetHis: 0.184 ± 0.127
2.021MetIle: 2.021 ± 0.349
3.123MetLys: 3.123 ± 0.516
2.388MetLeu: 2.388 ± 0.362
0.551MetMet: 0.551 ± 0.258
1.102MetAsn: 1.102 ± 0.25
0.919MetPro: 0.919 ± 0.261
0.919MetGln: 0.919 ± 0.21
0.827MetArg: 0.827 ± 0.205
2.021MetSer: 2.021 ± 0.457
2.113MetThr: 2.113 ± 0.454
1.561MetVal: 1.561 ± 0.37
0.184MetTrp: 0.184 ± 0.123
0.827MetTyr: 0.827 ± 0.203
0.0MetXaa: 0.0 ± 0.0
Asn
5.144AsnAla: 5.144 ± 1.15
0.459AsnCys: 0.459 ± 0.204
3.123AsnAsp: 3.123 ± 0.519
3.95AsnGlu: 3.95 ± 0.822
2.572AsnPhe: 2.572 ± 0.51
5.787AsnGly: 5.787 ± 1.157
1.194AsnHis: 1.194 ± 0.271
4.042AsnIle: 4.042 ± 0.467
4.317AsnLys: 4.317 ± 0.551
4.593AsnLeu: 4.593 ± 0.529
1.01AsnMet: 1.01 ± 0.313
3.858AsnAsn: 3.858 ± 0.58
2.572AsnPro: 2.572 ± 0.675
2.939AsnGln: 2.939 ± 0.49
1.745AsnArg: 1.745 ± 0.403
3.674AsnSer: 3.674 ± 0.65
3.858AsnThr: 3.858 ± 0.646
3.399AsnVal: 3.399 ± 0.541
1.01AsnTrp: 1.01 ± 0.277
2.48AsnTyr: 2.48 ± 0.462
0.0AsnXaa: 0.0 ± 0.0
Pro
1.745ProAla: 1.745 ± 0.315
0.184ProCys: 0.184 ± 0.169
1.378ProAsp: 1.378 ± 0.352
1.745ProGlu: 1.745 ± 0.368
1.47ProPhe: 1.47 ± 0.33
1.01ProGly: 1.01 ± 0.259
0.459ProHis: 0.459 ± 0.181
1.745ProIle: 1.745 ± 0.338
3.123ProLys: 3.123 ± 0.498
2.388ProLeu: 2.388 ± 0.393
0.276ProMet: 0.276 ± 0.163
2.204ProAsn: 2.204 ± 0.443
0.367ProPro: 0.367 ± 0.238
1.47ProGln: 1.47 ± 0.334
1.286ProArg: 1.286 ± 0.368
2.204ProSer: 2.204 ± 0.433
2.572ProThr: 2.572 ± 0.438
1.194ProVal: 1.194 ± 0.292
0.459ProTrp: 0.459 ± 0.162
1.102ProTyr: 1.102 ± 0.27
0.0ProXaa: 0.0 ± 0.0
Gln
2.572GlnAla: 2.572 ± 0.632
0.092GlnCys: 0.092 ± 0.087
1.745GlnAsp: 1.745 ± 0.31
3.49GlnGlu: 3.49 ± 0.543
1.745GlnPhe: 1.745 ± 0.445
2.296GlnGly: 2.296 ± 0.677
0.367GlnHis: 0.367 ± 0.154
2.021GlnIle: 2.021 ± 0.356
3.399GlnLys: 3.399 ± 0.536
3.582GlnLeu: 3.582 ± 0.528
1.745GlnMet: 1.745 ± 0.366
2.847GlnAsn: 2.847 ± 0.348
0.735GlnPro: 0.735 ± 0.243
3.123GlnGln: 3.123 ± 0.631
1.837GlnArg: 1.837 ± 0.384
2.48GlnSer: 2.48 ± 0.385
3.215GlnThr: 3.215 ± 0.51
2.204GlnVal: 2.204 ± 0.469
0.367GlnTrp: 0.367 ± 0.176
2.572GlnTyr: 2.572 ± 0.459
0.0GlnXaa: 0.0 ± 0.0
Arg
2.572ArgAla: 2.572 ± 0.524
0.092ArgCys: 0.092 ± 0.091
3.49ArgAsp: 3.49 ± 0.642
2.939ArgGlu: 2.939 ± 0.579
1.929ArgPhe: 1.929 ± 0.387
1.837ArgGly: 1.837 ± 0.31
0.459ArgHis: 0.459 ± 0.174
3.399ArgIle: 3.399 ± 0.675
2.939ArgLys: 2.939 ± 0.558
3.307ArgLeu: 3.307 ± 0.604
1.01ArgMet: 1.01 ± 0.344
2.48ArgAsn: 2.48 ± 0.495
1.194ArgPro: 1.194 ± 0.306
2.847ArgGln: 2.847 ± 0.405
1.653ArgArg: 1.653 ± 0.416
1.47ArgSer: 1.47 ± 0.302
2.572ArgThr: 2.572 ± 0.596
2.572ArgVal: 2.572 ± 0.451
1.194ArgTrp: 1.194 ± 0.3
2.48ArgTyr: 2.48 ± 0.568
0.0ArgXaa: 0.0 ± 0.0
Ser
3.674SerAla: 3.674 ± 0.484
0.735SerCys: 0.735 ± 0.301
4.501SerAsp: 4.501 ± 0.524
3.582SerGlu: 3.582 ± 0.428
3.399SerPhe: 3.399 ± 0.554
4.684SerGly: 4.684 ± 0.573
0.459SerHis: 0.459 ± 0.164
4.501SerIle: 4.501 ± 0.612
6.43SerLys: 6.43 ± 1.007
4.042SerLeu: 4.042 ± 0.537
1.929SerMet: 1.929 ± 0.328
4.409SerAsn: 4.409 ± 0.793
2.204SerPro: 2.204 ± 0.452
3.031SerGln: 3.031 ± 0.54
3.031SerArg: 3.031 ± 0.673
5.419SerSer: 5.419 ± 1.247
4.409SerThr: 4.409 ± 0.491
5.511SerVal: 5.511 ± 0.676
0.643SerTrp: 0.643 ± 0.235
2.296SerTyr: 2.296 ± 0.534
0.0SerXaa: 0.0 ± 0.0
Thr
4.776ThrAla: 4.776 ± 0.532
0.276ThrCys: 0.276 ± 0.166
4.042ThrAsp: 4.042 ± 0.549
4.317ThrGlu: 4.317 ± 0.51
3.215ThrPhe: 3.215 ± 0.572
4.317ThrGly: 4.317 ± 0.543
0.551ThrHis: 0.551 ± 0.229
4.409ThrIle: 4.409 ± 0.96
4.409ThrLys: 4.409 ± 0.618
6.062ThrLeu: 6.062 ± 0.802
0.827ThrMet: 0.827 ± 0.246
3.858ThrAsn: 3.858 ± 0.52
1.286ThrPro: 1.286 ± 0.398
2.296ThrGln: 2.296 ± 0.44
2.113ThrArg: 2.113 ± 0.414
3.49ThrSer: 3.49 ± 0.616
3.031ThrThr: 3.031 ± 0.638
3.766ThrVal: 3.766 ± 0.588
1.102ThrTrp: 1.102 ± 0.306
3.123ThrTyr: 3.123 ± 0.68
0.0ThrXaa: 0.0 ± 0.0
Val
3.674ValAla: 3.674 ± 0.615
0.367ValCys: 0.367 ± 0.184
4.501ValAsp: 4.501 ± 0.671
3.858ValGlu: 3.858 ± 0.582
2.572ValPhe: 2.572 ± 0.473
4.225ValGly: 4.225 ± 0.564
0.459ValHis: 0.459 ± 0.177
3.123ValIle: 3.123 ± 0.491
5.327ValLys: 5.327 ± 0.723
4.501ValLeu: 4.501 ± 0.666
1.378ValMet: 1.378 ± 0.268
3.215ValAsn: 3.215 ± 0.61
2.021ValPro: 2.021 ± 0.407
1.929ValGln: 1.929 ± 0.384
2.572ValArg: 2.572 ± 0.502
5.327ValSer: 5.327 ± 0.836
4.042ValThr: 4.042 ± 0.754
4.96ValVal: 4.96 ± 0.696
0.919ValTrp: 0.919 ± 0.259
1.837ValTyr: 1.837 ± 0.434
0.0ValXaa: 0.0 ± 0.0
Trp
0.643TrpAla: 0.643 ± 0.227
0.0TrpCys: 0.0 ± 0.0
1.102TrpAsp: 1.102 ± 0.487
0.643TrpGlu: 0.643 ± 0.194
0.551TrpPhe: 0.551 ± 0.221
0.551TrpGly: 0.551 ± 0.225
0.276TrpHis: 0.276 ± 0.142
0.735TrpIle: 0.735 ± 0.259
0.551TrpLys: 0.551 ± 0.189
1.653TrpLeu: 1.653 ± 0.273
0.092TrpMet: 0.092 ± 0.077
1.47TrpAsn: 1.47 ± 0.374
0.092TrpPro: 0.092 ± 0.09
0.643TrpGln: 0.643 ± 0.24
0.735TrpArg: 0.735 ± 0.247
2.113TrpSer: 2.113 ± 0.613
0.735TrpThr: 0.735 ± 0.23
1.102TrpVal: 1.102 ± 0.241
0.276TrpTrp: 0.276 ± 0.179
0.551TrpTyr: 0.551 ± 0.249
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.572TyrAla: 2.572 ± 0.387
0.735TyrCys: 0.735 ± 0.299
2.756TyrAsp: 2.756 ± 0.483
2.388TyrGlu: 2.388 ± 0.401
2.388TyrPhe: 2.388 ± 0.439
2.204TyrGly: 2.204 ± 0.466
1.378TyrHis: 1.378 ± 0.373
2.572TyrIle: 2.572 ± 0.535
3.399TyrLys: 3.399 ± 0.638
3.307TyrLeu: 3.307 ± 0.531
1.378TyrMet: 1.378 ± 0.38
1.561TyrAsn: 1.561 ± 0.39
1.194TyrPro: 1.194 ± 0.374
2.021TyrGln: 2.021 ± 0.307
1.837TyrArg: 1.837 ± 0.342
3.215TyrSer: 3.215 ± 0.685
1.286TyrThr: 1.286 ± 0.259
2.48TyrVal: 2.48 ± 0.497
0.184TyrTrp: 0.184 ± 0.105
1.561TyrTyr: 1.561 ± 0.637
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 49 proteins (10888 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski