Amino acid dipepetide frequency for Streptococcus phage IPP10

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.782AlaAla: 2.782 ± 0.72
0.309AlaCys: 0.309 ± 0.165
5.409AlaAsp: 5.409 ± 0.639
6.182AlaGlu: 6.182 ± 0.747
2.627AlaPhe: 2.627 ± 0.629
4.946AlaGly: 4.946 ± 1.139
0.85AlaHis: 0.85 ± 0.23
4.173AlaIle: 4.173 ± 0.678
6.027AlaLys: 6.027 ± 0.766
6.182AlaLeu: 6.182 ± 0.972
2.318AlaMet: 2.318 ± 0.416
4.636AlaAsn: 4.636 ± 0.895
1.855AlaPro: 1.855 ± 0.376
2.241AlaGln: 2.241 ± 0.458
2.705AlaArg: 2.705 ± 0.498
3.014AlaSer: 3.014 ± 0.817
4.482AlaThr: 4.482 ± 0.694
5.1AlaVal: 5.1 ± 0.547
0.927AlaTrp: 0.927 ± 0.258
1.7AlaTyr: 1.7 ± 0.368
0.0AlaXaa: 0.0 ± 0.0
Cys
0.232CysAla: 0.232 ± 0.123
0.155CysCys: 0.155 ± 0.12
0.541CysAsp: 0.541 ± 0.249
0.386CysGlu: 0.386 ± 0.181
0.309CysPhe: 0.309 ± 0.157
0.0CysGly: 0.0 ± 0.0
0.077CysHis: 0.077 ± 0.088
0.618CysIle: 0.618 ± 0.293
0.541CysLys: 0.541 ± 0.188
0.309CysLeu: 0.309 ± 0.145
0.0CysMet: 0.0 ± 0.0
0.077CysAsn: 0.077 ± 0.083
0.309CysPro: 0.309 ± 0.155
0.232CysGln: 0.232 ± 0.125
0.309CysArg: 0.309 ± 0.148
0.232CysSer: 0.232 ± 0.144
0.077CysThr: 0.077 ± 0.094
0.232CysVal: 0.232 ± 0.167
0.232CysTrp: 0.232 ± 0.105
0.386CysTyr: 0.386 ± 0.141
0.0CysXaa: 0.0 ± 0.0
Asp
3.632AspAla: 3.632 ± 0.737
0.541AspCys: 0.541 ± 0.229
3.245AspAsp: 3.245 ± 0.82
4.018AspGlu: 4.018 ± 0.961
3.091AspPhe: 3.091 ± 0.519
4.559AspGly: 4.559 ± 0.625
0.541AspHis: 0.541 ± 0.277
5.409AspIle: 5.409 ± 0.503
5.332AspLys: 5.332 ± 0.661
5.486AspLeu: 5.486 ± 0.855
1.623AspMet: 1.623 ± 0.34
2.705AspAsn: 2.705 ± 0.487
1.468AspPro: 1.468 ± 0.414
1.855AspGln: 1.855 ± 0.334
2.859AspArg: 2.859 ± 0.48
3.4AspSer: 3.4 ± 0.541
3.786AspThr: 3.786 ± 0.493
3.168AspVal: 3.168 ± 0.507
1.7AspTrp: 1.7 ± 0.323
3.168AspTyr: 3.168 ± 0.561
0.0AspXaa: 0.0 ± 0.0
Glu
6.105GluAla: 6.105 ± 0.978
0.232GluCys: 0.232 ± 0.129
4.018GluAsp: 4.018 ± 0.497
6.259GluGlu: 6.259 ± 1.013
3.786GluPhe: 3.786 ± 0.588
3.555GluGly: 3.555 ± 0.573
1.159GluHis: 1.159 ± 0.308
5.564GluIle: 5.564 ± 0.447
7.65GluLys: 7.65 ± 1.399
8.732GluLeu: 8.732 ± 1.128
2.241GluMet: 2.241 ± 0.609
4.25GluAsn: 4.25 ± 0.525
1.391GluPro: 1.391 ± 0.401
3.014GluGln: 3.014 ± 0.565
4.25GluArg: 4.25 ± 0.61
5.1GluSer: 5.1 ± 0.719
4.25GluThr: 4.25 ± 0.636
4.868GluVal: 4.868 ± 0.597
0.927GluTrp: 0.927 ± 0.254
2.859GluTyr: 2.859 ± 0.548
0.0GluXaa: 0.0 ± 0.0
Phe
2.318PheAla: 2.318 ± 0.51
0.232PheCys: 0.232 ± 0.145
4.25PheAsp: 4.25 ± 0.6
4.096PheGlu: 4.096 ± 0.503
1.932PhePhe: 1.932 ± 0.458
2.473PheGly: 2.473 ± 0.664
0.232PheHis: 0.232 ± 0.149
2.009PheIle: 2.009 ± 0.406
3.632PheLys: 3.632 ± 0.592
2.705PheLeu: 2.705 ± 0.402
1.159PheMet: 1.159 ± 0.405
3.091PheAsn: 3.091 ± 0.717
0.773PhePro: 0.773 ± 0.282
1.468PheGln: 1.468 ± 0.338
1.623PheArg: 1.623 ± 0.269
2.705PheSer: 2.705 ± 0.569
2.627PheThr: 2.627 ± 0.508
1.545PheVal: 1.545 ± 0.395
0.695PheTrp: 0.695 ± 0.244
2.009PheTyr: 2.009 ± 0.431
0.0PheXaa: 0.0 ± 0.0
Gly
2.627GlyAla: 2.627 ± 0.376
0.155GlyCys: 0.155 ± 0.09
3.4GlyAsp: 3.4 ± 0.619
4.482GlyGlu: 4.482 ± 0.588
2.473GlyPhe: 2.473 ± 0.518
4.327GlyGly: 4.327 ± 1.109
0.773GlyHis: 0.773 ± 0.211
3.864GlyIle: 3.864 ± 0.699
5.177GlyLys: 5.177 ± 0.529
5.718GlyLeu: 5.718 ± 0.93
1.545GlyMet: 1.545 ± 0.332
3.864GlyAsn: 3.864 ± 0.558
1.005GlyPro: 1.005 ± 0.272
3.323GlyGln: 3.323 ± 0.459
3.555GlyArg: 3.555 ± 0.557
3.632GlySer: 3.632 ± 0.916
3.168GlyThr: 3.168 ± 0.583
4.173GlyVal: 4.173 ± 0.494
1.082GlyTrp: 1.082 ± 0.549
2.782GlyTyr: 2.782 ± 0.442
0.0GlyXaa: 0.0 ± 0.0
His
0.618HisAla: 0.618 ± 0.302
0.0HisCys: 0.0 ± 0.0
0.695HisAsp: 0.695 ± 0.268
1.314HisGlu: 1.314 ± 0.332
0.773HisPhe: 0.773 ± 0.263
0.773HisGly: 0.773 ± 0.321
0.309HisHis: 0.309 ± 0.17
0.618HisIle: 0.618 ± 0.279
0.773HisLys: 0.773 ± 0.275
1.236HisLeu: 1.236 ± 0.348
0.232HisMet: 0.232 ± 0.149
1.005HisAsn: 1.005 ± 0.229
0.773HisPro: 0.773 ± 0.211
0.541HisGln: 0.541 ± 0.236
0.773HisArg: 0.773 ± 0.26
1.391HisSer: 1.391 ± 0.379
0.927HisThr: 0.927 ± 0.301
0.85HisVal: 0.85 ± 0.295
0.155HisTrp: 0.155 ± 0.121
0.464HisTyr: 0.464 ± 0.186
0.0HisXaa: 0.0 ± 0.0
Ile
5.409IleAla: 5.409 ± 0.776
0.695IleCys: 0.695 ± 0.196
4.018IleAsp: 4.018 ± 0.618
6.414IleGlu: 6.414 ± 0.867
2.318IlePhe: 2.318 ± 0.53
3.941IleGly: 3.941 ± 0.804
0.232IleHis: 0.232 ± 0.145
2.859IleIle: 2.859 ± 0.493
6.491IleLys: 6.491 ± 0.769
4.25IleLeu: 4.25 ± 0.757
1.159IleMet: 1.159 ± 0.375
3.091IleAsn: 3.091 ± 0.464
1.777IlePro: 1.777 ± 0.308
2.627IleGln: 2.627 ± 0.282
2.705IleArg: 2.705 ± 0.689
5.796IleSer: 5.796 ± 0.94
4.173IleThr: 4.173 ± 0.467
3.091IleVal: 3.091 ± 0.445
0.464IleTrp: 0.464 ± 0.152
2.395IleTyr: 2.395 ± 0.656
0.0IleXaa: 0.0 ± 0.0
Lys
4.946LysAla: 4.946 ± 0.701
0.309LysCys: 0.309 ± 0.157
5.95LysAsp: 5.95 ± 0.581
7.186LysGlu: 7.186 ± 1.207
3.168LysPhe: 3.168 ± 0.634
4.636LysGly: 4.636 ± 0.608
1.7LysHis: 1.7 ± 0.313
6.027LysIle: 6.027 ± 0.752
8.036LysLys: 8.036 ± 1.287
7.496LysLeu: 7.496 ± 0.66
3.245LysMet: 3.245 ± 0.428
4.636LysAsn: 4.636 ± 0.557
2.859LysPro: 2.859 ± 0.696
3.4LysGln: 3.4 ± 0.609
3.941LysArg: 3.941 ± 0.427
5.1LysSer: 5.1 ± 0.604
5.641LysThr: 5.641 ± 0.605
5.796LysVal: 5.796 ± 0.592
1.159LysTrp: 1.159 ± 0.356
3.477LysTyr: 3.477 ± 0.463
0.0LysXaa: 0.0 ± 0.0
Leu
6.723LeuAla: 6.723 ± 0.816
0.618LeuCys: 0.618 ± 0.32
5.796LeuAsp: 5.796 ± 0.749
7.418LeuGlu: 7.418 ± 0.919
2.627LeuPhe: 2.627 ± 0.429
5.023LeuGly: 5.023 ± 0.952
0.927LeuHis: 0.927 ± 0.259
3.786LeuIle: 3.786 ± 0.569
7.727LeuLys: 7.727 ± 0.733
7.109LeuLeu: 7.109 ± 0.888
2.241LeuMet: 2.241 ± 0.396
3.323LeuAsn: 3.323 ± 0.723
3.014LeuPro: 3.014 ± 0.57
3.091LeuGln: 3.091 ± 0.651
4.018LeuArg: 4.018 ± 0.547
5.641LeuSer: 5.641 ± 0.974
5.641LeuThr: 5.641 ± 0.962
4.636LeuVal: 4.636 ± 0.568
0.695LeuTrp: 0.695 ± 0.177
2.395LeuTyr: 2.395 ± 0.327
0.0LeuXaa: 0.0 ± 0.0
Met
1.932MetAla: 1.932 ± 0.444
0.0MetCys: 0.0 ± 0.0
1.391MetAsp: 1.391 ± 0.236
2.241MetGlu: 2.241 ± 0.517
1.005MetPhe: 1.005 ± 0.243
1.082MetGly: 1.082 ± 0.406
0.309MetHis: 0.309 ± 0.147
1.855MetIle: 1.855 ± 0.449
2.318MetLys: 2.318 ± 0.526
1.777MetLeu: 1.777 ± 0.39
0.386MetMet: 0.386 ± 0.191
1.468MetAsn: 1.468 ± 0.407
1.236MetPro: 1.236 ± 0.357
0.927MetGln: 0.927 ± 0.342
1.391MetArg: 1.391 ± 0.335
1.391MetSer: 1.391 ± 0.343
1.468MetThr: 1.468 ± 0.397
1.468MetVal: 1.468 ± 0.328
0.232MetTrp: 0.232 ± 0.125
0.927MetTyr: 0.927 ± 0.253
0.0MetXaa: 0.0 ± 0.0
Asn
4.714AsnAla: 4.714 ± 0.692
0.232AsnCys: 0.232 ± 0.112
2.705AsnAsp: 2.705 ± 0.465
2.627AsnGlu: 2.627 ± 0.455
2.395AsnPhe: 2.395 ± 0.495
4.096AsnGly: 4.096 ± 0.591
1.005AsnHis: 1.005 ± 0.349
2.859AsnIle: 2.859 ± 0.427
4.482AsnLys: 4.482 ± 0.623
4.868AsnLeu: 4.868 ± 0.655
1.236AsnMet: 1.236 ± 0.332
2.705AsnAsn: 2.705 ± 0.577
1.855AsnPro: 1.855 ± 0.357
2.936AsnGln: 2.936 ± 0.53
2.627AsnArg: 2.627 ± 0.566
3.632AsnSer: 3.632 ± 0.806
3.091AsnThr: 3.091 ± 0.624
3.245AsnVal: 3.245 ± 0.51
1.005AsnTrp: 1.005 ± 0.2
2.086AsnTyr: 2.086 ± 0.39
0.0AsnXaa: 0.0 ± 0.0
Pro
2.318ProAla: 2.318 ± 0.476
0.155ProCys: 0.155 ± 0.151
1.623ProAsp: 1.623 ± 0.363
3.245ProGlu: 3.245 ± 0.414
0.695ProPhe: 0.695 ± 0.307
1.314ProGly: 1.314 ± 0.397
0.464ProHis: 0.464 ± 0.155
1.932ProIle: 1.932 ± 0.459
3.091ProLys: 3.091 ± 0.45
1.391ProLeu: 1.391 ± 0.409
0.541ProMet: 0.541 ± 0.208
1.855ProAsn: 1.855 ± 0.39
0.541ProPro: 0.541 ± 0.198
0.773ProGln: 0.773 ± 0.392
1.236ProArg: 1.236 ± 0.313
1.932ProSer: 1.932 ± 0.555
0.695ProThr: 0.695 ± 0.258
2.086ProVal: 2.086 ± 0.473
0.386ProTrp: 0.386 ± 0.155
1.623ProTyr: 1.623 ± 0.487
0.0ProXaa: 0.0 ± 0.0
Gln
3.632GlnAla: 3.632 ± 0.553
0.155GlnCys: 0.155 ± 0.128
1.7GlnAsp: 1.7 ± 0.331
3.709GlnGlu: 3.709 ± 0.675
1.623GlnPhe: 1.623 ± 0.316
1.855GlnGly: 1.855 ± 0.373
0.618GlnHis: 0.618 ± 0.269
2.936GlnIle: 2.936 ± 0.517
3.632GlnLys: 3.632 ± 0.452
2.936GlnLeu: 2.936 ± 0.491
0.85GlnMet: 0.85 ± 0.186
1.855GlnAsn: 1.855 ± 0.376
1.159GlnPro: 1.159 ± 0.37
1.391GlnGln: 1.391 ± 0.37
1.777GlnArg: 1.777 ± 0.452
2.241GlnSer: 2.241 ± 0.425
2.859GlnThr: 2.859 ± 0.5
3.555GlnVal: 3.555 ± 0.52
0.464GlnTrp: 0.464 ± 0.159
0.85GlnTyr: 0.85 ± 0.301
0.0GlnXaa: 0.0 ± 0.0
Arg
3.014ArgAla: 3.014 ± 0.508
0.309ArgCys: 0.309 ± 0.147
2.318ArgAsp: 2.318 ± 0.447
2.782ArgGlu: 2.782 ± 0.421
2.009ArgPhe: 2.009 ± 0.463
1.623ArgGly: 1.623 ± 0.364
0.695ArgHis: 0.695 ± 0.259
3.245ArgIle: 3.245 ± 0.645
3.709ArgLys: 3.709 ± 0.715
4.714ArgLeu: 4.714 ± 0.711
2.086ArgMet: 2.086 ± 0.392
2.705ArgAsn: 2.705 ± 0.561
1.005ArgPro: 1.005 ± 0.236
2.55ArgGln: 2.55 ± 0.561
2.473ArgArg: 2.473 ± 0.625
2.782ArgSer: 2.782 ± 0.451
3.091ArgThr: 3.091 ± 0.615
2.705ArgVal: 2.705 ± 0.378
0.541ArgTrp: 0.541 ± 0.221
1.7ArgTyr: 1.7 ± 0.413
0.0ArgXaa: 0.0 ± 0.0
Ser
4.327SerAla: 4.327 ± 1.061
0.232SerCys: 0.232 ± 0.129
3.477SerAsp: 3.477 ± 0.63
4.714SerGlu: 4.714 ± 0.635
2.395SerPhe: 2.395 ± 0.475
5.255SerGly: 5.255 ± 0.858
1.545SerHis: 1.545 ± 0.434
4.405SerIle: 4.405 ± 0.625
4.868SerLys: 4.868 ± 0.75
5.486SerLeu: 5.486 ± 0.75
1.236SerMet: 1.236 ± 0.348
3.323SerAsn: 3.323 ± 0.545
1.314SerPro: 1.314 ± 0.275
2.009SerGln: 2.009 ± 0.413
3.091SerArg: 3.091 ± 0.64
3.864SerSer: 3.864 ± 0.695
4.018SerThr: 4.018 ± 0.539
3.555SerVal: 3.555 ± 0.89
1.082SerTrp: 1.082 ± 0.418
2.705SerTyr: 2.705 ± 0.459
0.0SerXaa: 0.0 ± 0.0
Thr
5.023ThrAla: 5.023 ± 1.079
0.155ThrCys: 0.155 ± 0.121
3.941ThrAsp: 3.941 ± 0.593
4.636ThrGlu: 4.636 ± 0.613
3.477ThrPhe: 3.477 ± 0.801
4.327ThrGly: 4.327 ± 0.827
0.773ThrHis: 0.773 ± 0.277
4.714ThrIle: 4.714 ± 0.552
4.946ThrLys: 4.946 ± 0.734
4.018ThrLeu: 4.018 ± 0.556
0.773ThrMet: 0.773 ± 0.269
3.632ThrAsn: 3.632 ± 0.465
1.468ThrPro: 1.468 ± 0.516
2.782ThrGln: 2.782 ± 0.733
1.7ThrArg: 1.7 ± 0.343
3.864ThrSer: 3.864 ± 0.576
4.482ThrThr: 4.482 ± 0.72
4.25ThrVal: 4.25 ± 0.858
0.773ThrTrp: 0.773 ± 0.258
2.627ThrTyr: 2.627 ± 0.501
0.0ThrXaa: 0.0 ± 0.0
Val
5.1ValAla: 5.1 ± 0.645
0.309ValCys: 0.309 ± 0.168
4.096ValAsp: 4.096 ± 0.706
5.486ValGlu: 5.486 ± 0.7
1.855ValPhe: 1.855 ± 0.352
4.405ValGly: 4.405 ± 0.595
1.005ValHis: 1.005 ± 0.322
4.018ValIle: 4.018 ± 0.711
4.714ValLys: 4.714 ± 0.622
4.482ValLeu: 4.482 ± 0.788
1.005ValMet: 1.005 ± 0.3
3.786ValAsn: 3.786 ± 0.536
2.009ValPro: 2.009 ± 0.333
1.777ValGln: 1.777 ± 0.463
2.473ValArg: 2.473 ± 0.42
4.096ValSer: 4.096 ± 0.57
4.636ValThr: 4.636 ± 0.713
4.25ValVal: 4.25 ± 0.738
0.464ValTrp: 0.464 ± 0.162
2.164ValTyr: 2.164 ± 0.459
0.0ValXaa: 0.0 ± 0.0
Trp
1.005TrpAla: 1.005 ± 0.299
0.155TrpCys: 0.155 ± 0.111
0.85TrpAsp: 0.85 ± 0.322
0.85TrpGlu: 0.85 ± 0.28
1.082TrpPhe: 1.082 ± 0.43
0.773TrpGly: 0.773 ± 0.198
0.0TrpHis: 0.0 ± 0.0
0.541TrpIle: 0.541 ± 0.229
1.7TrpLys: 1.7 ± 0.364
0.695TrpLeu: 0.695 ± 0.275
0.386TrpMet: 0.386 ± 0.181
0.927TrpAsn: 0.927 ± 0.284
0.155TrpPro: 0.155 ± 0.102
0.85TrpGln: 0.85 ± 0.303
0.386TrpArg: 0.386 ± 0.203
0.386TrpSer: 0.386 ± 0.145
0.618TrpThr: 0.618 ± 0.176
1.082TrpVal: 1.082 ± 0.254
0.155TrpTrp: 0.155 ± 0.099
0.85TrpTyr: 0.85 ± 0.551
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.009TyrAla: 2.009 ± 0.368
0.309TyrCys: 0.309 ± 0.129
2.164TyrAsp: 2.164 ± 0.346
2.241TyrGlu: 2.241 ± 0.405
1.932TyrPhe: 1.932 ± 0.506
2.164TyrGly: 2.164 ± 0.404
1.005TyrHis: 1.005 ± 0.271
2.473TyrIle: 2.473 ± 0.481
3.864TyrLys: 3.864 ± 0.708
2.859TyrLeu: 2.859 ± 0.608
0.464TyrMet: 0.464 ± 0.28
1.468TyrAsn: 1.468 ± 0.402
1.932TyrPro: 1.932 ± 0.479
2.009TyrGln: 2.009 ± 0.403
2.164TyrArg: 2.164 ± 0.491
2.705TyrSer: 2.705 ± 0.595
2.627TyrThr: 2.627 ± 0.555
2.473TyrVal: 2.473 ± 0.513
0.309TyrTrp: 0.309 ± 0.139
1.545TyrTyr: 1.545 ± 0.606
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 63 proteins (12942 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski