Amino acid dipepetide frequency for Gordonia phage BiPauneto

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.827AlaAla: 13.827 ± 1.255
0.852AlaCys: 0.852 ± 0.209
7.15AlaAsp: 7.15 ± 0.845
7.245AlaGlu: 7.245 ± 0.768
3.078AlaPhe: 3.078 ± 0.397
7.434AlaGly: 7.434 ± 0.733
2.368AlaHis: 2.368 ± 0.478
5.351AlaIle: 5.351 ± 0.577
5.019AlaLys: 5.019 ± 0.51
8.808AlaLeu: 8.808 ± 0.863
2.462AlaMet: 2.462 ± 0.317
3.031AlaAsn: 3.031 ± 0.394
4.309AlaPro: 4.309 ± 0.533
4.025AlaGln: 4.025 ± 0.489
6.582AlaArg: 6.582 ± 0.646
5.351AlaSer: 5.351 ± 0.678
6.156AlaThr: 6.156 ± 0.578
7.15AlaVal: 7.15 ± 0.538
1.847AlaTrp: 1.847 ± 0.315
2.604AlaTyr: 2.604 ± 0.368
0.0AlaXaa: 0.0 ± 0.0
Cys
0.758CysAla: 0.758 ± 0.194
0.095CysCys: 0.095 ± 0.067
0.805CysAsp: 0.805 ± 0.206
0.758CysGlu: 0.758 ± 0.159
0.095CysPhe: 0.095 ± 0.072
0.71CysGly: 0.71 ± 0.203
0.331CysHis: 0.331 ± 0.118
0.237CysIle: 0.237 ± 0.105
0.237CysLys: 0.237 ± 0.109
0.947CysLeu: 0.947 ± 0.221
0.0CysMet: 0.0 ± 0.0
0.189CysAsn: 0.189 ± 0.081
0.616CysPro: 0.616 ± 0.22
0.379CysGln: 0.379 ± 0.14
0.474CysArg: 0.474 ± 0.176
0.568CysSer: 0.568 ± 0.17
0.426CysThr: 0.426 ± 0.147
0.9CysVal: 0.9 ± 0.201
0.095CysTrp: 0.095 ± 0.066
0.189CysTyr: 0.189 ± 0.121
0.0CysXaa: 0.0 ± 0.0
Asp
7.482AspAla: 7.482 ± 0.638
0.568AspCys: 0.568 ± 0.169
7.008AspAsp: 7.008 ± 1.391
4.356AspGlu: 4.356 ± 0.612
2.036AspPhe: 2.036 ± 0.26
6.961AspGly: 6.961 ± 0.626
1.657AspHis: 1.657 ± 0.321
3.788AspIle: 3.788 ± 0.41
1.847AspLys: 1.847 ± 0.315
5.446AspLeu: 5.446 ± 0.535
1.61AspMet: 1.61 ± 0.246
1.894AspAsn: 1.894 ± 0.364
4.688AspPro: 4.688 ± 0.515
1.847AspGln: 1.847 ± 0.267
4.925AspArg: 4.925 ± 0.628
3.22AspSer: 3.22 ± 0.584
3.504AspThr: 3.504 ± 0.422
5.114AspVal: 5.114 ± 0.451
1.326AspTrp: 1.326 ± 0.261
1.799AspTyr: 1.799 ± 0.427
0.0AspXaa: 0.0 ± 0.0
Glu
5.919GluAla: 5.919 ± 0.512
0.568GluCys: 0.568 ± 0.174
4.262GluAsp: 4.262 ± 0.532
4.546GluGlu: 4.546 ± 0.561
2.32GluPhe: 2.32 ± 0.353
3.788GluGly: 3.788 ± 0.463
0.9GluHis: 0.9 ± 0.154
4.499GluIle: 4.499 ± 0.423
2.652GluLys: 2.652 ± 0.384
5.824GluLeu: 5.824 ± 0.528
1.989GluMet: 1.989 ± 0.332
2.084GluAsn: 2.084 ± 0.288
3.173GluPro: 3.173 ± 0.51
2.226GluGln: 2.226 ± 0.356
4.83GluArg: 4.83 ± 0.59
3.741GluSer: 3.741 ± 0.419
3.504GluThr: 3.504 ± 0.429
4.783GluVal: 4.783 ± 0.422
0.947GluTrp: 0.947 ± 0.207
1.705GluTyr: 1.705 ± 0.396
0.0GluXaa: 0.0 ± 0.0
Phe
2.604PheAla: 2.604 ± 0.322
0.379PheCys: 0.379 ± 0.141
2.699PheAsp: 2.699 ± 0.464
2.226PheGlu: 2.226 ± 0.353
0.663PhePhe: 0.663 ± 0.177
2.794PheGly: 2.794 ± 0.355
0.9PheHis: 0.9 ± 0.204
1.184PheIle: 1.184 ± 0.219
0.9PheLys: 0.9 ± 0.219
2.084PheLeu: 2.084 ± 0.38
0.521PheMet: 0.521 ± 0.137
0.71PheAsn: 0.71 ± 0.167
1.184PhePro: 1.184 ± 0.209
0.805PheGln: 0.805 ± 0.199
2.51PheArg: 2.51 ± 0.374
1.941PheSer: 1.941 ± 0.365
1.941PheThr: 1.941 ± 0.256
2.604PheVal: 2.604 ± 0.364
0.758PheTrp: 0.758 ± 0.204
1.042PheTyr: 1.042 ± 0.205
0.0PheXaa: 0.0 ± 0.0
Gly
7.624GlyAla: 7.624 ± 0.725
0.758GlyCys: 0.758 ± 0.218
6.298GlyAsp: 6.298 ± 0.493
6.061GlyGlu: 6.061 ± 0.502
2.746GlyPhe: 2.746 ± 0.345
7.482GlyGly: 7.482 ± 1.429
1.326GlyHis: 1.326 ± 0.233
4.735GlyIle: 4.735 ± 0.558
3.836GlyLys: 3.836 ± 0.415
5.73GlyLeu: 5.73 ± 0.648
1.847GlyMet: 1.847 ± 0.364
2.652GlyAsn: 2.652 ± 0.332
3.599GlyPro: 3.599 ± 0.496
3.031GlyGln: 3.031 ± 0.394
5.824GlyArg: 5.824 ± 0.525
5.73GlySer: 5.73 ± 0.715
5.351GlyThr: 5.351 ± 0.712
5.493GlyVal: 5.493 ± 0.539
1.799GlyTrp: 1.799 ± 0.291
2.794GlyTyr: 2.794 ± 0.401
0.0GlyXaa: 0.0 ± 0.0
His
2.084HisAla: 2.084 ± 0.297
0.189HisCys: 0.189 ± 0.09
1.468HisAsp: 1.468 ± 0.276
0.994HisGlu: 0.994 ± 0.215
0.805HisPhe: 0.805 ± 0.171
1.61HisGly: 1.61 ± 0.295
0.331HisHis: 0.331 ± 0.133
0.852HisIle: 0.852 ± 0.177
0.663HisLys: 0.663 ± 0.18
1.279HisLeu: 1.279 ± 0.268
0.521HisMet: 0.521 ± 0.176
0.189HisAsn: 0.189 ± 0.091
1.61HisPro: 1.61 ± 0.349
0.284HisGln: 0.284 ± 0.116
1.515HisArg: 1.515 ± 0.263
0.805HisSer: 0.805 ± 0.178
1.421HisThr: 1.421 ± 0.339
1.847HisVal: 1.847 ± 0.305
0.521HisTrp: 0.521 ± 0.133
0.663HisTyr: 0.663 ± 0.199
0.0HisXaa: 0.0 ± 0.0
Ile
5.682IleAla: 5.682 ± 0.625
0.237IleCys: 0.237 ± 0.117
3.551IleAsp: 3.551 ± 0.437
4.404IleGlu: 4.404 ± 0.506
0.994IlePhe: 0.994 ± 0.202
4.735IleGly: 4.735 ± 0.644
1.042IleHis: 1.042 ± 0.225
1.847IleIle: 1.847 ± 0.272
1.799IleLys: 1.799 ± 0.316
3.409IleLeu: 3.409 ± 0.436
0.852IleMet: 0.852 ± 0.199
2.036IleAsn: 2.036 ± 0.313
2.889IlePro: 2.889 ± 0.391
1.61IleGln: 1.61 ± 0.308
2.889IleArg: 2.889 ± 0.38
2.794IleSer: 2.794 ± 0.386
2.889IleThr: 2.889 ± 0.365
2.557IleVal: 2.557 ± 0.345
0.994IleTrp: 0.994 ± 0.208
1.421IleTyr: 1.421 ± 0.277
0.0IleXaa: 0.0 ± 0.0
Lys
4.404LysAla: 4.404 ± 0.49
0.284LysCys: 0.284 ± 0.095
3.078LysAsp: 3.078 ± 0.392
1.894LysGlu: 1.894 ± 0.311
1.184LysPhe: 1.184 ± 0.215
3.267LysGly: 3.267 ± 0.365
0.71LysHis: 0.71 ± 0.177
2.131LysIle: 2.131 ± 0.327
2.273LysLys: 2.273 ± 0.355
3.457LysLeu: 3.457 ± 0.471
0.947LysMet: 0.947 ± 0.208
1.563LysAsn: 1.563 ± 0.326
2.226LysPro: 2.226 ± 0.291
1.279LysGln: 1.279 ± 0.193
3.267LysArg: 3.267 ± 0.467
1.61LysSer: 1.61 ± 0.258
2.746LysThr: 2.746 ± 0.347
3.646LysVal: 3.646 ± 0.357
0.852LysTrp: 0.852 ± 0.22
1.373LysTyr: 1.373 ± 0.269
0.0LysXaa: 0.0 ± 0.0
Leu
9.044LeuAla: 9.044 ± 0.726
0.994LeuCys: 0.994 ± 0.277
5.304LeuAsp: 5.304 ± 0.509
3.646LeuGlu: 3.646 ± 0.368
2.368LeuPhe: 2.368 ± 0.373
6.109LeuGly: 6.109 ± 0.708
1.421LeuHis: 1.421 ± 0.264
2.841LeuIle: 2.841 ± 0.337
3.836LeuLys: 3.836 ± 0.43
5.304LeuLeu: 5.304 ± 0.516
1.421LeuMet: 1.421 ± 0.217
2.226LeuAsn: 2.226 ± 0.294
4.214LeuPro: 4.214 ± 0.494
1.894LeuGln: 1.894 ± 0.327
5.824LeuArg: 5.824 ± 0.507
4.593LeuSer: 4.593 ± 0.485
5.256LeuThr: 5.256 ± 0.546
5.304LeuVal: 5.304 ± 0.498
1.231LeuTrp: 1.231 ± 0.229
1.752LeuTyr: 1.752 ± 0.298
0.0LeuXaa: 0.0 ± 0.0
Met
2.462MetAla: 2.462 ± 0.419
0.0MetCys: 0.0 ± 0.0
1.136MetAsp: 1.136 ± 0.234
1.136MetGlu: 1.136 ± 0.22
0.805MetPhe: 0.805 ± 0.218
0.947MetGly: 0.947 ± 0.196
0.474MetHis: 0.474 ± 0.18
1.279MetIle: 1.279 ± 0.269
1.089MetLys: 1.089 ± 0.18
1.847MetLeu: 1.847 ± 0.323
0.568MetMet: 0.568 ± 0.147
0.994MetAsn: 0.994 ± 0.207
1.705MetPro: 1.705 ± 0.272
0.474MetGln: 0.474 ± 0.121
1.941MetArg: 1.941 ± 0.276
1.657MetSer: 1.657 ± 0.281
1.894MetThr: 1.894 ± 0.308
0.994MetVal: 0.994 ± 0.174
0.379MetTrp: 0.379 ± 0.125
0.521MetTyr: 0.521 ± 0.125
0.0MetXaa: 0.0 ± 0.0
Asn
2.746AsnAla: 2.746 ± 0.304
0.284AsnCys: 0.284 ± 0.106
1.989AsnAsp: 1.989 ± 0.323
2.084AsnGlu: 2.084 ± 0.259
1.136AsnPhe: 1.136 ± 0.198
3.551AsnGly: 3.551 ± 0.505
0.616AsnHis: 0.616 ± 0.168
1.468AsnIle: 1.468 ± 0.24
1.279AsnLys: 1.279 ± 0.215
2.32AsnLeu: 2.32 ± 0.314
0.805AsnMet: 0.805 ± 0.18
1.042AsnAsn: 1.042 ± 0.243
2.462AsnPro: 2.462 ± 0.41
0.663AsnGln: 0.663 ± 0.148
2.178AsnArg: 2.178 ± 0.343
1.61AsnSer: 1.61 ± 0.321
2.557AsnThr: 2.557 ± 0.324
2.273AsnVal: 2.273 ± 0.365
0.758AsnTrp: 0.758 ± 0.178
0.994AsnTyr: 0.994 ± 0.232
0.0AsnXaa: 0.0 ± 0.0
Pro
5.54ProAla: 5.54 ± 0.575
0.331ProCys: 0.331 ± 0.131
4.546ProAsp: 4.546 ± 0.512
3.409ProGlu: 3.409 ± 0.46
2.036ProPhe: 2.036 ± 0.321
4.546ProGly: 4.546 ± 0.609
0.758ProHis: 0.758 ± 0.151
3.031ProIle: 3.031 ± 0.4
2.746ProLys: 2.746 ± 0.351
3.788ProLeu: 3.788 ± 0.398
1.231ProMet: 1.231 ± 0.291
2.084ProAsn: 2.084 ± 0.332
2.794ProPro: 2.794 ± 0.472
1.799ProGln: 1.799 ± 0.25
3.741ProArg: 3.741 ± 0.449
3.267ProSer: 3.267 ± 0.4
3.694ProThr: 3.694 ± 0.356
4.262ProVal: 4.262 ± 0.489
0.616ProTrp: 0.616 ± 0.155
0.9ProTyr: 0.9 ± 0.2
0.0ProXaa: 0.0 ± 0.0
Gln
2.983GlnAla: 2.983 ± 0.414
0.189GlnCys: 0.189 ± 0.093
1.136GlnAsp: 1.136 ± 0.221
1.657GlnGlu: 1.657 ± 0.319
0.994GlnPhe: 0.994 ± 0.215
2.036GlnGly: 2.036 ± 0.28
0.521GlnHis: 0.521 ± 0.167
1.563GlnIle: 1.563 ± 0.289
1.563GlnLys: 1.563 ± 0.227
2.983GlnLeu: 2.983 ± 0.463
1.184GlnMet: 1.184 ± 0.217
1.468GlnAsn: 1.468 ± 0.269
1.468GlnPro: 1.468 ± 0.239
1.279GlnGln: 1.279 ± 0.296
3.078GlnArg: 3.078 ± 0.38
1.326GlnSer: 1.326 ± 0.2
1.989GlnThr: 1.989 ± 0.323
2.273GlnVal: 2.273 ± 0.4
0.758GlnTrp: 0.758 ± 0.2
0.852GlnTyr: 0.852 ± 0.177
0.0GlnXaa: 0.0 ± 0.0
Arg
6.771ArgAla: 6.771 ± 0.532
1.326ArgCys: 1.326 ± 0.287
4.877ArgAsp: 4.877 ± 0.663
5.351ArgGlu: 5.351 ± 0.58
2.604ArgPhe: 2.604 ± 0.447
6.014ArgGly: 6.014 ± 0.607
1.941ArgHis: 1.941 ± 0.341
3.173ArgIle: 3.173 ± 0.375
3.741ArgLys: 3.741 ± 0.4
4.641ArgLeu: 4.641 ± 0.497
1.894ArgMet: 1.894 ± 0.337
2.368ArgAsn: 2.368 ± 0.33
3.22ArgPro: 3.22 ± 0.339
2.51ArgGln: 2.51 ± 0.422
7.15ArgArg: 7.15 ± 0.681
3.22ArgSer: 3.22 ± 0.439
3.93ArgThr: 3.93 ± 0.482
5.398ArgVal: 5.398 ± 0.587
1.515ArgTrp: 1.515 ± 0.296
1.752ArgTyr: 1.752 ± 0.302
0.0ArgXaa: 0.0 ± 0.0
Ser
6.44SerAla: 6.44 ± 0.594
0.284SerCys: 0.284 ± 0.111
3.504SerAsp: 3.504 ± 0.588
3.599SerGlu: 3.599 ± 0.372
1.279SerPhe: 1.279 ± 0.224
5.966SerGly: 5.966 ± 0.505
1.136SerHis: 1.136 ± 0.228
2.794SerIle: 2.794 ± 0.339
2.131SerLys: 2.131 ± 0.396
3.409SerLeu: 3.409 ± 0.323
1.563SerMet: 1.563 ± 0.307
1.941SerAsn: 1.941 ± 0.304
3.078SerPro: 3.078 ± 0.393
1.894SerGln: 1.894 ± 0.319
3.504SerArg: 3.504 ± 0.498
3.409SerSer: 3.409 ± 0.609
3.457SerThr: 3.457 ± 0.369
3.457SerVal: 3.457 ± 0.368
1.136SerTrp: 1.136 ± 0.214
1.279SerTyr: 1.279 ± 0.204
0.0SerXaa: 0.0 ± 0.0
Thr
6.535ThrAla: 6.535 ± 0.669
0.474ThrCys: 0.474 ± 0.164
3.694ThrAsp: 3.694 ± 0.402
2.746ThrGlu: 2.746 ± 0.349
2.32ThrPhe: 2.32 ± 0.292
6.866ThrGly: 6.866 ± 0.562
1.136ThrHis: 1.136 ± 0.233
2.794ThrIle: 2.794 ± 0.351
2.604ThrLys: 2.604 ± 0.347
4.546ThrLeu: 4.546 ± 0.475
0.9ThrMet: 0.9 ± 0.213
2.084ThrAsn: 2.084 ± 0.354
5.019ThrPro: 5.019 ± 0.506
1.657ThrGln: 1.657 ± 0.371
4.451ThrArg: 4.451 ± 0.547
3.362ThrSer: 3.362 ± 0.43
4.309ThrThr: 4.309 ± 0.443
4.972ThrVal: 4.972 ± 0.487
1.515ThrTrp: 1.515 ± 0.282
1.705ThrTyr: 1.705 ± 0.302
0.0ThrXaa: 0.0 ± 0.0
Val
7.861ValAla: 7.861 ± 0.737
0.474ValCys: 0.474 ± 0.167
5.54ValAsp: 5.54 ± 0.544
5.588ValGlu: 5.588 ± 0.571
1.705ValPhe: 1.705 ± 0.239
5.682ValGly: 5.682 ± 0.499
0.947ValHis: 0.947 ± 0.21
3.551ValIle: 3.551 ± 0.404
2.652ValLys: 2.652 ± 0.319
4.877ValLeu: 4.877 ± 0.409
0.947ValMet: 0.947 ± 0.209
2.604ValAsn: 2.604 ± 0.347
3.741ValPro: 3.741 ± 0.392
2.415ValGln: 2.415 ± 0.322
5.351ValArg: 5.351 ± 0.47
4.356ValSer: 4.356 ± 0.425
5.493ValThr: 5.493 ± 0.44
5.304ValVal: 5.304 ± 0.545
1.279ValTrp: 1.279 ± 0.268
1.421ValTyr: 1.421 ± 0.303
0.0ValXaa: 0.0 ± 0.0
Trp
1.752TrpAla: 1.752 ± 0.272
0.237TrpCys: 0.237 ± 0.101
1.373TrpAsp: 1.373 ± 0.266
1.136TrpGlu: 1.136 ± 0.245
0.568TrpPhe: 0.568 ± 0.168
1.326TrpGly: 1.326 ± 0.226
0.663TrpHis: 0.663 ± 0.177
0.9TrpIle: 0.9 ± 0.207
0.663TrpLys: 0.663 ± 0.168
1.61TrpLeu: 1.61 ± 0.312
0.331TrpMet: 0.331 ± 0.109
0.805TrpAsn: 0.805 ± 0.2
0.994TrpPro: 0.994 ± 0.202
0.758TrpGln: 0.758 ± 0.213
1.089TrpArg: 1.089 ± 0.3
1.326TrpSer: 1.326 ± 0.234
1.705TrpThr: 1.705 ± 0.275
1.61TrpVal: 1.61 ± 0.241
0.758TrpTrp: 0.758 ± 0.2
0.237TrpTyr: 0.237 ± 0.117
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.178TyrAla: 2.178 ± 0.308
0.284TyrCys: 0.284 ± 0.104
1.657TyrAsp: 1.657 ± 0.302
1.421TyrGlu: 1.421 ± 0.25
0.71TyrPhe: 0.71 ± 0.183
2.889TyrGly: 2.889 ± 0.342
0.474TyrHis: 0.474 ± 0.145
0.616TyrIle: 0.616 ± 0.179
0.663TyrLys: 0.663 ± 0.142
2.273TyrLeu: 2.273 ± 0.398
0.663TyrMet: 0.663 ± 0.15
0.852TyrAsn: 0.852 ± 0.18
2.131TyrPro: 2.131 ± 0.375
0.474TyrGln: 0.474 ± 0.139
2.273TyrArg: 2.273 ± 0.307
1.373TyrSer: 1.373 ± 0.229
1.468TyrThr: 1.468 ± 0.283
1.847TyrVal: 1.847 ± 0.349
0.758TyrTrp: 0.758 ± 0.221
0.474TyrTyr: 0.474 ± 0.158
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 95 proteins (21119 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski