Amino acid dipepetide frequency for Enterococcus phage Nonaheksakonda

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.156AlaAla: 0.156 ± 0.093
0.311AlaCys: 0.311 ± 0.143
3.424AlaAsp: 3.424 ± 0.608
4.28AlaGlu: 4.28 ± 0.644
2.023AlaPhe: 2.023 ± 0.434
3.035AlaGly: 3.035 ± 0.524
0.934AlaHis: 0.934 ± 0.312
5.369AlaIle: 5.369 ± 0.8
5.68AlaLys: 5.68 ± 0.581
4.435AlaLeu: 4.435 ± 0.692
2.568AlaMet: 2.568 ± 0.547
3.502AlaAsn: 3.502 ± 0.544
1.868AlaPro: 1.868 ± 0.421
1.868AlaGln: 1.868 ± 0.526
1.868AlaArg: 1.868 ± 0.365
2.879AlaSer: 2.879 ± 0.506
4.28AlaThr: 4.28 ± 0.786
3.969AlaVal: 3.969 ± 0.609
0.467AlaTrp: 0.467 ± 0.204
3.268AlaTyr: 3.268 ± 0.422
0.0AlaXaa: 0.0 ± 0.0
Cys
0.545CysAla: 0.545 ± 0.222
0.0CysCys: 0.0 ± 0.0
0.467CysAsp: 0.467 ± 0.194
0.7CysGlu: 0.7 ± 0.26
0.156CysPhe: 0.156 ± 0.126
0.7CysGly: 0.7 ± 0.268
0.233CysHis: 0.233 ± 0.132
0.545CysIle: 0.545 ± 0.236
0.856CysLys: 0.856 ± 0.246
0.389CysLeu: 0.389 ± 0.187
0.156CysMet: 0.156 ± 0.108
0.623CysAsn: 0.623 ± 0.197
0.0CysPro: 0.0 ± 0.0
0.156CysGln: 0.156 ± 0.129
0.156CysArg: 0.156 ± 0.115
0.623CysSer: 0.623 ± 0.205
0.778CysThr: 0.778 ± 0.273
0.311CysVal: 0.311 ± 0.143
0.156CysTrp: 0.156 ± 0.106
0.233CysTyr: 0.233 ± 0.148
0.0CysXaa: 0.0 ± 0.0
Asp
3.346AspAla: 3.346 ± 0.566
0.467AspCys: 0.467 ± 0.202
3.19AspAsp: 3.19 ± 0.583
5.914AspGlu: 5.914 ± 0.68
2.957AspPhe: 2.957 ± 0.438
5.525AspGly: 5.525 ± 0.492
0.545AspHis: 0.545 ± 0.257
4.358AspIle: 4.358 ± 0.541
5.914AspLys: 5.914 ± 0.935
5.836AspLeu: 5.836 ± 0.676
1.79AspMet: 1.79 ± 0.298
4.202AspAsn: 4.202 ± 0.47
1.945AspPro: 1.945 ± 0.373
1.089AspGln: 1.089 ± 0.293
2.879AspArg: 2.879 ± 0.459
2.879AspSer: 2.879 ± 0.423
2.957AspThr: 2.957 ± 0.433
4.358AspVal: 4.358 ± 0.666
1.012AspTrp: 1.012 ± 0.243
2.646AspTyr: 2.646 ± 0.433
0.0AspXaa: 0.0 ± 0.0
Glu
4.435GluAla: 4.435 ± 0.604
0.7GluCys: 0.7 ± 0.259
4.825GluAsp: 4.825 ± 0.668
7.003GluGlu: 7.003 ± 1.055
3.735GluPhe: 3.735 ± 0.612
2.801GluGly: 2.801 ± 0.502
1.323GluHis: 1.323 ± 0.332
4.902GluIle: 4.902 ± 0.655
6.147GluLys: 6.147 ± 0.705
9.727GluLeu: 9.727 ± 1.111
3.19GluMet: 3.19 ± 0.565
4.435GluAsn: 4.435 ± 0.605
2.568GluPro: 2.568 ± 0.527
3.113GluGln: 3.113 ± 0.572
4.591GluArg: 4.591 ± 0.744
3.579GluSer: 3.579 ± 0.579
4.28GluThr: 4.28 ± 0.527
6.147GluVal: 6.147 ± 0.976
1.089GluTrp: 1.089 ± 0.303
4.358GluTyr: 4.358 ± 0.645
0.0GluXaa: 0.0 ± 0.0
Phe
1.401PheAla: 1.401 ± 0.339
0.233PheCys: 0.233 ± 0.131
2.334PheAsp: 2.334 ± 0.506
3.035PheGlu: 3.035 ± 0.533
1.167PhePhe: 1.167 ± 0.358
2.957PheGly: 2.957 ± 0.6
0.233PheHis: 0.233 ± 0.129
3.424PheIle: 3.424 ± 0.538
4.046PheLys: 4.046 ± 0.658
2.412PheLeu: 2.412 ± 0.486
0.7PheMet: 0.7 ± 0.238
2.957PheAsn: 2.957 ± 0.454
0.623PhePro: 0.623 ± 0.192
1.556PheGln: 1.556 ± 0.403
1.478PheArg: 1.478 ± 0.334
2.646PheSer: 2.646 ± 0.472
3.657PheThr: 3.657 ± 0.561
2.334PheVal: 2.334 ± 0.367
0.467PheTrp: 0.467 ± 0.177
1.245PheTyr: 1.245 ± 0.314
0.0PheXaa: 0.0 ± 0.0
Gly
3.969GlyAla: 3.969 ± 1.011
0.467GlyCys: 0.467 ± 0.226
3.502GlyAsp: 3.502 ± 0.662
3.424GlyGlu: 3.424 ± 0.392
3.113GlyPhe: 3.113 ± 0.432
3.813GlyGly: 3.813 ± 0.643
0.934GlyHis: 0.934 ± 0.283
5.291GlyIle: 5.291 ± 0.955
4.98GlyLys: 4.98 ± 0.708
5.68GlyLeu: 5.68 ± 0.892
1.79GlyMet: 1.79 ± 0.348
3.813GlyAsn: 3.813 ± 0.498
0.778GlyPro: 0.778 ± 0.31
2.334GlyGln: 2.334 ± 0.316
2.257GlyArg: 2.257 ± 0.475
3.113GlySer: 3.113 ± 0.437
4.28GlyThr: 4.28 ± 0.72
4.669GlyVal: 4.669 ± 0.609
1.089GlyTrp: 1.089 ± 0.279
2.257GlyTyr: 2.257 ± 0.429
0.0GlyXaa: 0.0 ± 0.0
His
0.934HisAla: 0.934 ± 0.243
0.311HisCys: 0.311 ± 0.143
0.856HisAsp: 0.856 ± 0.298
1.323HisGlu: 1.323 ± 0.318
0.856HisPhe: 0.856 ± 0.306
1.089HisGly: 1.089 ± 0.28
0.233HisHis: 0.233 ± 0.131
0.856HisIle: 0.856 ± 0.242
1.712HisLys: 1.712 ± 0.39
0.934HisLeu: 0.934 ± 0.329
0.389HisMet: 0.389 ± 0.169
1.245HisAsn: 1.245 ± 0.338
0.389HisPro: 0.389 ± 0.156
0.389HisGln: 0.389 ± 0.166
0.623HisArg: 0.623 ± 0.187
0.623HisSer: 0.623 ± 0.226
0.856HisThr: 0.856 ± 0.347
0.311HisVal: 0.311 ± 0.14
0.156HisTrp: 0.156 ± 0.107
1.245HisTyr: 1.245 ± 0.374
0.0HisXaa: 0.0 ± 0.0
Ile
3.502IleAla: 3.502 ± 0.649
0.623IleCys: 0.623 ± 0.213
5.603IleAsp: 5.603 ± 0.691
6.147IleGlu: 6.147 ± 0.854
2.101IlePhe: 2.101 ± 0.437
4.202IleGly: 4.202 ± 0.636
0.856IleHis: 0.856 ± 0.191
4.98IleIle: 4.98 ± 0.656
6.147IleLys: 6.147 ± 0.798
5.914IleLeu: 5.914 ± 0.772
1.478IleMet: 1.478 ± 0.492
3.969IleAsn: 3.969 ± 0.58
2.801IlePro: 2.801 ± 0.389
3.113IleGln: 3.113 ± 0.395
1.634IleArg: 1.634 ± 0.347
4.591IleSer: 4.591 ± 0.65
3.813IleThr: 3.813 ± 0.462
4.046IleVal: 4.046 ± 0.796
0.856IleTrp: 0.856 ± 0.238
2.179IleTyr: 2.179 ± 0.414
0.0IleXaa: 0.0 ± 0.0
Lys
5.914LysAla: 5.914 ± 0.698
0.545LysCys: 0.545 ± 0.206
4.825LysAsp: 4.825 ± 0.744
9.182LysGlu: 9.182 ± 1.313
3.424LysPhe: 3.424 ± 0.359
5.291LysGly: 5.291 ± 0.65
1.401LysHis: 1.401 ± 0.359
4.202LysIle: 4.202 ± 0.667
7.626LysLys: 7.626 ± 0.829
6.692LysLeu: 6.692 ± 0.781
3.657LysMet: 3.657 ± 0.573
5.214LysAsn: 5.214 ± 0.494
2.724LysPro: 2.724 ± 0.604
3.735LysGln: 3.735 ± 0.591
4.28LysArg: 4.28 ± 0.489
3.891LysSer: 3.891 ± 0.485
4.98LysThr: 4.98 ± 0.675
5.836LysVal: 5.836 ± 0.59
1.167LysTrp: 1.167 ± 0.268
3.268LysTyr: 3.268 ± 0.515
0.0LysXaa: 0.0 ± 0.0
Leu
5.525LeuAla: 5.525 ± 0.769
0.7LeuCys: 0.7 ± 0.29
7.315LeuAsp: 7.315 ± 0.754
8.56LeuGlu: 8.56 ± 1.043
2.646LeuPhe: 2.646 ± 0.42
5.136LeuGly: 5.136 ± 0.894
0.7LeuHis: 0.7 ± 0.221
4.28LeuIle: 4.28 ± 0.605
7.937LeuLys: 7.937 ± 1.046
6.692LeuLeu: 6.692 ± 0.989
2.49LeuMet: 2.49 ± 0.475
6.614LeuAsn: 6.614 ± 0.664
2.879LeuPro: 2.879 ± 0.505
3.346LeuGln: 3.346 ± 0.679
3.035LeuArg: 3.035 ± 0.588
3.735LeuSer: 3.735 ± 0.582
4.747LeuThr: 4.747 ± 0.551
6.147LeuVal: 6.147 ± 0.67
1.012LeuTrp: 1.012 ± 0.236
2.724LeuTyr: 2.724 ± 0.463
0.0LeuXaa: 0.0 ± 0.0
Met
1.945MetAla: 1.945 ± 0.442
0.233MetCys: 0.233 ± 0.143
1.945MetAsp: 1.945 ± 0.422
2.724MetGlu: 2.724 ± 0.515
1.089MetPhe: 1.089 ± 0.283
1.634MetGly: 1.634 ± 0.49
0.467MetHis: 0.467 ± 0.173
2.257MetIle: 2.257 ± 0.672
2.801MetLys: 2.801 ± 0.574
3.19MetLeu: 3.19 ± 0.627
0.856MetMet: 0.856 ± 0.252
1.945MetAsn: 1.945 ± 0.391
0.623MetPro: 0.623 ± 0.232
1.012MetGln: 1.012 ± 0.33
1.245MetArg: 1.245 ± 0.352
1.868MetSer: 1.868 ± 0.348
1.634MetThr: 1.634 ± 0.472
1.012MetVal: 1.012 ± 0.295
0.545MetTrp: 0.545 ± 0.238
1.556MetTyr: 1.556 ± 0.457
0.0MetXaa: 0.0 ± 0.0
Asn
4.435AsnAla: 4.435 ± 0.815
0.311AsnCys: 0.311 ± 0.15
4.046AsnAsp: 4.046 ± 0.492
5.214AsnGlu: 5.214 ± 0.637
1.556AsnPhe: 1.556 ± 0.437
6.225AsnGly: 6.225 ± 0.868
0.778AsnHis: 0.778 ± 0.251
3.891AsnIle: 3.891 ± 0.583
5.369AsnLys: 5.369 ± 0.679
5.058AsnLeu: 5.058 ± 0.55
2.646AsnMet: 2.646 ± 0.528
4.046AsnAsn: 4.046 ± 0.601
2.023AsnPro: 2.023 ± 0.363
1.634AsnGln: 1.634 ± 0.271
0.934AsnArg: 0.934 ± 0.226
2.568AsnSer: 2.568 ± 0.459
4.669AsnThr: 4.669 ± 0.75
3.735AsnVal: 3.735 ± 0.557
0.778AsnTrp: 0.778 ± 0.284
2.957AsnTyr: 2.957 ± 0.517
0.0AsnXaa: 0.0 ± 0.0
Pro
2.101ProAla: 2.101 ± 0.42
0.156ProCys: 0.156 ± 0.113
1.79ProAsp: 1.79 ± 0.41
3.113ProGlu: 3.113 ± 0.479
1.089ProPhe: 1.089 ± 0.24
0.0ProGly: 0.0 ± 0.0
0.311ProHis: 0.311 ± 0.18
1.634ProIle: 1.634 ± 0.351
3.19ProLys: 3.19 ± 0.58
2.957ProLeu: 2.957 ± 0.52
1.012ProMet: 1.012 ± 0.245
1.401ProAsn: 1.401 ± 0.385
0.545ProPro: 0.545 ± 0.213
1.634ProGln: 1.634 ± 0.337
0.389ProArg: 0.389 ± 0.14
1.634ProSer: 1.634 ± 0.338
1.401ProThr: 1.401 ± 0.261
1.712ProVal: 1.712 ± 0.346
0.233ProTrp: 0.233 ± 0.133
1.868ProTyr: 1.868 ± 0.489
0.0ProXaa: 0.0 ± 0.0
Gln
2.568GlnAla: 2.568 ± 0.678
0.545GlnCys: 0.545 ± 0.281
1.712GlnAsp: 1.712 ± 0.353
2.879GlnGlu: 2.879 ± 0.457
1.401GlnPhe: 1.401 ± 0.323
1.945GlnGly: 1.945 ± 0.319
0.778GlnHis: 0.778 ± 0.239
2.568GlnIle: 2.568 ± 0.479
2.023GlnLys: 2.023 ± 0.401
3.19GlnLeu: 3.19 ± 0.391
0.7GlnMet: 0.7 ± 0.273
2.023GlnAsn: 2.023 ± 0.369
1.323GlnPro: 1.323 ± 0.351
1.945GlnGln: 1.945 ± 0.436
2.257GlnArg: 2.257 ± 0.417
2.101GlnSer: 2.101 ± 0.365
2.101GlnThr: 2.101 ± 0.335
2.334GlnVal: 2.334 ± 0.431
0.467GlnTrp: 0.467 ± 0.271
2.257GlnTyr: 2.257 ± 0.447
0.0GlnXaa: 0.0 ± 0.0
Arg
1.634ArgAla: 1.634 ± 0.367
0.389ArgCys: 0.389 ± 0.145
2.412ArgAsp: 2.412 ± 0.401
2.257ArgGlu: 2.257 ± 0.426
2.023ArgPhe: 2.023 ± 0.279
1.79ArgGly: 1.79 ± 0.465
0.934ArgHis: 0.934 ± 0.29
2.801ArgIle: 2.801 ± 0.428
2.957ArgLys: 2.957 ± 0.671
3.579ArgLeu: 3.579 ± 0.602
1.167ArgMet: 1.167 ± 0.314
2.646ArgAsn: 2.646 ± 0.482
1.167ArgPro: 1.167 ± 0.212
1.478ArgGln: 1.478 ± 0.303
1.245ArgArg: 1.245 ± 0.306
1.79ArgSer: 1.79 ± 0.45
1.323ArgThr: 1.323 ± 0.364
2.49ArgVal: 2.49 ± 0.479
0.545ArgTrp: 0.545 ± 0.191
2.179ArgTyr: 2.179 ± 0.451
0.0ArgXaa: 0.0 ± 0.0
Ser
3.113SerAla: 3.113 ± 0.483
0.156SerCys: 0.156 ± 0.119
3.424SerAsp: 3.424 ± 0.49
3.813SerGlu: 3.813 ± 0.547
2.334SerPhe: 2.334 ± 0.375
4.513SerGly: 4.513 ± 0.791
1.634SerHis: 1.634 ± 0.359
3.735SerIle: 3.735 ± 0.72
4.825SerLys: 4.825 ± 0.625
3.579SerLeu: 3.579 ± 0.586
1.245SerMet: 1.245 ± 0.282
2.801SerAsn: 2.801 ± 0.502
0.856SerPro: 0.856 ± 0.265
1.945SerGln: 1.945 ± 0.521
1.478SerArg: 1.478 ± 0.398
2.568SerSer: 2.568 ± 0.592
3.268SerThr: 3.268 ± 0.723
3.346SerVal: 3.346 ± 0.44
0.778SerTrp: 0.778 ± 0.238
2.334SerTyr: 2.334 ± 0.598
0.0SerXaa: 0.0 ± 0.0
Thr
3.268ThrAla: 3.268 ± 0.503
0.233ThrCys: 0.233 ± 0.131
3.502ThrAsp: 3.502 ± 0.535
4.202ThrGlu: 4.202 ± 0.744
2.257ThrPhe: 2.257 ± 0.44
3.735ThrGly: 3.735 ± 0.617
0.856ThrHis: 0.856 ± 0.255
4.669ThrIle: 4.669 ± 0.652
5.447ThrLys: 5.447 ± 0.626
4.825ThrLeu: 4.825 ± 0.684
1.556ThrMet: 1.556 ± 0.347
3.19ThrAsn: 3.19 ± 0.469
2.101ThrPro: 2.101 ± 0.474
3.19ThrGln: 3.19 ± 0.463
1.945ThrArg: 1.945 ± 0.319
2.49ThrSer: 2.49 ± 0.485
4.046ThrThr: 4.046 ± 0.773
4.747ThrVal: 4.747 ± 0.619
0.856ThrTrp: 0.856 ± 0.317
2.334ThrTyr: 2.334 ± 0.64
0.0ThrXaa: 0.0 ± 0.0
Val
4.669ValAla: 4.669 ± 0.513
0.623ValCys: 0.623 ± 0.254
4.046ValAsp: 4.046 ± 0.531
4.513ValGlu: 4.513 ± 0.598
2.49ValPhe: 2.49 ± 0.422
3.969ValGly: 3.969 ± 0.565
0.934ValHis: 0.934 ± 0.306
4.202ValIle: 4.202 ± 0.587
4.902ValLys: 4.902 ± 0.57
6.07ValLeu: 6.07 ± 0.679
1.634ValMet: 1.634 ± 0.279
4.591ValAsn: 4.591 ± 0.777
1.634ValPro: 1.634 ± 0.337
2.179ValGln: 2.179 ± 0.633
2.724ValArg: 2.724 ± 0.424
5.291ValSer: 5.291 ± 0.618
3.346ValThr: 3.346 ± 0.604
5.369ValVal: 5.369 ± 0.761
1.012ValTrp: 1.012 ± 0.304
2.568ValTyr: 2.568 ± 0.547
0.0ValXaa: 0.0 ± 0.0
Trp
0.233TrpAla: 0.233 ± 0.129
0.156TrpCys: 0.156 ± 0.11
0.778TrpAsp: 0.778 ± 0.49
1.167TrpGlu: 1.167 ± 0.303
0.7TrpPhe: 0.7 ± 0.227
0.934TrpGly: 0.934 ± 0.349
0.467TrpHis: 0.467 ± 0.224
0.545TrpIle: 0.545 ± 0.198
1.556TrpLys: 1.556 ± 0.319
1.556TrpLeu: 1.556 ± 0.428
0.156TrpMet: 0.156 ± 0.092
0.467TrpAsn: 0.467 ± 0.168
0.0TrpPro: 0.0 ± 0.0
0.389TrpGln: 0.389 ± 0.181
0.778TrpArg: 0.778 ± 0.244
0.778TrpSer: 0.778 ± 0.237
0.7TrpThr: 0.7 ± 0.185
1.401TrpVal: 1.401 ± 0.272
0.311TrpTrp: 0.311 ± 0.131
0.389TrpTyr: 0.389 ± 0.168
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.49TyrAla: 2.49 ± 0.361
0.623TyrCys: 0.623 ± 0.204
3.813TyrAsp: 3.813 ± 0.704
3.579TyrGlu: 3.579 ± 0.458
1.79TyrPhe: 1.79 ± 0.384
2.257TyrGly: 2.257 ± 0.395
0.778TyrHis: 0.778 ± 0.29
4.046TyrIle: 4.046 ± 0.58
3.735TyrLys: 3.735 ± 0.594
3.502TyrLeu: 3.502 ± 0.6
1.323TyrMet: 1.323 ± 0.304
3.19TyrAsn: 3.19 ± 0.601
1.323TyrPro: 1.323 ± 0.29
1.012TyrGln: 1.012 ± 0.366
1.012TyrArg: 1.012 ± 0.31
2.101TyrSer: 2.101 ± 0.44
2.334TyrThr: 2.334 ± 0.544
2.49TyrVal: 2.49 ± 0.464
0.467TyrTrp: 0.467 ± 0.171
2.023TyrTyr: 2.023 ± 0.478
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 74 proteins (12852 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski