Amino acid dipepetide frequency for Microbacterium phage IAmGroot

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
20.931AlaAla: 20.931 ± 2.222
0.284AlaCys: 0.284 ± 0.163
8.018AlaAsp: 8.018 ± 0.619
9.933AlaGlu: 9.933 ± 0.978
3.902AlaPhe: 3.902 ± 0.605
7.947AlaGly: 7.947 ± 1.092
2.767AlaHis: 2.767 ± 0.407
7.095AlaIle: 7.095 ± 0.829
6.173AlaLys: 6.173 ± 0.998
14.049AlaLeu: 14.049 ± 1.418
4.186AlaMet: 4.186 ± 0.629
3.264AlaAsn: 3.264 ± 0.523
6.74AlaPro: 6.74 ± 0.717
4.683AlaGln: 4.683 ± 0.63
9.791AlaArg: 9.791 ± 0.933
7.663AlaSer: 7.663 ± 0.899
8.656AlaThr: 8.656 ± 1.093
8.443AlaVal: 8.443 ± 0.801
2.058AlaTrp: 2.058 ± 0.362
2.98AlaTyr: 2.98 ± 0.384
0.0AlaXaa: 0.0 ± 0.0
Cys
0.284CysAla: 0.284 ± 0.123
0.071CysCys: 0.071 ± 0.061
0.497CysAsp: 0.497 ± 0.183
0.213CysGlu: 0.213 ± 0.145
0.284CysPhe: 0.284 ± 0.141
0.78CysGly: 0.78 ± 0.254
0.142CysHis: 0.142 ± 0.114
0.071CysIle: 0.071 ± 0.062
0.284CysLys: 0.284 ± 0.15
0.355CysLeu: 0.355 ± 0.186
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.355CysPro: 0.355 ± 0.232
0.355CysGln: 0.355 ± 0.137
0.426CysArg: 0.426 ± 0.23
0.213CysSer: 0.213 ± 0.121
0.213CysThr: 0.213 ± 0.135
0.0CysVal: 0.0 ± 0.0
0.071CysTrp: 0.071 ± 0.088
0.142CysTyr: 0.142 ± 0.097
0.0CysXaa: 0.0 ± 0.0
Asp
8.372AspAla: 8.372 ± 0.807
0.284AspCys: 0.284 ± 0.17
4.257AspAsp: 4.257 ± 0.591
4.967AspGlu: 4.967 ± 0.624
1.916AspPhe: 1.916 ± 0.334
6.74AspGly: 6.74 ± 0.661
1.277AspHis: 1.277 ± 0.292
2.483AspIle: 2.483 ± 0.44
1.49AspLys: 1.49 ± 0.309
6.244AspLeu: 6.244 ± 0.779
1.206AspMet: 1.206 ± 0.315
1.135AspAsn: 1.135 ± 0.318
4.399AspPro: 4.399 ± 0.566
1.348AspGln: 1.348 ± 0.3
4.115AspArg: 4.115 ± 0.578
2.98AspSer: 2.98 ± 0.644
2.98AspThr: 2.98 ± 0.415
5.038AspVal: 5.038 ± 0.458
0.639AspTrp: 0.639 ± 0.186
0.639AspTyr: 0.639 ± 0.18
0.0AspXaa: 0.0 ± 0.0
Glu
10.501GluAla: 10.501 ± 1.003
0.497GluCys: 0.497 ± 0.204
3.406GluAsp: 3.406 ± 0.554
3.122GluGlu: 3.122 ± 0.5
0.922GluPhe: 0.922 ± 0.224
5.25GluGly: 5.25 ± 0.598
1.135GluHis: 1.135 ± 0.291
1.277GluIle: 1.277 ± 0.279
1.277GluLys: 1.277 ± 0.438
7.379GluLeu: 7.379 ± 0.722
0.78GluMet: 0.78 ± 0.228
0.993GluAsn: 0.993 ± 0.293
2.483GluPro: 2.483 ± 0.526
2.2GluGln: 2.2 ± 0.389
5.747GluArg: 5.747 ± 0.541
2.838GluSer: 2.838 ± 0.43
4.186GluThr: 4.186 ± 0.554
5.25GluVal: 5.25 ± 0.655
1.277GluTrp: 1.277 ± 0.268
1.632GluTyr: 1.632 ± 0.32
0.0GluXaa: 0.0 ± 0.0
Phe
3.69PheAla: 3.69 ± 0.584
0.213PheCys: 0.213 ± 0.119
1.987PheAsp: 1.987 ± 0.35
1.49PheGlu: 1.49 ± 0.371
0.78PhePhe: 0.78 ± 0.196
2.2PheGly: 2.2 ± 0.437
0.497PheHis: 0.497 ± 0.19
0.851PheIle: 0.851 ± 0.218
0.568PheLys: 0.568 ± 0.217
2.2PheLeu: 2.2 ± 0.396
0.78PheMet: 0.78 ± 0.193
0.213PheAsn: 0.213 ± 0.113
1.703PhePro: 1.703 ± 0.362
1.277PheGln: 1.277 ± 0.271
2.341PheArg: 2.341 ± 0.485
2.058PheSer: 2.058 ± 0.5
2.909PheThr: 2.909 ± 0.427
2.058PheVal: 2.058 ± 0.309
0.497PheTrp: 0.497 ± 0.188
1.064PheTyr: 1.064 ± 0.281
0.0PheXaa: 0.0 ± 0.0
Gly
6.74GlyAla: 6.74 ± 0.82
0.497GlyCys: 0.497 ± 0.185
4.754GlyAsp: 4.754 ± 0.66
3.902GlyGlu: 3.902 ± 0.461
3.69GlyPhe: 3.69 ± 0.558
5.818GlyGly: 5.818 ± 0.491
1.774GlyHis: 1.774 ± 0.382
4.115GlyIle: 4.115 ± 0.659
3.69GlyLys: 3.69 ± 0.498
8.089GlyLeu: 8.089 ± 0.726
0.71GlyMet: 0.71 ± 0.295
1.561GlyAsn: 1.561 ± 0.283
4.328GlyPro: 4.328 ± 0.725
2.483GlyGln: 2.483 ± 0.38
4.044GlyArg: 4.044 ± 0.556
4.541GlySer: 4.541 ± 0.511
4.896GlyThr: 4.896 ± 0.71
6.599GlyVal: 6.599 ± 0.595
2.058GlyTrp: 2.058 ± 0.291
2.625GlyTyr: 2.625 ± 0.403
0.0GlyXaa: 0.0 ± 0.0
His
2.129HisAla: 2.129 ± 0.315
0.0HisCys: 0.0 ± 0.0
1.348HisAsp: 1.348 ± 0.273
2.058HisGlu: 2.058 ± 0.43
0.568HisPhe: 0.568 ± 0.239
1.703HisGly: 1.703 ± 0.335
0.71HisHis: 0.71 ± 0.214
0.71HisIle: 0.71 ± 0.218
0.568HisLys: 0.568 ± 0.314
1.987HisLeu: 1.987 ± 0.34
0.355HisMet: 0.355 ± 0.137
0.497HisAsn: 0.497 ± 0.22
1.348HisPro: 1.348 ± 0.31
0.355HisGln: 0.355 ± 0.132
0.78HisArg: 0.78 ± 0.281
1.135HisSer: 1.135 ± 0.248
1.135HisThr: 1.135 ± 0.221
0.993HisVal: 0.993 ± 0.267
0.355HisTrp: 0.355 ± 0.15
0.497HisTyr: 0.497 ± 0.155
0.0HisXaa: 0.0 ± 0.0
Ile
6.457IleAla: 6.457 ± 0.677
0.142IleCys: 0.142 ± 0.092
3.548IleAsp: 3.548 ± 0.559
4.115IleGlu: 4.115 ± 0.499
1.206IlePhe: 1.206 ± 0.326
3.122IleGly: 3.122 ± 0.605
0.851IleHis: 0.851 ± 0.206
1.277IleIle: 1.277 ± 0.441
1.703IleLys: 1.703 ± 0.377
3.193IleLeu: 3.193 ± 0.521
0.568IleMet: 0.568 ± 0.188
1.064IleAsn: 1.064 ± 0.272
2.27IlePro: 2.27 ± 0.434
0.922IleGln: 0.922 ± 0.234
3.335IleArg: 3.335 ± 0.519
1.277IleSer: 1.277 ± 0.419
3.051IleThr: 3.051 ± 0.597
2.554IleVal: 2.554 ± 0.465
0.355IleTrp: 0.355 ± 0.153
0.568IleTyr: 0.568 ± 0.199
0.0IleXaa: 0.0 ± 0.0
Lys
5.747LysAla: 5.747 ± 1.238
0.213LysCys: 0.213 ± 0.128
1.49LysAsp: 1.49 ± 0.3
1.987LysGlu: 1.987 ± 0.35
0.497LysPhe: 0.497 ± 0.201
3.264LysGly: 3.264 ± 0.556
0.568LysHis: 0.568 ± 0.189
0.568LysIle: 0.568 ± 0.179
1.49LysLys: 1.49 ± 0.367
3.406LysLeu: 3.406 ± 0.477
0.851LysMet: 0.851 ± 0.25
0.639LysAsn: 0.639 ± 0.189
2.767LysPro: 2.767 ± 0.401
0.426LysGln: 0.426 ± 0.171
3.051LysArg: 3.051 ± 0.503
1.845LysSer: 1.845 ± 0.366
1.845LysThr: 1.845 ± 0.382
2.98LysVal: 2.98 ± 0.477
0.355LysTrp: 0.355 ± 0.154
0.71LysTyr: 0.71 ± 0.232
0.0LysXaa: 0.0 ± 0.0
Leu
13.623LeuAla: 13.623 ± 1.108
0.71LeuCys: 0.71 ± 0.241
6.244LeuAsp: 6.244 ± 0.622
3.76LeuGlu: 3.76 ± 0.565
2.341LeuPhe: 2.341 ± 0.523
6.244LeuGly: 6.244 ± 0.777
0.993LeuHis: 0.993 ± 0.292
4.47LeuIle: 4.47 ± 0.823
3.548LeuLys: 3.548 ± 0.545
7.947LeuLeu: 7.947 ± 0.895
1.064LeuMet: 1.064 ± 0.343
2.412LeuAsn: 2.412 ± 0.519
6.457LeuPro: 6.457 ± 0.99
4.186LeuGln: 4.186 ± 0.643
7.45LeuArg: 7.45 ± 0.758
6.031LeuSer: 6.031 ± 0.614
6.74LeuThr: 6.74 ± 0.562
6.953LeuVal: 6.953 ± 0.855
1.419LeuTrp: 1.419 ± 0.271
1.774LeuTyr: 1.774 ± 0.367
0.0LeuXaa: 0.0 ± 0.0
Met
2.98MetAla: 2.98 ± 0.439
0.071MetCys: 0.071 ± 0.067
0.639MetAsp: 0.639 ± 0.181
1.348MetGlu: 1.348 ± 0.338
0.355MetPhe: 0.355 ± 0.186
0.993MetGly: 0.993 ± 0.235
0.284MetHis: 0.284 ± 0.161
0.142MetIle: 0.142 ± 0.09
0.071MetLys: 0.071 ± 0.062
1.348MetLeu: 1.348 ± 0.371
0.071MetMet: 0.071 ± 0.069
0.639MetAsn: 0.639 ± 0.233
0.851MetPro: 0.851 ± 0.296
0.568MetGln: 0.568 ± 0.195
1.348MetArg: 1.348 ± 0.331
1.632MetSer: 1.632 ± 0.273
2.696MetThr: 2.696 ± 0.424
0.78MetVal: 0.78 ± 0.172
0.0MetTrp: 0.0 ± 0.0
0.071MetTyr: 0.071 ± 0.064
0.0MetXaa: 0.0 ± 0.0
Asn
3.122AsnAla: 3.122 ± 0.551
0.0AsnCys: 0.0 ± 0.0
0.993AsnAsp: 0.993 ± 0.302
0.922AsnGlu: 0.922 ± 0.308
0.497AsnPhe: 0.497 ± 0.209
2.767AsnGly: 2.767 ± 0.446
0.284AsnHis: 0.284 ± 0.141
0.851AsnIle: 0.851 ± 0.266
0.426AsnLys: 0.426 ± 0.177
2.058AsnLeu: 2.058 ± 0.349
0.426AsnMet: 0.426 ± 0.171
0.568AsnAsn: 0.568 ± 0.255
1.49AsnPro: 1.49 ± 0.316
0.284AsnGln: 0.284 ± 0.139
1.632AsnArg: 1.632 ± 0.368
0.639AsnSer: 0.639 ± 0.221
1.419AsnThr: 1.419 ± 0.332
1.845AsnVal: 1.845 ± 0.427
0.355AsnTrp: 0.355 ± 0.177
0.568AsnTyr: 0.568 ± 0.189
0.0AsnXaa: 0.0 ± 0.0
Pro
8.018ProAla: 8.018 ± 0.872
0.284ProCys: 0.284 ± 0.141
4.044ProAsp: 4.044 ± 0.504
4.683ProGlu: 4.683 ± 0.493
1.845ProPhe: 1.845 ± 0.426
4.683ProGly: 4.683 ± 0.577
0.851ProHis: 0.851 ± 0.246
2.341ProIle: 2.341 ± 0.363
1.561ProLys: 1.561 ± 0.303
5.392ProLeu: 5.392 ± 0.792
0.922ProMet: 0.922 ± 0.221
1.277ProAsn: 1.277 ± 0.265
3.122ProPro: 3.122 ± 0.528
1.774ProGln: 1.774 ± 0.291
3.619ProArg: 3.619 ± 0.619
4.612ProSer: 4.612 ± 0.901
3.335ProThr: 3.335 ± 0.525
3.76ProVal: 3.76 ± 0.531
0.497ProTrp: 0.497 ± 0.171
1.135ProTyr: 1.135 ± 0.268
0.0ProXaa: 0.0 ± 0.0
Gln
4.754GlnAla: 4.754 ± 0.689
0.213GlnCys: 0.213 ± 0.13
1.206GlnAsp: 1.206 ± 0.297
1.561GlnGlu: 1.561 ± 0.283
0.355GlnPhe: 0.355 ± 0.137
2.341GlnGly: 2.341 ± 0.494
0.639GlnHis: 0.639 ± 0.202
0.993GlnIle: 0.993 ± 0.218
1.135GlnLys: 1.135 ± 0.278
3.122GlnLeu: 3.122 ± 0.596
0.497GlnMet: 0.497 ± 0.156
0.426GlnAsn: 0.426 ± 0.178
1.419GlnPro: 1.419 ± 0.321
0.851GlnGln: 0.851 ± 0.322
3.051GlnArg: 3.051 ± 0.439
1.277GlnSer: 1.277 ± 0.251
1.845GlnThr: 1.845 ± 0.292
1.916GlnVal: 1.916 ± 0.315
0.213GlnTrp: 0.213 ± 0.134
0.993GlnTyr: 0.993 ± 0.237
0.0GlnXaa: 0.0 ± 0.0
Arg
9.508ArgAla: 9.508 ± 1.037
0.213ArgCys: 0.213 ± 0.117
5.392ArgAsp: 5.392 ± 0.724
4.967ArgGlu: 4.967 ± 0.623
2.27ArgPhe: 2.27 ± 0.446
3.831ArgGly: 3.831 ± 0.649
1.277ArgHis: 1.277 ± 0.324
3.477ArgIle: 3.477 ± 0.495
3.69ArgLys: 3.69 ± 0.52
6.953ArgLeu: 6.953 ± 0.842
1.703ArgMet: 1.703 ± 0.362
1.348ArgAsn: 1.348 ± 0.267
3.902ArgPro: 3.902 ± 0.603
2.129ArgGln: 2.129 ± 0.4
5.96ArgArg: 5.96 ± 0.817
4.399ArgSer: 4.399 ± 0.595
4.115ArgThr: 4.115 ± 0.657
4.328ArgVal: 4.328 ± 0.553
1.206ArgTrp: 1.206 ± 0.308
1.987ArgTyr: 1.987 ± 0.379
0.0ArgXaa: 0.0 ± 0.0
Ser
9.011SerAla: 9.011 ± 1.009
0.355SerCys: 0.355 ± 0.139
3.548SerAsp: 3.548 ± 0.534
2.767SerGlu: 2.767 ± 0.476
2.058SerPhe: 2.058 ± 0.376
5.605SerGly: 5.605 ± 0.67
1.703SerHis: 1.703 ± 0.358
2.767SerIle: 2.767 ± 0.405
1.774SerLys: 1.774 ± 0.362
4.967SerLeu: 4.967 ± 0.66
0.993SerMet: 0.993 ± 0.377
0.78SerAsn: 0.78 ± 0.28
3.477SerPro: 3.477 ± 0.813
1.064SerGln: 1.064 ± 0.25
4.328SerArg: 4.328 ± 0.536
3.264SerSer: 3.264 ± 0.384
5.747SerThr: 5.747 ± 0.655
2.341SerVal: 2.341 ± 0.386
1.135SerTrp: 1.135 ± 0.309
1.703SerTyr: 1.703 ± 0.328
0.0SerXaa: 0.0 ± 0.0
Thr
8.869ThrAla: 8.869 ± 0.892
0.213ThrCys: 0.213 ± 0.13
3.76ThrAsp: 3.76 ± 0.473
3.477ThrGlu: 3.477 ± 0.596
2.767ThrPhe: 2.767 ± 0.538
5.676ThrGly: 5.676 ± 0.882
0.78ThrHis: 0.78 ± 0.207
2.909ThrIle: 2.909 ± 0.485
1.703ThrLys: 1.703 ± 0.336
5.96ThrLeu: 5.96 ± 0.733
0.639ThrMet: 0.639 ± 0.217
1.064ThrAsn: 1.064 ± 0.262
5.038ThrPro: 5.038 ± 0.786
0.78ThrGln: 0.78 ± 0.232
4.825ThrArg: 4.825 ± 0.663
6.102ThrSer: 6.102 ± 0.933
6.74ThrThr: 6.74 ± 0.895
6.528ThrVal: 6.528 ± 0.758
0.639ThrTrp: 0.639 ± 0.24
2.341ThrTyr: 2.341 ± 0.433
0.0ThrXaa: 0.0 ± 0.0
Val
9.933ValAla: 9.933 ± 0.807
0.497ValCys: 0.497 ± 0.235
4.683ValAsp: 4.683 ± 0.635
4.115ValGlu: 4.115 ± 0.432
2.27ValPhe: 2.27 ± 0.471
3.548ValGly: 3.548 ± 0.494
1.703ValHis: 1.703 ± 0.375
3.477ValIle: 3.477 ± 0.404
2.696ValLys: 2.696 ± 0.529
7.166ValLeu: 7.166 ± 0.876
0.78ValMet: 0.78 ± 0.215
1.632ValAsn: 1.632 ± 0.285
3.548ValPro: 3.548 ± 0.464
2.341ValGln: 2.341 ± 0.504
4.399ValArg: 4.399 ± 0.589
4.47ValSer: 4.47 ± 0.698
4.967ValThr: 4.967 ± 0.471
4.683ValVal: 4.683 ± 0.626
0.922ValTrp: 0.922 ± 0.225
1.561ValTyr: 1.561 ± 0.29
0.0ValXaa: 0.0 ± 0.0
Trp
1.632TrpAla: 1.632 ± 0.386
0.0TrpCys: 0.0 ± 0.0
0.78TrpAsp: 0.78 ± 0.213
1.49TrpGlu: 1.49 ± 0.407
0.0TrpPhe: 0.0 ± 0.0
1.561TrpGly: 1.561 ± 0.287
0.639TrpHis: 0.639 ± 0.214
1.206TrpIle: 1.206 ± 0.281
0.142TrpLys: 0.142 ± 0.104
0.993TrpLeu: 0.993 ± 0.223
0.071TrpMet: 0.071 ± 0.071
0.639TrpAsn: 0.639 ± 0.204
0.568TrpPro: 0.568 ± 0.237
0.568TrpGln: 0.568 ± 0.202
1.206TrpArg: 1.206 ± 0.359
1.064TrpSer: 1.064 ± 0.269
0.78TrpThr: 0.78 ± 0.243
0.497TrpVal: 0.497 ± 0.179
0.071TrpTrp: 0.071 ± 0.068
0.355TrpTyr: 0.355 ± 0.136
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.264TyrAla: 3.264 ± 0.503
0.0TyrCys: 0.0 ± 0.0
1.916TyrAsp: 1.916 ± 0.399
1.277TyrGlu: 1.277 ± 0.34
0.851TyrPhe: 0.851 ± 0.323
2.838TyrGly: 2.838 ± 0.491
0.355TyrHis: 0.355 ± 0.167
0.497TyrIle: 0.497 ± 0.185
0.71TyrLys: 0.71 ± 0.253
1.561TyrLeu: 1.561 ± 0.252
0.213TyrMet: 0.213 ± 0.123
0.993TyrAsn: 0.993 ± 0.344
1.49TyrPro: 1.49 ± 0.348
0.213TyrGln: 0.213 ± 0.118
1.277TyrArg: 1.277 ± 0.268
1.348TyrSer: 1.348 ± 0.334
2.554TyrThr: 2.554 ± 0.441
1.845TyrVal: 1.845 ± 0.39
0.142TyrTrp: 0.142 ± 0.091
0.426TyrTyr: 0.426 ± 0.176
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 75 proteins (14095 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski