Amino acid dipepetide frequency for Rhodococcus phage ReqiPine5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.367AlaAla: 14.367 ± 1.032
0.877AlaCys: 0.877 ± 0.251
7.677AlaAsp: 7.677 ± 0.866
7.293AlaGlu: 7.293 ± 0.811
2.522AlaPhe: 2.522 ± 0.366
8.555AlaGly: 8.555 ± 0.73
2.139AlaHis: 2.139 ± 0.341
6.361AlaIle: 6.361 ± 0.643
4.222AlaLys: 4.222 ± 0.486
9.432AlaLeu: 9.432 ± 0.598
3.29AlaMet: 3.29 ± 0.477
3.126AlaAsn: 3.126 ± 0.49
5.922AlaPro: 5.922 ± 0.675
3.071AlaGln: 3.071 ± 0.445
5.868AlaArg: 5.868 ± 0.592
5.813AlaSer: 5.813 ± 0.52
7.184AlaThr: 7.184 ± 0.641
9.706AlaVal: 9.706 ± 0.743
2.084AlaTrp: 2.084 ± 0.454
2.577AlaTyr: 2.577 ± 0.443
0.0AlaXaa: 0.0 ± 0.0
Cys
1.316CysAla: 1.316 ± 0.266
0.274CysCys: 0.274 ± 0.135
0.768CysAsp: 0.768 ± 0.223
0.274CysGlu: 0.274 ± 0.131
0.274CysPhe: 0.274 ± 0.142
0.932CysGly: 0.932 ± 0.248
0.274CysHis: 0.274 ± 0.121
0.165CysIle: 0.165 ± 0.094
0.11CysLys: 0.11 ± 0.103
0.548CysLeu: 0.548 ± 0.186
0.0CysMet: 0.0 ± 0.0
0.165CysAsn: 0.165 ± 0.096
0.658CysPro: 0.658 ± 0.215
0.274CysGln: 0.274 ± 0.111
0.658CysArg: 0.658 ± 0.241
0.768CysSer: 0.768 ± 0.195
0.548CysThr: 0.548 ± 0.156
0.603CysVal: 0.603 ± 0.198
0.11CysTrp: 0.11 ± 0.086
0.055CysTyr: 0.055 ± 0.054
0.0CysXaa: 0.0 ± 0.0
Asp
7.129AspAla: 7.129 ± 0.711
0.494AspCys: 0.494 ± 0.144
4.497AspAsp: 4.497 ± 0.589
4.88AspGlu: 4.88 ± 0.605
1.097AspPhe: 1.097 ± 0.267
6.032AspGly: 6.032 ± 0.518
1.7AspHis: 1.7 ± 0.337
2.852AspIle: 2.852 ± 0.393
1.919AspLys: 1.919 ± 0.321
5.703AspLeu: 5.703 ± 0.621
1.097AspMet: 1.097 ± 0.233
1.426AspAsn: 1.426 ± 0.284
5.648AspPro: 5.648 ± 0.589
2.468AspGln: 2.468 ± 0.339
4.606AspArg: 4.606 ± 0.631
3.126AspSer: 3.126 ± 0.36
4.497AspThr: 4.497 ± 0.515
4.935AspVal: 4.935 ± 0.505
1.371AspTrp: 1.371 ± 0.288
1.7AspTyr: 1.7 ± 0.295
0.0AspXaa: 0.0 ± 0.0
Glu
7.293GluAla: 7.293 ± 0.873
0.603GluCys: 0.603 ± 0.183
4.497GluAsp: 4.497 ± 0.544
3.729GluGlu: 3.729 ± 0.52
1.919GluPhe: 1.919 ± 0.307
3.729GluGly: 3.729 ± 0.512
1.316GluHis: 1.316 ± 0.247
2.961GluIle: 2.961 ± 0.493
2.358GluLys: 2.358 ± 0.412
5.155GluLeu: 5.155 ± 0.529
1.316GluMet: 1.316 ± 0.257
1.919GluAsn: 1.919 ± 0.29
2.632GluPro: 2.632 ± 0.411
1.81GluGln: 1.81 ± 0.342
3.948GluArg: 3.948 ± 0.428
2.577GluSer: 2.577 ± 0.359
3.839GluThr: 3.839 ± 0.369
5.045GluVal: 5.045 ± 0.523
1.316GluTrp: 1.316 ± 0.261
1.316GluTyr: 1.316 ± 0.284
0.0GluXaa: 0.0 ± 0.0
Phe
3.455PheAla: 3.455 ± 0.428
0.055PheCys: 0.055 ± 0.051
2.139PheAsp: 2.139 ± 0.29
1.59PheGlu: 1.59 ± 0.327
0.987PhePhe: 0.987 ± 0.275
3.235PheGly: 3.235 ± 0.451
0.603PheHis: 0.603 ± 0.167
1.206PheIle: 1.206 ± 0.279
0.494PheLys: 0.494 ± 0.175
2.632PheLeu: 2.632 ± 0.377
0.494PheMet: 0.494 ± 0.169
0.823PheAsn: 0.823 ± 0.277
1.919PhePro: 1.919 ± 0.336
1.206PheGln: 1.206 ± 0.262
1.59PheArg: 1.59 ± 0.29
1.755PheSer: 1.755 ± 0.344
2.632PheThr: 2.632 ± 0.379
2.522PheVal: 2.522 ± 0.323
0.11PheTrp: 0.11 ± 0.086
0.494PheTyr: 0.494 ± 0.162
0.0PheXaa: 0.0 ± 0.0
Gly
8.006GlyAla: 8.006 ± 1.064
0.877GlyCys: 0.877 ± 0.264
5.209GlyAsp: 5.209 ± 0.566
3.784GlyGlu: 3.784 ± 0.434
3.345GlyPhe: 3.345 ± 0.514
8.061GlyGly: 8.061 ± 1.056
1.535GlyHis: 1.535 ± 0.305
4.168GlyIle: 4.168 ± 0.925
3.784GlyLys: 3.784 ± 0.471
7.567GlyLeu: 7.567 ± 0.723
2.303GlyMet: 2.303 ± 0.357
2.248GlyAsn: 2.248 ± 0.392
4.222GlyPro: 4.222 ± 0.436
2.961GlyGln: 2.961 ± 0.357
6.142GlyArg: 6.142 ± 0.591
4.332GlySer: 4.332 ± 0.447
6.087GlyThr: 6.087 ± 0.523
7.019GlyVal: 7.019 ± 0.66
2.193GlyTrp: 2.193 ± 0.359
2.852GlyTyr: 2.852 ± 0.419
0.0GlyXaa: 0.0 ± 0.0
His
1.59HisAla: 1.59 ± 0.32
0.274HisCys: 0.274 ± 0.111
0.823HisAsp: 0.823 ± 0.244
1.261HisGlu: 1.261 ± 0.331
0.932HisPhe: 0.932 ± 0.225
1.59HisGly: 1.59 ± 0.328
0.494HisHis: 0.494 ± 0.186
1.042HisIle: 1.042 ± 0.214
0.165HisLys: 0.165 ± 0.093
2.303HisLeu: 2.303 ± 0.341
0.548HisMet: 0.548 ± 0.155
0.329HisAsn: 0.329 ± 0.118
1.097HisPro: 1.097 ± 0.29
0.329HisGln: 0.329 ± 0.147
1.81HisArg: 1.81 ± 0.409
0.823HisSer: 0.823 ± 0.176
1.152HisThr: 1.152 ± 0.248
1.7HisVal: 1.7 ± 0.329
0.548HisTrp: 0.548 ± 0.125
0.768HisTyr: 0.768 ± 0.195
0.0HisXaa: 0.0 ± 0.0
Ile
5.593IleAla: 5.593 ± 0.563
0.11IleCys: 0.11 ± 0.068
4.826IleAsp: 4.826 ± 0.387
3.784IleGlu: 3.784 ± 0.411
1.206IlePhe: 1.206 ± 0.3
4.99IleGly: 4.99 ± 0.859
1.371IleHis: 1.371 ± 0.323
1.755IleIle: 1.755 ± 0.362
1.152IleLys: 1.152 ± 0.286
3.839IleLeu: 3.839 ± 0.445
0.548IleMet: 0.548 ± 0.155
1.042IleAsn: 1.042 ± 0.247
2.413IlePro: 2.413 ± 0.343
1.371IleGln: 1.371 ± 0.271
3.839IleArg: 3.839 ± 0.467
2.413IleSer: 2.413 ± 0.332
3.51IleThr: 3.51 ± 0.473
4.332IleVal: 4.332 ± 0.503
0.823IleTrp: 0.823 ± 0.167
0.932IleTyr: 0.932 ± 0.252
0.0IleXaa: 0.0 ± 0.0
Lys
4.332LysAla: 4.332 ± 0.579
0.274LysCys: 0.274 ± 0.128
1.7LysAsp: 1.7 ± 0.328
1.535LysGlu: 1.535 ± 0.338
0.987LysPhe: 0.987 ± 0.214
3.235LysGly: 3.235 ± 0.475
0.439LysHis: 0.439 ± 0.156
2.193LysIle: 2.193 ± 0.357
1.097LysLys: 1.097 ± 0.247
2.742LysLeu: 2.742 ± 0.442
0.877LysMet: 0.877 ± 0.272
1.097LysAsn: 1.097 ± 0.252
1.426LysPro: 1.426 ± 0.306
0.384LysGln: 0.384 ± 0.162
1.59LysArg: 1.59 ± 0.302
2.029LysSer: 2.029 ± 0.306
3.126LysThr: 3.126 ± 0.453
2.906LysVal: 2.906 ± 0.334
0.713LysTrp: 0.713 ± 0.207
1.097LysTyr: 1.097 ± 0.227
0.0LysXaa: 0.0 ± 0.0
Leu
8.39LeuAla: 8.39 ± 0.697
0.877LeuCys: 0.877 ± 0.187
6.306LeuAsp: 6.306 ± 0.385
4.113LeuGlu: 4.113 ± 0.443
2.852LeuPhe: 2.852 ± 0.375
8.938LeuGly: 8.938 ± 1.058
1.152LeuHis: 1.152 ± 0.277
3.4LeuIle: 3.4 ± 0.409
2.522LeuLys: 2.522 ± 0.358
5.429LeuLeu: 5.429 ± 0.526
1.152LeuMet: 1.152 ± 0.229
2.413LeuAsn: 2.413 ± 0.335
5.1LeuPro: 5.1 ± 0.709
1.755LeuGln: 1.755 ± 0.278
5.264LeuArg: 5.264 ± 0.606
4.442LeuSer: 4.442 ± 0.464
5.758LeuThr: 5.758 ± 0.622
6.8LeuVal: 6.8 ± 0.616
0.494LeuTrp: 0.494 ± 0.188
1.755LeuTyr: 1.755 ± 0.286
0.0LeuXaa: 0.0 ± 0.0
Met
2.742MetAla: 2.742 ± 0.369
0.11MetCys: 0.11 ± 0.086
1.316MetAsp: 1.316 ± 0.293
1.316MetGlu: 1.316 ± 0.29
0.713MetPhe: 0.713 ± 0.219
1.919MetGly: 1.919 ± 0.296
0.274MetHis: 0.274 ± 0.138
0.932MetIle: 0.932 ± 0.23
0.603MetLys: 0.603 ± 0.191
1.481MetLeu: 1.481 ± 0.282
0.603MetMet: 0.603 ± 0.195
0.548MetAsn: 0.548 ± 0.166
1.645MetPro: 1.645 ± 0.329
0.439MetGln: 0.439 ± 0.154
1.042MetArg: 1.042 ± 0.202
2.029MetSer: 2.029 ± 0.267
2.687MetThr: 2.687 ± 0.406
1.316MetVal: 1.316 ± 0.223
0.329MetTrp: 0.329 ± 0.116
0.823MetTyr: 0.823 ± 0.171
0.0MetXaa: 0.0 ± 0.0
Asn
3.071AsnAla: 3.071 ± 0.404
0.329AsnCys: 0.329 ± 0.123
2.084AsnAsp: 2.084 ± 0.261
1.206AsnGlu: 1.206 ± 0.281
0.658AsnPhe: 0.658 ± 0.173
2.303AsnGly: 2.303 ± 0.403
0.713AsnHis: 0.713 ± 0.196
1.481AsnIle: 1.481 ± 0.247
1.316AsnLys: 1.316 ± 0.392
2.084AsnLeu: 2.084 ± 0.314
0.494AsnMet: 0.494 ± 0.143
1.206AsnAsn: 1.206 ± 0.295
1.919AsnPro: 1.919 ± 0.329
0.603AsnGln: 0.603 ± 0.181
1.535AsnArg: 1.535 ± 0.315
1.097AsnSer: 1.097 ± 0.215
1.755AsnThr: 1.755 ± 0.274
2.358AsnVal: 2.358 ± 0.544
0.494AsnTrp: 0.494 ± 0.133
0.603AsnTyr: 0.603 ± 0.196
0.0AsnXaa: 0.0 ± 0.0
Pro
6.087ProAla: 6.087 ± 0.621
0.274ProCys: 0.274 ± 0.123
4.387ProAsp: 4.387 ± 0.561
5.045ProGlu: 5.045 ± 0.576
1.645ProPhe: 1.645 ± 0.332
5.429ProGly: 5.429 ± 0.494
0.932ProHis: 0.932 ± 0.234
3.016ProIle: 3.016 ± 0.409
1.7ProLys: 1.7 ± 0.24
3.51ProLeu: 3.51 ± 0.41
1.59ProMet: 1.59 ± 0.307
1.59ProAsn: 1.59 ± 0.311
2.906ProPro: 2.906 ± 0.426
1.261ProGln: 1.261 ± 0.266
2.577ProArg: 2.577 ± 0.316
3.948ProSer: 3.948 ± 0.466
4.99ProThr: 4.99 ± 0.586
4.222ProVal: 4.222 ± 0.45
0.713ProTrp: 0.713 ± 0.177
1.097ProTyr: 1.097 ± 0.262
0.0ProXaa: 0.0 ± 0.0
Gln
3.071GlnAla: 3.071 ± 0.5
0.439GlnCys: 0.439 ± 0.13
1.426GlnAsp: 1.426 ± 0.366
1.481GlnGlu: 1.481 ± 0.293
1.535GlnPhe: 1.535 ± 0.264
1.974GlnGly: 1.974 ± 0.315
0.439GlnHis: 0.439 ± 0.146
1.426GlnIle: 1.426 ± 0.25
0.713GlnLys: 0.713 ± 0.226
2.797GlnLeu: 2.797 ± 0.37
0.932GlnMet: 0.932 ± 0.218
0.823GlnAsn: 0.823 ± 0.282
1.042GlnPro: 1.042 ± 0.264
0.548GlnGln: 0.548 ± 0.152
2.084GlnArg: 2.084 ± 0.282
1.316GlnSer: 1.316 ± 0.216
2.358GlnThr: 2.358 ± 0.342
2.248GlnVal: 2.248 ± 0.31
0.658GlnTrp: 0.658 ± 0.228
0.658GlnTyr: 0.658 ± 0.213
0.0GlnXaa: 0.0 ± 0.0
Arg
6.69ArgAla: 6.69 ± 0.674
0.768ArgCys: 0.768 ± 0.187
4.222ArgAsp: 4.222 ± 0.673
3.016ArgGlu: 3.016 ± 0.374
1.59ArgPhe: 1.59 ± 0.286
4.551ArgGly: 4.551 ± 0.463
1.316ArgHis: 1.316 ± 0.289
2.687ArgIle: 2.687 ± 0.444
2.139ArgLys: 2.139 ± 0.25
5.155ArgLeu: 5.155 ± 0.517
1.755ArgMet: 1.755 ± 0.322
1.481ArgAsn: 1.481 ± 0.28
3.784ArgPro: 3.784 ± 0.64
1.7ArgGln: 1.7 ± 0.314
5.484ArgArg: 5.484 ± 0.726
3.839ArgSer: 3.839 ± 0.571
5.155ArgThr: 5.155 ± 0.633
5.538ArgVal: 5.538 ± 0.535
1.755ArgTrp: 1.755 ± 0.359
1.81ArgTyr: 1.81 ± 0.366
0.0ArgXaa: 0.0 ± 0.0
Ser
6.251SerAla: 6.251 ± 0.55
0.548SerCys: 0.548 ± 0.134
2.139SerAsp: 2.139 ± 0.376
2.797SerGlu: 2.797 ± 0.344
1.481SerPhe: 1.481 ± 0.318
5.813SerGly: 5.813 ± 0.554
0.658SerHis: 0.658 ± 0.204
2.797SerIle: 2.797 ± 0.445
2.029SerLys: 2.029 ± 0.306
3.948SerLeu: 3.948 ± 0.57
1.316SerMet: 1.316 ± 0.287
1.864SerAsn: 1.864 ± 0.312
2.906SerPro: 2.906 ± 0.339
1.755SerGln: 1.755 ± 0.275
2.961SerArg: 2.961 ± 0.535
2.797SerSer: 2.797 ± 0.389
3.071SerThr: 3.071 ± 0.443
4.497SerVal: 4.497 ± 0.455
1.645SerTrp: 1.645 ± 0.295
1.481SerTyr: 1.481 ± 0.344
0.0SerXaa: 0.0 ± 0.0
Thr
9.103ThrAla: 9.103 ± 0.717
0.713ThrCys: 0.713 ± 0.275
4.277ThrAsp: 4.277 ± 0.526
4.387ThrGlu: 4.387 ± 0.497
2.358ThrPhe: 2.358 ± 0.302
5.868ThrGly: 5.868 ± 0.563
1.097ThrHis: 1.097 ± 0.283
4.168ThrIle: 4.168 ± 0.385
2.632ThrLys: 2.632 ± 0.401
4.551ThrLeu: 4.551 ± 0.534
1.316ThrMet: 1.316 ± 0.284
1.59ThrAsn: 1.59 ± 0.353
5.264ThrPro: 5.264 ± 0.621
1.59ThrGln: 1.59 ± 0.437
4.551ThrArg: 4.551 ± 0.569
3.619ThrSer: 3.619 ± 0.39
5.593ThrThr: 5.593 ± 0.67
6.855ThrVal: 6.855 ± 0.501
1.645ThrTrp: 1.645 ± 0.286
2.248ThrTyr: 2.248 ± 0.458
0.0ThrXaa: 0.0 ± 0.0
Val
8.609ValAla: 8.609 ± 0.841
0.329ValCys: 0.329 ± 0.126
5.648ValAsp: 5.648 ± 0.532
4.826ValGlu: 4.826 ± 0.515
2.522ValPhe: 2.522 ± 0.329
6.526ValGly: 6.526 ± 0.508
2.029ValHis: 2.029 ± 0.297
5.155ValIle: 5.155 ± 0.571
3.345ValLys: 3.345 ± 0.411
6.361ValLeu: 6.361 ± 0.694
2.522ValMet: 2.522 ± 0.352
2.358ValAsn: 2.358 ± 0.366
4.497ValPro: 4.497 ± 0.446
2.742ValGln: 2.742 ± 0.378
5.264ValArg: 5.264 ± 0.562
4.113ValSer: 4.113 ± 0.615
6.197ValThr: 6.197 ± 0.419
7.184ValVal: 7.184 ± 0.71
1.919ValTrp: 1.919 ± 0.516
1.755ValTyr: 1.755 ± 0.35
0.0ValXaa: 0.0 ± 0.0
Trp
2.248TrpAla: 2.248 ± 0.343
0.219TrpCys: 0.219 ± 0.114
1.481TrpAsp: 1.481 ± 0.429
1.864TrpGlu: 1.864 ± 0.295
0.658TrpPhe: 0.658 ± 0.167
1.371TrpGly: 1.371 ± 0.275
0.548TrpHis: 0.548 ± 0.15
1.261TrpIle: 1.261 ± 0.285
0.603TrpLys: 0.603 ± 0.175
1.206TrpLeu: 1.206 ± 0.253
0.219TrpMet: 0.219 ± 0.121
0.603TrpAsn: 0.603 ± 0.236
0.658TrpPro: 0.658 ± 0.208
0.658TrpGln: 0.658 ± 0.186
1.261TrpArg: 1.261 ± 0.262
0.658TrpSer: 0.658 ± 0.212
1.81TrpThr: 1.81 ± 0.261
1.645TrpVal: 1.645 ± 0.319
0.274TrpTrp: 0.274 ± 0.129
0.384TrpTyr: 0.384 ± 0.114
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.742TyrAla: 2.742 ± 0.317
0.329TyrCys: 0.329 ± 0.132
1.755TyrAsp: 1.755 ± 0.369
1.097TyrGlu: 1.097 ± 0.245
0.658TyrPhe: 0.658 ± 0.204
1.645TyrGly: 1.645 ± 0.332
0.548TyrHis: 0.548 ± 0.157
0.823TyrIle: 0.823 ± 0.21
0.877TyrLys: 0.877 ± 0.197
2.522TyrLeu: 2.522 ± 0.402
0.439TyrMet: 0.439 ± 0.113
0.658TyrAsn: 0.658 ± 0.182
1.261TyrPro: 1.261 ± 0.324
1.042TyrGln: 1.042 ± 0.212
2.358TyrArg: 2.358 ± 0.443
1.261TyrSer: 1.261 ± 0.357
1.481TyrThr: 1.481 ± 0.333
2.468TyrVal: 2.468 ± 0.419
0.439TyrTrp: 0.439 ± 0.149
0.329TyrTyr: 0.329 ± 0.128
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 84 proteins (18237 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski