Amino acid dipepetide frequency for Ralstonia phage Gerry

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
21.707AlaAla: 21.707 ± 1.38
0.704AlaCys: 0.704 ± 0.215
6.442AlaAsp: 6.442 ± 0.551
8.986AlaGlu: 8.986 ± 0.97
3.952AlaPhe: 3.952 ± 0.358
12.451AlaGly: 12.451 ± 1.475
2.598AlaHis: 2.598 ± 0.4
4.98AlaIle: 4.98 ± 0.484
5.738AlaLys: 5.738 ± 0.762
10.069AlaLeu: 10.069 ± 0.69
3.898AlaMet: 3.898 ± 0.444
3.952AlaAsn: 3.952 ± 0.556
7.579AlaPro: 7.579 ± 1.164
6.171AlaGln: 6.171 ± 0.531
8.12AlaArg: 8.12 ± 0.654
7.146AlaSer: 7.146 ± 0.649
5.792AlaThr: 5.792 ± 0.563
7.687AlaVal: 7.687 ± 0.581
1.678AlaTrp: 1.678 ± 0.292
3.843AlaTyr: 3.843 ± 0.483
0.0AlaXaa: 0.0 ± 0.0
Cys
1.137CysAla: 1.137 ± 0.368
0.217CysCys: 0.217 ± 0.112
0.271CysAsp: 0.271 ± 0.118
0.325CysGlu: 0.325 ± 0.147
0.217CysPhe: 0.217 ± 0.143
1.029CysGly: 1.029 ± 0.54
0.271CysHis: 0.271 ± 0.157
0.433CysIle: 0.433 ± 0.184
0.162CysLys: 0.162 ± 0.099
0.758CysLeu: 0.758 ± 0.26
0.271CysMet: 0.271 ± 0.128
0.271CysAsn: 0.271 ± 0.137
0.271CysPro: 0.271 ± 0.133
0.108CysGln: 0.108 ± 0.074
0.704CysArg: 0.704 ± 0.319
0.433CysSer: 0.433 ± 0.196
0.379CysThr: 0.379 ± 0.167
0.325CysVal: 0.325 ± 0.141
0.162CysTrp: 0.162 ± 0.095
0.271CysTyr: 0.271 ± 0.204
0.0CysXaa: 0.0 ± 0.0
Asp
8.607AspAla: 8.607 ± 0.716
0.541AspCys: 0.541 ± 0.247
3.898AspAsp: 3.898 ± 0.659
3.952AspGlu: 3.952 ± 0.388
2.111AspPhe: 2.111 ± 0.371
7.633AspGly: 7.633 ± 0.797
1.029AspHis: 1.029 ± 0.18
2.869AspIle: 2.869 ± 0.396
2.274AspLys: 2.274 ± 0.342
3.898AspLeu: 3.898 ± 0.472
1.516AspMet: 1.516 ± 0.243
1.732AspAsn: 1.732 ± 0.304
2.707AspPro: 2.707 ± 0.347
1.462AspGln: 1.462 ± 0.252
4.439AspArg: 4.439 ± 0.481
2.219AspSer: 2.219 ± 0.444
2.707AspThr: 2.707 ± 0.289
3.627AspVal: 3.627 ± 0.422
0.92AspTrp: 0.92 ± 0.229
1.786AspTyr: 1.786 ± 0.274
0.0AspXaa: 0.0 ± 0.0
Glu
8.715GluAla: 8.715 ± 1.099
0.487GluCys: 0.487 ± 0.156
3.519GluAsp: 3.519 ± 0.442
4.006GluGlu: 4.006 ± 0.539
2.707GluPhe: 2.707 ± 0.409
4.764GluGly: 4.764 ± 0.739
1.299GluHis: 1.299 ± 0.244
3.194GluIle: 3.194 ± 0.4
3.248GluLys: 3.248 ± 0.621
6.063GluLeu: 6.063 ± 0.558
1.895GluMet: 1.895 ± 0.371
2.328GluAsn: 2.328 ± 0.329
2.761GluPro: 2.761 ± 0.453
3.573GluGln: 3.573 ± 0.559
6.55GluArg: 6.55 ± 0.816
2.869GluSer: 2.869 ± 0.353
2.219GluThr: 2.219 ± 0.315
2.382GluVal: 2.382 ± 0.366
1.137GluTrp: 1.137 ± 0.259
1.949GluTyr: 1.949 ± 0.404
0.0GluXaa: 0.0 ± 0.0
Phe
3.356PheAla: 3.356 ± 0.353
0.325PheCys: 0.325 ± 0.135
3.031PheAsp: 3.031 ± 0.366
1.462PheGlu: 1.462 ± 0.268
0.866PhePhe: 0.866 ± 0.233
2.923PheGly: 2.923 ± 0.393
0.541PheHis: 0.541 ± 0.156
1.462PheIle: 1.462 ± 0.333
1.137PheLys: 1.137 ± 0.24
1.949PheLeu: 1.949 ± 0.263
0.974PheMet: 0.974 ± 0.185
1.137PheAsn: 1.137 ± 0.24
0.92PhePro: 0.92 ± 0.233
0.92PheGln: 0.92 ± 0.168
2.003PheArg: 2.003 ± 0.376
2.057PheSer: 2.057 ± 0.328
1.299PheThr: 1.299 ± 0.263
3.086PheVal: 3.086 ± 0.352
0.271PheTrp: 0.271 ± 0.108
1.191PheTyr: 1.191 ± 0.201
0.0PheXaa: 0.0 ± 0.0
Gly
11.476GlyAla: 11.476 ± 1.003
1.137GlyCys: 1.137 ± 0.559
5.901GlyAsp: 5.901 ± 0.569
6.767GlyGlu: 6.767 ± 0.761
2.49GlyPhe: 2.49 ± 0.331
8.445GlyGly: 8.445 ± 1.066
1.299GlyHis: 1.299 ± 0.288
4.222GlyIle: 4.222 ± 0.997
5.089GlyLys: 5.089 ± 0.536
5.034GlyLeu: 5.034 ± 0.472
2.057GlyMet: 2.057 ± 0.38
3.302GlyAsn: 3.302 ± 0.514
1.949GlyPro: 1.949 ± 0.288
2.977GlyGln: 2.977 ± 0.415
5.522GlyArg: 5.522 ± 0.559
4.06GlySer: 4.06 ± 0.714
5.846GlyThr: 5.846 ± 0.705
5.251GlyVal: 5.251 ± 0.48
1.516GlyTrp: 1.516 ± 0.394
2.165GlyTyr: 2.165 ± 0.422
0.0GlyXaa: 0.0 ± 0.0
His
2.598HisAla: 2.598 ± 0.39
0.217HisCys: 0.217 ± 0.114
1.462HisAsp: 1.462 ± 0.302
1.57HisGlu: 1.57 ± 0.268
0.92HisPhe: 0.92 ± 0.22
0.92HisGly: 0.92 ± 0.191
0.487HisHis: 0.487 ± 0.188
0.758HisIle: 0.758 ± 0.199
0.487HisLys: 0.487 ± 0.154
0.866HisLeu: 0.866 ± 0.179
0.379HisMet: 0.379 ± 0.133
0.704HisAsn: 0.704 ± 0.188
0.541HisPro: 0.541 ± 0.149
0.595HisGln: 0.595 ± 0.136
1.245HisArg: 1.245 ± 0.267
1.191HisSer: 1.191 ± 0.241
0.92HisThr: 0.92 ± 0.208
1.299HisVal: 1.299 ± 0.236
0.162HisTrp: 0.162 ± 0.09
0.379HisTyr: 0.379 ± 0.144
0.0HisXaa: 0.0 ± 0.0
Ile
5.576IleAla: 5.576 ± 0.621
0.271IleCys: 0.271 ± 0.164
2.761IleAsp: 2.761 ± 0.388
4.764IleGlu: 4.764 ± 0.7
1.083IlePhe: 1.083 ± 0.247
4.06IleGly: 4.06 ± 0.394
0.65IleHis: 0.65 ± 0.204
1.786IleIle: 1.786 ± 0.351
2.707IleLys: 2.707 ± 0.564
2.111IleLeu: 2.111 ± 0.282
0.92IleMet: 0.92 ± 0.249
1.949IleAsn: 1.949 ± 0.314
1.841IlePro: 1.841 ± 0.305
2.057IleGln: 2.057 ± 0.332
2.869IleArg: 2.869 ± 0.457
2.382IleSer: 2.382 ± 0.365
2.815IleThr: 2.815 ± 0.47
3.302IleVal: 3.302 ± 0.477
0.379IleTrp: 0.379 ± 0.133
0.974IleTyr: 0.974 ± 0.206
0.0IleXaa: 0.0 ± 0.0
Lys
7.254LysAla: 7.254 ± 0.73
0.271LysCys: 0.271 ± 0.135
2.49LysAsp: 2.49 ± 0.386
3.031LysGlu: 3.031 ± 0.519
1.245LysPhe: 1.245 ± 0.33
3.14LysGly: 3.14 ± 0.63
0.65LysHis: 0.65 ± 0.194
2.165LysIle: 2.165 ± 0.388
2.165LysLys: 2.165 ± 0.339
3.573LysLeu: 3.573 ± 0.39
1.191LysMet: 1.191 ± 0.224
1.191LysAsn: 1.191 ± 0.243
2.815LysPro: 2.815 ± 0.414
2.761LysGln: 2.761 ± 0.529
3.952LysArg: 3.952 ± 0.622
2.436LysSer: 2.436 ± 0.355
2.923LysThr: 2.923 ± 0.387
3.086LysVal: 3.086 ± 0.354
0.92LysTrp: 0.92 ± 0.206
0.812LysTyr: 0.812 ± 0.157
0.0LysXaa: 0.0 ± 0.0
Leu
7.416LeuAla: 7.416 ± 0.587
0.541LeuCys: 0.541 ± 0.222
5.305LeuAsp: 5.305 ± 0.544
4.114LeuGlu: 4.114 ± 0.513
2.328LeuPhe: 2.328 ± 0.327
5.955LeuGly: 5.955 ± 0.578
1.516LeuHis: 1.516 ± 0.239
3.465LeuIle: 3.465 ± 0.572
4.168LeuLys: 4.168 ± 0.554
4.439LeuLeu: 4.439 ± 0.489
1.732LeuMet: 1.732 ± 0.247
2.815LeuAsn: 2.815 ± 0.337
4.71LeuPro: 4.71 ± 0.821
1.895LeuGln: 1.895 ± 0.265
5.576LeuArg: 5.576 ± 0.656
4.818LeuSer: 4.818 ± 0.549
4.439LeuThr: 4.439 ± 0.489
4.114LeuVal: 4.114 ± 0.648
0.65LeuTrp: 0.65 ± 0.22
1.516LeuTyr: 1.516 ± 0.265
0.0LeuXaa: 0.0 ± 0.0
Met
2.436MetAla: 2.436 ± 0.388
0.108MetCys: 0.108 ± 0.071
1.191MetAsp: 1.191 ± 0.249
1.245MetGlu: 1.245 ± 0.26
0.812MetPhe: 0.812 ± 0.205
2.111MetGly: 2.111 ± 0.353
0.812MetHis: 0.812 ± 0.259
0.595MetIle: 0.595 ± 0.148
1.841MetLys: 1.841 ± 0.326
2.057MetLeu: 2.057 ± 0.269
0.866MetMet: 0.866 ± 0.152
1.083MetAsn: 1.083 ± 0.243
1.624MetPro: 1.624 ± 0.314
1.407MetGln: 1.407 ± 0.282
2.111MetArg: 2.111 ± 0.397
1.895MetSer: 1.895 ± 0.363
2.057MetThr: 2.057 ± 0.311
1.029MetVal: 1.029 ± 0.269
0.433MetTrp: 0.433 ± 0.152
0.595MetTyr: 0.595 ± 0.165
0.0MetXaa: 0.0 ± 0.0
Asn
4.439AsnAla: 4.439 ± 0.424
0.217AsnCys: 0.217 ± 0.114
2.111AsnAsp: 2.111 ± 0.354
1.895AsnGlu: 1.895 ± 0.29
1.029AsnPhe: 1.029 ± 0.283
4.222AsnGly: 4.222 ± 0.548
0.433AsnHis: 0.433 ± 0.144
1.516AsnIle: 1.516 ± 0.248
1.299AsnLys: 1.299 ± 0.26
2.165AsnLeu: 2.165 ± 0.298
0.758AsnMet: 0.758 ± 0.193
1.299AsnAsn: 1.299 ± 0.252
2.111AsnPro: 2.111 ± 0.377
1.57AsnGln: 1.57 ± 0.338
2.057AsnArg: 2.057 ± 0.304
1.732AsnSer: 1.732 ± 0.296
1.949AsnThr: 1.949 ± 0.376
2.707AsnVal: 2.707 ± 0.33
0.704AsnTrp: 0.704 ± 0.155
0.541AsnTyr: 0.541 ± 0.186
0.0AsnXaa: 0.0 ± 0.0
Pro
8.228ProAla: 8.228 ± 1.186
0.271ProCys: 0.271 ± 0.133
3.14ProAsp: 3.14 ± 0.429
3.086ProGlu: 3.086 ± 0.369
1.245ProPhe: 1.245 ± 0.266
3.843ProGly: 3.843 ± 0.489
0.65ProHis: 0.65 ± 0.201
2.003ProIle: 2.003 ± 0.363
2.219ProLys: 2.219 ± 0.35
2.923ProLeu: 2.923 ± 0.391
0.866ProMet: 0.866 ± 0.257
1.462ProAsn: 1.462 ± 0.23
2.274ProPro: 2.274 ± 0.319
1.732ProGln: 1.732 ± 0.371
2.49ProArg: 2.49 ± 0.34
3.789ProSer: 3.789 ± 0.587
2.977ProThr: 2.977 ± 0.378
3.086ProVal: 3.086 ± 0.369
0.379ProTrp: 0.379 ± 0.131
0.866ProTyr: 0.866 ± 0.223
0.0ProXaa: 0.0 ± 0.0
Gln
6.225GlnAla: 6.225 ± 0.851
0.108GlnCys: 0.108 ± 0.079
1.949GlnAsp: 1.949 ± 0.288
2.436GlnGlu: 2.436 ± 0.424
0.92GlnPhe: 0.92 ± 0.209
3.248GlnGly: 3.248 ± 0.531
0.704GlnHis: 0.704 ± 0.175
1.841GlnIle: 1.841 ± 0.32
1.841GlnLys: 1.841 ± 0.332
3.302GlnLeu: 3.302 ± 0.401
1.462GlnMet: 1.462 ± 0.358
1.137GlnAsn: 1.137 ± 0.282
1.624GlnPro: 1.624 ± 0.264
3.573GlnGln: 3.573 ± 0.714
2.544GlnArg: 2.544 ± 0.497
2.003GlnSer: 2.003 ± 0.332
1.895GlnThr: 1.895 ± 0.355
3.248GlnVal: 3.248 ± 0.351
0.65GlnTrp: 0.65 ± 0.179
1.299GlnTyr: 1.299 ± 0.284
0.0GlnXaa: 0.0 ± 0.0
Arg
8.391ArgAla: 8.391 ± 0.617
0.595ArgCys: 0.595 ± 0.202
4.006ArgAsp: 4.006 ± 0.534
4.331ArgGlu: 4.331 ± 0.581
2.869ArgPhe: 2.869 ± 0.432
4.331ArgGly: 4.331 ± 0.539
1.191ArgHis: 1.191 ± 0.235
3.681ArgIle: 3.681 ± 0.417
3.194ArgLys: 3.194 ± 0.371
6.117ArgLeu: 6.117 ± 0.502
2.328ArgMet: 2.328 ± 0.404
2.707ArgAsn: 2.707 ± 0.38
2.923ArgPro: 2.923 ± 0.446
3.356ArgGln: 3.356 ± 0.601
4.547ArgArg: 4.547 ± 0.499
3.086ArgSer: 3.086 ± 0.411
3.031ArgThr: 3.031 ± 0.427
4.926ArgVal: 4.926 ± 0.555
0.92ArgTrp: 0.92 ± 0.189
2.653ArgTyr: 2.653 ± 0.371
0.0ArgXaa: 0.0 ± 0.0
Ser
7.146SerAla: 7.146 ± 0.876
0.487SerCys: 0.487 ± 0.201
3.302SerAsp: 3.302 ± 0.391
2.815SerGlu: 2.815 ± 0.411
1.407SerPhe: 1.407 ± 0.287
4.547SerGly: 4.547 ± 0.655
1.462SerHis: 1.462 ± 0.298
2.328SerIle: 2.328 ± 0.409
2.436SerLys: 2.436 ± 0.279
4.114SerLeu: 4.114 ± 0.334
1.299SerMet: 1.299 ± 0.318
1.895SerAsn: 1.895 ± 0.354
2.923SerPro: 2.923 ± 0.511
2.382SerGln: 2.382 ± 0.353
3.465SerArg: 3.465 ± 0.455
2.761SerSer: 2.761 ± 0.432
3.41SerThr: 3.41 ± 0.391
3.248SerVal: 3.248 ± 0.378
1.353SerTrp: 1.353 ± 0.293
1.353SerTyr: 1.353 ± 0.258
0.0SerXaa: 0.0 ± 0.0
Thr
6.009ThrAla: 6.009 ± 0.529
0.541ThrCys: 0.541 ± 0.221
2.544ThrAsp: 2.544 ± 0.335
4.006ThrGlu: 4.006 ± 0.411
1.732ThrPhe: 1.732 ± 0.339
4.71ThrGly: 4.71 ± 0.612
1.029ThrHis: 1.029 ± 0.228
3.086ThrIle: 3.086 ± 0.459
3.194ThrLys: 3.194 ± 0.331
4.331ThrLeu: 4.331 ± 0.503
1.353ThrMet: 1.353 ± 0.295
1.732ThrAsn: 1.732 ± 0.316
3.14ThrPro: 3.14 ± 0.33
1.57ThrGln: 1.57 ± 0.346
2.977ThrArg: 2.977 ± 0.461
2.977ThrSer: 2.977 ± 0.45
3.681ThrThr: 3.681 ± 0.582
4.385ThrVal: 4.385 ± 0.44
0.65ThrTrp: 0.65 ± 0.174
1.624ThrTyr: 1.624 ± 0.347
0.0ThrXaa: 0.0 ± 0.0
Val
8.012ValAla: 8.012 ± 0.622
0.487ValCys: 0.487 ± 0.175
4.168ValAsp: 4.168 ± 0.449
4.818ValGlu: 4.818 ± 0.449
1.57ValPhe: 1.57 ± 0.371
4.764ValGly: 4.764 ± 0.436
0.487ValHis: 0.487 ± 0.149
2.923ValIle: 2.923 ± 0.448
3.248ValLys: 3.248 ± 0.759
3.302ValLeu: 3.302 ± 0.321
1.732ValMet: 1.732 ± 0.286
2.544ValAsn: 2.544 ± 0.333
3.519ValPro: 3.519 ± 0.404
2.436ValGln: 2.436 ± 0.297
4.601ValArg: 4.601 ± 0.453
3.735ValSer: 3.735 ± 0.457
4.385ValThr: 4.385 ± 0.463
3.898ValVal: 3.898 ± 0.482
0.974ValTrp: 0.974 ± 0.25
1.57ValTyr: 1.57 ± 0.227
0.0ValXaa: 0.0 ± 0.0
Trp
1.083TrpAla: 1.083 ± 0.241
0.162TrpCys: 0.162 ± 0.102
0.65TrpAsp: 0.65 ± 0.224
0.595TrpGlu: 0.595 ± 0.217
0.595TrpPhe: 0.595 ± 0.212
1.299TrpGly: 1.299 ± 0.274
0.325TrpHis: 0.325 ± 0.16
0.758TrpIle: 0.758 ± 0.17
0.595TrpLys: 0.595 ± 0.176
1.678TrpLeu: 1.678 ± 0.378
0.433TrpMet: 0.433 ± 0.153
0.433TrpAsn: 0.433 ± 0.154
0.487TrpPro: 0.487 ± 0.198
0.541TrpGln: 0.541 ± 0.145
1.191TrpArg: 1.191 ± 0.29
1.029TrpSer: 1.029 ± 0.235
1.029TrpThr: 1.029 ± 0.207
0.974TrpVal: 0.974 ± 0.179
0.217TrpTrp: 0.217 ± 0.115
0.271TrpTyr: 0.271 ± 0.132
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.681TyrAla: 3.681 ± 0.449
0.325TyrCys: 0.325 ± 0.163
1.786TyrAsp: 1.786 ± 0.236
1.462TyrGlu: 1.462 ± 0.253
0.812TyrPhe: 0.812 ± 0.217
2.111TyrGly: 2.111 ± 0.344
0.325TyrHis: 0.325 ± 0.123
1.191TyrIle: 1.191 ± 0.272
1.029TyrLys: 1.029 ± 0.19
2.761TyrLeu: 2.761 ± 0.439
0.325TyrMet: 0.325 ± 0.097
1.083TyrAsn: 1.083 ± 0.279
0.812TyrPro: 0.812 ± 0.169
0.92TyrGln: 0.92 ± 0.18
2.219TyrArg: 2.219 ± 0.307
1.516TyrSer: 1.516 ± 0.259
1.516TyrThr: 1.516 ± 0.283
1.516TyrVal: 1.516 ± 0.254
0.271TyrTrp: 0.271 ± 0.196
0.595TyrTyr: 0.595 ± 0.151
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 76 proteins (18474 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski