Amino acid dipepetide frequency for Ralstonia phage Heva

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
21.253AlaAla: 21.253 ± 1.47
1.165AlaCys: 1.165 ± 0.353
6.437AlaAsp: 6.437 ± 0.589
8.324AlaGlu: 8.324 ± 0.974
3.44AlaPhe: 3.44 ± 0.443
11.986AlaGly: 11.986 ± 1.118
2.053AlaHis: 2.053 ± 0.338
5.383AlaIle: 5.383 ± 0.566
6.381AlaLys: 6.381 ± 0.757
10.044AlaLeu: 10.044 ± 0.707
3.607AlaMet: 3.607 ± 0.401
4.439AlaAsn: 4.439 ± 0.603
7.547AlaPro: 7.547 ± 1.053
6.825AlaGln: 6.825 ± 0.533
8.546AlaArg: 8.546 ± 0.825
7.269AlaSer: 7.269 ± 0.803
5.438AlaThr: 5.438 ± 0.553
7.547AlaVal: 7.547 ± 0.603
1.498AlaTrp: 1.498 ± 0.243
2.886AlaTyr: 2.886 ± 0.381
0.0AlaXaa: 0.0 ± 0.0
Cys
0.499CysAla: 0.499 ± 0.206
0.333CysCys: 0.333 ± 0.146
0.555CysAsp: 0.555 ± 0.232
0.333CysGlu: 0.333 ± 0.206
0.222CysPhe: 0.222 ± 0.149
0.666CysGly: 0.666 ± 0.27
0.499CysHis: 0.499 ± 0.217
0.222CysIle: 0.222 ± 0.115
0.333CysLys: 0.333 ± 0.139
0.61CysLeu: 0.61 ± 0.266
0.333CysMet: 0.333 ± 0.173
0.333CysAsn: 0.333 ± 0.159
0.166CysPro: 0.166 ± 0.107
0.222CysGln: 0.222 ± 0.133
0.721CysArg: 0.721 ± 0.3
0.832CysSer: 0.832 ± 0.305
0.222CysThr: 0.222 ± 0.137
0.555CysVal: 0.555 ± 0.194
0.166CysTrp: 0.166 ± 0.108
0.333CysTyr: 0.333 ± 0.238
0.0CysXaa: 0.0 ± 0.0
Asp
8.601AspAla: 8.601 ± 0.67
0.388AspCys: 0.388 ± 0.216
3.773AspAsp: 3.773 ± 0.625
4.217AspGlu: 4.217 ± 0.569
1.776AspPhe: 1.776 ± 0.318
7.325AspGly: 7.325 ± 0.773
1.332AspHis: 1.332 ± 0.227
2.83AspIle: 2.83 ± 0.367
2.109AspLys: 2.109 ± 0.353
3.607AspLeu: 3.607 ± 0.482
1.609AspMet: 1.609 ± 0.282
2.164AspAsn: 2.164 ± 0.334
2.664AspPro: 2.664 ± 0.421
1.443AspGln: 1.443 ± 0.25
4.606AspArg: 4.606 ± 0.416
2.608AspSer: 2.608 ± 0.39
2.775AspThr: 2.775 ± 0.294
4.661AspVal: 4.661 ± 0.526
0.832AspTrp: 0.832 ± 0.21
1.998AspTyr: 1.998 ± 0.306
0.0AspXaa: 0.0 ± 0.0
Glu
7.547GluAla: 7.547 ± 0.844
0.499GluCys: 0.499 ± 0.203
3.551GluAsp: 3.551 ± 0.418
3.607GluGlu: 3.607 ± 0.525
2.997GluPhe: 2.997 ± 0.421
4.384GluGly: 4.384 ± 0.716
1.554GluHis: 1.554 ± 0.306
3.607GluIle: 3.607 ± 0.385
2.83GluLys: 2.83 ± 0.733
5.438GluLeu: 5.438 ± 0.545
1.332GluMet: 1.332 ± 0.224
1.998GluAsn: 1.998 ± 0.341
2.775GluPro: 2.775 ± 0.671
3.662GluGln: 3.662 ± 0.425
6.603GluArg: 6.603 ± 0.799
2.83GluSer: 2.83 ± 0.407
2.442GluThr: 2.442 ± 0.411
2.997GluVal: 2.997 ± 0.416
0.721GluTrp: 0.721 ± 0.182
1.831GluTyr: 1.831 ± 0.413
0.0GluXaa: 0.0 ± 0.0
Phe
2.997PheAla: 2.997 ± 0.395
0.166PheCys: 0.166 ± 0.096
3.163PheAsp: 3.163 ± 0.386
1.72PheGlu: 1.72 ± 0.333
0.555PhePhe: 0.555 ± 0.196
2.719PheGly: 2.719 ± 0.344
0.388PheHis: 0.388 ± 0.17
1.332PheIle: 1.332 ± 0.304
1.221PheLys: 1.221 ± 0.285
1.887PheLeu: 1.887 ± 0.338
0.943PheMet: 0.943 ± 0.223
1.11PheAsn: 1.11 ± 0.236
1.165PhePro: 1.165 ± 0.209
0.999PheGln: 0.999 ± 0.206
1.942PheArg: 1.942 ± 0.483
2.109PheSer: 2.109 ± 0.351
1.443PheThr: 1.443 ± 0.257
2.386PheVal: 2.386 ± 0.347
0.277PheTrp: 0.277 ± 0.105
1.11PheTyr: 1.11 ± 0.221
0.0PheXaa: 0.0 ± 0.0
Gly
11.82GlyAla: 11.82 ± 1.278
0.61GlyCys: 0.61 ± 0.243
6.159GlyAsp: 6.159 ± 0.63
5.549GlyGlu: 5.549 ± 0.71
2.608GlyPhe: 2.608 ± 0.345
8.49GlyGly: 8.49 ± 1.099
1.831GlyHis: 1.831 ± 0.372
3.329GlyIle: 3.329 ± 0.4
4.883GlyLys: 4.883 ± 0.587
5.105GlyLeu: 5.105 ± 0.517
2.109GlyMet: 2.109 ± 0.461
2.997GlyAsn: 2.997 ± 0.646
1.942GlyPro: 1.942 ± 0.272
3.218GlyGln: 3.218 ± 0.454
6.548GlyArg: 6.548 ± 0.71
4.661GlySer: 4.661 ± 0.742
5.272GlyThr: 5.272 ± 0.624
5.938GlyVal: 5.938 ± 0.639
1.11GlyTrp: 1.11 ± 0.228
2.109GlyTyr: 2.109 ± 0.354
0.0GlyXaa: 0.0 ± 0.0
His
2.608HisAla: 2.608 ± 0.369
0.277HisCys: 0.277 ± 0.117
1.387HisAsp: 1.387 ± 0.318
1.387HisGlu: 1.387 ± 0.326
0.832HisPhe: 0.832 ± 0.193
1.443HisGly: 1.443 ± 0.272
0.721HisHis: 0.721 ± 0.301
0.999HisIle: 0.999 ± 0.214
0.777HisLys: 0.777 ± 0.241
1.165HisLeu: 1.165 ± 0.266
0.555HisMet: 0.555 ± 0.208
0.444HisAsn: 0.444 ± 0.172
0.721HisPro: 0.721 ± 0.196
0.61HisGln: 0.61 ± 0.155
1.831HisArg: 1.831 ± 0.385
0.943HisSer: 0.943 ± 0.243
0.777HisThr: 0.777 ± 0.222
1.332HisVal: 1.332 ± 0.336
0.111HisTrp: 0.111 ± 0.092
0.555HisTyr: 0.555 ± 0.183
0.0HisXaa: 0.0 ± 0.0
Ile
6.27IleAla: 6.27 ± 0.563
0.277IleCys: 0.277 ± 0.176
2.775IleAsp: 2.775 ± 0.388
4.162IleGlu: 4.162 ± 0.465
0.943IlePhe: 0.943 ± 0.235
3.884IleGly: 3.884 ± 0.449
0.666IleHis: 0.666 ± 0.185
1.554IleIle: 1.554 ± 0.317
2.22IleLys: 2.22 ± 0.341
1.998IleLeu: 1.998 ± 0.269
1.221IleMet: 1.221 ± 0.323
1.831IleAsn: 1.831 ± 0.307
1.887IlePro: 1.887 ± 0.328
1.72IleGln: 1.72 ± 0.37
3.274IleArg: 3.274 ± 0.579
2.275IleSer: 2.275 ± 0.369
2.22IleThr: 2.22 ± 0.359
3.163IleVal: 3.163 ± 0.47
0.222IleTrp: 0.222 ± 0.098
0.943IleTyr: 0.943 ± 0.218
0.0IleXaa: 0.0 ± 0.0
Lys
7.38LysAla: 7.38 ± 0.469
0.444LysCys: 0.444 ± 0.196
2.886LysAsp: 2.886 ± 0.402
2.941LysGlu: 2.941 ± 0.727
1.332LysPhe: 1.332 ± 0.294
3.274LysGly: 3.274 ± 0.59
0.555LysHis: 0.555 ± 0.152
1.942LysIle: 1.942 ± 0.425
2.275LysLys: 2.275 ± 0.331
4.051LysLeu: 4.051 ± 0.542
1.221LysMet: 1.221 ± 0.337
1.443LysAsn: 1.443 ± 0.29
2.553LysPro: 2.553 ± 0.394
2.608LysGln: 2.608 ± 0.365
3.995LysArg: 3.995 ± 0.51
2.442LysSer: 2.442 ± 0.428
3.163LysThr: 3.163 ± 0.438
2.886LysVal: 2.886 ± 0.363
0.832LysTrp: 0.832 ± 0.231
0.777LysTyr: 0.777 ± 0.197
0.0LysXaa: 0.0 ± 0.0
Leu
7.436LeuAla: 7.436 ± 0.608
0.444LeuCys: 0.444 ± 0.181
5.716LeuAsp: 5.716 ± 0.603
3.829LeuGlu: 3.829 ± 0.604
1.887LeuPhe: 1.887 ± 0.271
6.215LeuGly: 6.215 ± 0.585
1.443LeuHis: 1.443 ± 0.224
3.496LeuIle: 3.496 ± 0.545
4.273LeuLys: 4.273 ± 0.676
4.217LeuLeu: 4.217 ± 0.569
1.887LeuMet: 1.887 ± 0.283
2.275LeuAsn: 2.275 ± 0.311
3.995LeuPro: 3.995 ± 0.733
2.22LeuGln: 2.22 ± 0.339
5.438LeuArg: 5.438 ± 0.522
4.55LeuSer: 4.55 ± 0.42
4.051LeuThr: 4.051 ± 0.521
3.995LeuVal: 3.995 ± 0.631
0.666LeuTrp: 0.666 ± 0.266
1.609LeuTyr: 1.609 ± 0.258
0.0LeuXaa: 0.0 ± 0.0
Met
2.553MetAla: 2.553 ± 0.343
0.166MetCys: 0.166 ± 0.113
1.332MetAsp: 1.332 ± 0.279
1.054MetGlu: 1.054 ± 0.191
0.777MetPhe: 0.777 ± 0.236
1.942MetGly: 1.942 ± 0.324
0.444MetHis: 0.444 ± 0.153
0.721MetIle: 0.721 ± 0.153
1.942MetLys: 1.942 ± 0.324
1.942MetLeu: 1.942 ± 0.259
0.777MetMet: 0.777 ± 0.174
1.221MetAsn: 1.221 ± 0.3
1.665MetPro: 1.665 ± 0.317
1.332MetGln: 1.332 ± 0.292
2.22MetArg: 2.22 ± 0.435
2.053MetSer: 2.053 ± 0.329
1.665MetThr: 1.665 ± 0.309
1.332MetVal: 1.332 ± 0.304
0.444MetTrp: 0.444 ± 0.167
0.555MetTyr: 0.555 ± 0.135
0.0MetXaa: 0.0 ± 0.0
Asn
4.051AsnAla: 4.051 ± 0.567
0.277AsnCys: 0.277 ± 0.151
2.164AsnAsp: 2.164 ± 0.305
2.22AsnGlu: 2.22 ± 0.366
0.999AsnPhe: 0.999 ± 0.236
3.773AsnGly: 3.773 ± 0.506
0.555AsnHis: 0.555 ± 0.213
1.498AsnIle: 1.498 ± 0.241
1.276AsnLys: 1.276 ± 0.266
2.331AsnLeu: 2.331 ± 0.286
0.666AsnMet: 0.666 ± 0.195
1.165AsnAsn: 1.165 ± 0.298
2.386AsnPro: 2.386 ± 0.355
1.554AsnGln: 1.554 ± 0.326
2.109AsnArg: 2.109 ± 0.295
1.831AsnSer: 1.831 ± 0.32
2.22AsnThr: 2.22 ± 0.472
2.886AsnVal: 2.886 ± 0.403
0.721AsnTrp: 0.721 ± 0.19
0.555AsnTyr: 0.555 ± 0.222
0.0AsnXaa: 0.0 ± 0.0
Pro
8.49ProAla: 8.49 ± 1.424
0.333ProCys: 0.333 ± 0.165
2.997ProAsp: 2.997 ± 0.444
3.052ProGlu: 3.052 ± 0.458
1.276ProPhe: 1.276 ± 0.33
3.94ProGly: 3.94 ± 0.551
0.555ProHis: 0.555 ± 0.21
1.887ProIle: 1.887 ± 0.319
2.664ProLys: 2.664 ± 0.397
2.886ProLeu: 2.886 ± 0.404
0.832ProMet: 0.832 ± 0.3
1.443ProAsn: 1.443 ± 0.365
1.887ProPro: 1.887 ± 0.351
1.665ProGln: 1.665 ± 0.417
2.886ProArg: 2.886 ± 0.454
3.107ProSer: 3.107 ± 0.535
2.608ProThr: 2.608 ± 0.394
3.052ProVal: 3.052 ± 0.351
0.277ProTrp: 0.277 ± 0.107
0.999ProTyr: 0.999 ± 0.296
0.0ProXaa: 0.0 ± 0.0
Gln
6.381GlnAla: 6.381 ± 0.842
0.111GlnCys: 0.111 ± 0.083
1.887GlnAsp: 1.887 ± 0.316
2.442GlnGlu: 2.442 ± 0.381
0.999GlnPhe: 0.999 ± 0.254
2.997GlnGly: 2.997 ± 0.418
0.999GlnHis: 0.999 ± 0.265
2.164GlnIle: 2.164 ± 0.35
1.942GlnLys: 1.942 ± 0.378
3.274GlnLeu: 3.274 ± 0.411
1.443GlnMet: 1.443 ± 0.428
1.276GlnAsn: 1.276 ± 0.41
1.998GlnPro: 1.998 ± 0.355
3.829GlnGln: 3.829 ± 0.728
2.997GlnArg: 2.997 ± 0.52
1.498GlnSer: 1.498 ± 0.307
1.665GlnThr: 1.665 ± 0.373
2.997GlnVal: 2.997 ± 0.42
0.721GlnTrp: 0.721 ± 0.227
1.165GlnTyr: 1.165 ± 0.237
0.0GlnXaa: 0.0 ± 0.0
Arg
9.822ArgAla: 9.822 ± 0.735
0.499ArgCys: 0.499 ± 0.231
3.995ArgAsp: 3.995 ± 0.424
4.772ArgGlu: 4.772 ± 0.657
3.163ArgPhe: 3.163 ± 0.433
5.05ArgGly: 5.05 ± 0.602
1.831ArgHis: 1.831 ± 0.466
3.496ArgIle: 3.496 ± 0.446
3.551ArgLys: 3.551 ± 0.539
5.66ArgLeu: 5.66 ± 0.548
2.109ArgMet: 2.109 ± 0.348
2.886ArgAsn: 2.886 ± 0.369
2.775ArgPro: 2.775 ± 0.423
3.551ArgGln: 3.551 ± 0.699
5.216ArgArg: 5.216 ± 0.778
3.662ArgSer: 3.662 ± 0.438
2.83ArgThr: 2.83 ± 0.371
5.549ArgVal: 5.549 ± 0.756
1.332ArgTrp: 1.332 ± 0.304
2.664ArgTyr: 2.664 ± 0.365
0.0ArgXaa: 0.0 ± 0.0
Ser
6.77SerAla: 6.77 ± 0.84
0.499SerCys: 0.499 ± 0.235
3.496SerAsp: 3.496 ± 0.372
3.274SerGlu: 3.274 ± 0.384
1.776SerPhe: 1.776 ± 0.28
4.883SerGly: 4.883 ± 0.599
1.165SerHis: 1.165 ± 0.246
2.164SerIle: 2.164 ± 0.305
2.664SerLys: 2.664 ± 0.375
3.773SerLeu: 3.773 ± 0.342
1.165SerMet: 1.165 ± 0.347
2.109SerAsn: 2.109 ± 0.321
2.775SerPro: 2.775 ± 0.335
2.22SerGln: 2.22 ± 0.376
3.995SerArg: 3.995 ± 0.451
2.775SerSer: 2.775 ± 0.41
3.607SerThr: 3.607 ± 0.476
3.718SerVal: 3.718 ± 0.39
1.054SerTrp: 1.054 ± 0.22
1.332SerTyr: 1.332 ± 0.273
0.0SerXaa: 0.0 ± 0.0
Thr
5.827ThrAla: 5.827 ± 0.625
0.333ThrCys: 0.333 ± 0.173
2.386ThrAsp: 2.386 ± 0.387
3.44ThrGlu: 3.44 ± 0.412
1.221ThrPhe: 1.221 ± 0.304
5.272ThrGly: 5.272 ± 0.644
1.054ThrHis: 1.054 ± 0.199
2.664ThrIle: 2.664 ± 0.346
2.608ThrLys: 2.608 ± 0.325
4.217ThrLeu: 4.217 ± 0.584
1.609ThrMet: 1.609 ± 0.297
1.72ThrAsn: 1.72 ± 0.388
2.941ThrPro: 2.941 ± 0.443
1.332ThrGln: 1.332 ± 0.284
2.886ThrArg: 2.886 ± 0.409
2.886ThrSer: 2.886 ± 0.581
3.052ThrThr: 3.052 ± 0.522
4.328ThrVal: 4.328 ± 0.519
0.61ThrTrp: 0.61 ± 0.148
1.665ThrTyr: 1.665 ± 0.317
0.0ThrXaa: 0.0 ± 0.0
Val
7.658ValAla: 7.658 ± 0.673
0.943ValCys: 0.943 ± 0.312
4.106ValAsp: 4.106 ± 0.483
4.55ValGlu: 4.55 ± 0.575
1.665ValPhe: 1.665 ± 0.305
5.105ValGly: 5.105 ± 0.578
1.11ValHis: 1.11 ± 0.296
2.497ValIle: 2.497 ± 0.436
3.385ValLys: 3.385 ± 0.542
3.44ValLeu: 3.44 ± 0.386
1.831ValMet: 1.831 ± 0.287
2.775ValAsn: 2.775 ± 0.414
3.884ValPro: 3.884 ± 0.443
2.608ValGln: 2.608 ± 0.351
5.161ValArg: 5.161 ± 0.615
4.55ValSer: 4.55 ± 0.461
4.162ValThr: 4.162 ± 0.587
4.384ValVal: 4.384 ± 0.603
0.999ValTrp: 0.999 ± 0.345
1.609ValTyr: 1.609 ± 0.306
0.0ValXaa: 0.0 ± 0.0
Trp
1.221TrpAla: 1.221 ± 0.23
0.111TrpCys: 0.111 ± 0.088
0.61TrpAsp: 0.61 ± 0.24
0.555TrpGlu: 0.555 ± 0.233
0.555TrpPhe: 0.555 ± 0.205
0.999TrpGly: 0.999 ± 0.184
0.388TrpHis: 0.388 ± 0.184
0.721TrpIle: 0.721 ± 0.177
0.499TrpLys: 0.499 ± 0.19
1.276TrpLeu: 1.276 ± 0.263
0.388TrpMet: 0.388 ± 0.134
0.61TrpAsn: 0.61 ± 0.185
0.333TrpPro: 0.333 ± 0.157
0.388TrpGln: 0.388 ± 0.137
1.276TrpArg: 1.276 ± 0.344
0.888TrpSer: 0.888 ± 0.316
0.777TrpThr: 0.777 ± 0.193
0.943TrpVal: 0.943 ± 0.167
0.222TrpTrp: 0.222 ± 0.157
0.333TrpTyr: 0.333 ± 0.174
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.941TyrAla: 2.941 ± 0.342
0.277TyrCys: 0.277 ± 0.126
1.942TyrAsp: 1.942 ± 0.268
1.776TyrGlu: 1.776 ± 0.259
0.61TyrPhe: 0.61 ± 0.182
1.942TyrGly: 1.942 ± 0.348
0.333TyrHis: 0.333 ± 0.122
0.999TyrIle: 0.999 ± 0.229
1.165TyrLys: 1.165 ± 0.215
2.719TyrLeu: 2.719 ± 0.373
0.444TyrMet: 0.444 ± 0.188
0.999TyrAsn: 0.999 ± 0.258
0.777TyrPro: 0.777 ± 0.166
0.777TyrGln: 0.777 ± 0.188
2.164TyrArg: 2.164 ± 0.342
1.443TyrSer: 1.443 ± 0.354
1.665TyrThr: 1.665 ± 0.342
1.776TyrVal: 1.776 ± 0.342
0.277TyrTrp: 0.277 ± 0.176
0.499TyrTyr: 0.499 ± 0.135
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 68 proteins (18022 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski