Amino acid dipepetide frequency for Escherichia virus Rtp

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.42AlaAla: 8.42 ± 1.339
0.653AlaCys: 0.653 ± 0.242
4.21AlaAsp: 4.21 ± 0.452
5.372AlaGlu: 5.372 ± 0.766
3.049AlaPhe: 3.049 ± 0.448
6.533AlaGly: 6.533 ± 0.705
1.089AlaHis: 1.089 ± 0.32
7.186AlaIle: 7.186 ± 0.78
6.17AlaLys: 6.17 ± 0.939
6.315AlaLeu: 6.315 ± 0.844
2.323AlaMet: 2.323 ± 0.418
4.501AlaAsn: 4.501 ± 0.553
1.96AlaPro: 1.96 ± 0.421
3.339AlaGln: 3.339 ± 0.532
5.372AlaArg: 5.372 ± 0.923
5.081AlaSer: 5.081 ± 0.605
4.138AlaThr: 4.138 ± 0.632
5.226AlaVal: 5.226 ± 0.715
0.726AlaTrp: 0.726 ± 0.262
2.395AlaTyr: 2.395 ± 0.383
0.0AlaXaa: 0.0 ± 0.0
Cys
1.016CysAla: 1.016 ± 0.321
0.218CysCys: 0.218 ± 0.136
1.161CysAsp: 1.161 ± 0.293
0.726CysGlu: 0.726 ± 0.304
0.29CysPhe: 0.29 ± 0.149
1.161CysGly: 1.161 ± 0.4
0.145CysHis: 0.145 ± 0.113
0.581CysIle: 0.581 ± 0.254
0.871CysLys: 0.871 ± 0.249
0.653CysLeu: 0.653 ± 0.214
0.508CysMet: 0.508 ± 0.213
0.798CysAsn: 0.798 ± 0.275
0.145CysPro: 0.145 ± 0.11
0.145CysGln: 0.145 ± 0.11
0.944CysArg: 0.944 ± 0.315
1.016CysSer: 1.016 ± 0.294
0.798CysThr: 0.798 ± 0.276
0.581CysVal: 0.581 ± 0.21
0.218CysTrp: 0.218 ± 0.134
0.218CysTyr: 0.218 ± 0.134
0.0CysXaa: 0.0 ± 0.0
Asp
4.864AspAla: 4.864 ± 0.595
0.653AspCys: 0.653 ± 0.236
3.847AspAsp: 3.847 ± 0.705
4.21AspGlu: 4.21 ± 0.478
2.323AspPhe: 2.323 ± 0.477
7.622AspGly: 7.622 ± 0.892
0.871AspHis: 0.871 ± 0.326
3.847AspIle: 3.847 ± 0.609
4.065AspLys: 4.065 ± 0.546
5.299AspLeu: 5.299 ± 0.546
1.524AspMet: 1.524 ± 0.369
2.686AspAsn: 2.686 ± 0.461
1.815AspPro: 1.815 ± 0.362
0.944AspGln: 0.944 ± 0.255
2.395AspArg: 2.395 ± 0.512
3.992AspSer: 3.992 ± 0.636
3.049AspThr: 3.049 ± 0.471
3.775AspVal: 3.775 ± 0.511
1.016AspTrp: 1.016 ± 0.23
3.121AspTyr: 3.121 ± 0.437
0.0AspXaa: 0.0 ± 0.0
Glu
5.662GluAla: 5.662 ± 0.91
1.161GluCys: 1.161 ± 0.373
3.484GluAsp: 3.484 ± 0.489
3.702GluGlu: 3.702 ± 0.844
3.267GluPhe: 3.267 ± 0.482
3.339GluGly: 3.339 ± 0.508
0.871GluHis: 0.871 ± 0.251
4.573GluIle: 4.573 ± 0.613
3.847GluLys: 3.847 ± 0.662
5.444GluLeu: 5.444 ± 0.607
2.904GluMet: 2.904 ± 0.617
3.121GluAsn: 3.121 ± 0.567
1.887GluPro: 1.887 ± 0.333
2.831GluGln: 2.831 ± 0.569
2.613GluArg: 2.613 ± 0.472
4.791GluSer: 4.791 ± 0.675
3.339GluThr: 3.339 ± 0.467
5.081GluVal: 5.081 ± 0.742
0.581GluTrp: 0.581 ± 0.206
2.323GluTyr: 2.323 ± 0.419
0.0GluXaa: 0.0 ± 0.0
Phe
2.25PheAla: 2.25 ± 0.415
0.798PheCys: 0.798 ± 0.242
3.484PheAsp: 3.484 ± 0.406
3.194PheGlu: 3.194 ± 0.543
1.161PhePhe: 1.161 ± 0.245
3.63PheGly: 3.63 ± 0.595
0.944PheHis: 0.944 ± 0.3
2.468PheIle: 2.468 ± 0.361
2.395PheLys: 2.395 ± 0.451
2.468PheLeu: 2.468 ± 0.467
0.871PheMet: 0.871 ± 0.222
2.25PheAsn: 2.25 ± 0.524
1.452PhePro: 1.452 ± 0.269
1.742PheGln: 1.742 ± 0.353
1.815PheArg: 1.815 ± 0.328
2.541PheSer: 2.541 ± 0.502
2.758PheThr: 2.758 ± 0.408
2.033PheVal: 2.033 ± 0.309
0.363PheTrp: 0.363 ± 0.127
0.944PheTyr: 0.944 ± 0.265
0.0PheXaa: 0.0 ± 0.0
Gly
4.791GlyAla: 4.791 ± 0.779
1.742GlyCys: 1.742 ± 0.397
4.138GlyAsp: 4.138 ± 0.59
4.791GlyGlu: 4.791 ± 0.525
2.831GlyPhe: 2.831 ± 0.336
4.864GlyGly: 4.864 ± 0.935
0.726GlyHis: 0.726 ± 0.315
6.243GlyIle: 6.243 ± 0.658
5.952GlyLys: 5.952 ± 0.769
6.751GlyLeu: 6.751 ± 0.748
1.887GlyMet: 1.887 ± 0.438
3.63GlyAsn: 3.63 ± 0.633
0.944GlyPro: 0.944 ± 0.276
1.96GlyGln: 1.96 ± 0.362
2.904GlyArg: 2.904 ± 0.423
6.025GlySer: 6.025 ± 0.836
3.702GlyThr: 3.702 ± 0.647
5.807GlyVal: 5.807 ± 0.606
1.161GlyTrp: 1.161 ± 0.294
3.557GlyTyr: 3.557 ± 0.473
0.0GlyXaa: 0.0 ± 0.0
His
0.726HisAla: 0.726 ± 0.297
0.29HisCys: 0.29 ± 0.145
0.944HisAsp: 0.944 ± 0.247
0.798HisGlu: 0.798 ± 0.284
0.508HisPhe: 0.508 ± 0.194
1.161HisGly: 1.161 ± 0.395
0.363HisHis: 0.363 ± 0.171
1.379HisIle: 1.379 ± 0.356
1.67HisLys: 1.67 ± 0.343
1.161HisLeu: 1.161 ± 0.343
0.653HisMet: 0.653 ± 0.278
0.726HisAsn: 0.726 ± 0.308
0.218HisPro: 0.218 ± 0.124
0.581HisGln: 0.581 ± 0.244
0.871HisArg: 0.871 ± 0.296
0.363HisSer: 0.363 ± 0.188
0.871HisThr: 0.871 ± 0.268
0.653HisVal: 0.653 ± 0.176
0.218HisTrp: 0.218 ± 0.13
0.581HisTyr: 0.581 ± 0.248
0.0HisXaa: 0.0 ± 0.0
Ile
6.17IleAla: 6.17 ± 0.777
0.581IleCys: 0.581 ± 0.214
5.299IleAsp: 5.299 ± 0.611
3.775IleGlu: 3.775 ± 0.466
1.597IlePhe: 1.597 ± 0.302
3.847IleGly: 3.847 ± 0.578
1.089IleHis: 1.089 ± 0.337
3.412IleIle: 3.412 ± 0.615
5.88IleLys: 5.88 ± 0.706
3.194IleLeu: 3.194 ± 0.5
1.524IleMet: 1.524 ± 0.343
4.283IleAsn: 4.283 ± 0.636
2.613IlePro: 2.613 ± 0.504
2.468IleGln: 2.468 ± 0.518
2.686IleArg: 2.686 ± 0.466
5.444IleSer: 5.444 ± 0.969
4.718IleThr: 4.718 ± 0.575
4.428IleVal: 4.428 ± 0.6
0.944IleTrp: 0.944 ± 0.241
3.049IleTyr: 3.049 ± 0.555
0.0IleXaa: 0.0 ± 0.0
Lys
7.186LysAla: 7.186 ± 1.009
0.436LysCys: 0.436 ± 0.181
4.428LysAsp: 4.428 ± 0.551
4.718LysGlu: 4.718 ± 0.594
3.267LysPhe: 3.267 ± 0.548
3.194LysGly: 3.194 ± 0.472
0.871LysHis: 0.871 ± 0.278
4.428LysIle: 4.428 ± 0.419
4.065LysLys: 4.065 ± 0.735
4.718LysLeu: 4.718 ± 0.708
3.049LysMet: 3.049 ± 0.501
2.541LysAsn: 2.541 ± 0.56
1.815LysPro: 1.815 ± 0.404
2.613LysGln: 2.613 ± 0.427
2.613LysArg: 2.613 ± 0.585
4.138LysSer: 4.138 ± 0.666
4.283LysThr: 4.283 ± 0.493
4.428LysVal: 4.428 ± 0.593
0.726LysTrp: 0.726 ± 0.21
2.686LysTyr: 2.686 ± 0.489
0.0LysXaa: 0.0 ± 0.0
Leu
5.735LeuAla: 5.735 ± 0.824
0.944LeuCys: 0.944 ± 0.244
4.355LeuAsp: 4.355 ± 0.537
4.936LeuGlu: 4.936 ± 0.675
2.758LeuPhe: 2.758 ± 0.405
4.718LeuGly: 4.718 ± 0.698
1.307LeuHis: 1.307 ± 0.433
4.791LeuIle: 4.791 ± 0.53
3.775LeuLys: 3.775 ± 0.531
3.92LeuLeu: 3.92 ± 0.527
1.161LeuMet: 1.161 ± 0.26
2.976LeuAsn: 2.976 ± 0.443
2.831LeuPro: 2.831 ± 0.381
2.468LeuGln: 2.468 ± 0.571
3.775LeuArg: 3.775 ± 0.527
6.461LeuSer: 6.461 ± 0.741
4.718LeuThr: 4.718 ± 0.558
5.444LeuVal: 5.444 ± 0.622
0.436LeuTrp: 0.436 ± 0.244
2.105LeuTyr: 2.105 ± 0.403
0.0LeuXaa: 0.0 ± 0.0
Met
3.267MetAla: 3.267 ± 0.432
0.218MetCys: 0.218 ± 0.136
1.161MetAsp: 1.161 ± 0.322
1.234MetGlu: 1.234 ± 0.327
1.234MetPhe: 1.234 ± 0.3
1.161MetGly: 1.161 ± 0.287
0.436MetHis: 0.436 ± 0.19
2.323MetIle: 2.323 ± 0.416
2.105MetLys: 2.105 ± 0.543
1.597MetLeu: 1.597 ± 0.4
0.726MetMet: 0.726 ± 0.308
1.67MetAsn: 1.67 ± 0.386
0.363MetPro: 0.363 ± 0.15
0.871MetGln: 0.871 ± 0.26
1.887MetArg: 1.887 ± 0.426
1.742MetSer: 1.742 ± 0.311
2.105MetThr: 2.105 ± 0.329
1.887MetVal: 1.887 ± 0.419
0.073MetTrp: 0.073 ± 0.074
0.653MetTyr: 0.653 ± 0.194
0.0MetXaa: 0.0 ± 0.0
Asn
4.646AsnAla: 4.646 ± 0.607
0.726AsnCys: 0.726 ± 0.306
3.194AsnAsp: 3.194 ± 0.408
3.412AsnGlu: 3.412 ± 0.586
1.815AsnPhe: 1.815 ± 0.301
5.662AsnGly: 5.662 ± 1.144
0.871AsnHis: 0.871 ± 0.275
2.541AsnIle: 2.541 ± 0.445
2.831AsnLys: 2.831 ± 0.554
2.686AsnLeu: 2.686 ± 0.574
1.234AsnMet: 1.234 ± 0.232
3.194AsnAsn: 3.194 ± 0.543
1.742AsnPro: 1.742 ± 0.307
2.25AsnGln: 2.25 ± 0.389
2.395AsnArg: 2.395 ± 0.403
3.992AsnSer: 3.992 ± 0.658
2.25AsnThr: 2.25 ± 0.409
4.355AsnVal: 4.355 ± 0.453
1.016AsnTrp: 1.016 ± 0.28
1.67AsnTyr: 1.67 ± 0.341
0.0AsnXaa: 0.0 ± 0.0
Pro
2.831ProAla: 2.831 ± 0.423
0.363ProCys: 0.363 ± 0.21
2.033ProAsp: 2.033 ± 0.512
2.831ProGlu: 2.831 ± 0.46
1.452ProPhe: 1.452 ± 0.266
1.67ProGly: 1.67 ± 0.327
0.436ProHis: 0.436 ± 0.178
1.815ProIle: 1.815 ± 0.269
1.452ProLys: 1.452 ± 0.318
1.815ProLeu: 1.815 ± 0.374
0.653ProMet: 0.653 ± 0.216
1.597ProAsn: 1.597 ± 0.346
0.798ProPro: 0.798 ± 0.266
1.161ProGln: 1.161 ± 0.335
1.524ProArg: 1.524 ± 0.365
1.379ProSer: 1.379 ± 0.331
1.524ProThr: 1.524 ± 0.392
3.194ProVal: 3.194 ± 0.525
0.436ProTrp: 0.436 ± 0.233
1.234ProTyr: 1.234 ± 0.309
0.0ProXaa: 0.0 ± 0.0
Gln
3.847GlnAla: 3.847 ± 0.888
0.363GlnCys: 0.363 ± 0.16
1.379GlnAsp: 1.379 ± 0.331
2.613GlnGlu: 2.613 ± 0.482
1.452GlnPhe: 1.452 ± 0.327
2.033GlnGly: 2.033 ± 0.435
0.436GlnHis: 0.436 ± 0.158
2.904GlnIle: 2.904 ± 0.49
2.178GlnLys: 2.178 ± 0.411
2.25GlnLeu: 2.25 ± 0.459
1.089GlnMet: 1.089 ± 0.22
1.815GlnAsn: 1.815 ± 0.357
0.944GlnPro: 0.944 ± 0.301
2.904GlnGln: 2.904 ± 1.319
1.67GlnArg: 1.67 ± 0.328
2.686GlnSer: 2.686 ± 0.401
1.524GlnThr: 1.524 ± 0.321
2.976GlnVal: 2.976 ± 0.4
0.363GlnTrp: 0.363 ± 0.173
1.452GlnTyr: 1.452 ± 0.324
0.0GlnXaa: 0.0 ± 0.0
Arg
4.065ArgAla: 4.065 ± 0.503
0.726ArgCys: 0.726 ± 0.348
2.831ArgAsp: 2.831 ± 0.496
3.049ArgGlu: 3.049 ± 0.47
2.178ArgPhe: 2.178 ± 0.362
2.976ArgGly: 2.976 ± 0.461
0.363ArgHis: 0.363 ± 0.166
3.484ArgIle: 3.484 ± 0.587
4.21ArgLys: 4.21 ± 0.638
4.501ArgLeu: 4.501 ± 0.541
1.016ArgMet: 1.016 ± 0.296
2.033ArgAsn: 2.033 ± 0.385
1.597ArgPro: 1.597 ± 0.409
1.67ArgGln: 1.67 ± 0.533
2.758ArgArg: 2.758 ± 0.431
2.976ArgSer: 2.976 ± 0.437
2.033ArgThr: 2.033 ± 0.371
3.702ArgVal: 3.702 ± 0.532
0.508ArgTrp: 0.508 ± 0.193
2.395ArgTyr: 2.395 ± 0.333
0.0ArgXaa: 0.0 ± 0.0
Ser
5.226SerAla: 5.226 ± 0.695
0.508SerCys: 0.508 ± 0.196
4.936SerAsp: 4.936 ± 0.57
4.355SerGlu: 4.355 ± 0.565
3.412SerPhe: 3.412 ± 0.444
6.823SerGly: 6.823 ± 0.861
1.452SerHis: 1.452 ± 0.323
3.847SerIle: 3.847 ± 0.398
3.484SerLys: 3.484 ± 0.603
5.226SerLeu: 5.226 ± 0.683
1.597SerMet: 1.597 ± 0.358
3.63SerAsn: 3.63 ± 0.568
2.541SerPro: 2.541 ± 0.365
2.033SerGln: 2.033 ± 0.363
2.976SerArg: 2.976 ± 0.57
5.517SerSer: 5.517 ± 1.337
4.21SerThr: 4.21 ± 0.64
5.517SerVal: 5.517 ± 0.609
0.653SerTrp: 0.653 ± 0.222
2.468SerTyr: 2.468 ± 0.401
0.0SerXaa: 0.0 ± 0.0
Thr
5.009ThrAla: 5.009 ± 0.754
0.581ThrCys: 0.581 ± 0.182
2.613ThrAsp: 2.613 ± 0.493
2.758ThrGlu: 2.758 ± 0.39
2.395ThrPhe: 2.395 ± 0.482
6.098ThrGly: 6.098 ± 0.799
0.726ThrHis: 0.726 ± 0.268
3.92ThrIle: 3.92 ± 0.505
2.758ThrLys: 2.758 ± 0.538
3.992ThrLeu: 3.992 ± 0.548
0.871ThrMet: 0.871 ± 0.23
3.92ThrAsn: 3.92 ± 0.48
2.178ThrPro: 2.178 ± 0.324
2.613ThrGln: 2.613 ± 0.374
2.831ThrArg: 2.831 ± 0.478
3.339ThrSer: 3.339 ± 0.429
3.339ThrThr: 3.339 ± 0.663
4.355ThrVal: 4.355 ± 0.444
0.29ThrTrp: 0.29 ± 0.127
2.613ThrTyr: 2.613 ± 0.495
0.0ThrXaa: 0.0 ± 0.0
Val
5.154ValAla: 5.154 ± 0.668
0.653ValCys: 0.653 ± 0.238
4.864ValAsp: 4.864 ± 0.512
5.226ValGlu: 5.226 ± 0.9
2.758ValPhe: 2.758 ± 0.417
4.355ValGly: 4.355 ± 0.642
0.726ValHis: 0.726 ± 0.22
3.992ValIle: 3.992 ± 0.535
5.226ValLys: 5.226 ± 0.627
4.138ValLeu: 4.138 ± 0.62
1.96ValMet: 1.96 ± 0.463
4.718ValAsn: 4.718 ± 0.578
2.468ValPro: 2.468 ± 0.446
2.395ValGln: 2.395 ± 0.584
4.283ValArg: 4.283 ± 0.674
5.517ValSer: 5.517 ± 0.625
4.283ValThr: 4.283 ± 0.662
6.243ValVal: 6.243 ± 0.964
0.726ValTrp: 0.726 ± 0.209
3.194ValTyr: 3.194 ± 0.462
0.0ValXaa: 0.0 ± 0.0
Trp
0.508TrpAla: 0.508 ± 0.208
0.145TrpCys: 0.145 ± 0.102
0.653TrpAsp: 0.653 ± 0.198
0.581TrpGlu: 0.581 ± 0.266
0.726TrpPhe: 0.726 ± 0.216
0.871TrpGly: 0.871 ± 0.207
0.508TrpHis: 0.508 ± 0.175
0.871TrpIle: 0.871 ± 0.265
0.944TrpLys: 0.944 ± 0.214
0.944TrpLeu: 0.944 ± 0.227
0.29TrpMet: 0.29 ± 0.112
0.29TrpAsn: 0.29 ± 0.151
0.363TrpPro: 0.363 ± 0.164
0.218TrpGln: 0.218 ± 0.122
0.726TrpArg: 0.726 ± 0.206
0.944TrpSer: 0.944 ± 0.349
0.436TrpThr: 0.436 ± 0.181
0.581TrpVal: 0.581 ± 0.202
0.073TrpTrp: 0.073 ± 0.083
0.363TrpTyr: 0.363 ± 0.135
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.686TyrAla: 2.686 ± 0.437
0.436TyrCys: 0.436 ± 0.191
2.904TyrAsp: 2.904 ± 0.504
2.541TyrGlu: 2.541 ± 0.453
1.089TyrPhe: 1.089 ± 0.297
2.686TyrGly: 2.686 ± 0.386
0.508TyrHis: 0.508 ± 0.171
2.395TyrIle: 2.395 ± 0.362
2.323TyrLys: 2.323 ± 0.519
2.541TyrLeu: 2.541 ± 0.398
0.798TyrMet: 0.798 ± 0.21
2.178TyrAsn: 2.178 ± 0.342
1.524TyrPro: 1.524 ± 0.372
1.67TyrGln: 1.67 ± 0.343
2.033TyrArg: 2.033 ± 0.327
2.468TyrSer: 2.468 ± 0.437
3.121TyrThr: 3.121 ± 0.442
2.613TyrVal: 2.613 ± 0.433
0.508TyrTrp: 0.508 ± 0.183
1.597TyrTyr: 1.597 ± 0.33
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 75 proteins (13777 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski