Amino acid dipepetide frequency for Gordonia phage Tiamoceli

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
18.476AlaAla: 18.476 ± 1.48
0.672AlaCys: 0.672 ± 0.19
9.63AlaAsp: 9.63 ± 0.898
9.294AlaGlu: 9.294 ± 1.185
3.079AlaPhe: 3.079 ± 0.574
12.709AlaGly: 12.709 ± 0.955
2.24AlaHis: 2.24 ± 0.385
6.327AlaIle: 6.327 ± 0.832
5.039AlaLys: 5.039 ± 0.776
9.462AlaLeu: 9.462 ± 0.659
1.792AlaMet: 1.792 ± 0.354
3.191AlaAsn: 3.191 ± 0.393
5.487AlaPro: 5.487 ± 0.542
4.479AlaGln: 4.479 ± 0.538
7.502AlaArg: 7.502 ± 0.966
6.383AlaSer: 6.383 ± 0.525
7.67AlaThr: 7.67 ± 0.692
8.342AlaVal: 8.342 ± 1.171
2.351AlaTrp: 2.351 ± 0.28
2.463AlaTyr: 2.463 ± 0.477
0.0AlaXaa: 0.0 ± 0.0
Cys
0.84CysAla: 0.84 ± 0.233
0.112CysCys: 0.112 ± 0.122
0.336CysAsp: 0.336 ± 0.132
0.728CysGlu: 0.728 ± 0.225
0.28CysPhe: 0.28 ± 0.115
0.728CysGly: 0.728 ± 0.256
0.224CysHis: 0.224 ± 0.119
0.168CysIle: 0.168 ± 0.101
0.28CysLys: 0.28 ± 0.131
0.504CysLeu: 0.504 ± 0.166
0.112CysMet: 0.112 ± 0.071
0.336CysAsn: 0.336 ± 0.147
0.728CysPro: 0.728 ± 0.2
0.112CysGln: 0.112 ± 0.075
0.84CysArg: 0.84 ± 0.263
0.28CysSer: 0.28 ± 0.118
0.616CysThr: 0.616 ± 0.215
0.392CysVal: 0.392 ± 0.13
0.168CysTrp: 0.168 ± 0.09
0.168CysTyr: 0.168 ± 0.099
0.0CysXaa: 0.0 ± 0.0
Asp
7.334AspAla: 7.334 ± 0.947
0.616AspCys: 0.616 ± 0.152
7.558AspAsp: 7.558 ± 1.082
5.655AspGlu: 5.655 ± 1.012
1.288AspPhe: 1.288 ± 0.261
7.726AspGly: 7.726 ± 0.689
1.232AspHis: 1.232 ± 0.28
1.344AspIle: 1.344 ± 0.339
1.904AspLys: 1.904 ± 0.366
5.991AspLeu: 5.991 ± 0.645
1.008AspMet: 1.008 ± 0.242
1.344AspAsn: 1.344 ± 0.261
5.879AspPro: 5.879 ± 0.678
3.135AspGln: 3.135 ± 0.395
5.767AspArg: 5.767 ± 0.753
3.079AspSer: 3.079 ± 0.442
3.863AspThr: 3.863 ± 0.558
4.479AspVal: 4.479 ± 0.571
1.792AspTrp: 1.792 ± 0.346
1.736AspTyr: 1.736 ± 0.281
0.0AspXaa: 0.0 ± 0.0
Glu
7.222GluAla: 7.222 ± 0.76
0.392GluCys: 0.392 ± 0.158
2.967GluAsp: 2.967 ± 0.615
1.4GluGlu: 1.4 ± 0.342
1.904GluPhe: 1.904 ± 0.387
3.695GluGly: 3.695 ± 0.626
1.176GluHis: 1.176 ± 0.243
2.967GluIle: 2.967 ± 0.48
0.952GluLys: 0.952 ± 0.21
4.983GluLeu: 4.983 ± 0.694
1.176GluMet: 1.176 ± 0.264
1.68GluAsn: 1.68 ± 0.289
3.247GluPro: 3.247 ± 0.529
4.031GluGln: 4.031 ± 0.665
4.367GluArg: 4.367 ± 0.647
2.911GluSer: 2.911 ± 0.432
3.079GluThr: 3.079 ± 0.389
4.591GluVal: 4.591 ± 0.524
1.736GluTrp: 1.736 ± 0.382
2.296GluTyr: 2.296 ± 0.384
0.0GluXaa: 0.0 ± 0.0
Phe
3.639PheAla: 3.639 ± 0.547
0.392PheCys: 0.392 ± 0.172
2.128PheAsp: 2.128 ± 0.407
1.4PheGlu: 1.4 ± 0.257
0.952PhePhe: 0.952 ± 0.405
2.799PheGly: 2.799 ± 0.371
0.28PheHis: 0.28 ± 0.141
0.672PheIle: 0.672 ± 0.201
0.728PheLys: 0.728 ± 0.431
1.68PheLeu: 1.68 ± 0.288
0.28PheMet: 0.28 ± 0.11
0.728PheAsn: 0.728 ± 0.259
1.288PhePro: 1.288 ± 0.268
1.064PheGln: 1.064 ± 0.192
1.288PheArg: 1.288 ± 0.307
1.512PheSer: 1.512 ± 0.276
2.128PheThr: 2.128 ± 0.445
1.792PheVal: 1.792 ± 0.311
0.728PheTrp: 0.728 ± 0.203
0.336PheTyr: 0.336 ± 0.146
0.0PheXaa: 0.0 ± 0.0
Gly
9.294GlyAla: 9.294 ± 1.22
0.672GlyCys: 0.672 ± 0.22
5.599GlyAsp: 5.599 ± 0.58
5.599GlyGlu: 5.599 ± 0.698
2.184GlyPhe: 2.184 ± 0.46
8.51GlyGly: 8.51 ± 0.799
1.792GlyHis: 1.792 ± 0.379
5.095GlyIle: 5.095 ± 0.687
2.967GlyLys: 2.967 ± 0.643
6.327GlyLeu: 6.327 ± 0.829
1.792GlyMet: 1.792 ± 0.335
3.247GlyAsn: 3.247 ± 0.516
3.471GlyPro: 3.471 ± 0.472
3.303GlyGln: 3.303 ± 0.389
5.935GlyArg: 5.935 ± 0.673
6.271GlySer: 6.271 ± 0.823
6.551GlyThr: 6.551 ± 0.89
6.383GlyVal: 6.383 ± 0.467
2.016GlyTrp: 2.016 ± 0.348
2.575GlyTyr: 2.575 ± 0.391
0.0GlyXaa: 0.0 ± 0.0
His
2.128HisAla: 2.128 ± 0.368
0.112HisCys: 0.112 ± 0.085
1.008HisAsp: 1.008 ± 0.22
1.008HisGlu: 1.008 ± 0.314
0.168HisPhe: 0.168 ± 0.087
1.568HisGly: 1.568 ± 0.303
0.392HisHis: 0.392 ± 0.165
0.84HisIle: 0.84 ± 0.208
0.224HisLys: 0.224 ± 0.112
1.96HisLeu: 1.96 ± 0.367
0.392HisMet: 0.392 ± 0.14
0.336HisAsn: 0.336 ± 0.147
1.736HisPro: 1.736 ± 0.373
0.728HisGln: 0.728 ± 0.242
1.624HisArg: 1.624 ± 0.35
0.784HisSer: 0.784 ± 0.29
1.064HisThr: 1.064 ± 0.257
1.232HisVal: 1.232 ± 0.257
0.392HisTrp: 0.392 ± 0.138
0.448HisTyr: 0.448 ± 0.149
0.0HisXaa: 0.0 ± 0.0
Ile
7.054IleAla: 7.054 ± 0.699
0.28IleCys: 0.28 ± 0.121
3.359IleAsp: 3.359 ± 0.425
3.583IleGlu: 3.583 ± 0.408
0.784IlePhe: 0.784 ± 0.2
4.871IleGly: 4.871 ± 0.664
0.56IleHis: 0.56 ± 0.18
1.848IleIle: 1.848 ± 0.411
1.456IleLys: 1.456 ± 0.267
2.296IleLeu: 2.296 ± 0.392
0.392IleMet: 0.392 ± 0.161
1.12IleAsn: 1.12 ± 0.228
2.855IlePro: 2.855 ± 0.305
0.84IleGln: 0.84 ± 0.282
2.967IleArg: 2.967 ± 0.355
1.96IleSer: 1.96 ± 0.423
3.471IleThr: 3.471 ± 0.416
3.527IleVal: 3.527 ± 0.476
0.784IleTrp: 0.784 ± 0.167
0.952IleTyr: 0.952 ± 0.191
0.0IleXaa: 0.0 ± 0.0
Lys
4.479LysAla: 4.479 ± 0.876
0.336LysCys: 0.336 ± 0.151
1.4LysAsp: 1.4 ± 0.254
0.448LysGlu: 0.448 ± 0.187
0.728LysPhe: 0.728 ± 0.25
2.519LysGly: 2.519 ± 0.413
0.56LysHis: 0.56 ± 0.178
2.072LysIle: 2.072 ± 0.486
0.728LysLys: 0.728 ± 0.176
3.583LysLeu: 3.583 ± 0.45
0.56LysMet: 0.56 ± 0.153
0.448LysAsn: 0.448 ± 0.175
1.568LysPro: 1.568 ± 0.347
0.728LysGln: 0.728 ± 0.211
2.351LysArg: 2.351 ± 0.352
1.4LysSer: 1.4 ± 0.268
1.4LysThr: 1.4 ± 0.279
2.296LysVal: 2.296 ± 0.258
0.448LysTrp: 0.448 ± 0.164
0.952LysTyr: 0.952 ± 0.262
0.0LysXaa: 0.0 ± 0.0
Leu
9.742LeuAla: 9.742 ± 0.8
0.784LeuCys: 0.784 ± 0.267
7.278LeuAsp: 7.278 ± 0.916
2.799LeuGlu: 2.799 ± 0.41
1.848LeuPhe: 1.848 ± 0.357
8.062LeuGly: 8.062 ± 0.887
1.288LeuHis: 1.288 ± 0.341
3.527LeuIle: 3.527 ± 0.348
1.456LeuLys: 1.456 ± 0.207
5.935LeuLeu: 5.935 ± 0.627
1.12LeuMet: 1.12 ± 0.232
2.072LeuAsn: 2.072 ± 0.325
5.039LeuPro: 5.039 ± 0.494
2.967LeuGln: 2.967 ± 0.48
4.591LeuArg: 4.591 ± 0.487
3.303LeuSer: 3.303 ± 0.426
5.767LeuThr: 5.767 ± 0.588
6.551LeuVal: 6.551 ± 0.605
1.456LeuTrp: 1.456 ± 0.302
1.288LeuTyr: 1.288 ± 0.325
0.0LeuXaa: 0.0 ± 0.0
Met
2.128MetAla: 2.128 ± 0.382
0.056MetCys: 0.056 ± 0.058
0.84MetAsp: 0.84 ± 0.18
0.336MetGlu: 0.336 ± 0.116
0.392MetPhe: 0.392 ± 0.131
1.176MetGly: 1.176 ± 0.215
0.224MetHis: 0.224 ± 0.105
0.952MetIle: 0.952 ± 0.255
0.56MetLys: 0.56 ± 0.18
1.624MetLeu: 1.624 ± 0.235
0.448MetMet: 0.448 ± 0.151
0.728MetAsn: 0.728 ± 0.169
1.904MetPro: 1.904 ± 0.293
0.168MetGln: 0.168 ± 0.095
1.344MetArg: 1.344 ± 0.268
1.064MetSer: 1.064 ± 0.257
3.863MetThr: 3.863 ± 0.383
0.84MetVal: 0.84 ± 0.244
0.392MetTrp: 0.392 ± 0.159
0.056MetTyr: 0.056 ± 0.06
0.0MetXaa: 0.0 ± 0.0
Asn
3.527AsnAla: 3.527 ± 0.553
0.336AsnCys: 0.336 ± 0.125
2.351AsnAsp: 2.351 ± 0.344
1.4AsnGlu: 1.4 ± 0.29
0.784AsnPhe: 0.784 ± 0.241
2.743AsnGly: 2.743 ± 0.411
0.28AsnHis: 0.28 ± 0.122
0.504AsnIle: 0.504 ± 0.173
0.784AsnLys: 0.784 ± 0.181
2.24AsnLeu: 2.24 ± 0.355
0.504AsnMet: 0.504 ± 0.213
0.672AsnAsn: 0.672 ± 0.189
2.128AsnPro: 2.128 ± 0.403
0.504AsnGln: 0.504 ± 0.171
1.96AsnArg: 1.96 ± 0.401
1.456AsnSer: 1.456 ± 0.364
2.351AsnThr: 2.351 ± 0.363
2.24AsnVal: 2.24 ± 0.386
0.56AsnTrp: 0.56 ± 0.167
0.616AsnTyr: 0.616 ± 0.184
0.0AsnXaa: 0.0 ± 0.0
Pro
8.398ProAla: 8.398 ± 0.863
0.448ProCys: 0.448 ± 0.154
4.983ProAsp: 4.983 ± 0.624
3.863ProGlu: 3.863 ± 0.584
1.904ProPhe: 1.904 ± 0.303
4.759ProGly: 4.759 ± 0.424
1.176ProHis: 1.176 ± 0.247
3.359ProIle: 3.359 ± 0.338
1.68ProLys: 1.68 ± 0.281
3.247ProLeu: 3.247 ± 0.447
1.232ProMet: 1.232 ± 0.241
1.848ProAsn: 1.848 ± 0.313
3.303ProPro: 3.303 ± 0.509
2.072ProGln: 2.072 ± 0.327
3.191ProArg: 3.191 ± 0.526
2.24ProSer: 2.24 ± 0.321
4.423ProThr: 4.423 ± 0.52
4.871ProVal: 4.871 ± 0.554
1.568ProTrp: 1.568 ± 0.323
1.288ProTyr: 1.288 ± 0.265
0.0ProXaa: 0.0 ± 0.0
Gln
4.703GlnAla: 4.703 ± 0.553
0.392GlnCys: 0.392 ± 0.147
1.456GlnAsp: 1.456 ± 0.224
1.232GlnGlu: 1.232 ± 0.254
1.624GlnPhe: 1.624 ± 0.238
2.799GlnGly: 2.799 ± 0.482
0.896GlnHis: 0.896 ± 0.277
2.296GlnIle: 2.296 ± 0.322
1.008GlnLys: 1.008 ± 0.206
3.135GlnLeu: 3.135 ± 0.399
1.232GlnMet: 1.232 ± 0.252
1.288GlnAsn: 1.288 ± 0.272
2.184GlnPro: 2.184 ± 0.338
2.743GlnGln: 2.743 ± 0.453
2.407GlnArg: 2.407 ± 0.38
1.344GlnSer: 1.344 ± 0.206
2.351GlnThr: 2.351 ± 0.303
3.135GlnVal: 3.135 ± 0.364
0.784GlnTrp: 0.784 ± 0.207
0.896GlnTyr: 0.896 ± 0.219
0.0GlnXaa: 0.0 ± 0.0
Arg
8.398ArgAla: 8.398 ± 0.846
0.504ArgCys: 0.504 ± 0.15
4.479ArgAsp: 4.479 ± 0.753
4.087ArgGlu: 4.087 ± 0.639
1.512ArgPhe: 1.512 ± 0.275
4.311ArgGly: 4.311 ± 0.551
1.568ArgHis: 1.568 ± 0.295
3.191ArgIle: 3.191 ± 0.528
2.519ArgLys: 2.519 ± 0.368
4.871ArgLeu: 4.871 ± 0.475
2.296ArgMet: 2.296 ± 0.343
1.792ArgAsn: 1.792 ± 0.372
3.415ArgPro: 3.415 ± 0.557
2.519ArgGln: 2.519 ± 0.366
7.334ArgArg: 7.334 ± 1.039
3.247ArgSer: 3.247 ± 0.356
4.535ArgThr: 4.535 ± 0.476
5.431ArgVal: 5.431 ± 0.661
1.344ArgTrp: 1.344 ± 0.295
1.96ArgTyr: 1.96 ± 0.333
0.0ArgXaa: 0.0 ± 0.0
Ser
6.607SerAla: 6.607 ± 0.929
0.336SerCys: 0.336 ± 0.164
2.575SerAsp: 2.575 ± 0.371
2.016SerGlu: 2.016 ± 0.385
1.064SerPhe: 1.064 ± 0.222
4.759SerGly: 4.759 ± 0.736
0.672SerHis: 0.672 ± 0.287
2.519SerIle: 2.519 ± 0.326
1.904SerLys: 1.904 ± 0.366
4.591SerLeu: 4.591 ± 0.511
1.064SerMet: 1.064 ± 0.182
1.288SerAsn: 1.288 ± 0.3
3.135SerPro: 3.135 ± 0.382
2.016SerGln: 2.016 ± 0.333
2.911SerArg: 2.911 ± 0.438
2.855SerSer: 2.855 ± 0.507
3.639SerThr: 3.639 ± 0.569
3.023SerVal: 3.023 ± 0.454
1.4SerTrp: 1.4 ± 0.261
1.008SerTyr: 1.008 ± 0.287
0.0SerXaa: 0.0 ± 0.0
Thr
8.454ThrAla: 8.454 ± 0.716
0.616ThrCys: 0.616 ± 0.219
5.431ThrAsp: 5.431 ± 0.562
4.367ThrGlu: 4.367 ± 0.484
1.904ThrPhe: 1.904 ± 0.437
6.887ThrGly: 6.887 ± 0.734
1.176ThrHis: 1.176 ± 0.247
3.583ThrIle: 3.583 ± 0.432
1.736ThrLys: 1.736 ± 0.313
5.543ThrLeu: 5.543 ± 0.494
1.624ThrMet: 1.624 ± 0.288
2.631ThrAsn: 2.631 ± 0.336
5.319ThrPro: 5.319 ± 0.688
1.512ThrGln: 1.512 ± 0.256
4.199ThrArg: 4.199 ± 0.608
3.807ThrSer: 3.807 ± 0.594
5.207ThrThr: 5.207 ± 1.044
5.375ThrVal: 5.375 ± 0.913
0.952ThrTrp: 0.952 ± 0.192
1.288ThrTyr: 1.288 ± 0.259
0.0ThrXaa: 0.0 ± 0.0
Val
8.678ValAla: 8.678 ± 0.604
0.224ValCys: 0.224 ± 0.138
5.935ValAsp: 5.935 ± 0.764
5.599ValGlu: 5.599 ± 0.494
2.128ValPhe: 2.128 ± 0.397
5.767ValGly: 5.767 ± 0.73
1.736ValHis: 1.736 ± 0.338
2.407ValIle: 2.407 ± 0.35
2.24ValLys: 2.24 ± 0.316
4.591ValLeu: 4.591 ± 0.506
1.232ValMet: 1.232 ± 0.25
1.904ValAsn: 1.904 ± 0.359
4.255ValPro: 4.255 ± 0.6
3.023ValGln: 3.023 ± 0.427
4.479ValArg: 4.479 ± 0.425
3.191ValSer: 3.191 ± 0.509
6.607ValThr: 6.607 ± 0.685
4.255ValVal: 4.255 ± 0.572
1.4ValTrp: 1.4 ± 0.255
2.072ValTyr: 2.072 ± 0.329
0.0ValXaa: 0.0 ± 0.0
Trp
2.855TrpAla: 2.855 ± 0.382
0.448TrpCys: 0.448 ± 0.171
1.176TrpAsp: 1.176 ± 0.239
0.672TrpGlu: 0.672 ± 0.213
0.896TrpPhe: 0.896 ± 0.216
1.288TrpGly: 1.288 ± 0.404
0.392TrpHis: 0.392 ± 0.172
0.56TrpIle: 0.56 ± 0.159
0.504TrpLys: 0.504 ± 0.162
2.351TrpLeu: 2.351 ± 0.35
0.392TrpMet: 0.392 ± 0.125
0.728TrpAsn: 0.728 ± 0.275
1.512TrpPro: 1.512 ± 0.254
1.4TrpGln: 1.4 ± 0.229
1.568TrpArg: 1.568 ± 0.325
0.952TrpSer: 0.952 ± 0.32
1.512TrpThr: 1.512 ± 0.298
1.4TrpVal: 1.4 ± 0.273
0.616TrpTrp: 0.616 ± 0.226
0.336TrpTyr: 0.336 ± 0.127
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.631TyrAla: 2.631 ± 0.569
0.224TyrCys: 0.224 ± 0.12
2.24TyrAsp: 2.24 ± 0.394
1.344TyrGlu: 1.344 ± 0.27
0.392TyrPhe: 0.392 ± 0.133
1.904TyrGly: 1.904 ± 0.326
0.28TyrHis: 0.28 ± 0.13
0.728TyrIle: 0.728 ± 0.231
0.336TyrLys: 0.336 ± 0.133
1.96TyrLeu: 1.96 ± 0.34
0.392TyrMet: 0.392 ± 0.124
0.56TyrAsn: 0.56 ± 0.24
1.4TyrPro: 1.4 ± 0.289
0.56TyrGln: 0.56 ± 0.185
2.687TyrArg: 2.687 ± 0.454
1.344TyrSer: 1.344 ± 0.286
1.512TyrThr: 1.512 ± 0.313
1.568TyrVal: 1.568 ± 0.294
0.672TyrTrp: 0.672 ± 0.172
0.504TyrTyr: 0.504 ± 0.192
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 80 proteins (17862 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski