Amino acid dipepetide frequency for Gordonia phage GMA4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
19.282AlaAla: 19.282 ± 1.375
0.788AlaCys: 0.788 ± 0.267
7.526AlaAsp: 7.526 ± 0.643
7.24AlaGlu: 7.24 ± 0.871
3.441AlaPhe: 3.441 ± 0.891
12.186AlaGly: 12.186 ± 1.267
2.007AlaHis: 2.007 ± 0.535
4.803AlaIle: 4.803 ± 0.592
5.089AlaLys: 5.089 ± 0.547
9.963AlaLeu: 9.963 ± 1.053
2.939AlaMet: 2.939 ± 0.53
3.082AlaAsn: 3.082 ± 0.406
6.953AlaPro: 6.953 ± 0.764
4.874AlaGln: 4.874 ± 0.704
8.602AlaArg: 8.602 ± 0.93
7.526AlaSer: 7.526 ± 0.633
7.24AlaThr: 7.24 ± 0.736
10.465AlaVal: 10.465 ± 0.952
2.724AlaTrp: 2.724 ± 0.36
1.72AlaTyr: 1.72 ± 0.39
0.0AlaXaa: 0.0 ± 0.0
Cys
1.505CysAla: 1.505 ± 0.428
0.072CysCys: 0.072 ± 0.072
0.502CysAsp: 0.502 ± 0.181
0.502CysGlu: 0.502 ± 0.19
0.287CysPhe: 0.287 ± 0.128
0.932CysGly: 0.932 ± 0.266
0.143CysHis: 0.143 ± 0.111
0.143CysIle: 0.143 ± 0.107
0.43CysLys: 0.43 ± 0.206
0.43CysLeu: 0.43 ± 0.217
0.072CysMet: 0.072 ± 0.073
0.072CysAsn: 0.072 ± 0.079
0.932CysPro: 0.932 ± 0.339
0.43CysGln: 0.43 ± 0.22
1.147CysArg: 1.147 ± 0.309
0.43CysSer: 0.43 ± 0.148
0.358CysThr: 0.358 ± 0.166
0.287CysVal: 0.287 ± 0.175
0.215CysTrp: 0.215 ± 0.129
0.072CysTyr: 0.072 ± 0.075
0.0CysXaa: 0.0 ± 0.0
Asp
8.171AspAla: 8.171 ± 0.627
0.645AspCys: 0.645 ± 0.209
5.233AspAsp: 5.233 ± 0.814
4.659AspGlu: 4.659 ± 0.527
1.935AspPhe: 1.935 ± 0.267
6.236AspGly: 6.236 ± 0.628
1.434AspHis: 1.434 ± 0.356
3.011AspIle: 3.011 ± 0.449
1.434AspLys: 1.434 ± 0.306
5.591AspLeu: 5.591 ± 0.574
1.434AspMet: 1.434 ± 0.322
1.792AspAsn: 1.792 ± 0.321
4.086AspPro: 4.086 ± 0.484
2.294AspGln: 2.294 ± 0.352
6.093AspArg: 6.093 ± 0.843
2.652AspSer: 2.652 ± 0.377
3.727AspThr: 3.727 ± 0.517
4.587AspVal: 4.587 ± 0.612
1.649AspTrp: 1.649 ± 0.268
1.29AspTyr: 1.29 ± 0.237
0.0AspXaa: 0.0 ± 0.0
Glu
7.455GluAla: 7.455 ± 0.718
0.287GluCys: 0.287 ± 0.157
2.652GluAsp: 2.652 ± 0.404
2.652GluGlu: 2.652 ± 0.642
2.222GluPhe: 2.222 ± 0.369
3.369GluGly: 3.369 ± 0.427
1.505GluHis: 1.505 ± 0.312
2.652GluIle: 2.652 ± 0.378
1.864GluLys: 1.864 ± 0.328
5.734GluLeu: 5.734 ± 0.704
1.505GluMet: 1.505 ± 0.31
1.219GluAsn: 1.219 ± 0.246
2.437GluPro: 2.437 ± 0.518
2.724GluGln: 2.724 ± 0.491
4.157GluArg: 4.157 ± 0.609
3.082GluSer: 3.082 ± 0.486
3.154GluThr: 3.154 ± 0.51
3.656GluVal: 3.656 ± 0.506
1.075GluTrp: 1.075 ± 0.244
0.86GluTyr: 0.86 ± 0.203
0.0GluXaa: 0.0 ± 0.0
Phe
3.082PheAla: 3.082 ± 0.451
0.43PheCys: 0.43 ± 0.214
2.222PheAsp: 2.222 ± 0.443
1.577PheGlu: 1.577 ± 0.269
1.004PhePhe: 1.004 ± 0.298
2.867PheGly: 2.867 ± 0.458
0.502PheHis: 0.502 ± 0.152
0.788PheIle: 0.788 ± 0.242
1.434PheLys: 1.434 ± 0.354
1.29PheLeu: 1.29 ± 0.314
0.645PheMet: 0.645 ± 0.184
0.788PheAsn: 0.788 ± 0.222
1.219PhePro: 1.219 ± 0.31
0.573PheGln: 0.573 ± 0.168
1.505PheArg: 1.505 ± 0.312
1.004PheSer: 1.004 ± 0.233
2.15PheThr: 2.15 ± 0.355
1.792PheVal: 1.792 ± 0.387
0.287PheTrp: 0.287 ± 0.116
0.86PheTyr: 0.86 ± 0.22
0.0PheXaa: 0.0 ± 0.0
Gly
10.752GlyAla: 10.752 ± 1.04
0.86GlyCys: 0.86 ± 0.272
5.161GlyAsp: 5.161 ± 0.56
4.587GlyGlu: 4.587 ± 0.494
1.935GlyPhe: 1.935 ± 0.438
8.53GlyGly: 8.53 ± 1.636
1.72GlyHis: 1.72 ± 0.44
3.584GlyIle: 3.584 ± 0.492
2.795GlyLys: 2.795 ± 0.524
5.734GlyLeu: 5.734 ± 0.698
1.792GlyMet: 1.792 ± 0.32
2.437GlyAsn: 2.437 ± 0.358
4.086GlyPro: 4.086 ± 0.66
3.011GlyGln: 3.011 ± 0.386
6.666GlyArg: 6.666 ± 0.779
4.086GlySer: 4.086 ± 0.557
8.028GlyThr: 8.028 ± 1.0
5.233GlyVal: 5.233 ± 0.75
2.079GlyTrp: 2.079 ± 0.419
1.864GlyTyr: 1.864 ± 0.429
0.0GlyXaa: 0.0 ± 0.0
His
2.58HisAla: 2.58 ± 0.434
0.358HisCys: 0.358 ± 0.163
1.219HisAsp: 1.219 ± 0.303
1.004HisGlu: 1.004 ± 0.254
0.43HisPhe: 0.43 ± 0.191
2.15HisGly: 2.15 ± 0.374
0.645HisHis: 0.645 ± 0.22
0.645HisIle: 0.645 ± 0.225
0.645HisLys: 0.645 ± 0.221
1.864HisLeu: 1.864 ± 0.397
0.287HisMet: 0.287 ± 0.179
0.358HisAsn: 0.358 ± 0.132
1.577HisPro: 1.577 ± 0.393
0.788HisGln: 0.788 ± 0.266
1.792HisArg: 1.792 ± 0.346
0.932HisSer: 0.932 ± 0.261
1.362HisThr: 1.362 ± 0.331
1.434HisVal: 1.434 ± 0.365
0.143HisTrp: 0.143 ± 0.095
0.502HisTyr: 0.502 ± 0.178
0.0HisXaa: 0.0 ± 0.0
Ile
5.233IleAla: 5.233 ± 0.741
0.43IleCys: 0.43 ± 0.171
4.946IleAsp: 4.946 ± 0.699
3.297IleGlu: 3.297 ± 0.567
0.645IlePhe: 0.645 ± 0.208
3.871IleGly: 3.871 ± 0.584
0.645IleHis: 0.645 ± 0.206
1.649IleIle: 1.649 ± 0.313
1.29IleLys: 1.29 ± 0.313
3.656IleLeu: 3.656 ± 0.541
0.717IleMet: 0.717 ± 0.205
1.577IleAsn: 1.577 ± 0.335
2.365IlePro: 2.365 ± 0.399
1.434IleGln: 1.434 ± 0.395
3.871IleArg: 3.871 ± 0.664
2.007IleSer: 2.007 ± 0.312
3.154IleThr: 3.154 ± 0.541
3.297IleVal: 3.297 ± 0.463
0.645IleTrp: 0.645 ± 0.196
0.717IleTyr: 0.717 ± 0.266
0.0IleXaa: 0.0 ± 0.0
Lys
4.157LysAla: 4.157 ± 0.537
0.143LysCys: 0.143 ± 0.092
2.294LysAsp: 2.294 ± 0.423
1.075LysGlu: 1.075 ± 0.292
1.004LysPhe: 1.004 ± 0.284
2.509LysGly: 2.509 ± 0.471
0.645LysHis: 0.645 ± 0.241
1.577LysIle: 1.577 ± 0.43
1.434LysLys: 1.434 ± 0.389
2.867LysLeu: 2.867 ± 0.656
0.573LysMet: 0.573 ± 0.149
1.29LysAsn: 1.29 ± 0.336
2.079LysPro: 2.079 ± 0.31
1.434LysGln: 1.434 ± 0.306
2.724LysArg: 2.724 ± 0.477
1.505LysSer: 1.505 ± 0.256
2.079LysThr: 2.079 ± 0.38
2.509LysVal: 2.509 ± 0.405
0.43LysTrp: 0.43 ± 0.144
0.86LysTyr: 0.86 ± 0.231
0.0LysXaa: 0.0 ± 0.0
Leu
11.397LeuAla: 11.397 ± 1.089
0.502LeuCys: 0.502 ± 0.202
6.81LeuAsp: 6.81 ± 0.727
3.799LeuGlu: 3.799 ± 0.553
2.079LeuPhe: 2.079 ± 0.402
7.025LeuGly: 7.025 ± 0.678
1.935LeuHis: 1.935 ± 0.418
5.018LeuIle: 5.018 ± 0.512
2.007LeuLys: 2.007 ± 0.372
5.519LeuLeu: 5.519 ± 0.682
1.29LeuMet: 1.29 ± 0.274
1.792LeuAsn: 1.792 ± 0.322
4.587LeuPro: 4.587 ± 0.67
2.795LeuGln: 2.795 ± 0.471
5.949LeuArg: 5.949 ± 0.73
4.229LeuSer: 4.229 ± 0.609
6.164LeuThr: 6.164 ± 0.726
6.093LeuVal: 6.093 ± 0.763
1.434LeuTrp: 1.434 ± 0.295
1.505LeuTyr: 1.505 ± 0.359
0.0LeuXaa: 0.0 ± 0.0
Met
3.226MetAla: 3.226 ± 0.413
0.143MetCys: 0.143 ± 0.106
0.932MetAsp: 0.932 ± 0.233
0.645MetGlu: 0.645 ± 0.246
0.502MetPhe: 0.502 ± 0.188
0.788MetGly: 0.788 ± 0.213
0.358MetHis: 0.358 ± 0.176
0.932MetIle: 0.932 ± 0.274
1.29MetLys: 1.29 ± 0.241
1.29MetLeu: 1.29 ± 0.381
0.43MetMet: 0.43 ± 0.189
0.788MetAsn: 0.788 ± 0.349
1.434MetPro: 1.434 ± 0.334
1.004MetGln: 1.004 ± 0.264
1.577MetArg: 1.577 ± 0.296
1.792MetSer: 1.792 ± 0.326
2.222MetThr: 2.222 ± 0.404
1.505MetVal: 1.505 ± 0.311
0.072MetTrp: 0.072 ± 0.06
0.215MetTyr: 0.215 ± 0.167
0.0MetXaa: 0.0 ± 0.0
Asn
2.652AsnAla: 2.652 ± 0.37
0.143AsnCys: 0.143 ± 0.12
1.577AsnAsp: 1.577 ± 0.322
1.29AsnGlu: 1.29 ± 0.381
0.717AsnPhe: 0.717 ± 0.249
3.154AsnGly: 3.154 ± 0.557
0.502AsnHis: 0.502 ± 0.238
1.004AsnIle: 1.004 ± 0.226
0.788AsnLys: 0.788 ± 0.188
2.15AsnLeu: 2.15 ± 0.37
0.502AsnMet: 0.502 ± 0.148
0.717AsnAsn: 0.717 ± 0.251
1.935AsnPro: 1.935 ± 0.406
1.505AsnGln: 1.505 ± 0.393
1.649AsnArg: 1.649 ± 0.388
1.505AsnSer: 1.505 ± 0.371
1.577AsnThr: 1.577 ± 0.369
1.792AsnVal: 1.792 ± 0.315
0.358AsnTrp: 0.358 ± 0.167
0.287AsnTyr: 0.287 ± 0.124
0.0AsnXaa: 0.0 ± 0.0
Pro
6.953ProAla: 6.953 ± 0.865
0.645ProCys: 0.645 ± 0.247
4.372ProAsp: 4.372 ± 0.561
2.58ProGlu: 2.58 ± 0.576
1.72ProPhe: 1.72 ± 0.307
4.229ProGly: 4.229 ± 0.479
1.219ProHis: 1.219 ± 0.374
3.011ProIle: 3.011 ± 0.411
1.29ProLys: 1.29 ± 0.355
4.014ProLeu: 4.014 ± 0.538
1.505ProMet: 1.505 ± 0.362
1.219ProAsn: 1.219 ± 0.293
3.011ProPro: 3.011 ± 0.611
2.079ProGln: 2.079 ± 0.464
2.724ProArg: 2.724 ± 0.488
3.441ProSer: 3.441 ± 0.607
4.157ProThr: 4.157 ± 0.463
4.587ProVal: 4.587 ± 0.586
1.29ProTrp: 1.29 ± 0.309
1.219ProTyr: 1.219 ± 0.307
0.0ProXaa: 0.0 ± 0.0
Gln
4.731GlnAla: 4.731 ± 0.592
0.43GlnCys: 0.43 ± 0.208
1.075GlnAsp: 1.075 ± 0.26
1.935GlnGlu: 1.935 ± 0.35
1.004GlnPhe: 1.004 ± 0.22
1.935GlnGly: 1.935 ± 0.377
1.004GlnHis: 1.004 ± 0.308
1.434GlnIle: 1.434 ± 0.237
1.219GlnLys: 1.219 ± 0.273
3.942GlnLeu: 3.942 ± 0.69
1.29GlnMet: 1.29 ± 0.297
0.645GlnAsn: 0.645 ± 0.205
2.294GlnPro: 2.294 ± 0.555
1.792GlnGln: 1.792 ± 0.662
2.58GlnArg: 2.58 ± 0.402
1.505GlnSer: 1.505 ± 0.325
2.509GlnThr: 2.509 ± 0.372
3.727GlnVal: 3.727 ± 0.601
1.075GlnTrp: 1.075 ± 0.313
1.147GlnTyr: 1.147 ± 0.187
0.0GlnXaa: 0.0 ± 0.0
Arg
8.53ArgAla: 8.53 ± 0.791
0.645ArgCys: 0.645 ± 0.221
4.659ArgAsp: 4.659 ± 0.646
4.014ArgGlu: 4.014 ± 0.569
1.649ArgPhe: 1.649 ± 0.363
5.448ArgGly: 5.448 ± 0.676
1.864ArgHis: 1.864 ± 0.379
3.942ArgIle: 3.942 ± 0.601
3.154ArgLys: 3.154 ± 0.614
7.526ArgLeu: 7.526 ± 0.862
1.577ArgMet: 1.577 ± 0.342
1.935ArgAsn: 1.935 ± 0.366
3.011ArgPro: 3.011 ± 0.446
3.082ArgGln: 3.082 ± 0.53
8.171ArgArg: 8.171 ± 1.191
3.727ArgSer: 3.727 ± 0.523
4.229ArgThr: 4.229 ± 0.524
4.874ArgVal: 4.874 ± 0.646
1.72ArgTrp: 1.72 ± 0.384
1.505ArgTyr: 1.505 ± 0.33
0.0ArgXaa: 0.0 ± 0.0
Ser
6.881SerAla: 6.881 ± 0.847
0.358SerCys: 0.358 ± 0.152
3.441SerAsp: 3.441 ± 0.416
2.222SerGlu: 2.222 ± 0.462
1.29SerPhe: 1.29 ± 0.319
5.591SerGly: 5.591 ± 0.668
1.075SerHis: 1.075 ± 0.345
2.079SerIle: 2.079 ± 0.351
1.72SerLys: 1.72 ± 0.391
4.086SerLeu: 4.086 ± 0.685
1.362SerMet: 1.362 ± 0.293
1.505SerAsn: 1.505 ± 0.277
2.867SerPro: 2.867 ± 0.509
1.577SerGln: 1.577 ± 0.275
2.509SerArg: 2.509 ± 0.465
3.011SerSer: 3.011 ± 0.576
4.516SerThr: 4.516 ± 0.686
4.157SerVal: 4.157 ± 0.488
0.932SerTrp: 0.932 ± 0.192
0.717SerTyr: 0.717 ± 0.23
0.0SerXaa: 0.0 ± 0.0
Thr
9.175ThrAla: 9.175 ± 0.612
0.502ThrCys: 0.502 ± 0.176
4.946ThrAsp: 4.946 ± 0.686
3.799ThrGlu: 3.799 ± 0.48
1.505ThrPhe: 1.505 ± 0.279
6.164ThrGly: 6.164 ± 0.956
1.29ThrHis: 1.29 ± 0.371
4.301ThrIle: 4.301 ± 0.615
1.72ThrLys: 1.72 ± 0.383
6.021ThrLeu: 6.021 ± 0.768
1.004ThrMet: 1.004 ± 0.272
1.434ThrAsn: 1.434 ± 0.442
4.803ThrPro: 4.803 ± 0.738
2.007ThrGln: 2.007 ± 0.272
4.229ThrArg: 4.229 ± 0.642
4.086ThrSer: 4.086 ± 0.539
4.874ThrThr: 4.874 ± 0.801
4.803ThrVal: 4.803 ± 0.637
1.577ThrTrp: 1.577 ± 0.406
1.29ThrTyr: 1.29 ± 0.357
0.0ThrXaa: 0.0 ± 0.0
Val
8.745ValAla: 8.745 ± 0.945
0.645ValCys: 0.645 ± 0.211
5.663ValAsp: 5.663 ± 0.668
5.161ValGlu: 5.161 ± 0.712
1.72ValPhe: 1.72 ± 0.481
5.591ValGly: 5.591 ± 0.628
1.577ValHis: 1.577 ± 0.368
2.939ValIle: 2.939 ± 0.346
2.437ValLys: 2.437 ± 0.496
6.953ValLeu: 6.953 ± 0.648
1.505ValMet: 1.505 ± 0.267
2.007ValAsn: 2.007 ± 0.378
3.584ValPro: 3.584 ± 0.436
2.294ValGln: 2.294 ± 0.501
5.519ValArg: 5.519 ± 0.574
3.226ValSer: 3.226 ± 0.524
5.448ValThr: 5.448 ± 0.78
6.308ValVal: 6.308 ± 0.838
1.577ValTrp: 1.577 ± 0.356
1.075ValTyr: 1.075 ± 0.298
0.0ValXaa: 0.0 ± 0.0
Trp
2.15TrpAla: 2.15 ± 0.381
0.502TrpCys: 0.502 ± 0.213
1.505TrpAsp: 1.505 ± 0.313
1.075TrpGlu: 1.075 ± 0.32
0.215TrpPhe: 0.215 ± 0.125
0.788TrpGly: 0.788 ± 0.238
0.43TrpHis: 0.43 ± 0.226
1.29TrpIle: 1.29 ± 0.333
0.788TrpLys: 0.788 ± 0.261
2.079TrpLeu: 2.079 ± 0.369
0.215TrpMet: 0.215 ± 0.093
0.573TrpAsn: 0.573 ± 0.201
0.86TrpPro: 0.86 ± 0.371
0.86TrpGln: 0.86 ± 0.194
1.935TrpArg: 1.935 ± 0.366
1.147TrpSer: 1.147 ± 0.278
1.792TrpThr: 1.792 ± 0.368
1.29TrpVal: 1.29 ± 0.303
0.215TrpTrp: 0.215 ± 0.131
0.215TrpTyr: 0.215 ± 0.117
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.792TyrAla: 1.792 ± 0.438
0.43TyrCys: 0.43 ± 0.177
1.219TyrAsp: 1.219 ± 0.282
1.147TyrGlu: 1.147 ± 0.314
0.717TyrPhe: 0.717 ± 0.268
1.577TyrGly: 1.577 ± 0.355
0.215TyrHis: 0.215 ± 0.103
0.645TyrIle: 0.645 ± 0.189
0.358TyrLys: 0.358 ± 0.192
1.219TyrLeu: 1.219 ± 0.227
0.43TyrMet: 0.43 ± 0.142
0.573TyrAsn: 0.573 ± 0.236
1.147TyrPro: 1.147 ± 0.219
0.717TyrGln: 0.717 ± 0.256
1.864TyrArg: 1.864 ± 0.343
1.075TyrSer: 1.075 ± 0.323
0.788TyrThr: 0.788 ± 0.281
1.577TyrVal: 1.577 ± 0.281
0.43TyrTrp: 0.43 ± 0.162
0.143TyrTyr: 0.143 ± 0.089
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 68 proteins (13952 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski