Amino acid dipepetide frequency for Microbacterium phage Antares

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.8AlaAla: 16.8 ± 1.497
0.769AlaCys: 0.769 ± 0.237
8.4AlaAsp: 8.4 ± 0.768
7.867AlaGlu: 7.867 ± 0.752
3.253AlaPhe: 3.253 ± 0.357
8.755AlaGly: 8.755 ± 0.932
2.425AlaHis: 2.425 ± 0.459
5.383AlaIle: 5.383 ± 0.619
3.431AlaLys: 3.431 ± 0.448
11.713AlaLeu: 11.713 ± 1.314
3.49AlaMet: 3.49 ± 0.442
3.017AlaAsn: 3.017 ± 0.402
6.211AlaPro: 6.211 ± 0.609
4.91AlaGln: 4.91 ± 0.584
7.158AlaArg: 7.158 ± 0.763
7.098AlaSer: 7.098 ± 0.741
8.163AlaThr: 8.163 ± 0.748
6.98AlaVal: 6.98 ± 0.703
3.017AlaTrp: 3.017 ± 0.463
2.721AlaTyr: 2.721 ± 0.399
0.0AlaXaa: 0.0 ± 0.0
Cys
0.592CysAla: 0.592 ± 0.176
0.177CysCys: 0.177 ± 0.137
1.183CysAsp: 1.183 ± 0.315
0.592CysGlu: 0.592 ± 0.215
0.237CysPhe: 0.237 ± 0.098
1.361CysGly: 1.361 ± 0.337
0.296CysHis: 0.296 ± 0.134
0.296CysIle: 0.296 ± 0.152
0.0CysLys: 0.0 ± 0.0
0.355CysLeu: 0.355 ± 0.137
0.177CysMet: 0.177 ± 0.117
0.177CysAsn: 0.177 ± 0.109
0.769CysPro: 0.769 ± 0.215
0.177CysGln: 0.177 ± 0.101
0.651CysArg: 0.651 ± 0.207
0.473CysSer: 0.473 ± 0.283
0.414CysThr: 0.414 ± 0.151
0.592CysVal: 0.592 ± 0.196
0.118CysTrp: 0.118 ± 0.078
0.177CysTyr: 0.177 ± 0.101
0.0CysXaa: 0.0 ± 0.0
Asp
8.518AspAla: 8.518 ± 0.903
0.651AspCys: 0.651 ± 0.242
5.087AspAsp: 5.087 ± 0.752
5.265AspGlu: 5.265 ± 0.746
1.893AspPhe: 1.893 ± 0.317
6.566AspGly: 6.566 ± 0.684
1.361AspHis: 1.361 ± 0.255
2.899AspIle: 2.899 ± 0.459
1.361AspLys: 1.361 ± 0.301
5.797AspLeu: 5.797 ± 0.595
1.538AspMet: 1.538 ± 0.388
1.656AspAsn: 1.656 ± 0.329
4.259AspPro: 4.259 ± 0.517
2.366AspGln: 2.366 ± 0.386
5.087AspArg: 5.087 ± 0.622
3.253AspSer: 3.253 ± 0.419
3.076AspThr: 3.076 ± 0.378
4.91AspVal: 4.91 ± 0.61
0.946AspTrp: 0.946 ± 0.256
1.656AspTyr: 1.656 ± 0.355
0.0AspXaa: 0.0 ± 0.0
Glu
8.577GluAla: 8.577 ± 0.758
0.532GluCys: 0.532 ± 0.187
4.791GluAsp: 4.791 ± 0.62
4.2GluGlu: 4.2 ± 0.55
1.775GluPhe: 1.775 ± 0.345
5.087GluGly: 5.087 ± 0.645
1.715GluHis: 1.715 ± 0.312
1.893GluIle: 1.893 ± 0.455
1.479GluLys: 1.479 ± 0.319
3.135GluLeu: 3.135 ± 0.378
1.183GluMet: 1.183 ± 0.26
1.479GluAsn: 1.479 ± 0.308
4.791GluPro: 4.791 ± 0.968
2.248GluGln: 2.248 ± 0.409
5.915GluArg: 5.915 ± 0.605
2.958GluSer: 2.958 ± 0.459
3.727GluThr: 3.727 ± 0.5
5.679GluVal: 5.679 ± 0.686
1.42GluTrp: 1.42 ± 0.299
2.011GluTyr: 2.011 ± 0.343
0.0GluXaa: 0.0 ± 0.0
Phe
3.135PheAla: 3.135 ± 0.428
0.237PheCys: 0.237 ± 0.16
2.425PheAsp: 2.425 ± 0.324
1.952PheGlu: 1.952 ± 0.329
0.71PhePhe: 0.71 ± 0.2
2.544PheGly: 2.544 ± 0.403
0.355PheHis: 0.355 ± 0.13
1.361PheIle: 1.361 ± 0.351
0.769PheLys: 0.769 ± 0.181
1.479PheLeu: 1.479 ± 0.286
0.592PheMet: 0.592 ± 0.174
0.296PheAsn: 0.296 ± 0.136
1.301PhePro: 1.301 ± 0.279
0.532PheGln: 0.532 ± 0.202
1.361PheArg: 1.361 ± 0.245
0.946PheSer: 0.946 ± 0.286
2.248PheThr: 2.248 ± 0.317
1.775PheVal: 1.775 ± 0.284
0.237PheTrp: 0.237 ± 0.109
0.828PheTyr: 0.828 ± 0.213
0.0PheXaa: 0.0 ± 0.0
Gly
8.282GlyAla: 8.282 ± 0.956
0.532GlyCys: 0.532 ± 0.187
6.034GlyAsp: 6.034 ± 0.631
5.62GlyGlu: 5.62 ± 0.643
2.721GlyPhe: 2.721 ± 0.446
7.453GlyGly: 7.453 ± 0.695
2.189GlyHis: 2.189 ± 0.467
4.259GlyIle: 4.259 ± 0.523
2.721GlyLys: 2.721 ± 0.381
7.158GlyLeu: 7.158 ± 0.949
3.194GlyMet: 3.194 ± 0.449
1.893GlyAsn: 1.893 ± 0.33
3.253GlyPro: 3.253 ± 0.449
3.194GlyGln: 3.194 ± 0.439
5.797GlyArg: 5.797 ± 0.64
5.146GlySer: 5.146 ± 0.632
6.093GlyThr: 6.093 ± 0.633
5.856GlyVal: 5.856 ± 0.589
2.603GlyTrp: 2.603 ± 0.409
2.839GlyTyr: 2.839 ± 0.321
0.0GlyXaa: 0.0 ± 0.0
His
2.07HisAla: 2.07 ± 0.399
0.177HisCys: 0.177 ± 0.094
1.538HisAsp: 1.538 ± 0.275
1.301HisGlu: 1.301 ± 0.262
0.473HisPhe: 0.473 ± 0.183
1.834HisGly: 1.834 ± 0.371
0.355HisHis: 0.355 ± 0.159
0.828HisIle: 0.828 ± 0.249
0.592HisLys: 0.592 ± 0.209
2.07HisLeu: 2.07 ± 0.389
0.355HisMet: 0.355 ± 0.124
0.473HisAsn: 0.473 ± 0.168
1.361HisPro: 1.361 ± 0.311
0.414HisGln: 0.414 ± 0.168
0.946HisArg: 0.946 ± 0.232
0.828HisSer: 0.828 ± 0.224
1.301HisThr: 1.301 ± 0.249
1.301HisVal: 1.301 ± 0.261
0.355HisTrp: 0.355 ± 0.14
0.887HisTyr: 0.887 ± 0.22
0.0HisXaa: 0.0 ± 0.0
Ile
4.673IleAla: 4.673 ± 0.441
0.414IleCys: 0.414 ± 0.182
3.431IleAsp: 3.431 ± 0.488
3.963IleGlu: 3.963 ± 0.449
0.769IlePhe: 0.769 ± 0.2
3.313IleGly: 3.313 ± 0.387
0.592IleHis: 0.592 ± 0.182
1.893IleIle: 1.893 ± 0.451
1.006IleLys: 1.006 ± 0.219
2.484IleLeu: 2.484 ± 0.37
0.946IleMet: 0.946 ± 0.257
1.065IleAsn: 1.065 ± 0.271
3.135IlePro: 3.135 ± 0.573
1.242IleGln: 1.242 ± 0.295
3.313IleArg: 3.313 ± 0.502
2.307IleSer: 2.307 ± 0.364
4.377IleThr: 4.377 ± 0.575
4.022IleVal: 4.022 ± 0.471
0.651IleTrp: 0.651 ± 0.184
0.651IleTyr: 0.651 ± 0.168
0.0IleXaa: 0.0 ± 0.0
Lys
3.786LysAla: 3.786 ± 0.544
0.355LysCys: 0.355 ± 0.151
0.946LysAsp: 0.946 ± 0.24
0.592LysGlu: 0.592 ± 0.208
0.769LysPhe: 0.769 ± 0.276
2.307LysGly: 2.307 ± 0.401
0.532LysHis: 0.532 ± 0.174
1.006LysIle: 1.006 ± 0.234
0.592LysLys: 0.592 ± 0.206
1.065LysLeu: 1.065 ± 0.26
0.71LysMet: 0.71 ± 0.19
0.237LysAsn: 0.237 ± 0.11
1.952LysPro: 1.952 ± 0.381
0.592LysGln: 0.592 ± 0.189
2.425LysArg: 2.425 ± 0.481
1.834LysSer: 1.834 ± 0.312
1.597LysThr: 1.597 ± 0.401
3.017LysVal: 3.017 ± 0.36
0.473LysTrp: 0.473 ± 0.17
0.355LysTyr: 0.355 ± 0.142
0.0LysXaa: 0.0 ± 0.0
Leu
9.82LeuAla: 9.82 ± 0.86
0.296LeuCys: 0.296 ± 0.112
4.851LeuAsp: 4.851 ± 0.607
4.437LeuGlu: 4.437 ± 0.605
1.656LeuPhe: 1.656 ± 0.396
6.566LeuGly: 6.566 ± 0.985
1.242LeuHis: 1.242 ± 0.226
3.963LeuIle: 3.963 ± 0.547
0.651LeuLys: 0.651 ± 0.172
5.915LeuLeu: 5.915 ± 0.704
2.011LeuMet: 2.011 ± 0.354
1.656LeuAsn: 1.656 ± 0.297
5.087LeuPro: 5.087 ± 0.493
2.307LeuGln: 2.307 ± 0.384
4.969LeuArg: 4.969 ± 0.698
5.797LeuSer: 5.797 ± 0.659
6.507LeuThr: 6.507 ± 0.683
6.093LeuVal: 6.093 ± 0.591
1.242LeuTrp: 1.242 ± 0.273
1.183LeuTyr: 1.183 ± 0.271
0.0LeuXaa: 0.0 ± 0.0
Met
3.194MetAla: 3.194 ± 0.483
0.177MetCys: 0.177 ± 0.103
1.065MetAsp: 1.065 ± 0.281
0.414MetGlu: 0.414 ± 0.142
0.355MetPhe: 0.355 ± 0.154
2.011MetGly: 2.011 ± 0.409
0.473MetHis: 0.473 ± 0.162
1.834MetIle: 1.834 ± 0.376
0.355MetLys: 0.355 ± 0.118
1.538MetLeu: 1.538 ± 0.308
0.414MetMet: 0.414 ± 0.165
0.71MetAsn: 0.71 ± 0.213
1.361MetPro: 1.361 ± 0.318
0.769MetGln: 0.769 ± 0.224
1.952MetArg: 1.952 ± 0.299
2.484MetSer: 2.484 ± 0.34
2.721MetThr: 2.721 ± 0.336
1.775MetVal: 1.775 ± 0.298
0.355MetTrp: 0.355 ± 0.143
0.71MetTyr: 0.71 ± 0.172
0.0MetXaa: 0.0 ± 0.0
Asn
3.253AsnAla: 3.253 ± 0.371
0.237AsnCys: 0.237 ± 0.124
0.946AsnAsp: 0.946 ± 0.213
0.946AsnGlu: 0.946 ± 0.246
0.355AsnPhe: 0.355 ± 0.157
3.076AsnGly: 3.076 ± 0.492
0.414AsnHis: 0.414 ± 0.188
0.71AsnIle: 0.71 ± 0.219
0.355AsnLys: 0.355 ± 0.136
2.544AsnLeu: 2.544 ± 0.377
0.355AsnMet: 0.355 ± 0.134
0.532AsnAsn: 0.532 ± 0.157
2.425AsnPro: 2.425 ± 0.581
0.473AsnGln: 0.473 ± 0.169
1.893AsnArg: 1.893 ± 0.329
1.538AsnSer: 1.538 ± 0.335
2.07AsnThr: 2.07 ± 0.359
1.42AsnVal: 1.42 ± 0.248
0.177AsnTrp: 0.177 ± 0.088
0.473AsnTyr: 0.473 ± 0.154
0.0AsnXaa: 0.0 ± 0.0
Pro
7.69ProAla: 7.69 ± 1.008
0.71ProCys: 0.71 ± 0.211
4.614ProAsp: 4.614 ± 0.656
5.856ProGlu: 5.856 ± 0.716
1.538ProPhe: 1.538 ± 0.274
5.442ProGly: 5.442 ± 0.516
1.124ProHis: 1.124 ± 0.268
1.538ProIle: 1.538 ± 0.252
1.538ProLys: 1.538 ± 0.273
4.082ProLeu: 4.082 ± 0.495
1.538ProMet: 1.538 ± 0.33
1.538ProAsn: 1.538 ± 0.365
3.549ProPro: 3.549 ± 0.483
1.715ProGln: 1.715 ± 0.493
2.662ProArg: 2.662 ± 0.355
3.727ProSer: 3.727 ± 0.449
4.496ProThr: 4.496 ± 0.9
4.2ProVal: 4.2 ± 0.524
0.769ProTrp: 0.769 ± 0.203
1.538ProTyr: 1.538 ± 0.281
0.0ProXaa: 0.0 ± 0.0
Gln
3.668GlnAla: 3.668 ± 0.553
0.414GlnCys: 0.414 ± 0.183
1.952GlnAsp: 1.952 ± 0.355
1.242GlnGlu: 1.242 ± 0.228
1.301GlnPhe: 1.301 ± 0.279
2.662GlnGly: 2.662 ± 0.423
0.887GlnHis: 0.887 ± 0.216
1.834GlnIle: 1.834 ± 0.304
0.769GlnLys: 0.769 ± 0.213
1.065GlnLeu: 1.065 ± 0.325
1.242GlnMet: 1.242 ± 0.268
1.183GlnAsn: 1.183 ± 0.286
1.834GlnPro: 1.834 ± 0.328
1.124GlnGln: 1.124 ± 0.268
2.13GlnArg: 2.13 ± 0.398
2.07GlnSer: 2.07 ± 0.287
2.248GlnThr: 2.248 ± 0.363
2.425GlnVal: 2.425 ± 0.478
0.651GlnTrp: 0.651 ± 0.189
1.065GlnTyr: 1.065 ± 0.262
0.0GlnXaa: 0.0 ± 0.0
Arg
7.217ArgAla: 7.217 ± 0.661
0.887ArgCys: 0.887 ± 0.264
4.614ArgAsp: 4.614 ± 0.672
5.028ArgGlu: 5.028 ± 0.669
1.656ArgPhe: 1.656 ± 0.379
6.093ArgGly: 6.093 ± 0.619
1.538ArgHis: 1.538 ± 0.33
2.662ArgIle: 2.662 ± 0.43
2.13ArgLys: 2.13 ± 0.335
6.389ArgLeu: 6.389 ± 0.614
1.715ArgMet: 1.715 ± 0.315
1.242ArgAsn: 1.242 ± 0.257
3.076ArgPro: 3.076 ± 0.453
2.839ArgGln: 2.839 ± 0.472
7.039ArgArg: 7.039 ± 0.814
3.313ArgSer: 3.313 ± 0.445
3.49ArgThr: 3.49 ± 0.499
4.732ArgVal: 4.732 ± 0.578
1.715ArgTrp: 1.715 ± 0.344
2.544ArgTyr: 2.544 ± 0.428
0.0ArgXaa: 0.0 ± 0.0
Ser
8.518SerAla: 8.518 ± 1.201
0.237SerCys: 0.237 ± 0.127
5.087SerAsp: 5.087 ± 0.495
3.668SerGlu: 3.668 ± 0.471
1.538SerPhe: 1.538 ± 0.313
4.91SerGly: 4.91 ± 0.602
1.065SerHis: 1.065 ± 0.215
2.899SerIle: 2.899 ± 0.482
1.775SerLys: 1.775 ± 0.332
4.141SerLeu: 4.141 ± 0.517
1.597SerMet: 1.597 ± 0.317
1.775SerAsn: 1.775 ± 0.302
2.484SerPro: 2.484 ± 0.391
1.242SerGln: 1.242 ± 0.282
3.431SerArg: 3.431 ± 0.46
2.366SerSer: 2.366 ± 0.515
4.377SerThr: 4.377 ± 0.551
4.377SerVal: 4.377 ± 0.625
1.124SerTrp: 1.124 ± 0.226
1.42SerTyr: 1.42 ± 0.405
0.0SerXaa: 0.0 ± 0.0
Thr
8.282ThrAla: 8.282 ± 0.896
0.71ThrCys: 0.71 ± 0.226
4.022ThrAsp: 4.022 ± 0.53
4.318ThrGlu: 4.318 ± 0.566
1.301ThrPhe: 1.301 ± 0.275
6.684ThrGly: 6.684 ± 0.79
1.42ThrHis: 1.42 ± 0.313
3.194ThrIle: 3.194 ± 0.54
2.07ThrLys: 2.07 ± 0.464
5.975ThrLeu: 5.975 ± 0.601
1.301ThrMet: 1.301 ± 0.299
1.42ThrAsn: 1.42 ± 0.281
5.975ThrPro: 5.975 ± 0.81
1.479ThrGln: 1.479 ± 0.25
4.141ThrArg: 4.141 ± 0.511
3.963ThrSer: 3.963 ± 0.463
4.969ThrThr: 4.969 ± 0.566
6.625ThrVal: 6.625 ± 0.537
0.887ThrTrp: 0.887 ± 0.25
1.952ThrTyr: 1.952 ± 0.244
0.0ThrXaa: 0.0 ± 0.0
Val
8.4ValAla: 8.4 ± 0.703
0.769ValCys: 0.769 ± 0.218
3.608ValAsp: 3.608 ± 0.665
4.614ValGlu: 4.614 ± 0.494
1.42ValPhe: 1.42 ± 0.255
6.329ValGly: 6.329 ± 0.663
0.828ValHis: 0.828 ± 0.222
3.786ValIle: 3.786 ± 0.456
2.248ValLys: 2.248 ± 0.351
5.679ValLeu: 5.679 ± 0.532
1.479ValMet: 1.479 ± 0.288
2.544ValAsn: 2.544 ± 0.333
5.206ValPro: 5.206 ± 0.456
2.425ValGln: 2.425 ± 0.464
5.028ValArg: 5.028 ± 0.564
4.851ValSer: 4.851 ± 0.564
6.211ValThr: 6.211 ± 0.521
5.087ValVal: 5.087 ± 0.567
1.656ValTrp: 1.656 ± 0.345
2.248ValTyr: 2.248 ± 0.366
0.0ValXaa: 0.0 ± 0.0
Trp
2.189TrpAla: 2.189 ± 0.385
0.414TrpCys: 0.414 ± 0.156
0.946TrpAsp: 0.946 ± 0.259
1.124TrpGlu: 1.124 ± 0.235
0.946TrpPhe: 0.946 ± 0.234
1.301TrpGly: 1.301 ± 0.274
0.414TrpHis: 0.414 ± 0.143
0.887TrpIle: 0.887 ± 0.285
0.71TrpLys: 0.71 ± 0.19
1.715TrpLeu: 1.715 ± 0.34
0.177TrpMet: 0.177 ± 0.102
0.71TrpAsn: 0.71 ± 0.216
0.71TrpPro: 0.71 ± 0.202
0.592TrpGln: 0.592 ± 0.184
1.715TrpArg: 1.715 ± 0.325
1.301TrpSer: 1.301 ± 0.28
0.828TrpThr: 0.828 ± 0.207
1.538TrpVal: 1.538 ± 0.292
0.473TrpTrp: 0.473 ± 0.153
0.71TrpTyr: 0.71 ± 0.191
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.899TyrAla: 2.899 ± 0.483
0.118TyrCys: 0.118 ± 0.076
2.603TyrAsp: 2.603 ± 0.395
1.597TyrGlu: 1.597 ± 0.29
0.414TyrPhe: 0.414 ± 0.194
2.544TyrGly: 2.544 ± 0.397
0.237TyrHis: 0.237 ± 0.099
1.124TyrIle: 1.124 ± 0.231
0.592TyrLys: 0.592 ± 0.222
1.952TyrLeu: 1.952 ± 0.347
0.592TyrMet: 0.592 ± 0.21
0.592TyrAsn: 0.592 ± 0.155
1.124TyrPro: 1.124 ± 0.225
1.124TyrGln: 1.124 ± 0.272
2.307TyrArg: 2.307 ± 0.339
1.775TyrSer: 1.775 ± 0.393
1.775TyrThr: 1.775 ± 0.289
2.07TyrVal: 2.07 ± 0.354
0.532TyrTrp: 0.532 ± 0.182
0.946TyrTyr: 0.946 ± 0.264
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 91 proteins (16906 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski