Amino acid dipepetide frequency for Gordonia phage Wocket

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.571AlaAla: 16.571 ± 1.148
0.736AlaCys: 0.736 ± 0.194
7.487AlaAsp: 7.487 ± 0.932
7.058AlaGlu: 7.058 ± 0.591
3.13AlaPhe: 3.13 ± 0.483
10.065AlaGly: 10.065 ± 1.068
1.596AlaHis: 1.596 ± 0.344
5.462AlaIle: 5.462 ± 0.548
3.498AlaLys: 3.498 ± 0.753
9.697AlaLeu: 9.697 ± 0.76
2.7AlaMet: 2.7 ± 0.507
3.314AlaAsn: 3.314 ± 0.496
6.014AlaPro: 6.014 ± 0.698
4.419AlaGln: 4.419 ± 0.582
9.144AlaArg: 9.144 ± 0.825
6.26AlaSer: 6.26 ± 0.657
7.426AlaThr: 7.426 ± 0.493
8.408AlaVal: 8.408 ± 0.866
2.148AlaTrp: 2.148 ± 0.308
2.332AlaTyr: 2.332 ± 0.373
0.0AlaXaa: 0.0 ± 0.0
Cys
0.614CysAla: 0.614 ± 0.223
0.123CysCys: 0.123 ± 0.12
0.921CysAsp: 0.921 ± 0.35
0.368CysGlu: 0.368 ± 0.167
0.0CysPhe: 0.0 ± 0.0
1.043CysGly: 1.043 ± 0.303
0.123CysHis: 0.123 ± 0.089
0.245CysIle: 0.245 ± 0.114
0.123CysLys: 0.123 ± 0.086
0.675CysLeu: 0.675 ± 0.242
0.123CysMet: 0.123 ± 0.094
0.491CysAsn: 0.491 ± 0.184
0.736CysPro: 0.736 ± 0.249
0.43CysGln: 0.43 ± 0.155
0.736CysArg: 0.736 ± 0.229
0.43CysSer: 0.43 ± 0.157
0.552CysThr: 0.552 ± 0.186
0.491CysVal: 0.491 ± 0.182
0.123CysTrp: 0.123 ± 0.133
0.061CysTyr: 0.061 ± 0.068
0.0CysXaa: 0.0 ± 0.0
Asp
6.935AspAla: 6.935 ± 0.709
0.552AspCys: 0.552 ± 0.216
6.014AspAsp: 6.014 ± 0.881
4.542AspGlu: 4.542 ± 0.627
2.148AspPhe: 2.148 ± 0.411
6.444AspGly: 6.444 ± 0.743
1.657AspHis: 1.657 ± 0.336
2.762AspIle: 2.762 ± 0.412
1.903AspLys: 1.903 ± 0.261
5.769AspLeu: 5.769 ± 0.604
1.473AspMet: 1.473 ± 0.316
1.841AspAsn: 1.841 ± 0.349
3.989AspPro: 3.989 ± 0.435
2.209AspGln: 2.209 ± 0.402
5.094AspArg: 5.094 ± 0.87
3.56AspSer: 3.56 ± 0.497
4.235AspThr: 4.235 ± 0.578
6.26AspVal: 6.26 ± 0.7
0.982AspTrp: 0.982 ± 0.221
1.841AspTyr: 1.841 ± 0.29
0.0AspXaa: 0.0 ± 0.0
Glu
6.137GluAla: 6.137 ± 0.688
0.368GluCys: 0.368 ± 0.125
3.191GluAsp: 3.191 ± 0.454
3.069GluGlu: 3.069 ± 0.662
1.903GluPhe: 1.903 ± 0.347
3.989GluGly: 3.989 ± 0.476
1.78GluHis: 1.78 ± 0.307
2.271GluIle: 2.271 ± 0.313
1.903GluLys: 1.903 ± 0.399
5.033GluLeu: 5.033 ± 0.691
1.534GluMet: 1.534 ± 0.265
1.166GluAsn: 1.166 ± 0.25
3.866GluPro: 3.866 ± 0.685
3.253GluGln: 3.253 ± 0.552
5.278GluArg: 5.278 ± 0.829
2.394GluSer: 2.394 ± 0.359
3.069GluThr: 3.069 ± 0.354
5.033GluVal: 5.033 ± 0.606
1.289GluTrp: 1.289 ± 0.311
1.473GluTyr: 1.473 ± 0.344
0.0GluXaa: 0.0 ± 0.0
Phe
2.762PheAla: 2.762 ± 0.442
0.368PheCys: 0.368 ± 0.144
2.025PheAsp: 2.025 ± 0.293
2.025PheGlu: 2.025 ± 0.328
0.859PhePhe: 0.859 ± 0.232
2.332PheGly: 2.332 ± 0.342
0.368PheHis: 0.368 ± 0.19
1.289PheIle: 1.289 ± 0.294
1.043PheLys: 1.043 ± 0.22
1.903PheLeu: 1.903 ± 0.46
0.614PheMet: 0.614 ± 0.147
0.798PheAsn: 0.798 ± 0.215
1.718PhePro: 1.718 ± 0.261
0.675PheGln: 0.675 ± 0.209
1.841PheArg: 1.841 ± 0.282
1.412PheSer: 1.412 ± 0.382
1.78PheThr: 1.78 ± 0.306
3.007PheVal: 3.007 ± 0.411
0.184PheTrp: 0.184 ± 0.124
0.491PheTyr: 0.491 ± 0.155
0.0PheXaa: 0.0 ± 0.0
Gly
8.469GlyAla: 8.469 ± 1.014
0.43GlyCys: 0.43 ± 0.164
5.401GlyAsp: 5.401 ± 0.522
4.787GlyGlu: 4.787 ± 0.643
2.946GlyPhe: 2.946 ± 0.591
6.751GlyGly: 6.751 ± 0.96
1.657GlyHis: 1.657 ± 0.305
4.357GlyIle: 4.357 ± 0.575
3.191GlyLys: 3.191 ± 0.549
6.812GlyLeu: 6.812 ± 1.254
1.841GlyMet: 1.841 ± 0.303
2.578GlyAsn: 2.578 ± 0.294
3.56GlyPro: 3.56 ± 0.572
3.007GlyGln: 3.007 ± 0.36
6.26GlyArg: 6.26 ± 0.66
4.971GlySer: 4.971 ± 0.516
4.971GlyThr: 4.971 ± 0.656
5.524GlyVal: 5.524 ± 0.667
2.087GlyTrp: 2.087 ± 0.289
2.7GlyTyr: 2.7 ± 0.475
0.0GlyXaa: 0.0 ± 0.0
His
2.209HisAla: 2.209 ± 0.369
0.184HisCys: 0.184 ± 0.113
1.718HisAsp: 1.718 ± 0.362
1.105HisGlu: 1.105 ± 0.236
0.736HisPhe: 0.736 ± 0.216
1.78HisGly: 1.78 ± 0.325
0.245HisHis: 0.245 ± 0.116
1.412HisIle: 1.412 ± 0.336
0.614HisLys: 0.614 ± 0.209
1.718HisLeu: 1.718 ± 0.303
0.245HisMet: 0.245 ± 0.125
0.368HisAsn: 0.368 ± 0.158
2.025HisPro: 2.025 ± 0.398
0.798HisGln: 0.798 ± 0.2
1.412HisArg: 1.412 ± 0.273
0.675HisSer: 0.675 ± 0.189
1.289HisThr: 1.289 ± 0.301
1.35HisVal: 1.35 ± 0.255
0.614HisTrp: 0.614 ± 0.219
0.552HisTyr: 0.552 ± 0.178
0.0HisXaa: 0.0 ± 0.0
Ile
6.444IleAla: 6.444 ± 0.662
0.43IleCys: 0.43 ± 0.146
4.664IleAsp: 4.664 ± 0.667
3.007IleGlu: 3.007 ± 0.412
0.798IlePhe: 0.798 ± 0.235
3.314IleGly: 3.314 ± 0.627
1.043IleHis: 1.043 ± 0.28
1.412IleIle: 1.412 ± 0.309
1.412IleLys: 1.412 ± 0.534
2.762IleLeu: 2.762 ± 0.346
0.491IleMet: 0.491 ± 0.212
1.227IleAsn: 1.227 ± 0.252
3.191IlePro: 3.191 ± 0.423
1.043IleGln: 1.043 ± 0.236
3.56IleArg: 3.56 ± 0.421
1.964IleSer: 1.964 ± 0.364
3.928IleThr: 3.928 ± 0.546
3.682IleVal: 3.682 ± 0.402
0.43IleTrp: 0.43 ± 0.154
1.105IleTyr: 1.105 ± 0.247
0.0IleXaa: 0.0 ± 0.0
Lys
4.051LysAla: 4.051 ± 0.466
0.061LysCys: 0.061 ± 0.069
1.473LysAsp: 1.473 ± 0.377
1.473LysGlu: 1.473 ± 0.314
0.982LysPhe: 0.982 ± 0.264
2.578LysGly: 2.578 ± 0.534
0.552LysHis: 0.552 ± 0.227
1.596LysIle: 1.596 ± 0.296
1.534LysLys: 1.534 ± 0.315
3.069LysLeu: 3.069 ± 0.445
0.307LysMet: 0.307 ± 0.134
0.921LysAsn: 0.921 ± 0.263
2.639LysPro: 2.639 ± 0.428
0.921LysGln: 0.921 ± 0.28
1.78LysArg: 1.78 ± 0.286
1.841LysSer: 1.841 ± 0.308
1.903LysThr: 1.903 ± 0.381
2.884LysVal: 2.884 ± 0.417
0.614LysTrp: 0.614 ± 0.197
0.736LysTyr: 0.736 ± 0.243
0.0LysXaa: 0.0 ± 0.0
Leu
10.986LeuAla: 10.986 ± 0.92
0.675LeuCys: 0.675 ± 0.198
5.769LeuAsp: 5.769 ± 0.632
3.805LeuGlu: 3.805 ± 0.539
2.025LeuPhe: 2.025 ± 0.297
6.014LeuGly: 6.014 ± 0.694
1.043LeuHis: 1.043 ± 0.24
4.051LeuIle: 4.051 ± 0.492
1.718LeuLys: 1.718 ± 0.322
5.155LeuLeu: 5.155 ± 0.565
1.964LeuMet: 1.964 ± 0.313
2.7LeuAsn: 2.7 ± 0.409
4.48LeuPro: 4.48 ± 0.44
1.841LeuGln: 1.841 ± 0.326
5.217LeuArg: 5.217 ± 0.566
4.726LeuSer: 4.726 ± 0.583
6.567LeuThr: 6.567 ± 0.613
5.953LeuVal: 5.953 ± 0.606
2.087LeuTrp: 2.087 ± 0.297
1.289LeuTyr: 1.289 ± 0.242
0.0LeuXaa: 0.0 ± 0.0
Met
3.437MetAla: 3.437 ± 0.655
0.245MetCys: 0.245 ± 0.11
0.552MetAsp: 0.552 ± 0.17
0.859MetGlu: 0.859 ± 0.224
0.675MetPhe: 0.675 ± 0.208
1.227MetGly: 1.227 ± 0.229
0.368MetHis: 0.368 ± 0.157
0.921MetIle: 0.921 ± 0.243
0.491MetLys: 0.491 ± 0.191
1.596MetLeu: 1.596 ± 0.313
0.307MetMet: 0.307 ± 0.154
0.307MetAsn: 0.307 ± 0.153
1.657MetPro: 1.657 ± 0.346
0.736MetGln: 0.736 ± 0.296
1.903MetArg: 1.903 ± 0.505
1.534MetSer: 1.534 ± 0.288
3.191MetThr: 3.191 ± 0.456
0.736MetVal: 0.736 ± 0.249
0.614MetTrp: 0.614 ± 0.179
0.43MetTyr: 0.43 ± 0.148
0.0MetXaa: 0.0 ± 0.0
Asn
3.069AsnAla: 3.069 ± 0.404
0.184AsnCys: 0.184 ± 0.102
2.087AsnAsp: 2.087 ± 0.302
1.412AsnGlu: 1.412 ± 0.279
0.43AsnPhe: 0.43 ± 0.135
3.007AsnGly: 3.007 ± 0.431
0.614AsnHis: 0.614 ± 0.162
1.227AsnIle: 1.227 ± 0.335
0.859AsnLys: 0.859 ± 0.207
1.964AsnLeu: 1.964 ± 0.307
0.43AsnMet: 0.43 ± 0.163
0.675AsnAsn: 0.675 ± 0.227
2.7AsnPro: 2.7 ± 0.417
0.736AsnGln: 0.736 ± 0.199
2.148AsnArg: 2.148 ± 0.472
1.534AsnSer: 1.534 ± 0.277
2.455AsnThr: 2.455 ± 0.448
1.78AsnVal: 1.78 ± 0.405
0.368AsnTrp: 0.368 ± 0.123
0.675AsnTyr: 0.675 ± 0.217
0.0AsnXaa: 0.0 ± 0.0
Pro
7.426ProAla: 7.426 ± 1.129
0.736ProCys: 0.736 ± 0.254
4.971ProAsp: 4.971 ± 0.657
3.866ProGlu: 3.866 ± 0.464
1.412ProPhe: 1.412 ± 0.239
4.91ProGly: 4.91 ± 0.559
1.473ProHis: 1.473 ± 0.407
3.13ProIle: 3.13 ± 0.473
2.455ProLys: 2.455 ± 0.37
3.621ProLeu: 3.621 ± 0.481
1.841ProMet: 1.841 ± 0.434
2.271ProAsn: 2.271 ± 0.344
3.191ProPro: 3.191 ± 0.51
1.841ProGln: 1.841 ± 0.323
3.253ProArg: 3.253 ± 0.489
3.069ProSer: 3.069 ± 0.448
4.235ProThr: 4.235 ± 0.622
3.375ProVal: 3.375 ± 0.418
1.289ProTrp: 1.289 ± 0.267
1.473ProTyr: 1.473 ± 0.328
0.0ProXaa: 0.0 ± 0.0
Gln
3.253GlnAla: 3.253 ± 0.381
0.061GlnCys: 0.061 ± 0.063
1.412GlnAsp: 1.412 ± 0.307
1.412GlnGlu: 1.412 ± 0.329
1.043GlnPhe: 1.043 ± 0.254
2.025GlnGly: 2.025 ± 0.451
1.227GlnHis: 1.227 ± 0.344
2.087GlnIle: 2.087 ± 0.318
0.798GlnLys: 0.798 ± 0.2
3.191GlnLeu: 3.191 ± 0.406
0.859GlnMet: 0.859 ± 0.203
0.736GlnAsn: 0.736 ± 0.203
2.087GlnPro: 2.087 ± 0.343
1.412GlnGln: 1.412 ± 0.309
2.946GlnArg: 2.946 ± 0.37
1.78GlnSer: 1.78 ± 0.277
2.025GlnThr: 2.025 ± 0.355
2.7GlnVal: 2.7 ± 0.456
1.166GlnTrp: 1.166 ± 0.278
0.982GlnTyr: 0.982 ± 0.302
0.0GlnXaa: 0.0 ± 0.0
Arg
7.733ArgAla: 7.733 ± 0.901
0.921ArgCys: 0.921 ± 0.266
5.769ArgAsp: 5.769 ± 0.577
3.989ArgGlu: 3.989 ± 0.483
1.964ArgPhe: 1.964 ± 0.312
7.058ArgGly: 7.058 ± 0.666
1.841ArgHis: 1.841 ± 0.369
3.437ArgIle: 3.437 ± 0.462
2.455ArgLys: 2.455 ± 0.342
6.26ArgLeu: 6.26 ± 0.626
1.964ArgMet: 1.964 ± 0.353
2.271ArgAsn: 2.271 ± 0.409
3.682ArgPro: 3.682 ± 0.435
2.516ArgGln: 2.516 ± 0.478
7.242ArgArg: 7.242 ± 0.965
3.744ArgSer: 3.744 ± 0.504
4.603ArgThr: 4.603 ± 0.429
4.91ArgVal: 4.91 ± 0.516
1.657ArgTrp: 1.657 ± 0.331
1.412ArgTyr: 1.412 ± 0.299
0.0ArgXaa: 0.0 ± 0.0
Ser
5.892SerAla: 5.892 ± 0.661
0.184SerCys: 0.184 ± 0.114
3.007SerAsp: 3.007 ± 0.43
3.253SerGlu: 3.253 ± 0.378
1.35SerPhe: 1.35 ± 0.235
4.971SerGly: 4.971 ± 0.697
1.105SerHis: 1.105 ± 0.209
2.7SerIle: 2.7 ± 0.383
1.596SerLys: 1.596 ± 0.339
3.437SerLeu: 3.437 ± 0.401
1.227SerMet: 1.227 ± 0.295
1.166SerAsn: 1.166 ± 0.226
2.823SerPro: 2.823 ± 0.355
0.982SerGln: 0.982 ± 0.284
3.989SerArg: 3.989 ± 0.476
2.148SerSer: 2.148 ± 0.51
4.112SerThr: 4.112 ± 0.453
4.726SerVal: 4.726 ± 0.571
1.105SerTrp: 1.105 ± 0.214
0.614SerTyr: 0.614 ± 0.154
0.0SerXaa: 0.0 ± 0.0
Thr
8.163ThrAla: 8.163 ± 0.756
0.491ThrCys: 0.491 ± 0.179
4.787ThrAsp: 4.787 ± 0.723
4.051ThrGlu: 4.051 ± 0.512
2.394ThrPhe: 2.394 ± 0.405
6.014ThrGly: 6.014 ± 0.665
1.473ThrHis: 1.473 ± 0.31
3.13ThrIle: 3.13 ± 0.509
2.271ThrLys: 2.271 ± 0.324
5.953ThrLeu: 5.953 ± 0.637
1.166ThrMet: 1.166 ± 0.24
2.025ThrAsn: 2.025 ± 0.361
4.848ThrPro: 4.848 ± 0.603
1.841ThrGln: 1.841 ± 0.295
4.173ThrArg: 4.173 ± 0.608
3.253ThrSer: 3.253 ± 0.441
4.48ThrThr: 4.48 ± 0.531
6.321ThrVal: 6.321 ± 0.627
1.289ThrTrp: 1.289 ± 0.267
1.534ThrTyr: 1.534 ± 0.342
0.0ThrXaa: 0.0 ± 0.0
Val
8.96ValAla: 8.96 ± 0.783
0.982ValCys: 0.982 ± 0.289
6.076ValAsp: 6.076 ± 0.601
5.217ValGlu: 5.217 ± 0.61
1.718ValPhe: 1.718 ± 0.324
5.339ValGly: 5.339 ± 0.646
1.534ValHis: 1.534 ± 0.292
2.884ValIle: 2.884 ± 0.45
2.578ValLys: 2.578 ± 0.317
5.892ValLeu: 5.892 ± 0.744
1.596ValMet: 1.596 ± 0.388
1.964ValAsn: 1.964 ± 0.359
4.051ValPro: 4.051 ± 0.635
3.007ValGln: 3.007 ± 0.364
5.708ValArg: 5.708 ± 0.707
3.314ValSer: 3.314 ± 0.49
5.769ValThr: 5.769 ± 0.831
7.242ValVal: 7.242 ± 0.82
1.596ValTrp: 1.596 ± 0.367
1.841ValTyr: 1.841 ± 0.282
0.0ValXaa: 0.0 ± 0.0
Trp
1.657TrpAla: 1.657 ± 0.316
0.43TrpCys: 0.43 ± 0.158
1.596TrpAsp: 1.596 ± 0.35
0.921TrpGlu: 0.921 ± 0.304
0.552TrpPhe: 0.552 ± 0.219
1.105TrpGly: 1.105 ± 0.212
0.614TrpHis: 0.614 ± 0.22
0.675TrpIle: 0.675 ± 0.185
0.675TrpLys: 0.675 ± 0.192
2.148TrpLeu: 2.148 ± 0.42
0.552TrpMet: 0.552 ± 0.2
0.859TrpAsn: 0.859 ± 0.287
1.596TrpPro: 1.596 ± 0.264
0.921TrpGln: 0.921 ± 0.251
1.718TrpArg: 1.718 ± 0.271
0.798TrpSer: 0.798 ± 0.196
1.35TrpThr: 1.35 ± 0.266
1.473TrpVal: 1.473 ± 0.286
0.675TrpTrp: 0.675 ± 0.198
0.368TrpTyr: 0.368 ± 0.138
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.7TyrAla: 2.7 ± 0.438
0.245TyrCys: 0.245 ± 0.126
1.289TyrAsp: 1.289 ± 0.315
1.78TyrGlu: 1.78 ± 0.384
0.43TyrPhe: 0.43 ± 0.189
2.332TyrGly: 2.332 ± 0.358
0.736TyrHis: 0.736 ± 0.214
0.859TyrIle: 0.859 ± 0.237
0.798TyrLys: 0.798 ± 0.191
1.412TyrLeu: 1.412 ± 0.299
0.491TyrMet: 0.491 ± 0.163
0.675TyrAsn: 0.675 ± 0.203
0.982TyrPro: 0.982 ± 0.257
0.614TyrGln: 0.614 ± 0.226
2.025TyrArg: 2.025 ± 0.381
1.043TyrSer: 1.043 ± 0.279
1.718TyrThr: 1.718 ± 0.321
1.412TyrVal: 1.412 ± 0.311
0.368TyrTrp: 0.368 ± 0.16
0.675TyrTyr: 0.675 ± 0.21
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 79 proteins (16295 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski