Amino acid dipepetide frequency for Gordonia phage Hotorobo

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.818AlaAla: 12.818 ± 3.511
0.827AlaCys: 0.827 ± 0.213
5.169AlaAsp: 5.169 ± 0.472
6.037AlaGlu: 6.037 ± 0.548
2.853AlaPhe: 2.853 ± 0.484
7.443AlaGly: 7.443 ± 0.822
1.116AlaHis: 1.116 ± 0.182
5.127AlaIle: 5.127 ± 0.689
4.507AlaLys: 4.507 ± 0.609
7.774AlaLeu: 7.774 ± 0.767
2.936AlaMet: 2.936 ± 0.491
3.349AlaAsn: 3.349 ± 0.521
4.548AlaPro: 4.548 ± 0.598
4.631AlaGln: 4.631 ± 1.021
5.169AlaArg: 5.169 ± 0.508
6.326AlaSer: 6.326 ± 0.898
5.541AlaThr: 5.541 ± 0.565
5.83AlaVal: 5.83 ± 0.643
1.943AlaTrp: 1.943 ± 0.516
2.646AlaTyr: 2.646 ± 0.359
0.0AlaXaa: 0.0 ± 0.0
Cys
0.744CysAla: 0.744 ± 0.223
0.165CysCys: 0.165 ± 0.091
0.703CysAsp: 0.703 ± 0.195
0.703CysGlu: 0.703 ± 0.195
0.207CysPhe: 0.207 ± 0.09
1.075CysGly: 1.075 ± 0.248
0.165CysHis: 0.165 ± 0.094
0.538CysIle: 0.538 ± 0.153
0.703CysLys: 0.703 ± 0.256
0.579CysLeu: 0.579 ± 0.157
0.124CysMet: 0.124 ± 0.073
0.538CysAsn: 0.538 ± 0.166
0.496CysPro: 0.496 ± 0.165
0.331CysGln: 0.331 ± 0.117
0.579CysArg: 0.579 ± 0.162
0.62CysSer: 0.62 ± 0.218
0.703CysThr: 0.703 ± 0.166
0.662CysVal: 0.662 ± 0.197
0.207CysTrp: 0.207 ± 0.098
0.455CysTyr: 0.455 ± 0.141
0.0CysXaa: 0.0 ± 0.0
Asp
5.003AspAla: 5.003 ± 0.516
0.703AspCys: 0.703 ± 0.192
4.052AspAsp: 4.052 ± 0.596
5.21AspGlu: 5.21 ± 0.652
2.316AspPhe: 2.316 ± 0.337
4.3AspGly: 4.3 ± 0.43
1.447AspHis: 1.447 ± 0.275
3.019AspIle: 3.019 ± 0.401
2.936AspLys: 2.936 ± 0.336
4.218AspLeu: 4.218 ± 0.411
1.943AspMet: 1.943 ± 0.251
2.812AspAsn: 2.812 ± 0.325
4.424AspPro: 4.424 ± 0.53
2.026AspGln: 2.026 ± 0.237
2.977AspArg: 2.977 ± 0.508
3.473AspSer: 3.473 ± 0.367
3.225AspThr: 3.225 ± 0.407
3.763AspVal: 3.763 ± 0.427
1.447AspTrp: 1.447 ± 0.21
2.481AspTyr: 2.481 ± 0.36
0.0AspXaa: 0.0 ± 0.0
Glu
6.409GluAla: 6.409 ± 0.711
0.827GluCys: 0.827 ± 0.247
3.887GluAsp: 3.887 ± 0.518
6.285GluGlu: 6.285 ± 0.722
2.688GluPhe: 2.688 ± 0.28
4.466GluGly: 4.466 ± 0.562
1.116GluHis: 1.116 ± 0.227
3.97GluIle: 3.97 ± 0.614
3.019GluLys: 3.019 ± 0.318
5.541GluLeu: 5.541 ± 0.676
2.233GluMet: 2.233 ± 0.35
1.861GluAsn: 1.861 ± 0.233
3.184GluPro: 3.184 ± 0.536
3.391GluGln: 3.391 ± 0.433
4.342GluArg: 4.342 ± 0.628
4.3GluSer: 4.3 ± 0.373
3.763GluThr: 3.763 ± 0.367
5.375GluVal: 5.375 ± 0.484
1.282GluTrp: 1.282 ± 0.192
2.481GluTyr: 2.481 ± 0.407
0.0GluXaa: 0.0 ± 0.0
Phe
3.308PheAla: 3.308 ± 0.439
0.496PheCys: 0.496 ± 0.159
2.977PheAsp: 2.977 ± 0.349
2.688PheGlu: 2.688 ± 0.38
0.992PhePhe: 0.992 ± 0.249
3.308PheGly: 3.308 ± 0.417
0.455PheHis: 0.455 ± 0.153
1.695PheIle: 1.695 ± 0.243
2.646PheLys: 2.646 ± 0.276
2.109PheLeu: 2.109 ± 0.363
1.158PheMet: 1.158 ± 0.202
1.282PheAsn: 1.282 ± 0.218
1.489PhePro: 1.489 ± 0.246
1.158PheGln: 1.158 ± 0.258
1.53PheArg: 1.53 ± 0.209
2.15PheSer: 2.15 ± 0.319
2.316PheThr: 2.316 ± 0.322
1.613PheVal: 1.613 ± 0.256
0.455PheTrp: 0.455 ± 0.119
0.703PheTyr: 0.703 ± 0.181
0.0PheXaa: 0.0 ± 0.0
Gly
7.278GlyAla: 7.278 ± 0.868
0.744GlyCys: 0.744 ± 0.211
4.59GlyAsp: 4.59 ± 0.41
4.3GlyGlu: 4.3 ± 0.352
3.06GlyPhe: 3.06 ± 0.387
6.533GlyGly: 6.533 ± 0.586
1.778GlyHis: 1.778 ± 0.229
4.548GlyIle: 4.548 ± 0.361
5.003GlyLys: 5.003 ± 0.427
5.375GlyLeu: 5.375 ± 0.609
2.357GlyMet: 2.357 ± 0.349
3.184GlyAsn: 3.184 ± 0.313
3.391GlyPro: 3.391 ± 0.321
3.019GlyGln: 3.019 ± 0.341
4.135GlyArg: 4.135 ± 0.566
4.673GlySer: 4.673 ± 0.492
5.251GlyThr: 5.251 ± 0.499
4.879GlyVal: 4.879 ± 0.456
1.447GlyTrp: 1.447 ± 0.312
2.522GlyTyr: 2.522 ± 0.408
0.0GlyXaa: 0.0 ± 0.0
His
1.282HisAla: 1.282 ± 0.28
0.165HisCys: 0.165 ± 0.085
1.116HisAsp: 1.116 ± 0.257
1.24HisGlu: 1.24 ± 0.268
0.413HisPhe: 0.413 ± 0.134
1.158HisGly: 1.158 ± 0.273
0.413HisHis: 0.413 ± 0.147
0.91HisIle: 0.91 ± 0.183
0.744HisLys: 0.744 ± 0.214
1.365HisLeu: 1.365 ± 0.259
0.579HisMet: 0.579 ± 0.131
0.62HisAsn: 0.62 ± 0.222
1.199HisPro: 1.199 ± 0.257
0.372HisGln: 0.372 ± 0.125
1.406HisArg: 1.406 ± 0.31
0.951HisSer: 0.951 ± 0.178
1.158HisThr: 1.158 ± 0.201
1.034HisVal: 1.034 ± 0.183
0.372HisTrp: 0.372 ± 0.115
0.744HisTyr: 0.744 ± 0.186
0.0HisXaa: 0.0 ± 0.0
Ile
5.169IleAla: 5.169 ± 0.538
0.413IleCys: 0.413 ± 0.15
3.473IleAsp: 3.473 ± 0.35
4.507IleGlu: 4.507 ± 0.548
1.695IlePhe: 1.695 ± 0.262
5.045IleGly: 5.045 ± 0.602
0.538IleHis: 0.538 ± 0.183
2.522IleIle: 2.522 ± 0.385
3.101IleLys: 3.101 ± 0.315
2.853IleLeu: 2.853 ± 0.32
1.406IleMet: 1.406 ± 0.271
2.109IleAsn: 2.109 ± 0.274
2.316IlePro: 2.316 ± 0.337
2.274IleGln: 2.274 ± 0.302
2.936IleArg: 2.936 ± 0.384
2.812IleSer: 2.812 ± 0.37
3.019IleThr: 3.019 ± 0.312
3.928IleVal: 3.928 ± 0.396
0.868IleTrp: 0.868 ± 0.254
1.24IleTyr: 1.24 ± 0.264
0.0IleXaa: 0.0 ± 0.0
Lys
4.879LysAla: 4.879 ± 0.535
0.496LysCys: 0.496 ± 0.156
2.605LysAsp: 2.605 ± 0.408
3.556LysGlu: 3.556 ± 0.392
2.15LysPhe: 2.15 ± 0.344
3.597LysGly: 3.597 ± 0.38
0.744LysHis: 0.744 ± 0.217
2.894LysIle: 2.894 ± 0.289
3.515LysLys: 3.515 ± 0.396
5.417LysLeu: 5.417 ± 0.446
1.613LysMet: 1.613 ± 0.25
2.481LysAsn: 2.481 ± 0.335
2.646LysPro: 2.646 ± 0.395
2.192LysGln: 2.192 ± 0.265
3.639LysArg: 3.639 ± 0.505
2.605LysSer: 2.605 ± 0.346
2.357LysThr: 2.357 ± 0.332
3.887LysVal: 3.887 ± 0.389
0.786LysTrp: 0.786 ± 0.208
1.489LysTyr: 1.489 ± 0.236
0.0LysXaa: 0.0 ± 0.0
Leu
6.12LeuAla: 6.12 ± 0.575
0.744LeuCys: 0.744 ± 0.176
5.458LeuAsp: 5.458 ± 0.547
5.83LeuGlu: 5.83 ± 0.578
2.605LeuPhe: 2.605 ± 0.314
5.872LeuGly: 5.872 ± 0.471
1.199LeuHis: 1.199 ± 0.233
3.887LeuIle: 3.887 ± 0.401
3.68LeuLys: 3.68 ± 0.419
5.045LeuLeu: 5.045 ± 0.559
2.522LeuMet: 2.522 ± 0.33
2.77LeuAsn: 2.77 ± 0.37
3.887LeuPro: 3.887 ± 0.475
2.481LeuGln: 2.481 ± 0.369
4.507LeuArg: 4.507 ± 0.552
4.673LeuSer: 4.673 ± 0.441
4.714LeuThr: 4.714 ± 0.542
4.548LeuVal: 4.548 ± 0.436
1.034LeuTrp: 1.034 ± 0.272
1.778LeuTyr: 1.778 ± 0.317
0.0LeuXaa: 0.0 ± 0.0
Met
2.316MetAla: 2.316 ± 0.301
0.289MetCys: 0.289 ± 0.114
1.447MetAsp: 1.447 ± 0.293
1.654MetGlu: 1.654 ± 0.285
0.91MetPhe: 0.91 ± 0.221
1.819MetGly: 1.819 ± 0.274
0.579MetHis: 0.579 ± 0.163
1.323MetIle: 1.323 ± 0.198
1.737MetLys: 1.737 ± 0.246
2.233MetLeu: 2.233 ± 0.304
0.372MetMet: 0.372 ± 0.123
1.282MetAsn: 1.282 ± 0.242
1.861MetPro: 1.861 ± 0.28
1.158MetGln: 1.158 ± 0.287
1.571MetArg: 1.571 ± 0.212
2.192MetSer: 2.192 ± 0.26
2.357MetThr: 2.357 ± 0.306
1.778MetVal: 1.778 ± 0.326
0.331MetTrp: 0.331 ± 0.115
0.786MetTyr: 0.786 ± 0.182
0.0MetXaa: 0.0 ± 0.0
Asn
4.466AsnAla: 4.466 ± 0.681
0.207AsnCys: 0.207 ± 0.087
2.481AsnAsp: 2.481 ± 0.358
2.026AsnGlu: 2.026 ± 0.294
1.282AsnPhe: 1.282 ± 0.235
3.639AsnGly: 3.639 ± 0.467
0.744AsnHis: 0.744 ± 0.195
2.15AsnIle: 2.15 ± 0.346
2.357AsnLys: 2.357 ± 0.279
2.853AsnLeu: 2.853 ± 0.385
0.951AsnMet: 0.951 ± 0.209
1.819AsnAsn: 1.819 ± 0.334
2.564AsnPro: 2.564 ± 0.372
1.819AsnGln: 1.819 ± 0.311
1.695AsnArg: 1.695 ± 0.284
2.192AsnSer: 2.192 ± 0.293
2.192AsnThr: 2.192 ± 0.288
2.853AsnVal: 2.853 ± 0.36
0.579AsnTrp: 0.579 ± 0.176
0.91AsnTyr: 0.91 ± 0.195
0.0AsnXaa: 0.0 ± 0.0
Pro
5.127ProAla: 5.127 ± 0.599
0.331ProCys: 0.331 ± 0.126
4.094ProAsp: 4.094 ± 0.556
4.466ProGlu: 4.466 ± 0.691
1.778ProPhe: 1.778 ± 0.208
4.259ProGly: 4.259 ± 0.459
0.62ProHis: 0.62 ± 0.197
2.398ProIle: 2.398 ± 0.34
2.274ProLys: 2.274 ± 0.355
2.977ProLeu: 2.977 ± 0.439
1.323ProMet: 1.323 ± 0.178
2.357ProAsn: 2.357 ± 0.289
3.101ProPro: 3.101 ± 0.451
2.067ProGln: 2.067 ± 0.318
2.109ProArg: 2.109 ± 0.343
2.853ProSer: 2.853 ± 0.395
3.68ProThr: 3.68 ± 0.51
4.3ProVal: 4.3 ± 0.455
0.91ProTrp: 0.91 ± 0.183
1.902ProTyr: 1.902 ± 0.261
0.0ProXaa: 0.0 ± 0.0
Gln
4.176GlnAla: 4.176 ± 0.865
0.372GlnCys: 0.372 ± 0.157
1.613GlnAsp: 1.613 ± 0.31
2.274GlnGlu: 2.274 ± 0.388
1.365GlnPhe: 1.365 ± 0.235
2.936GlnGly: 2.936 ± 0.372
0.91GlnHis: 0.91 ± 0.19
2.233GlnIle: 2.233 ± 0.352
1.819GlnLys: 1.819 ± 0.299
2.894GlnLeu: 2.894 ± 0.404
1.158GlnMet: 1.158 ± 0.229
1.902GlnAsn: 1.902 ± 0.289
2.192GlnPro: 2.192 ± 0.375
1.943GlnGln: 1.943 ± 0.286
2.646GlnArg: 2.646 ± 0.32
1.654GlnSer: 1.654 ± 0.261
2.44GlnThr: 2.44 ± 0.398
2.853GlnVal: 2.853 ± 0.285
0.703GlnTrp: 0.703 ± 0.216
1.199GlnTyr: 1.199 ± 0.244
0.0GlnXaa: 0.0 ± 0.0
Arg
4.466ArgAla: 4.466 ± 0.523
0.703ArgCys: 0.703 ± 0.164
2.646ArgAsp: 2.646 ± 0.325
4.797ArgGlu: 4.797 ± 0.435
1.654ArgPhe: 1.654 ± 0.21
3.928ArgGly: 3.928 ± 0.494
1.158ArgHis: 1.158 ± 0.239
2.77ArgIle: 2.77 ± 0.391
3.515ArgLys: 3.515 ± 0.431
4.548ArgLeu: 4.548 ± 0.481
1.819ArgMet: 1.819 ± 0.272
2.316ArgAsn: 2.316 ± 0.425
2.853ArgPro: 2.853 ± 0.469
2.15ArgGln: 2.15 ± 0.304
3.556ArgArg: 3.556 ± 0.514
2.44ArgSer: 2.44 ± 0.28
2.729ArgThr: 2.729 ± 0.433
4.052ArgVal: 4.052 ± 0.564
1.034ArgTrp: 1.034 ± 0.198
2.026ArgTyr: 2.026 ± 0.319
0.0ArgXaa: 0.0 ± 0.0
Ser
5.83SerAla: 5.83 ± 0.621
0.538SerCys: 0.538 ± 0.179
3.143SerAsp: 3.143 ± 0.369
3.267SerGlu: 3.267 ± 0.412
1.819SerPhe: 1.819 ± 0.296
5.045SerGly: 5.045 ± 0.491
0.786SerHis: 0.786 ± 0.196
3.143SerIle: 3.143 ± 0.398
3.225SerLys: 3.225 ± 0.436
4.755SerLeu: 4.755 ± 0.509
1.282SerMet: 1.282 ± 0.278
2.067SerAsn: 2.067 ± 0.253
3.184SerPro: 3.184 ± 0.29
1.943SerGln: 1.943 ± 0.225
3.349SerArg: 3.349 ± 0.393
4.052SerSer: 4.052 ± 0.437
3.225SerThr: 3.225 ± 0.408
4.466SerVal: 4.466 ± 0.492
0.662SerTrp: 0.662 ± 0.152
2.274SerTyr: 2.274 ± 0.323
0.0SerXaa: 0.0 ± 0.0
Thr
6.451ThrAla: 6.451 ± 0.972
0.703ThrCys: 0.703 ± 0.238
3.763ThrAsp: 3.763 ± 0.452
3.721ThrGlu: 3.721 ± 0.357
2.274ThrPhe: 2.274 ± 0.3
4.962ThrGly: 4.962 ± 0.419
0.992ThrHis: 0.992 ± 0.252
2.894ThrIle: 2.894 ± 0.339
2.688ThrLys: 2.688 ± 0.291
4.466ThrLeu: 4.466 ± 0.427
0.992ThrMet: 0.992 ± 0.221
2.274ThrAsn: 2.274 ± 0.288
4.052ThrPro: 4.052 ± 0.544
1.819ThrGln: 1.819 ± 0.265
2.729ThrArg: 2.729 ± 0.275
3.68ThrSer: 3.68 ± 0.406
4.259ThrThr: 4.259 ± 0.586
5.086ThrVal: 5.086 ± 0.463
1.53ThrTrp: 1.53 ± 0.259
1.819ThrTyr: 1.819 ± 0.369
0.0ThrXaa: 0.0 ± 0.0
Val
6.699ValAla: 6.699 ± 0.682
0.662ValCys: 0.662 ± 0.191
5.045ValAsp: 5.045 ± 0.468
4.673ValGlu: 4.673 ± 0.378
2.688ValPhe: 2.688 ± 0.356
4.424ValGly: 4.424 ± 0.487
1.323ValHis: 1.323 ± 0.279
4.052ValIle: 4.052 ± 0.417
3.639ValLys: 3.639 ± 0.399
4.59ValLeu: 4.59 ± 0.352
2.026ValMet: 2.026 ± 0.299
2.605ValAsn: 2.605 ± 0.338
3.391ValPro: 3.391 ± 0.361
2.481ValGln: 2.481 ± 0.334
3.308ValArg: 3.308 ± 0.409
3.97ValSer: 3.97 ± 0.447
5.458ValThr: 5.458 ± 0.422
4.921ValVal: 4.921 ± 0.365
1.323ValTrp: 1.323 ± 0.236
1.778ValTyr: 1.778 ± 0.279
0.0ValXaa: 0.0 ± 0.0
Trp
1.158TrpAla: 1.158 ± 0.365
0.289TrpCys: 0.289 ± 0.1
1.695TrpAsp: 1.695 ± 0.255
1.24TrpGlu: 1.24 ± 0.222
0.496TrpPhe: 0.496 ± 0.134
1.365TrpGly: 1.365 ± 0.2
0.289TrpHis: 0.289 ± 0.132
0.91TrpIle: 0.91 ± 0.175
1.075TrpLys: 1.075 ± 0.237
1.323TrpLeu: 1.323 ± 0.213
0.248TrpMet: 0.248 ± 0.097
0.744TrpAsn: 0.744 ± 0.192
0.786TrpPro: 0.786 ± 0.209
0.703TrpGln: 0.703 ± 0.141
1.075TrpArg: 1.075 ± 0.222
1.116TrpSer: 1.116 ± 0.188
0.827TrpThr: 0.827 ± 0.135
1.282TrpVal: 1.282 ± 0.267
0.165TrpTrp: 0.165 ± 0.085
0.744TrpTyr: 0.744 ± 0.201
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.894TyrAla: 2.894 ± 0.418
0.62TyrCys: 0.62 ± 0.179
1.819TyrAsp: 1.819 ± 0.305
1.943TyrGlu: 1.943 ± 0.311
1.282TyrPhe: 1.282 ± 0.256
2.894TyrGly: 2.894 ± 0.378
0.868TyrHis: 0.868 ± 0.186
1.24TyrIle: 1.24 ± 0.305
1.365TyrLys: 1.365 ± 0.271
2.44TyrLeu: 2.44 ± 0.428
0.91TyrMet: 0.91 ± 0.198
1.406TyrAsn: 1.406 ± 0.248
1.365TyrPro: 1.365 ± 0.264
1.323TyrGln: 1.323 ± 0.246
1.902TyrArg: 1.902 ± 0.358
1.323TyrSer: 1.323 ± 0.213
1.985TyrThr: 1.985 ± 0.284
1.943TyrVal: 1.943 ± 0.305
0.455TyrTrp: 0.455 ± 0.123
1.034TyrTyr: 1.034 ± 0.224
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 108 proteins (24185 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski