Amino acid dipepetide frequency for Gordonia phage Yakult

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.9AlaAla: 10.9 ± 1.428
0.919AlaCys: 0.919 ± 0.249
6.566AlaAsp: 6.566 ± 0.582
4.924AlaGlu: 4.924 ± 0.57
4.596AlaPhe: 4.596 ± 0.662
8.142AlaGly: 8.142 ± 0.708
2.101AlaHis: 2.101 ± 0.437
6.303AlaIle: 6.303 ± 0.623
5.187AlaLys: 5.187 ± 0.729
8.207AlaLeu: 8.207 ± 0.609
3.349AlaMet: 3.349 ± 0.596
3.349AlaAsn: 3.349 ± 0.424
4.662AlaPro: 4.662 ± 0.456
4.531AlaGln: 4.531 ± 0.683
6.238AlaArg: 6.238 ± 0.816
4.99AlaSer: 4.99 ± 0.492
7.091AlaThr: 7.091 ± 0.976
6.96AlaVal: 6.96 ± 0.795
2.035AlaTrp: 2.035 ± 0.365
3.349AlaTyr: 3.349 ± 0.401
0.0AlaXaa: 0.0 ± 0.0
Cys
0.985CysAla: 0.985 ± 0.326
0.0CysCys: 0.0 ± 0.0
0.985CysAsp: 0.985 ± 0.261
0.394CysGlu: 0.394 ± 0.158
0.525CysPhe: 0.525 ± 0.204
1.248CysGly: 1.248 ± 0.302
0.197CysHis: 0.197 ± 0.119
0.394CysIle: 0.394 ± 0.178
0.263CysLys: 0.263 ± 0.129
0.394CysLeu: 0.394 ± 0.14
0.197CysMet: 0.197 ± 0.104
0.394CysAsn: 0.394 ± 0.153
0.263CysPro: 0.263 ± 0.161
0.394CysGln: 0.394 ± 0.166
0.985CysArg: 0.985 ± 0.292
0.46CysSer: 0.46 ± 0.188
0.46CysThr: 0.46 ± 0.165
0.722CysVal: 0.722 ± 0.28
0.263CysTrp: 0.263 ± 0.135
0.46CysTyr: 0.46 ± 0.183
0.0CysXaa: 0.0 ± 0.0
Asp
7.682AspAla: 7.682 ± 0.765
0.591AspCys: 0.591 ± 0.164
5.187AspAsp: 5.187 ± 0.867
4.859AspGlu: 4.859 ± 0.723
2.364AspPhe: 2.364 ± 0.342
4.531AspGly: 4.531 ± 0.506
1.51AspHis: 1.51 ± 0.292
3.086AspIle: 3.086 ± 0.452
3.02AspLys: 3.02 ± 0.47
5.515AspLeu: 5.515 ± 0.863
2.561AspMet: 2.561 ± 0.377
1.707AspAsn: 1.707 ± 0.298
5.778AspPro: 5.778 ± 0.632
1.707AspGln: 1.707 ± 0.378
3.48AspArg: 3.48 ± 0.466
3.283AspSer: 3.283 ± 0.581
4.596AspThr: 4.596 ± 0.735
4.137AspVal: 4.137 ± 0.51
1.576AspTrp: 1.576 ± 0.367
1.838AspTyr: 1.838 ± 0.383
0.0AspXaa: 0.0 ± 0.0
Glu
5.844GluAla: 5.844 ± 0.811
0.657GluCys: 0.657 ± 0.212
3.874GluAsp: 3.874 ± 0.619
3.414GluGlu: 3.414 ± 0.503
1.773GluPhe: 1.773 ± 0.349
4.99GluGly: 4.99 ± 0.547
1.445GluHis: 1.445 ± 0.319
1.576GluIle: 1.576 ± 0.286
1.838GluLys: 1.838 ± 0.45
5.121GluLeu: 5.121 ± 0.681
1.445GluMet: 1.445 ± 0.328
1.445GluAsn: 1.445 ± 0.348
2.364GluPro: 2.364 ± 0.436
2.692GluGln: 2.692 ± 0.385
4.268GluArg: 4.268 ± 0.765
2.626GluSer: 2.626 ± 0.353
2.298GluThr: 2.298 ± 0.44
3.94GluVal: 3.94 ± 0.676
1.051GluTrp: 1.051 ± 0.287
2.495GluTyr: 2.495 ± 0.456
0.0GluXaa: 0.0 ± 0.0
Phe
2.692PheAla: 2.692 ± 0.363
0.591PheCys: 0.591 ± 0.234
2.626PheAsp: 2.626 ± 0.412
1.51PheGlu: 1.51 ± 0.265
0.788PhePhe: 0.788 ± 0.19
2.955PheGly: 2.955 ± 0.518
0.788PheHis: 0.788 ± 0.19
1.379PheIle: 1.379 ± 0.284
1.838PheLys: 1.838 ± 0.322
2.429PheLeu: 2.429 ± 0.416
0.657PheMet: 0.657 ± 0.177
1.116PheAsn: 1.116 ± 0.334
1.116PhePro: 1.116 ± 0.227
0.919PheGln: 0.919 ± 0.231
2.429PheArg: 2.429 ± 0.378
1.707PheSer: 1.707 ± 0.33
1.97PheThr: 1.97 ± 0.298
2.035PheVal: 2.035 ± 0.403
0.328PheTrp: 0.328 ± 0.138
0.657PheTyr: 0.657 ± 0.186
0.0PheXaa: 0.0 ± 0.0
Gly
8.076GlyAla: 8.076 ± 1.408
1.051GlyCys: 1.051 ± 0.276
6.5GlyAsp: 6.5 ± 1.013
4.137GlyGlu: 4.137 ± 0.568
2.561GlyPhe: 2.561 ± 0.321
6.894GlyGly: 6.894 ± 1.297
2.101GlyHis: 2.101 ± 0.349
5.121GlyIle: 5.121 ± 0.832
4.334GlyLys: 4.334 ± 0.692
5.909GlyLeu: 5.909 ± 0.636
1.576GlyMet: 1.576 ± 0.366
2.298GlyAsn: 2.298 ± 0.348
3.546GlyPro: 3.546 ± 1.267
2.889GlyGln: 2.889 ± 0.518
5.318GlyArg: 5.318 ± 0.735
4.531GlySer: 4.531 ± 0.537
5.975GlyThr: 5.975 ± 0.64
7.091GlyVal: 7.091 ± 0.8
1.707GlyTrp: 1.707 ± 0.356
2.758GlyTyr: 2.758 ± 0.364
0.0GlyXaa: 0.0 ± 0.0
His
1.576HisAla: 1.576 ± 0.368
0.46HisCys: 0.46 ± 0.204
1.904HisAsp: 1.904 ± 0.418
1.773HisGlu: 1.773 ± 0.318
0.591HisPhe: 0.591 ± 0.208
1.773HisGly: 1.773 ± 0.366
0.46HisHis: 0.46 ± 0.164
0.722HisIle: 0.722 ± 0.205
0.985HisLys: 0.985 ± 0.326
1.51HisLeu: 1.51 ± 0.298
0.854HisMet: 0.854 ± 0.215
0.657HisAsn: 0.657 ± 0.204
1.116HisPro: 1.116 ± 0.314
0.591HisGln: 0.591 ± 0.23
1.313HisArg: 1.313 ± 0.296
1.182HisSer: 1.182 ± 0.228
1.445HisThr: 1.445 ± 0.296
1.182HisVal: 1.182 ± 0.248
0.722HisTrp: 0.722 ± 0.212
0.263HisTyr: 0.263 ± 0.132
0.0HisXaa: 0.0 ± 0.0
Ile
5.778IleAla: 5.778 ± 0.664
0.197IleCys: 0.197 ± 0.109
3.677IleAsp: 3.677 ± 0.484
3.349IleGlu: 3.349 ± 0.445
0.788IlePhe: 0.788 ± 0.19
4.137IleGly: 4.137 ± 0.686
1.051IleHis: 1.051 ± 0.235
1.838IleIle: 1.838 ± 0.332
1.904IleLys: 1.904 ± 0.379
2.889IleLeu: 2.889 ± 0.323
0.657IleMet: 0.657 ± 0.203
1.576IleAsn: 1.576 ± 0.314
3.02IlePro: 3.02 ± 0.47
1.248IleGln: 1.248 ± 0.39
3.283IleArg: 3.283 ± 0.458
1.904IleSer: 1.904 ± 0.351
2.035IleThr: 2.035 ± 0.329
4.071IleVal: 4.071 ± 0.494
0.985IleTrp: 0.985 ± 0.26
0.854IleTyr: 0.854 ± 0.223
0.0IleXaa: 0.0 ± 0.0
Lys
5.844LysAla: 5.844 ± 0.853
0.066LysCys: 0.066 ± 0.075
2.167LysAsp: 2.167 ± 0.415
1.904LysGlu: 1.904 ± 0.418
1.248LysPhe: 1.248 ± 0.262
4.465LysGly: 4.465 ± 0.803
1.116LysHis: 1.116 ± 0.261
2.101LysIle: 2.101 ± 0.279
2.561LysLys: 2.561 ± 0.527
3.611LysLeu: 3.611 ± 0.446
0.788LysMet: 0.788 ± 0.208
1.248LysAsn: 1.248 ± 0.326
1.97LysPro: 1.97 ± 0.414
1.576LysGln: 1.576 ± 0.358
4.596LysArg: 4.596 ± 0.634
2.626LysSer: 2.626 ± 0.444
2.232LysThr: 2.232 ± 0.489
2.823LysVal: 2.823 ± 0.387
0.591LysTrp: 0.591 ± 0.181
1.773LysTyr: 1.773 ± 0.317
0.0LysXaa: 0.0 ± 0.0
Leu
7.42LeuAla: 7.42 ± 0.519
0.657LeuCys: 0.657 ± 0.214
5.515LeuAsp: 5.515 ± 0.653
4.334LeuGlu: 4.334 ± 0.658
1.904LeuPhe: 1.904 ± 0.341
5.318LeuGly: 5.318 ± 0.738
1.248LeuHis: 1.248 ± 0.305
3.677LeuIle: 3.677 ± 0.478
3.546LeuLys: 3.546 ± 0.684
5.515LeuLeu: 5.515 ± 0.746
2.035LeuMet: 2.035 ± 0.365
2.232LeuAsn: 2.232 ± 0.387
4.924LeuPro: 4.924 ± 0.507
2.889LeuGln: 2.889 ± 0.47
7.485LeuArg: 7.485 ± 0.87
5.318LeuSer: 5.318 ± 0.626
4.924LeuThr: 4.924 ± 0.526
5.384LeuVal: 5.384 ± 0.468
1.773LeuTrp: 1.773 ± 0.375
1.97LeuTyr: 1.97 ± 0.428
0.0LeuXaa: 0.0 ± 0.0
Met
2.561MetAla: 2.561 ± 0.364
0.197MetCys: 0.197 ± 0.098
0.985MetAsp: 0.985 ± 0.265
1.051MetGlu: 1.051 ± 0.211
0.854MetPhe: 0.854 ± 0.299
2.035MetGly: 2.035 ± 0.358
0.657MetHis: 0.657 ± 0.339
1.576MetIle: 1.576 ± 0.302
1.51MetLys: 1.51 ± 0.305
2.232MetLeu: 2.232 ± 0.318
0.657MetMet: 0.657 ± 0.229
0.657MetAsn: 0.657 ± 0.223
1.051MetPro: 1.051 ± 0.241
1.182MetGln: 1.182 ± 0.243
2.035MetArg: 2.035 ± 0.336
2.626MetSer: 2.626 ± 0.377
2.495MetThr: 2.495 ± 0.486
1.445MetVal: 1.445 ± 0.343
0.722MetTrp: 0.722 ± 0.201
0.394MetTyr: 0.394 ± 0.142
0.0MetXaa: 0.0 ± 0.0
Asn
3.546AsnAla: 3.546 ± 0.693
0.263AsnCys: 0.263 ± 0.157
1.904AsnAsp: 1.904 ± 0.502
1.51AsnGlu: 1.51 ± 0.302
1.313AsnPhe: 1.313 ± 0.315
2.429AsnGly: 2.429 ± 0.397
0.394AsnHis: 0.394 ± 0.176
0.854AsnIle: 0.854 ± 0.183
1.51AsnLys: 1.51 ± 0.302
1.904AsnLeu: 1.904 ± 0.412
0.854AsnMet: 0.854 ± 0.234
0.854AsnAsn: 0.854 ± 0.282
2.495AsnPro: 2.495 ± 0.356
0.919AsnGln: 0.919 ± 0.26
1.838AsnArg: 1.838 ± 0.325
1.51AsnSer: 1.51 ± 0.259
2.101AsnThr: 2.101 ± 0.387
2.035AsnVal: 2.035 ± 0.334
0.525AsnTrp: 0.525 ± 0.177
0.591AsnTyr: 0.591 ± 0.194
0.0AsnXaa: 0.0 ± 0.0
Pro
6.763ProAla: 6.763 ± 0.667
0.657ProCys: 0.657 ± 0.184
4.268ProAsp: 4.268 ± 0.598
3.086ProGlu: 3.086 ± 0.446
1.51ProPhe: 1.51 ± 0.307
5.253ProGly: 5.253 ± 0.738
0.985ProHis: 0.985 ± 0.257
2.101ProIle: 2.101 ± 0.377
2.889ProLys: 2.889 ± 0.447
3.349ProLeu: 3.349 ± 0.554
1.313ProMet: 1.313 ± 0.328
1.313ProAsn: 1.313 ± 0.288
3.02ProPro: 3.02 ± 0.488
2.167ProGln: 2.167 ± 0.679
2.758ProArg: 2.758 ± 0.521
3.086ProSer: 3.086 ± 0.348
2.626ProThr: 2.626 ± 0.455
3.874ProVal: 3.874 ± 0.553
1.182ProTrp: 1.182 ± 0.237
1.248ProTyr: 1.248 ± 0.301
0.0ProXaa: 0.0 ± 0.0
Gln
4.137GlnAla: 4.137 ± 0.624
0.328GlnCys: 0.328 ± 0.154
2.298GlnAsp: 2.298 ± 0.408
1.445GlnGlu: 1.445 ± 0.28
1.313GlnPhe: 1.313 ± 0.332
4.465GlnGly: 4.465 ± 1.85
0.525GlnHis: 0.525 ± 0.168
2.035GlnIle: 2.035 ± 0.352
0.854GlnLys: 0.854 ± 0.292
2.823GlnLeu: 2.823 ± 0.439
0.919GlnMet: 0.919 ± 0.195
0.657GlnAsn: 0.657 ± 0.176
1.641GlnPro: 1.641 ± 0.278
1.313GlnGln: 1.313 ± 0.281
3.677GlnArg: 3.677 ± 0.536
1.773GlnSer: 1.773 ± 0.327
2.429GlnThr: 2.429 ± 0.421
2.429GlnVal: 2.429 ± 0.43
1.182GlnTrp: 1.182 ± 0.269
1.445GlnTyr: 1.445 ± 0.313
0.0GlnXaa: 0.0 ± 0.0
Arg
7.091ArgAla: 7.091 ± 0.986
0.854ArgCys: 0.854 ± 0.308
4.596ArgAsp: 4.596 ± 0.564
4.268ArgGlu: 4.268 ± 0.778
2.035ArgPhe: 2.035 ± 0.422
5.647ArgGly: 5.647 ± 0.663
1.313ArgHis: 1.313 ± 0.272
2.823ArgIle: 2.823 ± 0.465
2.889ArgLys: 2.889 ± 0.436
7.157ArgLeu: 7.157 ± 0.71
2.298ArgMet: 2.298 ± 0.41
2.167ArgAsn: 2.167 ± 0.32
2.561ArgPro: 2.561 ± 0.486
3.48ArgGln: 3.48 ± 0.438
5.581ArgArg: 5.581 ± 0.71
3.874ArgSer: 3.874 ± 0.518
3.94ArgThr: 3.94 ± 0.551
4.465ArgVal: 4.465 ± 0.631
2.101ArgTrp: 2.101 ± 0.409
2.101ArgTyr: 2.101 ± 0.354
0.0ArgXaa: 0.0 ± 0.0
Ser
4.793SerAla: 4.793 ± 0.512
0.263SerCys: 0.263 ± 0.138
3.546SerAsp: 3.546 ± 0.507
3.874SerGlu: 3.874 ± 0.567
1.182SerPhe: 1.182 ± 0.289
5.253SerGly: 5.253 ± 0.54
0.985SerHis: 0.985 ± 0.233
2.364SerIle: 2.364 ± 0.344
2.495SerLys: 2.495 ± 0.374
5.384SerLeu: 5.384 ± 0.637
1.379SerMet: 1.379 ± 0.314
1.904SerAsn: 1.904 ± 0.358
3.349SerPro: 3.349 ± 0.446
2.035SerGln: 2.035 ± 0.387
3.086SerArg: 3.086 ± 0.427
3.217SerSer: 3.217 ± 0.454
3.02SerThr: 3.02 ± 0.44
3.94SerVal: 3.94 ± 0.608
1.379SerTrp: 1.379 ± 0.231
1.576SerTyr: 1.576 ± 0.354
0.0SerXaa: 0.0 ± 0.0
Thr
5.778ThrAla: 5.778 ± 0.774
0.722ThrCys: 0.722 ± 0.264
3.94ThrAsp: 3.94 ± 0.512
1.97ThrGlu: 1.97 ± 0.329
1.379ThrPhe: 1.379 ± 0.305
7.091ThrGly: 7.091 ± 1.393
1.051ThrHis: 1.051 ± 0.267
3.217ThrIle: 3.217 ± 0.533
2.758ThrLys: 2.758 ± 0.362
4.99ThrLeu: 4.99 ± 0.416
2.035ThrMet: 2.035 ± 0.414
1.379ThrAsn: 1.379 ± 0.269
3.677ThrPro: 3.677 ± 0.518
1.97ThrGln: 1.97 ± 0.315
4.202ThrArg: 4.202 ± 0.474
3.152ThrSer: 3.152 ± 0.412
4.399ThrThr: 4.399 ± 0.656
5.187ThrVal: 5.187 ± 0.611
1.116ThrTrp: 1.116 ± 0.238
1.445ThrTyr: 1.445 ± 0.356
0.0ThrXaa: 0.0 ± 0.0
Val
7.879ValAla: 7.879 ± 0.688
0.854ValCys: 0.854 ± 0.3
4.399ValAsp: 4.399 ± 0.562
3.48ValGlu: 3.48 ± 0.403
2.167ValPhe: 2.167 ± 0.418
4.005ValGly: 4.005 ± 0.53
1.379ValHis: 1.379 ± 0.335
3.414ValIle: 3.414 ± 0.475
3.02ValLys: 3.02 ± 0.514
4.793ValLeu: 4.793 ± 0.562
1.51ValMet: 1.51 ± 0.247
2.823ValAsn: 2.823 ± 0.425
4.334ValPro: 4.334 ± 0.59
3.02ValGln: 3.02 ± 0.397
5.121ValArg: 5.121 ± 0.588
4.137ValSer: 4.137 ± 0.61
5.45ValThr: 5.45 ± 0.54
4.859ValVal: 4.859 ± 0.627
2.561ValTrp: 2.561 ± 0.401
1.51ValTyr: 1.51 ± 0.251
0.0ValXaa: 0.0 ± 0.0
Trp
2.035TrpAla: 2.035 ± 0.477
0.46TrpCys: 0.46 ± 0.15
2.035TrpAsp: 2.035 ± 0.301
1.641TrpGlu: 1.641 ± 0.35
0.788TrpPhe: 0.788 ± 0.201
1.313TrpGly: 1.313 ± 0.408
0.919TrpHis: 0.919 ± 0.242
0.066TrpIle: 0.066 ± 0.066
0.985TrpLys: 0.985 ± 0.193
2.035TrpLeu: 2.035 ± 0.355
0.854TrpMet: 0.854 ± 0.271
1.116TrpAsn: 1.116 ± 0.244
0.919TrpPro: 0.919 ± 0.241
0.985TrpGln: 0.985 ± 0.268
1.116TrpArg: 1.116 ± 0.274
1.576TrpSer: 1.576 ± 0.277
0.657TrpThr: 0.657 ± 0.209
1.838TrpVal: 1.838 ± 0.399
0.591TrpTrp: 0.591 ± 0.197
0.788TrpTyr: 0.788 ± 0.224
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.02TyrAla: 3.02 ± 0.481
0.197TyrCys: 0.197 ± 0.123
2.035TyrAsp: 2.035 ± 0.308
2.232TyrGlu: 2.232 ± 0.39
0.854TyrPhe: 0.854 ± 0.207
2.429TyrGly: 2.429 ± 0.323
0.854TyrHis: 0.854 ± 0.275
0.591TyrIle: 0.591 ± 0.17
0.919TyrLys: 0.919 ± 0.199
2.364TyrLeu: 2.364 ± 0.373
0.919TyrMet: 0.919 ± 0.247
0.46TyrAsn: 0.46 ± 0.196
1.641TyrPro: 1.641 ± 0.273
1.182TyrGln: 1.182 ± 0.262
2.429TyrArg: 2.429 ± 0.585
1.445TyrSer: 1.445 ± 0.328
1.313TyrThr: 1.313 ± 0.258
2.298TyrVal: 2.298 ± 0.374
0.328TyrTrp: 0.328 ± 0.165
0.788TyrTyr: 0.788 ± 0.188
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 74 proteins (15231 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski