Amino acid dipepetide frequency for Acinetobacter phage fEg-Aba01

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.505AlaAla: 7.505 ± 1.0
1.347AlaCys: 1.347 ± 0.445
2.983AlaAsp: 2.983 ± 0.578
6.35AlaGlu: 6.35 ± 0.9
2.79AlaPhe: 2.79 ± 0.46
5.388AlaGly: 5.388 ± 0.937
1.155AlaHis: 1.155 ± 0.282
4.811AlaIle: 4.811 ± 0.923
6.735AlaLys: 6.735 ± 0.832
7.89AlaLeu: 7.89 ± 0.922
2.79AlaMet: 2.79 ± 0.648
3.368AlaAsn: 3.368 ± 0.988
2.309AlaPro: 2.309 ± 0.546
4.041AlaGln: 4.041 ± 0.825
3.175AlaArg: 3.175 ± 0.644
4.041AlaSer: 4.041 ± 0.683
6.928AlaThr: 6.928 ± 0.894
4.426AlaVal: 4.426 ± 0.732
1.058AlaTrp: 1.058 ± 0.317
2.502AlaTyr: 2.502 ± 0.498
0.0AlaXaa: 0.0 ± 0.0
Cys
0.674CysAla: 0.674 ± 0.307
0.385CysCys: 0.385 ± 0.193
0.577CysAsp: 0.577 ± 0.228
0.77CysGlu: 0.77 ± 0.377
0.577CysPhe: 0.577 ± 0.222
0.962CysGly: 0.962 ± 0.399
0.289CysHis: 0.289 ± 0.192
0.674CysIle: 0.674 ± 0.255
0.77CysLys: 0.77 ± 0.276
0.77CysLeu: 0.77 ± 0.277
0.192CysMet: 0.192 ± 0.154
0.481CysAsn: 0.481 ± 0.279
0.481CysPro: 0.481 ± 0.173
0.289CysGln: 0.289 ± 0.184
0.674CysArg: 0.674 ± 0.268
0.77CysSer: 0.77 ± 0.394
0.0CysThr: 0.0 ± 0.0
0.962CysVal: 0.962 ± 0.304
0.0CysTrp: 0.0 ± 0.0
0.385CysTyr: 0.385 ± 0.198
0.0CysXaa: 0.0 ± 0.0
Asp
4.522AspAla: 4.522 ± 0.68
0.674AspCys: 0.674 ± 0.287
4.522AspAsp: 4.522 ± 0.779
4.426AspGlu: 4.426 ± 0.621
2.502AspPhe: 2.502 ± 0.459
5.677AspGly: 5.677 ± 0.712
1.347AspHis: 1.347 ± 0.394
3.271AspIle: 3.271 ± 0.512
4.234AspLys: 4.234 ± 0.538
4.907AspLeu: 4.907 ± 0.637
1.443AspMet: 1.443 ± 0.348
2.598AspAsn: 2.598 ± 0.582
2.502AspPro: 2.502 ± 0.544
1.732AspGln: 1.732 ± 0.353
1.924AspArg: 1.924 ± 0.409
4.522AspSer: 4.522 ± 0.678
2.598AspThr: 2.598 ± 0.613
3.945AspVal: 3.945 ± 0.526
0.962AspTrp: 0.962 ± 0.321
1.732AspTyr: 1.732 ± 0.319
0.0AspXaa: 0.0 ± 0.0
Glu
4.715GluAla: 4.715 ± 0.52
0.674GluCys: 0.674 ± 0.288
2.887GluAsp: 2.887 ± 0.551
3.945GluGlu: 3.945 ± 0.744
2.983GluPhe: 2.983 ± 0.586
4.041GluGly: 4.041 ± 0.608
0.962GluHis: 0.962 ± 0.306
4.426GluIle: 4.426 ± 0.678
5.196GluLys: 5.196 ± 0.777
6.735GluLeu: 6.735 ± 0.836
1.828GluMet: 1.828 ± 0.422
3.56GluAsn: 3.56 ± 0.5
1.732GluPro: 1.732 ± 0.435
4.522GluGln: 4.522 ± 0.655
3.175GluArg: 3.175 ± 0.546
4.234GluSer: 4.234 ± 0.701
4.041GluThr: 4.041 ± 0.594
3.849GluVal: 3.849 ± 0.632
1.058GluTrp: 1.058 ± 0.235
3.271GluTyr: 3.271 ± 0.569
0.0GluXaa: 0.0 ± 0.0
Phe
2.405PheAla: 2.405 ± 0.465
0.192PheCys: 0.192 ± 0.113
2.502PheAsp: 2.502 ± 0.466
2.79PheGlu: 2.79 ± 0.479
2.117PhePhe: 2.117 ± 0.505
2.598PheGly: 2.598 ± 0.408
0.674PheHis: 0.674 ± 0.204
2.117PheIle: 2.117 ± 0.605
2.79PheLys: 2.79 ± 0.502
3.464PheLeu: 3.464 ± 0.59
1.251PheMet: 1.251 ± 0.326
2.309PheAsn: 2.309 ± 0.535
1.155PhePro: 1.155 ± 0.291
1.251PheGln: 1.251 ± 0.283
1.155PheArg: 1.155 ± 0.299
1.636PheSer: 1.636 ± 0.456
2.598PheThr: 2.598 ± 0.483
1.347PheVal: 1.347 ± 0.334
0.674PheTrp: 0.674 ± 0.333
1.636PheTyr: 1.636 ± 0.365
0.0PheXaa: 0.0 ± 0.0
Gly
5.677GlyAla: 5.677 ± 0.825
0.577GlyCys: 0.577 ± 0.232
3.368GlyAsp: 3.368 ± 0.484
3.368GlyGlu: 3.368 ± 0.557
2.598GlyPhe: 2.598 ± 0.58
4.426GlyGly: 4.426 ± 0.561
0.77GlyHis: 0.77 ± 0.329
4.426GlyIle: 4.426 ± 0.478
5.484GlyLys: 5.484 ± 0.801
5.196GlyLeu: 5.196 ± 0.755
2.405GlyMet: 2.405 ± 0.432
3.271GlyAsn: 3.271 ± 0.741
1.539GlyPro: 1.539 ± 0.489
2.309GlyGln: 2.309 ± 0.551
3.849GlyArg: 3.849 ± 0.591
3.753GlySer: 3.753 ± 0.478
2.887GlyThr: 2.887 ± 0.47
3.849GlyVal: 3.849 ± 0.572
0.577GlyTrp: 0.577 ± 0.32
3.368GlyTyr: 3.368 ± 0.679
0.0GlyXaa: 0.0 ± 0.0
His
0.77HisAla: 0.77 ± 0.269
0.192HisCys: 0.192 ± 0.139
0.962HisAsp: 0.962 ± 0.22
0.866HisGlu: 0.866 ± 0.306
0.577HisPhe: 0.577 ± 0.228
1.828HisGly: 1.828 ± 0.471
0.385HisHis: 0.385 ± 0.165
1.155HisIle: 1.155 ± 0.457
0.577HisLys: 0.577 ± 0.257
2.213HisLeu: 2.213 ± 0.446
0.192HisMet: 0.192 ± 0.154
0.577HisAsn: 0.577 ± 0.189
0.577HisPro: 0.577 ± 0.228
0.481HisGln: 0.481 ± 0.268
0.577HisArg: 0.577 ± 0.231
0.962HisSer: 0.962 ± 0.446
0.674HisThr: 0.674 ± 0.244
1.251HisVal: 1.251 ± 0.426
0.289HisTrp: 0.289 ± 0.176
0.866HisTyr: 0.866 ± 0.34
0.0HisXaa: 0.0 ± 0.0
Ile
5.292IleAla: 5.292 ± 0.596
0.866IleCys: 0.866 ± 0.33
5.292IleAsp: 5.292 ± 0.805
5.003IleGlu: 5.003 ± 0.721
1.732IlePhe: 1.732 ± 0.412
4.137IleGly: 4.137 ± 0.508
0.866IleHis: 0.866 ± 0.349
2.983IleIle: 2.983 ± 0.51
4.522IleLys: 4.522 ± 0.734
4.234IleLeu: 4.234 ± 0.509
1.058IleMet: 1.058 ± 0.266
3.271IleAsn: 3.271 ± 0.772
2.502IlePro: 2.502 ± 0.457
2.887IleGln: 2.887 ± 0.693
2.598IleArg: 2.598 ± 0.563
4.234IleSer: 4.234 ± 0.556
3.849IleThr: 3.849 ± 0.539
3.753IleVal: 3.753 ± 0.808
0.481IleTrp: 0.481 ± 0.275
2.021IleTyr: 2.021 ± 0.525
0.0IleXaa: 0.0 ± 0.0
Lys
7.216LysAla: 7.216 ± 1.095
0.481LysCys: 0.481 ± 0.262
4.715LysAsp: 4.715 ± 0.749
5.1LysGlu: 5.1 ± 0.773
2.502LysPhe: 2.502 ± 0.506
3.656LysGly: 3.656 ± 0.602
1.443LysHis: 1.443 ± 0.409
4.137LysIle: 4.137 ± 0.646
4.234LysLys: 4.234 ± 0.727
7.601LysLeu: 7.601 ± 0.69
1.924LysMet: 1.924 ± 0.401
4.907LysAsn: 4.907 ± 0.803
2.887LysPro: 2.887 ± 0.581
2.887LysGln: 2.887 ± 0.479
3.753LysArg: 3.753 ± 0.619
4.522LysSer: 4.522 ± 0.667
4.137LysThr: 4.137 ± 0.548
6.158LysVal: 6.158 ± 0.745
0.962LysTrp: 0.962 ± 0.294
2.021LysTyr: 2.021 ± 0.456
0.0LysXaa: 0.0 ± 0.0
Leu
8.179LeuAla: 8.179 ± 1.243
0.77LeuCys: 0.77 ± 0.316
6.639LeuAsp: 6.639 ± 0.622
7.505LeuGlu: 7.505 ± 0.82
3.175LeuPhe: 3.175 ± 0.6
4.426LeuGly: 4.426 ± 0.79
0.962LeuHis: 0.962 ± 0.347
6.735LeuIle: 6.735 ± 1.008
6.928LeuLys: 6.928 ± 0.958
7.601LeuLeu: 7.601 ± 1.218
2.309LeuMet: 2.309 ± 0.538
3.656LeuAsn: 3.656 ± 0.696
2.983LeuPro: 2.983 ± 0.597
4.426LeuGln: 4.426 ± 0.705
4.618LeuArg: 4.618 ± 0.859
7.12LeuSer: 7.12 ± 1.114
6.062LeuThr: 6.062 ± 0.743
4.522LeuVal: 4.522 ± 0.62
0.481LeuTrp: 0.481 ± 0.203
2.79LeuTyr: 2.79 ± 0.392
0.0LeuXaa: 0.0 ± 0.0
Met
1.924MetAla: 1.924 ± 0.424
0.481MetCys: 0.481 ± 0.193
1.636MetAsp: 1.636 ± 0.41
1.155MetGlu: 1.155 ± 0.335
0.962MetPhe: 0.962 ± 0.297
1.251MetGly: 1.251 ± 0.394
0.289MetHis: 0.289 ± 0.198
1.058MetIle: 1.058 ± 0.255
1.443MetLys: 1.443 ± 0.35
2.309MetLeu: 2.309 ± 0.558
0.385MetMet: 0.385 ± 0.201
1.539MetAsn: 1.539 ± 0.356
0.77MetPro: 0.77 ± 0.25
1.443MetGln: 1.443 ± 0.457
2.021MetArg: 2.021 ± 0.453
2.117MetSer: 2.117 ± 0.374
2.117MetThr: 2.117 ± 0.378
1.636MetVal: 1.636 ± 0.479
0.192MetTrp: 0.192 ± 0.107
0.385MetTyr: 0.385 ± 0.173
0.0MetXaa: 0.0 ± 0.0
Asn
5.1AsnAla: 5.1 ± 0.963
0.096AsnCys: 0.096 ± 0.087
3.271AsnAsp: 3.271 ± 0.662
2.79AsnGlu: 2.79 ± 0.63
0.866AsnPhe: 0.866 ± 0.361
4.33AsnGly: 4.33 ± 0.616
1.347AsnHis: 1.347 ± 0.367
2.598AsnIle: 2.598 ± 0.52
4.522AsnLys: 4.522 ± 0.65
4.811AsnLeu: 4.811 ± 0.605
0.962AsnMet: 0.962 ± 0.325
2.598AsnAsn: 2.598 ± 0.543
3.753AsnPro: 3.753 ± 0.646
2.502AsnGln: 2.502 ± 0.499
2.405AsnArg: 2.405 ± 0.497
2.405AsnSer: 2.405 ± 0.478
2.117AsnThr: 2.117 ± 0.406
2.117AsnVal: 2.117 ± 0.403
0.77AsnTrp: 0.77 ± 0.342
1.443AsnTyr: 1.443 ± 0.46
0.0AsnXaa: 0.0 ± 0.0
Pro
3.368ProAla: 3.368 ± 0.79
1.058ProCys: 1.058 ± 0.383
1.732ProAsp: 1.732 ± 0.524
2.598ProGlu: 2.598 ± 0.555
1.155ProPhe: 1.155 ± 0.296
0.866ProGly: 0.866 ± 0.351
0.962ProHis: 0.962 ± 0.311
2.117ProIle: 2.117 ± 0.481
3.368ProLys: 3.368 ± 0.73
3.271ProLeu: 3.271 ± 0.634
0.866ProMet: 0.866 ± 0.268
2.213ProAsn: 2.213 ± 0.537
0.962ProPro: 0.962 ± 0.339
1.636ProGln: 1.636 ± 0.393
0.77ProArg: 0.77 ± 0.246
2.983ProSer: 2.983 ± 0.52
1.828ProThr: 1.828 ± 0.504
2.213ProVal: 2.213 ± 0.465
0.096ProTrp: 0.096 ± 0.097
1.443ProTyr: 1.443 ± 0.432
0.0ProXaa: 0.0 ± 0.0
Gln
5.484GlnAla: 5.484 ± 1.056
0.674GlnCys: 0.674 ± 0.274
2.213GlnAsp: 2.213 ± 0.638
2.887GlnGlu: 2.887 ± 0.509
1.828GlnPhe: 1.828 ± 0.395
1.636GlnGly: 1.636 ± 0.594
0.481GlnHis: 0.481 ± 0.187
2.598GlnIle: 2.598 ± 0.617
3.464GlnLys: 3.464 ± 0.818
5.773GlnLeu: 5.773 ± 0.849
1.347GlnMet: 1.347 ± 0.422
2.694GlnAsn: 2.694 ± 0.617
1.636GlnPro: 1.636 ± 0.424
2.502GlnGln: 2.502 ± 0.805
1.828GlnArg: 1.828 ± 0.504
2.694GlnSer: 2.694 ± 0.474
3.368GlnThr: 3.368 ± 0.618
2.309GlnVal: 2.309 ± 0.405
0.481GlnTrp: 0.481 ± 0.213
1.347GlnTyr: 1.347 ± 0.317
0.0GlnXaa: 0.0 ± 0.0
Arg
3.079ArgAla: 3.079 ± 0.513
0.577ArgCys: 0.577 ± 0.277
3.656ArgAsp: 3.656 ± 0.57
3.56ArgGlu: 3.56 ± 0.82
2.117ArgPhe: 2.117 ± 0.519
2.694ArgGly: 2.694 ± 0.496
0.866ArgHis: 0.866 ± 0.339
2.79ArgIle: 2.79 ± 0.544
3.464ArgLys: 3.464 ± 0.649
4.811ArgLeu: 4.811 ± 0.612
1.155ArgMet: 1.155 ± 0.298
2.021ArgAsn: 2.021 ± 0.555
1.828ArgPro: 1.828 ± 0.404
1.251ArgGln: 1.251 ± 0.447
2.405ArgArg: 2.405 ± 0.474
2.983ArgSer: 2.983 ± 0.632
2.694ArgThr: 2.694 ± 0.599
2.502ArgVal: 2.502 ± 0.514
0.674ArgTrp: 0.674 ± 0.196
1.924ArgTyr: 1.924 ± 0.375
0.0ArgXaa: 0.0 ± 0.0
Ser
4.137SerAla: 4.137 ± 0.576
0.192SerCys: 0.192 ± 0.137
3.368SerAsp: 3.368 ± 0.56
4.715SerGlu: 4.715 ± 0.735
2.502SerPhe: 2.502 ± 0.505
5.773SerGly: 5.773 ± 0.639
0.962SerHis: 0.962 ± 0.312
3.945SerIle: 3.945 ± 0.696
4.522SerLys: 4.522 ± 0.644
5.484SerLeu: 5.484 ± 0.592
2.117SerMet: 2.117 ± 0.447
4.137SerAsn: 4.137 ± 0.681
1.828SerPro: 1.828 ± 0.435
2.79SerGln: 2.79 ± 0.453
3.079SerArg: 3.079 ± 0.509
5.003SerSer: 5.003 ± 0.767
2.983SerThr: 2.983 ± 0.561
2.79SerVal: 2.79 ± 0.57
0.577SerTrp: 0.577 ± 0.228
2.502SerTyr: 2.502 ± 0.431
0.0SerXaa: 0.0 ± 0.0
Thr
4.041ThrAla: 4.041 ± 0.589
0.77ThrCys: 0.77 ± 0.345
2.887ThrAsp: 2.887 ± 0.564
3.175ThrGlu: 3.175 ± 0.561
2.021ThrPhe: 2.021 ± 0.536
4.33ThrGly: 4.33 ± 0.639
0.577ThrHis: 0.577 ± 0.246
4.907ThrIle: 4.907 ± 0.91
4.041ThrLys: 4.041 ± 0.475
6.158ThrLeu: 6.158 ± 0.634
1.058ThrMet: 1.058 ± 0.296
2.983ThrAsn: 2.983 ± 0.514
2.405ThrPro: 2.405 ± 0.567
4.137ThrGln: 4.137 ± 0.741
2.405ThrArg: 2.405 ± 0.469
3.271ThrSer: 3.271 ± 0.48
3.849ThrThr: 3.849 ± 0.843
3.464ThrVal: 3.464 ± 0.593
0.866ThrTrp: 0.866 ± 0.302
1.924ThrTyr: 1.924 ± 0.49
0.0ThrXaa: 0.0 ± 0.0
Val
3.656ValAla: 3.656 ± 0.721
0.289ValCys: 0.289 ± 0.19
3.849ValAsp: 3.849 ± 0.693
3.753ValGlu: 3.753 ± 0.598
2.021ValPhe: 2.021 ± 0.419
3.56ValGly: 3.56 ± 0.519
0.481ValHis: 0.481 ± 0.231
4.234ValIle: 4.234 ± 0.651
4.811ValLys: 4.811 ± 0.718
4.041ValLeu: 4.041 ± 0.75
1.058ValMet: 1.058 ± 0.314
2.887ValAsn: 2.887 ± 0.679
2.502ValPro: 2.502 ± 0.436
3.656ValGln: 3.656 ± 0.597
3.271ValArg: 3.271 ± 0.508
2.887ValSer: 2.887 ± 0.535
3.464ValThr: 3.464 ± 0.684
3.945ValVal: 3.945 ± 0.652
1.251ValTrp: 1.251 ± 0.294
2.309ValTyr: 2.309 ± 0.419
0.0ValXaa: 0.0 ± 0.0
Trp
1.251TrpAla: 1.251 ± 0.419
0.192TrpCys: 0.192 ± 0.127
0.77TrpAsp: 0.77 ± 0.321
0.77TrpGlu: 0.77 ± 0.252
0.289TrpPhe: 0.289 ± 0.191
0.674TrpGly: 0.674 ± 0.236
0.289TrpHis: 0.289 ± 0.171
0.962TrpIle: 0.962 ± 0.306
0.866TrpLys: 0.866 ± 0.336
1.347TrpLeu: 1.347 ± 0.385
0.192TrpMet: 0.192 ± 0.125
0.577TrpAsn: 0.577 ± 0.221
0.192TrpPro: 0.192 ± 0.123
0.674TrpGln: 0.674 ± 0.279
0.962TrpArg: 0.962 ± 0.335
1.058TrpSer: 1.058 ± 0.314
0.481TrpThr: 0.481 ± 0.223
0.481TrpVal: 0.481 ± 0.258
0.192TrpTrp: 0.192 ± 0.119
0.385TrpTyr: 0.385 ± 0.182
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.694TyrAla: 2.694 ± 0.704
0.192TyrCys: 0.192 ± 0.146
2.213TyrAsp: 2.213 ± 0.402
2.309TyrGlu: 2.309 ± 0.543
1.347TyrPhe: 1.347 ± 0.342
1.443TyrGly: 1.443 ± 0.584
0.866TyrHis: 0.866 ± 0.332
1.636TyrIle: 1.636 ± 0.382
3.271TyrLys: 3.271 ± 0.488
3.271TyrLeu: 3.271 ± 0.59
0.481TyrMet: 0.481 ± 0.208
1.155TyrAsn: 1.155 ± 0.279
0.962TyrPro: 0.962 ± 0.322
1.828TyrGln: 1.828 ± 0.52
2.405TyrArg: 2.405 ± 0.411
2.213TyrSer: 2.213 ± 0.491
2.598TyrThr: 2.598 ± 0.575
2.405TyrVal: 2.405 ± 0.434
0.962TyrTrp: 0.962 ± 0.314
1.636TyrTyr: 1.636 ± 0.542
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 52 proteins (10394 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski