Amino acid dipepetide frequency for Salmonella phage templet

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.346AlaAla: 11.346 ± 1.953
1.166AlaCys: 1.166 ± 0.265
5.984AlaAsp: 5.984 ± 0.67
6.528AlaGlu: 6.528 ± 0.993
3.808AlaPhe: 3.808 ± 0.571
7.46AlaGly: 7.46 ± 0.747
1.787AlaHis: 1.787 ± 0.49
3.575AlaIle: 3.575 ± 0.57
5.207AlaLys: 5.207 ± 0.89
7.771AlaLeu: 7.771 ± 0.962
2.565AlaMet: 2.565 ± 0.537
3.419AlaAsn: 3.419 ± 0.532
3.186AlaPro: 3.186 ± 0.468
3.031AlaGln: 3.031 ± 0.593
4.119AlaArg: 4.119 ± 0.479
5.984AlaSer: 5.984 ± 1.02
5.207AlaThr: 5.207 ± 0.722
7.616AlaVal: 7.616 ± 0.906
1.088AlaTrp: 1.088 ± 0.225
3.031AlaTyr: 3.031 ± 0.447
0.078AlaXaa: 0.078 ± 0.08
Cys
0.699CysAla: 0.699 ± 0.206
0.155CysCys: 0.155 ± 0.119
0.699CysAsp: 0.699 ± 0.227
1.243CysGlu: 1.243 ± 0.36
0.311CysPhe: 0.311 ± 0.16
0.622CysGly: 0.622 ± 0.191
0.155CysHis: 0.155 ± 0.11
0.233CysIle: 0.233 ± 0.12
1.01CysLys: 1.01 ± 0.346
0.933CysLeu: 0.933 ± 0.295
0.466CysMet: 0.466 ± 0.205
0.466CysAsn: 0.466 ± 0.239
0.233CysPro: 0.233 ± 0.114
0.155CysGln: 0.155 ± 0.114
0.933CysArg: 0.933 ± 0.291
0.466CysSer: 0.466 ± 0.208
0.544CysThr: 0.544 ± 0.152
0.622CysVal: 0.622 ± 0.226
0.311CysTrp: 0.311 ± 0.162
0.233CysTyr: 0.233 ± 0.122
0.0CysXaa: 0.0 ± 0.0
Asp
6.683AspAla: 6.683 ± 0.641
0.777AspCys: 0.777 ± 0.221
3.886AspAsp: 3.886 ± 0.534
3.73AspGlu: 3.73 ± 0.589
2.953AspPhe: 2.953 ± 0.505
5.906AspGly: 5.906 ± 0.799
0.777AspHis: 0.777 ± 0.221
3.264AspIle: 3.264 ± 0.371
3.186AspLys: 3.186 ± 0.405
4.818AspLeu: 4.818 ± 0.555
1.321AspMet: 1.321 ± 0.296
2.72AspAsn: 2.72 ± 0.525
1.71AspPro: 1.71 ± 0.408
0.466AspGln: 0.466 ± 0.194
2.875AspArg: 2.875 ± 0.463
3.575AspSer: 3.575 ± 0.431
4.119AspThr: 4.119 ± 0.563
4.119AspVal: 4.119 ± 0.427
0.933AspTrp: 0.933 ± 0.318
2.021AspTyr: 2.021 ± 0.443
0.0AspXaa: 0.0 ± 0.0
Glu
6.606GluAla: 6.606 ± 0.909
0.311GluCys: 0.311 ± 0.151
3.575GluAsp: 3.575 ± 0.584
4.585GluGlu: 4.585 ± 0.779
3.342GluPhe: 3.342 ± 0.824
4.74GluGly: 4.74 ± 0.653
0.933GluHis: 0.933 ± 0.263
3.497GluIle: 3.497 ± 0.422
3.886GluLys: 3.886 ± 0.567
5.984GluLeu: 5.984 ± 0.76
2.72GluMet: 2.72 ± 0.491
2.254GluAsn: 2.254 ± 0.436
1.71GluPro: 1.71 ± 0.522
3.108GluGln: 3.108 ± 0.635
3.73GluArg: 3.73 ± 0.792
3.808GluSer: 3.808 ± 0.546
3.575GluThr: 3.575 ± 0.543
4.507GluVal: 4.507 ± 0.628
0.777GluTrp: 0.777 ± 0.263
1.787GluTyr: 1.787 ± 0.419
0.0GluXaa: 0.0 ± 0.0
Phe
2.72PheAla: 2.72 ± 0.565
0.389PheCys: 0.389 ± 0.165
3.342PheAsp: 3.342 ± 0.525
2.642PheGlu: 2.642 ± 0.564
0.544PhePhe: 0.544 ± 0.184
3.031PheGly: 3.031 ± 0.437
0.622PheHis: 0.622 ± 0.206
2.487PheIle: 2.487 ± 0.542
1.71PheLys: 1.71 ± 0.337
2.487PheLeu: 2.487 ± 0.546
0.311PheMet: 0.311 ± 0.159
1.321PheAsn: 1.321 ± 0.382
1.71PhePro: 1.71 ± 0.469
1.399PheGln: 1.399 ± 0.362
2.254PheArg: 2.254 ± 0.333
2.254PheSer: 2.254 ± 0.528
3.575PheThr: 3.575 ± 0.651
2.565PheVal: 2.565 ± 0.521
0.777PheTrp: 0.777 ± 0.233
1.166PheTyr: 1.166 ± 0.345
0.0PheXaa: 0.0 ± 0.0
Gly
7.227GlyAla: 7.227 ± 0.771
0.855GlyCys: 0.855 ± 0.26
4.119GlyAsp: 4.119 ± 0.732
5.44GlyGlu: 5.44 ± 0.952
3.108GlyPhe: 3.108 ± 0.543
6.062GlyGly: 6.062 ± 0.834
1.477GlyHis: 1.477 ± 0.469
3.264GlyIle: 3.264 ± 0.402
5.207GlyLys: 5.207 ± 0.614
5.751GlyLeu: 5.751 ± 0.527
2.098GlyMet: 2.098 ± 0.576
3.652GlyAsn: 3.652 ± 0.526
1.787GlyPro: 1.787 ± 0.442
3.108GlyGln: 3.108 ± 0.5
4.663GlyArg: 4.663 ± 0.627
4.818GlySer: 4.818 ± 0.716
4.119GlyThr: 4.119 ± 0.638
5.673GlyVal: 5.673 ± 0.709
0.933GlyTrp: 0.933 ± 0.308
2.953GlyTyr: 2.953 ± 0.457
0.0GlyXaa: 0.0 ± 0.0
His
1.166HisAla: 1.166 ± 0.33
0.466HisCys: 0.466 ± 0.194
0.855HisAsp: 0.855 ± 0.225
0.855HisGlu: 0.855 ± 0.257
0.777HisPhe: 0.777 ± 0.299
0.699HisGly: 0.699 ± 0.263
0.699HisHis: 0.699 ± 0.337
1.321HisIle: 1.321 ± 0.291
0.855HisLys: 0.855 ± 0.247
1.321HisLeu: 1.321 ± 0.27
0.389HisMet: 0.389 ± 0.165
0.466HisAsn: 0.466 ± 0.186
1.088HisPro: 1.088 ± 0.304
0.933HisGln: 0.933 ± 0.245
1.088HisArg: 1.088 ± 0.288
1.01HisSer: 1.01 ± 0.272
1.088HisThr: 1.088 ± 0.42
0.544HisVal: 0.544 ± 0.18
0.078HisTrp: 0.078 ± 0.076
0.699HisTyr: 0.699 ± 0.258
0.0HisXaa: 0.0 ± 0.0
Ile
4.352IleAla: 4.352 ± 0.711
0.699IleCys: 0.699 ± 0.238
3.886IleAsp: 3.886 ± 0.548
3.031IleGlu: 3.031 ± 0.636
1.166IlePhe: 1.166 ± 0.34
3.342IleGly: 3.342 ± 0.398
0.699IleHis: 0.699 ± 0.176
2.021IleIle: 2.021 ± 0.463
2.72IleLys: 2.72 ± 0.508
3.031IleLeu: 3.031 ± 0.544
1.243IleMet: 1.243 ± 0.336
1.943IleAsn: 1.943 ± 0.454
2.72IlePro: 2.72 ± 0.447
1.787IleGln: 1.787 ± 0.479
2.331IleArg: 2.331 ± 0.288
3.186IleSer: 3.186 ± 0.513
4.119IleThr: 4.119 ± 0.621
3.186IleVal: 3.186 ± 0.472
0.777IleTrp: 0.777 ± 0.251
1.321IleTyr: 1.321 ± 0.372
0.0IleXaa: 0.0 ± 0.0
Lys
5.207LysAla: 5.207 ± 0.8
0.699LysCys: 0.699 ± 0.312
3.652LysAsp: 3.652 ± 0.571
4.119LysGlu: 4.119 ± 0.667
2.176LysPhe: 2.176 ± 0.345
3.808LysGly: 3.808 ± 0.5
1.088LysHis: 1.088 ± 0.291
1.477LysIle: 1.477 ± 0.345
3.186LysLys: 3.186 ± 0.597
5.284LysLeu: 5.284 ± 0.638
2.875LysMet: 2.875 ± 0.549
2.565LysAsn: 2.565 ± 0.437
2.254LysPro: 2.254 ± 0.486
2.409LysGln: 2.409 ± 0.462
4.041LysArg: 4.041 ± 0.595
2.798LysSer: 2.798 ± 0.518
4.041LysThr: 4.041 ± 0.512
3.963LysVal: 3.963 ± 0.554
0.777LysTrp: 0.777 ± 0.239
2.798LysTyr: 2.798 ± 0.441
0.0LysXaa: 0.0 ± 0.0
Leu
6.994LeuAla: 6.994 ± 0.765
0.933LeuCys: 0.933 ± 0.272
3.963LeuAsp: 3.963 ± 0.624
4.585LeuGlu: 4.585 ± 0.72
1.71LeuPhe: 1.71 ± 0.432
4.585LeuGly: 4.585 ± 0.604
1.166LeuHis: 1.166 ± 0.326
4.274LeuIle: 4.274 ± 0.543
6.062LeuLys: 6.062 ± 0.753
6.372LeuLeu: 6.372 ± 0.672
2.254LeuMet: 2.254 ± 0.464
4.585LeuAsn: 4.585 ± 0.666
3.808LeuPro: 3.808 ± 0.603
2.565LeuGln: 2.565 ± 0.351
5.362LeuArg: 5.362 ± 0.689
4.974LeuSer: 4.974 ± 0.592
5.595LeuThr: 5.595 ± 0.468
5.673LeuVal: 5.673 ± 0.578
1.399LeuTrp: 1.399 ± 0.356
2.176LeuTyr: 2.176 ± 0.344
0.0LeuXaa: 0.0 ± 0.0
Met
2.487MetAla: 2.487 ± 0.386
0.233MetCys: 0.233 ± 0.123
1.477MetAsp: 1.477 ± 0.436
1.321MetGlu: 1.321 ± 0.277
1.166MetPhe: 1.166 ± 0.272
1.865MetGly: 1.865 ± 0.373
0.389MetHis: 0.389 ± 0.186
0.933MetIle: 0.933 ± 0.27
1.321MetLys: 1.321 ± 0.363
1.787MetLeu: 1.787 ± 0.419
0.622MetMet: 0.622 ± 0.266
1.243MetAsn: 1.243 ± 0.317
1.243MetPro: 1.243 ± 0.385
0.933MetGln: 0.933 ± 0.251
1.477MetArg: 1.477 ± 0.316
1.943MetSer: 1.943 ± 0.405
2.331MetThr: 2.331 ± 0.369
1.787MetVal: 1.787 ± 0.375
0.466MetTrp: 0.466 ± 0.175
0.777MetTyr: 0.777 ± 0.245
0.0MetXaa: 0.0 ± 0.0
Asn
3.419AsnAla: 3.419 ± 0.497
0.544AsnCys: 0.544 ± 0.225
2.72AsnAsp: 2.72 ± 0.404
2.331AsnGlu: 2.331 ± 0.385
1.865AsnPhe: 1.865 ± 0.36
4.43AsnGly: 4.43 ± 0.698
0.466AsnHis: 0.466 ± 0.206
3.031AsnIle: 3.031 ± 0.445
1.865AsnLys: 1.865 ± 0.43
3.652AsnLeu: 3.652 ± 0.351
0.466AsnMet: 0.466 ± 0.211
2.176AsnAsn: 2.176 ± 0.433
1.632AsnPro: 1.632 ± 0.395
1.477AsnGln: 1.477 ± 0.328
2.565AsnArg: 2.565 ± 0.357
1.943AsnSer: 1.943 ± 0.34
2.254AsnThr: 2.254 ± 0.371
3.886AsnVal: 3.886 ± 0.418
0.699AsnTrp: 0.699 ± 0.22
1.243AsnTyr: 1.243 ± 0.311
0.0AsnXaa: 0.0 ± 0.0
Pro
2.72ProAla: 2.72 ± 0.523
0.389ProCys: 0.389 ± 0.203
2.72ProAsp: 2.72 ± 0.535
3.808ProGlu: 3.808 ± 0.555
1.632ProPhe: 1.632 ± 0.349
2.953ProGly: 2.953 ± 0.494
0.855ProHis: 0.855 ± 0.223
1.632ProIle: 1.632 ± 0.347
2.72ProLys: 2.72 ± 0.479
3.419ProLeu: 3.419 ± 0.504
0.855ProMet: 0.855 ± 0.284
1.088ProAsn: 1.088 ± 0.412
1.321ProPro: 1.321 ± 0.297
1.166ProGln: 1.166 ± 0.287
2.176ProArg: 2.176 ± 0.384
1.943ProSer: 1.943 ± 0.369
1.477ProThr: 1.477 ± 0.296
3.652ProVal: 3.652 ± 0.57
0.466ProTrp: 0.466 ± 0.237
1.632ProTyr: 1.632 ± 0.399
0.078ProXaa: 0.078 ± 0.084
Gln
4.274GlnAla: 4.274 ± 0.616
0.233GlnCys: 0.233 ± 0.118
1.554GlnAsp: 1.554 ± 0.333
2.331GlnGlu: 2.331 ± 0.583
1.321GlnPhe: 1.321 ± 0.36
2.021GlnGly: 2.021 ± 0.461
0.544GlnHis: 0.544 ± 0.21
1.554GlnIle: 1.554 ± 0.386
2.098GlnLys: 2.098 ± 0.376
2.72GlnLeu: 2.72 ± 0.469
1.166GlnMet: 1.166 ± 0.286
1.787GlnAsn: 1.787 ± 0.346
1.943GlnPro: 1.943 ± 0.313
1.943GlnGln: 1.943 ± 0.446
1.632GlnArg: 1.632 ± 0.363
1.943GlnSer: 1.943 ± 0.335
1.632GlnThr: 1.632 ± 0.303
2.72GlnVal: 2.72 ± 0.368
0.622GlnTrp: 0.622 ± 0.204
1.477GlnTyr: 1.477 ± 0.296
0.0GlnXaa: 0.0 ± 0.0
Arg
4.74ArgAla: 4.74 ± 0.498
0.466ArgCys: 0.466 ± 0.174
3.342ArgAsp: 3.342 ± 0.409
3.652ArgGlu: 3.652 ± 0.633
2.021ArgPhe: 2.021 ± 0.442
4.352ArgGly: 4.352 ± 0.574
1.01ArgHis: 1.01 ± 0.275
3.186ArgIle: 3.186 ± 0.484
3.963ArgLys: 3.963 ± 0.552
4.274ArgLeu: 4.274 ± 0.525
1.787ArgMet: 1.787 ± 0.345
3.108ArgAsn: 3.108 ± 0.532
2.021ArgPro: 2.021 ± 0.342
3.031ArgGln: 3.031 ± 0.533
4.585ArgArg: 4.585 ± 0.688
2.021ArgSer: 2.021 ± 0.246
2.642ArgThr: 2.642 ± 0.404
4.119ArgVal: 4.119 ± 0.53
1.01ArgTrp: 1.01 ± 0.258
1.477ArgTyr: 1.477 ± 0.449
0.078ArgXaa: 0.078 ± 0.086
Ser
5.984SerAla: 5.984 ± 1.002
0.389SerCys: 0.389 ± 0.239
2.953SerAsp: 2.953 ± 0.511
3.031SerGlu: 3.031 ± 0.541
2.176SerPhe: 2.176 ± 0.418
6.683SerGly: 6.683 ± 0.548
0.777SerHis: 0.777 ± 0.198
2.565SerIle: 2.565 ± 0.497
2.487SerLys: 2.487 ± 0.419
5.44SerLeu: 5.44 ± 0.632
1.166SerMet: 1.166 ± 0.306
2.72SerAsn: 2.72 ± 0.352
1.787SerPro: 1.787 ± 0.448
1.943SerGln: 1.943 ± 0.39
3.186SerArg: 3.186 ± 0.553
3.575SerSer: 3.575 ± 0.559
4.352SerThr: 4.352 ± 0.569
5.207SerVal: 5.207 ± 0.796
0.933SerTrp: 0.933 ± 0.212
1.943SerTyr: 1.943 ± 0.373
0.0SerXaa: 0.0 ± 0.0
Thr
6.062ThrAla: 6.062 ± 0.679
0.389ThrCys: 0.389 ± 0.185
4.352ThrAsp: 4.352 ± 0.597
3.342ThrGlu: 3.342 ± 0.459
2.953ThrPhe: 2.953 ± 0.509
5.751ThrGly: 5.751 ± 0.77
1.01ThrHis: 1.01 ± 0.275
2.875ThrIle: 2.875 ± 0.468
3.497ThrLys: 3.497 ± 0.56
4.896ThrLeu: 4.896 ± 0.73
1.166ThrMet: 1.166 ± 0.358
1.787ThrAsn: 1.787 ± 0.39
4.119ThrPro: 4.119 ± 0.529
1.943ThrGln: 1.943 ± 0.351
3.264ThrArg: 3.264 ± 0.434
4.585ThrSer: 4.585 ± 0.582
4.196ThrThr: 4.196 ± 0.575
4.818ThrVal: 4.818 ± 0.732
0.855ThrTrp: 0.855 ± 0.339
2.565ThrTyr: 2.565 ± 0.467
0.0ThrXaa: 0.0 ± 0.0
Val
7.383ValAla: 7.383 ± 0.893
0.777ValCys: 0.777 ± 0.2
3.963ValAsp: 3.963 ± 0.475
5.751ValGlu: 5.751 ± 0.606
2.254ValPhe: 2.254 ± 0.501
4.196ValGly: 4.196 ± 0.659
0.855ValHis: 0.855 ± 0.217
4.43ValIle: 4.43 ± 0.609
5.051ValLys: 5.051 ± 0.76
4.818ValLeu: 4.818 ± 0.589
0.933ValMet: 0.933 ± 0.297
3.497ValAsn: 3.497 ± 0.667
2.72ValPro: 2.72 ± 0.656
2.021ValGln: 2.021 ± 0.332
3.497ValArg: 3.497 ± 0.519
5.751ValSer: 5.751 ± 0.678
6.45ValThr: 6.45 ± 0.863
5.284ValVal: 5.284 ± 0.81
1.01ValTrp: 1.01 ± 0.303
2.565ValTyr: 2.565 ± 0.342
0.0ValXaa: 0.0 ± 0.0
Trp
0.933TrpAla: 0.933 ± 0.416
0.155TrpCys: 0.155 ± 0.118
0.777TrpAsp: 0.777 ± 0.27
0.389TrpGlu: 0.389 ± 0.172
0.777TrpPhe: 0.777 ± 0.315
1.088TrpGly: 1.088 ± 0.278
0.311TrpHis: 0.311 ± 0.19
0.544TrpIle: 0.544 ± 0.202
0.622TrpLys: 0.622 ± 0.246
1.943TrpLeu: 1.943 ± 0.452
0.389TrpMet: 0.389 ± 0.158
0.622TrpAsn: 0.622 ± 0.281
0.544TrpPro: 0.544 ± 0.24
0.622TrpGln: 0.622 ± 0.224
1.321TrpArg: 1.321 ± 0.384
0.622TrpSer: 0.622 ± 0.191
0.855TrpThr: 0.855 ± 0.254
1.399TrpVal: 1.399 ± 0.309
0.389TrpTrp: 0.389 ± 0.323
0.311TrpTyr: 0.311 ± 0.191
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.875TyrAla: 2.875 ± 0.474
0.466TyrCys: 0.466 ± 0.182
2.021TyrAsp: 2.021 ± 0.561
2.72TyrGlu: 2.72 ± 0.444
1.243TyrPhe: 1.243 ± 0.332
2.72TyrGly: 2.72 ± 0.356
0.777TyrHis: 0.777 ± 0.24
1.399TyrIle: 1.399 ± 0.306
2.642TyrLys: 2.642 ± 0.504
2.331TyrLeu: 2.331 ± 0.448
0.855TyrMet: 0.855 ± 0.18
1.243TyrAsn: 1.243 ± 0.319
1.166TyrPro: 1.166 ± 0.355
1.477TyrGln: 1.477 ± 0.357
1.787TyrArg: 1.787 ± 0.534
2.098TyrSer: 2.098 ± 0.388
2.254TyrThr: 2.254 ± 0.394
1.865TyrVal: 1.865 ± 0.313
0.233TyrTrp: 0.233 ± 0.191
1.399TyrTyr: 1.399 ± 0.289
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.078XaaAla: 0.078 ± 0.08
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.078XaaMet: 0.078 ± 0.084
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.078XaaTrp: 0.078 ± 0.086
0.0XaaTyr: 0.0 ± 0.0
9.559XaaXaa: 9.559 ± 6.386
Statistics based on 59 proteins (12869 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski