Amino acid dipepetide frequency for Staphylococcus phage tp310-3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.891AlaAla: 1.891 ± 1.014
0.36AlaCys: 0.36 ± 0.178
1.711AlaAsp: 1.711 ± 0.371
3.241AlaGlu: 3.241 ± 0.691
2.251AlaPhe: 2.251 ± 0.63
3.872AlaGly: 3.872 ± 0.851
0.63AlaHis: 0.63 ± 0.289
5.042AlaIle: 5.042 ± 0.761
4.862AlaLys: 4.862 ± 0.596
5.042AlaLeu: 5.042 ± 0.904
1.801AlaMet: 1.801 ± 0.497
3.061AlaAsn: 3.061 ± 0.582
1.171AlaPro: 1.171 ± 0.275
2.521AlaGln: 2.521 ± 0.484
3.602AlaArg: 3.602 ± 0.422
3.422AlaSer: 3.422 ± 0.616
3.962AlaThr: 3.962 ± 0.6
3.602AlaVal: 3.602 ± 0.548
0.27AlaTrp: 0.27 ± 0.132
2.341AlaTyr: 2.341 ± 0.379
0.0AlaXaa: 0.0 ± 0.0
Cys
0.36CysAla: 0.36 ± 0.16
0.0CysCys: 0.0 ± 0.0
0.27CysAsp: 0.27 ± 0.192
0.45CysGlu: 0.45 ± 0.264
0.54CysPhe: 0.54 ± 0.265
0.36CysGly: 0.36 ± 0.217
0.18CysHis: 0.18 ± 0.113
0.81CysIle: 0.81 ± 0.256
0.36CysLys: 0.36 ± 0.187
0.54CysLeu: 0.54 ± 0.169
0.0CysMet: 0.0 ± 0.0
0.27CysAsn: 0.27 ± 0.145
0.0CysPro: 0.0 ± 0.0
0.18CysGln: 0.18 ± 0.114
0.63CysArg: 0.63 ± 0.274
0.54CysSer: 0.54 ± 0.223
0.27CysThr: 0.27 ± 0.146
0.45CysVal: 0.45 ± 0.197
0.0CysTrp: 0.0 ± 0.0
0.54CysTyr: 0.54 ± 0.18
0.0CysXaa: 0.0 ± 0.0
Asp
3.061AspAla: 3.061 ± 0.513
0.54AspCys: 0.54 ± 0.184
3.872AspAsp: 3.872 ± 0.707
5.312AspGlu: 5.312 ± 0.804
2.701AspPhe: 2.701 ± 0.434
3.872AspGly: 3.872 ± 0.667
0.81AspHis: 0.81 ± 0.256
4.412AspIle: 4.412 ± 0.641
5.853AspLys: 5.853 ± 0.846
5.042AspLeu: 5.042 ± 0.725
1.891AspMet: 1.891 ± 0.372
3.692AspAsn: 3.692 ± 0.555
1.351AspPro: 1.351 ± 0.451
0.9AspGln: 0.9 ± 0.234
2.341AspArg: 2.341 ± 0.551
3.332AspSer: 3.332 ± 0.529
4.052AspThr: 4.052 ± 0.709
3.512AspVal: 3.512 ± 0.752
0.45AspTrp: 0.45 ± 0.22
3.151AspTyr: 3.151 ± 0.383
0.0AspXaa: 0.0 ± 0.0
Glu
3.962GluAla: 3.962 ± 0.702
0.63GluCys: 0.63 ± 0.252
3.962GluAsp: 3.962 ± 0.668
6.753GluGlu: 6.753 ± 1.335
3.061GluPhe: 3.061 ± 0.536
2.881GluGly: 2.881 ± 0.442
1.711GluHis: 1.711 ± 0.359
6.213GluIle: 6.213 ± 1.0
6.933GluLys: 6.933 ± 1.057
7.834GluLeu: 7.834 ± 0.862
2.701GluMet: 2.701 ± 0.527
4.952GluAsn: 4.952 ± 0.725
2.071GluPro: 2.071 ± 0.338
4.322GluGln: 4.322 ± 0.83
3.782GluArg: 3.782 ± 0.61
4.142GluSer: 4.142 ± 0.641
3.422GluThr: 3.422 ± 0.594
4.232GluVal: 4.232 ± 0.64
0.72GluTrp: 0.72 ± 0.24
4.052GluTyr: 4.052 ± 0.932
0.0GluXaa: 0.0 ± 0.0
Phe
2.341PheAla: 2.341 ± 0.394
0.27PheCys: 0.27 ± 0.178
2.611PheAsp: 2.611 ± 0.456
2.431PheGlu: 2.431 ± 0.403
0.81PhePhe: 0.81 ± 0.213
2.611PheGly: 2.611 ± 0.586
1.08PheHis: 1.08 ± 0.282
2.881PheIle: 2.881 ± 0.693
3.782PheLys: 3.782 ± 0.531
2.431PheLeu: 2.431 ± 0.512
1.351PheMet: 1.351 ± 0.374
3.782PheAsn: 3.782 ± 0.494
0.9PhePro: 0.9 ± 0.313
1.08PheGln: 1.08 ± 0.274
1.621PheArg: 1.621 ± 0.301
2.071PheSer: 2.071 ± 0.477
2.611PheThr: 2.611 ± 0.518
1.981PheVal: 1.981 ± 0.435
0.18PheTrp: 0.18 ± 0.115
1.351PheTyr: 1.351 ± 0.344
0.0PheXaa: 0.0 ± 0.0
Gly
3.151GlyAla: 3.151 ± 0.887
0.45GlyCys: 0.45 ± 0.194
2.791GlyAsp: 2.791 ± 0.469
3.241GlyGlu: 3.241 ± 0.414
2.701GlyPhe: 2.701 ± 0.614
3.422GlyGly: 3.422 ± 0.816
0.99GlyHis: 0.99 ± 0.247
4.412GlyIle: 4.412 ± 0.981
6.483GlyLys: 6.483 ± 0.776
5.673GlyLeu: 5.673 ± 0.989
1.351GlyMet: 1.351 ± 0.455
3.332GlyAsn: 3.332 ± 0.558
0.9GlyPro: 0.9 ± 0.36
1.891GlyGln: 1.891 ± 0.543
2.431GlyArg: 2.431 ± 0.442
3.332GlySer: 3.332 ± 0.527
3.962GlyThr: 3.962 ± 0.475
3.241GlyVal: 3.241 ± 0.721
1.08GlyTrp: 1.08 ± 0.318
3.151GlyTyr: 3.151 ± 0.514
0.0GlyXaa: 0.0 ± 0.0
His
0.9HisAla: 0.9 ± 0.327
0.18HisCys: 0.18 ± 0.131
0.9HisAsp: 0.9 ± 0.287
1.531HisGlu: 1.531 ± 0.367
0.99HisPhe: 0.99 ± 0.229
0.54HisGly: 0.54 ± 0.242
0.36HisHis: 0.36 ± 0.267
1.891HisIle: 1.891 ± 0.451
1.351HisLys: 1.351 ± 0.3
1.711HisLeu: 1.711 ± 0.421
0.27HisMet: 0.27 ± 0.167
1.08HisAsn: 1.08 ± 0.291
0.18HisPro: 0.18 ± 0.138
0.54HisGln: 0.54 ± 0.18
0.72HisArg: 0.72 ± 0.194
1.171HisSer: 1.171 ± 0.284
0.99HisThr: 0.99 ± 0.403
0.72HisVal: 0.72 ± 0.226
0.27HisTrp: 0.27 ± 0.134
0.99HisTyr: 0.99 ± 0.328
0.0HisXaa: 0.0 ± 0.0
Ile
4.232IleAla: 4.232 ± 0.563
0.36IleCys: 0.36 ± 0.178
4.772IleAsp: 4.772 ± 0.586
7.023IleGlu: 7.023 ± 0.906
2.791IlePhe: 2.791 ± 0.569
3.872IleGly: 3.872 ± 0.668
1.171IleHis: 1.171 ± 0.314
4.502IleIle: 4.502 ± 0.651
7.744IleLys: 7.744 ± 0.92
4.682IleLeu: 4.682 ± 0.599
1.981IleMet: 1.981 ± 0.491
6.123IleAsn: 6.123 ± 0.845
1.801IlePro: 1.801 ± 0.302
3.422IleGln: 3.422 ± 0.579
3.061IleArg: 3.061 ± 0.558
4.772IleSer: 4.772 ± 0.658
4.952IleThr: 4.952 ± 0.555
4.952IleVal: 4.952 ± 0.608
1.08IleTrp: 1.08 ± 0.412
2.791IleTyr: 2.791 ± 0.552
0.0IleXaa: 0.0 ± 0.0
Lys
6.033LysAla: 6.033 ± 0.681
0.36LysCys: 0.36 ± 0.185
5.402LysAsp: 5.402 ± 0.603
7.654LysGlu: 7.654 ± 1.097
4.232LysPhe: 4.232 ± 0.584
6.843LysGly: 6.843 ± 0.78
1.801LysHis: 1.801 ± 0.486
6.753LysIle: 6.753 ± 0.677
7.924LysLys: 7.924 ± 1.034
8.464LysLeu: 8.464 ± 0.889
2.521LysMet: 2.521 ± 0.43
5.132LysAsn: 5.132 ± 0.656
2.971LysPro: 2.971 ± 0.6
4.412LysGln: 4.412 ± 0.749
3.782LysArg: 3.782 ± 0.453
5.763LysSer: 5.763 ± 0.68
6.033LysThr: 6.033 ± 0.915
5.132LysVal: 5.132 ± 0.72
1.261LysTrp: 1.261 ± 0.342
4.232LysTyr: 4.232 ± 0.597
0.0LysXaa: 0.0 ± 0.0
Leu
4.862LeuAla: 4.862 ± 0.713
0.54LeuCys: 0.54 ± 0.221
5.583LeuAsp: 5.583 ± 0.739
6.573LeuGlu: 6.573 ± 0.875
2.971LeuPhe: 2.971 ± 0.472
4.682LeuGly: 4.682 ± 0.897
0.9LeuHis: 0.9 ± 0.292
5.312LeuIle: 5.312 ± 0.532
8.374LeuLys: 8.374 ± 1.067
6.483LeuLeu: 6.483 ± 0.799
1.08LeuMet: 1.08 ± 0.301
5.493LeuAsn: 5.493 ± 0.557
2.701LeuPro: 2.701 ± 0.466
2.881LeuGln: 2.881 ± 0.353
4.142LeuArg: 4.142 ± 0.554
5.943LeuSer: 5.943 ± 0.68
4.052LeuThr: 4.052 ± 0.565
4.142LeuVal: 4.142 ± 0.584
0.81LeuTrp: 0.81 ± 0.241
2.881LeuTyr: 2.881 ± 0.443
0.0LeuXaa: 0.0 ± 0.0
Met
1.261MetAla: 1.261 ± 0.364
0.27MetCys: 0.27 ± 0.151
1.351MetAsp: 1.351 ± 0.311
1.891MetGlu: 1.891 ± 0.541
0.99MetPhe: 0.99 ± 0.318
0.72MetGly: 0.72 ± 0.383
0.45MetHis: 0.45 ± 0.207
1.531MetIle: 1.531 ± 0.343
3.512MetLys: 3.512 ± 0.619
1.711MetLeu: 1.711 ± 0.397
0.81MetMet: 0.81 ± 0.293
1.981MetAsn: 1.981 ± 0.44
1.441MetPro: 1.441 ± 0.405
0.81MetGln: 0.81 ± 0.291
1.351MetArg: 1.351 ± 0.42
2.161MetSer: 2.161 ± 0.341
1.531MetThr: 1.531 ± 0.35
1.351MetVal: 1.351 ± 0.285
0.54MetTrp: 0.54 ± 0.197
1.08MetTyr: 1.08 ± 0.272
0.0MetXaa: 0.0 ± 0.0
Asn
3.422AsnAla: 3.422 ± 0.616
0.36AsnCys: 0.36 ± 0.14
3.872AsnAsp: 3.872 ± 0.712
6.123AsnGlu: 6.123 ± 1.036
1.531AsnPhe: 1.531 ± 0.345
5.132AsnGly: 5.132 ± 0.797
0.81AsnHis: 0.81 ± 0.271
5.042AsnIle: 5.042 ± 0.589
5.763AsnLys: 5.763 ± 0.748
5.042AsnLeu: 5.042 ± 0.737
1.711AsnMet: 1.711 ± 0.28
4.862AsnAsn: 4.862 ± 0.65
2.161AsnPro: 2.161 ± 0.401
2.881AsnGln: 2.881 ± 0.413
2.701AsnArg: 2.701 ± 0.513
3.151AsnSer: 3.151 ± 0.483
3.782AsnThr: 3.782 ± 0.477
3.151AsnVal: 3.151 ± 0.549
1.261AsnTrp: 1.261 ± 0.426
2.611AsnTyr: 2.611 ± 0.402
0.0AsnXaa: 0.0 ± 0.0
Pro
0.9ProAla: 0.9 ± 0.267
0.0ProCys: 0.0 ± 0.0
1.441ProAsp: 1.441 ± 0.452
1.801ProGlu: 1.801 ± 0.498
0.99ProPhe: 0.99 ± 0.356
1.261ProGly: 1.261 ± 0.277
0.63ProHis: 0.63 ± 0.245
2.161ProIle: 2.161 ± 0.358
2.611ProLys: 2.611 ± 0.619
2.341ProLeu: 2.341 ± 0.541
1.08ProMet: 1.08 ± 0.257
1.711ProAsn: 1.711 ± 0.429
0.9ProPro: 0.9 ± 0.246
0.72ProGln: 0.72 ± 0.241
0.9ProArg: 0.9 ± 0.273
1.801ProSer: 1.801 ± 0.415
1.981ProThr: 1.981 ± 0.455
1.531ProVal: 1.531 ± 0.305
0.18ProTrp: 0.18 ± 0.117
0.81ProTyr: 0.81 ± 0.255
0.0ProXaa: 0.0 ± 0.0
Gln
3.332GlnAla: 3.332 ± 0.515
0.54GlnCys: 0.54 ± 0.237
2.251GlnAsp: 2.251 ± 0.596
3.061GlnGlu: 3.061 ± 0.672
0.99GlnPhe: 0.99 ± 0.241
1.621GlnGly: 1.621 ± 0.352
0.72GlnHis: 0.72 ± 0.224
2.701GlnIle: 2.701 ± 0.417
3.602GlnLys: 3.602 ± 0.664
2.881GlnLeu: 2.881 ± 0.382
0.99GlnMet: 0.99 ± 0.327
2.161GlnAsn: 2.161 ± 0.383
0.81GlnPro: 0.81 ± 0.229
1.351GlnGln: 1.351 ± 0.391
2.071GlnArg: 2.071 ± 0.403
2.071GlnSer: 2.071 ± 0.516
1.801GlnThr: 1.801 ± 0.337
2.161GlnVal: 2.161 ± 0.487
0.18GlnTrp: 0.18 ± 0.114
2.341GlnTyr: 2.341 ± 0.416
0.0GlnXaa: 0.0 ± 0.0
Arg
2.431ArgAla: 2.431 ± 0.518
0.36ArgCys: 0.36 ± 0.214
3.241ArgAsp: 3.241 ± 0.522
4.322ArgGlu: 4.322 ± 0.556
1.621ArgPhe: 1.621 ± 0.35
1.981ArgGly: 1.981 ± 0.468
1.261ArgHis: 1.261 ± 0.274
3.061ArgIle: 3.061 ± 0.513
4.232ArgLys: 4.232 ± 0.65
3.512ArgLeu: 3.512 ± 0.532
1.171ArgMet: 1.171 ± 0.356
2.881ArgAsn: 2.881 ± 0.415
0.81ArgPro: 0.81 ± 0.233
1.351ArgGln: 1.351 ± 0.386
1.891ArgArg: 1.891 ± 0.452
2.161ArgSer: 2.161 ± 0.411
2.431ArgThr: 2.431 ± 0.38
2.791ArgVal: 2.791 ± 0.428
0.45ArgTrp: 0.45 ± 0.185
2.611ArgTyr: 2.611 ± 0.5
0.0ArgXaa: 0.0 ± 0.0
Ser
3.962SerAla: 3.962 ± 0.647
0.27SerCys: 0.27 ± 0.166
4.412SerAsp: 4.412 ± 0.642
4.682SerGlu: 4.682 ± 0.835
2.521SerPhe: 2.521 ± 0.595
3.512SerGly: 3.512 ± 0.823
1.261SerHis: 1.261 ± 0.321
5.222SerIle: 5.222 ± 0.563
5.583SerLys: 5.583 ± 0.969
4.322SerLeu: 4.322 ± 0.585
1.171SerMet: 1.171 ± 0.306
4.412SerAsn: 4.412 ± 0.591
0.81SerPro: 0.81 ± 0.244
1.801SerGln: 1.801 ± 0.549
2.431SerArg: 2.431 ± 0.48
3.512SerSer: 3.512 ± 0.668
3.422SerThr: 3.422 ± 0.461
2.881SerVal: 2.881 ± 0.65
0.54SerTrp: 0.54 ± 0.17
2.521SerTyr: 2.521 ± 0.572
0.0SerXaa: 0.0 ± 0.0
Thr
3.241ThrAla: 3.241 ± 0.464
0.36ThrCys: 0.36 ± 0.173
5.042ThrAsp: 5.042 ± 0.68
4.232ThrGlu: 4.232 ± 0.798
1.711ThrPhe: 1.711 ± 0.33
4.412ThrGly: 4.412 ± 0.778
1.08ThrHis: 1.08 ± 0.233
4.862ThrIle: 4.862 ± 0.653
5.312ThrLys: 5.312 ± 0.64
3.602ThrLeu: 3.602 ± 0.442
1.351ThrMet: 1.351 ± 0.349
3.241ThrAsn: 3.241 ± 0.636
1.981ThrPro: 1.981 ± 0.481
1.351ThrGln: 1.351 ± 0.299
2.701ThrArg: 2.701 ± 0.512
3.602ThrSer: 3.602 ± 0.607
3.692ThrThr: 3.692 ± 0.481
3.872ThrVal: 3.872 ± 0.57
0.9ThrTrp: 0.9 ± 0.269
3.332ThrTyr: 3.332 ± 0.695
0.0ThrXaa: 0.0 ± 0.0
Val
2.701ValAla: 2.701 ± 0.519
0.45ValCys: 0.45 ± 0.219
3.061ValAsp: 3.061 ± 0.548
3.872ValGlu: 3.872 ± 0.643
2.071ValPhe: 2.071 ± 0.486
3.872ValGly: 3.872 ± 0.798
0.36ValHis: 0.36 ± 0.189
4.772ValIle: 4.772 ± 0.628
6.393ValLys: 6.393 ± 0.868
4.862ValLeu: 4.862 ± 0.647
1.621ValMet: 1.621 ± 0.475
3.602ValAsn: 3.602 ± 0.508
1.711ValPro: 1.711 ± 0.321
1.801ValGln: 1.801 ± 0.406
1.801ValArg: 1.801 ± 0.496
3.332ValSer: 3.332 ± 0.624
4.502ValThr: 4.502 ± 0.842
4.412ValVal: 4.412 ± 0.651
0.36ValTrp: 0.36 ± 0.16
2.161ValTyr: 2.161 ± 0.377
0.0ValXaa: 0.0 ± 0.0
Trp
0.36TrpAla: 0.36 ± 0.179
0.09TrpCys: 0.09 ± 0.086
0.81TrpAsp: 0.81 ± 0.305
0.45TrpGlu: 0.45 ± 0.164
0.54TrpPhe: 0.54 ± 0.177
0.45TrpGly: 0.45 ± 0.168
0.09TrpHis: 0.09 ± 0.086
1.171TrpIle: 1.171 ± 0.254
0.9TrpLys: 0.9 ± 0.296
0.63TrpLeu: 0.63 ± 0.305
0.63TrpMet: 0.63 ± 0.217
1.08TrpAsn: 1.08 ± 0.456
0.18TrpPro: 0.18 ± 0.117
0.72TrpGln: 0.72 ± 0.17
0.45TrpArg: 0.45 ± 0.181
0.45TrpSer: 0.45 ± 0.174
0.54TrpThr: 0.54 ± 0.172
1.08TrpVal: 1.08 ± 0.285
0.27TrpTrp: 0.27 ± 0.14
0.54TrpTyr: 0.54 ± 0.232
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.251TyrAla: 2.251 ± 0.379
0.36TyrCys: 0.36 ± 0.185
2.791TyrAsp: 2.791 ± 0.433
3.692TyrGlu: 3.692 ± 0.543
2.161TyrPhe: 2.161 ± 0.455
2.071TyrGly: 2.071 ± 0.489
0.99TyrHis: 0.99 ± 0.279
3.332TyrIle: 3.332 ± 0.574
5.132TyrLys: 5.132 ± 0.689
3.602TyrLeu: 3.602 ± 0.677
1.171TyrMet: 1.171 ± 0.332
2.521TyrAsn: 2.521 ± 0.47
0.9TyrPro: 0.9 ± 0.233
2.701TyrGln: 2.701 ± 0.488
2.161TyrArg: 2.161 ± 0.563
2.521TyrSer: 2.521 ± 0.544
1.891TyrThr: 1.891 ± 0.448
2.521TyrVal: 2.521 ± 0.428
0.63TyrTrp: 0.63 ± 0.23
1.711TyrTyr: 1.711 ± 0.51
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 58 proteins (11107 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski