Amino acid dipepetide frequency for Streptococcus satellite phage Javan410

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.503AlaAla: 0.503 ± 0.339
1.761AlaCys: 1.761 ± 0.621
3.019AlaAsp: 3.019 ± 0.836
3.774AlaGlu: 3.774 ± 0.953
1.761AlaPhe: 1.761 ± 0.453
2.264AlaGly: 2.264 ± 0.774
0.0AlaHis: 0.0 ± 0.0
4.528AlaIle: 4.528 ± 1.658
4.528AlaLys: 4.528 ± 1.404
3.522AlaLeu: 3.522 ± 0.947
1.761AlaMet: 1.761 ± 0.679
2.516AlaAsn: 2.516 ± 0.715
1.006AlaPro: 1.006 ± 0.427
2.767AlaGln: 2.767 ± 1.03
2.013AlaArg: 2.013 ± 0.788
3.774AlaSer: 3.774 ± 1.061
3.774AlaThr: 3.774 ± 1.269
4.277AlaVal: 4.277 ± 1.157
0.755AlaTrp: 0.755 ± 0.373
1.509AlaTyr: 1.509 ± 0.546
0.0AlaXaa: 0.0 ± 0.0
Cys
0.755CysAla: 0.755 ± 0.525
0.0CysCys: 0.0 ± 0.0
0.755CysAsp: 0.755 ± 0.44
0.252CysGlu: 0.252 ± 0.299
0.0CysPhe: 0.0 ± 0.0
1.258CysGly: 1.258 ± 0.497
1.006CysHis: 1.006 ± 0.594
0.503CysIle: 0.503 ± 0.3
0.252CysLys: 0.252 ± 0.298
1.006CysLeu: 1.006 ± 0.61
0.0CysMet: 0.0 ± 0.0
0.252CysAsn: 0.252 ± 0.19
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.252CysArg: 0.252 ± 0.229
0.252CysSer: 0.252 ± 0.231
0.0CysThr: 0.0 ± 0.0
0.252CysVal: 0.252 ± 0.299
0.0CysTrp: 0.0 ± 0.0
0.252CysTyr: 0.252 ± 0.221
0.0CysXaa: 0.0 ± 0.0
Asp
2.767AspAla: 2.767 ± 1.035
0.755AspCys: 0.755 ± 0.407
3.774AspAsp: 3.774 ± 1.247
4.528AspGlu: 4.528 ± 0.899
4.78AspPhe: 4.78 ± 1.361
2.264AspGly: 2.264 ± 0.773
1.006AspHis: 1.006 ± 0.463
3.019AspIle: 3.019 ± 0.877
6.541AspLys: 6.541 ± 1.242
5.786AspLeu: 5.786 ± 0.905
1.006AspMet: 1.006 ± 0.486
6.541AspAsn: 6.541 ± 0.938
0.503AspPro: 0.503 ± 0.286
1.761AspGln: 1.761 ± 0.633
1.258AspArg: 1.258 ± 0.564
3.522AspSer: 3.522 ± 1.001
3.774AspThr: 3.774 ± 0.848
2.516AspVal: 2.516 ± 0.865
0.0AspTrp: 0.0 ± 0.0
6.038AspTyr: 6.038 ± 1.403
0.0AspXaa: 0.0 ± 0.0
Glu
5.031GluAla: 5.031 ± 1.151
0.0GluCys: 0.0 ± 0.0
4.277GluAsp: 4.277 ± 0.718
10.314GluGlu: 10.314 ± 1.919
2.013GluPhe: 2.013 ± 0.629
2.013GluGly: 2.013 ± 0.62
1.258GluHis: 1.258 ± 0.682
7.296GluIle: 7.296 ± 1.842
8.805GluLys: 8.805 ± 1.49
8.05GluLeu: 8.05 ± 1.312
2.264GluMet: 2.264 ± 0.792
5.283GluAsn: 5.283 ± 1.269
1.006GluPro: 1.006 ± 0.512
2.264GluGln: 2.264 ± 0.81
1.761GluArg: 1.761 ± 0.673
4.277GluSer: 4.277 ± 1.052
4.277GluThr: 4.277 ± 0.852
4.528GluVal: 4.528 ± 1.099
1.258GluTrp: 1.258 ± 0.479
2.013GluTyr: 2.013 ± 0.874
0.0GluXaa: 0.0 ± 0.0
Phe
1.761PheAla: 1.761 ± 0.846
0.252PheCys: 0.252 ± 0.19
3.522PheAsp: 3.522 ± 1.014
3.774PheGlu: 3.774 ± 1.204
1.761PhePhe: 1.761 ± 0.834
1.258PheGly: 1.258 ± 0.352
0.252PheHis: 0.252 ± 0.23
4.025PheIle: 4.025 ± 0.779
3.774PheLys: 3.774 ± 1.405
4.528PheLeu: 4.528 ± 1.405
1.761PheMet: 1.761 ± 0.672
3.27PheAsn: 3.27 ± 0.818
1.006PhePro: 1.006 ± 0.51
0.503PheGln: 0.503 ± 0.309
0.503PheArg: 0.503 ± 0.379
4.277PheSer: 4.277 ± 1.055
1.761PheThr: 1.761 ± 0.566
2.264PheVal: 2.264 ± 0.899
0.0PheTrp: 0.0 ± 0.0
2.516PheTyr: 2.516 ± 1.029
0.0PheXaa: 0.0 ± 0.0
Gly
2.013GlyAla: 2.013 ± 0.775
0.0GlyCys: 0.0 ± 0.0
2.516GlyAsp: 2.516 ± 1.037
2.264GlyGlu: 2.264 ± 0.982
2.013GlyPhe: 2.013 ± 0.881
2.516GlyGly: 2.516 ± 0.925
1.258GlyHis: 1.258 ± 0.581
3.27GlyIle: 3.27 ± 0.812
3.774GlyLys: 3.774 ± 1.123
6.038GlyLeu: 6.038 ± 1.265
0.755GlyMet: 0.755 ± 0.437
2.767GlyAsn: 2.767 ± 0.742
0.252GlyPro: 0.252 ± 0.304
0.755GlyGln: 0.755 ± 0.37
2.264GlyArg: 2.264 ± 0.596
3.27GlySer: 3.27 ± 0.718
2.264GlyThr: 2.264 ± 0.69
2.264GlyVal: 2.264 ± 0.743
0.252GlyTrp: 0.252 ± 0.19
1.761GlyTyr: 1.761 ± 0.698
0.0GlyXaa: 0.0 ± 0.0
His
0.755HisAla: 0.755 ± 0.69
0.0HisCys: 0.0 ± 0.0
0.503HisAsp: 0.503 ± 0.283
1.006HisGlu: 1.006 ± 0.66
1.006HisPhe: 1.006 ± 0.523
0.252HisGly: 0.252 ± 0.25
0.503HisHis: 0.503 ± 0.387
0.755HisIle: 0.755 ± 0.495
1.258HisLys: 1.258 ± 0.48
1.006HisLeu: 1.006 ± 0.485
1.258HisMet: 1.258 ± 0.641
0.503HisAsn: 0.503 ± 0.358
0.252HisPro: 0.252 ± 0.19
0.252HisGln: 0.252 ± 0.234
0.252HisArg: 0.252 ± 0.249
0.503HisSer: 0.503 ± 0.296
1.258HisThr: 1.258 ± 0.486
0.252HisVal: 0.252 ± 0.25
0.0HisTrp: 0.0 ± 0.0
1.006HisTyr: 1.006 ± 0.445
0.0HisXaa: 0.0 ± 0.0
Ile
4.277IleAla: 4.277 ± 1.087
0.252IleCys: 0.252 ± 0.299
5.031IleAsp: 5.031 ± 1.185
5.283IleGlu: 5.283 ± 1.089
3.27IlePhe: 3.27 ± 1.608
3.774IleGly: 3.774 ± 1.031
0.503IleHis: 0.503 ± 0.324
7.799IleIle: 7.799 ± 1.478
6.541IleLys: 6.541 ± 1.796
6.289IleLeu: 6.289 ± 1.234
0.755IleMet: 0.755 ± 0.442
4.528IleAsn: 4.528 ± 0.862
2.264IlePro: 2.264 ± 0.676
2.264IleGln: 2.264 ± 0.576
3.019IleArg: 3.019 ± 0.664
5.031IleSer: 5.031 ± 1.419
6.541IleThr: 6.541 ± 1.439
4.277IleVal: 4.277 ± 1.367
0.252IleTrp: 0.252 ± 0.231
2.013IleTyr: 2.013 ± 0.538
0.0IleXaa: 0.0 ± 0.0
Lys
4.78LysAla: 4.78 ± 1.455
0.503LysCys: 0.503 ± 0.401
4.277LysAsp: 4.277 ± 1.082
8.302LysGlu: 8.302 ± 2.082
4.025LysPhe: 4.025 ± 1.019
2.516LysGly: 2.516 ± 0.72
1.761LysHis: 1.761 ± 0.759
7.296LysIle: 7.296 ± 1.187
14.088LysLys: 14.088 ± 2.754
8.05LysLeu: 8.05 ± 1.525
3.774LysMet: 3.774 ± 1.043
7.296LysAsn: 7.296 ± 1.418
3.774LysPro: 3.774 ± 0.964
7.296LysGln: 7.296 ± 1.477
2.264LysArg: 2.264 ± 0.784
6.289LysSer: 6.289 ± 1.129
7.296LysThr: 7.296 ± 1.254
4.528LysVal: 4.528 ± 0.862
1.006LysTrp: 1.006 ± 0.539
3.522LysTyr: 3.522 ± 1.12
0.0LysXaa: 0.0 ± 0.0
Leu
4.277LeuAla: 4.277 ± 0.987
0.755LeuCys: 0.755 ± 0.432
7.044LeuAsp: 7.044 ± 1.45
5.283LeuGlu: 5.283 ± 1.126
4.277LeuPhe: 4.277 ± 1.438
5.535LeuGly: 5.535 ± 1.792
0.503LeuHis: 0.503 ± 0.36
8.05LeuIle: 8.05 ± 1.521
10.314LeuLys: 10.314 ± 1.884
9.057LeuLeu: 9.057 ± 1.776
2.264LeuMet: 2.264 ± 0.62
4.025LeuAsn: 4.025 ± 0.898
3.019LeuPro: 3.019 ± 0.738
2.264LeuGln: 2.264 ± 0.873
2.264LeuArg: 2.264 ± 0.884
6.541LeuSer: 6.541 ± 0.92
5.786LeuThr: 5.786 ± 1.419
5.031LeuVal: 5.031 ± 1.157
1.509LeuTrp: 1.509 ± 0.621
4.025LeuTyr: 4.025 ± 1.159
0.0LeuXaa: 0.0 ± 0.0
Met
1.761MetAla: 1.761 ± 0.662
0.252MetCys: 0.252 ± 0.23
1.006MetAsp: 1.006 ± 0.594
2.516MetGlu: 2.516 ± 0.818
0.755MetPhe: 0.755 ± 0.458
1.006MetGly: 1.006 ± 0.507
1.006MetHis: 1.006 ± 0.578
2.767MetIle: 2.767 ± 1.32
4.025MetLys: 4.025 ± 1.053
1.006MetLeu: 1.006 ± 0.502
2.013MetMet: 2.013 ± 0.681
1.509MetAsn: 1.509 ± 0.536
0.503MetPro: 0.503 ± 0.297
1.006MetGln: 1.006 ± 0.671
0.755MetArg: 0.755 ± 0.431
1.761MetSer: 1.761 ± 0.85
2.013MetThr: 2.013 ± 0.8
1.258MetVal: 1.258 ± 0.684
0.252MetTrp: 0.252 ± 0.304
1.006MetTyr: 1.006 ± 0.565
0.0MetXaa: 0.0 ± 0.0
Asn
3.522AsnAla: 3.522 ± 0.923
0.0AsnCys: 0.0 ± 0.0
3.27AsnAsp: 3.27 ± 0.851
6.038AsnGlu: 6.038 ± 0.988
2.767AsnPhe: 2.767 ± 0.975
4.78AsnGly: 4.78 ± 1.185
1.006AsnHis: 1.006 ± 0.469
3.774AsnIle: 3.774 ± 0.834
5.535AsnLys: 5.535 ± 1.085
6.038AsnLeu: 6.038 ± 1.088
2.013AsnMet: 2.013 ± 0.78
4.78AsnAsn: 4.78 ± 1.27
3.27AsnPro: 3.27 ± 0.732
1.258AsnGln: 1.258 ± 0.59
2.516AsnArg: 2.516 ± 0.814
4.025AsnSer: 4.025 ± 1.049
3.27AsnThr: 3.27 ± 0.932
2.516AsnVal: 2.516 ± 0.746
0.755AsnTrp: 0.755 ± 0.383
4.025AsnTyr: 4.025 ± 0.879
0.0AsnXaa: 0.0 ± 0.0
Pro
1.761ProAla: 1.761 ± 0.763
0.0ProCys: 0.0 ± 0.0
2.264ProAsp: 2.264 ± 0.698
3.019ProGlu: 3.019 ± 0.643
1.509ProPhe: 1.509 ± 0.725
0.0ProGly: 0.0 ± 0.0
0.252ProHis: 0.252 ± 0.234
0.755ProIle: 0.755 ± 0.427
3.019ProLys: 3.019 ± 0.711
2.013ProLeu: 2.013 ± 0.729
0.503ProMet: 0.503 ± 0.307
2.013ProAsn: 2.013 ± 0.658
0.755ProPro: 0.755 ± 0.436
0.503ProGln: 0.503 ± 0.285
1.509ProArg: 1.509 ± 0.778
2.264ProSer: 2.264 ± 0.692
3.522ProThr: 3.522 ± 0.829
2.516ProVal: 2.516 ± 0.735
0.0ProTrp: 0.0 ± 0.0
2.013ProTyr: 2.013 ± 0.671
0.0ProXaa: 0.0 ± 0.0
Gln
2.013GlnAla: 2.013 ± 0.818
0.503GlnCys: 0.503 ± 0.374
3.522GlnAsp: 3.522 ± 0.91
1.006GlnGlu: 1.006 ± 0.569
0.755GlnPhe: 0.755 ± 0.378
1.761GlnGly: 1.761 ± 0.52
0.0GlnHis: 0.0 ± 0.0
3.019GlnIle: 3.019 ± 0.816
4.277GlnLys: 4.277 ± 1.204
5.535GlnLeu: 5.535 ± 1.064
0.252GlnMet: 0.252 ± 0.272
1.761GlnAsn: 1.761 ± 0.655
1.761GlnPro: 1.761 ± 0.746
2.264GlnGln: 2.264 ± 0.941
1.006GlnArg: 1.006 ± 0.759
1.258GlnSer: 1.258 ± 0.43
2.013GlnThr: 2.013 ± 0.938
2.264GlnVal: 2.264 ± 0.64
0.0GlnTrp: 0.0 ± 0.0
2.516GlnTyr: 2.516 ± 0.918
0.0GlnXaa: 0.0 ± 0.0
Arg
0.503ArgAla: 0.503 ± 0.286
0.252ArgCys: 0.252 ± 0.229
2.516ArgAsp: 2.516 ± 0.894
2.767ArgGlu: 2.767 ± 1.164
1.006ArgPhe: 1.006 ± 0.474
1.258ArgGly: 1.258 ± 0.417
0.252ArgHis: 0.252 ± 0.23
2.264ArgIle: 2.264 ± 0.885
3.27ArgLys: 3.27 ± 0.866
2.767ArgLeu: 2.767 ± 0.815
0.503ArgMet: 0.503 ± 0.391
2.013ArgAsn: 2.013 ± 0.723
1.006ArgPro: 1.006 ± 0.314
2.264ArgGln: 2.264 ± 0.862
0.503ArgArg: 0.503 ± 0.335
1.509ArgSer: 1.509 ± 0.565
1.761ArgThr: 1.761 ± 0.543
1.258ArgVal: 1.258 ± 0.525
0.503ArgTrp: 0.503 ± 0.384
1.258ArgTyr: 1.258 ± 0.5
0.0ArgXaa: 0.0 ± 0.0
Ser
5.031SerAla: 5.031 ± 1.06
0.755SerCys: 0.755 ± 0.405
5.031SerAsp: 5.031 ± 0.885
4.78SerGlu: 4.78 ± 0.907
1.509SerPhe: 1.509 ± 0.616
2.264SerGly: 2.264 ± 0.721
0.755SerHis: 0.755 ± 0.418
4.277SerIle: 4.277 ± 0.717
6.792SerLys: 6.792 ± 1.155
4.277SerLeu: 4.277 ± 0.948
2.516SerMet: 2.516 ± 1.265
4.025SerAsn: 4.025 ± 1.001
2.013SerPro: 2.013 ± 0.567
3.522SerGln: 3.522 ± 0.813
1.509SerArg: 1.509 ± 0.516
11.069SerSer: 11.069 ± 4.063
4.025SerThr: 4.025 ± 0.748
5.535SerVal: 5.535 ± 1.115
0.755SerTrp: 0.755 ± 0.351
3.522SerTyr: 3.522 ± 0.993
0.0SerXaa: 0.0 ± 0.0
Thr
2.767ThrAla: 2.767 ± 0.802
0.0ThrCys: 0.0 ± 0.0
3.522ThrAsp: 3.522 ± 0.92
5.535ThrGlu: 5.535 ± 0.937
3.522ThrPhe: 3.522 ± 0.736
2.767ThrGly: 2.767 ± 0.737
0.503ThrHis: 0.503 ± 0.326
5.031ThrIle: 5.031 ± 1.034
5.535ThrLys: 5.535 ± 1.353
5.283ThrLeu: 5.283 ± 1.326
1.761ThrMet: 1.761 ± 0.76
3.774ThrAsn: 3.774 ± 1.413
3.019ThrPro: 3.019 ± 0.931
3.019ThrGln: 3.019 ± 0.914
2.013ThrArg: 2.013 ± 0.853
3.522ThrSer: 3.522 ± 1.354
4.528ThrThr: 4.528 ± 0.964
4.277ThrVal: 4.277 ± 1.079
0.755ThrTrp: 0.755 ± 0.338
1.006ThrTyr: 1.006 ± 0.396
0.0ThrXaa: 0.0 ± 0.0
Val
3.27ValAla: 3.27 ± 0.662
0.503ValCys: 0.503 ± 0.319
2.767ValAsp: 2.767 ± 0.922
4.025ValGlu: 4.025 ± 1.7
2.264ValPhe: 2.264 ± 0.736
2.264ValGly: 2.264 ± 0.51
0.503ValHis: 0.503 ± 0.3
3.019ValIle: 3.019 ± 1.057
4.277ValLys: 4.277 ± 1.253
6.038ValLeu: 6.038 ± 1.14
1.509ValMet: 1.509 ± 0.472
3.522ValAsn: 3.522 ± 0.953
3.019ValPro: 3.019 ± 0.827
2.264ValGln: 2.264 ± 0.976
1.509ValArg: 1.509 ± 0.642
6.038ValSer: 6.038 ± 1.135
3.27ValThr: 3.27 ± 0.661
2.516ValVal: 2.516 ± 0.99
0.0ValTrp: 0.0 ± 0.0
3.522ValTyr: 3.522 ± 0.867
0.0ValXaa: 0.0 ± 0.0
Trp
0.252TrpAla: 0.252 ± 0.221
0.0TrpCys: 0.0 ± 0.0
0.503TrpAsp: 0.503 ± 0.261
0.503TrpGlu: 0.503 ± 0.363
0.503TrpPhe: 0.503 ± 0.396
0.503TrpGly: 0.503 ± 0.296
0.0TrpHis: 0.0 ± 0.0
0.503TrpIle: 0.503 ± 0.38
0.252TrpLys: 0.252 ± 0.19
0.755TrpLeu: 0.755 ± 0.369
0.0TrpMet: 0.0 ± 0.0
0.755TrpAsn: 0.755 ± 0.357
0.252TrpPro: 0.252 ± 0.231
0.252TrpGln: 0.252 ± 0.19
0.252TrpArg: 0.252 ± 0.19
1.258TrpSer: 1.258 ± 0.68
0.0TrpThr: 0.0 ± 0.0
1.509TrpVal: 1.509 ± 0.606
0.252TrpTrp: 0.252 ± 0.231
0.503TrpTyr: 0.503 ± 0.286
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.013TyrAla: 2.013 ± 1.465
0.503TyrCys: 0.503 ± 0.3
3.019TyrAsp: 3.019 ± 1.182
3.019TyrGlu: 3.019 ± 1.051
3.27TyrPhe: 3.27 ± 1.036
2.013TyrGly: 2.013 ± 0.587
0.252TyrHis: 0.252 ± 0.249
2.013TyrIle: 2.013 ± 0.702
5.786TyrLys: 5.786 ± 1.336
4.78TyrLeu: 4.78 ± 0.894
1.258TyrMet: 1.258 ± 0.586
4.025TyrAsn: 4.025 ± 1.042
1.509TyrPro: 1.509 ± 0.59
1.006TyrGln: 1.006 ± 0.453
2.264TyrArg: 2.264 ± 0.789
3.522TyrSer: 3.522 ± 0.769
1.006TyrThr: 1.006 ± 0.64
2.264TyrVal: 2.264 ± 0.759
0.503TyrTrp: 0.503 ± 0.312
1.509TyrTyr: 1.509 ± 0.671
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 23 proteins (3976 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski