Amino acid dipepetide frequency for Streptococcus satellite phage Javan2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.479AlaAla: 0.479 ± 0.647
0.0AlaCys: 0.0 ± 0.0
1.438AlaAsp: 1.438 ± 0.842
3.835AlaGlu: 3.835 ± 1.129
1.438AlaPhe: 1.438 ± 0.88
1.918AlaGly: 1.918 ± 1.021
1.438AlaHis: 1.438 ± 0.907
7.67AlaIle: 7.67 ± 2.195
2.876AlaLys: 2.876 ± 1.375
8.629AlaLeu: 8.629 ± 1.5
2.397AlaMet: 2.397 ± 1.476
3.835AlaAsn: 3.835 ± 1.2
2.397AlaPro: 2.397 ± 0.787
2.397AlaGln: 2.397 ± 1.003
2.876AlaArg: 2.876 ± 1.102
2.876AlaSer: 2.876 ± 1.133
1.918AlaThr: 1.918 ± 1.244
2.876AlaVal: 2.876 ± 1.044
1.438AlaTrp: 1.438 ± 0.782
1.438AlaTyr: 1.438 ± 1.021
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.479CysCys: 0.479 ± 0.37
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.438CysIle: 1.438 ± 0.788
0.479CysLys: 0.479 ± 0.412
0.479CysLeu: 0.479 ± 0.37
0.0CysMet: 0.0 ± 0.0
0.479CysAsn: 0.479 ± 0.581
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.479CysArg: 0.479 ± 0.412
0.959CysSer: 0.959 ± 0.793
0.959CysThr: 0.959 ± 0.729
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
0.959AspAla: 0.959 ± 0.594
0.0AspCys: 0.0 ± 0.0
1.438AspAsp: 1.438 ± 1.026
3.356AspGlu: 3.356 ± 1.571
2.876AspPhe: 2.876 ± 0.94
2.397AspGly: 2.397 ± 0.918
0.479AspHis: 0.479 ± 0.37
7.191AspIle: 7.191 ± 1.459
4.314AspLys: 4.314 ± 1.422
3.356AspLeu: 3.356 ± 1.316
1.438AspMet: 1.438 ± 0.799
3.356AspAsn: 3.356 ± 1.439
0.479AspPro: 0.479 ± 0.37
0.959AspGln: 0.959 ± 0.505
3.835AspArg: 3.835 ± 1.061
2.876AspSer: 2.876 ± 1.007
3.835AspThr: 3.835 ± 1.824
1.438AspVal: 1.438 ± 1.014
0.959AspTrp: 0.959 ± 0.884
2.397AspTyr: 2.397 ± 1.247
0.0AspXaa: 0.0 ± 0.0
Glu
4.794GluAla: 4.794 ± 1.477
0.959GluCys: 0.959 ± 0.605
3.835GluAsp: 3.835 ± 1.158
6.711GluGlu: 6.711 ± 1.726
2.397GluPhe: 2.397 ± 0.789
1.438GluGly: 1.438 ± 0.946
1.918GluHis: 1.918 ± 1.313
5.753GluIle: 5.753 ± 2.18
5.273GluLys: 5.273 ± 1.977
13.423GluLeu: 13.423 ± 1.523
1.438GluMet: 1.438 ± 0.838
2.876GluAsn: 2.876 ± 0.973
2.397GluPro: 2.397 ± 1.154
7.191GluGln: 7.191 ± 1.912
7.191GluArg: 7.191 ± 2.032
2.876GluSer: 2.876 ± 0.982
6.232GluThr: 6.232 ± 1.903
5.753GluVal: 5.753 ± 1.881
0.479GluTrp: 0.479 ± 0.412
2.397GluTyr: 2.397 ± 0.707
0.0GluXaa: 0.0 ± 0.0
Phe
1.438PheAla: 1.438 ± 0.713
0.0PheCys: 0.0 ± 0.0
1.918PheAsp: 1.918 ± 0.728
3.835PheGlu: 3.835 ± 1.571
0.479PhePhe: 0.479 ± 0.497
2.397PheGly: 2.397 ± 1.347
1.438PheHis: 1.438 ± 0.505
1.918PheIle: 1.918 ± 0.813
1.918PheLys: 1.918 ± 0.902
4.314PheLeu: 4.314 ± 1.805
0.0PheMet: 0.0 ± 0.0
1.438PheAsn: 1.438 ± 0.636
2.397PhePro: 2.397 ± 1.773
0.479PheGln: 0.479 ± 0.473
3.356PheArg: 3.356 ± 1.143
1.918PheSer: 1.918 ± 1.197
1.438PheThr: 1.438 ± 0.505
0.479PheVal: 0.479 ± 0.412
0.0PheTrp: 0.0 ± 0.0
1.918PheTyr: 1.918 ± 1.041
0.0PheXaa: 0.0 ± 0.0
Gly
2.876GlyAla: 2.876 ± 1.252
0.479GlyCys: 0.479 ± 0.412
0.959GlyAsp: 0.959 ± 0.884
2.876GlyGlu: 2.876 ± 1.218
1.438GlyPhe: 1.438 ± 0.825
3.356GlyGly: 3.356 ± 1.197
0.959GlyHis: 0.959 ± 0.591
5.753GlyIle: 5.753 ± 1.425
4.794GlyLys: 4.794 ± 2.017
5.753GlyLeu: 5.753 ± 1.66
1.918GlyMet: 1.918 ± 1.168
2.397GlyAsn: 2.397 ± 1.216
0.0GlyPro: 0.0 ± 0.0
1.918GlyGln: 1.918 ± 0.978
3.356GlyArg: 3.356 ± 1.368
1.918GlySer: 1.918 ± 0.958
3.356GlyThr: 3.356 ± 1.214
1.918GlyVal: 1.918 ± 1.061
0.959GlyTrp: 0.959 ± 0.569
6.232GlyTyr: 6.232 ± 1.717
0.0GlyXaa: 0.0 ± 0.0
His
3.835HisAla: 3.835 ± 1.52
0.0HisCys: 0.0 ± 0.0
1.438HisAsp: 1.438 ± 0.758
2.876HisGlu: 2.876 ± 1.036
0.0HisPhe: 0.0 ± 0.0
1.438HisGly: 1.438 ± 0.979
0.0HisHis: 0.0 ± 0.0
0.479HisIle: 0.479 ± 0.412
1.438HisLys: 1.438 ± 0.842
3.356HisLeu: 3.356 ± 1.213
0.0HisMet: 0.0 ± 0.0
0.959HisAsn: 0.959 ± 0.594
0.479HisPro: 0.479 ± 0.607
0.0HisGln: 0.0 ± 0.0
0.479HisArg: 0.479 ± 0.37
1.918HisSer: 1.918 ± 0.659
1.918HisThr: 1.918 ± 0.592
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.479HisTyr: 0.479 ± 0.58
0.0HisXaa: 0.0 ± 0.0
Ile
3.356IleAla: 3.356 ± 0.895
0.0IleCys: 0.0 ± 0.0
4.794IleAsp: 4.794 ± 1.684
7.67IleGlu: 7.67 ± 2.58
4.314IlePhe: 4.314 ± 0.933
5.273IleGly: 5.273 ± 1.805
1.438IleHis: 1.438 ± 0.505
2.876IleIle: 2.876 ± 1.149
5.273IleLys: 5.273 ± 1.14
7.191IleLeu: 7.191 ± 1.539
1.438IleMet: 1.438 ± 1.21
3.835IleAsn: 3.835 ± 0.976
2.876IlePro: 2.876 ± 1.176
2.876IleGln: 2.876 ± 0.849
3.356IleArg: 3.356 ± 1.015
7.191IleSer: 7.191 ± 1.528
5.273IleThr: 5.273 ± 1.686
3.356IleVal: 3.356 ± 1.343
0.0IleTrp: 0.0 ± 0.0
2.397IleTyr: 2.397 ± 1.016
0.0IleXaa: 0.0 ± 0.0
Lys
6.711LysAla: 6.711 ± 1.642
0.0LysCys: 0.0 ± 0.0
4.794LysAsp: 4.794 ± 1.401
11.026LysGlu: 11.026 ± 2.902
0.959LysPhe: 0.959 ± 0.505
5.273LysGly: 5.273 ± 1.774
4.314LysHis: 4.314 ± 1.884
5.273LysIle: 5.273 ± 1.246
9.588LysLys: 9.588 ± 2.482
8.629LysLeu: 8.629 ± 1.348
0.959LysMet: 0.959 ± 0.54
1.918LysAsn: 1.918 ± 0.827
1.918LysPro: 1.918 ± 1.231
3.356LysGln: 3.356 ± 0.915
3.835LysArg: 3.835 ± 1.61
3.356LysSer: 3.356 ± 1.457
4.314LysThr: 4.314 ± 1.109
3.835LysVal: 3.835 ± 0.886
0.959LysTrp: 0.959 ± 0.633
4.314LysTyr: 4.314 ± 1.04
0.0LysXaa: 0.0 ± 0.0
Leu
6.711LeuAla: 6.711 ± 2.139
0.479LeuCys: 0.479 ± 0.492
9.588LeuAsp: 9.588 ± 2.141
15.34LeuGlu: 15.34 ± 2.1
3.356LeuPhe: 3.356 ± 0.659
4.314LeuGly: 4.314 ± 1.413
2.876LeuHis: 2.876 ± 1.118
7.191LeuIle: 7.191 ± 1.627
10.067LeuLys: 10.067 ± 2.648
9.588LeuLeu: 9.588 ± 2.078
3.356LeuMet: 3.356 ± 0.946
5.753LeuAsn: 5.753 ± 1.367
4.314LeuPro: 4.314 ± 1.133
5.273LeuGln: 5.273 ± 1.679
3.835LeuArg: 3.835 ± 1.641
9.108LeuSer: 9.108 ± 1.184
4.314LeuThr: 4.314 ± 1.733
5.753LeuVal: 5.753 ± 1.444
0.959LeuTrp: 0.959 ± 0.585
1.918LeuTyr: 1.918 ± 0.814
0.0LeuXaa: 0.0 ± 0.0
Met
3.356MetAla: 3.356 ± 1.146
0.0MetCys: 0.0 ± 0.0
0.959MetAsp: 0.959 ± 0.632
0.959MetGlu: 0.959 ± 0.585
0.479MetPhe: 0.479 ± 0.586
1.918MetGly: 1.918 ± 0.883
0.0MetHis: 0.0 ± 0.0
1.438MetIle: 1.438 ± 0.804
1.438MetLys: 1.438 ± 0.853
2.397MetLeu: 2.397 ± 1.193
0.479MetMet: 0.479 ± 0.447
1.438MetAsn: 1.438 ± 0.888
0.0MetPro: 0.0 ± 0.0
1.438MetGln: 1.438 ± 0.659
0.479MetArg: 0.479 ± 0.442
0.479MetSer: 0.479 ± 0.58
3.356MetThr: 3.356 ± 1.005
0.479MetVal: 0.479 ± 0.497
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.918AsnAla: 1.918 ± 0.871
1.438AsnCys: 1.438 ± 0.96
0.959AsnAsp: 0.959 ± 0.594
2.876AsnGlu: 2.876 ± 0.737
1.918AsnPhe: 1.918 ± 0.697
3.356AsnGly: 3.356 ± 0.969
0.479AsnHis: 0.479 ± 0.447
4.794AsnIle: 4.794 ± 1.381
4.314AsnLys: 4.314 ± 1.363
5.273AsnLeu: 5.273 ± 1.52
1.438AsnMet: 1.438 ± 0.697
0.479AsnAsn: 0.479 ± 0.442
2.876AsnPro: 2.876 ± 0.658
0.959AsnGln: 0.959 ± 0.481
3.356AsnArg: 3.356 ± 1.316
3.356AsnSer: 3.356 ± 1.149
2.876AsnThr: 2.876 ± 1.223
2.397AsnVal: 2.397 ± 0.977
0.959AsnTrp: 0.959 ± 0.521
3.356AsnTyr: 3.356 ± 0.961
0.0AsnXaa: 0.0 ± 0.0
Pro
1.918ProAla: 1.918 ± 1.339
0.0ProCys: 0.0 ± 0.0
1.438ProAsp: 1.438 ± 1.0
1.438ProGlu: 1.438 ± 0.571
0.479ProPhe: 0.479 ± 0.581
1.918ProGly: 1.918 ± 0.956
0.0ProHis: 0.0 ± 0.0
2.876ProIle: 2.876 ± 1.29
4.794ProLys: 4.794 ± 1.379
1.918ProLeu: 1.918 ± 1.162
0.479ProMet: 0.479 ± 0.486
0.479ProAsn: 0.479 ± 0.37
0.959ProPro: 0.959 ± 0.633
1.438ProGln: 1.438 ± 0.505
1.438ProArg: 1.438 ± 0.885
2.397ProSer: 2.397 ± 1.211
3.835ProThr: 3.835 ± 1.153
0.0ProVal: 0.0 ± 0.0
0.0ProTrp: 0.0 ± 0.0
1.438ProTyr: 1.438 ± 1.032
0.0ProXaa: 0.0 ± 0.0
Gln
3.835GlnAla: 3.835 ± 1.384
0.959GlnCys: 0.959 ± 0.658
1.918GlnAsp: 1.918 ± 0.743
4.794GlnGlu: 4.794 ± 1.123
0.479GlnPhe: 0.479 ± 0.37
3.835GlnGly: 3.835 ± 1.169
0.959GlnHis: 0.959 ± 0.741
1.438GlnIle: 1.438 ± 0.894
4.794GlnLys: 4.794 ± 1.374
6.232GlnLeu: 6.232 ± 1.124
0.0GlnMet: 0.0 ± 0.0
3.356GlnAsn: 3.356 ± 1.104
0.479GlnPro: 0.479 ± 0.58
3.835GlnGln: 3.835 ± 1.198
3.356GlnArg: 3.356 ± 1.196
0.959GlnSer: 0.959 ± 0.609
0.959GlnThr: 0.959 ± 0.521
3.356GlnVal: 3.356 ± 1.408
0.0GlnTrp: 0.0 ± 0.0
2.876GlnTyr: 2.876 ± 1.538
0.0GlnXaa: 0.0 ± 0.0
Arg
2.876ArgAla: 2.876 ± 1.629
0.479ArgCys: 0.479 ± 0.412
1.438ArgAsp: 1.438 ± 0.723
6.232ArgGlu: 6.232 ± 2.598
1.918ArgPhe: 1.918 ± 0.891
2.876ArgGly: 2.876 ± 1.107
0.959ArgHis: 0.959 ± 0.823
3.835ArgIle: 3.835 ± 1.328
3.835ArgLys: 3.835 ± 1.337
8.15ArgLeu: 8.15 ± 1.994
0.479ArgMet: 0.479 ± 0.686
3.356ArgAsn: 3.356 ± 1.965
3.356ArgPro: 3.356 ± 1.49
4.314ArgGln: 4.314 ± 1.658
3.356ArgArg: 3.356 ± 0.961
0.959ArgSer: 0.959 ± 0.778
4.794ArgThr: 4.794 ± 1.479
1.918ArgVal: 1.918 ± 0.942
0.0ArgTrp: 0.0 ± 0.0
2.876ArgTyr: 2.876 ± 0.88
0.0ArgXaa: 0.0 ± 0.0
Ser
1.918SerAla: 1.918 ± 0.953
0.479SerCys: 0.479 ± 0.442
4.794SerAsp: 4.794 ± 1.449
2.876SerGlu: 2.876 ± 1.455
1.918SerPhe: 1.918 ± 0.995
3.356SerGly: 3.356 ± 0.835
1.438SerHis: 1.438 ± 0.625
4.314SerIle: 4.314 ± 1.567
6.232SerLys: 6.232 ± 1.123
8.15SerLeu: 8.15 ± 2.012
1.438SerMet: 1.438 ± 0.972
2.397SerAsn: 2.397 ± 1.084
0.959SerPro: 0.959 ± 0.505
3.835SerGln: 3.835 ± 1.48
2.397SerArg: 2.397 ± 1.205
2.397SerSer: 2.397 ± 1.34
2.876SerThr: 2.876 ± 1.059
1.918SerVal: 1.918 ± 1.209
0.959SerTrp: 0.959 ± 0.706
3.356SerTyr: 3.356 ± 2.036
0.0SerXaa: 0.0 ± 0.0
Thr
2.876ThrAla: 2.876 ± 1.188
0.0ThrCys: 0.0 ± 0.0
1.438ThrAsp: 1.438 ± 1.111
4.314ThrGlu: 4.314 ± 1.271
3.835ThrPhe: 3.835 ± 1.585
4.794ThrGly: 4.794 ± 1.295
1.438ThrHis: 1.438 ± 0.505
5.753ThrIle: 5.753 ± 1.254
2.876ThrLys: 2.876 ± 1.2
7.67ThrLeu: 7.67 ± 1.97
1.438ThrMet: 1.438 ± 0.748
3.356ThrAsn: 3.356 ± 1.724
1.438ThrPro: 1.438 ± 0.816
1.918ThrGln: 1.918 ± 1.081
3.835ThrArg: 3.835 ± 1.212
2.397ThrSer: 2.397 ± 0.935
1.918ThrThr: 1.918 ± 0.853
6.232ThrVal: 6.232 ± 1.843
0.0ThrTrp: 0.0 ± 0.0
3.835ThrTyr: 3.835 ± 1.133
0.0ThrXaa: 0.0 ± 0.0
Val
3.356ValAla: 3.356 ± 1.213
0.0ValCys: 0.0 ± 0.0
1.918ValAsp: 1.918 ± 1.142
1.918ValGlu: 1.918 ± 0.995
1.438ValPhe: 1.438 ± 0.864
0.959ValGly: 0.959 ± 0.614
0.479ValHis: 0.479 ± 0.442
2.397ValIle: 2.397 ± 1.221
5.273ValLys: 5.273 ± 1.96
4.314ValLeu: 4.314 ± 1.175
1.438ValMet: 1.438 ± 0.749
3.835ValAsn: 3.835 ± 1.099
0.959ValPro: 0.959 ± 0.54
0.959ValGln: 0.959 ± 0.719
1.918ValArg: 1.918 ± 1.029
4.314ValSer: 4.314 ± 1.275
4.794ValThr: 4.794 ± 1.396
1.918ValVal: 1.918 ± 0.79
0.959ValTrp: 0.959 ± 0.625
1.438ValTyr: 1.438 ± 1.235
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
1.438TrpAsp: 1.438 ± 0.892
0.959TrpGlu: 0.959 ± 0.665
0.0TrpPhe: 0.0 ± 0.0
0.479TrpGly: 0.479 ± 0.442
0.0TrpHis: 0.0 ± 0.0
0.479TrpIle: 0.479 ± 0.497
0.959TrpLys: 0.959 ± 0.766
1.438TrpLeu: 1.438 ± 0.571
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.959TrpGln: 0.959 ± 0.741
0.959TrpArg: 0.959 ± 0.56
0.959TrpSer: 0.959 ± 0.481
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.479TrpTyr: 0.479 ± 0.37
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.959TyrAla: 0.959 ± 0.557
0.0TyrCys: 0.0 ± 0.0
1.438TyrAsp: 1.438 ± 1.02
1.438TyrGlu: 1.438 ± 0.779
3.356TyrPhe: 3.356 ± 1.634
1.918TyrGly: 1.918 ± 0.867
0.479TyrHis: 0.479 ± 0.412
1.918TyrIle: 1.918 ± 0.751
4.794TyrLys: 4.794 ± 1.548
3.835TyrLeu: 3.835 ± 1.013
0.479TyrMet: 0.479 ± 0.412
3.835TyrAsn: 3.835 ± 1.461
0.959TyrPro: 0.959 ± 0.741
4.314TyrGln: 4.314 ± 1.356
4.794TyrArg: 4.794 ± 1.224
4.794TyrSer: 4.794 ± 1.291
2.397TyrThr: 2.397 ± 1.855
0.959TyrVal: 0.959 ± 0.481
0.479TyrTrp: 0.479 ± 0.37
2.876TyrTyr: 2.876 ± 0.897
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 15 proteins (2087 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski