Amino acid dipepetide frequency for Streptococcus satellite phage Javan387

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
1.557AlaCys: 1.557 ± 0.553
2.802AlaAsp: 2.802 ± 0.896
4.047AlaGlu: 4.047 ± 1.045
1.557AlaPhe: 1.557 ± 0.639
2.802AlaGly: 2.802 ± 0.891
0.311AlaHis: 0.311 ± 0.365
7.472AlaIle: 7.472 ± 1.249
2.802AlaLys: 2.802 ± 0.711
5.915AlaLeu: 5.915 ± 1.112
2.179AlaMet: 2.179 ± 0.485
1.868AlaAsn: 1.868 ± 0.814
2.179AlaPro: 2.179 ± 0.996
2.179AlaGln: 2.179 ± 0.79
1.557AlaArg: 1.557 ± 0.473
0.934AlaSer: 0.934 ± 0.367
3.113AlaThr: 3.113 ± 1.034
1.557AlaVal: 1.557 ± 0.547
0.623AlaTrp: 0.623 ± 0.38
3.113AlaTyr: 3.113 ± 0.752
0.0AlaXaa: 0.0 ± 0.0
Cys
1.557CysAla: 1.557 ± 0.702
0.0CysCys: 0.0 ± 0.0
0.934CysAsp: 0.934 ± 0.509
0.311CysGlu: 0.311 ± 0.246
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.623CysIle: 0.623 ± 0.363
0.0CysLys: 0.0 ± 0.0
0.934CysLeu: 0.934 ± 0.473
0.311CysMet: 0.311 ± 0.271
0.0CysAsn: 0.0 ± 0.0
0.311CysPro: 0.311 ± 0.255
0.311CysGln: 0.311 ± 0.246
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.311CysVal: 0.311 ± 0.246
0.0CysTrp: 0.0 ± 0.0
0.311CysTyr: 0.311 ± 0.28
0.0CysXaa: 0.0 ± 0.0
Asp
0.623AspAla: 0.623 ± 0.394
0.934AspCys: 0.934 ± 0.854
6.227AspAsp: 6.227 ± 2.149
3.736AspGlu: 3.736 ± 1.207
3.425AspPhe: 3.425 ± 1.135
2.491AspGly: 2.491 ± 0.605
0.623AspHis: 0.623 ± 0.343
8.406AspIle: 8.406 ± 0.977
7.161AspLys: 7.161 ± 1.486
8.095AspLeu: 8.095 ± 1.39
1.868AspMet: 1.868 ± 0.594
4.047AspAsn: 4.047 ± 0.982
1.245AspPro: 1.245 ± 0.661
1.245AspGln: 1.245 ± 0.538
1.245AspArg: 1.245 ± 0.51
4.047AspSer: 4.047 ± 1.816
4.67AspThr: 4.67 ± 1.42
0.934AspVal: 0.934 ± 0.367
0.0AspTrp: 0.0 ± 0.0
6.849AspTyr: 6.849 ± 1.427
0.0AspXaa: 0.0 ± 0.0
Glu
6.849GluAla: 6.849 ± 1.69
0.623GluCys: 0.623 ± 0.426
4.047GluAsp: 4.047 ± 1.383
5.293GluGlu: 5.293 ± 2.106
2.491GluPhe: 2.491 ± 0.854
1.245GluGly: 1.245 ± 0.674
2.179GluHis: 2.179 ± 0.961
4.67GluIle: 4.67 ± 1.417
6.849GluLys: 6.849 ± 2.024
10.897GluLeu: 10.897 ± 1.434
2.491GluMet: 2.491 ± 0.671
4.359GluAsn: 4.359 ± 1.073
3.736GluPro: 3.736 ± 1.037
3.736GluGln: 3.736 ± 1.15
3.736GluArg: 3.736 ± 0.975
3.425GluSer: 3.425 ± 1.035
4.047GluThr: 4.047 ± 0.778
4.67GluVal: 4.67 ± 1.472
0.0GluTrp: 0.0 ± 0.0
5.293GluTyr: 5.293 ± 1.288
0.0GluXaa: 0.0 ± 0.0
Phe
0.623PheAla: 0.623 ± 0.538
0.0PheCys: 0.0 ± 0.0
1.868PheAsp: 1.868 ± 0.637
4.047PheGlu: 4.047 ± 1.336
1.245PhePhe: 1.245 ± 0.599
1.245PheGly: 1.245 ± 0.503
0.623PheHis: 0.623 ± 0.45
3.425PheIle: 3.425 ± 1.214
5.915PheLys: 5.915 ± 1.449
1.868PheLeu: 1.868 ± 0.825
1.245PheMet: 1.245 ± 0.519
2.802PheAsn: 2.802 ± 1.005
0.311PhePro: 0.311 ± 0.335
1.245PheGln: 1.245 ± 0.533
1.557PheArg: 1.557 ± 0.854
1.868PheSer: 1.868 ± 0.583
3.736PheThr: 3.736 ± 1.145
0.311PheVal: 0.311 ± 0.335
0.311PheTrp: 0.311 ± 0.246
1.868PheTyr: 1.868 ± 0.866
0.0PheXaa: 0.0 ± 0.0
Gly
2.179GlyAla: 2.179 ± 0.855
0.0GlyCys: 0.0 ± 0.0
3.113GlyAsp: 3.113 ± 0.812
0.934GlyGlu: 0.934 ± 0.506
2.491GlyPhe: 2.491 ± 0.887
1.245GlyGly: 1.245 ± 0.419
0.623GlyHis: 0.623 ± 0.43
3.113GlyIle: 3.113 ± 0.671
4.67GlyLys: 4.67 ± 0.932
4.359GlyLeu: 4.359 ± 1.14
1.245GlyMet: 1.245 ± 0.568
3.113GlyAsn: 3.113 ± 0.702
0.0GlyPro: 0.0 ± 0.0
1.557GlyGln: 1.557 ± 0.689
1.868GlyArg: 1.868 ± 0.722
3.425GlySer: 3.425 ± 1.302
2.491GlyThr: 2.491 ± 0.823
1.245GlyVal: 1.245 ± 0.665
0.311GlyTrp: 0.311 ± 0.246
2.491GlyTyr: 2.491 ± 0.776
0.0GlyXaa: 0.0 ± 0.0
His
2.179HisAla: 2.179 ± 0.812
0.0HisCys: 0.0 ± 0.0
0.934HisAsp: 0.934 ± 0.549
1.245HisGlu: 1.245 ± 0.62
0.623HisPhe: 0.623 ± 0.452
0.311HisGly: 0.311 ± 0.255
0.623HisHis: 0.623 ± 0.359
1.557HisIle: 1.557 ± 0.401
0.934HisLys: 0.934 ± 0.452
2.179HisLeu: 2.179 ± 1.133
0.311HisMet: 0.311 ± 0.343
0.934HisAsn: 0.934 ± 0.656
0.0HisPro: 0.0 ± 0.0
0.311HisGln: 0.311 ± 0.352
0.311HisArg: 0.311 ± 0.246
1.245HisSer: 1.245 ± 0.666
1.557HisThr: 1.557 ± 0.637
0.311HisVal: 0.311 ± 0.246
0.0HisTrp: 0.0 ± 0.0
0.934HisTyr: 0.934 ± 0.408
0.0HisXaa: 0.0 ± 0.0
Ile
4.359IleAla: 4.359 ± 1.288
0.311IleCys: 0.311 ± 0.246
7.161IleAsp: 7.161 ± 1.211
8.095IleGlu: 8.095 ± 1.145
1.557IlePhe: 1.557 ± 0.558
3.425IleGly: 3.425 ± 1.435
0.934IleHis: 0.934 ± 0.433
6.227IleIle: 6.227 ± 1.321
11.519IleLys: 11.519 ± 2.044
4.047IleLeu: 4.047 ± 1.103
2.491IleMet: 2.491 ± 0.844
6.849IleAsn: 6.849 ± 1.712
2.802IlePro: 2.802 ± 1.009
3.736IleGln: 3.736 ± 0.845
2.802IleArg: 2.802 ± 0.721
6.227IleSer: 6.227 ± 1.418
4.67IleThr: 4.67 ± 0.721
3.113IleVal: 3.113 ± 0.927
0.311IleTrp: 0.311 ± 0.246
3.113IleTyr: 3.113 ± 0.879
0.0IleXaa: 0.0 ± 0.0
Lys
6.227LysAla: 6.227 ± 1.469
0.0LysCys: 0.0 ± 0.0
6.538LysAsp: 6.538 ± 1.626
13.387LysGlu: 13.387 ± 2.552
1.245LysPhe: 1.245 ± 0.55
3.113LysGly: 3.113 ± 1.358
2.802LysHis: 2.802 ± 0.928
7.161LysIle: 7.161 ± 2.111
11.208LysLys: 11.208 ± 3.228
7.783LysLeu: 7.783 ± 1.802
3.425LysMet: 3.425 ± 1.15
5.293LysAsn: 5.293 ± 1.217
3.113LysPro: 3.113 ± 1.329
2.802LysGln: 2.802 ± 0.791
6.227LysArg: 6.227 ± 1.605
6.227LysSer: 6.227 ± 1.143
6.227LysThr: 6.227 ± 1.39
3.736LysVal: 3.736 ± 0.934
0.623LysTrp: 0.623 ± 0.375
4.047LysTyr: 4.047 ± 0.895
0.0LysXaa: 0.0 ± 0.0
Leu
4.981LeuAla: 4.981 ± 1.305
0.0LeuCys: 0.0 ± 0.0
8.717LeuAsp: 8.717 ± 2.029
8.095LeuGlu: 8.095 ± 1.434
1.868LeuPhe: 1.868 ± 0.845
4.981LeuGly: 4.981 ± 1.197
1.245LeuHis: 1.245 ± 0.647
8.095LeuIle: 8.095 ± 1.339
7.472LeuLys: 7.472 ± 1.856
13.387LeuLeu: 13.387 ± 2.351
2.491LeuMet: 2.491 ± 0.779
8.717LeuAsn: 8.717 ± 1.624
4.67LeuPro: 4.67 ± 1.195
4.047LeuGln: 4.047 ± 0.669
3.425LeuArg: 3.425 ± 0.928
5.604LeuSer: 5.604 ± 1.022
8.095LeuThr: 8.095 ± 1.589
3.736LeuVal: 3.736 ± 0.843
0.311LeuTrp: 0.311 ± 0.285
3.113LeuTyr: 3.113 ± 0.937
0.0LeuXaa: 0.0 ± 0.0
Met
2.802MetAla: 2.802 ± 0.86
0.311MetCys: 0.311 ± 0.326
1.557MetAsp: 1.557 ± 0.545
1.868MetGlu: 1.868 ± 0.74
0.623MetPhe: 0.623 ± 0.329
0.623MetGly: 0.623 ± 0.439
0.0MetHis: 0.0 ± 0.0
1.557MetIle: 1.557 ± 0.588
4.359MetLys: 4.359 ± 1.165
1.868MetLeu: 1.868 ± 0.887
0.0MetMet: 0.0 ± 0.0
2.491MetAsn: 2.491 ± 0.778
0.0MetPro: 0.0 ± 0.0
0.623MetGln: 0.623 ± 0.421
1.868MetArg: 1.868 ± 0.66
0.934MetSer: 0.934 ± 0.518
1.557MetThr: 1.557 ± 0.496
1.245MetVal: 1.245 ± 0.634
0.0MetTrp: 0.0 ± 0.0
0.934MetTyr: 0.934 ± 0.508
0.0MetXaa: 0.0 ± 0.0
Asn
4.359AsnAla: 4.359 ± 0.901
1.557AsnCys: 1.557 ± 0.515
4.67AsnAsp: 4.67 ± 1.16
5.915AsnGlu: 5.915 ± 0.958
2.802AsnPhe: 2.802 ± 0.801
2.802AsnGly: 2.802 ± 0.858
1.868AsnHis: 1.868 ± 0.61
3.425AsnIle: 3.425 ± 0.994
5.915AsnLys: 5.915 ± 1.28
5.915AsnLeu: 5.915 ± 1.165
0.623AsnMet: 0.623 ± 0.494
3.113AsnAsn: 3.113 ± 0.874
2.802AsnPro: 2.802 ± 0.893
2.491AsnGln: 2.491 ± 1.012
1.868AsnArg: 1.868 ± 0.624
4.67AsnSer: 4.67 ± 0.843
2.491AsnThr: 2.491 ± 0.753
3.113AsnVal: 3.113 ± 0.97
0.311AsnTrp: 0.311 ± 0.285
4.981AsnTyr: 4.981 ± 1.412
0.0AsnXaa: 0.0 ± 0.0
Pro
0.934ProAla: 0.934 ± 0.368
0.0ProCys: 0.0 ± 0.0
2.802ProAsp: 2.802 ± 0.834
2.179ProGlu: 2.179 ± 0.725
1.245ProPhe: 1.245 ± 0.537
0.311ProGly: 0.311 ± 0.297
0.623ProHis: 0.623 ± 0.359
0.934ProIle: 0.934 ± 0.522
4.67ProLys: 4.67 ± 1.089
2.802ProLeu: 2.802 ± 0.711
0.311ProMet: 0.311 ± 0.335
2.491ProAsn: 2.491 ± 0.782
1.245ProPro: 1.245 ± 0.746
0.311ProGln: 0.311 ± 0.255
1.868ProArg: 1.868 ± 0.695
1.245ProSer: 1.245 ± 0.494
1.868ProThr: 1.868 ± 0.749
1.557ProVal: 1.557 ± 0.622
0.0ProTrp: 0.0 ± 0.0
2.802ProTyr: 2.802 ± 1.02
0.0ProXaa: 0.0 ± 0.0
Gln
3.736GlnAla: 3.736 ± 1.138
0.311GlnCys: 0.311 ± 0.255
1.245GlnAsp: 1.245 ± 0.45
2.491GlnGlu: 2.491 ± 0.803
2.179GlnPhe: 2.179 ± 0.678
1.557GlnGly: 1.557 ± 0.721
0.311GlnHis: 0.311 ± 0.255
2.802GlnIle: 2.802 ± 0.995
1.557GlnLys: 1.557 ± 0.62
2.802GlnLeu: 2.802 ± 0.786
1.245GlnMet: 1.245 ± 0.561
1.245GlnAsn: 1.245 ± 0.578
0.623GlnPro: 0.623 ± 0.562
2.802GlnGln: 2.802 ± 0.798
1.868GlnArg: 1.868 ± 0.828
1.868GlnSer: 1.868 ± 0.844
1.245GlnThr: 1.245 ± 0.587
2.491GlnVal: 2.491 ± 0.862
0.623GlnTrp: 0.623 ± 0.443
1.868GlnTyr: 1.868 ± 0.79
0.0GlnXaa: 0.0 ± 0.0
Arg
0.623ArgAla: 0.623 ± 0.369
0.0ArgCys: 0.0 ± 0.0
2.802ArgAsp: 2.802 ± 0.833
2.179ArgGlu: 2.179 ± 0.577
1.868ArgPhe: 1.868 ± 0.694
4.047ArgGly: 4.047 ± 0.921
0.623ArgHis: 0.623 ± 0.57
3.425ArgIle: 3.425 ± 0.886
5.604ArgLys: 5.604 ± 1.204
7.472ArgLeu: 7.472 ± 1.332
0.311ArgMet: 0.311 ± 0.342
0.934ArgAsn: 0.934 ± 0.607
0.623ArgPro: 0.623 ± 0.446
0.934ArgGln: 0.934 ± 0.393
0.934ArgArg: 0.934 ± 0.559
1.868ArgSer: 1.868 ± 0.604
2.179ArgThr: 2.179 ± 0.703
2.802ArgVal: 2.802 ± 0.937
0.311ArgTrp: 0.311 ± 0.315
0.623ArgTyr: 0.623 ± 0.383
0.0ArgXaa: 0.0 ± 0.0
Ser
1.557SerAla: 1.557 ± 0.667
0.0SerCys: 0.0 ± 0.0
4.359SerAsp: 4.359 ± 1.117
3.736SerGlu: 3.736 ± 0.913
3.113SerPhe: 3.113 ± 0.818
1.868SerGly: 1.868 ± 0.802
0.934SerHis: 0.934 ± 0.532
6.849SerIle: 6.849 ± 2.032
6.227SerLys: 6.227 ± 1.412
5.293SerLeu: 5.293 ± 0.782
0.311SerMet: 0.311 ± 0.335
4.981SerAsn: 4.981 ± 1.461
1.868SerPro: 1.868 ± 0.747
1.868SerGln: 1.868 ± 0.692
2.179SerArg: 2.179 ± 0.571
2.179SerSer: 2.179 ± 0.634
2.491SerThr: 2.491 ± 1.235
1.868SerVal: 1.868 ± 0.779
0.311SerTrp: 0.311 ± 0.315
4.67SerTyr: 4.67 ± 1.673
0.0SerXaa: 0.0 ± 0.0
Thr
1.557ThrAla: 1.557 ± 0.565
0.0ThrCys: 0.0 ± 0.0
3.736ThrAsp: 3.736 ± 0.97
6.849ThrGlu: 6.849 ± 1.602
4.047ThrPhe: 4.047 ± 1.093
3.425ThrGly: 3.425 ± 0.925
0.623ThrHis: 0.623 ± 0.45
6.227ThrIle: 6.227 ± 1.275
4.67ThrLys: 4.67 ± 1.109
8.095ThrLeu: 8.095 ± 1.737
1.245ThrMet: 1.245 ± 0.611
3.113ThrAsn: 3.113 ± 1.125
2.179ThrPro: 2.179 ± 0.773
1.557ThrGln: 1.557 ± 0.6
2.802ThrArg: 2.802 ± 0.818
2.802ThrSer: 2.802 ± 1.019
4.67ThrThr: 4.67 ± 1.34
4.047ThrVal: 4.047 ± 1.319
0.623ThrTrp: 0.623 ± 0.407
1.868ThrTyr: 1.868 ± 0.803
0.0ThrXaa: 0.0 ± 0.0
Val
2.491ValAla: 2.491 ± 0.903
0.0ValCys: 0.0 ± 0.0
1.868ValAsp: 1.868 ± 0.559
1.868ValGlu: 1.868 ± 0.735
1.557ValPhe: 1.557 ± 0.705
1.557ValGly: 1.557 ± 0.605
0.934ValHis: 0.934 ± 0.636
3.113ValIle: 3.113 ± 0.845
3.736ValLys: 3.736 ± 0.852
4.047ValLeu: 4.047 ± 0.926
0.934ValMet: 0.934 ± 0.617
4.359ValAsn: 4.359 ± 1.255
0.311ValPro: 0.311 ± 0.326
0.623ValGln: 0.623 ± 0.42
0.623ValArg: 0.623 ± 0.473
3.736ValSer: 3.736 ± 1.146
4.359ValThr: 4.359 ± 1.221
1.868ValVal: 1.868 ± 0.835
0.623ValTrp: 0.623 ± 0.369
1.868ValTyr: 1.868 ± 0.629
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.311TrpAsp: 0.311 ± 0.285
0.934TrpGlu: 0.934 ± 0.513
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.311TrpHis: 0.311 ± 0.246
0.311TrpIle: 0.311 ± 0.28
0.623TrpLys: 0.623 ± 0.329
0.934TrpLeu: 0.934 ± 0.577
0.0TrpMet: 0.0 ± 0.0
0.311TrpAsn: 0.311 ± 0.246
0.0TrpPro: 0.0 ± 0.0
0.311TrpGln: 0.311 ± 0.315
0.0TrpArg: 0.0 ± 0.0
0.934TrpSer: 0.934 ± 0.368
0.311TrpThr: 0.311 ± 0.311
0.311TrpVal: 0.311 ± 0.311
0.311TrpTrp: 0.311 ± 0.285
0.311TrpTyr: 0.311 ± 0.315
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.934TyrAla: 0.934 ± 0.559
0.311TyrCys: 0.311 ± 0.335
2.179TyrAsp: 2.179 ± 1.399
3.425TyrGlu: 3.425 ± 1.503
2.179TyrPhe: 2.179 ± 0.739
3.425TyrGly: 3.425 ± 1.097
0.311TyrHis: 0.311 ± 0.255
4.359TyrIle: 4.359 ± 1.34
5.293TyrLys: 5.293 ± 1.989
4.67TyrLeu: 4.67 ± 0.889
1.868TyrMet: 1.868 ± 0.864
4.67TyrAsn: 4.67 ± 1.241
2.179TyrPro: 2.179 ± 0.721
2.179TyrGln: 2.179 ± 0.845
3.425TyrArg: 3.425 ± 1.094
3.425TyrSer: 3.425 ± 0.793
4.359TyrThr: 4.359 ± 1.043
1.245TyrVal: 1.245 ± 0.58
0.623TyrTrp: 0.623 ± 0.366
1.245TyrTyr: 1.245 ± 0.573
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 22 proteins (3213 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski