Amino acid dipepetide frequency for Streptococcus satellite phage Javan148

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.811AlaAla: 0.811 ± 0.34
0.609AlaCys: 0.609 ± 0.293
4.26AlaAsp: 4.26 ± 1.026
5.274AlaGlu: 5.274 ± 1.081
3.245AlaPhe: 3.245 ± 0.744
1.826AlaGly: 1.826 ± 0.589
0.203AlaHis: 0.203 ± 0.188
6.288AlaIle: 6.288 ± 1.178
4.462AlaLys: 4.462 ± 0.878
5.477AlaLeu: 5.477 ± 1.083
1.623AlaMet: 1.623 ± 0.47
4.665AlaAsn: 4.665 ± 0.881
1.42AlaPro: 1.42 ± 0.485
2.434AlaGln: 2.434 ± 0.475
2.231AlaArg: 2.231 ± 0.557
2.637AlaSer: 2.637 ± 0.924
3.854AlaThr: 3.854 ± 0.546
3.043AlaVal: 3.043 ± 0.811
0.811AlaTrp: 0.811 ± 0.524
1.42AlaTyr: 1.42 ± 0.424
0.0AlaXaa: 0.0 ± 0.0
Cys
1.014CysAla: 1.014 ± 0.368
0.203CysCys: 0.203 ± 0.203
0.406CysAsp: 0.406 ± 0.313
0.203CysGlu: 0.203 ± 0.203
0.203CysPhe: 0.203 ± 0.188
0.811CysGly: 0.811 ± 0.529
0.203CysHis: 0.203 ± 0.176
0.203CysIle: 0.203 ± 0.182
0.203CysLys: 0.203 ± 0.177
1.014CysLeu: 1.014 ± 0.419
0.0CysMet: 0.0 ± 0.0
0.609CysAsn: 0.609 ± 0.318
0.406CysPro: 0.406 ± 0.344
0.203CysGln: 0.203 ± 0.203
0.203CysArg: 0.203 ± 0.203
0.203CysSer: 0.203 ± 0.225
0.203CysThr: 0.203 ± 0.239
0.406CysVal: 0.406 ± 0.227
0.0CysTrp: 0.0 ± 0.0
0.406CysTyr: 0.406 ± 0.279
0.0CysXaa: 0.0 ± 0.0
Asp
1.42AspAla: 1.42 ± 0.532
1.217AspCys: 1.217 ± 0.553
3.043AspAsp: 3.043 ± 0.719
3.651AspGlu: 3.651 ± 0.86
2.637AspPhe: 2.637 ± 0.833
2.434AspGly: 2.434 ± 0.975
0.203AspHis: 0.203 ± 0.186
6.491AspIle: 6.491 ± 1.07
5.477AspLys: 5.477 ± 0.857
5.882AspLeu: 5.882 ± 0.896
1.014AspMet: 1.014 ± 0.415
2.434AspAsn: 2.434 ± 0.703
1.014AspPro: 1.014 ± 0.413
1.826AspGln: 1.826 ± 0.503
3.043AspArg: 3.043 ± 0.734
5.071AspSer: 5.071 ± 1.089
2.637AspThr: 2.637 ± 0.901
2.028AspVal: 2.028 ± 0.515
0.203AspTrp: 0.203 ± 0.207
4.868AspTyr: 4.868 ± 1.341
0.0AspXaa: 0.0 ± 0.0
Glu
5.274GluAla: 5.274 ± 0.958
1.217GluCys: 1.217 ± 0.643
3.651GluAsp: 3.651 ± 0.926
6.085GluGlu: 6.085 ± 1.91
3.043GluPhe: 3.043 ± 1.219
2.84GluGly: 2.84 ± 0.916
3.043GluHis: 3.043 ± 0.742
6.288GluIle: 6.288 ± 1.306
5.68GluLys: 5.68 ± 0.776
9.128GluLeu: 9.128 ± 1.111
2.028GluMet: 2.028 ± 0.642
2.231GluAsn: 2.231 ± 0.485
2.637GluPro: 2.637 ± 0.67
4.26GluGln: 4.26 ± 1.007
3.245GluArg: 3.245 ± 0.937
1.623GluSer: 1.623 ± 0.541
4.868GluThr: 4.868 ± 1.006
3.448GluVal: 3.448 ± 0.689
1.217GluTrp: 1.217 ± 0.426
3.043GluTyr: 3.043 ± 0.836
0.0GluXaa: 0.0 ± 0.0
Phe
1.623PheAla: 1.623 ± 0.467
0.0PheCys: 0.0 ± 0.0
2.84PheAsp: 2.84 ± 0.717
3.043PheGlu: 3.043 ± 0.718
1.826PhePhe: 1.826 ± 0.467
1.623PheGly: 1.623 ± 0.435
1.623PheHis: 1.623 ± 0.455
5.071PheIle: 5.071 ± 1.127
3.651PheLys: 3.651 ± 0.825
3.245PheLeu: 3.245 ± 0.802
0.609PheMet: 0.609 ± 0.343
2.434PheAsn: 2.434 ± 0.681
1.014PhePro: 1.014 ± 0.445
1.217PheGln: 1.217 ± 0.396
2.434PheArg: 2.434 ± 0.627
2.637PheSer: 2.637 ± 0.698
2.84PheThr: 2.84 ± 0.833
2.231PheVal: 2.231 ± 0.497
0.0PheTrp: 0.0 ± 0.0
1.217PheTyr: 1.217 ± 0.429
0.0PheXaa: 0.0 ± 0.0
Gly
2.434GlyAla: 2.434 ± 0.742
0.203GlyCys: 0.203 ± 0.182
3.651GlyAsp: 3.651 ± 1.285
2.434GlyGlu: 2.434 ± 0.71
1.826GlyPhe: 1.826 ± 0.562
1.623GlyGly: 1.623 ± 0.595
0.609GlyHis: 0.609 ± 0.39
4.26GlyIle: 4.26 ± 0.807
3.854GlyLys: 3.854 ± 1.03
6.085GlyLeu: 6.085 ± 1.041
1.217GlyMet: 1.217 ± 0.431
1.826GlyAsn: 1.826 ± 0.538
0.203GlyPro: 0.203 ± 0.181
1.826GlyGln: 1.826 ± 0.816
1.623GlyArg: 1.623 ± 0.406
1.826GlySer: 1.826 ± 0.592
2.434GlyThr: 2.434 ± 0.697
4.057GlyVal: 4.057 ± 0.968
0.609GlyTrp: 0.609 ± 0.416
3.854GlyTyr: 3.854 ± 0.82
0.0GlyXaa: 0.0 ± 0.0
His
1.826HisAla: 1.826 ± 0.821
0.203HisCys: 0.203 ± 0.182
0.406HisAsp: 0.406 ± 0.258
0.203HisGlu: 0.203 ± 0.214
0.406HisPhe: 0.406 ± 0.254
1.623HisGly: 1.623 ± 0.59
0.406HisHis: 0.406 ± 0.415
1.826HisIle: 1.826 ± 0.516
1.623HisLys: 1.623 ± 0.59
2.028HisLeu: 2.028 ± 0.756
0.406HisMet: 0.406 ± 0.263
0.406HisAsn: 0.406 ± 0.267
0.811HisPro: 0.811 ± 0.572
1.217HisGln: 1.217 ± 0.604
0.406HisArg: 0.406 ± 0.249
0.609HisSer: 0.609 ± 0.388
0.811HisThr: 0.811 ± 0.327
0.609HisVal: 0.609 ± 0.332
0.203HisTrp: 0.203 ± 0.203
1.217HisTyr: 1.217 ± 0.603
0.0HisXaa: 0.0 ± 0.0
Ile
5.071IleAla: 5.071 ± 0.758
0.609IleCys: 0.609 ± 0.308
7.099IleAsp: 7.099 ± 0.918
6.288IleGlu: 6.288 ± 1.12
3.043IlePhe: 3.043 ± 0.706
2.231IleGly: 2.231 ± 0.657
0.811IleHis: 0.811 ± 0.479
5.882IleIle: 5.882 ± 1.25
9.736IleLys: 9.736 ± 1.379
4.462IleLeu: 4.462 ± 0.694
1.217IleMet: 1.217 ± 0.479
3.854IleAsn: 3.854 ± 0.84
2.637IlePro: 2.637 ± 0.771
2.434IleGln: 2.434 ± 0.586
3.245IleArg: 3.245 ± 0.658
6.897IleSer: 6.897 ± 1.194
5.274IleThr: 5.274 ± 0.882
2.637IleVal: 2.637 ± 0.658
0.203IleTrp: 0.203 ± 0.208
3.854IleTyr: 3.854 ± 0.779
0.0IleXaa: 0.0 ± 0.0
Lys
8.722LysAla: 8.722 ± 1.219
0.203LysCys: 0.203 ± 0.239
4.26LysAsp: 4.26 ± 0.963
8.519LysGlu: 8.519 ± 1.404
3.043LysPhe: 3.043 ± 0.655
3.854LysGly: 3.854 ± 0.963
2.028LysHis: 2.028 ± 0.465
4.26LysIle: 4.26 ± 0.935
8.114LysLys: 8.114 ± 1.585
6.897LysLeu: 6.897 ± 1.079
2.231LysMet: 2.231 ± 0.62
4.868LysAsn: 4.868 ± 0.989
3.854LysPro: 3.854 ± 0.851
4.868LysGln: 4.868 ± 0.907
4.057LysArg: 4.057 ± 0.783
4.057LysSer: 4.057 ± 1.146
4.868LysThr: 4.868 ± 1.213
6.085LysVal: 6.085 ± 1.231
0.609LysTrp: 0.609 ± 0.304
3.854LysTyr: 3.854 ± 1.004
0.0LysXaa: 0.0 ± 0.0
Leu
5.882LeuAla: 5.882 ± 1.333
0.811LeuCys: 0.811 ± 0.369
5.071LeuAsp: 5.071 ± 0.803
9.939LeuGlu: 9.939 ± 1.379
3.854LeuPhe: 3.854 ± 0.787
6.694LeuGly: 6.694 ± 1.154
0.609LeuHis: 0.609 ± 0.328
8.114LeuIle: 8.114 ± 1.318
9.331LeuLys: 9.331 ± 1.474
8.316LeuLeu: 8.316 ± 1.409
2.637LeuMet: 2.637 ± 0.602
6.694LeuAsn: 6.694 ± 1.201
4.462LeuPro: 4.462 ± 0.996
3.245LeuGln: 3.245 ± 0.477
2.434LeuArg: 2.434 ± 0.551
7.505LeuSer: 7.505 ± 1.403
4.057LeuThr: 4.057 ± 0.77
5.071LeuVal: 5.071 ± 1.128
1.42LeuTrp: 1.42 ± 0.488
3.854LeuTyr: 3.854 ± 0.849
0.0LeuXaa: 0.0 ± 0.0
Met
2.637MetAla: 2.637 ± 0.735
0.0MetCys: 0.0 ± 0.0
1.42MetAsp: 1.42 ± 0.511
1.217MetGlu: 1.217 ± 0.399
0.811MetPhe: 0.811 ± 0.386
0.203MetGly: 0.203 ± 0.176
0.0MetHis: 0.0 ± 0.0
1.42MetIle: 1.42 ± 0.465
2.028MetLys: 2.028 ± 0.484
3.245MetLeu: 3.245 ± 0.78
0.203MetMet: 0.203 ± 0.209
2.231MetAsn: 2.231 ± 0.722
0.203MetPro: 0.203 ± 0.177
0.406MetGln: 0.406 ± 0.418
1.623MetArg: 1.623 ± 0.518
1.42MetSer: 1.42 ± 0.444
2.84MetThr: 2.84 ± 0.7
0.811MetVal: 0.811 ± 0.392
0.0MetTrp: 0.0 ± 0.0
0.203MetTyr: 0.203 ± 0.239
0.0MetXaa: 0.0 ± 0.0
Asn
3.854AsnAla: 3.854 ± 0.75
0.0AsnCys: 0.0 ± 0.0
2.637AsnAsp: 2.637 ± 0.794
3.245AsnGlu: 3.245 ± 0.799
1.826AsnPhe: 1.826 ± 0.545
3.651AsnGly: 3.651 ± 0.999
1.623AsnHis: 1.623 ± 0.501
4.665AsnIle: 4.665 ± 1.002
4.868AsnLys: 4.868 ± 0.972
5.071AsnLeu: 5.071 ± 1.065
1.014AsnMet: 1.014 ± 0.39
3.043AsnAsn: 3.043 ± 0.737
2.434AsnPro: 2.434 ± 0.553
2.84AsnGln: 2.84 ± 0.726
3.854AsnArg: 3.854 ± 0.727
2.231AsnSer: 2.231 ± 0.513
3.245AsnThr: 3.245 ± 0.803
2.231AsnVal: 2.231 ± 0.699
0.203AsnTrp: 0.203 ± 0.185
3.043AsnTyr: 3.043 ± 0.78
0.0AsnXaa: 0.0 ± 0.0
Pro
1.014ProAla: 1.014 ± 0.439
0.203ProCys: 0.203 ± 0.225
1.014ProAsp: 1.014 ± 0.547
2.637ProGlu: 2.637 ± 0.767
1.42ProPhe: 1.42 ± 0.558
1.014ProGly: 1.014 ± 0.455
0.203ProHis: 0.203 ± 0.182
2.028ProIle: 2.028 ± 0.681
2.84ProLys: 2.84 ± 0.794
2.637ProLeu: 2.637 ± 0.756
0.609ProMet: 0.609 ± 0.312
2.028ProAsn: 2.028 ± 0.833
0.609ProPro: 0.609 ± 0.343
1.014ProGln: 1.014 ± 0.414
2.028ProArg: 2.028 ± 0.65
2.231ProSer: 2.231 ± 0.648
2.84ProThr: 2.84 ± 0.489
2.028ProVal: 2.028 ± 0.66
0.406ProTrp: 0.406 ± 0.258
1.42ProTyr: 1.42 ± 0.534
0.0ProXaa: 0.0 ± 0.0
Gln
3.651GlnAla: 3.651 ± 0.756
0.406GlnCys: 0.406 ± 0.267
1.826GlnAsp: 1.826 ± 0.59
3.043GlnGlu: 3.043 ± 0.722
0.811GlnPhe: 0.811 ± 0.33
2.434GlnGly: 2.434 ± 0.746
0.609GlnHis: 0.609 ± 0.313
2.434GlnIle: 2.434 ± 0.65
4.057GlnLys: 4.057 ± 0.884
5.882GlnLeu: 5.882 ± 0.996
1.217GlnMet: 1.217 ± 0.655
2.434GlnAsn: 2.434 ± 0.66
1.217GlnPro: 1.217 ± 0.523
2.231GlnGln: 2.231 ± 0.498
3.043GlnArg: 3.043 ± 0.516
2.434GlnSer: 2.434 ± 0.477
1.42GlnThr: 1.42 ± 0.522
4.26GlnVal: 4.26 ± 0.875
0.203GlnTrp: 0.203 ± 0.239
1.42GlnTyr: 1.42 ± 0.379
0.0GlnXaa: 0.0 ± 0.0
Arg
1.623ArgAla: 1.623 ± 0.598
0.406ArgCys: 0.406 ± 0.247
2.434ArgAsp: 2.434 ± 0.547
2.231ArgGlu: 2.231 ± 0.602
2.028ArgPhe: 2.028 ± 0.453
1.623ArgGly: 1.623 ± 0.604
1.217ArgHis: 1.217 ± 0.416
2.434ArgIle: 2.434 ± 0.648
5.68ArgLys: 5.68 ± 1.131
6.694ArgLeu: 6.694 ± 0.82
1.217ArgMet: 1.217 ± 0.559
1.826ArgAsn: 1.826 ± 0.672
1.217ArgPro: 1.217 ± 0.618
3.043ArgGln: 3.043 ± 0.674
1.623ArgArg: 1.623 ± 0.57
3.448ArgSer: 3.448 ± 0.695
2.434ArgThr: 2.434 ± 0.516
2.637ArgVal: 2.637 ± 0.632
0.609ArgTrp: 0.609 ± 0.369
3.043ArgTyr: 3.043 ± 0.803
0.0ArgXaa: 0.0 ± 0.0
Ser
2.231SerAla: 2.231 ± 0.598
0.203SerCys: 0.203 ± 0.203
5.274SerAsp: 5.274 ± 0.792
3.245SerGlu: 3.245 ± 0.894
2.637SerPhe: 2.637 ± 0.638
2.637SerGly: 2.637 ± 0.679
0.811SerHis: 0.811 ± 0.371
3.854SerIle: 3.854 ± 1.077
4.868SerLys: 4.868 ± 1.045
7.302SerLeu: 7.302 ± 1.367
1.623SerMet: 1.623 ± 0.698
2.434SerAsn: 2.434 ± 0.796
1.42SerPro: 1.42 ± 0.485
3.651SerGln: 3.651 ± 0.833
2.84SerArg: 2.84 ± 0.868
3.245SerSer: 3.245 ± 0.991
3.651SerThr: 3.651 ± 0.863
3.854SerVal: 3.854 ± 0.98
0.406SerTrp: 0.406 ± 0.249
2.84SerTyr: 2.84 ± 0.914
0.0SerXaa: 0.0 ± 0.0
Thr
3.448ThrAla: 3.448 ± 0.789
0.0ThrCys: 0.0 ± 0.0
1.217ThrAsp: 1.217 ± 0.436
3.854ThrGlu: 3.854 ± 0.643
1.42ThrPhe: 1.42 ± 0.54
4.26ThrGly: 4.26 ± 0.735
1.42ThrHis: 1.42 ± 0.535
5.477ThrIle: 5.477 ± 0.999
4.26ThrLys: 4.26 ± 0.961
6.288ThrLeu: 6.288 ± 1.093
1.217ThrMet: 1.217 ± 0.606
3.043ThrAsn: 3.043 ± 1.001
2.231ThrPro: 2.231 ± 0.879
2.84ThrGln: 2.84 ± 0.616
3.448ThrArg: 3.448 ± 0.78
2.84ThrSer: 2.84 ± 0.623
2.84ThrThr: 2.84 ± 0.919
3.043ThrVal: 3.043 ± 0.797
1.014ThrTrp: 1.014 ± 0.375
4.057ThrTyr: 4.057 ± 0.752
0.0ThrXaa: 0.0 ± 0.0
Val
2.434ValAla: 2.434 ± 0.519
0.203ValCys: 0.203 ± 0.203
2.84ValAsp: 2.84 ± 0.781
4.057ValGlu: 4.057 ± 1.002
3.448ValPhe: 3.448 ± 0.753
2.028ValGly: 2.028 ± 0.515
0.203ValHis: 0.203 ± 0.203
4.26ValIle: 4.26 ± 0.778
5.274ValLys: 5.274 ± 0.83
6.085ValLeu: 6.085 ± 1.204
1.42ValMet: 1.42 ± 0.47
3.854ValAsn: 3.854 ± 0.684
1.014ValPro: 1.014 ± 0.421
2.028ValGln: 2.028 ± 0.808
2.028ValArg: 2.028 ± 0.725
4.665ValSer: 4.665 ± 0.965
3.854ValThr: 3.854 ± 0.902
3.651ValVal: 3.651 ± 0.906
0.811ValTrp: 0.811 ± 0.402
1.826ValTyr: 1.826 ± 0.521
0.0ValXaa: 0.0 ± 0.0
Trp
0.609TrpAla: 0.609 ± 0.312
0.0TrpCys: 0.0 ± 0.0
1.014TrpAsp: 1.014 ± 0.462
0.811TrpGlu: 0.811 ± 0.329
0.609TrpPhe: 0.609 ± 0.325
0.203TrpGly: 0.203 ± 0.177
0.0TrpHis: 0.0 ± 0.0
0.203TrpIle: 0.203 ± 0.228
0.203TrpLys: 0.203 ± 0.182
1.826TrpLeu: 1.826 ± 0.536
0.0TrpMet: 0.0 ± 0.0
0.811TrpAsn: 0.811 ± 0.385
0.0TrpPro: 0.0 ± 0.0
0.406TrpGln: 0.406 ± 0.279
0.406TrpArg: 0.406 ± 0.292
0.609TrpSer: 0.609 ± 0.331
0.203TrpThr: 0.203 ± 0.182
1.217TrpVal: 1.217 ± 0.439
0.609TrpTrp: 0.609 ± 0.326
0.406TrpTyr: 0.406 ± 0.293
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.014TyrAla: 1.014 ± 0.489
0.406TyrCys: 0.406 ± 0.25
2.231TyrAsp: 2.231 ± 0.838
4.868TyrGlu: 4.868 ± 0.891
3.043TyrPhe: 3.043 ± 0.606
2.84TyrGly: 2.84 ± 0.623
1.42TyrHis: 1.42 ± 0.534
2.028TyrIle: 2.028 ± 0.594
3.043TyrLys: 3.043 ± 0.808
2.434TyrLeu: 2.434 ± 0.579
1.217TyrMet: 1.217 ± 0.553
3.854TyrAsn: 3.854 ± 0.721
1.42TyrPro: 1.42 ± 0.558
3.043TyrGln: 3.043 ± 0.617
3.651TyrArg: 3.651 ± 0.806
2.84TyrSer: 2.84 ± 0.923
3.245TyrThr: 3.245 ± 0.501
2.637TyrVal: 2.637 ± 0.672
0.609TyrTrp: 0.609 ± 0.428
2.84TyrTyr: 2.84 ± 0.852
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 32 proteins (4931 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski