Amino acid dipepetide frequency for Streptococcus satellite phage Javan218

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.648AlaAla: 0.648 ± 0.406
0.972AlaCys: 0.972 ± 0.451
2.915AlaAsp: 2.915 ± 0.733
5.505AlaGlu: 5.505 ± 1.33
1.295AlaPhe: 1.295 ± 0.533
1.295AlaGly: 1.295 ± 0.673
0.972AlaHis: 0.972 ± 0.505
4.21AlaIle: 4.21 ± 0.804
7.448AlaLys: 7.448 ± 2.032
2.915AlaLeu: 2.915 ± 0.888
2.267AlaMet: 2.267 ± 0.722
3.238AlaAsn: 3.238 ± 1.004
0.648AlaPro: 0.648 ± 0.409
2.591AlaGln: 2.591 ± 1.058
3.562AlaArg: 3.562 ± 0.855
4.858AlaSer: 4.858 ± 1.268
3.886AlaThr: 3.886 ± 1.188
3.562AlaVal: 3.562 ± 0.851
0.648AlaTrp: 0.648 ± 0.472
2.267AlaTyr: 2.267 ± 0.822
0.0AlaXaa: 0.0 ± 0.0
Cys
0.648CysAla: 0.648 ± 0.387
0.0CysCys: 0.0 ± 0.0
0.972CysAsp: 0.972 ± 0.503
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
1.943CysGly: 1.943 ± 0.924
0.324CysHis: 0.324 ± 0.324
0.972CysIle: 0.972 ± 0.717
0.648CysLys: 0.648 ± 0.424
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.972CysAsn: 0.972 ± 0.545
1.295CysPro: 1.295 ± 0.731
0.324CysGln: 0.324 ± 0.306
0.648CysArg: 0.648 ± 0.426
1.943CysSer: 1.943 ± 0.776
0.324CysThr: 0.324 ± 0.292
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.324CysTyr: 0.324 ± 0.292
0.0CysXaa: 0.0 ± 0.0
Asp
0.972AspAla: 0.972 ± 0.615
0.648AspCys: 0.648 ± 0.372
3.886AspAsp: 3.886 ± 0.932
4.21AspGlu: 4.21 ± 1.447
3.886AspPhe: 3.886 ± 0.891
2.591AspGly: 2.591 ± 1.224
1.295AspHis: 1.295 ± 0.656
8.096AspIle: 8.096 ± 1.648
6.153AspLys: 6.153 ± 1.258
3.886AspLeu: 3.886 ± 0.99
1.619AspMet: 1.619 ± 0.661
4.534AspAsn: 4.534 ± 1.223
0.324AspPro: 0.324 ± 0.339
0.324AspGln: 0.324 ± 0.317
0.972AspArg: 0.972 ± 0.58
2.591AspSer: 2.591 ± 0.921
2.915AspThr: 2.915 ± 0.702
2.591AspVal: 2.591 ± 0.946
0.324AspTrp: 0.324 ± 0.288
5.505AspTyr: 5.505 ± 1.393
0.0AspXaa: 0.0 ± 0.0
Glu
6.153GluAla: 6.153 ± 1.693
0.972GluCys: 0.972 ± 0.484
4.858GluAsp: 4.858 ± 1.41
8.744GluGlu: 8.744 ± 1.999
2.267GluPhe: 2.267 ± 0.631
3.238GluGly: 3.238 ± 1.266
1.295GluHis: 1.295 ± 0.753
6.801GluIle: 6.801 ± 2.105
7.124GluLys: 7.124 ± 1.138
10.687GluLeu: 10.687 ± 1.822
0.324GluMet: 0.324 ± 0.341
5.181GluAsn: 5.181 ± 1.192
3.238GluPro: 3.238 ± 0.743
3.886GluGln: 3.886 ± 0.986
4.858GluArg: 4.858 ± 1.55
4.21GluSer: 4.21 ± 1.472
2.915GluThr: 2.915 ± 1.06
4.534GluVal: 4.534 ± 1.051
1.619GluTrp: 1.619 ± 0.718
2.915GluTyr: 2.915 ± 0.849
0.0GluXaa: 0.0 ± 0.0
Phe
0.972PheAla: 0.972 ± 0.461
0.972PheCys: 0.972 ± 0.544
2.591PheAsp: 2.591 ± 0.643
3.886PheGlu: 3.886 ± 1.083
0.972PhePhe: 0.972 ± 0.699
1.619PheGly: 1.619 ± 0.552
0.972PheHis: 0.972 ± 0.459
4.858PheIle: 4.858 ± 1.665
3.562PheLys: 3.562 ± 1.116
3.886PheLeu: 3.886 ± 0.876
0.324PheMet: 0.324 ± 0.272
0.972PheAsn: 0.972 ± 0.54
0.648PhePro: 0.648 ± 0.439
0.972PheGln: 0.972 ± 0.689
1.619PheArg: 1.619 ± 0.838
2.591PheSer: 2.591 ± 0.974
1.619PheThr: 1.619 ± 0.463
0.648PheVal: 0.648 ± 0.379
0.648PheTrp: 0.648 ± 0.365
1.619PheTyr: 1.619 ± 0.662
0.0PheXaa: 0.0 ± 0.0
Gly
2.267GlyAla: 2.267 ± 0.842
0.0GlyCys: 0.0 ± 0.0
1.295GlyAsp: 1.295 ± 0.642
3.562GlyGlu: 3.562 ± 1.053
2.267GlyPhe: 2.267 ± 0.943
1.943GlyGly: 1.943 ± 0.665
0.972GlyHis: 0.972 ± 0.476
4.21GlyIle: 4.21 ± 1.255
2.915GlyLys: 2.915 ± 0.641
4.534GlyLeu: 4.534 ± 1.236
0.648GlyMet: 0.648 ± 0.48
2.915GlyAsn: 2.915 ± 0.861
1.943GlyPro: 1.943 ± 0.753
0.648GlyGln: 0.648 ± 0.438
0.972GlyArg: 0.972 ± 0.483
1.943GlySer: 1.943 ± 0.812
1.619GlyThr: 1.619 ± 0.555
3.238GlyVal: 3.238 ± 0.804
0.648GlyTrp: 0.648 ± 0.487
3.886GlyTyr: 3.886 ± 1.0
0.0GlyXaa: 0.0 ± 0.0
His
0.972HisAla: 0.972 ± 0.675
0.324HisCys: 0.324 ± 0.324
0.324HisAsp: 0.324 ± 0.31
0.648HisGlu: 0.648 ± 0.406
1.295HisPhe: 1.295 ± 0.672
1.619HisGly: 1.619 ± 0.632
0.324HisHis: 0.324 ± 0.324
1.619HisIle: 1.619 ± 0.721
1.619HisLys: 1.619 ± 0.697
1.619HisLeu: 1.619 ± 0.682
0.324HisMet: 0.324 ± 0.306
2.267HisAsn: 2.267 ± 0.8
0.0HisPro: 0.0 ± 0.0
0.648HisGln: 0.648 ± 0.47
0.324HisArg: 0.324 ± 0.293
0.648HisSer: 0.648 ± 0.346
1.943HisThr: 1.943 ± 0.815
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.648HisTyr: 0.648 ± 0.421
0.0HisXaa: 0.0 ± 0.0
Ile
3.886IleAla: 3.886 ± 1.261
0.648IleCys: 0.648 ± 0.445
7.448IleAsp: 7.448 ± 1.955
5.505IleGlu: 5.505 ± 1.233
3.238IlePhe: 3.238 ± 0.79
3.238IleGly: 3.238 ± 0.939
0.324IleHis: 0.324 ± 0.292
3.562IleIle: 3.562 ± 1.096
8.096IleLys: 8.096 ± 1.101
4.534IleLeu: 4.534 ± 0.982
1.943IleMet: 1.943 ± 0.766
5.181IleAsn: 5.181 ± 1.196
2.591IlePro: 2.591 ± 1.066
2.591IleGln: 2.591 ± 1.14
1.619IleArg: 1.619 ± 0.689
5.505IleSer: 5.505 ± 1.231
3.562IleThr: 3.562 ± 0.964
3.562IleVal: 3.562 ± 1.143
0.0IleTrp: 0.0 ± 0.0
2.267IleTyr: 2.267 ± 0.73
0.0IleXaa: 0.0 ± 0.0
Lys
7.772LysAla: 7.772 ± 2.076
1.295LysCys: 1.295 ± 0.602
5.829LysAsp: 5.829 ± 1.31
10.687LysGlu: 10.687 ± 2.0
1.295LysPhe: 1.295 ± 0.434
5.181LysGly: 5.181 ± 1.413
1.295LysHis: 1.295 ± 0.55
4.21LysIle: 4.21 ± 1.026
10.687LysLys: 10.687 ± 1.43
9.067LysLeu: 9.067 ± 1.604
1.943LysMet: 1.943 ± 0.722
10.039LysAsn: 10.039 ± 1.136
2.267LysPro: 2.267 ± 0.808
5.505LysGln: 5.505 ± 1.239
5.505LysArg: 5.505 ± 1.284
7.448LysSer: 7.448 ± 1.026
7.124LysThr: 7.124 ± 1.174
4.858LysVal: 4.858 ± 0.92
0.648LysTrp: 0.648 ± 0.448
5.181LysTyr: 5.181 ± 1.684
0.0LysXaa: 0.0 ± 0.0
Leu
5.181LeuAla: 5.181 ± 1.212
0.648LeuCys: 0.648 ± 0.475
5.505LeuAsp: 5.505 ± 1.165
10.039LeuGlu: 10.039 ± 1.564
2.591LeuPhe: 2.591 ± 0.879
4.858LeuGly: 4.858 ± 1.051
0.972LeuHis: 0.972 ± 0.537
5.829LeuIle: 5.829 ± 1.278
13.277LeuLys: 13.277 ± 2.123
8.096LeuLeu: 8.096 ± 2.252
3.238LeuMet: 3.238 ± 0.887
6.477LeuAsn: 6.477 ± 1.207
1.943LeuPro: 1.943 ± 0.741
4.858LeuGln: 4.858 ± 0.792
2.591LeuArg: 2.591 ± 0.983
4.21LeuSer: 4.21 ± 0.766
5.181LeuThr: 5.181 ± 1.236
2.915LeuVal: 2.915 ± 0.927
1.295LeuTrp: 1.295 ± 0.564
3.238LeuTyr: 3.238 ± 0.581
0.0LeuXaa: 0.0 ± 0.0
Met
1.619MetAla: 1.619 ± 0.734
0.0MetCys: 0.0 ± 0.0
0.972MetAsp: 0.972 ± 0.694
1.943MetGlu: 1.943 ± 0.763
1.295MetPhe: 1.295 ± 0.757
1.295MetGly: 1.295 ± 0.578
0.0MetHis: 0.0 ± 0.0
0.648MetIle: 0.648 ± 0.433
1.943MetLys: 1.943 ± 0.707
1.943MetLeu: 1.943 ± 0.821
0.0MetMet: 0.0 ± 0.0
1.295MetAsn: 1.295 ± 0.463
0.0MetPro: 0.0 ± 0.0
0.648MetGln: 0.648 ± 0.448
0.972MetArg: 0.972 ± 0.413
0.972MetSer: 0.972 ± 0.56
1.943MetThr: 1.943 ± 0.709
0.972MetVal: 0.972 ± 0.696
0.0MetTrp: 0.0 ± 0.0
0.972MetTyr: 0.972 ± 0.445
0.0MetXaa: 0.0 ± 0.0
Asn
5.829AsnAla: 5.829 ± 1.028
1.295AsnCys: 1.295 ± 1.016
3.562AsnAsp: 3.562 ± 0.906
4.21AsnGlu: 4.21 ± 1.29
3.886AsnPhe: 3.886 ± 1.15
3.886AsnGly: 3.886 ± 1.064
0.324AsnHis: 0.324 ± 0.306
2.591AsnIle: 2.591 ± 0.795
5.505AsnLys: 5.505 ± 1.319
5.181AsnLeu: 5.181 ± 1.124
1.295AsnMet: 1.295 ± 0.638
5.181AsnAsn: 5.181 ± 1.678
1.943AsnPro: 1.943 ± 0.854
3.238AsnGln: 3.238 ± 1.109
2.267AsnArg: 2.267 ± 0.919
4.21AsnSer: 4.21 ± 1.142
4.534AsnThr: 4.534 ± 2.139
4.534AsnVal: 4.534 ± 1.07
0.972AsnTrp: 0.972 ± 0.563
3.886AsnTyr: 3.886 ± 0.969
0.0AsnXaa: 0.0 ± 0.0
Pro
1.619ProAla: 1.619 ± 0.486
0.0ProCys: 0.0 ± 0.0
0.648ProAsp: 0.648 ± 0.365
1.943ProGlu: 1.943 ± 0.758
1.295ProPhe: 1.295 ± 0.515
0.324ProGly: 0.324 ± 0.341
0.0ProHis: 0.0 ± 0.0
0.648ProIle: 0.648 ± 0.412
4.858ProLys: 4.858 ± 1.205
1.943ProLeu: 1.943 ± 0.764
0.324ProMet: 0.324 ± 0.341
1.619ProAsn: 1.619 ± 0.604
1.619ProPro: 1.619 ± 0.948
0.972ProGln: 0.972 ± 0.698
1.295ProArg: 1.295 ± 0.492
1.943ProSer: 1.943 ± 0.941
0.648ProThr: 0.648 ± 0.471
3.562ProVal: 3.562 ± 1.059
0.324ProTrp: 0.324 ± 0.297
1.295ProTyr: 1.295 ± 0.601
0.0ProXaa: 0.0 ± 0.0
Gln
4.21GlnAla: 4.21 ± 1.226
0.324GlnCys: 0.324 ± 0.341
1.295GlnAsp: 1.295 ± 0.503
3.562GlnGlu: 3.562 ± 0.955
0.648GlnPhe: 0.648 ± 0.649
0.648GlnGly: 0.648 ± 0.495
0.648GlnHis: 0.648 ± 0.586
1.619GlnIle: 1.619 ± 0.792
4.858GlnLys: 4.858 ± 1.055
5.505GlnLeu: 5.505 ± 1.104
0.648GlnMet: 0.648 ± 0.443
1.619GlnAsn: 1.619 ± 0.789
2.267GlnPro: 2.267 ± 0.623
4.21GlnGln: 4.21 ± 1.145
1.943GlnArg: 1.943 ± 0.583
2.915GlnSer: 2.915 ± 0.841
1.943GlnThr: 1.943 ± 0.665
1.295GlnVal: 1.295 ± 0.606
0.324GlnTrp: 0.324 ± 0.292
1.943GlnTyr: 1.943 ± 0.726
0.0GlnXaa: 0.0 ± 0.0
Arg
2.915ArgAla: 2.915 ± 0.775
0.324ArgCys: 0.324 ± 0.306
2.591ArgAsp: 2.591 ± 0.822
3.562ArgGlu: 3.562 ± 0.803
1.295ArgPhe: 1.295 ± 0.467
1.295ArgGly: 1.295 ± 0.612
0.972ArgHis: 0.972 ± 0.43
3.562ArgIle: 3.562 ± 0.942
6.477ArgLys: 6.477 ± 1.345
3.562ArgLeu: 3.562 ± 0.946
0.324ArgMet: 0.324 ± 0.33
2.267ArgAsn: 2.267 ± 0.718
0.324ArgPro: 0.324 ± 0.292
3.238ArgGln: 3.238 ± 0.891
2.915ArgArg: 2.915 ± 0.963
2.267ArgSer: 2.267 ± 0.658
1.295ArgThr: 1.295 ± 0.552
1.619ArgVal: 1.619 ± 0.944
0.648ArgTrp: 0.648 ± 0.425
1.295ArgTyr: 1.295 ± 0.51
0.0ArgXaa: 0.0 ± 0.0
Ser
1.943SerAla: 1.943 ± 1.082
0.324SerCys: 0.324 ± 0.324
5.505SerAsp: 5.505 ± 0.865
5.829SerGlu: 5.829 ± 0.946
0.972SerPhe: 0.972 ± 0.537
1.943SerGly: 1.943 ± 0.693
1.295SerHis: 1.295 ± 0.717
4.534SerIle: 4.534 ± 1.031
5.181SerLys: 5.181 ± 1.699
8.096SerLeu: 8.096 ± 1.161
0.972SerMet: 0.972 ± 0.444
3.238SerAsn: 3.238 ± 1.51
1.295SerPro: 1.295 ± 0.954
1.295SerGln: 1.295 ± 0.509
4.21SerArg: 4.21 ± 1.161
5.505SerSer: 5.505 ± 1.764
2.591SerThr: 2.591 ± 0.66
2.591SerVal: 2.591 ± 0.73
0.648SerTrp: 0.648 ± 0.397
2.915SerTyr: 2.915 ± 0.729
0.0SerXaa: 0.0 ± 0.0
Thr
2.591ThrAla: 2.591 ± 0.725
0.324ThrCys: 0.324 ± 0.281
1.619ThrAsp: 1.619 ± 0.853
4.858ThrGlu: 4.858 ± 1.479
2.915ThrPhe: 2.915 ± 0.897
2.267ThrGly: 2.267 ± 1.017
1.295ThrHis: 1.295 ± 0.567
5.505ThrIle: 5.505 ± 0.974
6.477ThrLys: 6.477 ± 1.331
5.505ThrLeu: 5.505 ± 1.463
0.324ThrMet: 0.324 ± 0.281
4.21ThrAsn: 4.21 ± 0.824
1.943ThrPro: 1.943 ± 0.547
0.972ThrGln: 0.972 ± 0.457
1.295ThrArg: 1.295 ± 0.662
1.295ThrSer: 1.295 ± 0.71
4.534ThrThr: 4.534 ± 2.212
3.886ThrVal: 3.886 ± 1.218
0.648ThrTrp: 0.648 ± 0.53
1.619ThrTyr: 1.619 ± 0.663
0.0ThrXaa: 0.0 ± 0.0
Val
1.943ValAla: 1.943 ± 0.661
1.295ValCys: 1.295 ± 0.574
2.591ValAsp: 2.591 ± 0.836
1.943ValGlu: 1.943 ± 0.634
1.943ValPhe: 1.943 ± 0.848
0.648ValGly: 0.648 ± 0.411
2.267ValHis: 2.267 ± 0.995
3.562ValIle: 3.562 ± 1.073
4.534ValLys: 4.534 ± 0.899
5.181ValLeu: 5.181 ± 0.986
0.972ValMet: 0.972 ± 0.479
3.562ValAsn: 3.562 ± 1.372
1.943ValPro: 1.943 ± 0.74
1.943ValGln: 1.943 ± 0.591
1.619ValArg: 1.619 ± 0.789
3.238ValSer: 3.238 ± 1.014
3.886ValThr: 3.886 ± 0.943
1.943ValVal: 1.943 ± 0.666
0.0ValTrp: 0.0 ± 0.0
4.21ValTyr: 4.21 ± 0.744
0.0ValXaa: 0.0 ± 0.0
Trp
0.972TrpAla: 0.972 ± 0.379
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.295TrpGlu: 1.295 ± 0.74
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.324TrpHis: 0.324 ± 0.297
0.0TrpIle: 0.0 ± 0.0
0.972TrpLys: 0.972 ± 0.558
1.619TrpLeu: 1.619 ± 0.718
0.324TrpMet: 0.324 ± 0.393
0.648TrpAsn: 0.648 ± 0.385
0.0TrpPro: 0.0 ± 0.0
1.295TrpGln: 1.295 ± 0.557
0.324TrpArg: 0.324 ± 0.317
0.648TrpSer: 0.648 ± 0.412
0.324TrpThr: 0.324 ± 0.306
1.619TrpVal: 1.619 ± 0.59
0.0TrpTrp: 0.0 ± 0.0
0.324TrpTyr: 0.324 ± 0.293
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.591TyrAla: 2.591 ± 0.807
0.972TyrCys: 0.972 ± 0.461
2.915TyrAsp: 2.915 ± 1.002
3.886TyrGlu: 3.886 ± 1.288
2.591TyrPhe: 2.591 ± 1.099
2.915TyrGly: 2.915 ± 0.716
1.295TyrHis: 1.295 ± 0.572
2.267TyrIle: 2.267 ± 0.843
5.505TyrLys: 5.505 ± 1.539
5.181TyrLeu: 5.181 ± 1.0
1.295TyrMet: 1.295 ± 0.681
2.915TyrAsn: 2.915 ± 0.799
0.648TyrPro: 0.648 ± 0.394
2.267TyrGln: 2.267 ± 1.067
3.238TyrArg: 3.238 ± 0.939
1.943TyrSer: 1.943 ± 0.804
1.295TyrThr: 1.295 ± 0.674
1.295TyrVal: 1.295 ± 0.57
1.295TyrTrp: 1.295 ± 0.578
3.238TyrTyr: 3.238 ± 1.339
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 19 proteins (3089 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski