Amino acid dipepetide frequency for Streptococcus satellite phage Javan66

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.847AlaAla: 2.847 ± 0.827
0.712AlaCys: 0.712 ± 0.425
5.338AlaAsp: 5.338 ± 1.595
5.338AlaGlu: 5.338 ± 1.204
2.135AlaPhe: 2.135 ± 0.722
4.27AlaGly: 4.27 ± 1.403
0.356AlaHis: 0.356 ± 0.317
6.406AlaIle: 6.406 ± 1.255
6.762AlaLys: 6.762 ± 1.411
9.609AlaLeu: 9.609 ± 2.16
1.779AlaMet: 1.779 ± 0.574
2.491AlaAsn: 2.491 ± 0.969
1.779AlaPro: 1.779 ± 0.727
2.491AlaGln: 2.491 ± 0.858
3.203AlaArg: 3.203 ± 0.864
3.203AlaSer: 3.203 ± 1.02
3.559AlaThr: 3.559 ± 0.617
2.491AlaVal: 2.491 ± 0.813
0.356AlaTrp: 0.356 ± 0.372
2.847AlaTyr: 2.847 ± 1.123
0.0AlaXaa: 0.0 ± 0.0
Cys
1.068CysAla: 1.068 ± 0.496
0.0CysCys: 0.0 ± 0.0
0.356CysAsp: 0.356 ± 0.391
0.356CysGlu: 0.356 ± 0.352
0.0CysPhe: 0.0 ± 0.0
0.712CysGly: 0.712 ± 0.519
0.356CysHis: 0.356 ± 0.352
0.712CysIle: 0.712 ± 0.51
1.068CysLys: 1.068 ± 0.539
0.356CysLeu: 0.356 ± 0.352
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.356CysPro: 0.356 ± 0.324
0.356CysGln: 0.356 ± 0.384
0.356CysArg: 0.356 ± 0.352
0.712CysSer: 0.712 ± 0.422
0.356CysThr: 0.356 ± 0.372
0.356CysVal: 0.356 ± 0.324
0.0CysTrp: 0.0 ± 0.0
0.356CysTyr: 0.356 ± 0.391
0.0CysXaa: 0.0 ± 0.0
Asp
2.135AspAla: 2.135 ± 0.624
0.712AspCys: 0.712 ± 0.52
2.135AspAsp: 2.135 ± 0.66
3.915AspGlu: 3.915 ± 0.804
2.135AspPhe: 2.135 ± 0.862
2.491AspGly: 2.491 ± 1.083
0.712AspHis: 0.712 ± 0.445
6.406AspIle: 6.406 ± 1.273
7.117AspLys: 7.117 ± 1.138
4.982AspLeu: 4.982 ± 1.216
2.491AspMet: 2.491 ± 0.946
4.27AspAsn: 4.27 ± 1.345
0.712AspPro: 0.712 ± 0.457
1.068AspGln: 1.068 ± 0.454
1.068AspArg: 1.068 ± 0.479
2.135AspSer: 2.135 ± 0.904
2.491AspThr: 2.491 ± 1.009
2.491AspVal: 2.491 ± 1.113
0.712AspTrp: 0.712 ± 0.435
2.491AspTyr: 2.491 ± 1.097
0.0AspXaa: 0.0 ± 0.0
Glu
6.406GluAla: 6.406 ± 1.514
0.712GluCys: 0.712 ± 0.432
2.847GluAsp: 2.847 ± 1.169
5.338GluGlu: 5.338 ± 1.619
2.847GluPhe: 2.847 ± 1.078
2.135GluGly: 2.135 ± 0.905
0.356GluHis: 0.356 ± 0.372
7.117GluIle: 7.117 ± 2.027
9.609GluLys: 9.609 ± 1.167
10.32GluLeu: 10.32 ± 1.682
3.559GluMet: 3.559 ± 1.011
2.847GluAsn: 2.847 ± 0.881
1.068GluPro: 1.068 ± 0.507
6.406GluGln: 6.406 ± 1.957
6.762GluArg: 6.762 ± 2.337
3.203GluSer: 3.203 ± 0.939
4.27GluThr: 4.27 ± 0.771
2.135GluVal: 2.135 ± 0.691
1.068GluTrp: 1.068 ± 0.529
3.203GluTyr: 3.203 ± 0.957
0.0GluXaa: 0.0 ± 0.0
Phe
2.491PheAla: 2.491 ± 0.681
1.068PheCys: 1.068 ± 0.582
2.847PheAsp: 2.847 ± 1.054
3.915PheGlu: 3.915 ± 1.055
1.779PhePhe: 1.779 ± 0.557
2.847PheGly: 2.847 ± 0.855
1.068PheHis: 1.068 ± 0.463
3.203PheIle: 3.203 ± 0.888
2.847PheLys: 2.847 ± 0.781
2.847PheLeu: 2.847 ± 1.154
0.356PheMet: 0.356 ± 0.328
1.068PheAsn: 1.068 ± 0.591
1.068PhePro: 1.068 ± 0.538
1.423PheGln: 1.423 ± 0.62
2.491PheArg: 2.491 ± 0.708
2.135PheSer: 2.135 ± 1.012
1.779PheThr: 1.779 ± 0.702
1.423PheVal: 1.423 ± 0.545
0.356PheTrp: 0.356 ± 0.348
1.423PheTyr: 1.423 ± 0.622
0.0PheXaa: 0.0 ± 0.0
Gly
2.491GlyAla: 2.491 ± 0.607
0.0GlyCys: 0.0 ± 0.0
0.712GlyAsp: 0.712 ± 0.447
2.491GlyGlu: 2.491 ± 1.014
1.779GlyPhe: 1.779 ± 0.781
2.847GlyGly: 2.847 ± 1.112
0.712GlyHis: 0.712 ± 0.357
4.27GlyIle: 4.27 ± 1.292
3.915GlyLys: 3.915 ± 0.864
8.541GlyLeu: 8.541 ± 1.753
1.779GlyMet: 1.779 ± 0.792
1.423GlyAsn: 1.423 ± 0.758
0.0GlyPro: 0.0 ± 0.0
1.779GlyGln: 1.779 ± 0.832
2.135GlyArg: 2.135 ± 0.579
3.203GlySer: 3.203 ± 0.85
1.423GlyThr: 1.423 ± 0.507
2.847GlyVal: 2.847 ± 0.944
1.068GlyTrp: 1.068 ± 0.617
3.203GlyTyr: 3.203 ± 0.985
0.0GlyXaa: 0.0 ± 0.0
His
1.068HisAla: 1.068 ± 0.883
0.0HisCys: 0.0 ± 0.0
0.356HisAsp: 0.356 ± 0.306
0.356HisGlu: 0.356 ± 0.43
0.356HisPhe: 0.356 ± 0.43
2.135HisGly: 2.135 ± 0.829
1.779HisHis: 1.779 ± 1.168
1.423HisIle: 1.423 ± 0.664
1.068HisLys: 1.068 ± 0.697
3.203HisLeu: 3.203 ± 0.918
0.0HisMet: 0.0 ± 0.0
1.068HisAsn: 1.068 ± 0.577
0.712HisPro: 0.712 ± 0.452
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
0.0HisSer: 0.0 ± 0.0
1.068HisThr: 1.068 ± 0.427
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.712HisTyr: 0.712 ± 0.434
0.0HisXaa: 0.0 ± 0.0
Ile
4.27IleAla: 4.27 ± 1.023
1.423IleCys: 1.423 ± 0.751
7.117IleAsp: 7.117 ± 1.256
5.338IleGlu: 5.338 ± 1.597
2.847IlePhe: 2.847 ± 0.93
1.779IleGly: 1.779 ± 0.754
1.423IleHis: 1.423 ± 0.664
4.27IleIle: 4.27 ± 0.965
4.27IleLys: 4.27 ± 0.992
8.897IleLeu: 8.897 ± 1.879
0.712IleMet: 0.712 ± 0.529
2.135IleAsn: 2.135 ± 0.708
2.847IlePro: 2.847 ± 1.017
3.559IleGln: 3.559 ± 1.174
6.05IleArg: 6.05 ± 1.135
4.626IleSer: 4.626 ± 1.321
5.694IleThr: 5.694 ± 1.52
1.779IleVal: 1.779 ± 0.761
1.779IleTrp: 1.779 ± 0.672
2.847IleTyr: 2.847 ± 0.682
0.0IleXaa: 0.0 ± 0.0
Lys
10.32LysAla: 10.32 ± 2.042
0.356LysCys: 0.356 ± 0.348
3.203LysAsp: 3.203 ± 1.507
11.388LysGlu: 11.388 ± 1.691
3.559LysPhe: 3.559 ± 0.761
3.203LysGly: 3.203 ± 0.913
1.068LysHis: 1.068 ± 0.741
4.626LysIle: 4.626 ± 0.94
9.253LysLys: 9.253 ± 2.057
7.117LysLeu: 7.117 ± 1.257
2.135LysMet: 2.135 ± 0.862
7.473LysAsn: 7.473 ± 1.41
3.915LysPro: 3.915 ± 0.826
5.338LysGln: 5.338 ± 1.109
3.559LysArg: 3.559 ± 1.049
3.559LysSer: 3.559 ± 1.123
7.829LysThr: 7.829 ± 1.344
6.05LysVal: 6.05 ± 1.56
0.356LysTrp: 0.356 ± 0.372
1.423LysTyr: 1.423 ± 0.845
0.0LysXaa: 0.0 ± 0.0
Leu
9.253LeuAla: 9.253 ± 1.748
0.712LeuCys: 0.712 ± 0.523
8.185LeuAsp: 8.185 ± 1.26
12.811LeuGlu: 12.811 ± 2.269
4.982LeuPhe: 4.982 ± 1.395
6.406LeuGly: 6.406 ± 2.025
0.712LeuHis: 0.712 ± 0.46
4.982LeuIle: 4.982 ± 1.259
13.167LeuLys: 13.167 ± 2.455
13.879LeuLeu: 13.879 ± 2.237
3.203LeuMet: 3.203 ± 0.826
3.915LeuAsn: 3.915 ± 1.09
2.491LeuPro: 2.491 ± 1.188
5.694LeuGln: 5.694 ± 1.062
3.203LeuArg: 3.203 ± 1.049
5.338LeuSer: 5.338 ± 1.328
8.541LeuThr: 8.541 ± 1.29
2.847LeuVal: 2.847 ± 0.928
0.712LeuTrp: 0.712 ± 0.705
3.559LeuTyr: 3.559 ± 1.287
0.0LeuXaa: 0.0 ± 0.0
Met
3.203MetAla: 3.203 ± 1.009
0.0MetCys: 0.0 ± 0.0
1.423MetAsp: 1.423 ± 0.733
2.491MetGlu: 2.491 ± 0.701
0.0MetPhe: 0.0 ± 0.0
0.356MetGly: 0.356 ± 0.305
0.0MetHis: 0.0 ± 0.0
1.779MetIle: 1.779 ± 0.645
1.423MetLys: 1.423 ± 0.854
2.491MetLeu: 2.491 ± 1.079
0.0MetMet: 0.0 ± 0.0
1.779MetAsn: 1.779 ± 0.914
0.712MetPro: 0.712 ± 0.532
0.356MetGln: 0.356 ± 0.305
1.423MetArg: 1.423 ± 0.487
0.712MetSer: 0.712 ± 0.486
2.491MetThr: 2.491 ± 0.751
1.068MetVal: 1.068 ± 0.678
0.356MetTrp: 0.356 ± 0.391
1.068MetTyr: 1.068 ± 0.589
0.0MetXaa: 0.0 ± 0.0
Asn
2.491AsnAla: 2.491 ± 0.841
0.0AsnCys: 0.0 ± 0.0
1.779AsnAsp: 1.779 ± 0.798
3.203AsnGlu: 3.203 ± 1.19
2.491AsnPhe: 2.491 ± 0.859
2.491AsnGly: 2.491 ± 0.905
0.712AsnHis: 0.712 ± 0.452
2.491AsnIle: 2.491 ± 0.797
5.338AsnLys: 5.338 ± 0.767
4.27AsnLeu: 4.27 ± 0.881
0.712AsnMet: 0.712 ± 0.357
3.203AsnAsn: 3.203 ± 0.857
0.712AsnPro: 0.712 ± 0.44
2.491AsnGln: 2.491 ± 0.817
2.491AsnArg: 2.491 ± 0.929
2.491AsnSer: 2.491 ± 1.016
2.135AsnThr: 2.135 ± 1.115
1.423AsnVal: 1.423 ± 0.906
0.356AsnTrp: 0.356 ± 0.384
2.135AsnTyr: 2.135 ± 0.76
0.0AsnXaa: 0.0 ± 0.0
Pro
2.135ProAla: 2.135 ± 0.732
0.0ProCys: 0.0 ± 0.0
1.779ProAsp: 1.779 ± 0.545
1.779ProGlu: 1.779 ± 0.918
0.712ProPhe: 0.712 ± 0.471
0.356ProGly: 0.356 ± 0.348
0.356ProHis: 0.356 ± 0.306
1.068ProIle: 1.068 ± 0.617
2.491ProLys: 2.491 ± 1.082
2.491ProLeu: 2.491 ± 1.081
0.0ProMet: 0.0 ± 0.0
0.0ProAsn: 0.0 ± 0.0
0.356ProPro: 0.356 ± 0.429
2.847ProGln: 2.847 ± 0.953
2.135ProArg: 2.135 ± 0.819
1.423ProSer: 1.423 ± 0.594
1.423ProThr: 1.423 ± 0.574
0.712ProVal: 0.712 ± 0.44
0.356ProTrp: 0.356 ± 0.329
1.423ProTyr: 1.423 ± 0.603
0.0ProXaa: 0.0 ± 0.0
Gln
3.559GlnAla: 3.559 ± 1.36
0.356GlnCys: 0.356 ± 0.352
2.135GlnAsp: 2.135 ± 0.878
3.915GlnGlu: 3.915 ± 0.895
1.068GlnPhe: 1.068 ± 0.673
2.847GlnGly: 2.847 ± 0.747
1.068GlnHis: 1.068 ± 0.687
3.915GlnIle: 3.915 ± 1.276
3.915GlnLys: 3.915 ± 0.936
7.473GlnLeu: 7.473 ± 1.566
0.356GlnMet: 0.356 ± 0.305
2.135GlnAsn: 2.135 ± 0.797
2.135GlnPro: 2.135 ± 1.193
3.915GlnGln: 3.915 ± 0.883
2.847GlnArg: 2.847 ± 0.598
2.491GlnSer: 2.491 ± 0.751
4.27GlnThr: 4.27 ± 1.395
3.915GlnVal: 3.915 ± 1.039
0.356GlnTrp: 0.356 ± 0.305
0.712GlnTyr: 0.712 ± 0.589
0.0GlnXaa: 0.0 ± 0.0
Arg
2.847ArgAla: 2.847 ± 0.888
0.356ArgCys: 0.356 ± 0.294
2.135ArgAsp: 2.135 ± 0.74
5.694ArgGlu: 5.694 ± 1.613
1.779ArgPhe: 1.779 ± 0.669
3.203ArgGly: 3.203 ± 1.18
0.356ArgHis: 0.356 ± 0.294
3.203ArgIle: 3.203 ± 0.956
3.559ArgLys: 3.559 ± 0.892
4.626ArgLeu: 4.626 ± 1.019
0.356ArgMet: 0.356 ± 0.373
1.779ArgAsn: 1.779 ± 0.685
1.068ArgPro: 1.068 ± 0.718
4.982ArgGln: 4.982 ± 1.35
3.559ArgArg: 3.559 ± 0.996
2.491ArgSer: 2.491 ± 1.054
2.135ArgThr: 2.135 ± 0.987
2.135ArgVal: 2.135 ± 0.837
0.712ArgTrp: 0.712 ± 0.497
2.491ArgTyr: 2.491 ± 0.728
0.0ArgXaa: 0.0 ± 0.0
Ser
2.847SerAla: 2.847 ± 0.916
0.0SerCys: 0.0 ± 0.0
4.982SerAsp: 4.982 ± 0.84
4.626SerGlu: 4.626 ± 1.557
0.712SerPhe: 0.712 ± 0.648
1.068SerGly: 1.068 ± 0.503
1.068SerHis: 1.068 ± 0.541
5.694SerIle: 5.694 ± 1.43
4.27SerLys: 4.27 ± 1.368
4.626SerLeu: 4.626 ± 0.804
0.712SerMet: 0.712 ± 0.456
1.779SerAsn: 1.779 ± 0.718
0.356SerPro: 0.356 ± 0.305
1.423SerGln: 1.423 ± 0.751
1.779SerArg: 1.779 ± 0.697
1.779SerSer: 1.779 ± 0.681
3.203SerThr: 3.203 ± 1.202
4.27SerVal: 4.27 ± 1.033
0.712SerTrp: 0.712 ± 0.43
3.915SerTyr: 3.915 ± 1.085
0.0SerXaa: 0.0 ± 0.0
Thr
2.847ThrAla: 2.847 ± 1.037
0.356ThrCys: 0.356 ± 0.348
2.847ThrAsp: 2.847 ± 0.835
3.559ThrGlu: 3.559 ± 1.005
2.135ThrPhe: 2.135 ± 0.532
4.626ThrGly: 4.626 ± 1.46
1.068ThrHis: 1.068 ± 0.644
5.694ThrIle: 5.694 ± 1.23
6.406ThrLys: 6.406 ± 1.439
7.473ThrLeu: 7.473 ± 1.1
2.491ThrMet: 2.491 ± 1.382
2.135ThrAsn: 2.135 ± 0.817
1.423ThrPro: 1.423 ± 0.646
4.27ThrGln: 4.27 ± 1.433
2.491ThrArg: 2.491 ± 0.893
2.491ThrSer: 2.491 ± 1.129
3.559ThrThr: 3.559 ± 0.979
5.338ThrVal: 5.338 ± 1.587
0.356ThrTrp: 0.356 ± 0.305
2.491ThrTyr: 2.491 ± 0.789
0.0ThrXaa: 0.0 ± 0.0
Val
2.135ValAla: 2.135 ± 0.552
0.356ValCys: 0.356 ± 0.324
1.423ValAsp: 1.423 ± 0.705
2.491ValGlu: 2.491 ± 0.989
2.847ValPhe: 2.847 ± 1.317
1.423ValGly: 1.423 ± 0.756
0.356ValHis: 0.356 ± 0.417
3.203ValIle: 3.203 ± 1.055
4.982ValLys: 4.982 ± 1.69
4.27ValLeu: 4.27 ± 1.154
0.356ValMet: 0.356 ± 0.372
2.135ValAsn: 2.135 ± 0.82
1.068ValPro: 1.068 ± 0.533
1.423ValGln: 1.423 ± 0.529
1.779ValArg: 1.779 ± 0.764
3.559ValSer: 3.559 ± 0.95
4.982ValThr: 4.982 ± 1.245
2.491ValVal: 2.491 ± 0.85
0.356ValTrp: 0.356 ± 0.349
3.559ValTyr: 3.559 ± 0.961
0.0ValXaa: 0.0 ± 0.0
Trp
1.423TrpAla: 1.423 ± 0.746
0.356TrpCys: 0.356 ± 0.352
0.0TrpAsp: 0.0 ± 0.0
1.423TrpGlu: 1.423 ± 0.624
0.356TrpPhe: 0.356 ± 0.352
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.356TrpIle: 0.356 ± 0.384
1.068TrpLys: 1.068 ± 0.529
2.135TrpLeu: 2.135 ± 0.631
0.356TrpMet: 0.356 ± 0.352
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.712TrpGln: 0.712 ± 0.449
0.0TrpArg: 0.0 ± 0.0
0.712TrpSer: 0.712 ± 0.435
0.712TrpThr: 0.712 ± 0.44
0.712TrpVal: 0.712 ± 0.532
0.0TrpTrp: 0.0 ± 0.0
0.356TrpTyr: 0.356 ± 0.352
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.135TyrAla: 2.135 ± 0.685
0.356TyrCys: 0.356 ± 0.352
1.068TyrAsp: 1.068 ± 0.712
1.779TyrGlu: 1.779 ± 0.888
3.915TyrPhe: 3.915 ± 1.263
1.423TyrGly: 1.423 ± 0.529
1.779TyrHis: 1.779 ± 0.702
2.847TyrIle: 2.847 ± 0.879
3.559TyrLys: 3.559 ± 1.387
4.982TyrLeu: 4.982 ± 1.174
1.423TyrMet: 1.423 ± 0.565
1.779TyrAsn: 1.779 ± 0.717
1.068TyrPro: 1.068 ± 0.517
2.847TyrGln: 2.847 ± 1.016
2.135TyrArg: 2.135 ± 0.867
3.559TyrSer: 3.559 ± 0.914
2.135TyrThr: 2.135 ± 0.782
0.712TyrVal: 0.712 ± 0.425
0.712TyrTrp: 0.712 ± 0.453
1.779TyrTyr: 1.779 ± 0.562
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 18 proteins (2811 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski