Amino acid dipepetide frequency for Freshwater phage uvFW-CGR-AMDFOS-S50-C341

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.919AlaAla: 16.919 ± 3.27
1.025AlaCys: 1.025 ± 0.482
3.332AlaAsp: 3.332 ± 0.879
5.896AlaGlu: 5.896 ± 1.26
5.896AlaPhe: 5.896 ± 1.607
7.69AlaGly: 7.69 ± 1.583
1.025AlaHis: 1.025 ± 0.581
6.665AlaIle: 6.665 ± 1.455
3.076AlaLys: 3.076 ± 0.825
8.459AlaLeu: 8.459 ± 2.42
5.383AlaMet: 5.383 ± 1.288
6.409AlaAsn: 6.409 ± 1.4
4.102AlaPro: 4.102 ± 1.32
4.102AlaGln: 4.102 ± 1.45
1.538AlaArg: 1.538 ± 0.71
6.921AlaSer: 6.921 ± 1.442
9.997AlaThr: 9.997 ± 1.558
11.023AlaVal: 11.023 ± 1.704
1.794AlaTrp: 1.794 ± 0.795
3.332AlaTyr: 3.332 ± 0.794
0.0AlaXaa: 0.0 ± 0.0
Cys
1.538CysAla: 1.538 ± 0.601
0.256CysCys: 0.256 ± 0.298
0.513CysAsp: 0.513 ± 0.374
0.769CysGlu: 0.769 ± 0.404
0.256CysPhe: 0.256 ± 0.248
1.025CysGly: 1.025 ± 0.763
0.256CysHis: 0.256 ± 0.278
0.769CysIle: 0.769 ± 0.595
0.513CysLys: 0.513 ± 0.369
0.769CysLeu: 0.769 ± 0.36
0.256CysMet: 0.256 ± 0.205
0.256CysAsn: 0.256 ± 0.248
0.0CysPro: 0.0 ± 0.0
0.513CysGln: 0.513 ± 0.384
0.0CysArg: 0.0 ± 0.0
0.513CysSer: 0.513 ± 0.339
0.513CysThr: 0.513 ± 0.4
1.025CysVal: 1.025 ± 0.536
0.0CysTrp: 0.0 ± 0.0
0.769CysTyr: 0.769 ± 0.496
0.0CysXaa: 0.0 ± 0.0
Asp
3.845AspAla: 3.845 ± 1.123
0.513AspCys: 0.513 ± 0.384
2.82AspAsp: 2.82 ± 0.591
2.307AspGlu: 2.307 ± 0.75
1.538AspPhe: 1.538 ± 0.513
2.563AspGly: 2.563 ± 0.936
0.769AspHis: 0.769 ± 0.436
3.589AspIle: 3.589 ± 1.238
2.307AspLys: 2.307 ± 0.777
6.409AspLeu: 6.409 ± 1.133
1.538AspMet: 1.538 ± 0.829
2.051AspAsn: 2.051 ± 0.718
3.332AspPro: 3.332 ± 1.126
1.538AspGln: 1.538 ± 0.597
2.82AspArg: 2.82 ± 1.01
2.563AspSer: 2.563 ± 0.81
2.82AspThr: 2.82 ± 0.737
4.102AspVal: 4.102 ± 0.804
1.025AspTrp: 1.025 ± 0.468
2.563AspTyr: 2.563 ± 1.168
0.0AspXaa: 0.0 ± 0.0
Glu
5.383GluAla: 5.383 ± 1.249
0.0GluCys: 0.0 ± 0.0
1.794GluAsp: 1.794 ± 0.657
4.614GluGlu: 4.614 ± 1.603
1.538GluPhe: 1.538 ± 0.505
2.82GluGly: 2.82 ± 0.637
0.256GluHis: 0.256 ± 0.278
3.076GluIle: 3.076 ± 0.697
1.538GluLys: 1.538 ± 0.561
4.871GluLeu: 4.871 ± 1.399
1.282GluMet: 1.282 ± 0.668
1.794GluAsn: 1.794 ± 0.491
3.076GluPro: 3.076 ± 0.992
2.563GluGln: 2.563 ± 0.983
1.794GluArg: 1.794 ± 0.612
2.307GluSer: 2.307 ± 0.686
4.358GluThr: 4.358 ± 1.197
3.589GluVal: 3.589 ± 0.954
0.513GluTrp: 0.513 ± 0.293
1.794GluTyr: 1.794 ± 0.827
0.0GluXaa: 0.0 ± 0.0
Phe
6.665PheAla: 6.665 ± 1.835
0.0PheCys: 0.0 ± 0.0
2.307PheAsp: 2.307 ± 0.743
2.307PheGlu: 2.307 ± 0.684
0.513PhePhe: 0.513 ± 0.371
3.845PheGly: 3.845 ± 0.856
1.025PheHis: 1.025 ± 0.491
1.794PheIle: 1.794 ± 0.549
2.307PheLys: 2.307 ± 0.801
3.589PheLeu: 3.589 ± 1.102
0.769PheMet: 0.769 ± 0.377
3.076PheAsn: 3.076 ± 0.705
1.538PhePro: 1.538 ± 0.844
1.538PheGln: 1.538 ± 0.634
1.282PheArg: 1.282 ± 0.567
2.051PheSer: 2.051 ± 0.681
4.871PheThr: 4.871 ± 1.341
1.538PheVal: 1.538 ± 0.563
0.0PheTrp: 0.0 ± 0.0
0.769PheTyr: 0.769 ± 0.429
0.0PheXaa: 0.0 ± 0.0
Gly
8.716GlyAla: 8.716 ± 1.403
0.0GlyCys: 0.0 ± 0.0
3.076GlyAsp: 3.076 ± 0.718
2.051GlyGlu: 2.051 ± 0.62
3.589GlyPhe: 3.589 ± 1.08
6.665GlyGly: 6.665 ± 1.35
0.513GlyHis: 0.513 ± 0.359
5.896GlyIle: 5.896 ± 1.433
3.589GlyLys: 3.589 ± 1.259
5.896GlyLeu: 5.896 ± 1.095
0.769GlyMet: 0.769 ± 0.549
2.051GlyAsn: 2.051 ± 0.654
3.076GlyPro: 3.076 ± 0.613
3.076GlyGln: 3.076 ± 1.068
3.332GlyArg: 3.332 ± 1.081
5.896GlySer: 5.896 ± 0.967
6.152GlyThr: 6.152 ± 1.563
4.871GlyVal: 4.871 ± 1.491
1.538GlyTrp: 1.538 ± 0.64
2.307GlyTyr: 2.307 ± 0.682
0.0GlyXaa: 0.0 ± 0.0
His
0.769HisAla: 0.769 ± 0.893
0.256HisCys: 0.256 ± 0.278
1.025HisAsp: 1.025 ± 0.486
0.256HisGlu: 0.256 ± 0.234
0.256HisPhe: 0.256 ± 0.278
0.769HisGly: 0.769 ± 0.431
0.256HisHis: 0.256 ± 0.298
0.256HisIle: 0.256 ± 0.234
0.513HisLys: 0.513 ± 0.321
0.256HisLeu: 0.256 ± 0.298
0.513HisMet: 0.513 ± 0.384
0.513HisAsn: 0.513 ± 0.379
1.282HisPro: 1.282 ± 0.505
1.282HisGln: 1.282 ± 0.578
0.513HisArg: 0.513 ± 0.373
0.256HisSer: 0.256 ± 0.225
0.513HisThr: 0.513 ± 0.367
0.256HisVal: 0.256 ± 0.298
0.256HisTrp: 0.256 ± 0.237
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
6.665IleAla: 6.665 ± 1.077
0.769IleCys: 0.769 ± 0.575
3.589IleAsp: 3.589 ± 0.951
1.538IleGlu: 1.538 ± 0.661
3.589IlePhe: 3.589 ± 0.965
4.102IleGly: 4.102 ± 1.009
0.256IleHis: 0.256 ± 0.298
2.563IleIle: 2.563 ± 0.878
2.563IleLys: 2.563 ± 0.917
1.538IleLeu: 1.538 ± 0.725
1.538IleMet: 1.538 ± 0.47
1.794IleAsn: 1.794 ± 0.7
3.332IlePro: 3.332 ± 1.216
2.051IleGln: 2.051 ± 0.792
2.307IleArg: 2.307 ± 0.644
3.589IleSer: 3.589 ± 1.024
5.127IleThr: 5.127 ± 0.949
3.589IleVal: 3.589 ± 0.902
1.025IleTrp: 1.025 ± 0.453
2.563IleTyr: 2.563 ± 0.521
0.0IleXaa: 0.0 ± 0.0
Lys
5.127LysAla: 5.127 ± 1.864
0.256LysCys: 0.256 ± 0.22
3.076LysAsp: 3.076 ± 1.157
2.82LysGlu: 2.82 ± 0.771
1.538LysPhe: 1.538 ± 0.61
3.332LysGly: 3.332 ± 1.117
0.0LysHis: 0.0 ± 0.0
1.538LysIle: 1.538 ± 0.498
1.538LysLys: 1.538 ± 0.692
3.076LysLeu: 3.076 ± 1.596
1.282LysMet: 1.282 ± 0.588
2.307LysAsn: 2.307 ± 0.883
0.769LysPro: 0.769 ± 0.462
1.794LysGln: 1.794 ± 0.754
2.051LysArg: 2.051 ± 0.86
3.076LysSer: 3.076 ± 0.699
2.307LysThr: 2.307 ± 0.629
3.332LysVal: 3.332 ± 1.097
1.025LysTrp: 1.025 ± 0.444
1.282LysTyr: 1.282 ± 0.57
0.0LysXaa: 0.0 ± 0.0
Leu
11.279LeuAla: 11.279 ± 1.772
0.769LeuCys: 0.769 ± 0.467
4.102LeuAsp: 4.102 ± 1.063
2.563LeuGlu: 2.563 ± 1.019
4.102LeuPhe: 4.102 ± 1.113
6.921LeuGly: 6.921 ± 1.082
0.769LeuHis: 0.769 ± 0.462
3.589LeuIle: 3.589 ± 0.883
3.332LeuLys: 3.332 ± 0.854
5.127LeuLeu: 5.127 ± 1.086
1.282LeuMet: 1.282 ± 0.48
4.102LeuAsn: 4.102 ± 0.788
3.076LeuPro: 3.076 ± 0.699
2.307LeuGln: 2.307 ± 0.627
3.589LeuArg: 3.589 ± 0.741
6.409LeuSer: 6.409 ± 1.035
7.434LeuThr: 7.434 ± 1.949
3.332LeuVal: 3.332 ± 1.28
0.769LeuTrp: 0.769 ± 0.383
3.332LeuTyr: 3.332 ± 1.108
0.0LeuXaa: 0.0 ± 0.0
Met
3.845MetAla: 3.845 ± 0.915
1.282MetCys: 1.282 ± 0.644
0.513MetAsp: 0.513 ± 0.303
1.282MetGlu: 1.282 ± 0.575
1.025MetPhe: 1.025 ± 0.463
2.307MetGly: 2.307 ± 1.058
0.513MetHis: 0.513 ± 0.355
1.538MetIle: 1.538 ± 0.544
2.563MetLys: 2.563 ± 0.989
1.538MetLeu: 1.538 ± 0.574
1.538MetMet: 1.538 ± 0.665
1.282MetAsn: 1.282 ± 0.415
1.538MetPro: 1.538 ± 0.415
1.282MetGln: 1.282 ± 0.583
1.025MetArg: 1.025 ± 0.489
1.794MetSer: 1.794 ± 0.572
3.076MetThr: 3.076 ± 1.098
1.025MetVal: 1.025 ± 0.423
0.0MetTrp: 0.0 ± 0.0
1.025MetTyr: 1.025 ± 0.47
0.0MetXaa: 0.0 ± 0.0
Asn
3.589AsnAla: 3.589 ± 1.31
0.513AsnCys: 0.513 ± 0.369
1.794AsnAsp: 1.794 ± 0.775
1.538AsnGlu: 1.538 ± 0.623
2.051AsnPhe: 2.051 ± 0.567
3.589AsnGly: 3.589 ± 0.893
0.256AsnHis: 0.256 ± 0.265
2.307AsnIle: 2.307 ± 0.713
1.794AsnLys: 1.794 ± 0.718
3.589AsnLeu: 3.589 ± 0.965
1.794AsnMet: 1.794 ± 0.854
2.563AsnAsn: 2.563 ± 0.608
3.845AsnPro: 3.845 ± 0.795
2.307AsnGln: 2.307 ± 0.862
2.051AsnArg: 2.051 ± 0.588
2.563AsnSer: 2.563 ± 0.732
4.614AsnThr: 4.614 ± 1.209
2.82AsnVal: 2.82 ± 0.66
0.256AsnTrp: 0.256 ± 0.269
3.076AsnTyr: 3.076 ± 0.953
0.0AsnXaa: 0.0 ± 0.0
Pro
5.64ProAla: 5.64 ± 1.161
0.256ProCys: 0.256 ± 0.258
3.589ProAsp: 3.589 ± 1.192
3.845ProGlu: 3.845 ± 0.807
1.282ProPhe: 1.282 ± 0.475
3.845ProGly: 3.845 ± 1.085
0.0ProHis: 0.0 ± 0.0
2.82ProIle: 2.82 ± 0.826
2.307ProLys: 2.307 ± 0.681
3.845ProLeu: 3.845 ± 0.766
2.051ProMet: 2.051 ± 0.724
2.051ProAsn: 2.051 ± 0.608
1.282ProPro: 1.282 ± 0.576
2.307ProGln: 2.307 ± 0.698
2.563ProArg: 2.563 ± 0.758
2.051ProSer: 2.051 ± 0.566
4.358ProThr: 4.358 ± 1.377
4.614ProVal: 4.614 ± 1.259
0.769ProTrp: 0.769 ± 0.409
1.282ProTyr: 1.282 ± 0.405
0.0ProXaa: 0.0 ± 0.0
Gln
4.358GlnAla: 4.358 ± 0.886
0.769GlnCys: 0.769 ± 0.493
1.794GlnAsp: 1.794 ± 0.614
2.82GlnGlu: 2.82 ± 0.759
2.307GlnPhe: 2.307 ± 0.944
3.076GlnGly: 3.076 ± 0.821
0.769GlnHis: 0.769 ± 0.467
2.307GlnIle: 2.307 ± 0.738
2.307GlnLys: 2.307 ± 0.83
4.102GlnLeu: 4.102 ± 0.775
1.282GlnMet: 1.282 ± 0.548
1.025GlnAsn: 1.025 ± 0.491
2.563GlnPro: 2.563 ± 0.834
1.794GlnGln: 1.794 ± 0.616
1.794GlnArg: 1.794 ± 0.685
1.025GlnSer: 1.025 ± 0.534
3.332GlnThr: 3.332 ± 0.691
3.845GlnVal: 3.845 ± 0.948
0.513GlnTrp: 0.513 ± 0.399
1.025GlnTyr: 1.025 ± 0.409
0.0GlnXaa: 0.0 ± 0.0
Arg
3.332ArgAla: 3.332 ± 0.815
1.025ArgCys: 1.025 ± 0.523
1.282ArgAsp: 1.282 ± 0.624
1.282ArgGlu: 1.282 ± 0.402
2.051ArgPhe: 2.051 ± 0.747
2.82ArgGly: 2.82 ± 0.685
0.0ArgHis: 0.0 ± 0.0
1.282ArgIle: 1.282 ± 0.571
2.051ArgLys: 2.051 ± 0.814
4.358ArgLeu: 4.358 ± 0.97
1.025ArgMet: 1.025 ± 0.668
2.563ArgAsn: 2.563 ± 0.766
3.845ArgPro: 3.845 ± 0.979
2.051ArgGln: 2.051 ± 0.707
2.563ArgArg: 2.563 ± 1.0
1.025ArgSer: 1.025 ± 0.607
3.076ArgThr: 3.076 ± 0.767
2.051ArgVal: 2.051 ± 0.795
0.769ArgTrp: 0.769 ± 0.355
1.025ArgTyr: 1.025 ± 0.478
0.0ArgXaa: 0.0 ± 0.0
Ser
6.152SerAla: 6.152 ± 1.134
0.0SerCys: 0.0 ± 0.0
4.102SerAsp: 4.102 ± 0.937
2.82SerGlu: 2.82 ± 0.973
2.051SerPhe: 2.051 ± 0.713
5.383SerGly: 5.383 ± 1.773
0.256SerHis: 0.256 ± 0.278
3.076SerIle: 3.076 ± 1.316
2.051SerLys: 2.051 ± 0.932
4.102SerLeu: 4.102 ± 0.907
2.307SerMet: 2.307 ± 0.612
2.82SerAsn: 2.82 ± 1.058
3.332SerPro: 3.332 ± 0.997
2.82SerGln: 2.82 ± 0.94
2.82SerArg: 2.82 ± 1.018
3.076SerSer: 3.076 ± 1.114
5.64SerThr: 5.64 ± 1.018
4.102SerVal: 4.102 ± 1.129
1.282SerTrp: 1.282 ± 0.462
1.794SerTyr: 1.794 ± 0.61
0.0SerXaa: 0.0 ± 0.0
Thr
7.434ThrAla: 7.434 ± 1.054
1.282ThrCys: 1.282 ± 0.641
3.589ThrAsp: 3.589 ± 0.954
4.358ThrGlu: 4.358 ± 1.278
3.845ThrPhe: 3.845 ± 1.1
5.896ThrGly: 5.896 ± 1.071
1.794ThrHis: 1.794 ± 0.792
5.127ThrIle: 5.127 ± 1.129
3.332ThrLys: 3.332 ± 1.077
5.896ThrLeu: 5.896 ± 1.399
1.794ThrMet: 1.794 ± 0.606
3.845ThrAsn: 3.845 ± 0.869
4.102ThrPro: 4.102 ± 1.083
4.102ThrGln: 4.102 ± 0.826
3.076ThrArg: 3.076 ± 0.799
6.409ThrSer: 6.409 ± 0.888
9.997ThrThr: 9.997 ± 2.222
6.665ThrVal: 6.665 ± 1.793
1.282ThrTrp: 1.282 ± 0.556
3.332ThrTyr: 3.332 ± 1.185
0.0ThrXaa: 0.0 ± 0.0
Val
8.203ValAla: 8.203 ± 1.585
0.513ValCys: 0.513 ± 0.43
5.64ValAsp: 5.64 ± 1.64
3.845ValGlu: 3.845 ± 1.105
3.076ValPhe: 3.076 ± 0.912
4.614ValGly: 4.614 ± 0.737
0.769ValHis: 0.769 ± 0.893
3.589ValIle: 3.589 ± 0.856
2.307ValLys: 2.307 ± 0.813
3.845ValLeu: 3.845 ± 0.919
1.794ValMet: 1.794 ± 0.746
3.332ValAsn: 3.332 ± 1.192
4.614ValPro: 4.614 ± 1.201
2.563ValGln: 2.563 ± 0.635
1.538ValArg: 1.538 ± 0.773
6.665ValSer: 6.665 ± 1.613
4.614ValThr: 4.614 ± 0.747
5.383ValVal: 5.383 ± 1.217
1.538ValTrp: 1.538 ± 0.777
1.794ValTyr: 1.794 ± 0.765
0.0ValXaa: 0.0 ± 0.0
Trp
1.538TrpAla: 1.538 ± 0.557
0.0TrpCys: 0.0 ± 0.0
0.513TrpAsp: 0.513 ± 0.305
0.769TrpGlu: 0.769 ± 0.431
0.256TrpPhe: 0.256 ± 0.248
0.513TrpGly: 0.513 ± 0.374
0.0TrpHis: 0.0 ± 0.0
0.513TrpIle: 0.513 ± 0.314
1.025TrpLys: 1.025 ± 0.446
2.051TrpLeu: 2.051 ± 0.71
0.256TrpMet: 0.256 ± 0.221
0.769TrpAsn: 0.769 ± 0.322
0.769TrpPro: 0.769 ± 0.44
1.282TrpGln: 1.282 ± 0.529
1.025TrpArg: 1.025 ± 0.535
0.769TrpSer: 0.769 ± 0.398
1.282TrpThr: 1.282 ± 0.485
1.282TrpVal: 1.282 ± 0.412
0.256TrpTrp: 0.256 ± 0.248
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.589TyrAla: 3.589 ± 1.043
1.025TyrCys: 1.025 ± 0.58
2.563TyrAsp: 2.563 ± 0.653
1.282TyrGlu: 1.282 ± 0.62
1.025TyrPhe: 1.025 ± 0.531
1.282TyrGly: 1.282 ± 0.611
0.513TyrHis: 0.513 ± 0.313
1.794TyrIle: 1.794 ± 0.708
0.513TyrLys: 0.513 ± 0.446
4.871TyrLeu: 4.871 ± 1.083
1.282TyrMet: 1.282 ± 0.534
2.307TyrAsn: 2.307 ± 0.87
1.282TyrPro: 1.282 ± 0.703
1.538TyrGln: 1.538 ± 0.516
1.794TyrArg: 1.794 ± 0.628
1.025TyrSer: 1.025 ± 0.545
3.332TyrThr: 3.332 ± 1.012
1.794TyrVal: 1.794 ± 0.631
0.256TyrTrp: 0.256 ± 0.237
1.025TyrTyr: 1.025 ± 0.623
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 22 proteins (3902 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski