Amino acid dipepetide frequency for Okra yellow crinkle virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.191AlaAla: 9.191 ± 3.733
0.919AlaCys: 0.919 ± 0.863
0.919AlaAsp: 0.919 ± 0.674
2.757AlaGlu: 2.757 ± 1.409
0.0AlaPhe: 0.0 ± 0.0
1.838AlaGly: 1.838 ± 1.347
0.919AlaHis: 0.919 ± 0.911
3.676AlaIle: 3.676 ± 1.067
4.596AlaLys: 4.596 ± 0.996
5.515AlaLeu: 5.515 ± 1.381
1.838AlaMet: 1.838 ± 1.267
3.676AlaAsn: 3.676 ± 1.634
1.838AlaPro: 1.838 ± 1.347
2.757AlaGln: 2.757 ± 1.699
6.434AlaArg: 6.434 ± 3.106
4.596AlaSer: 4.596 ± 2.119
5.515AlaThr: 5.515 ± 2.236
1.838AlaVal: 1.838 ± 1.289
1.838AlaTrp: 1.838 ± 1.347
0.0AlaTyr: 0.0 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.919CysAla: 0.919 ± 0.957
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.919CysGlu: 0.919 ± 0.863
0.919CysPhe: 0.919 ± 0.911
1.838CysGly: 1.838 ± 1.013
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
1.838CysLys: 1.838 ± 1.726
1.838CysLeu: 1.838 ± 1.389
1.838CysMet: 1.838 ± 1.142
1.838CysAsn: 1.838 ± 0.971
2.757CysPro: 2.757 ± 1.92
0.919CysGln: 0.919 ± 0.674
0.919CysArg: 0.919 ± 0.674
2.757CysSer: 2.757 ± 1.852
1.838CysThr: 1.838 ± 0.726
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.757AspAla: 2.757 ± 2.021
0.0AspCys: 0.0 ± 0.0
1.838AspAsp: 1.838 ± 1.013
0.919AspGlu: 0.919 ± 0.863
0.919AspPhe: 0.919 ± 0.863
1.838AspGly: 1.838 ± 1.347
0.919AspHis: 0.919 ± 0.957
1.838AspIle: 1.838 ± 1.123
2.757AspLys: 2.757 ± 1.103
7.353AspLeu: 7.353 ± 2.95
0.0AspMet: 0.0 ± 0.0
1.838AspAsn: 1.838 ± 1.232
1.838AspPro: 1.838 ± 0.971
1.838AspGln: 1.838 ± 0.726
2.757AspArg: 2.757 ± 1.446
6.434AspSer: 6.434 ± 1.941
2.757AspThr: 2.757 ± 1.133
6.434AspVal: 6.434 ± 1.54
1.838AspTrp: 1.838 ± 1.013
0.919AspTyr: 0.919 ± 0.674
0.0AspXaa: 0.0 ± 0.0
Glu
6.434GluAla: 6.434 ± 1.554
0.0GluCys: 0.0 ± 0.0
2.757GluAsp: 2.757 ± 1.409
4.596GluGlu: 4.596 ± 2.767
2.757GluPhe: 2.757 ± 1.409
5.515GluGly: 5.515 ± 1.388
0.0GluHis: 0.0 ± 0.0
0.919GluIle: 0.919 ± 0.911
0.919GluLys: 0.919 ± 0.674
4.596GluLeu: 4.596 ± 1.894
0.0GluMet: 0.0 ± 0.0
4.596GluAsn: 4.596 ± 2.118
4.596GluPro: 4.596 ± 1.636
0.0GluGln: 0.0 ± 0.0
0.0GluArg: 0.0 ± 0.0
2.757GluSer: 2.757 ± 1.92
0.919GluThr: 0.919 ± 0.911
1.838GluVal: 1.838 ± 1.378
2.757GluTrp: 2.757 ± 1.429
0.919GluTyr: 0.919 ± 0.674
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.0PheCys: 0.0 ± 0.0
4.596PheAsp: 4.596 ± 2.177
0.919PheGlu: 0.919 ± 0.674
1.838PhePhe: 1.838 ± 0.726
0.919PheGly: 0.919 ± 0.863
2.757PheHis: 2.757 ± 1.151
1.838PheIle: 1.838 ± 0.997
2.757PheLys: 2.757 ± 0.868
4.596PheLeu: 4.596 ± 2.008
0.919PheMet: 0.919 ± 0.674
3.676PheAsn: 3.676 ± 1.625
1.838PhePro: 1.838 ± 0.971
4.596PheGln: 4.596 ± 1.597
3.676PheArg: 3.676 ± 2.435
3.676PheSer: 3.676 ± 2.025
0.0PheThr: 0.0 ± 0.0
0.919PheVal: 0.919 ± 0.863
0.919PheTrp: 0.919 ± 0.863
1.838PheTyr: 1.838 ± 1.232
0.0PheXaa: 0.0 ± 0.0
Gly
2.757GlyAla: 2.757 ± 2.021
3.676GlyCys: 3.676 ± 1.541
4.596GlyAsp: 4.596 ± 1.793
4.596GlyGlu: 4.596 ± 2.77
1.838GlyPhe: 1.838 ± 1.222
2.757GlyGly: 2.757 ± 1.103
0.919GlyHis: 0.919 ± 0.674
3.676GlyIle: 3.676 ± 1.95
4.596GlyLys: 4.596 ± 1.741
1.838GlyLeu: 1.838 ± 1.232
0.0GlyMet: 0.0 ± 0.0
3.676GlyAsn: 3.676 ± 2.583
3.676GlyPro: 3.676 ± 1.452
2.757GlyGln: 2.757 ± 1.168
2.757GlyArg: 2.757 ± 1.437
3.676GlySer: 3.676 ± 1.942
1.838GlyThr: 1.838 ± 1.232
1.838GlyVal: 1.838 ± 1.822
0.0GlyTrp: 0.0 ± 0.0
0.0GlyTyr: 0.0 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.838HisAla: 1.838 ± 1.142
1.838HisCys: 1.838 ± 1.222
0.0HisAsp: 0.0 ± 0.0
2.757HisGlu: 2.757 ± 1.151
1.838HisPhe: 1.838 ± 1.347
1.838HisGly: 1.838 ± 1.222
1.838HisHis: 1.838 ± 1.232
4.596HisIle: 4.596 ± 2.583
2.757HisLys: 2.757 ± 1.443
1.838HisLeu: 1.838 ± 0.997
0.919HisMet: 0.919 ± 1.204
2.757HisAsn: 2.757 ± 1.437
1.838HisPro: 1.838 ± 0.997
0.919HisGln: 0.919 ± 0.863
2.757HisArg: 2.757 ± 2.031
2.757HisSer: 2.757 ± 1.852
2.757HisThr: 2.757 ± 2.589
2.757HisVal: 2.757 ± 0.868
0.0HisTrp: 0.0 ± 0.0
0.919HisTyr: 0.919 ± 0.674
0.0HisXaa: 0.0 ± 0.0
Ile
0.919IleAla: 0.919 ± 0.957
0.919IleCys: 0.919 ± 0.674
5.515IleAsp: 5.515 ± 2.319
0.0IleGlu: 0.0 ± 0.0
3.676IlePhe: 3.676 ± 1.991
0.919IleGly: 0.919 ± 0.957
1.838IleHis: 1.838 ± 1.822
3.676IleIle: 3.676 ± 1.415
7.353IleLys: 7.353 ± 1.698
0.919IleLeu: 0.919 ± 0.863
0.919IleMet: 0.919 ± 0.957
2.757IleAsn: 2.757 ± 1.396
0.919IlePro: 0.919 ± 0.674
3.676IleGln: 3.676 ± 1.677
7.353IleArg: 7.353 ± 3.318
4.596IleSer: 4.596 ± 2.906
2.757IleThr: 2.757 ± 1.787
1.838IleVal: 1.838 ± 0.726
0.919IleTrp: 0.919 ± 0.911
0.919IleTyr: 0.919 ± 0.911
0.0IleXaa: 0.0 ± 0.0
Lys
4.596LysAla: 4.596 ± 1.926
2.757LysCys: 2.757 ± 1.217
2.757LysAsp: 2.757 ± 2.021
3.676LysGlu: 3.676 ± 1.677
0.919LysPhe: 0.919 ± 0.911
1.838LysGly: 1.838 ± 1.013
0.919LysHis: 0.919 ± 0.674
4.596LysIle: 4.596 ± 2.177
3.676LysLys: 3.676 ± 1.793
0.919LysLeu: 0.919 ± 0.674
0.0LysMet: 0.0 ± 0.0
3.676LysAsn: 3.676 ± 1.984
2.757LysPro: 2.757 ± 0.876
0.919LysGln: 0.919 ± 0.899
4.596LysArg: 4.596 ± 4.315
5.515LysSer: 5.515 ± 1.621
1.838LysThr: 1.838 ± 1.013
3.676LysVal: 3.676 ± 2.464
0.0LysTrp: 0.0 ± 0.0
5.515LysTyr: 5.515 ± 1.381
0.0LysXaa: 0.0 ± 0.0
Leu
1.838LeuAla: 1.838 ± 1.761
1.838LeuCys: 1.838 ± 1.347
1.838LeuAsp: 1.838 ± 1.013
4.596LeuGlu: 4.596 ± 2.325
0.0LeuPhe: 0.0 ± 0.0
8.272LeuGly: 8.272 ± 1.247
3.676LeuHis: 3.676 ± 1.408
4.596LeuIle: 4.596 ± 2.271
8.272LeuLys: 8.272 ± 2.018
3.676LeuLeu: 3.676 ± 1.916
0.0LeuMet: 0.0 ± 0.0
4.596LeuAsn: 4.596 ± 2.487
2.757LeuPro: 2.757 ± 1.133
3.676LeuGln: 3.676 ± 1.634
3.676LeuArg: 3.676 ± 2.794
6.434LeuSer: 6.434 ± 2.383
3.676LeuThr: 3.676 ± 1.111
3.676LeuVal: 3.676 ± 0.915
0.0LeuTrp: 0.0 ± 0.0
3.676LeuTyr: 3.676 ± 1.625
0.0LeuXaa: 0.0 ± 0.0
Met
1.838MetAla: 1.838 ± 0.726
0.919MetCys: 0.919 ± 1.34
2.757MetAsp: 2.757 ± 1.289
0.0MetGlu: 0.0 ± 0.0
1.838MetPhe: 1.838 ± 1.726
1.838MetGly: 1.838 ± 1.378
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
0.919MetLeu: 0.919 ± 0.899
0.0MetMet: 0.0 ± 0.0
0.919MetAsn: 0.919 ± 0.863
0.919MetPro: 0.919 ± 0.674
0.919MetGln: 0.919 ± 0.957
0.919MetArg: 0.919 ± 1.34
0.919MetSer: 0.919 ± 0.863
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
1.838MetTrp: 1.838 ± 0.971
3.676MetTyr: 3.676 ± 3.452
0.0MetXaa: 0.0 ± 0.0
Asn
3.676AsnAla: 3.676 ± 1.677
2.757AsnCys: 2.757 ± 1.133
1.838AsnAsp: 1.838 ± 0.726
3.676AsnGlu: 3.676 ± 0.983
1.838AsnPhe: 1.838 ± 1.123
3.676AsnGly: 3.676 ± 1.225
6.434AsnHis: 6.434 ± 3.234
0.919AsnIle: 0.919 ± 0.674
0.919AsnLys: 0.919 ± 0.674
4.596AsnLeu: 4.596 ± 2.747
1.838AsnMet: 1.838 ± 1.649
1.838AsnAsn: 1.838 ± 0.997
3.676AsnPro: 3.676 ± 1.048
0.919AsnGln: 0.919 ± 0.674
1.838AsnArg: 1.838 ± 1.289
1.838AsnSer: 1.838 ± 1.726
5.515AsnThr: 5.515 ± 1.28
5.515AsnVal: 5.515 ± 1.907
0.0AsnTrp: 0.0 ± 0.0
3.676AsnTyr: 3.676 ± 1.942
0.0AsnXaa: 0.0 ± 0.0
Pro
2.757ProAla: 2.757 ± 1.133
1.838ProCys: 1.838 ± 1.142
1.838ProAsp: 1.838 ± 1.142
2.757ProGlu: 2.757 ± 1.409
2.757ProPhe: 2.757 ± 1.787
1.838ProGly: 1.838 ± 0.726
4.596ProHis: 4.596 ± 2.607
2.757ProIle: 2.757 ± 1.852
1.838ProLys: 1.838 ± 1.347
3.676ProLeu: 3.676 ± 1.809
1.838ProMet: 1.838 ± 1.11
4.596ProAsn: 4.596 ± 1.917
0.919ProPro: 0.919 ± 0.674
4.596ProGln: 4.596 ± 1.135
4.596ProArg: 4.596 ± 1.335
4.596ProSer: 4.596 ± 2.119
4.596ProThr: 4.596 ± 1.718
3.676ProVal: 3.676 ± 2.464
1.838ProTrp: 1.838 ± 1.378
1.838ProTyr: 1.838 ± 1.726
0.0ProXaa: 0.0 ± 0.0
Gln
2.757GlnAla: 2.757 ± 0.876
0.0GlnCys: 0.0 ± 0.0
1.838GlnAsp: 1.838 ± 1.232
1.838GlnGlu: 1.838 ± 0.726
0.919GlnPhe: 0.919 ± 0.674
1.838GlnGly: 1.838 ± 1.013
2.757GlnHis: 2.757 ± 2.085
2.757GlnIle: 2.757 ± 1.533
1.838GlnLys: 1.838 ± 1.797
2.757GlnLeu: 2.757 ± 1.852
0.0GlnMet: 0.0 ± 0.0
0.919GlnAsn: 0.919 ± 0.899
3.676GlnPro: 3.676 ± 2.892
0.919GlnGln: 0.919 ± 1.34
3.676GlnArg: 3.676 ± 1.305
3.676GlnSer: 3.676 ± 1.677
4.596GlnThr: 4.596 ± 1.958
5.515GlnVal: 5.515 ± 1.335
0.0GlnTrp: 0.0 ± 0.0
0.919GlnTyr: 0.919 ± 0.674
0.0GlnXaa: 0.0 ± 0.0
Arg
4.596ArgAla: 4.596 ± 1.443
2.757ArgCys: 2.757 ± 1.359
5.515ArgAsp: 5.515 ± 2.236
2.757ArgGlu: 2.757 ± 1.706
7.353ArgPhe: 7.353 ± 2.096
3.676ArgGly: 3.676 ± 1.541
2.757ArgHis: 2.757 ± 1.92
2.757ArgIle: 2.757 ± 1.852
1.838ArgLys: 1.838 ± 1.123
3.676ArgLeu: 3.676 ± 2.089
3.676ArgMet: 3.676 ± 2.583
0.0ArgAsn: 0.0 ± 0.0
6.434ArgPro: 6.434 ± 2.183
1.838ArgGln: 1.838 ± 1.217
7.353ArgArg: 7.353 ± 4.191
4.596ArgSer: 4.596 ± 1.718
6.434ArgThr: 6.434 ± 3.923
2.757ArgVal: 2.757 ± 1.787
0.0ArgTrp: 0.0 ± 0.0
1.838ArgTyr: 1.838 ± 1.142
0.0ArgXaa: 0.0 ± 0.0
Ser
2.757SerAla: 2.757 ± 1.429
0.0SerCys: 0.0 ± 0.0
4.596SerAsp: 4.596 ± 1.731
2.757SerGlu: 2.757 ± 1.429
2.757SerPhe: 2.757 ± 0.977
0.919SerGly: 0.919 ± 0.863
2.757SerHis: 2.757 ± 2.08
3.676SerIle: 3.676 ± 1.753
4.596SerLys: 4.596 ± 2.21
3.676SerLeu: 3.676 ± 1.305
0.919SerMet: 0.919 ± 1.34
5.515SerAsn: 5.515 ± 2.177
10.11SerPro: 10.11 ± 2.109
2.757SerGln: 2.757 ± 1.852
6.434SerArg: 6.434 ± 2.213
11.949SerSer: 11.949 ± 6.586
7.353SerThr: 7.353 ± 3.283
6.434SerVal: 6.434 ± 3.339
0.919SerTrp: 0.919 ± 0.863
3.676SerTyr: 3.676 ± 2.025
0.0SerXaa: 0.0 ± 0.0
Thr
6.434ThrAla: 6.434 ± 2.103
0.919ThrCys: 0.919 ± 0.863
0.919ThrAsp: 0.919 ± 0.674
2.757ThrGlu: 2.757 ± 1.536
2.757ThrPhe: 2.757 ± 1.706
5.515ThrGly: 5.515 ± 2.45
4.596ThrHis: 4.596 ± 2.298
1.838ThrIle: 1.838 ± 0.997
1.838ThrLys: 1.838 ± 1.013
4.596ThrLeu: 4.596 ± 1.894
0.919ThrMet: 0.919 ± 0.674
3.676ThrAsn: 3.676 ± 1.074
5.515ThrPro: 5.515 ± 1.705
1.838ThrGln: 1.838 ± 1.289
3.676ThrArg: 3.676 ± 1.805
4.596ThrSer: 4.596 ± 3.005
4.596ThrThr: 4.596 ± 3.053
3.676ThrVal: 3.676 ± 3.452
0.919ThrTrp: 0.919 ± 1.34
2.757ThrTyr: 2.757 ± 1.814
0.0ThrXaa: 0.0 ± 0.0
Val
0.919ValAla: 0.919 ± 0.911
0.0ValCys: 0.0 ± 0.0
1.838ValAsp: 1.838 ± 1.347
1.838ValGlu: 1.838 ± 0.971
3.676ValPhe: 3.676 ± 2.011
1.838ValGly: 1.838 ± 1.378
0.919ValHis: 0.919 ± 0.899
4.596ValIle: 4.596 ± 1.602
1.838ValLys: 1.838 ± 0.726
6.434ValLeu: 6.434 ± 2.602
1.838ValMet: 1.838 ± 1.142
0.919ValAsn: 0.919 ± 0.899
3.676ValPro: 3.676 ± 1.443
5.515ValGln: 5.515 ± 3.218
3.676ValArg: 3.676 ± 2.681
4.596ValSer: 4.596 ± 1.431
4.596ValThr: 4.596 ± 3.39
2.757ValVal: 2.757 ± 1.429
1.838ValTrp: 1.838 ± 1.123
3.676ValTyr: 3.676 ± 1.452
0.0ValXaa: 0.0 ± 0.0
Trp
2.757TrpAla: 2.757 ± 2.021
0.0TrpCys: 0.0 ± 0.0
0.919TrpAsp: 0.919 ± 0.899
0.919TrpGlu: 0.919 ± 0.911
0.0TrpPhe: 0.0 ± 0.0
0.919TrpGly: 0.919 ± 0.674
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.919TrpLeu: 0.919 ± 0.863
0.919TrpMet: 0.919 ± 0.863
0.919TrpAsn: 0.919 ± 1.34
0.0TrpPro: 0.0 ± 0.0
0.919TrpGln: 0.919 ± 0.674
1.838TrpArg: 1.838 ± 1.013
1.838TrpSer: 1.838 ± 1.467
1.838TrpThr: 1.838 ± 1.123
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.919TrpTyr: 0.919 ± 0.674
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.919TyrAla: 0.919 ± 0.863
0.0TyrCys: 0.0 ± 0.0
0.919TyrAsp: 0.919 ± 0.863
1.838TyrGlu: 1.838 ± 1.142
3.676TyrPhe: 3.676 ± 0.915
1.838TyrGly: 1.838 ± 0.726
0.919TyrHis: 0.919 ± 0.674
3.676TyrIle: 3.676 ± 1.677
0.919TyrLys: 0.919 ± 0.674
6.434TyrLeu: 6.434 ± 2.17
0.919TyrMet: 0.919 ± 1.036
4.596TyrAsn: 4.596 ± 0.996
0.0TyrPro: 0.0 ± 0.0
0.919TyrGln: 0.919 ± 0.863
3.676TyrArg: 3.676 ± 3.452
2.757TyrSer: 2.757 ± 1.437
0.919TyrThr: 0.919 ± 0.863
1.838TyrVal: 1.838 ± 0.971
0.0TyrTrp: 0.0 ± 0.0
0.919TyrTyr: 0.919 ± 0.957
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1089 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski