Amino acid dipepetide frequency for Jujube mosaic-associated virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.831AlaAla: 4.831 ± 2.047
1.318AlaCys: 1.318 ± 0.748
3.953AlaAsp: 3.953 ± 1.909
4.831AlaGlu: 4.831 ± 1.633
2.196AlaPhe: 2.196 ± 1.103
2.635AlaGly: 2.635 ± 1.247
0.0AlaHis: 0.0 ± 0.0
3.513AlaIle: 3.513 ± 1.326
2.635AlaLys: 2.635 ± 2.319
3.953AlaLeu: 3.953 ± 1.57
3.074AlaMet: 3.074 ± 1.745
3.513AlaAsn: 3.513 ± 2.385
2.196AlaPro: 2.196 ± 1.736
3.513AlaGln: 3.513 ± 1.773
3.513AlaArg: 3.513 ± 2.651
3.953AlaSer: 3.953 ± 1.912
6.148AlaThr: 6.148 ± 1.633
6.588AlaVal: 6.588 ± 3.798
0.439AlaTrp: 0.439 ± 0.249
3.074AlaTyr: 3.074 ± 0.864
0.0AlaXaa: 0.0 ± 0.0
Cys
0.439CysAla: 0.439 ± 0.249
0.439CysCys: 0.439 ± 0.249
0.0CysAsp: 0.0 ± 0.0
0.439CysGlu: 0.439 ± 0.249
0.878CysPhe: 0.878 ± 0.499
0.878CysGly: 0.878 ± 0.499
0.878CysHis: 0.878 ± 0.499
0.878CysIle: 0.878 ± 0.499
2.635CysLys: 2.635 ± 1.496
0.439CysLeu: 0.439 ± 0.249
0.439CysMet: 0.439 ± 0.249
0.439CysAsn: 0.439 ± 0.249
1.318CysPro: 1.318 ± 0.748
0.878CysGln: 0.878 ± 0.499
1.318CysArg: 1.318 ± 0.748
1.318CysSer: 1.318 ± 0.748
0.439CysThr: 0.439 ± 0.249
1.318CysVal: 1.318 ± 0.555
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
5.709AspAla: 5.709 ± 1.233
0.439AspCys: 0.439 ± 0.249
4.392AspAsp: 4.392 ± 1.806
1.757AspGlu: 1.757 ± 0.997
1.757AspPhe: 1.757 ± 0.597
2.635AspGly: 2.635 ± 1.076
1.318AspHis: 1.318 ± 0.748
3.513AspIle: 3.513 ± 1.226
2.635AspLys: 2.635 ± 1.231
10.979AspLeu: 10.979 ± 1.299
0.0AspMet: 0.0 ± 0.0
2.196AspAsn: 2.196 ± 1.103
5.27AspPro: 5.27 ± 2.43
5.27AspGln: 5.27 ± 1.646
3.513AspArg: 3.513 ± 1.526
3.953AspSer: 3.953 ± 1.648
3.074AspThr: 3.074 ± 1.118
3.513AspVal: 3.513 ± 1.575
1.757AspTrp: 1.757 ± 0.597
1.757AspTyr: 1.757 ± 0.597
0.0AspXaa: 0.0 ± 0.0
Glu
2.196GluAla: 2.196 ± 2.311
0.439GluCys: 0.439 ± 0.249
5.709GluAsp: 5.709 ± 0.835
6.148GluGlu: 6.148 ± 3.315
1.318GluPhe: 1.318 ± 0.87
3.513GluGly: 3.513 ± 1.194
2.196GluHis: 2.196 ± 1.246
6.588GluIle: 6.588 ± 2.422
4.392GluLys: 4.392 ± 2.652
6.148GluLeu: 6.148 ± 0.681
0.878GluMet: 0.878 ± 1.036
3.953GluAsn: 3.953 ± 1.691
0.878GluPro: 0.878 ± 0.499
5.709GluGln: 5.709 ± 1.723
3.953GluArg: 3.953 ± 0.934
4.831GluSer: 4.831 ± 1.59
3.953GluThr: 3.953 ± 1.57
1.757GluVal: 1.757 ± 1.113
0.878GluTrp: 0.878 ± 0.499
2.196GluTyr: 2.196 ± 1.555
0.0GluXaa: 0.0 ± 0.0
Phe
3.074PheAla: 3.074 ± 1.142
0.878PheCys: 0.878 ± 0.499
1.318PheAsp: 1.318 ± 0.748
2.196PheGlu: 2.196 ± 2.246
0.439PhePhe: 0.439 ± 0.249
1.318PheGly: 1.318 ± 0.555
1.318PheHis: 1.318 ± 0.87
2.635PheIle: 2.635 ± 1.496
3.074PheLys: 3.074 ± 1.681
2.196PheLeu: 2.196 ± 0.942
0.439PheMet: 0.439 ± 0.249
1.757PheAsn: 1.757 ± 0.997
1.757PhePro: 1.757 ± 0.997
1.757PheGln: 1.757 ± 1.022
0.878PheArg: 0.878 ± 0.499
1.318PheSer: 1.318 ± 0.748
1.757PheThr: 1.757 ± 0.997
0.878PheVal: 0.878 ± 0.499
0.439PheTrp: 0.439 ± 0.249
0.878PheTyr: 0.878 ± 0.499
0.0PheXaa: 0.0 ± 0.0
Gly
2.635GlyAla: 2.635 ± 1.695
0.439GlyCys: 0.439 ± 0.249
4.831GlyAsp: 4.831 ± 1.068
3.074GlyGlu: 3.074 ± 1.118
0.878GlyPhe: 0.878 ± 0.906
3.953GlyGly: 3.953 ± 1.124
0.878GlyHis: 0.878 ± 0.499
2.635GlyIle: 2.635 ± 1.496
3.074GlyLys: 3.074 ± 1.246
5.27GlyLeu: 5.27 ± 1.63
1.318GlyMet: 1.318 ± 1.147
2.196GlyAsn: 2.196 ± 1.246
4.831GlyPro: 4.831 ± 2.205
0.878GlyGln: 0.878 ± 0.499
3.513GlyArg: 3.513 ± 1.038
3.074GlySer: 3.074 ± 0.844
3.513GlyThr: 3.513 ± 1.694
3.513GlyVal: 3.513 ± 1.226
0.439GlyTrp: 0.439 ± 0.249
2.635GlyTyr: 2.635 ± 1.857
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.878HisCys: 0.878 ± 0.499
3.074HisAsp: 3.074 ± 1.745
0.878HisGlu: 0.878 ± 0.974
0.878HisPhe: 0.878 ± 0.499
1.318HisGly: 1.318 ± 0.555
0.878HisHis: 0.878 ± 0.619
3.953HisIle: 3.953 ± 1.124
0.439HisLys: 0.439 ± 0.249
1.318HisLeu: 1.318 ± 0.748
0.439HisMet: 0.439 ± 0.249
0.439HisAsn: 0.439 ± 1.125
0.439HisPro: 0.439 ± 0.249
1.318HisGln: 1.318 ± 0.998
2.196HisArg: 2.196 ± 1.981
1.757HisSer: 1.757 ± 1.948
0.0HisThr: 0.0 ± 0.0
1.757HisVal: 1.757 ± 0.997
0.878HisTrp: 0.878 ± 0.499
0.878HisTyr: 0.878 ± 0.619
0.0HisXaa: 0.0 ± 0.0
Ile
7.027IleAla: 7.027 ± 3.062
1.318IleCys: 1.318 ± 0.748
3.074IleAsp: 3.074 ± 1.745
4.392IleGlu: 4.392 ± 1.233
0.878IlePhe: 0.878 ± 0.499
4.392IleGly: 4.392 ± 1.081
2.196IleHis: 2.196 ± 0.728
3.074IleIle: 3.074 ± 1.118
1.757IleLys: 1.757 ± 1.446
6.148IleLeu: 6.148 ± 2.602
0.439IleMet: 0.439 ± 0.763
3.953IleAsn: 3.953 ± 1.57
4.392IlePro: 4.392 ± 1.867
3.953IleGln: 3.953 ± 1.487
3.074IleArg: 3.074 ± 1.351
2.635IleSer: 2.635 ± 2.093
3.513IleThr: 3.513 ± 1.994
3.074IleVal: 3.074 ± 1.213
0.0IleTrp: 0.0 ± 0.0
0.878IleTyr: 0.878 ± 0.974
0.0IleXaa: 0.0 ± 0.0
Lys
3.513LysAla: 3.513 ± 1.843
2.196LysCys: 2.196 ± 1.246
3.074LysAsp: 3.074 ± 1.187
4.392LysGlu: 4.392 ± 2.493
2.635LysPhe: 2.635 ± 1.496
3.953LysGly: 3.953 ± 1.787
1.757LysHis: 1.757 ± 0.83
2.635LysIle: 2.635 ± 1.496
5.709LysLys: 5.709 ± 1.702
3.953LysLeu: 3.953 ± 1.691
2.196LysMet: 2.196 ± 0.914
2.196LysAsn: 2.196 ± 2.019
3.074LysPro: 3.074 ± 1.925
4.831LysGln: 4.831 ± 4.444
1.318LysArg: 1.318 ± 0.748
3.074LysSer: 3.074 ± 1.681
2.196LysThr: 2.196 ± 0.863
4.392LysVal: 4.392 ± 2.248
0.878LysTrp: 0.878 ± 0.974
3.074LysTyr: 3.074 ± 0.844
0.0LysXaa: 0.0 ± 0.0
Leu
5.27LeuAla: 5.27 ± 3.055
0.439LeuCys: 0.439 ± 0.763
6.148LeuAsp: 6.148 ± 1.465
8.783LeuGlu: 8.783 ± 4.628
5.27LeuPhe: 5.27 ± 2.149
5.27LeuGly: 5.27 ± 2.322
1.757LeuHis: 1.757 ± 1.948
3.513LeuIle: 3.513 ± 0.815
8.783LeuLys: 8.783 ± 2.257
5.709LeuLeu: 5.709 ± 2.572
0.0LeuMet: 0.0 ± 0.888
3.513LeuAsn: 3.513 ± 1.34
6.148LeuPro: 6.148 ± 2.031
7.027LeuGln: 7.027 ± 3.217
4.392LeuArg: 4.392 ± 1.548
7.466LeuSer: 7.466 ± 1.223
3.953LeuThr: 3.953 ± 2.881
5.709LeuVal: 5.709 ± 3.247
0.439LeuTrp: 0.439 ± 1.128
3.513LeuTyr: 3.513 ± 1.455
0.0LeuXaa: 0.0 ± 0.0
Met
1.318MetAla: 1.318 ± 1.148
0.878MetCys: 0.878 ± 0.499
0.878MetAsp: 0.878 ± 0.499
1.318MetGlu: 1.318 ± 0.555
0.0MetPhe: 0.0 ± 0.0
0.439MetGly: 0.439 ± 0.249
0.0MetHis: 0.0 ± 0.0
2.635MetIle: 2.635 ± 0.91
0.878MetLys: 0.878 ± 0.499
2.635MetLeu: 2.635 ± 1.496
2.196MetMet: 2.196 ± 0.728
1.757MetAsn: 1.757 ± 0.597
1.318MetPro: 1.318 ± 0.748
0.878MetGln: 0.878 ± 0.499
0.439MetArg: 0.439 ± 0.249
3.074MetSer: 3.074 ± 2.334
2.196MetThr: 2.196 ± 0.924
1.318MetVal: 1.318 ± 0.555
0.0MetTrp: 0.0 ± 0.0
0.878MetTyr: 0.878 ± 0.499
0.0MetXaa: 0.0 ± 0.0
Asn
2.196AsnAla: 2.196 ± 1.326
1.318AsnCys: 1.318 ± 0.748
2.635AsnAsp: 2.635 ± 0.91
2.196AsnGlu: 2.196 ± 1.246
3.074AsnPhe: 3.074 ± 1.325
1.757AsnGly: 1.757 ± 0.597
0.439AsnHis: 0.439 ± 1.125
1.318AsnIle: 1.318 ± 0.87
2.196AsnLys: 2.196 ± 0.942
4.392AsnLeu: 4.392 ± 1.267
1.757AsnMet: 1.757 ± 0.997
2.196AsnAsn: 2.196 ± 1.555
3.074AsnPro: 3.074 ± 1.125
2.196AsnGln: 2.196 ± 0.876
2.196AsnArg: 2.196 ± 1.149
2.196AsnSer: 2.196 ± 0.728
3.953AsnThr: 3.953 ± 1.957
1.757AsnVal: 1.757 ± 1.238
0.878AsnTrp: 0.878 ± 0.619
3.074AsnTyr: 3.074 ± 3.05
0.0AsnXaa: 0.0 ± 0.0
Pro
4.392ProAla: 4.392 ± 1.848
0.439ProCys: 0.439 ± 0.249
7.466ProAsp: 7.466 ± 2.81
3.953ProGlu: 3.953 ± 1.067
1.318ProPhe: 1.318 ± 0.748
3.074ProGly: 3.074 ± 1.118
1.318ProHis: 1.318 ± 0.748
2.196ProIle: 2.196 ± 2.019
3.074ProLys: 3.074 ± 0.961
5.27ProLeu: 5.27 ± 1.742
0.878ProMet: 0.878 ± 0.499
2.196ProAsn: 2.196 ± 1.246
5.27ProPro: 5.27 ± 2.285
1.318ProGln: 1.318 ± 0.748
3.074ProArg: 3.074 ± 1.118
3.953ProSer: 3.953 ± 1.341
2.635ProThr: 2.635 ± 0.838
2.196ProVal: 2.196 ± 1.246
0.878ProTrp: 0.878 ± 0.499
0.878ProTyr: 0.878 ± 0.499
0.0ProXaa: 0.0 ± 0.0
Gln
3.953GlnAla: 3.953 ± 1.296
0.878GlnCys: 0.878 ± 0.499
1.757GlnAsp: 1.757 ± 0.997
3.074GlnGlu: 3.074 ± 1.198
0.878GlnPhe: 0.878 ± 0.974
2.635GlnGly: 2.635 ± 0.91
3.074GlnHis: 3.074 ± 1.76
5.709GlnIle: 5.709 ± 1.72
4.831GlnLys: 4.831 ± 1.18
5.709GlnLeu: 5.709 ± 2.66
2.635GlnMet: 2.635 ± 0.832
2.635GlnAsn: 2.635 ± 1.247
1.757GlnPro: 1.757 ± 0.83
3.074GlnGln: 3.074 ± 2.851
2.635GlnArg: 2.635 ± 1.286
1.318GlnSer: 1.318 ± 2.324
2.196GlnThr: 2.196 ± 0.728
3.513GlnVal: 3.513 ± 2.522
1.318GlnTrp: 1.318 ± 0.748
3.074GlnTyr: 3.074 ± 1.745
0.0GlnXaa: 0.0 ± 0.0
Arg
3.074ArgAla: 3.074 ± 2.005
0.439ArgCys: 0.439 ± 0.249
1.757ArgAsp: 1.757 ± 1.652
3.074ArgGlu: 3.074 ± 0.961
1.757ArgPhe: 1.757 ± 1.113
2.635ArgGly: 2.635 ± 1.109
0.878ArgHis: 0.878 ± 0.499
2.196ArgIle: 2.196 ± 0.863
3.513ArgLys: 3.513 ± 1.795
5.27ArgLeu: 5.27 ± 2.043
2.635ArgMet: 2.635 ± 1.052
1.318ArgAsn: 1.318 ± 0.847
3.953ArgPro: 3.953 ± 0.891
2.196ArgGln: 2.196 ± 0.728
3.953ArgArg: 3.953 ± 5.014
4.392ArgSer: 4.392 ± 0.998
3.953ArgThr: 3.953 ± 3.363
3.953ArgVal: 3.953 ± 1.124
1.318ArgTrp: 1.318 ± 0.555
0.878ArgTyr: 0.878 ± 0.619
0.0ArgXaa: 0.0 ± 0.0
Ser
5.27SerAla: 5.27 ± 1.894
0.439SerCys: 0.439 ± 0.249
5.709SerAsp: 5.709 ± 2.315
4.392SerGlu: 4.392 ± 3.083
1.318SerPhe: 1.318 ± 0.87
2.635SerGly: 2.635 ± 1.997
2.196SerHis: 2.196 ± 1.149
3.513SerIle: 3.513 ± 1.721
3.513SerLys: 3.513 ± 2.695
5.27SerLeu: 5.27 ± 5.679
0.439SerMet: 0.439 ± 0.249
3.074SerAsn: 3.074 ± 2.139
2.635SerPro: 2.635 ± 1.231
2.196SerGln: 2.196 ± 1.326
4.831SerArg: 4.831 ± 2.992
4.392SerSer: 4.392 ± 1.845
5.709SerThr: 5.709 ± 1.898
1.757SerVal: 1.757 ± 0.597
1.757SerTrp: 1.757 ± 0.861
3.074SerTyr: 3.074 ± 1.246
0.0SerXaa: 0.0 ± 0.0
Thr
5.27ThrAla: 5.27 ± 0.836
0.0ThrCys: 0.0 ± 0.0
2.635ThrAsp: 2.635 ± 1.496
7.466ThrGlu: 7.466 ± 3.329
1.757ThrPhe: 1.757 ± 0.997
5.27ThrGly: 5.27 ± 1.819
0.878ThrHis: 0.878 ± 1.334
4.831ThrIle: 4.831 ± 1.22
1.757ThrLys: 1.757 ± 1.046
6.148ThrLeu: 6.148 ± 2.412
2.196ThrMet: 2.196 ± 0.728
3.074ThrAsn: 3.074 ± 2.597
1.757ThrPro: 1.757 ± 1.022
2.196ThrGln: 2.196 ± 1.981
2.196ThrArg: 2.196 ± 0.728
3.953ThrSer: 3.953 ± 1.067
4.392ThrThr: 4.392 ± 1.671
2.196ThrVal: 2.196 ± 0.728
0.439ThrTrp: 0.439 ± 0.249
1.318ThrTyr: 1.318 ± 0.748
0.0ThrXaa: 0.0 ± 0.0
Val
1.757ValAla: 1.757 ± 1.046
1.757ValCys: 1.757 ± 0.997
3.953ValAsp: 3.953 ± 1.266
2.635ValGlu: 2.635 ± 3.82
1.757ValPhe: 1.757 ± 0.997
2.196ValGly: 2.196 ± 0.924
0.878ValHis: 0.878 ± 0.499
2.635ValIle: 2.635 ± 1.341
3.074ValLys: 3.074 ± 1.681
6.588ValLeu: 6.588 ± 2.034
1.757ValMet: 1.757 ± 0.597
1.318ValAsn: 1.318 ± 1.207
3.953ValPro: 3.953 ± 1.308
3.074ValGln: 3.074 ± 1.871
1.757ValArg: 1.757 ± 0.83
3.513ValSer: 3.513 ± 2.173
4.392ValThr: 4.392 ± 1.17
1.757ValVal: 1.757 ± 1.113
1.318ValTrp: 1.318 ± 0.748
3.074ValTyr: 3.074 ± 1.325
0.0ValXaa: 0.0 ± 0.0
Trp
1.757TrpAla: 1.757 ± 0.861
0.0TrpCys: 0.0 ± 0.0
0.439TrpAsp: 0.439 ± 0.249
1.318TrpGlu: 1.318 ± 0.748
0.0TrpPhe: 0.0 ± 0.0
1.757TrpGly: 1.757 ± 0.597
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.318TrpLys: 1.318 ± 0.748
2.635TrpLeu: 2.635 ± 1.231
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
1.757TrpGln: 1.757 ± 0.997
1.757TrpArg: 1.757 ± 0.597
0.439TrpSer: 0.439 ± 0.249
0.439TrpThr: 0.439 ± 0.249
0.439TrpVal: 0.439 ± 0.763
0.0TrpTrp: 0.0 ± 0.0
0.439TrpTyr: 0.439 ± 1.125
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.196TyrAla: 2.196 ± 0.728
0.439TyrCys: 0.439 ± 0.249
2.635TyrAsp: 2.635 ± 1.38
1.318TyrGlu: 1.318 ± 0.87
1.318TyrPhe: 1.318 ± 0.998
1.318TyrGly: 1.318 ± 0.998
0.878TyrHis: 0.878 ± 0.499
2.196TyrIle: 2.196 ± 1.246
1.757TyrLys: 1.757 ± 0.861
3.513TyrLeu: 3.513 ± 0.815
0.878TyrMet: 0.878 ± 0.499
3.074TyrAsn: 3.074 ± 1.125
1.757TyrPro: 1.757 ± 1.022
2.635TyrGln: 2.635 ± 0.859
2.635TyrArg: 2.635 ± 1.877
3.513TyrSer: 3.513 ± 2.476
1.318TyrThr: 1.318 ± 0.847
1.757TyrVal: 1.757 ± 1.046
0.439TyrTrp: 0.439 ± 0.249
1.318TyrTyr: 1.318 ± 0.998
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (2278 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski