Amino acid dipepetide frequency for Hibiscus chlorotic ringspot virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.206AlaAla: 11.206 ± 4.489
3.202AlaCys: 3.202 ± 0.749
2.668AlaAsp: 2.668 ± 0.906
4.269AlaGlu: 4.269 ± 1.956
4.269AlaPhe: 4.269 ± 1.51
4.269AlaGly: 4.269 ± 1.856
1.067AlaHis: 1.067 ± 0.498
8.004AlaIle: 8.004 ± 1.804
2.668AlaLys: 2.668 ± 1.202
8.004AlaLeu: 8.004 ± 2.427
1.601AlaMet: 1.601 ± 0.598
3.202AlaAsn: 3.202 ± 1.507
4.803AlaPro: 4.803 ± 1.452
0.534AlaGln: 0.534 ± 0.35
3.735AlaArg: 3.735 ± 1.076
7.471AlaSer: 7.471 ± 1.133
4.803AlaThr: 4.803 ± 2.523
7.471AlaVal: 7.471 ± 2.445
1.067AlaTrp: 1.067 ± 0.633
1.067AlaTyr: 1.067 ± 0.699
0.0AlaXaa: 0.0 ± 0.0
Cys
1.601CysAla: 1.601 ± 0.598
0.534CysCys: 0.534 ± 0.587
1.067CysAsp: 1.067 ± 0.498
0.534CysGlu: 0.534 ± 0.525
0.534CysPhe: 0.534 ± 0.757
2.668CysGly: 2.668 ± 1.322
0.534CysHis: 0.534 ± 0.587
1.601CysIle: 1.601 ± 0.652
2.134CysLys: 2.134 ± 0.724
3.202CysLeu: 3.202 ± 1.214
1.067CysMet: 1.067 ± 0.633
0.534CysAsn: 0.534 ± 0.525
0.0CysPro: 0.0 ± 0.0
1.067CysGln: 1.067 ± 0.539
1.601CysArg: 1.601 ± 0.681
1.067CysSer: 1.067 ± 0.699
0.0CysThr: 0.0 ± 0.0
2.668CysVal: 2.668 ± 1.4
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.735AspAla: 3.735 ± 1.623
0.534AspCys: 0.534 ± 0.35
2.134AspAsp: 2.134 ± 0.995
2.668AspGlu: 2.668 ± 1.395
2.134AspPhe: 2.134 ± 0.755
2.668AspGly: 2.668 ± 1.141
0.0AspHis: 0.0 ± 0.0
1.601AspIle: 1.601 ± 0.694
2.668AspLys: 2.668 ± 1.137
1.067AspLeu: 1.067 ± 0.735
1.601AspMet: 1.601 ± 0.604
0.534AspAsn: 0.534 ± 0.525
4.803AspPro: 4.803 ± 1.702
0.534AspGln: 0.534 ± 0.35
1.067AspArg: 1.067 ± 0.498
4.269AspSer: 4.269 ± 1.846
3.735AspThr: 3.735 ± 0.438
2.668AspVal: 2.668 ± 1.062
0.0AspTrp: 0.0 ± 0.0
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
3.735GluAla: 3.735 ± 1.663
1.067GluCys: 1.067 ± 0.699
0.534GluAsp: 0.534 ± 0.35
4.803GluGlu: 4.803 ± 2.487
2.134GluPhe: 2.134 ± 0.962
2.134GluGly: 2.134 ± 0.995
2.668GluHis: 2.668 ± 1.383
3.735GluIle: 3.735 ± 1.457
1.067GluLys: 1.067 ± 0.498
6.937GluLeu: 6.937 ± 2.287
0.0GluMet: 0.0 ± 0.0
2.134GluAsn: 2.134 ± 0.955
4.269GluPro: 4.269 ± 1.242
1.601GluGln: 1.601 ± 1.761
4.269GluArg: 4.269 ± 1.209
2.134GluSer: 2.134 ± 0.945
2.134GluThr: 2.134 ± 0.647
3.735GluVal: 3.735 ± 1.61
1.067GluTrp: 1.067 ± 0.699
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
2.668PheAla: 2.668 ± 1.425
1.067PheCys: 1.067 ± 0.699
3.735PheAsp: 3.735 ± 1.134
2.668PheGlu: 2.668 ± 1.425
2.668PhePhe: 2.668 ± 1.062
4.803PheGly: 4.803 ± 1.584
1.601PheHis: 1.601 ± 1.451
2.134PheIle: 2.134 ± 1.917
1.067PheLys: 1.067 ± 0.699
3.735PheLeu: 3.735 ± 2.041
0.534PheMet: 0.534 ± 0.823
2.134PheAsn: 2.134 ± 0.995
1.601PhePro: 1.601 ± 0.598
3.202PheGln: 3.202 ± 1.723
1.067PheArg: 1.067 ± 0.782
2.134PheSer: 2.134 ± 0.8
0.534PheThr: 0.534 ± 0.525
2.134PheVal: 2.134 ± 1.597
0.0PheTrp: 0.0 ± 0.0
1.067PheTyr: 1.067 ± 0.699
0.0PheXaa: 0.0 ± 0.0
Gly
4.269GlyAla: 4.269 ± 1.263
1.067GlyCys: 1.067 ± 0.699
2.668GlyAsp: 2.668 ± 0.933
3.202GlyGlu: 3.202 ± 1.362
2.134GlyPhe: 2.134 ± 0.958
6.403GlyGly: 6.403 ± 3.246
1.067GlyHis: 1.067 ± 0.699
2.668GlyIle: 2.668 ± 0.831
5.336GlyLys: 5.336 ± 1.238
5.336GlyLeu: 5.336 ± 1.554
3.735GlyMet: 3.735 ± 0.897
2.134GlyAsn: 2.134 ± 0.872
1.601GlyPro: 1.601 ± 0.598
2.134GlyGln: 2.134 ± 1.508
5.336GlyArg: 5.336 ± 1.34
2.134GlySer: 2.134 ± 1.01
5.336GlyThr: 5.336 ± 1.716
4.269GlyVal: 4.269 ± 0.989
1.067GlyTrp: 1.067 ± 0.782
3.202GlyTyr: 3.202 ± 0.749
0.0GlyXaa: 0.0 ± 0.0
His
3.735HisAla: 3.735 ± 2.046
0.534HisCys: 0.534 ± 0.525
1.601HisAsp: 1.601 ± 1.3
1.067HisGlu: 1.067 ± 0.637
2.134HisPhe: 2.134 ± 1.474
0.0HisGly: 0.0 ± 0.0
1.601HisHis: 1.601 ± 0.694
0.0HisIle: 0.0 ± 0.0
0.534HisLys: 0.534 ± 0.846
3.202HisLeu: 3.202 ± 1.025
0.0HisMet: 0.0 ± 0.0
1.067HisAsn: 1.067 ± 0.836
1.067HisPro: 1.067 ± 0.633
0.534HisGln: 0.534 ± 0.35
2.668HisArg: 2.668 ± 1.425
1.067HisSer: 1.067 ± 0.699
2.134HisThr: 2.134 ± 1.48
1.601HisVal: 1.601 ± 0.681
1.067HisTrp: 1.067 ± 0.699
1.601HisTyr: 1.601 ± 0.8
0.0HisXaa: 0.0 ± 0.0
Ile
10.139IleAla: 10.139 ± 2.392
0.0IleCys: 0.0 ± 0.0
0.534IleAsp: 0.534 ± 0.587
1.601IleGlu: 1.601 ± 0.962
1.601IlePhe: 1.601 ± 1.306
2.668IleGly: 2.668 ± 0.596
0.534IleHis: 0.534 ± 0.713
1.601IleIle: 1.601 ± 0.845
3.735IleLys: 3.735 ± 1.682
3.735IleLeu: 3.735 ± 1.341
2.134IleMet: 2.134 ± 1.082
3.735IleAsn: 3.735 ± 1.663
3.202IlePro: 3.202 ± 0.81
1.601IleGln: 1.601 ± 0.822
2.668IleArg: 2.668 ± 0.764
3.202IleSer: 3.202 ± 0.749
4.803IleThr: 4.803 ± 1.529
1.601IleVal: 1.601 ± 0.939
0.0IleTrp: 0.0 ± 0.0
1.067IleTyr: 1.067 ± 0.539
0.0IleXaa: 0.0 ± 0.0
Lys
2.668LysAla: 2.668 ± 1.355
0.534LysCys: 0.534 ± 0.713
1.067LysAsp: 1.067 ± 0.498
2.668LysGlu: 2.668 ± 1.202
1.601LysPhe: 1.601 ± 0.822
5.87LysGly: 5.87 ± 1.745
1.067LysHis: 1.067 ± 0.836
3.202LysIle: 3.202 ± 0.71
1.601LysLys: 1.601 ± 0.965
6.403LysLeu: 6.403 ± 2.217
1.067LysMet: 1.067 ± 1.085
1.601LysAsn: 1.601 ± 0.961
3.735LysPro: 3.735 ± 2.055
0.534LysGln: 0.534 ± 0.525
3.735LysArg: 3.735 ± 0.438
3.202LysSer: 3.202 ± 2.345
1.067LysThr: 1.067 ± 0.735
5.336LysVal: 5.336 ± 1.594
1.067LysTrp: 1.067 ± 1.426
1.601LysTyr: 1.601 ± 0.604
0.534LysXaa: 0.534 ± 0.35
Leu
11.206LeuAla: 11.206 ± 2.281
2.668LeuCys: 2.668 ± 1.214
2.134LeuAsp: 2.134 ± 1.399
5.87LeuGlu: 5.87 ± 2.113
4.803LeuPhe: 4.803 ± 2.264
3.735LeuGly: 3.735 ± 1.877
2.668LeuHis: 2.668 ± 1.214
4.803LeuIle: 4.803 ± 3.06
8.004LeuLys: 8.004 ± 2.78
13.874LeuLeu: 13.874 ± 6.567
1.601LeuMet: 1.601 ± 1.421
3.202LeuAsn: 3.202 ± 1.187
4.803LeuPro: 4.803 ± 2.164
5.336LeuGln: 5.336 ± 3.089
9.605LeuArg: 9.605 ± 2.931
11.74LeuSer: 11.74 ± 6.263
5.336LeuThr: 5.336 ± 1.541
6.403LeuVal: 6.403 ± 0.947
2.134LeuTrp: 2.134 ± 1.416
2.134LeuTyr: 2.134 ± 0.778
0.0LeuXaa: 0.0 ± 0.0
Met
3.202MetAla: 3.202 ± 0.891
0.0MetCys: 0.0 ± 0.0
0.534MetAsp: 0.534 ± 0.35
1.601MetGlu: 1.601 ± 1.159
0.0MetPhe: 0.0 ± 0.0
1.601MetGly: 1.601 ± 0.694
0.534MetHis: 0.534 ± 0.35
1.067MetIle: 1.067 ± 0.539
1.601MetLys: 1.601 ± 0.868
3.735MetLeu: 3.735 ± 1.245
1.067MetMet: 1.067 ± 1.057
0.0MetAsn: 0.0 ± 0.0
0.534MetPro: 0.534 ± 0.587
0.0MetGln: 0.0 ± 0.0
4.269MetArg: 4.269 ± 1.27
1.601MetSer: 1.601 ± 1.306
0.534MetThr: 0.534 ± 0.587
1.067MetVal: 1.067 ± 0.633
1.067MetTrp: 1.067 ± 1.174
1.067MetTyr: 1.067 ± 0.498
0.0MetXaa: 0.0 ± 0.0
Asn
1.067AsnAla: 1.067 ± 0.498
1.067AsnCys: 1.067 ± 0.735
2.668AsnAsp: 2.668 ± 1.519
1.601AsnGlu: 1.601 ± 0.822
1.067AsnPhe: 1.067 ± 0.836
1.601AsnGly: 1.601 ± 0.681
0.534AsnHis: 0.534 ± 0.525
3.202AsnIle: 3.202 ± 1.345
0.534AsnLys: 0.534 ± 0.846
2.668AsnLeu: 2.668 ± 1.163
0.534AsnMet: 0.534 ± 0.451
1.601AsnAsn: 1.601 ± 1.049
2.668AsnPro: 2.668 ± 1.071
0.0AsnGln: 0.0 ± 0.0
2.668AsnArg: 2.668 ± 1.072
3.202AsnSer: 3.202 ± 1.246
2.134AsnThr: 2.134 ± 1.232
2.668AsnVal: 2.668 ± 0.938
0.0AsnTrp: 0.0 ± 0.0
0.534AsnTyr: 0.534 ± 0.525
0.0AsnXaa: 0.0 ± 0.0
Pro
1.601ProAla: 1.601 ± 0.604
2.668ProCys: 2.668 ± 0.607
2.668ProAsp: 2.668 ± 0.932
1.601ProGlu: 1.601 ± 0.822
1.601ProPhe: 1.601 ± 0.652
5.336ProGly: 5.336 ± 1.223
1.601ProHis: 1.601 ± 0.965
2.134ProIle: 2.134 ± 1.642
0.534ProLys: 0.534 ± 0.525
4.803ProLeu: 4.803 ± 2.685
1.067ProMet: 1.067 ± 0.57
0.0ProAsn: 0.0 ± 0.0
1.067ProPro: 1.067 ± 1.174
3.202ProGln: 3.202 ± 1.602
7.471ProArg: 7.471 ± 2.847
1.067ProSer: 1.067 ± 1.05
8.004ProThr: 8.004 ± 2.467
6.403ProVal: 6.403 ± 0.834
2.134ProTrp: 2.134 ± 1.467
0.534ProTyr: 0.534 ± 0.35
0.0ProXaa: 0.0 ± 0.0
Gln
2.668GlnAla: 2.668 ± 1.88
0.534GlnCys: 0.534 ± 0.35
0.0GlnAsp: 0.0 ± 0.0
0.534GlnGlu: 0.534 ± 0.35
0.534GlnPhe: 0.534 ± 0.35
2.134GlnGly: 2.134 ± 0.679
1.067GlnHis: 1.067 ± 0.539
1.601GlnIle: 1.601 ± 0.961
2.134GlnLys: 2.134 ± 1.467
3.735GlnLeu: 3.735 ± 2.073
2.668GlnMet: 2.668 ± 0.932
0.534GlnAsn: 0.534 ± 0.757
2.668GlnPro: 2.668 ± 1.163
1.067GlnGln: 1.067 ± 0.959
2.134GlnArg: 2.134 ± 0.735
4.269GlnSer: 4.269 ± 1.881
1.067GlnThr: 1.067 ± 0.754
1.067GlnVal: 1.067 ± 0.637
0.0GlnTrp: 0.0 ± 0.0
0.534GlnTyr: 0.534 ± 0.525
0.0GlnXaa: 0.0 ± 0.0
Arg
7.471ArgAla: 7.471 ± 1.325
1.601ArgCys: 1.601 ± 0.694
3.735ArgAsp: 3.735 ± 1.489
2.668ArgGlu: 2.668 ± 1.095
2.134ArgPhe: 2.134 ± 0.995
3.735ArgGly: 3.735 ± 1.223
1.601ArgHis: 1.601 ± 0.768
2.134ArgIle: 2.134 ± 1.093
2.134ArgLys: 2.134 ± 1.341
7.471ArgLeu: 7.471 ± 2.565
1.601ArgMet: 1.601 ± 0.74
2.668ArgAsn: 2.668 ± 0.837
3.202ArgPro: 3.202 ± 1.265
2.134ArgGln: 2.134 ± 1.729
7.471ArgArg: 7.471 ± 3.272
4.803ArgSer: 4.803 ± 1.781
4.269ArgThr: 4.269 ± 1.458
4.803ArgVal: 4.803 ± 1.439
2.668ArgTrp: 2.668 ± 1.031
3.202ArgTyr: 3.202 ± 1.362
0.0ArgXaa: 0.0 ± 0.0
Ser
2.668SerAla: 2.668 ± 0.607
0.534SerCys: 0.534 ± 0.587
2.134SerAsp: 2.134 ± 1.021
2.668SerGlu: 2.668 ± 1.38
4.269SerPhe: 4.269 ± 1.909
4.803SerGly: 4.803 ± 1.444
2.668SerHis: 2.668 ± 1.273
3.735SerIle: 3.735 ± 0.438
6.403SerLys: 6.403 ± 1.301
13.34SerLeu: 13.34 ± 5.917
2.668SerMet: 2.668 ± 0.736
3.202SerAsn: 3.202 ± 0.752
4.803SerPro: 4.803 ± 1.628
2.668SerGln: 2.668 ± 1.118
2.134SerArg: 2.134 ± 1.25
6.937SerSer: 6.937 ± 2.997
3.202SerThr: 3.202 ± 2.319
4.803SerVal: 4.803 ± 1.793
0.534SerTrp: 0.534 ± 0.587
1.601SerTyr: 1.601 ± 0.768
0.0SerXaa: 0.0 ± 0.0
Thr
3.202ThrAla: 3.202 ± 0.873
1.601ThrCys: 1.601 ± 1.159
1.601ThrAsp: 1.601 ± 0.668
3.202ThrGlu: 3.202 ± 1.111
1.601ThrPhe: 1.601 ± 0.868
3.735ThrGly: 3.735 ± 1.699
3.202ThrHis: 3.202 ± 1.187
2.668ThrIle: 2.668 ± 2.137
1.601ThrLys: 1.601 ± 0.74
5.87ThrLeu: 5.87 ± 3.793
1.067ThrMet: 1.067 ± 1.174
0.534ThrAsn: 0.534 ± 0.35
7.471ThrPro: 7.471 ± 2.276
1.601ThrGln: 1.601 ± 1.119
3.202ThrArg: 3.202 ± 0.934
3.735ThrSer: 3.735 ± 1.643
2.668ThrThr: 2.668 ± 1.543
2.668ThrVal: 2.668 ± 1.062
1.601ThrTrp: 1.601 ± 0.668
2.134ThrTyr: 2.134 ± 0.724
0.0ThrXaa: 0.0 ± 0.0
Val
5.336ValAla: 5.336 ± 1.812
2.134ValCys: 2.134 ± 1.274
5.87ValAsp: 5.87 ± 1.869
5.336ValGlu: 5.336 ± 1.499
5.336ValPhe: 5.336 ± 1.907
5.336ValGly: 5.336 ± 1.713
2.668ValHis: 2.668 ± 1.04
2.134ValIle: 2.134 ± 0.928
3.202ValLys: 3.202 ± 0.749
5.87ValLeu: 5.87 ± 1.153
0.0ValMet: 0.0 ± 0.0
1.601ValAsn: 1.601 ± 0.868
1.601ValPro: 1.601 ± 1.072
1.067ValGln: 1.067 ± 1.05
3.735ValArg: 3.735 ± 1.049
7.471ValSer: 7.471 ± 2.847
2.668ValThr: 2.668 ± 1.519
5.87ValVal: 5.87 ± 1.789
1.067ValTrp: 1.067 ± 0.637
1.067ValTyr: 1.067 ± 0.699
0.0ValXaa: 0.0 ± 0.0
Trp
2.134TrpAla: 2.134 ± 1.634
0.0TrpCys: 0.0 ± 0.0
0.534TrpAsp: 0.534 ± 0.525
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.534TrpHis: 0.534 ± 0.587
1.067TrpIle: 1.067 ± 0.539
1.067TrpLys: 1.067 ± 0.637
4.803TrpLeu: 4.803 ± 0.911
0.534TrpMet: 0.534 ± 0.587
0.534TrpAsn: 0.534 ± 0.35
1.067TrpPro: 1.067 ± 0.959
0.534TrpGln: 0.534 ± 0.35
1.067TrpArg: 1.067 ± 0.633
1.067TrpSer: 1.067 ± 0.539
0.0TrpThr: 0.0 ± 0.0
1.067TrpVal: 1.067 ± 0.637
1.601TrpTrp: 1.601 ± 0.985
1.067TrpTyr: 1.067 ± 1.426
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.067TyrAla: 1.067 ± 0.498
1.067TyrCys: 1.067 ± 0.633
0.534TyrAsp: 0.534 ± 0.35
1.601TyrGlu: 1.601 ± 0.681
0.0TyrPhe: 0.0 ± 0.0
1.067TyrGly: 1.067 ± 0.498
0.0TyrHis: 0.0 ± 0.0
1.067TyrIle: 1.067 ± 0.637
2.668TyrLys: 2.668 ± 1.251
4.269TyrLeu: 4.269 ± 2.501
0.0TyrMet: 0.0 ± 0.0
1.067TyrAsn: 1.067 ± 0.498
0.534TyrPro: 0.534 ± 0.35
1.601TyrGln: 1.601 ± 0.822
1.601TyrArg: 1.601 ± 0.681
3.202TyrSer: 3.202 ± 0.873
0.534TyrThr: 0.534 ± 0.35
1.067TyrVal: 1.067 ± 0.637
0.534TyrTrp: 0.534 ± 0.587
0.534TyrTyr: 0.534 ± 0.35
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.534XaaGly: 0.534 ± 0.35
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (1875 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski