Amino acid dipepetide frequency for Dielma fastidiosa

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.321AlaAla: 7.321 ± 0.108
1.293AlaCys: 1.293 ± 0.031
4.769AlaAsp: 4.769 ± 0.07
4.899AlaGlu: 4.899 ± 0.072
3.423AlaPhe: 3.423 ± 0.06
5.122AlaGly: 5.122 ± 0.077
1.29AlaHis: 1.29 ± 0.035
5.907AlaIle: 5.907 ± 0.086
4.727AlaLys: 4.727 ± 0.069
7.84AlaLeu: 7.84 ± 0.09
2.735AlaMet: 2.735 ± 0.054
3.157AlaAsn: 3.157 ± 0.061
1.754AlaPro: 1.754 ± 0.042
2.521AlaGln: 2.521 ± 0.051
2.341AlaArg: 2.341 ± 0.053
4.55AlaSer: 4.55 ± 0.069
2.811AlaThr: 2.811 ± 0.071
5.757AlaVal: 5.757 ± 0.077
0.673AlaTrp: 0.673 ± 0.025
3.282AlaTyr: 3.282 ± 0.059
0.0AlaXaa: 0.0 ± 0.0
Cys
1.135CysAla: 1.135 ± 0.033
0.317CysCys: 0.317 ± 0.019
0.936CysAsp: 0.936 ± 0.031
1.067CysGlu: 1.067 ± 0.035
0.8CysPhe: 0.8 ± 0.03
1.275CysGly: 1.275 ± 0.039
0.332CysHis: 0.332 ± 0.019
1.214CysIle: 1.214 ± 0.036
0.693CysLys: 0.693 ± 0.025
1.55CysLeu: 1.55 ± 0.042
0.473CysMet: 0.473 ± 0.023
0.607CysAsn: 0.607 ± 0.027
0.549CysPro: 0.549 ± 0.025
0.491CysGln: 0.491 ± 0.025
0.595CysArg: 0.595 ± 0.026
1.012CysSer: 1.012 ± 0.031
0.807CysThr: 0.807 ± 0.029
1.054CysVal: 1.054 ± 0.034
0.168CysTrp: 0.168 ± 0.014
0.557CysTyr: 0.557 ± 0.023
0.0CysXaa: 0.0 ± 0.0
Asp
4.289AspAla: 4.289 ± 0.073
0.918AspCys: 0.918 ± 0.035
3.229AspAsp: 3.229 ± 0.061
5.623AspGlu: 5.623 ± 0.082
2.861AspPhe: 2.861 ± 0.05
3.673AspGly: 3.673 ± 0.071
1.184AspHis: 1.184 ± 0.035
4.166AspIle: 4.166 ± 0.067
3.681AspLys: 3.681 ± 0.065
5.446AspLeu: 5.446 ± 0.081
1.609AspMet: 1.609 ± 0.036
2.445AspAsn: 2.445 ± 0.051
1.829AspPro: 1.829 ± 0.046
2.211AspGln: 2.211 ± 0.049
1.978AspArg: 1.978 ± 0.039
2.968AspSer: 2.968 ± 0.064
2.877AspThr: 2.877 ± 0.053
3.708AspVal: 3.708 ± 0.067
0.557AspTrp: 0.557 ± 0.026
2.819AspTyr: 2.819 ± 0.053
0.0AspXaa: 0.0 ± 0.0
Glu
5.837GluAla: 5.837 ± 0.082
0.964GluCys: 0.964 ± 0.029
3.584GluAsp: 3.584 ± 0.069
5.135GluGlu: 5.135 ± 0.094
2.552GluPhe: 2.552 ± 0.049
3.707GluGly: 3.707 ± 0.066
1.356GluHis: 1.356 ± 0.04
5.666GluIle: 5.666 ± 0.074
5.495GluLys: 5.495 ± 0.073
7.071GluLeu: 7.071 ± 0.098
2.37GluMet: 2.37 ± 0.045
4.017GluAsn: 4.017 ± 0.064
1.761GluPro: 1.761 ± 0.039
2.538GluGln: 2.538 ± 0.052
2.762GluArg: 2.762 ± 0.056
3.126GluSer: 3.126 ± 0.06
3.556GluThr: 3.556 ± 0.067
4.344GluVal: 4.344 ± 0.078
0.616GluTrp: 0.616 ± 0.025
2.611GluTyr: 2.611 ± 0.057
0.0GluXaa: 0.0 ± 0.0
Phe
3.059PheAla: 3.059 ± 0.054
0.625PheCys: 0.625 ± 0.025
2.97PheAsp: 2.97 ± 0.055
2.715PheGlu: 2.715 ± 0.05
1.817PhePhe: 1.817 ± 0.045
2.785PheGly: 2.785 ± 0.056
0.765PheHis: 0.765 ± 0.029
3.727PheIle: 3.727 ± 0.067
2.816PheLys: 2.816 ± 0.058
3.631PheLeu: 3.631 ± 0.071
1.387PheMet: 1.387 ± 0.041
2.411PheAsn: 2.411 ± 0.043
1.257PhePro: 1.257 ± 0.035
1.18PheGln: 1.18 ± 0.032
1.193PheArg: 1.193 ± 0.037
2.86PheSer: 2.86 ± 0.053
2.634PheThr: 2.634 ± 0.053
2.744PheVal: 2.744 ± 0.052
0.354PheTrp: 0.354 ± 0.019
2.008PheTyr: 2.008 ± 0.048
0.0PheXaa: 0.0 ± 0.0
Gly
4.075GlyAla: 4.075 ± 0.083
1.336GlyCys: 1.336 ± 0.037
2.894GlyAsp: 2.894 ± 0.054
3.678GlyGlu: 3.678 ± 0.068
2.986GlyPhe: 2.986 ± 0.052
4.356GlyGly: 4.356 ± 0.111
1.155GlyHis: 1.155 ± 0.035
5.856GlyIle: 5.856 ± 0.087
4.381GlyLys: 4.381 ± 0.063
5.943GlyLeu: 5.943 ± 0.087
2.125GlyMet: 2.125 ± 0.052
2.947GlyAsn: 2.947 ± 0.058
1.183GlyPro: 1.183 ± 0.06
1.698GlyGln: 1.698 ± 0.043
2.254GlyArg: 2.254 ± 0.046
4.167GlySer: 4.167 ± 0.078
3.373GlyThr: 3.373 ± 0.068
4.246GlyVal: 4.246 ± 0.072
0.688GlyTrp: 0.688 ± 0.029
3.216GlyTyr: 3.216 ± 0.054
0.0GlyXaa: 0.0 ± 0.0
His
1.322HisAla: 1.322 ± 0.034
0.389HisCys: 0.389 ± 0.02
1.215HisAsp: 1.215 ± 0.033
1.591HisGlu: 1.591 ± 0.043
0.931HisPhe: 0.931 ± 0.027
1.39HisGly: 1.39 ± 0.043
0.593HisHis: 0.593 ± 0.03
1.334HisIle: 1.334 ± 0.037
0.928HisLys: 0.928 ± 0.028
1.891HisLeu: 1.891 ± 0.045
0.497HisMet: 0.497 ± 0.024
0.752HisAsn: 0.752 ± 0.028
0.855HisPro: 0.855 ± 0.027
0.927HisGln: 0.927 ± 0.03
0.768HisArg: 0.768 ± 0.026
0.969HisSer: 0.969 ± 0.027
0.955HisThr: 0.955 ± 0.033
1.246HisVal: 1.246 ± 0.032
0.199HisTrp: 0.199 ± 0.013
0.93HisTyr: 0.93 ± 0.029
0.0HisXaa: 0.0 ± 0.0
Ile
6.289IleAla: 6.289 ± 0.089
1.265IleCys: 1.265 ± 0.031
5.289IleAsp: 5.289 ± 0.069
5.773IleGlu: 5.773 ± 0.083
3.095IlePhe: 3.095 ± 0.064
5.126IleGly: 5.126 ± 0.083
1.697IleHis: 1.697 ± 0.052
5.959IleIle: 5.959 ± 0.085
5.274IleLys: 5.274 ± 0.079
7.684IleLeu: 7.684 ± 0.111
2.095IleMet: 2.095 ± 0.044
4.083IleAsn: 4.083 ± 0.067
2.791IlePro: 2.791 ± 0.053
2.849IleGln: 2.849 ± 0.059
2.964IleArg: 2.964 ± 0.056
5.167IleSer: 5.167 ± 0.077
4.332IleThr: 4.332 ± 0.073
5.033IleVal: 5.033 ± 0.068
0.563IleTrp: 0.563 ± 0.025
3.025IleTyr: 3.025 ± 0.054
0.0IleXaa: 0.0 ± 0.0
Lys
5.663LysAla: 5.663 ± 0.078
0.714LysCys: 0.714 ± 0.025
4.048LysAsp: 4.048 ± 0.07
5.554LysGlu: 5.554 ± 0.074
1.755LysPhe: 1.755 ± 0.038
3.936LysGly: 3.936 ± 0.074
1.318LysHis: 1.318 ± 0.035
4.983LysIle: 4.983 ± 0.07
5.309LysLys: 5.309 ± 0.078
6.034LysLeu: 6.034 ± 0.082
2.253LysMet: 2.253 ± 0.047
3.554LysAsn: 3.554 ± 0.068
2.085LysPro: 2.085 ± 0.044
3.17LysGln: 3.17 ± 0.053
2.891LysArg: 2.891 ± 0.062
3.288LysSer: 3.288 ± 0.051
3.669LysThr: 3.669 ± 0.055
3.994LysVal: 3.994 ± 0.06
0.535LysTrp: 0.535 ± 0.023
2.51LysTyr: 2.51 ± 0.047
0.0LysXaa: 0.0 ± 0.0
Leu
7.627LeuAla: 7.627 ± 0.098
1.742LeuCys: 1.742 ± 0.044
5.681LeuAsp: 5.681 ± 0.08
5.53LeuGlu: 5.53 ± 0.076
4.492LeuPhe: 4.492 ± 0.079
5.705LeuGly: 5.705 ± 0.089
1.817LeuHis: 1.817 ± 0.044
8.353LeuIle: 8.353 ± 0.102
7.361LeuLys: 7.361 ± 0.082
10.321LeuLeu: 10.321 ± 0.142
3.27LeuMet: 3.27 ± 0.058
5.488LeuAsn: 5.488 ± 0.073
3.639LeuPro: 3.639 ± 0.056
2.983LeuGln: 2.983 ± 0.058
3.577LeuArg: 3.577 ± 0.062
7.102LeuSer: 7.102 ± 0.089
5.202LeuThr: 5.202 ± 0.072
5.568LeuVal: 5.568 ± 0.077
0.678LeuTrp: 0.678 ± 0.023
3.529LeuTyr: 3.529 ± 0.055
0.0LeuXaa: 0.0 ± 0.0
Met
2.355MetAla: 2.355 ± 0.043
0.335MetCys: 0.335 ± 0.02
1.701MetAsp: 1.701 ± 0.041
1.796MetGlu: 1.796 ± 0.044
1.077MetPhe: 1.077 ± 0.032
1.865MetGly: 1.865 ± 0.048
0.558MetHis: 0.558 ± 0.024
2.773MetIle: 2.773 ± 0.058
2.826MetLys: 2.826 ± 0.043
3.234MetLeu: 3.234 ± 0.064
1.072MetMet: 1.072 ± 0.035
2.07MetAsn: 2.07 ± 0.043
1.148MetPro: 1.148 ± 0.031
1.183MetGln: 1.183 ± 0.032
1.19MetArg: 1.19 ± 0.039
1.946MetSer: 1.946 ± 0.045
1.443MetThr: 1.443 ± 0.033
1.788MetVal: 1.788 ± 0.041
0.169MetTrp: 0.169 ± 0.012
0.742MetTyr: 0.742 ± 0.027
0.0MetXaa: 0.0 ± 0.0
Asn
3.625AsnAla: 3.625 ± 0.058
0.68AsnCys: 0.68 ± 0.027
3.162AsnAsp: 3.162 ± 0.059
4.154AsnGlu: 4.154 ± 0.063
1.814AsnPhe: 1.814 ± 0.037
3.467AsnGly: 3.467 ± 0.068
1.088AsnHis: 1.088 ± 0.035
3.547AsnIle: 3.547 ± 0.07
3.059AsnLys: 3.059 ± 0.05
4.25AsnLeu: 4.25 ± 0.068
1.299AsnMet: 1.299 ± 0.036
2.415AsnAsn: 2.415 ± 0.066
1.756AsnPro: 1.756 ± 0.041
2.297AsnGln: 2.297 ± 0.052
1.893AsnArg: 1.893 ± 0.049
2.477AsnSer: 2.477 ± 0.053
2.521AsnThr: 2.521 ± 0.051
3.115AsnVal: 3.115 ± 0.059
0.438AsnTrp: 0.438 ± 0.02
2.199AsnTyr: 2.199 ± 0.045
0.0AsnXaa: 0.0 ± 0.0
Pro
2.401ProAla: 2.401 ± 0.051
0.441ProCys: 0.441 ± 0.021
1.815ProAsp: 1.815 ± 0.04
2.272ProGlu: 2.272 ± 0.048
1.599ProPhe: 1.599 ± 0.033
1.535ProGly: 1.535 ± 0.037
0.59ProHis: 0.59 ± 0.021
2.753ProIle: 2.753 ± 0.053
1.768ProLys: 1.768 ± 0.043
3.115ProLeu: 3.115 ± 0.067
0.981ProMet: 0.981 ± 0.031
1.41ProAsn: 1.41 ± 0.041
0.584ProPro: 0.584 ± 0.024
1.1ProGln: 1.1 ± 0.033
0.803ProArg: 0.803 ± 0.03
1.795ProSer: 1.795 ± 0.041
1.538ProThr: 1.538 ± 0.04
2.269ProVal: 2.269 ± 0.053
0.274ProTrp: 0.274 ± 0.015
1.459ProTyr: 1.459 ± 0.039
0.0ProXaa: 0.0 ± 0.0
Gln
2.866GlnAla: 2.866 ± 0.054
0.584GlnCys: 0.584 ± 0.022
1.551GlnAsp: 1.551 ± 0.039
2.162GlnGlu: 2.162 ± 0.052
1.502GlnPhe: 1.502 ± 0.045
2.185GlnGly: 2.185 ± 0.057
0.715GlnHis: 0.715 ± 0.025
2.949GlnIle: 2.949 ± 0.054
2.411GlnLys: 2.411 ± 0.052
4.318GlnLeu: 4.318 ± 0.064
1.135GlnMet: 1.135 ± 0.036
1.611GlnAsn: 1.611 ± 0.042
1.21GlnPro: 1.21 ± 0.04
1.672GlnGln: 1.672 ± 0.048
1.712GlnArg: 1.712 ± 0.045
2.038GlnSer: 2.038 ± 0.044
1.786GlnThr: 1.786 ± 0.043
2.04GlnVal: 2.04 ± 0.045
0.383GlnTrp: 0.383 ± 0.019
1.405GlnTyr: 1.405 ± 0.042
0.0GlnXaa: 0.0 ± 0.0
Arg
2.125ArgAla: 2.125 ± 0.05
0.63ArgCys: 0.63 ± 0.026
1.872ArgAsp: 1.872 ± 0.046
2.589ArgGlu: 2.589 ± 0.049
1.821ArgPhe: 1.821 ± 0.047
1.74ArgGly: 1.74 ± 0.049
0.794ArgHis: 0.794 ± 0.027
3.104ArgIle: 3.104 ± 0.057
2.713ArgLys: 2.713 ± 0.058
3.859ArgLeu: 3.859 ± 0.069
1.291ArgMet: 1.291 ± 0.04
1.76ArgAsn: 1.76 ± 0.038
0.965ArgPro: 0.965 ± 0.031
1.597ArgGln: 1.597 ± 0.042
1.646ArgArg: 1.646 ± 0.044
2.107ArgSer: 2.107 ± 0.042
1.632ArgThr: 1.632 ± 0.039
2.201ArgVal: 2.201 ± 0.045
0.348ArgTrp: 0.348 ± 0.017
1.831ArgTyr: 1.831 ± 0.043
0.0ArgXaa: 0.0 ± 0.0
Ser
4.512SerAla: 4.512 ± 0.062
0.882SerCys: 0.882 ± 0.029
3.537SerAsp: 3.537 ± 0.064
4.146SerGlu: 4.146 ± 0.055
2.846SerPhe: 2.846 ± 0.057
4.383SerGly: 4.383 ± 0.076
1.081SerHis: 1.081 ± 0.033
4.818SerIle: 4.818 ± 0.07
3.762SerLys: 3.762 ± 0.058
5.863SerLeu: 5.863 ± 0.08
1.876SerMet: 1.876 ± 0.043
2.63SerAsn: 2.63 ± 0.059
1.439SerPro: 1.439 ± 0.036
1.978SerGln: 1.978 ± 0.045
2.106SerArg: 2.106 ± 0.049
3.625SerSer: 3.625 ± 0.072
2.808SerThr: 2.808 ± 0.062
3.989SerVal: 3.989 ± 0.063
0.519SerTrp: 0.519 ± 0.023
2.48SerTyr: 2.48 ± 0.054
0.0SerXaa: 0.0 ± 0.0
Thr
4.15ThrAla: 4.15 ± 0.064
0.715ThrCys: 0.715 ± 0.025
2.726ThrAsp: 2.726 ± 0.058
2.662ThrGlu: 2.662 ± 0.058
2.24ThrPhe: 2.24 ± 0.047
3.407ThrGly: 3.407 ± 0.079
1.045ThrHis: 1.045 ± 0.031
4.52ThrIle: 4.52 ± 0.07
2.765ThrLys: 2.765 ± 0.056
5.495ThrLeu: 5.495 ± 0.078
1.495ThrMet: 1.495 ± 0.039
2.169ThrAsn: 2.169 ± 0.044
1.96ThrPro: 1.96 ± 0.05
1.622ThrGln: 1.622 ± 0.036
1.649ThrArg: 1.649 ± 0.038
3.073ThrSer: 3.073 ± 0.064
2.614ThrThr: 2.614 ± 0.062
3.682ThrVal: 3.682 ± 0.073
0.451ThrTrp: 0.451 ± 0.024
2.219ThrTyr: 2.219 ± 0.052
0.0ThrXaa: 0.0 ± 0.0
Val
4.01ValAla: 4.01 ± 0.078
1.079ValCys: 1.079 ± 0.031
3.873ValAsp: 3.873 ± 0.06
4.242ValGlu: 4.242 ± 0.07
2.832ValPhe: 2.832 ± 0.058
3.692ValGly: 3.692 ± 0.07
1.104ValHis: 1.104 ± 0.036
5.333ValIle: 5.333 ± 0.082
4.583ValLys: 4.583 ± 0.067
6.775ValLeu: 6.775 ± 0.089
2.001ValMet: 2.001 ± 0.043
3.38ValAsn: 3.38 ± 0.059
1.94ValPro: 1.94 ± 0.038
1.819ValGln: 1.819 ± 0.039
2.242ValArg: 2.242 ± 0.05
4.388ValSer: 4.388 ± 0.063
3.412ValThr: 3.412 ± 0.069
4.239ValVal: 4.239 ± 0.078
0.503ValTrp: 0.503 ± 0.022
2.64ValTyr: 2.64 ± 0.048
0.0ValXaa: 0.0 ± 0.0
Trp
0.497TrpAla: 0.497 ± 0.025
0.145TrpCys: 0.145 ± 0.012
0.428TrpAsp: 0.428 ± 0.021
0.463TrpGlu: 0.463 ± 0.021
0.411TrpPhe: 0.411 ± 0.02
0.551TrpGly: 0.551 ± 0.026
0.216TrpHis: 0.216 ± 0.016
0.768TrpIle: 0.768 ± 0.026
0.433TrpLys: 0.433 ± 0.019
1.078TrpLeu: 1.078 ± 0.036
0.245TrpMet: 0.245 ± 0.015
0.393TrpAsn: 0.393 ± 0.02
0.225TrpPro: 0.225 ± 0.015
0.472TrpGln: 0.472 ± 0.023
0.308TrpArg: 0.308 ± 0.016
0.44TrpSer: 0.44 ± 0.021
0.389TrpThr: 0.389 ± 0.022
0.554TrpVal: 0.554 ± 0.023
0.085TrpTrp: 0.085 ± 0.009
0.463TrpTyr: 0.463 ± 0.026
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.012TyrAla: 3.012 ± 0.054
0.577TyrCys: 0.577 ± 0.025
2.791TyrAsp: 2.791 ± 0.052
3.021TyrGlu: 3.021 ± 0.058
2.083TyrPhe: 2.083 ± 0.05
2.657TyrGly: 2.657 ± 0.053
0.982TyrHis: 0.982 ± 0.03
2.624TyrIle: 2.624 ± 0.05
2.282TyrLys: 2.282 ± 0.052
4.264TyrLeu: 4.264 ± 0.064
1.062TyrMet: 1.062 ± 0.03
1.954TyrAsn: 1.954 ± 0.051
1.574TyrPro: 1.574 ± 0.036
1.927TyrGln: 1.927 ± 0.044
1.755TyrArg: 1.755 ± 0.046
2.182TyrSer: 2.182 ± 0.051
2.301TyrThr: 2.301 ± 0.05
2.514TyrVal: 2.514 ± 0.049
0.363TyrTrp: 0.363 ± 0.019
2.199TyrTyr: 2.199 ± 0.057
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3467 proteins (1067974 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski