Amino acid dipepetide frequency for Plasmodium falciparum (isolate HB3)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.883AlaAla: 0.883 ± 0.029
0.482AlaCys: 0.482 ± 0.015
1.104AlaAsp: 1.104 ± 0.026
1.195AlaGlu: 1.195 ± 0.028
0.945AlaPhe: 0.945 ± 0.017
0.726AlaGly: 0.726 ± 0.023
0.594AlaHis: 0.594 ± 0.021
1.597AlaIle: 1.597 ± 0.027
1.786AlaLys: 1.786 ± 0.031
1.902AlaLeu: 1.902 ± 0.031
0.365AlaMet: 0.365 ± 0.011
1.579AlaAsn: 1.579 ± 0.028
0.601AlaPro: 0.601 ± 0.024
0.713AlaGln: 0.713 ± 0.015
0.592AlaArg: 0.592 ± 0.016
1.524AlaSer: 1.524 ± 0.028
0.971AlaThr: 0.971 ± 0.023
0.869AlaVal: 0.869 ± 0.022
0.132AlaTrp: 0.132 ± 0.007
1.093AlaTyr: 1.093 ± 0.018
0.0AlaXaa: 0.0 ± 0.0
Cys
0.527CysAla: 0.527 ± 0.016
0.318CysCys: 0.318 ± 0.011
1.3CysAsp: 1.3 ± 0.024
1.148CysGlu: 1.148 ± 0.023
0.823CysPhe: 0.823 ± 0.014
0.708CysGly: 0.708 ± 0.018
0.327CysHis: 0.327 ± 0.01
1.729CysIle: 1.729 ± 0.024
1.626CysLys: 1.626 ± 0.029
1.522CysLeu: 1.522 ± 0.022
0.358CysMet: 0.358 ± 0.011
1.872CysAsn: 1.872 ± 0.028
0.474CysPro: 0.474 ± 0.016
0.348CysGln: 0.348 ± 0.012
0.488CysArg: 0.488 ± 0.014
1.424CysSer: 1.424 ± 0.027
0.932CysThr: 0.932 ± 0.019
0.897CysVal: 0.897 ± 0.017
0.069CysTrp: 0.069 ± 0.005
0.817CysTyr: 0.817 ± 0.016
0.0CysXaa: 0.0 ± 0.0
Asp
1.237AspAla: 1.237 ± 0.024
0.709AspCys: 0.709 ± 0.019
7.255AspAsp: 7.255 ± 0.124
5.978AspGlu: 5.978 ± 0.064
2.114AspPhe: 2.114 ± 0.025
1.775AspGly: 1.775 ± 0.032
1.528AspHis: 1.528 ± 0.035
7.421AspIle: 7.421 ± 0.063
6.82AspLys: 6.82 ± 0.066
3.804AspLeu: 3.804 ± 0.042
1.721AspMet: 1.721 ± 0.028
9.171AspAsn: 9.171 ± 0.107
1.028AspPro: 1.028 ± 0.026
1.442AspGln: 1.442 ± 0.027
1.26AspArg: 1.26 ± 0.028
3.202AspSer: 3.202 ± 0.034
2.756AspThr: 2.756 ± 0.034
2.888AspVal: 2.888 ± 0.033
0.232AspTrp: 0.232 ± 0.011
2.853AspTyr: 2.853 ± 0.034
0.0AspXaa: 0.0 ± 0.0
Glu
1.447GluAla: 1.447 ± 0.026
1.103GluCys: 1.103 ± 0.021
4.866GluAsp: 4.866 ± 0.062
8.629GluGlu: 8.629 ± 0.24
2.01GluPhe: 2.01 ± 0.029
2.126GluGly: 2.126 ± 0.05
1.856GluHis: 1.856 ± 0.029
5.443GluIle: 5.443 ± 0.076
10.805GluLys: 10.805 ± 0.1
4.595GluLeu: 4.595 ± 0.057
1.392GluMet: 1.392 ± 0.023
9.029GluAsn: 9.029 ± 0.083
1.02GluPro: 1.02 ± 0.02
2.751GluGln: 2.751 ± 0.047
2.155GluArg: 2.155 ± 0.037
3.37GluSer: 3.37 ± 0.049
2.412GluThr: 2.412 ± 0.031
2.254GluVal: 2.254 ± 0.09
0.506GluTrp: 0.506 ± 0.021
3.716GluTyr: 3.716 ± 0.039
0.0GluXaa: 0.0 ± 0.0
Phe
0.801PheAla: 0.801 ± 0.016
0.952PheCys: 0.952 ± 0.016
2.452PheAsp: 2.452 ± 0.029
2.31PheGlu: 2.31 ± 0.032
3.365PhePhe: 3.365 ± 0.054
1.152PheGly: 1.152 ± 0.025
1.139PheHis: 1.139 ± 0.017
4.261PheIle: 4.261 ± 0.052
3.64PheLys: 3.64 ± 0.038
5.025PheLeu: 5.025 ± 0.06
0.915PheMet: 0.915 ± 0.017
4.29PheAsn: 4.29 ± 0.048
1.02PhePro: 1.02 ± 0.019
1.112PheGln: 1.112 ± 0.019
0.981PheArg: 0.981 ± 0.017
3.244PheSer: 3.244 ± 0.037
1.573PheThr: 1.573 ± 0.023
1.979PheVal: 1.979 ± 0.024
0.208PheTrp: 0.208 ± 0.008
2.852PheTyr: 2.852 ± 0.038
0.0PheXaa: 0.0 ± 0.0
Gly
0.816GlyAla: 0.816 ± 0.021
0.541GlyCys: 0.541 ± 0.016
2.24GlyAsp: 2.24 ± 0.04
1.926GlyGlu: 1.926 ± 0.038
1.079GlyPhe: 1.079 ± 0.02
1.507GlyGly: 1.507 ± 0.037
0.63GlyHis: 0.63 ± 0.016
2.412GlyIle: 2.412 ± 0.03
3.124GlyLys: 3.124 ± 0.042
1.902GlyLeu: 1.902 ± 0.029
0.593GlyMet: 0.593 ± 0.014
3.151GlyAsn: 3.151 ± 0.048
0.528GlyPro: 0.528 ± 0.018
0.721GlyGln: 0.721 ± 0.021
0.903GlyArg: 0.903 ± 0.02
2.054GlySer: 2.054 ± 0.052
1.5GlyThr: 1.5 ± 0.033
1.326GlyVal: 1.326 ± 0.026
0.171GlyTrp: 0.171 ± 0.006
1.448GlyTyr: 1.448 ± 0.022
0.0GlyXaa: 0.0 ± 0.0
His
0.478HisAla: 0.478 ± 0.019
0.309HisCys: 0.309 ± 0.011
1.416HisAsp: 1.416 ± 0.024
1.324HisGlu: 1.324 ± 0.02
1.299HisPhe: 1.299 ± 0.025
0.622HisGly: 0.622 ± 0.016
0.795HisHis: 0.795 ± 0.032
2.909HisIle: 2.909 ± 0.042
2.34HisLys: 2.34 ± 0.031
1.908HisLeu: 1.908 ± 0.021
0.861HisMet: 0.861 ± 0.016
3.468HisAsn: 3.468 ± 0.053
0.556HisPro: 0.556 ± 0.013
0.515HisGln: 0.515 ± 0.013
0.529HisArg: 0.529 ± 0.015
1.421HisSer: 1.421 ± 0.021
1.18HisThr: 1.18 ± 0.019
1.137HisVal: 1.137 ± 0.022
0.098HisTrp: 0.098 ± 0.005
1.059HisTyr: 1.059 ± 0.019
0.0HisXaa: 0.0 ± 0.0
Ile
1.496IleAla: 1.496 ± 0.027
2.071IleCys: 2.071 ± 0.027
4.809IleAsp: 4.809 ± 0.043
5.183IleGlu: 5.183 ± 0.054
4.68IlePhe: 4.68 ± 0.058
2.169IleGly: 2.169 ± 0.03
2.634IleHis: 2.634 ± 0.034
8.321IleIle: 8.321 ± 0.081
10.613IleLys: 10.613 ± 0.081
8.241IleLeu: 8.241 ± 0.073
1.641IleMet: 1.641 ± 0.021
12.862IleAsn: 12.862 ± 0.118
2.417IlePro: 2.417 ± 0.044
2.985IleGln: 2.985 ± 0.033
2.261IleArg: 2.261 ± 0.028
6.151IleSer: 6.151 ± 0.049
3.469IleThr: 3.469 ± 0.031
2.905IleVal: 2.905 ± 0.057
0.521IleTrp: 0.521 ± 0.016
6.497IleTyr: 6.497 ± 0.071
0.0IleXaa: 0.0 ± 0.0
Lys
1.838LysAla: 1.838 ± 0.033
2.084LysCys: 2.084 ± 0.037
6.952LysAsp: 6.952 ± 0.057
10.273LysGlu: 10.273 ± 0.094
3.155LysPhe: 3.155 ± 0.033
3.438LysGly: 3.438 ± 0.042
2.401LysHis: 2.401 ± 0.031
9.494LysIle: 9.494 ± 0.065
20.03LysLys: 20.03 ± 0.184
7.388LysLeu: 7.388 ± 0.066
2.6LysMet: 2.6 ± 0.033
17.051LysAsn: 17.051 ± 0.119
1.536LysPro: 1.536 ± 0.03
3.162LysGln: 3.162 ± 0.038
4.238LysArg: 4.238 ± 0.046
6.2LysSer: 6.2 ± 0.052
4.225LysThr: 4.225 ± 0.033
3.35LysVal: 3.35 ± 0.041
0.68LysTrp: 0.68 ± 0.023
7.021LysTyr: 7.021 ± 0.067
0.0LysXaa: 0.0 ± 0.0
Leu
1.556LeuAla: 1.556 ± 0.028
1.784LeuCys: 1.784 ± 0.024
3.644LeuAsp: 3.644 ± 0.034
4.474LeuGlu: 4.474 ± 0.057
4.43LeuPhe: 4.43 ± 0.052
1.971LeuGly: 1.971 ± 0.028
1.864LeuHis: 1.864 ± 0.025
6.201LeuIle: 6.201 ± 0.06
9.075LeuLys: 9.075 ± 0.072
7.456LeuLeu: 7.456 ± 0.072
1.278LeuMet: 1.278 ± 0.019
8.821LeuAsn: 8.821 ± 0.074
1.882LeuPro: 1.882 ± 0.033
2.41LeuGln: 2.41 ± 0.031
2.347LeuArg: 2.347 ± 0.029
5.618LeuSer: 5.618 ± 0.052
2.874LeuThr: 2.874 ± 0.035
2.345LeuVal: 2.345 ± 0.041
0.481LeuTrp: 0.481 ± 0.014
4.953LeuTyr: 4.953 ± 0.051
0.0LeuXaa: 0.0 ± 0.0
Met
0.384MetAla: 0.384 ± 0.012
0.46MetCys: 0.46 ± 0.013
1.606MetAsp: 1.606 ± 0.024
1.515MetGlu: 1.515 ± 0.025
0.873MetPhe: 0.873 ± 0.017
0.608MetGly: 0.608 ± 0.018
0.434MetHis: 0.434 ± 0.013
1.529MetIle: 1.529 ± 0.024
2.888MetLys: 2.888 ± 0.034
1.648MetLeu: 1.648 ± 0.023
0.484MetMet: 0.484 ± 0.012
3.989MetAsn: 3.989 ± 0.078
0.379MetPro: 0.379 ± 0.011
0.514MetGln: 0.514 ± 0.012
0.55MetArg: 0.55 ± 0.014
1.451MetSer: 1.451 ± 0.023
0.679MetThr: 0.679 ± 0.018
0.682MetVal: 0.682 ± 0.015
0.121MetTrp: 0.121 ± 0.006
1.191MetTyr: 1.191 ± 0.023
0.0MetXaa: 0.0 ± 0.0
Asn
1.953AsnAla: 1.953 ± 0.028
1.594AsnCys: 1.594 ± 0.025
10.114AsnAsp: 10.114 ± 0.12
9.617AsnGlu: 9.617 ± 0.09
4.787AsnPhe: 4.787 ± 0.047
3.167AsnGly: 3.167 ± 0.052
2.851AsnHis: 2.851 ± 0.051
16.11AsnIle: 16.11 ± 0.138
15.513AsnLys: 15.513 ± 0.123
7.555AsnLeu: 7.555 ± 0.061
4.083AsnMet: 4.083 ± 0.073
32.713AsnAsn: 32.713 ± 0.534
1.741AsnPro: 1.741 ± 0.028
2.878AsnGln: 2.878 ± 0.035
2.466AsnArg: 2.466 ± 0.034
7.668AsnSer: 7.668 ± 0.083
5.556AsnThr: 5.556 ± 0.058
6.295AsnVal: 6.295 ± 0.061
0.339AsnTrp: 0.339 ± 0.01
7.031AsnTyr: 7.031 ± 0.073
0.0AsnXaa: 0.0 ± 0.0
Pro
0.439ProAla: 0.439 ± 0.017
0.43ProCys: 0.43 ± 0.014
0.891ProAsp: 0.891 ± 0.034
1.139ProGlu: 1.139 ± 0.063
1.262ProPhe: 1.262 ± 0.024
0.563ProGly: 0.563 ± 0.018
0.542ProHis: 0.542 ± 0.013
1.693ProIle: 1.693 ± 0.027
1.721ProLys: 1.721 ± 0.031
1.881ProLeu: 1.881 ± 0.027
0.366ProMet: 0.366 ± 0.01
2.055ProAsn: 2.055 ± 0.031
0.835ProPro: 0.835 ± 0.033
0.689ProGln: 0.689 ± 0.019
0.544ProArg: 0.544 ± 0.013
1.633ProSer: 1.633 ± 0.029
1.027ProThr: 1.027 ± 0.024
0.81ProVal: 0.81 ± 0.024
0.137ProTrp: 0.137 ± 0.006
1.329ProTyr: 1.329 ± 0.022
0.0ProXaa: 0.0 ± 0.0
Gln
0.589GlnAla: 0.589 ± 0.016
0.419GlnCys: 0.419 ± 0.014
1.412GlnAsp: 1.412 ± 0.023
2.061GlnGlu: 2.061 ± 0.043
0.965GlnPhe: 0.965 ± 0.017
0.775GlnGly: 0.775 ± 0.016
0.747GlnHis: 0.747 ± 0.019
2.482GlnIle: 2.482 ± 0.034
3.595GlnLys: 3.595 ± 0.039
1.933GlnLeu: 1.933 ± 0.026
0.703GlnMet: 0.703 ± 0.016
4.303GlnAsn: 4.303 ± 0.05
0.548GlnPro: 0.548 ± 0.017
1.168GlnGln: 1.168 ± 0.05
0.828GlnArg: 0.828 ± 0.017
1.483GlnSer: 1.483 ± 0.023
1.286GlnThr: 1.286 ± 0.026
0.942GlnVal: 0.942 ± 0.017
0.174GlnTrp: 0.174 ± 0.008
1.36GlnTyr: 1.36 ± 0.024
0.0GlnXaa: 0.0 ± 0.0
Arg
0.644ArgAla: 0.644 ± 0.017
0.447ArgCys: 0.447 ± 0.013
1.542ArgAsp: 1.542 ± 0.031
1.797ArgGlu: 1.797 ± 0.032
0.967ArgPhe: 0.967 ± 0.019
0.986ArgGly: 0.986 ± 0.023
0.571ArgHis: 0.571 ± 0.013
2.143ArgIle: 2.143 ± 0.026
3.915ArgLys: 3.915 ± 0.048
1.792ArgLeu: 1.792 ± 0.028
0.542ArgMet: 0.542 ± 0.014
3.415ArgAsn: 3.415 ± 0.039
0.442ArgPro: 0.442 ± 0.014
0.712ArgGln: 0.712 ± 0.016
1.449ArgArg: 1.449 ± 0.033
1.612ArgSer: 1.612 ± 0.027
1.14ArgThr: 1.14 ± 0.019
0.887ArgVal: 0.887 ± 0.018
0.207ArgTrp: 0.207 ± 0.01
1.335ArgTyr: 1.335 ± 0.02
0.0ArgXaa: 0.0 ± 0.0
Ser
1.389SerAla: 1.389 ± 0.029
1.243SerCys: 1.243 ± 0.021
4.071SerAsp: 4.071 ± 0.053
3.534SerGlu: 3.534 ± 0.045
3.542SerPhe: 3.542 ± 0.032
2.102SerGly: 2.102 ± 0.05
1.483SerHis: 1.483 ± 0.022
5.455SerIle: 5.455 ± 0.048
5.668SerLys: 5.668 ± 0.044
5.344SerLeu: 5.344 ± 0.051
1.181SerMet: 1.181 ± 0.02
8.154SerAsn: 8.154 ± 0.091
1.416SerPro: 1.416 ± 0.034
1.613SerGln: 1.613 ± 0.025
1.58SerArg: 1.58 ± 0.023
6.493SerSer: 6.493 ± 0.075
3.085SerThr: 3.085 ± 0.034
2.718SerVal: 2.718 ± 0.062
0.281SerTrp: 0.281 ± 0.009
3.787SerTyr: 3.787 ± 0.035
0.0SerXaa: 0.0 ± 0.0
Thr
0.916ThrAla: 0.916 ± 0.024
1.005ThrCys: 1.005 ± 0.019
2.071ThrAsp: 2.071 ± 0.031
2.162ThrGlu: 2.162 ± 0.07
2.179ThrPhe: 2.179 ± 0.027
1.123ThrGly: 1.123 ± 0.023
1.218ThrHis: 1.218 ± 0.021
3.102ThrIle: 3.102 ± 0.034
4.081ThrLys: 4.081 ± 0.04
3.422ThrLeu: 3.422 ± 0.038
0.698ThrMet: 0.698 ± 0.017
5.723ThrAsn: 5.723 ± 0.062
1.162ThrPro: 1.162 ± 0.026
1.354ThrGln: 1.354 ± 0.022
0.928ThrArg: 0.928 ± 0.015
3.23ThrSer: 3.23 ± 0.036
2.277ThrThr: 2.277 ± 0.04
1.363ThrVal: 1.363 ± 0.026
0.219ThrTrp: 0.219 ± 0.008
2.79ThrTyr: 2.79 ± 0.037
0.0ThrXaa: 0.0 ± 0.0
Val
0.977ValAla: 0.977 ± 0.032
0.827ValCys: 0.827 ± 0.017
2.723ValAsp: 2.723 ± 0.038
2.984ValGlu: 2.984 ± 0.098
1.574ValPhe: 1.574 ± 0.023
1.312ValGly: 1.312 ± 0.026
1.237ValHis: 1.237 ± 0.02
3.043ValIle: 3.043 ± 0.04
3.561ValLys: 3.561 ± 0.04
3.35ValLeu: 3.35 ± 0.035
0.676ValMet: 0.676 ± 0.014
4.079ValAsn: 4.079 ± 0.051
1.171ValPro: 1.171 ± 0.044
1.387ValGln: 1.387 ± 0.023
0.98ValArg: 0.98 ± 0.017
2.605ValSer: 2.605 ± 0.042
1.621ValThr: 1.621 ± 0.039
1.78ValVal: 1.78 ± 0.055
0.233ValTrp: 0.233 ± 0.009
1.953ValTyr: 1.953 ± 0.025
0.0ValXaa: 0.0 ± 0.0
Trp
0.152TrpAla: 0.152 ± 0.008
0.098TrpCys: 0.098 ± 0.006
0.298TrpAsp: 0.298 ± 0.011
0.322TrpGlu: 0.322 ± 0.013
0.272TrpPhe: 0.272 ± 0.011
0.223TrpGly: 0.223 ± 0.009
0.083TrpHis: 0.083 ± 0.004
0.509TrpIle: 0.509 ± 0.018
0.658TrpLys: 0.658 ± 0.02
0.433TrpLeu: 0.433 ± 0.013
0.107TrpMet: 0.107 ± 0.006
0.546TrpAsn: 0.546 ± 0.015
0.096TrpPro: 0.096 ± 0.005
0.087TrpGln: 0.087 ± 0.004
0.183TrpArg: 0.183 ± 0.007
0.316TrpSer: 0.316 ± 0.011
0.2TrpThr: 0.2 ± 0.008
0.232TrpVal: 0.232 ± 0.01
0.091TrpTrp: 0.091 ± 0.007
0.214TrpTyr: 0.214 ± 0.007
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.13TyrAla: 1.13 ± 0.019
0.832TyrCys: 0.832 ± 0.019
4.429TyrAsp: 4.429 ± 0.045
3.915TyrGlu: 3.915 ± 0.04
2.951TyrPhe: 2.951 ± 0.037
1.521TyrGly: 1.521 ± 0.025
1.291TyrHis: 1.291 ± 0.022
6.068TyrIle: 6.068 ± 0.069
5.655TyrLys: 5.655 ± 0.058
4.364TyrLeu: 4.364 ± 0.045
1.422TyrMet: 1.422 ± 0.02
7.76TyrAsn: 7.76 ± 0.088
1.136TyrPro: 1.136 ± 0.022
1.208TyrGln: 1.208 ± 0.022
1.284TyrArg: 1.284 ± 0.023
3.441TyrSer: 3.441 ± 0.038
2.259TyrThr: 2.259 ± 0.026
2.416TyrVal: 2.416 ± 0.027
0.22TyrTrp: 0.22 ± 0.008
3.409TyrTyr: 3.409 ± 0.048
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5452 proteins (3784517 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski