Amino acid dipepetide frequency for Plasmodium falciparum (isolate Palo Alto / Uganda)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.881AlaAla: 0.881 ± 0.028
0.487AlaCys: 0.487 ± 0.014
1.048AlaAsp: 1.048 ± 0.021
1.165AlaGlu: 1.165 ± 0.025
0.927AlaPhe: 0.927 ± 0.017
0.726AlaGly: 0.726 ± 0.017
0.585AlaHis: 0.585 ± 0.014
1.614AlaIle: 1.614 ± 0.026
1.743AlaLys: 1.743 ± 0.026
1.898AlaLeu: 1.898 ± 0.027
0.363AlaMet: 0.363 ± 0.012
1.572AlaAsn: 1.572 ± 0.025
0.574AlaPro: 0.574 ± 0.018
0.693AlaGln: 0.693 ± 0.017
0.588AlaArg: 0.588 ± 0.015
1.475AlaSer: 1.475 ± 0.025
0.976AlaThr: 0.976 ± 0.023
0.861AlaVal: 0.861 ± 0.02
0.131AlaTrp: 0.131 ± 0.005
1.085AlaTyr: 1.085 ± 0.02
0.0AlaXaa: 0.0 ± 0.0
Cys
0.527CysAla: 0.527 ± 0.014
0.328CysCys: 0.328 ± 0.01
1.272CysAsp: 1.272 ± 0.022
1.114CysGlu: 1.114 ± 0.021
0.861CysPhe: 0.861 ± 0.016
0.71CysGly: 0.71 ± 0.018
0.327CysHis: 0.327 ± 0.009
1.783CysIle: 1.783 ± 0.028
1.635CysLys: 1.635 ± 0.027
1.557CysLeu: 1.557 ± 0.023
0.369CysMet: 0.369 ± 0.01
1.888CysAsn: 1.888 ± 0.026
0.466CysPro: 0.466 ± 0.014
0.347CysGln: 0.347 ± 0.011
0.495CysArg: 0.495 ± 0.015
1.436CysSer: 1.436 ± 0.022
0.937CysThr: 0.937 ± 0.019
0.901CysVal: 0.901 ± 0.017
0.069CysTrp: 0.069 ± 0.005
0.845CysTyr: 0.845 ± 0.017
0.0CysXaa: 0.0 ± 0.0
Asp
1.208AspAla: 1.208 ± 0.021
0.68AspCys: 0.68 ± 0.015
6.812AspAsp: 6.812 ± 0.097
5.79AspGlu: 5.79 ± 0.063
2.135AspPhe: 2.135 ± 0.024
1.704AspGly: 1.704 ± 0.032
1.475AspHis: 1.475 ± 0.026
7.36AspIle: 7.36 ± 0.054
6.718AspLys: 6.718 ± 0.057
3.745AspLeu: 3.745 ± 0.034
1.706AspMet: 1.706 ± 0.023
8.952AspAsn: 8.952 ± 0.088
0.998AspPro: 0.998 ± 0.021
1.428AspGln: 1.428 ± 0.023
1.211AspArg: 1.211 ± 0.027
3.127AspSer: 3.127 ± 0.035
2.671AspThr: 2.671 ± 0.03
2.814AspVal: 2.814 ± 0.036
0.221AspTrp: 0.221 ± 0.009
2.821AspTyr: 2.821 ± 0.029
0.0AspXaa: 0.0 ± 0.0
Glu
1.404GluAla: 1.404 ± 0.03
1.099GluCys: 1.099 ± 0.022
4.723GluAsp: 4.723 ± 0.063
8.028GluGlu: 8.028 ± 0.168
2.021GluPhe: 2.021 ± 0.028
2.001GluGly: 2.001 ± 0.038
1.839GluHis: 1.839 ± 0.024
5.335GluIle: 5.335 ± 0.058
10.67GluLys: 10.67 ± 0.09
4.525GluLeu: 4.525 ± 0.054
1.366GluMet: 1.366 ± 0.019
8.999GluAsn: 8.999 ± 0.075
0.978GluPro: 0.978 ± 0.022
2.647GluGln: 2.647 ± 0.034
2.091GluArg: 2.091 ± 0.037
3.275GluSer: 3.275 ± 0.044
2.38GluThr: 2.38 ± 0.036
2.095GluVal: 2.095 ± 0.073
0.479GluTrp: 0.479 ± 0.02
3.716GluTyr: 3.716 ± 0.035
0.0GluXaa: 0.0 ± 0.0
Phe
0.808PheAla: 0.808 ± 0.014
0.98PheCys: 0.98 ± 0.017
2.479PheAsp: 2.479 ± 0.031
2.341PheGlu: 2.341 ± 0.028
3.518PhePhe: 3.518 ± 0.049
1.149PheGly: 1.149 ± 0.025
1.163PheHis: 1.163 ± 0.016
4.423PheIle: 4.423 ± 0.051
3.739PheLys: 3.739 ± 0.037
5.17PheLeu: 5.17 ± 0.059
0.933PheMet: 0.933 ± 0.018
4.417PheAsn: 4.417 ± 0.045
1.019PhePro: 1.019 ± 0.019
1.138PheGln: 1.138 ± 0.017
0.98PheArg: 0.98 ± 0.018
3.304PheSer: 3.304 ± 0.034
1.594PheThr: 1.594 ± 0.025
2.013PheVal: 2.013 ± 0.025
0.212PheTrp: 0.212 ± 0.008
2.984PheTyr: 2.984 ± 0.036
0.0PheXaa: 0.0 ± 0.0
Gly
0.816GlyAla: 0.816 ± 0.023
0.522GlyCys: 0.522 ± 0.013
2.074GlyAsp: 2.074 ± 0.04
1.858GlyGlu: 1.858 ± 0.035
1.075GlyPhe: 1.075 ± 0.019
1.469GlyGly: 1.469 ± 0.039
0.626GlyHis: 0.626 ± 0.014
2.39GlyIle: 2.39 ± 0.029
3.048GlyLys: 3.048 ± 0.039
1.959GlyLeu: 1.959 ± 0.028
0.587GlyMet: 0.587 ± 0.016
3.068GlyAsn: 3.068 ± 0.038
0.538GlyPro: 0.538 ± 0.016
0.667GlyGln: 0.667 ± 0.014
0.889GlyArg: 0.889 ± 0.019
1.961GlySer: 1.961 ± 0.039
1.456GlyThr: 1.456 ± 0.028
1.296GlyVal: 1.296 ± 0.02
0.167GlyTrp: 0.167 ± 0.006
1.462GlyTyr: 1.462 ± 0.024
0.0GlyXaa: 0.0 ± 0.0
His
0.459HisAla: 0.459 ± 0.012
0.307HisCys: 0.307 ± 0.009
1.391HisAsp: 1.391 ± 0.02
1.316HisGlu: 1.316 ± 0.017
1.349HisPhe: 1.349 ± 0.021
0.606HisGly: 0.606 ± 0.015
0.762HisHis: 0.762 ± 0.024
2.998HisIle: 2.998 ± 0.037
2.332HisLys: 2.332 ± 0.028
1.929HisLeu: 1.929 ± 0.023
0.874HisMet: 0.874 ± 0.02
3.425HisAsn: 3.425 ± 0.045
0.55HisPro: 0.55 ± 0.013
0.536HisGln: 0.536 ± 0.014
0.528HisArg: 0.528 ± 0.013
1.429HisSer: 1.429 ± 0.02
1.174HisThr: 1.174 ± 0.02
1.148HisVal: 1.148 ± 0.022
0.093HisTrp: 0.093 ± 0.005
1.082HisTyr: 1.082 ± 0.018
0.0HisXaa: 0.0 ± 0.0
Ile
1.477IleAla: 1.477 ± 0.028
2.102IleCys: 2.102 ± 0.029
4.822IleAsp: 4.822 ± 0.047
5.178IleGlu: 5.178 ± 0.051
4.912IlePhe: 4.912 ± 0.059
2.192IleGly: 2.192 ± 0.03
2.672IleHis: 2.672 ± 0.033
8.627IleIle: 8.627 ± 0.08
10.791IleLys: 10.791 ± 0.083
8.516IleLeu: 8.516 ± 0.072
1.686IleMet: 1.686 ± 0.023
13.025IleAsn: 13.025 ± 0.116
2.385IlePro: 2.385 ± 0.042
2.987IleGln: 2.987 ± 0.036
2.314IleArg: 2.314 ± 0.028
6.237IleSer: 6.237 ± 0.046
3.53IleThr: 3.53 ± 0.033
2.86IleVal: 2.86 ± 0.04
0.514IleTrp: 0.514 ± 0.013
6.69IleTyr: 6.69 ± 0.075
0.0IleXaa: 0.0 ± 0.0
Lys
1.804LysAla: 1.804 ± 0.029
2.082LysCys: 2.082 ± 0.034
6.927LysAsp: 6.927 ± 0.056
10.129LysGlu: 10.129 ± 0.087
3.212LysPhe: 3.212 ± 0.031
3.396LysGly: 3.396 ± 0.035
2.406LysHis: 2.406 ± 0.027
9.669LysIle: 9.669 ± 0.08
20.112LysLys: 20.112 ± 0.178
7.484LysLeu: 7.484 ± 0.05
2.606LysMet: 2.606 ± 0.03
17.182LysAsn: 17.182 ± 0.112
1.502LysPro: 1.502 ± 0.024
3.142LysGln: 3.142 ± 0.035
4.269LysArg: 4.269 ± 0.049
6.2LysSer: 6.2 ± 0.045
4.225LysThr: 4.225 ± 0.043
3.346LysVal: 3.346 ± 0.034
0.643LysTrp: 0.643 ± 0.018
7.103LysTyr: 7.103 ± 0.066
0.0LysXaa: 0.0 ± 0.0
Leu
1.552LeuAla: 1.552 ± 0.025
1.83LeuCys: 1.83 ± 0.026
3.603LeuAsp: 3.603 ± 0.037
4.369LeuGlu: 4.369 ± 0.044
4.604LeuPhe: 4.604 ± 0.051
1.986LeuGly: 1.986 ± 0.03
1.892LeuHis: 1.892 ± 0.025
6.371LeuIle: 6.371 ± 0.056
9.218LeuLys: 9.218 ± 0.062
7.652LeuLeu: 7.652 ± 0.071
1.328LeuMet: 1.328 ± 0.02
8.947LeuAsn: 8.947 ± 0.066
1.816LeuPro: 1.816 ± 0.027
2.395LeuGln: 2.395 ± 0.031
2.328LeuArg: 2.328 ± 0.029
5.671LeuSer: 5.671 ± 0.049
2.905LeuThr: 2.905 ± 0.033
2.342LeuVal: 2.342 ± 0.035
0.478LeuTrp: 0.478 ± 0.013
5.135LeuTyr: 5.135 ± 0.052
0.0LeuXaa: 0.0 ± 0.0
Met
0.383MetAla: 0.383 ± 0.012
0.477MetCys: 0.477 ± 0.012
1.574MetAsp: 1.574 ± 0.023
1.511MetGlu: 1.511 ± 0.022
0.91MetPhe: 0.91 ± 0.014
0.578MetGly: 0.578 ± 0.014
0.444MetHis: 0.444 ± 0.012
1.578MetIle: 1.578 ± 0.021
2.901MetLys: 2.901 ± 0.031
1.708MetLeu: 1.708 ± 0.021
0.513MetMet: 0.513 ± 0.015
4.009MetAsn: 4.009 ± 0.074
0.386MetPro: 0.386 ± 0.011
0.525MetGln: 0.525 ± 0.012
0.551MetArg: 0.551 ± 0.011
1.412MetSer: 1.412 ± 0.022
0.677MetThr: 0.677 ± 0.012
0.696MetVal: 0.696 ± 0.015
0.124MetTrp: 0.124 ± 0.006
1.208MetTyr: 1.208 ± 0.022
0.0MetXaa: 0.0 ± 0.0
Asn
1.919AsnAla: 1.919 ± 0.027
1.613AsnCys: 1.613 ± 0.025
9.963AsnAsp: 9.963 ± 0.082
9.573AsnGlu: 9.573 ± 0.081
4.878AsnPhe: 4.878 ± 0.045
3.022AsnGly: 3.022 ± 0.042
2.84AsnHis: 2.84 ± 0.039
16.448AsnIle: 16.448 ± 0.134
15.585AsnLys: 15.585 ± 0.103
7.692AsnLeu: 7.692 ± 0.06
4.115AsnMet: 4.115 ± 0.064
32.83AsnAsn: 32.83 ± 0.474
1.721AsnPro: 1.721 ± 0.027
2.868AsnGln: 2.868 ± 0.041
2.478AsnArg: 2.478 ± 0.032
7.719AsnSer: 7.719 ± 0.085
5.529AsnThr: 5.529 ± 0.052
6.264AsnVal: 6.264 ± 0.05
0.335AsnTrp: 0.335 ± 0.01
7.202AsnTyr: 7.202 ± 0.071
0.0AsnXaa: 0.0 ± 0.0
Pro
0.436ProAla: 0.436 ± 0.016
0.445ProCys: 0.445 ± 0.015
0.819ProAsp: 0.819 ± 0.016
1.09ProGlu: 1.09 ± 0.049
1.24ProPhe: 1.24 ± 0.019
0.546ProGly: 0.546 ± 0.016
0.544ProHis: 0.544 ± 0.012
1.689ProIle: 1.689 ± 0.026
1.668ProLys: 1.668 ± 0.028
1.863ProLeu: 1.863 ± 0.025
0.37ProMet: 0.37 ± 0.011
2.018ProAsn: 2.018 ± 0.028
0.804ProPro: 0.804 ± 0.029
0.692ProGln: 0.692 ± 0.017
0.537ProArg: 0.537 ± 0.015
1.599ProSer: 1.599 ± 0.026
0.998ProThr: 0.998 ± 0.021
0.773ProVal: 0.773 ± 0.017
0.134ProTrp: 0.134 ± 0.006
1.318ProTyr: 1.318 ± 0.02
0.0ProXaa: 0.0 ± 0.0
Gln
0.593GlnAla: 0.593 ± 0.014
0.415GlnCys: 0.415 ± 0.011
1.39GlnAsp: 1.39 ± 0.023
1.956GlnGlu: 1.956 ± 0.026
0.971GlnPhe: 0.971 ± 0.017
0.749GlnGly: 0.749 ± 0.016
0.75GlnHis: 0.75 ± 0.018
2.483GlnIle: 2.483 ± 0.029
3.593GlnLys: 3.593 ± 0.039
1.917GlnLeu: 1.917 ± 0.025
0.693GlnMet: 0.693 ± 0.015
4.293GlnAsn: 4.293 ± 0.049
0.541GlnPro: 0.541 ± 0.016
1.13GlnGln: 1.13 ± 0.036
0.822GlnArg: 0.822 ± 0.015
1.432GlnSer: 1.432 ± 0.019
1.28GlnThr: 1.28 ± 0.024
0.945GlnVal: 0.945 ± 0.018
0.163GlnTrp: 0.163 ± 0.006
1.396GlnTyr: 1.396 ± 0.019
0.0GlnXaa: 0.0 ± 0.0
Arg
0.628ArgAla: 0.628 ± 0.015
0.453ArgCys: 0.453 ± 0.012
1.502ArgAsp: 1.502 ± 0.033
1.745ArgGlu: 1.745 ± 0.031
0.985ArgPhe: 0.985 ± 0.016
0.972ArgGly: 0.972 ± 0.021
0.568ArgHis: 0.568 ± 0.014
2.18ArgIle: 2.18 ± 0.026
3.958ArgLys: 3.958 ± 0.043
1.76ArgLeu: 1.76 ± 0.025
0.534ArgMet: 0.534 ± 0.011
3.435ArgAsn: 3.435 ± 0.035
0.444ArgPro: 0.444 ± 0.013
0.701ArgGln: 0.701 ± 0.015
1.416ArgArg: 1.416 ± 0.028
1.603ArgSer: 1.603 ± 0.027
1.137ArgThr: 1.137 ± 0.019
0.887ArgVal: 0.887 ± 0.015
0.197ArgTrp: 0.197 ± 0.009
1.337ArgTyr: 1.337 ± 0.021
0.0ArgXaa: 0.0 ± 0.0
Ser
1.364SerAla: 1.364 ± 0.023
1.251SerCys: 1.251 ± 0.019
3.896SerAsp: 3.896 ± 0.04
3.439SerGlu: 3.439 ± 0.04
3.596SerPhe: 3.596 ± 0.04
2.012SerGly: 2.012 ± 0.044
1.513SerHis: 1.513 ± 0.023
5.539SerIle: 5.539 ± 0.048
5.706SerLys: 5.706 ± 0.048
5.397SerLeu: 5.397 ± 0.046
1.201SerMet: 1.201 ± 0.019
8.205SerAsn: 8.205 ± 0.086
1.361SerPro: 1.361 ± 0.02
1.616SerGln: 1.616 ± 0.021
1.57SerArg: 1.57 ± 0.027
6.423SerSer: 6.423 ± 0.075
3.086SerThr: 3.086 ± 0.034
2.614SerVal: 2.614 ± 0.043
0.276SerTrp: 0.276 ± 0.008
3.878SerTyr: 3.878 ± 0.04
0.0SerXaa: 0.0 ± 0.0
Thr
0.928ThrAla: 0.928 ± 0.022
1.005ThrCys: 1.005 ± 0.016
2.005ThrAsp: 2.005 ± 0.025
2.044ThrGlu: 2.044 ± 0.047
2.235ThrPhe: 2.235 ± 0.027
1.11ThrGly: 1.11 ± 0.024
1.234ThrHis: 1.234 ± 0.021
3.144ThrIle: 3.144 ± 0.03
4.07ThrLys: 4.07 ± 0.038
3.407ThrLeu: 3.407 ± 0.034
0.683ThrMet: 0.683 ± 0.013
5.781ThrAsn: 5.781 ± 0.047
1.153ThrPro: 1.153 ± 0.026
1.346ThrGln: 1.346 ± 0.022
0.936ThrArg: 0.936 ± 0.017
3.234ThrSer: 3.234 ± 0.036
2.268ThrThr: 2.268 ± 0.036
1.32ThrVal: 1.32 ± 0.024
0.216ThrTrp: 0.216 ± 0.009
2.819ThrTyr: 2.819 ± 0.031
0.0ThrXaa: 0.0 ± 0.0
Val
0.943ValAla: 0.943 ± 0.021
0.835ValCys: 0.835 ± 0.017
2.632ValAsp: 2.632 ± 0.032
2.768ValGlu: 2.768 ± 0.065
1.613ValPhe: 1.613 ± 0.024
1.279ValGly: 1.279 ± 0.024
1.235ValHis: 1.235 ± 0.017
3.069ValIle: 3.069 ± 0.037
3.554ValLys: 3.554 ± 0.034
3.344ValLeu: 3.344 ± 0.036
0.699ValMet: 0.699 ± 0.015
4.027ValAsn: 4.027 ± 0.04
1.112ValPro: 1.112 ± 0.029
1.348ValGln: 1.348 ± 0.024
0.983ValArg: 0.983 ± 0.018
2.601ValSer: 2.601 ± 0.042
1.61ValThr: 1.61 ± 0.029
1.7ValVal: 1.7 ± 0.041
0.221ValTrp: 0.221 ± 0.008
1.994ValTyr: 1.994 ± 0.024
0.0ValXaa: 0.0 ± 0.0
Trp
0.15TrpAla: 0.15 ± 0.006
0.097TrpCys: 0.097 ± 0.005
0.278TrpAsp: 0.278 ± 0.009
0.305TrpGlu: 0.305 ± 0.012
0.256TrpPhe: 0.256 ± 0.011
0.212TrpGly: 0.212 ± 0.01
0.08TrpHis: 0.08 ± 0.005
0.485TrpIle: 0.485 ± 0.014
0.636TrpLys: 0.636 ± 0.017
0.428TrpLeu: 0.428 ± 0.013
0.106TrpMet: 0.106 ± 0.006
0.536TrpAsn: 0.536 ± 0.013
0.096TrpPro: 0.096 ± 0.005
0.089TrpGln: 0.089 ± 0.006
0.179TrpArg: 0.179 ± 0.007
0.309TrpSer: 0.309 ± 0.009
0.2TrpThr: 0.2 ± 0.008
0.23TrpVal: 0.23 ± 0.008
0.087TrpTrp: 0.087 ± 0.007
0.215TrpTyr: 0.215 ± 0.007
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.113TyrAla: 1.113 ± 0.015
0.861TyrCys: 0.861 ± 0.015
4.364TyrAsp: 4.364 ± 0.041
3.953TyrGlu: 3.953 ± 0.035
3.065TyrPhe: 3.065 ± 0.041
1.506TyrGly: 1.506 ± 0.023
1.33TyrHis: 1.33 ± 0.021
6.332TyrIle: 6.332 ± 0.072
5.764TyrLys: 5.764 ± 0.053
4.473TyrLeu: 4.473 ± 0.046
1.434TyrMet: 1.434 ± 0.023
7.984TyrAsn: 7.984 ± 0.083
1.139TyrPro: 1.139 ± 0.019
1.219TyrGln: 1.219 ± 0.019
1.275TyrArg: 1.275 ± 0.021
3.495TyrSer: 3.495 ± 0.035
2.304TyrThr: 2.304 ± 0.029
2.463TyrVal: 2.463 ± 0.03
0.213TyrTrp: 0.213 ± 0.008
3.532TyrTyr: 3.532 ± 0.045
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6037 proteins (3952059 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski