Amino acid dipepetide frequency for Plasmodium falciparum (isolate NF54)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.933AlaAla: 0.933 ± 0.025
0.471AlaCys: 0.471 ± 0.014
1.045AlaAsp: 1.045 ± 0.019
1.161AlaGlu: 1.161 ± 0.025
0.937AlaPhe: 0.937 ± 0.022
0.753AlaGly: 0.753 ± 0.018
0.574AlaHis: 0.574 ± 0.013
1.635AlaIle: 1.635 ± 0.029
1.761AlaLys: 1.761 ± 0.031
1.92AlaLeu: 1.92 ± 0.032
0.368AlaMet: 0.368 ± 0.012
1.57AlaAsn: 1.57 ± 0.022
0.575AlaPro: 0.575 ± 0.017
0.696AlaGln: 0.696 ± 0.016
0.588AlaArg: 0.588 ± 0.014
1.491AlaSer: 1.491 ± 0.025
0.986AlaThr: 0.986 ± 0.022
0.874AlaVal: 0.874 ± 0.021
0.133AlaTrp: 0.133 ± 0.006
1.082AlaTyr: 1.082 ± 0.018
0.0AlaXaa: 0.0 ± 0.0
Cys
0.522CysAla: 0.522 ± 0.014
0.33CysCys: 0.33 ± 0.01
1.264CysAsp: 1.264 ± 0.022
1.098CysGlu: 1.098 ± 0.016
0.853CysPhe: 0.853 ± 0.017
0.706CysGly: 0.706 ± 0.019
0.333CysHis: 0.333 ± 0.01
1.78CysIle: 1.78 ± 0.026
1.628CysLys: 1.628 ± 0.029
1.563CysLeu: 1.563 ± 0.024
0.369CysMet: 0.369 ± 0.011
1.902CysAsn: 1.902 ± 0.026
0.467CysPro: 0.467 ± 0.016
0.349CysGln: 0.349 ± 0.01
0.497CysArg: 0.497 ± 0.015
1.43CysSer: 1.43 ± 0.021
0.951CysThr: 0.951 ± 0.017
0.9CysVal: 0.9 ± 0.018
0.069CysTrp: 0.069 ± 0.005
0.847CysTyr: 0.847 ± 0.017
0.0CysXaa: 0.0 ± 0.0
Asp
1.213AspAla: 1.213 ± 0.025
0.681AspCys: 0.681 ± 0.016
6.709AspAsp: 6.709 ± 0.095
5.777AspGlu: 5.777 ± 0.057
2.137AspPhe: 2.137 ± 0.027
1.701AspGly: 1.701 ± 0.03
1.479AspHis: 1.479 ± 0.028
7.368AspIle: 7.368 ± 0.053
6.696AspLys: 6.696 ± 0.059
3.762AspLeu: 3.762 ± 0.036
1.713AspMet: 1.713 ± 0.025
8.954AspAsn: 8.954 ± 0.079
0.99AspPro: 0.99 ± 0.02
1.431AspGln: 1.431 ± 0.024
1.203AspArg: 1.203 ± 0.026
3.136AspSer: 3.136 ± 0.036
2.668AspThr: 2.668 ± 0.029
2.84AspVal: 2.84 ± 0.036
0.214AspTrp: 0.214 ± 0.008
2.825AspTyr: 2.825 ± 0.036
0.0AspXaa: 0.0 ± 0.0
Glu
1.436GluAla: 1.436 ± 0.026
1.1GluCys: 1.1 ± 0.02
4.68GluAsp: 4.68 ± 0.062
8.013GluGlu: 8.013 ± 0.153
2.007GluPhe: 2.007 ± 0.026
2.015GluGly: 2.015 ± 0.035
1.828GluHis: 1.828 ± 0.025
5.325GluIle: 5.325 ± 0.062
10.626GluLys: 10.626 ± 0.079
4.533GluLeu: 4.533 ± 0.055
1.389GluMet: 1.389 ± 0.024
8.965GluAsn: 8.965 ± 0.065
0.972GluPro: 0.972 ± 0.02
2.657GluGln: 2.657 ± 0.033
2.089GluArg: 2.089 ± 0.033
3.297GluSer: 3.297 ± 0.043
2.388GluThr: 2.388 ± 0.03
2.084GluVal: 2.084 ± 0.063
0.474GluTrp: 0.474 ± 0.019
3.716GluTyr: 3.716 ± 0.03
0.0GluXaa: 0.0 ± 0.0
Phe
0.808PheAla: 0.808 ± 0.015
0.982PheCys: 0.982 ± 0.017
2.481PheAsp: 2.481 ± 0.027
2.345PheGlu: 2.345 ± 0.031
3.522PhePhe: 3.522 ± 0.052
1.148PheGly: 1.148 ± 0.021
1.168PheHis: 1.168 ± 0.018
4.403PheIle: 4.403 ± 0.052
3.736PheLys: 3.736 ± 0.035
5.156PheLeu: 5.156 ± 0.052
0.934PheMet: 0.934 ± 0.015
4.402PheAsn: 4.402 ± 0.04
1.018PhePro: 1.018 ± 0.019
1.135PheGln: 1.135 ± 0.02
1.002PheArg: 1.002 ± 0.016
3.289PheSer: 3.289 ± 0.037
1.583PheThr: 1.583 ± 0.023
1.982PheVal: 1.982 ± 0.03
0.212PheTrp: 0.212 ± 0.008
2.959PheTyr: 2.959 ± 0.039
0.0PheXaa: 0.0 ± 0.0
Gly
0.832GlyAla: 0.832 ± 0.02
0.522GlyCys: 0.522 ± 0.012
2.089GlyAsp: 2.089 ± 0.042
1.854GlyGlu: 1.854 ± 0.033
1.078GlyPhe: 1.078 ± 0.02
1.513GlyGly: 1.513 ± 0.04
0.621GlyHis: 0.621 ± 0.013
2.415GlyIle: 2.415 ± 0.032
3.039GlyLys: 3.039 ± 0.04
1.946GlyLeu: 1.946 ± 0.029
0.59GlyMet: 0.59 ± 0.014
3.059GlyAsn: 3.059 ± 0.037
0.548GlyPro: 0.548 ± 0.013
0.667GlyGln: 0.667 ± 0.016
0.874GlyArg: 0.874 ± 0.019
1.985GlySer: 1.985 ± 0.036
1.439GlyThr: 1.439 ± 0.031
1.315GlyVal: 1.315 ± 0.023
0.168GlyTrp: 0.168 ± 0.006
1.462GlyTyr: 1.462 ± 0.022
0.0GlyXaa: 0.0 ± 0.0
His
0.462HisAla: 0.462 ± 0.013
0.313HisCys: 0.313 ± 0.01
1.371HisAsp: 1.371 ± 0.023
1.297HisGlu: 1.297 ± 0.022
1.349HisPhe: 1.349 ± 0.02
0.603HisGly: 0.603 ± 0.014
0.756HisHis: 0.756 ± 0.022
2.993HisIle: 2.993 ± 0.039
2.307HisLys: 2.307 ± 0.029
1.936HisLeu: 1.936 ± 0.021
0.871HisMet: 0.871 ± 0.018
3.45HisAsn: 3.45 ± 0.056
0.556HisPro: 0.556 ± 0.012
0.535HisGln: 0.535 ± 0.014
0.532HisArg: 0.532 ± 0.012
1.421HisSer: 1.421 ± 0.021
1.162HisThr: 1.162 ± 0.019
1.163HisVal: 1.163 ± 0.03
0.091HisTrp: 0.091 ± 0.005
1.08HisTyr: 1.08 ± 0.019
0.0HisXaa: 0.0 ± 0.0
Ile
1.502IleAla: 1.502 ± 0.025
2.1IleCys: 2.1 ± 0.028
4.855IleAsp: 4.855 ± 0.048
5.176IleGlu: 5.176 ± 0.055
4.89IlePhe: 4.89 ± 0.053
2.171IleGly: 2.171 ± 0.03
2.667IleHis: 2.667 ± 0.031
8.611IleIle: 8.611 ± 0.074
10.791IleLys: 10.791 ± 0.072
8.54IleLeu: 8.54 ± 0.077
1.686IleMet: 1.686 ± 0.024
13.002IleAsn: 13.002 ± 0.113
2.39IlePro: 2.39 ± 0.04
2.976IleGln: 2.976 ± 0.04
2.291IleArg: 2.291 ± 0.028
6.272IleSer: 6.272 ± 0.048
3.5IleThr: 3.5 ± 0.037
2.876IleVal: 2.876 ± 0.036
0.513IleTrp: 0.513 ± 0.012
6.688IleTyr: 6.688 ± 0.074
0.0IleXaa: 0.0 ± 0.0
Lys
1.815LysAla: 1.815 ± 0.027
2.08LysCys: 2.08 ± 0.029
6.91LysAsp: 6.91 ± 0.047
10.097LysGlu: 10.097 ± 0.092
3.215LysPhe: 3.215 ± 0.035
3.39LysGly: 3.39 ± 0.036
2.394LysHis: 2.394 ± 0.028
9.645LysIle: 9.645 ± 0.074
19.897LysLys: 19.897 ± 0.144
7.474LysLeu: 7.474 ± 0.058
2.622LysMet: 2.622 ± 0.031
17.148LysAsn: 17.148 ± 0.103
1.491LysPro: 1.491 ± 0.025
3.138LysGln: 3.138 ± 0.033
4.249LysArg: 4.249 ± 0.043
6.173LysSer: 6.173 ± 0.046
4.226LysThr: 4.226 ± 0.033
3.348LysVal: 3.348 ± 0.034
0.639LysTrp: 0.639 ± 0.017
7.088LysTyr: 7.088 ± 0.057
0.0LysXaa: 0.0 ± 0.0
Leu
1.557LeuAla: 1.557 ± 0.028
1.817LeuCys: 1.817 ± 0.028
3.601LeuAsp: 3.601 ± 0.04
4.377LeuGlu: 4.377 ± 0.05
4.583LeuPhe: 4.583 ± 0.052
1.995LeuGly: 1.995 ± 0.029
1.89LeuHis: 1.89 ± 0.025
6.353LeuIle: 6.353 ± 0.061
9.203LeuLys: 9.203 ± 0.065
7.651LeuLeu: 7.651 ± 0.067
1.319LeuMet: 1.319 ± 0.02
9.016LeuAsn: 9.016 ± 0.067
1.818LeuPro: 1.818 ± 0.023
2.403LeuGln: 2.403 ± 0.028
2.321LeuArg: 2.321 ± 0.032
5.712LeuSer: 5.712 ± 0.043
2.925LeuThr: 2.925 ± 0.032
2.359LeuVal: 2.359 ± 0.034
0.473LeuTrp: 0.473 ± 0.013
5.12LeuTyr: 5.12 ± 0.051
0.0LeuXaa: 0.0 ± 0.0
Met
0.385MetAla: 0.385 ± 0.011
0.465MetCys: 0.465 ± 0.013
1.569MetAsp: 1.569 ± 0.023
1.505MetGlu: 1.505 ± 0.022
0.917MetPhe: 0.917 ± 0.017
0.581MetGly: 0.581 ± 0.016
0.443MetHis: 0.443 ± 0.011
1.59MetIle: 1.59 ± 0.021
2.912MetLys: 2.912 ± 0.031
1.691MetLeu: 1.691 ± 0.024
0.509MetMet: 0.509 ± 0.014
4.057MetAsn: 4.057 ± 0.074
0.391MetPro: 0.391 ± 0.013
0.521MetGln: 0.521 ± 0.012
0.547MetArg: 0.547 ± 0.012
1.422MetSer: 1.422 ± 0.022
0.675MetThr: 0.675 ± 0.014
0.697MetVal: 0.697 ± 0.016
0.126MetTrp: 0.126 ± 0.006
1.211MetTyr: 1.211 ± 0.02
0.0MetXaa: 0.0 ± 0.0
Asn
1.907AsnAla: 1.907 ± 0.025
1.613AsnCys: 1.613 ± 0.025
9.968AsnAsp: 9.968 ± 0.09
9.579AsnGlu: 9.579 ± 0.068
4.87AsnPhe: 4.87 ± 0.048
3.042AsnGly: 3.042 ± 0.052
2.846AsnHis: 2.846 ± 0.051
16.426AsnIle: 16.426 ± 0.143
15.532AsnLys: 15.532 ± 0.1
7.704AsnLeu: 7.704 ± 0.051
4.104AsnMet: 4.104 ± 0.069
33.04AsnAsn: 33.04 ± 0.513
1.722AsnPro: 1.722 ± 0.026
2.88AsnGln: 2.88 ± 0.044
2.49AsnArg: 2.49 ± 0.033
7.756AsnSer: 7.756 ± 0.083
5.546AsnThr: 5.546 ± 0.055
6.213AsnVal: 6.213 ± 0.057
0.33AsnTrp: 0.33 ± 0.009
7.219AsnTyr: 7.219 ± 0.075
0.0AsnXaa: 0.0 ± 0.0
Pro
0.44ProAla: 0.44 ± 0.018
0.448ProCys: 0.448 ± 0.017
0.808ProAsp: 0.808 ± 0.019
1.075ProGlu: 1.075 ± 0.043
1.232ProPhe: 1.232 ± 0.019
0.554ProGly: 0.554 ± 0.017
0.545ProHis: 0.545 ± 0.012
1.688ProIle: 1.688 ± 0.024
1.661ProLys: 1.661 ± 0.027
1.876ProLeu: 1.876 ± 0.023
0.374ProMet: 0.374 ± 0.01
2.009ProAsn: 2.009 ± 0.03
0.789ProPro: 0.789 ± 0.03
0.68ProGln: 0.68 ± 0.019
0.531ProArg: 0.531 ± 0.013
1.613ProSer: 1.613 ± 0.028
1.006ProThr: 1.006 ± 0.021
0.769ProVal: 0.769 ± 0.018
0.137ProTrp: 0.137 ± 0.006
1.323ProTyr: 1.323 ± 0.02
0.0ProXaa: 0.0 ± 0.0
Gln
0.598GlnAla: 0.598 ± 0.013
0.411GlnCys: 0.411 ± 0.01
1.391GlnAsp: 1.391 ± 0.022
1.954GlnGlu: 1.954 ± 0.025
0.971GlnPhe: 0.971 ± 0.015
0.762GlnGly: 0.762 ± 0.015
0.738GlnHis: 0.738 ± 0.015
2.486GlnIle: 2.486 ± 0.03
3.592GlnLys: 3.592 ± 0.035
1.921GlnLeu: 1.921 ± 0.027
0.701GlnMet: 0.701 ± 0.017
4.271GlnAsn: 4.271 ± 0.047
0.541GlnPro: 0.541 ± 0.015
1.115GlnGln: 1.115 ± 0.032
0.828GlnArg: 0.828 ± 0.018
1.446GlnSer: 1.446 ± 0.022
1.287GlnThr: 1.287 ± 0.021
0.941GlnVal: 0.941 ± 0.017
0.159GlnTrp: 0.159 ± 0.007
1.393GlnTyr: 1.393 ± 0.021
0.0GlnXaa: 0.0 ± 0.0
Arg
0.632ArgAla: 0.632 ± 0.014
0.458ArgCys: 0.458 ± 0.012
1.508ArgAsp: 1.508 ± 0.031
1.738ArgGlu: 1.738 ± 0.034
0.985ArgPhe: 0.985 ± 0.017
0.97ArgGly: 0.97 ± 0.023
0.563ArgHis: 0.563 ± 0.012
2.176ArgIle: 2.176 ± 0.029
3.929ArgLys: 3.929 ± 0.037
1.745ArgLeu: 1.745 ± 0.026
0.547ArgMet: 0.547 ± 0.014
3.454ArgAsn: 3.454 ± 0.041
0.455ArgPro: 0.455 ± 0.014
0.703ArgGln: 0.703 ± 0.015
1.425ArgArg: 1.425 ± 0.03
1.588ArgSer: 1.588 ± 0.026
1.127ArgThr: 1.127 ± 0.021
0.876ArgVal: 0.876 ± 0.017
0.195ArgTrp: 0.195 ± 0.009
1.349ArgTyr: 1.349 ± 0.02
0.0ArgXaa: 0.0 ± 0.0
Ser
1.374SerAla: 1.374 ± 0.025
1.253SerCys: 1.253 ± 0.019
3.915SerAsp: 3.915 ± 0.046
3.466SerGlu: 3.466 ± 0.042
3.594SerPhe: 3.594 ± 0.032
2.02SerGly: 2.02 ± 0.038
1.525SerHis: 1.525 ± 0.026
5.556SerIle: 5.556 ± 0.046
5.697SerLys: 5.697 ± 0.048
5.409SerLeu: 5.409 ± 0.045
1.209SerMet: 1.209 ± 0.019
8.226SerAsn: 8.226 ± 0.097
1.358SerPro: 1.358 ± 0.025
1.618SerGln: 1.618 ± 0.024
1.58SerArg: 1.58 ± 0.026
6.527SerSer: 6.527 ± 0.072
3.089SerThr: 3.089 ± 0.035
2.636SerVal: 2.636 ± 0.037
0.277SerTrp: 0.277 ± 0.009
3.888SerTyr: 3.888 ± 0.036
0.0SerXaa: 0.0 ± 0.0
Thr
0.921ThrAla: 0.921 ± 0.022
1.014ThrCys: 1.014 ± 0.016
2.017ThrAsp: 2.017 ± 0.027
2.05ThrGlu: 2.05 ± 0.044
2.221ThrPhe: 2.221 ± 0.025
1.107ThrGly: 1.107 ± 0.023
1.221ThrHis: 1.221 ± 0.018
3.146ThrIle: 3.146 ± 0.029
4.09ThrLys: 4.09 ± 0.034
3.424ThrLeu: 3.424 ± 0.038
0.684ThrMet: 0.684 ± 0.015
5.737ThrAsn: 5.737 ± 0.06
1.149ThrPro: 1.149 ± 0.023
1.347ThrGln: 1.347 ± 0.021
0.939ThrArg: 0.939 ± 0.016
3.216ThrSer: 3.216 ± 0.035
2.266ThrThr: 2.266 ± 0.034
1.331ThrVal: 1.331 ± 0.023
0.215ThrTrp: 0.215 ± 0.008
2.814ThrTyr: 2.814 ± 0.03
0.0ThrXaa: 0.0 ± 0.0
Val
0.955ValAla: 0.955 ± 0.023
0.841ValCys: 0.841 ± 0.017
2.651ValAsp: 2.651 ± 0.03
2.766ValGlu: 2.766 ± 0.057
1.599ValPhe: 1.599 ± 0.021
1.261ValGly: 1.261 ± 0.023
1.243ValHis: 1.243 ± 0.022
3.092ValIle: 3.092 ± 0.042
3.55ValLys: 3.55 ± 0.034
3.344ValLeu: 3.344 ± 0.034
0.692ValMet: 0.692 ± 0.014
4.024ValAsn: 4.024 ± 0.04
1.089ValPro: 1.089 ± 0.024
1.346ValGln: 1.346 ± 0.02
0.983ValArg: 0.983 ± 0.015
2.626ValSer: 2.626 ± 0.052
1.603ValThr: 1.603 ± 0.027
1.713ValVal: 1.713 ± 0.034
0.227ValTrp: 0.227 ± 0.008
1.99ValTyr: 1.99 ± 0.021
0.0ValXaa: 0.0 ± 0.0
Trp
0.15TrpAla: 0.15 ± 0.007
0.094TrpCys: 0.094 ± 0.005
0.278TrpAsp: 0.278 ± 0.009
0.304TrpGlu: 0.304 ± 0.011
0.254TrpPhe: 0.254 ± 0.011
0.212TrpGly: 0.212 ± 0.008
0.076TrpHis: 0.076 ± 0.005
0.486TrpIle: 0.486 ± 0.016
0.65TrpLys: 0.65 ± 0.017
0.423TrpLeu: 0.423 ± 0.013
0.104TrpMet: 0.104 ± 0.006
0.523TrpAsn: 0.523 ± 0.012
0.098TrpPro: 0.098 ± 0.005
0.09TrpGln: 0.09 ± 0.005
0.174TrpArg: 0.174 ± 0.007
0.312TrpSer: 0.312 ± 0.011
0.194TrpThr: 0.194 ± 0.007
0.234TrpVal: 0.234 ± 0.008
0.084TrpTrp: 0.084 ± 0.006
0.212TrpTyr: 0.212 ± 0.007
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.108TyrAla: 1.108 ± 0.021
0.865TyrCys: 0.865 ± 0.016
4.384TyrAsp: 4.384 ± 0.041
3.961TyrGlu: 3.961 ± 0.037
3.049TyrPhe: 3.049 ± 0.04
1.511TyrGly: 1.511 ± 0.025
1.337TyrHis: 1.337 ± 0.023
6.327TyrIle: 6.327 ± 0.068
5.742TyrLys: 5.742 ± 0.048
4.476TyrLeu: 4.476 ± 0.047
1.432TyrMet: 1.432 ± 0.025
7.979TyrAsn: 7.979 ± 0.069
1.138TyrPro: 1.138 ± 0.02
1.22TyrGln: 1.22 ± 0.017
1.282TyrArg: 1.282 ± 0.022
3.501TyrSer: 3.501 ± 0.037
2.287TyrThr: 2.287 ± 0.025
2.447TyrVal: 2.447 ± 0.027
0.219TyrTrp: 0.219 ± 0.008
3.52TyrTyr: 3.52 ± 0.044
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5920 proteins (3863743 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski