Amino acid dipepetide frequency for Plasmodium falciparum (isolate Camp / Malaysia)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.806AlaAla: 0.806 ± 0.025
0.481AlaCys: 0.481 ± 0.014
1.038AlaAsp: 1.038 ± 0.02
1.143AlaGlu: 1.143 ± 0.024
0.929AlaPhe: 0.929 ± 0.019
0.694AlaGly: 0.694 ± 0.019
0.583AlaHis: 0.583 ± 0.015
1.595AlaIle: 1.595 ± 0.025
1.729AlaLys: 1.729 ± 0.031
1.898AlaLeu: 1.898 ± 0.032
0.368AlaMet: 0.368 ± 0.011
1.556AlaAsn: 1.556 ± 0.024
0.55AlaPro: 0.55 ± 0.017
0.685AlaGln: 0.685 ± 0.015
0.581AlaArg: 0.581 ± 0.016
1.449AlaSer: 1.449 ± 0.026
0.952AlaThr: 0.952 ± 0.024
0.844AlaVal: 0.844 ± 0.02
0.133AlaTrp: 0.133 ± 0.007
1.104AlaTyr: 1.104 ± 0.017
0.0AlaXaa: 0.0 ± 0.0
Cys
0.525CysAla: 0.525 ± 0.012
0.329CysCys: 0.329 ± 0.011
1.264CysAsp: 1.264 ± 0.023
1.087CysGlu: 1.087 ± 0.016
0.871CysPhe: 0.871 ± 0.016
0.692CysGly: 0.692 ± 0.016
0.329CysHis: 0.329 ± 0.008
1.781CysIle: 1.781 ± 0.029
1.64CysLys: 1.64 ± 0.024
1.562CysLeu: 1.562 ± 0.024
0.371CysMet: 0.371 ± 0.011
1.891CysAsn: 1.891 ± 0.028
0.464CysPro: 0.464 ± 0.014
0.35CysGln: 0.35 ± 0.01
0.492CysArg: 0.492 ± 0.013
1.437CysSer: 1.437 ± 0.021
0.933CysThr: 0.933 ± 0.015
0.901CysVal: 0.901 ± 0.018
0.071CysTrp: 0.071 ± 0.004
0.849CysTyr: 0.849 ± 0.017
0.0CysXaa: 0.0 ± 0.0
Asp
1.187AspAla: 1.187 ± 0.021
0.679AspCys: 0.679 ± 0.014
6.63AspAsp: 6.63 ± 0.092
5.752AspGlu: 5.752 ± 0.055
2.14AspPhe: 2.14 ± 0.028
1.685AspGly: 1.685 ± 0.025
1.444AspHis: 1.444 ± 0.023
7.362AspIle: 7.362 ± 0.054
6.688AspLys: 6.688 ± 0.06
3.752AspLeu: 3.752 ± 0.037
1.704AspMet: 1.704 ± 0.024
8.895AspAsn: 8.895 ± 0.085
0.976AspPro: 0.976 ± 0.017
1.423AspGln: 1.423 ± 0.024
1.176AspArg: 1.176 ± 0.021
3.112AspSer: 3.112 ± 0.033
2.643AspThr: 2.643 ± 0.03
2.769AspVal: 2.769 ± 0.033
0.218AspTrp: 0.218 ± 0.009
2.843AspTyr: 2.843 ± 0.03
0.0AspXaa: 0.0 ± 0.0
Glu
1.392GluAla: 1.392 ± 0.026
1.104GluCys: 1.104 ± 0.02
4.702GluAsp: 4.702 ± 0.051
7.863GluGlu: 7.863 ± 0.115
2.034GluPhe: 2.034 ± 0.023
1.983GluGly: 1.983 ± 0.031
1.843GluHis: 1.843 ± 0.028
5.372GluIle: 5.372 ± 0.052
10.714GluLys: 10.714 ± 0.091
4.531GluLeu: 4.531 ± 0.049
1.381GluMet: 1.381 ± 0.023
9.027GluAsn: 9.027 ± 0.063
0.953GluPro: 0.953 ± 0.023
2.637GluGln: 2.637 ± 0.035
2.095GluArg: 2.095 ± 0.031
3.28GluSer: 3.28 ± 0.037
2.361GluThr: 2.361 ± 0.03
2.016GluVal: 2.016 ± 0.033
0.468GluTrp: 0.468 ± 0.016
3.733GluTyr: 3.733 ± 0.032
0.0GluXaa: 0.0 ± 0.0
Phe
0.807PheAla: 0.807 ± 0.017
0.989PheCys: 0.989 ± 0.017
2.487PheAsp: 2.487 ± 0.031
2.354PheGlu: 2.354 ± 0.025
3.564PhePhe: 3.564 ± 0.051
1.149PheGly: 1.149 ± 0.022
1.164PheHis: 1.164 ± 0.018
4.455PheIle: 4.455 ± 0.052
3.785PheLys: 3.785 ± 0.035
5.244PheLeu: 5.244 ± 0.055
0.941PheMet: 0.941 ± 0.017
4.439PheAsn: 4.439 ± 0.041
1.027PhePro: 1.027 ± 0.019
1.151PheGln: 1.151 ± 0.018
1.002PheArg: 1.002 ± 0.016
3.333PheSer: 3.333 ± 0.033
1.6PheThr: 1.6 ± 0.02
2.022PheVal: 2.022 ± 0.023
0.211PheTrp: 0.211 ± 0.008
3.011PheTyr: 3.011 ± 0.038
0.0PheXaa: 0.0 ± 0.0
Gly
0.789GlyAla: 0.789 ± 0.021
0.518GlyCys: 0.518 ± 0.013
2.005GlyAsp: 2.005 ± 0.036
1.827GlyGlu: 1.827 ± 0.025
1.089GlyPhe: 1.089 ± 0.021
1.417GlyGly: 1.417 ± 0.03
0.622GlyHis: 0.622 ± 0.012
2.368GlyIle: 2.368 ± 0.029
3.041GlyLys: 3.041 ± 0.035
1.921GlyLeu: 1.921 ± 0.027
0.585GlyMet: 0.585 ± 0.015
3.034GlyAsn: 3.034 ± 0.036
0.503GlyPro: 0.503 ± 0.013
0.668GlyGln: 0.668 ± 0.015
0.887GlyArg: 0.887 ± 0.018
1.954GlySer: 1.954 ± 0.031
1.436GlyThr: 1.436 ± 0.025
1.298GlyVal: 1.298 ± 0.026
0.167GlyTrp: 0.167 ± 0.007
1.464GlyTyr: 1.464 ± 0.023
0.0GlyXaa: 0.0 ± 0.0
His
0.454HisAla: 0.454 ± 0.01
0.311HisCys: 0.311 ± 0.01
1.365HisAsp: 1.365 ± 0.022
1.315HisGlu: 1.315 ± 0.02
1.35HisPhe: 1.35 ± 0.021
0.609HisGly: 0.609 ± 0.014
0.753HisHis: 0.753 ± 0.021
3.023HisIle: 3.023 ± 0.035
2.346HisLys: 2.346 ± 0.033
1.943HisLeu: 1.943 ± 0.025
0.867HisMet: 0.867 ± 0.016
3.426HisAsn: 3.426 ± 0.055
0.555HisPro: 0.555 ± 0.012
0.529HisGln: 0.529 ± 0.013
0.521HisArg: 0.521 ± 0.011
1.412HisSer: 1.412 ± 0.02
1.164HisThr: 1.164 ± 0.019
1.11HisVal: 1.11 ± 0.018
0.093HisTrp: 0.093 ± 0.005
1.085HisTyr: 1.085 ± 0.015
0.0HisXaa: 0.0 ± 0.0
Ile
1.464IleAla: 1.464 ± 0.027
2.105IleCys: 2.105 ± 0.028
4.836IleAsp: 4.836 ± 0.048
5.193IleGlu: 5.193 ± 0.052
4.977IlePhe: 4.977 ± 0.053
2.157IleGly: 2.157 ± 0.03
2.671IleHis: 2.671 ± 0.033
8.687IleIle: 8.687 ± 0.07
10.927IleLys: 10.927 ± 0.074
8.585IleLeu: 8.585 ± 0.069
1.674IleMet: 1.674 ± 0.024
13.094IleAsn: 13.094 ± 0.108
2.347IlePro: 2.347 ± 0.028
2.999IleGln: 2.999 ± 0.038
2.332IleArg: 2.332 ± 0.027
6.254IleSer: 6.254 ± 0.046
3.511IleThr: 3.511 ± 0.037
2.864IleVal: 2.864 ± 0.033
0.516IleTrp: 0.516 ± 0.013
6.697IleTyr: 6.697 ± 0.07
0.0IleXaa: 0.0 ± 0.0
Lys
1.796LysAla: 1.796 ± 0.029
2.084LysCys: 2.084 ± 0.03
6.913LysAsp: 6.913 ± 0.056
10.167LysGlu: 10.167 ± 0.086
3.25LysPhe: 3.25 ± 0.029
3.419LysGly: 3.419 ± 0.037
2.402LysHis: 2.402 ± 0.026
9.775LysIle: 9.775 ± 0.066
20.351LysLys: 20.351 ± 0.166
7.512LysLeu: 7.512 ± 0.061
2.646LysMet: 2.646 ± 0.027
17.301LysAsn: 17.301 ± 0.114
1.471LysPro: 1.471 ± 0.022
3.165LysGln: 3.165 ± 0.034
4.321LysArg: 4.321 ± 0.041
6.216LysSer: 6.216 ± 0.047
4.215LysThr: 4.215 ± 0.037
3.343LysVal: 3.343 ± 0.035
0.644LysTrp: 0.644 ± 0.018
7.167LysTyr: 7.167 ± 0.059
0.0LysXaa: 0.0 ± 0.0
Leu
1.546LeuAla: 1.546 ± 0.028
1.837LeuCys: 1.837 ± 0.025
3.613LeuAsp: 3.613 ± 0.035
4.395LeuGlu: 4.395 ± 0.046
4.643LeuPhe: 4.643 ± 0.047
1.962LeuGly: 1.962 ± 0.03
1.911LeuHis: 1.911 ± 0.021
6.422LeuIle: 6.422 ± 0.054
9.26LeuLys: 9.26 ± 0.067
7.703LeuLeu: 7.703 ± 0.065
1.327LeuMet: 1.327 ± 0.018
9.009LeuAsn: 9.009 ± 0.065
1.829LeuPro: 1.829 ± 0.026
2.405LeuGln: 2.405 ± 0.031
2.33LeuArg: 2.33 ± 0.028
5.736LeuSer: 5.736 ± 0.048
2.912LeuThr: 2.912 ± 0.031
2.318LeuVal: 2.318 ± 0.029
0.481LeuTrp: 0.481 ± 0.012
5.171LeuTyr: 5.171 ± 0.048
0.0LeuXaa: 0.0 ± 0.0
Met
0.389MetAla: 0.389 ± 0.011
0.46MetCys: 0.46 ± 0.013
1.562MetAsp: 1.562 ± 0.024
1.529MetGlu: 1.529 ± 0.025
0.912MetPhe: 0.912 ± 0.017
0.585MetGly: 0.585 ± 0.015
0.44MetHis: 0.44 ± 0.011
1.592MetIle: 1.592 ± 0.022
2.905MetLys: 2.905 ± 0.03
1.693MetLeu: 1.693 ± 0.024
0.502MetMet: 0.502 ± 0.013
4.042MetAsn: 4.042 ± 0.068
0.395MetPro: 0.395 ± 0.013
0.527MetGln: 0.527 ± 0.013
0.555MetArg: 0.555 ± 0.012
1.415MetSer: 1.415 ± 0.021
0.68MetThr: 0.68 ± 0.014
0.687MetVal: 0.687 ± 0.014
0.125MetTrp: 0.125 ± 0.005
1.214MetTyr: 1.214 ± 0.021
0.0MetXaa: 0.0 ± 0.0
Asn
1.917AsnAla: 1.917 ± 0.029
1.624AsnCys: 1.624 ± 0.025
9.873AsnAsp: 9.873 ± 0.082
9.599AsnGlu: 9.599 ± 0.071
4.907AsnPhe: 4.907 ± 0.05
2.997AsnGly: 2.997 ± 0.035
2.833AsnHis: 2.833 ± 0.047
16.531AsnIle: 16.531 ± 0.137
15.703AsnLys: 15.703 ± 0.119
7.736AsnLeu: 7.736 ± 0.054
4.114AsnMet: 4.114 ± 0.069
32.874AsnAsn: 32.874 ± 0.489
1.711AsnPro: 1.711 ± 0.026
2.85AsnGln: 2.85 ± 0.037
2.501AsnArg: 2.501 ± 0.031
7.728AsnSer: 7.728 ± 0.08
5.473AsnThr: 5.473 ± 0.052
6.254AsnVal: 6.254 ± 0.06
0.338AsnTrp: 0.338 ± 0.011
7.219AsnTyr: 7.219 ± 0.074
0.0AsnXaa: 0.0 ± 0.0
Pro
0.413ProAla: 0.413 ± 0.013
0.443ProCys: 0.443 ± 0.013
0.817ProAsp: 0.817 ± 0.017
1.032ProGlu: 1.032 ± 0.02
1.23ProPhe: 1.23 ± 0.019
0.532ProGly: 0.532 ± 0.014
0.542ProHis: 0.542 ± 0.013
1.661ProIle: 1.661 ± 0.022
1.635ProLys: 1.635 ± 0.023
1.879ProLeu: 1.879 ± 0.025
0.366ProMet: 0.366 ± 0.01
2.004ProAsn: 2.004 ± 0.034
0.774ProPro: 0.774 ± 0.024
0.671ProGln: 0.671 ± 0.015
0.541ProArg: 0.541 ± 0.014
1.554ProSer: 1.554 ± 0.02
0.955ProThr: 0.955 ± 0.021
0.772ProVal: 0.772 ± 0.015
0.138ProTrp: 0.138 ± 0.006
1.31ProTyr: 1.31 ± 0.02
0.0ProXaa: 0.0 ± 0.0
Gln
0.579GlnAla: 0.579 ± 0.015
0.411GlnCys: 0.411 ± 0.012
1.395GlnAsp: 1.395 ± 0.019
1.957GlnGlu: 1.957 ± 0.025
0.979GlnPhe: 0.979 ± 0.017
0.747GlnGly: 0.747 ± 0.015
0.738GlnHis: 0.738 ± 0.016
2.499GlnIle: 2.499 ± 0.03
3.616GlnLys: 3.616 ± 0.039
1.934GlnLeu: 1.934 ± 0.026
0.698GlnMet: 0.698 ± 0.016
4.272GlnAsn: 4.272 ± 0.048
0.515GlnPro: 0.515 ± 0.013
1.105GlnGln: 1.105 ± 0.03
0.815GlnArg: 0.815 ± 0.016
1.431GlnSer: 1.431 ± 0.023
1.266GlnThr: 1.266 ± 0.021
0.94GlnVal: 0.94 ± 0.015
0.162GlnTrp: 0.162 ± 0.007
1.383GlnTyr: 1.383 ± 0.021
0.0GlnXaa: 0.0 ± 0.0
Arg
0.632ArgAla: 0.632 ± 0.017
0.447ArgCys: 0.447 ± 0.012
1.478ArgAsp: 1.478 ± 0.027
1.77ArgGlu: 1.77 ± 0.031
0.993ArgPhe: 0.993 ± 0.018
0.963ArgGly: 0.963 ± 0.017
0.565ArgHis: 0.565 ± 0.015
2.209ArgIle: 2.209 ± 0.024
3.99ArgLys: 3.99 ± 0.04
1.774ArgLeu: 1.774 ± 0.028
0.534ArgMet: 0.534 ± 0.013
3.45ArgAsn: 3.45 ± 0.036
0.433ArgPro: 0.433 ± 0.012
0.699ArgGln: 0.699 ± 0.015
1.424ArgArg: 1.424 ± 0.033
1.608ArgSer: 1.608 ± 0.026
1.14ArgThr: 1.14 ± 0.018
0.887ArgVal: 0.887 ± 0.016
0.202ArgTrp: 0.202 ± 0.007
1.349ArgTyr: 1.349 ± 0.019
0.0ArgXaa: 0.0 ± 0.0
Ser
1.352SerAla: 1.352 ± 0.025
1.237SerCys: 1.237 ± 0.019
3.846SerAsp: 3.846 ± 0.044
3.451SerGlu: 3.451 ± 0.041
3.647SerPhe: 3.647 ± 0.035
1.921SerGly: 1.921 ± 0.035
1.49SerHis: 1.49 ± 0.019
5.542SerIle: 5.542 ± 0.045
5.724SerLys: 5.724 ± 0.05
5.425SerLeu: 5.425 ± 0.042
1.201SerMet: 1.201 ± 0.019
8.191SerAsn: 8.191 ± 0.087
1.338SerPro: 1.338 ± 0.021
1.605SerGln: 1.605 ± 0.026
1.581SerArg: 1.581 ± 0.026
6.394SerSer: 6.394 ± 0.069
3.077SerThr: 3.077 ± 0.034
2.613SerVal: 2.613 ± 0.035
0.282SerTrp: 0.282 ± 0.008
3.913SerTyr: 3.913 ± 0.038
0.0SerXaa: 0.0 ± 0.0
Thr
0.897ThrAla: 0.897 ± 0.02
0.989ThrCys: 0.989 ± 0.018
1.989ThrAsp: 1.989 ± 0.029
2.022ThrGlu: 2.022 ± 0.035
2.235ThrPhe: 2.235 ± 0.027
1.103ThrGly: 1.103 ± 0.023
1.221ThrHis: 1.221 ± 0.018
3.162ThrIle: 3.162 ± 0.034
4.089ThrLys: 4.089 ± 0.033
3.407ThrLeu: 3.407 ± 0.034
0.687ThrMet: 0.687 ± 0.014
5.689ThrAsn: 5.689 ± 0.056
1.125ThrPro: 1.125 ± 0.024
1.322ThrGln: 1.322 ± 0.021
0.941ThrArg: 0.941 ± 0.018
3.173ThrSer: 3.173 ± 0.032
2.197ThrThr: 2.197 ± 0.033
1.315ThrVal: 1.315 ± 0.022
0.211ThrTrp: 0.211 ± 0.008
2.809ThrTyr: 2.809 ± 0.028
0.0ThrXaa: 0.0 ± 0.0
Val
0.909ValAla: 0.909 ± 0.022
0.836ValCys: 0.836 ± 0.015
2.629ValAsp: 2.629 ± 0.031
2.742ValGlu: 2.742 ± 0.044
1.638ValPhe: 1.638 ± 0.023
1.248ValGly: 1.248 ± 0.023
1.254ValHis: 1.254 ± 0.027
3.037ValIle: 3.037 ± 0.029
3.547ValLys: 3.547 ± 0.033
3.356ValLeu: 3.356 ± 0.033
0.69ValMet: 0.69 ± 0.014
4.039ValAsn: 4.039 ± 0.039
1.065ValPro: 1.065 ± 0.018
1.342ValGln: 1.342 ± 0.024
0.983ValArg: 0.983 ± 0.017
2.529ValSer: 2.529 ± 0.031
1.567ValThr: 1.567 ± 0.025
1.652ValVal: 1.652 ± 0.025
0.223ValTrp: 0.223 ± 0.009
2.007ValTyr: 2.007 ± 0.027
0.0ValXaa: 0.0 ± 0.0
Trp
0.148TrpAla: 0.148 ± 0.007
0.095TrpCys: 0.095 ± 0.006
0.277TrpAsp: 0.277 ± 0.01
0.305TrpGlu: 0.305 ± 0.011
0.263TrpPhe: 0.263 ± 0.009
0.214TrpGly: 0.214 ± 0.007
0.078TrpHis: 0.078 ± 0.004
0.485TrpIle: 0.485 ± 0.012
0.643TrpLys: 0.643 ± 0.017
0.425TrpLeu: 0.425 ± 0.012
0.109TrpMet: 0.109 ± 0.006
0.531TrpAsn: 0.531 ± 0.013
0.1TrpPro: 0.1 ± 0.006
0.093TrpGln: 0.093 ± 0.005
0.182TrpArg: 0.182 ± 0.007
0.311TrpSer: 0.311 ± 0.01
0.199TrpThr: 0.199 ± 0.009
0.232TrpVal: 0.232 ± 0.009
0.087TrpTrp: 0.087 ± 0.005
0.217TrpTyr: 0.217 ± 0.008
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.114TyrAla: 1.114 ± 0.016
0.859TyrCys: 0.859 ± 0.018
4.36TyrAsp: 4.36 ± 0.041
3.985TyrGlu: 3.985 ± 0.033
3.086TyrPhe: 3.086 ± 0.04
1.515TyrGly: 1.515 ± 0.024
1.347TyrHis: 1.347 ± 0.024
6.336TyrIle: 6.336 ± 0.066
5.826TyrLys: 5.826 ± 0.048
4.532TyrLeu: 4.532 ± 0.039
1.444TyrMet: 1.444 ± 0.024
8.017TyrAsn: 8.017 ± 0.087
1.138TyrPro: 1.138 ± 0.019
1.217TyrGln: 1.217 ± 0.02
1.285TyrArg: 1.285 ± 0.022
3.504TyrSer: 3.504 ± 0.038
2.303TyrThr: 2.303 ± 0.028
2.455TyrVal: 2.455 ± 0.031
0.222TyrTrp: 0.222 ± 0.008
3.559TyrTyr: 3.559 ± 0.05
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6105 proteins (3936546 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski