Amino acid dipepetide frequency for Plasmodium malariae

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.142AlaAla: 1.142 ± 0.03
0.535AlaCys: 0.535 ± 0.013
1.541AlaAsp: 1.541 ± 0.03
1.752AlaGlu: 1.752 ± 0.027
1.201AlaPhe: 1.201 ± 0.019
1.058AlaGly: 1.058 ± 0.022
0.802AlaHis: 0.802 ± 0.016
1.988AlaIle: 1.988 ± 0.03
2.521AlaLys: 2.521 ± 0.031
2.403AlaLeu: 2.403 ± 0.027
0.484AlaMet: 0.484 ± 0.012
2.771AlaAsn: 2.771 ± 0.042
0.702AlaPro: 0.702 ± 0.015
0.901AlaGln: 0.901 ± 0.019
0.881AlaArg: 0.881 ± 0.017
2.262AlaSer: 2.262 ± 0.029
1.346AlaThr: 1.346 ± 0.025
1.391AlaVal: 1.391 ± 0.021
0.159AlaTrp: 0.159 ± 0.007
1.397AlaTyr: 1.397 ± 0.019
0.0AlaXaa: 0.0 ± 0.0
Cys
0.675CysAla: 0.675 ± 0.014
0.375CysCys: 0.375 ± 0.011
0.958CysAsp: 0.958 ± 0.016
1.168CysGlu: 1.168 ± 0.019
0.924CysPhe: 0.924 ± 0.017
0.751CysGly: 0.751 ± 0.016
0.341CysHis: 0.341 ± 0.009
1.732CysIle: 1.732 ± 0.022
1.81CysLys: 1.81 ± 0.024
1.605CysLeu: 1.605 ± 0.021
0.372CysMet: 0.372 ± 0.01
1.99CysAsn: 1.99 ± 0.031
0.468CysPro: 0.468 ± 0.014
0.353CysGln: 0.353 ± 0.011
0.632CysArg: 0.632 ± 0.015
1.961CysSer: 1.961 ± 0.027
1.084CysThr: 1.084 ± 0.017
0.988CysVal: 0.988 ± 0.016
0.079CysTrp: 0.079 ± 0.004
0.908CysTyr: 0.908 ± 0.017
0.0CysXaa: 0.0 ± 0.0
Asp
1.671AspAla: 1.671 ± 0.027
0.736AspCys: 0.736 ± 0.014
4.468AspAsp: 4.468 ± 0.071
5.513AspGlu: 5.513 ± 0.066
2.168AspPhe: 2.168 ± 0.027
2.233AspGly: 2.233 ± 0.033
1.034AspHis: 1.034 ± 0.016
5.536AspIle: 5.536 ± 0.042
5.875AspLys: 5.875 ± 0.06
3.747AspLeu: 3.747 ± 0.037
1.354AspMet: 1.354 ± 0.024
5.8AspAsn: 5.8 ± 0.074
0.912AspPro: 0.912 ± 0.017
1.186AspGln: 1.186 ± 0.019
1.799AspArg: 1.799 ± 0.031
3.905AspSer: 3.905 ± 0.039
2.369AspThr: 2.369 ± 0.028
2.938AspVal: 2.938 ± 0.038
0.245AspTrp: 0.245 ± 0.009
2.584AspTyr: 2.584 ± 0.027
0.0AspXaa: 0.0 ± 0.0
Glu
2.127GluAla: 2.127 ± 0.032
1.181GluCys: 1.181 ± 0.021
4.529GluAsp: 4.529 ± 0.062
9.067GluGlu: 9.067 ± 0.147
2.058GluPhe: 2.058 ± 0.028
3.156GluGly: 3.156 ± 0.042
1.702GluHis: 1.702 ± 0.025
5.277GluIle: 5.277 ± 0.051
11.108GluLys: 11.108 ± 0.082
4.972GluLeu: 4.972 ± 0.053
1.584GluMet: 1.584 ± 0.026
8.271GluAsn: 8.271 ± 0.063
0.868GluPro: 0.868 ± 0.018
2.834GluGln: 2.834 ± 0.037
2.95GluArg: 2.95 ± 0.045
3.875GluSer: 3.875 ± 0.059
2.484GluThr: 2.484 ± 0.034
3.064GluVal: 3.064 ± 0.052
0.476GluTrp: 0.476 ± 0.012
3.197GluTyr: 3.197 ± 0.035
0.0GluXaa: 0.0 ± 0.0
Phe
1.073PheAla: 1.073 ± 0.019
0.983PheCys: 0.983 ± 0.017
2.389PheAsp: 2.389 ± 0.029
2.63PheGlu: 2.63 ± 0.03
3.706PhePhe: 3.706 ± 0.05
1.289PheGly: 1.289 ± 0.025
1.117PheHis: 1.117 ± 0.015
4.206PheIle: 4.206 ± 0.049
3.797PheLys: 3.797 ± 0.038
5.508PheLeu: 5.508 ± 0.055
0.908PheMet: 0.908 ± 0.015
3.948PheAsn: 3.948 ± 0.036
1.123PhePro: 1.123 ± 0.019
1.093PheGln: 1.093 ± 0.019
1.318PheArg: 1.318 ± 0.019
3.802PheSer: 3.802 ± 0.036
1.912PheThr: 1.912 ± 0.022
2.148PheVal: 2.148 ± 0.03
0.243PheTrp: 0.243 ± 0.008
2.807PheTyr: 2.807 ± 0.038
0.0PheXaa: 0.0 ± 0.0
Gly
1.313GlyAla: 1.313 ± 0.023
0.683GlyCys: 0.683 ± 0.017
2.003GlyAsp: 2.003 ± 0.03
3.006GlyGlu: 3.006 ± 0.04
1.23GlyPhe: 1.23 ± 0.02
2.592GlyGly: 2.592 ± 0.045
0.726GlyHis: 0.726 ± 0.015
3.019GlyIle: 3.019 ± 0.036
4.504GlyLys: 4.504 ± 0.048
2.272GlyLeu: 2.272 ± 0.029
0.83GlyMet: 0.83 ± 0.019
4.448GlyAsn: 4.448 ± 0.057
0.538GlyPro: 0.538 ± 0.017
0.875GlyGln: 0.875 ± 0.017
1.648GlyArg: 1.648 ± 0.027
3.686GlySer: 3.686 ± 0.056
1.932GlyThr: 1.932 ± 0.027
1.995GlyVal: 1.995 ± 0.026
0.198GlyTrp: 0.198 ± 0.007
1.505GlyTyr: 1.505 ± 0.021
0.0GlyXaa: 0.0 ± 0.0
His
0.645HisAla: 0.645 ± 0.014
0.339HisCys: 0.339 ± 0.01
1.049HisAsp: 1.049 ± 0.016
1.267HisGlu: 1.267 ± 0.019
1.391HisPhe: 1.391 ± 0.019
0.712HisGly: 0.712 ± 0.014
0.524HisHis: 0.524 ± 0.015
2.121HisIle: 2.121 ± 0.025
1.98HisLys: 1.98 ± 0.025
1.944HisLeu: 1.944 ± 0.025
0.621HisMet: 0.621 ± 0.014
2.317HisAsn: 2.317 ± 0.035
0.585HisPro: 0.585 ± 0.026
0.46HisGln: 0.46 ± 0.012
0.705HisArg: 0.705 ± 0.015
1.696HisSer: 1.696 ± 0.025
1.025HisThr: 1.025 ± 0.021
1.174HisVal: 1.174 ± 0.02
0.101HisTrp: 0.101 ± 0.005
1.013HisTyr: 1.013 ± 0.018
0.0HisXaa: 0.0 ± 0.0
Ile
1.948IleAla: 1.948 ± 0.025
1.937IleCys: 1.937 ± 0.026
4.234IleAsp: 4.234 ± 0.04
4.923IleGlu: 4.923 ± 0.051
4.776IlePhe: 4.776 ± 0.05
2.602IleGly: 2.602 ± 0.029
1.855IleHis: 1.855 ± 0.023
7.057IleIle: 7.057 ± 0.062
8.809IleLys: 8.809 ± 0.063
7.949IleLeu: 7.949 ± 0.072
1.482IleMet: 1.482 ± 0.02
9.121IleAsn: 9.121 ± 0.072
2.177IlePro: 2.177 ± 0.033
2.099IleGln: 2.099 ± 0.027
2.637IleArg: 2.637 ± 0.026
6.599IleSer: 6.599 ± 0.057
3.327IleThr: 3.327 ± 0.035
3.218IleVal: 3.218 ± 0.034
0.469IleTrp: 0.469 ± 0.011
4.99IleTyr: 4.99 ± 0.053
0.0IleXaa: 0.0 ± 0.0
Lys
2.286LysAla: 2.286 ± 0.027
2.127LysCys: 2.127 ± 0.025
5.859LysAsp: 5.859 ± 0.06
10.095LysGlu: 10.095 ± 0.089
3.64LysPhe: 3.64 ± 0.03
4.366LysGly: 4.366 ± 0.038
2.174LysHis: 2.174 ± 0.025
9.08LysIle: 9.08 ± 0.062
18.312LysLys: 18.312 ± 0.148
7.576LysLeu: 7.576 ± 0.05
2.341LysMet: 2.341 ± 0.029
13.506LysAsn: 13.506 ± 0.079
1.372LysPro: 1.372 ± 0.022
2.922LysGln: 2.922 ± 0.032
5.114LysArg: 5.114 ± 0.048
6.653LysSer: 6.653 ± 0.054
4.038LysThr: 4.038 ± 0.035
3.868LysVal: 3.868 ± 0.039
0.758LysTrp: 0.758 ± 0.015
6.118LysTyr: 6.118 ± 0.045
0.0LysXaa: 0.0 ± 0.0
Leu
1.999LeuAla: 1.999 ± 0.031
1.817LeuCys: 1.817 ± 0.028
3.636LeuAsp: 3.636 ± 0.037
4.556LeuGlu: 4.556 ± 0.048
4.843LeuPhe: 4.843 ± 0.055
2.458LeuGly: 2.458 ± 0.035
1.97LeuHis: 1.97 ± 0.028
6.158LeuIle: 6.158 ± 0.054
9.102LeuLys: 9.102 ± 0.062
8.13LeuLeu: 8.13 ± 0.076
1.464LeuMet: 1.464 ± 0.021
8.317LeuAsn: 8.317 ± 0.059
2.051LeuPro: 2.051 ± 0.031
2.307LeuGln: 2.307 ± 0.026
2.889LeuArg: 2.889 ± 0.03
6.384LeuSer: 6.384 ± 0.048
3.374LeuThr: 3.374 ± 0.031
2.895LeuVal: 2.895 ± 0.031
0.481LeuTrp: 0.481 ± 0.012
4.359LeuTyr: 4.359 ± 0.04
0.0LeuXaa: 0.0 ± 0.0
Met
0.476MetAla: 0.476 ± 0.013
0.427MetCys: 0.427 ± 0.011
1.212MetAsp: 1.212 ± 0.022
1.613MetGlu: 1.613 ± 0.024
0.873MetPhe: 0.873 ± 0.016
0.746MetGly: 0.746 ± 0.016
0.636MetHis: 0.636 ± 0.015
1.255MetIle: 1.255 ± 0.019
2.484MetLys: 2.484 ± 0.026
1.558MetLeu: 1.558 ± 0.022
0.462MetMet: 0.462 ± 0.014
2.989MetAsn: 2.989 ± 0.053
0.456MetPro: 0.456 ± 0.015
0.658MetGln: 0.658 ± 0.014
0.676MetArg: 0.676 ± 0.015
1.412MetSer: 1.412 ± 0.022
0.699MetThr: 0.699 ± 0.013
0.694MetVal: 0.694 ± 0.015
0.109MetTrp: 0.109 ± 0.005
1.055MetTyr: 1.055 ± 0.017
0.0MetXaa: 0.0 ± 0.0
Asn
3.129AsnAla: 3.129 ± 0.042
2.093AsnCys: 2.093 ± 0.033
7.267AsnAsp: 7.267 ± 0.084
8.932AsnGlu: 8.932 ± 0.067
4.954AsnPhe: 4.954 ± 0.047
4.464AsnGly: 4.464 ± 0.06
1.806AsnHis: 1.806 ± 0.028
10.798AsnIle: 10.798 ± 0.078
11.342AsnLys: 11.342 ± 0.076
7.232AsnLeu: 7.232 ± 0.058
2.706AsnMet: 2.706 ± 0.043
17.69AsnAsn: 17.69 ± 0.315
1.558AsnPro: 1.558 ± 0.029
1.898AsnGln: 1.898 ± 0.027
3.567AsnArg: 3.567 ± 0.042
10.61AsnSer: 10.61 ± 0.142
4.91AsnThr: 4.91 ± 0.052
5.597AsnVal: 5.597 ± 0.05
0.397AsnTrp: 0.397 ± 0.011
6.011AsnTyr: 6.011 ± 0.057
0.0AsnXaa: 0.0 ± 0.0
Pro
0.478ProAla: 0.478 ± 0.013
0.378ProCys: 0.378 ± 0.012
0.836ProAsp: 0.836 ± 0.018
1.009ProGlu: 1.009 ± 0.024
1.313ProPhe: 1.313 ± 0.019
0.611ProGly: 0.611 ± 0.017
0.535ProHis: 0.535 ± 0.013
1.586ProIle: 1.586 ± 0.023
1.511ProLys: 1.511 ± 0.022
2.031ProLeu: 2.031 ± 0.031
0.36ProMet: 0.36 ± 0.012
1.839ProAsn: 1.839 ± 0.03
0.767ProPro: 0.767 ± 0.025
0.615ProGln: 0.615 ± 0.016
0.607ProArg: 0.607 ± 0.012
1.846ProSer: 1.846 ± 0.025
0.936ProThr: 0.936 ± 0.022
0.954ProVal: 0.954 ± 0.017
0.132ProTrp: 0.132 ± 0.006
1.168ProTyr: 1.168 ± 0.021
0.0ProXaa: 0.0 ± 0.0
Gln
0.657GlnAla: 0.657 ± 0.015
0.395GlnCys: 0.395 ± 0.011
1.172GlnAsp: 1.172 ± 0.018
1.875GlnGlu: 1.875 ± 0.031
0.868GlnPhe: 0.868 ± 0.016
0.96GlnGly: 0.96 ± 0.019
0.601GlnHis: 0.601 ± 0.016
2.09GlnIle: 2.09 ± 0.027
3.18GlnLys: 3.18 ± 0.029
1.972GlnLeu: 1.972 ± 0.028
0.674GlnMet: 0.674 ± 0.016
3.268GlnAsn: 3.268 ± 0.039
0.453GlnPro: 0.453 ± 0.014
1.081GlnGln: 1.081 ± 0.043
1.03GlnArg: 1.03 ± 0.019
1.594GlnSer: 1.594 ± 0.021
1.082GlnThr: 1.082 ± 0.018
1.199GlnVal: 1.199 ± 0.022
0.178GlnTrp: 0.178 ± 0.008
1.11GlnTyr: 1.11 ± 0.017
0.0GlnXaa: 0.0 ± 0.0
Arg
0.954ArgAla: 0.954 ± 0.016
0.616ArgCys: 0.616 ± 0.014
1.812ArgAsp: 1.812 ± 0.027
2.819ArgGlu: 2.819 ± 0.041
1.04ArgPhe: 1.04 ± 0.017
1.917ArgGly: 1.917 ± 0.03
0.692ArgHis: 0.692 ± 0.016
2.659ArgIle: 2.659 ± 0.025
4.993ArgLys: 4.993 ± 0.047
2.152ArgLeu: 2.152 ± 0.025
0.743ArgMet: 0.743 ± 0.016
4.313ArgAsn: 4.313 ± 0.047
0.489ArgPro: 0.489 ± 0.012
0.811ArgGln: 0.811 ± 0.013
2.074ArgArg: 2.074 ± 0.041
3.19ArgSer: 3.19 ± 0.051
1.614ArgThr: 1.614 ± 0.019
1.408ArgVal: 1.408 ± 0.028
0.204ArgTrp: 0.204 ± 0.008
1.443ArgTyr: 1.443 ± 0.021
0.0ArgXaa: 0.0 ± 0.0
Ser
2.409SerAla: 2.409 ± 0.036
1.649SerCys: 1.649 ± 0.027
4.295SerAsp: 4.295 ± 0.044
4.803SerGlu: 4.803 ± 0.057
3.836SerPhe: 3.836 ± 0.035
3.763SerGly: 3.763 ± 0.049
1.52SerHis: 1.52 ± 0.024
5.827SerIle: 5.827 ± 0.039
6.946SerLys: 6.946 ± 0.052
6.061SerLeu: 6.061 ± 0.042
1.4SerMet: 1.4 ± 0.026
10.484SerAsn: 10.484 ± 0.144
1.596SerPro: 1.596 ± 0.026
1.593SerGln: 1.593 ± 0.021
2.744SerArg: 2.744 ± 0.043
10.069SerSer: 10.069 ± 0.125
4.123SerThr: 4.123 ± 0.042
3.478SerVal: 3.478 ± 0.034
0.382SerTrp: 0.382 ± 0.011
3.91SerTyr: 3.91 ± 0.033
0.0SerXaa: 0.0 ± 0.0
Thr
1.431ThrAla: 1.431 ± 0.028
0.984ThrCys: 0.984 ± 0.018
2.317ThrAsp: 2.317 ± 0.031
2.542ThrGlu: 2.542 ± 0.035
2.101ThrPhe: 2.101 ± 0.023
1.56ThrGly: 1.56 ± 0.022
1.107ThrHis: 1.107 ± 0.022
3.183ThrIle: 3.183 ± 0.035
4.012ThrLys: 4.012 ± 0.041
3.406ThrLeu: 3.406 ± 0.031
0.702ThrMet: 0.702 ± 0.013
5.04ThrAsn: 5.04 ± 0.058
1.205ThrPro: 1.205 ± 0.028
1.199ThrGln: 1.199 ± 0.02
1.352ThrArg: 1.352 ± 0.021
3.693ThrSer: 3.693 ± 0.043
2.429ThrThr: 2.429 ± 0.036
1.908ThrVal: 1.908 ± 0.025
0.283ThrTrp: 0.283 ± 0.008
2.382ThrTyr: 2.382 ± 0.026
0.0ThrXaa: 0.0 ± 0.0
Val
1.228ValAla: 1.228 ± 0.025
0.907ValCys: 0.907 ± 0.016
2.863ValAsp: 2.863 ± 0.034
3.269ValGlu: 3.269 ± 0.053
1.851ValPhe: 1.851 ± 0.022
1.789ValGly: 1.789 ± 0.029
1.388ValHis: 1.388 ± 0.022
3.162ValIle: 3.162 ± 0.034
4.393ValLys: 4.393 ± 0.038
3.824ValLeu: 3.824 ± 0.042
0.696ValMet: 0.696 ± 0.014
4.569ValAsn: 4.569 ± 0.046
1.131ValPro: 1.131 ± 0.018
1.467ValGln: 1.467 ± 0.019
1.54ValArg: 1.54 ± 0.022
3.412ValSer: 3.412 ± 0.036
1.78ValThr: 1.78 ± 0.025
2.089ValVal: 2.089 ± 0.029
0.227ValTrp: 0.227 ± 0.008
2.123ValTyr: 2.123 ± 0.024
0.0ValXaa: 0.0 ± 0.0
Trp
0.143TrpAla: 0.143 ± 0.006
0.104TrpCys: 0.104 ± 0.005
0.273TrpAsp: 0.273 ± 0.009
0.343TrpGlu: 0.343 ± 0.008
0.275TrpPhe: 0.275 ± 0.009
0.265TrpGly: 0.265 ± 0.008
0.084TrpHis: 0.084 ± 0.005
0.606TrpIle: 0.606 ± 0.015
0.669TrpLys: 0.669 ± 0.013
0.463TrpLeu: 0.463 ± 0.012
0.137TrpMet: 0.137 ± 0.006
0.539TrpAsn: 0.539 ± 0.011
0.097TrpPro: 0.097 ± 0.006
0.094TrpGln: 0.094 ± 0.005
0.202TrpArg: 0.202 ± 0.008
0.367TrpSer: 0.367 ± 0.01
0.222TrpThr: 0.222 ± 0.009
0.277TrpVal: 0.277 ± 0.008
0.042TrpTrp: 0.042 ± 0.003
0.199TrpTyr: 0.199 ± 0.008
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.453TyrAla: 1.453 ± 0.017
0.909TyrCys: 0.909 ± 0.017
3.36TyrAsp: 3.36 ± 0.034
3.595TyrGlu: 3.595 ± 0.032
2.951TyrPhe: 2.951 ± 0.037
1.712TyrGly: 1.712 ± 0.024
1.057TyrHis: 1.057 ± 0.019
4.867TyrIle: 4.867 ± 0.052
4.859TyrLys: 4.859 ± 0.044
4.387TyrLeu: 4.387 ± 0.045
1.168TyrMet: 1.168 ± 0.019
5.738TyrAsn: 5.738 ± 0.056
0.965TyrPro: 0.965 ± 0.015
0.992TyrGln: 0.992 ± 0.015
1.578TyrArg: 1.578 ± 0.02
3.873TyrSer: 3.873 ± 0.036
2.147TyrThr: 2.147 ± 0.025
2.425TyrVal: 2.425 ± 0.024
0.241TyrTrp: 0.241 ± 0.008
3.004TyrTyr: 3.004 ± 0.04
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5915 proteins (4135924 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski