Amino acid dipepetide frequency for Phytophthora megakarya

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.918AlaAla: 7.918 ± 0.037
1.173AlaCys: 1.173 ± 0.011
4.375AlaAsp: 4.375 ± 0.021
5.122AlaGlu: 5.122 ± 0.026
3.058AlaPhe: 3.058 ± 0.019
4.374AlaGly: 4.374 ± 0.022
1.715AlaHis: 1.715 ± 0.012
3.762AlaIle: 3.762 ± 0.021
4.52AlaLys: 4.52 ± 0.022
7.535AlaLeu: 7.535 ± 0.032
2.197AlaMet: 2.197 ± 0.014
2.909AlaAsn: 2.909 ± 0.016
3.684AlaPro: 3.684 ± 0.025
3.135AlaGln: 3.135 ± 0.019
4.984AlaArg: 4.984 ± 0.024
6.425AlaSer: 6.425 ± 0.027
5.351AlaThr: 5.351 ± 0.024
5.727AlaVal: 5.727 ± 0.024
0.934AlaTrp: 0.934 ± 0.009
1.922AlaTyr: 1.922 ± 0.013
0.002AlaXaa: 0.002 ± 0.0
Cys
1.26CysAla: 1.26 ± 0.01
0.391CysCys: 0.391 ± 0.007
0.953CysAsp: 0.953 ± 0.011
0.916CysGlu: 0.916 ± 0.009
0.65CysPhe: 0.65 ± 0.007
1.289CysGly: 1.289 ± 0.012
0.427CysHis: 0.427 ± 0.006
0.832CysIle: 0.832 ± 0.008
0.792CysLys: 0.792 ± 0.009
1.401CysLeu: 1.401 ± 0.011
0.422CysMet: 0.422 ± 0.006
0.596CysAsn: 0.596 ± 0.009
0.805CysPro: 0.805 ± 0.013
0.57CysGln: 0.57 ± 0.008
1.001CysArg: 1.001 ± 0.01
1.357CysSer: 1.357 ± 0.013
0.985CysThr: 0.985 ± 0.012
1.27CysVal: 1.27 ± 0.01
0.27CysTrp: 0.27 ± 0.005
0.486CysTyr: 0.486 ± 0.006
0.0CysXaa: 0.0 ± 0.0
Asp
4.899AspAla: 4.899 ± 0.025
0.855AspCys: 0.855 ± 0.01
4.545AspAsp: 4.545 ± 0.03
4.927AspGlu: 4.927 ± 0.027
2.287AspPhe: 2.287 ± 0.014
3.737AspGly: 3.737 ± 0.02
1.366AspHis: 1.366 ± 0.012
2.777AspIle: 2.777 ± 0.017
2.564AspLys: 2.564 ± 0.018
5.273AspLeu: 5.273 ± 0.024
1.435AspMet: 1.435 ± 0.011
2.085AspAsn: 2.085 ± 0.014
2.824AspPro: 2.824 ± 0.018
2.097AspGln: 2.097 ± 0.015
3.294AspArg: 3.294 ± 0.023
4.158AspSer: 4.158 ± 0.024
2.988AspThr: 2.988 ± 0.016
4.423AspVal: 4.423 ± 0.021
0.762AspTrp: 0.762 ± 0.009
1.643AspTyr: 1.643 ± 0.013
0.001AspXaa: 0.001 ± 0.0
Glu
5.323GluAla: 5.323 ± 0.026
1.004GluCys: 1.004 ± 0.01
4.389GluAsp: 4.389 ± 0.024
5.253GluGlu: 5.253 ± 0.033
2.333GluPhe: 2.333 ± 0.015
3.235GluGly: 3.235 ± 0.018
1.566GluHis: 1.566 ± 0.012
3.039GluIle: 3.039 ± 0.019
3.737GluLys: 3.737 ± 0.021
6.204GluLeu: 6.204 ± 0.031
1.769GluMet: 1.769 ± 0.013
2.573GluAsn: 2.573 ± 0.016
2.351GluPro: 2.351 ± 0.015
2.64GluGln: 2.64 ± 0.019
4.359GluArg: 4.359 ± 0.029
4.18GluSer: 4.18 ± 0.019
3.556GluThr: 3.556 ± 0.021
4.388GluVal: 4.388 ± 0.023
0.959GluTrp: 0.959 ± 0.01
1.963GluTyr: 1.963 ± 0.013
0.001GluXaa: 0.001 ± 0.0
Phe
2.894PheAla: 2.894 ± 0.019
0.721PheCys: 0.721 ± 0.009
2.3PheAsp: 2.3 ± 0.014
2.256PheGlu: 2.256 ± 0.013
1.413PhePhe: 1.413 ± 0.014
2.65PheGly: 2.65 ± 0.017
0.935PheHis: 0.935 ± 0.009
1.6PheIle: 1.6 ± 0.015
1.708PheLys: 1.708 ± 0.015
3.394PheLeu: 3.394 ± 0.02
0.983PheMet: 0.983 ± 0.009
1.393PheAsn: 1.393 ± 0.012
1.651PhePro: 1.651 ± 0.013
1.578PheGln: 1.578 ± 0.013
2.303PheArg: 2.303 ± 0.015
2.752PheSer: 2.752 ± 0.017
2.084PheThr: 2.084 ± 0.015
2.898PheVal: 2.898 ± 0.019
0.522PheTrp: 0.522 ± 0.008
1.161PheTyr: 1.161 ± 0.01
0.0PheXaa: 0.0 ± 0.0
Gly
4.285GlyAla: 4.285 ± 0.023
1.094GlyCys: 1.094 ± 0.009
3.525GlyAsp: 3.525 ± 0.024
3.418GlyGlu: 3.418 ± 0.018
2.361GlyPhe: 2.361 ± 0.015
4.199GlyGly: 4.199 ± 0.036
1.561GlyHis: 1.561 ± 0.013
2.883GlyIle: 2.883 ± 0.019
3.108GlyLys: 3.108 ± 0.018
4.848GlyLeu: 4.848 ± 0.024
1.496GlyMet: 1.496 ± 0.013
2.403GlyAsn: 2.403 ± 0.015
2.077GlyPro: 2.077 ± 0.015
1.949GlyGln: 1.949 ± 0.014
3.559GlyArg: 3.559 ± 0.021
5.219GlySer: 5.219 ± 0.031
3.408GlyThr: 3.408 ± 0.023
4.221GlyVal: 4.221 ± 0.021
0.874GlyTrp: 0.874 ± 0.008
1.95GlyTyr: 1.95 ± 0.013
0.001GlyXaa: 0.001 ± 0.0
His
1.792HisAla: 1.792 ± 0.013
0.462HisCys: 0.462 ± 0.007
1.416HisAsp: 1.416 ± 0.013
1.617HisGlu: 1.617 ± 0.013
1.117HisPhe: 1.117 ± 0.011
1.669HisGly: 1.669 ± 0.015
0.752HisHis: 0.752 ± 0.011
1.094HisIle: 1.094 ± 0.011
1.058HisLys: 1.058 ± 0.012
2.335HisLeu: 2.335 ± 0.016
0.56HisMet: 0.56 ± 0.007
0.82HisAsn: 0.82 ± 0.009
1.277HisPro: 1.277 ± 0.01
1.072HisGln: 1.072 ± 0.011
1.69HisArg: 1.69 ± 0.014
1.676HisSer: 1.676 ± 0.013
1.202HisThr: 1.202 ± 0.011
1.906HisVal: 1.906 ± 0.014
0.431HisTrp: 0.431 ± 0.006
0.746HisTyr: 0.746 ± 0.009
0.0HisXaa: 0.0 ± 0.0
Ile
3.771IleAla: 3.771 ± 0.021
0.814IleCys: 0.814 ± 0.009
2.778IleAsp: 2.778 ± 0.018
2.71IleGlu: 2.71 ± 0.016
1.639IlePhe: 1.639 ± 0.015
2.472IleGly: 2.472 ± 0.016
1.127IleHis: 1.127 ± 0.011
1.793IleIle: 1.793 ± 0.012
2.257IleLys: 2.257 ± 0.014
3.997IleLeu: 3.997 ± 0.021
1.067IleMet: 1.067 ± 0.01
1.622IleAsn: 1.622 ± 0.012
2.519IlePro: 2.519 ± 0.015
1.994IleGln: 1.994 ± 0.016
2.914IleArg: 2.914 ± 0.018
3.498IleSer: 3.498 ± 0.02
2.543IleThr: 2.543 ± 0.018
3.273IleVal: 3.273 ± 0.018
0.59IleTrp: 0.59 ± 0.008
1.28IleTyr: 1.28 ± 0.012
0.0IleXaa: 0.0 ± 0.0
Lys
4.135LysAla: 4.135 ± 0.021
0.878LysCys: 0.878 ± 0.01
2.551LysAsp: 2.551 ± 0.019
3.362LysGlu: 3.362 ± 0.022
1.845LysPhe: 1.845 ± 0.015
2.524LysGly: 2.524 ± 0.016
1.311LysHis: 1.311 ± 0.011
2.469LysIle: 2.469 ± 0.018
3.966LysLys: 3.966 ± 0.028
5.336LysLeu: 5.336 ± 0.026
1.483LysMet: 1.483 ± 0.014
2.108LysAsn: 2.108 ± 0.014
2.377LysPro: 2.377 ± 0.018
2.269LysGln: 2.269 ± 0.016
4.021LysArg: 4.021 ± 0.023
3.726LysSer: 3.726 ± 0.021
3.413LysThr: 3.413 ± 0.017
3.283LysVal: 3.283 ± 0.019
0.901LysTrp: 0.901 ± 0.009
1.635LysTyr: 1.635 ± 0.013
0.001LysXaa: 0.001 ± 0.0
Leu
7.262LeuAla: 7.262 ± 0.029
1.578LeuCys: 1.578 ± 0.013
5.409LeuAsp: 5.409 ± 0.025
6.076LeuGlu: 6.076 ± 0.027
3.283LeuPhe: 3.283 ± 0.02
4.912LeuGly: 4.912 ± 0.022
2.514LeuHis: 2.514 ± 0.021
3.533LeuIle: 3.533 ± 0.022
5.133LeuLys: 5.133 ± 0.026
9.284LeuLeu: 9.284 ± 0.044
2.205LeuMet: 2.205 ± 0.016
3.445LeuAsn: 3.445 ± 0.019
4.601LeuPro: 4.601 ± 0.025
4.403LeuGln: 4.403 ± 0.024
6.729LeuArg: 6.729 ± 0.03
6.722LeuSer: 6.722 ± 0.028
5.004LeuThr: 5.004 ± 0.02
6.264LeuVal: 6.264 ± 0.025
1.185LeuTrp: 1.185 ± 0.01
2.417LeuTyr: 2.417 ± 0.015
0.002LeuXaa: 0.002 ± 0.0
Met
2.073MetAla: 2.073 ± 0.014
0.425MetCys: 0.425 ± 0.007
1.561MetAsp: 1.561 ± 0.013
1.783MetGlu: 1.783 ± 0.015
0.841MetPhe: 0.841 ± 0.01
1.293MetGly: 1.293 ± 0.013
0.619MetHis: 0.619 ± 0.008
1.186MetIle: 1.186 ± 0.011
1.542MetLys: 1.542 ± 0.011
2.399MetLeu: 2.399 ± 0.014
0.712MetMet: 0.712 ± 0.008
1.016MetAsn: 1.016 ± 0.009
1.132MetPro: 1.132 ± 0.009
1.138MetGln: 1.138 ± 0.01
1.617MetArg: 1.617 ± 0.014
1.853MetSer: 1.853 ± 0.015
1.659MetThr: 1.659 ± 0.011
1.641MetVal: 1.641 ± 0.013
0.361MetTrp: 0.361 ± 0.005
0.672MetTyr: 0.672 ± 0.008
0.0MetXaa: 0.0 ± 0.0
Asn
3.359AsnAla: 3.359 ± 0.019
0.667AsnCys: 0.667 ± 0.008
2.148AsnAsp: 2.148 ± 0.014
2.403AsnGlu: 2.403 ± 0.014
1.324AsnPhe: 1.324 ± 0.011
2.778AsnGly: 2.778 ± 0.02
0.833AsnHis: 0.833 ± 0.008
1.73AsnIle: 1.73 ± 0.013
1.814AsnLys: 1.814 ± 0.013
3.164AsnLeu: 3.164 ± 0.02
0.936AsnMet: 0.936 ± 0.009
1.502AsnAsn: 1.502 ± 0.016
1.891AsnPro: 1.891 ± 0.015
1.4AsnGln: 1.4 ± 0.014
2.187AsnArg: 2.187 ± 0.016
2.716AsnSer: 2.716 ± 0.016
2.178AsnThr: 2.178 ± 0.015
2.809AsnVal: 2.809 ± 0.016
0.53AsnTrp: 0.53 ± 0.007
1.158AsnTyr: 1.158 ± 0.01
0.0AsnXaa: 0.0 ± 0.0
Pro
3.736ProAla: 3.736 ± 0.024
0.595ProCys: 0.595 ± 0.008
2.743ProAsp: 2.743 ± 0.018
3.175ProGlu: 3.175 ± 0.018
1.738ProPhe: 1.738 ± 0.014
2.764ProGly: 2.764 ± 0.02
1.058ProHis: 1.058 ± 0.009
1.926ProIle: 1.926 ± 0.013
2.388ProLys: 2.388 ± 0.015
3.795ProLeu: 3.795 ± 0.021
1.124ProMet: 1.124 ± 0.011
1.698ProAsn: 1.698 ± 0.012
3.005ProPro: 3.005 ± 0.024
1.781ProGln: 1.781 ± 0.013
3.01ProArg: 3.01 ± 0.019
4.149ProSer: 4.149 ± 0.022
3.487ProThr: 3.487 ± 0.022
3.476ProVal: 3.476 ± 0.021
0.652ProTrp: 0.652 ± 0.009
1.036ProTyr: 1.036 ± 0.011
0.002ProXaa: 0.002 ± 0.0
Gln
3.129GlnAla: 3.129 ± 0.017
0.606GlnCys: 0.606 ± 0.007
2.23GlnAsp: 2.23 ± 0.016
2.61GlnGlu: 2.61 ± 0.019
1.394GlnPhe: 1.394 ± 0.012
2.025GlnGly: 2.025 ± 0.014
1.166GlnHis: 1.166 ± 0.01
1.787GlnIle: 1.787 ± 0.012
2.074GlnLys: 2.074 ± 0.015
4.168GlnLeu: 4.168 ± 0.029
1.069GlnMet: 1.069 ± 0.01
1.436GlnAsn: 1.436 ± 0.012
1.739GlnPro: 1.739 ± 0.012
2.567GlnGln: 2.567 ± 0.029
3.232GlnArg: 3.232 ± 0.02
2.701GlnSer: 2.701 ± 0.015
2.377GlnThr: 2.377 ± 0.015
2.686GlnVal: 2.686 ± 0.015
0.669GlnTrp: 0.669 ± 0.008
1.112GlnTyr: 1.112 ± 0.01
0.0GlnXaa: 0.0 ± 0.0
Arg
4.783ArgAla: 4.783 ± 0.024
1.078ArgCys: 1.078 ± 0.011
3.414ArgAsp: 3.414 ± 0.02
4.128ArgGlu: 4.128 ± 0.024
2.617ArgPhe: 2.617 ± 0.016
3.557ArgGly: 3.557 ± 0.019
1.823ArgHis: 1.823 ± 0.016
3.033ArgIle: 3.033 ± 0.02
3.692ArgLys: 3.692 ± 0.022
6.328ArgLeu: 6.328 ± 0.03
1.746ArgMet: 1.746 ± 0.014
2.324ArgAsn: 2.324 ± 0.014
2.753ArgPro: 2.753 ± 0.018
2.941ArgGln: 2.941 ± 0.019
5.513ArgArg: 5.513 ± 0.038
5.25ArgSer: 5.25 ± 0.028
3.536ArgThr: 3.536 ± 0.018
4.559ArgVal: 4.559 ± 0.025
1.082ArgTrp: 1.082 ± 0.01
1.899ArgTyr: 1.899 ± 0.013
0.001ArgXaa: 0.001 ± 0.0
Ser
6.099SerAla: 6.099 ± 0.029
1.203SerCys: 1.203 ± 0.011
4.759SerAsp: 4.759 ± 0.022
4.426SerGlu: 4.426 ± 0.022
2.83SerPhe: 2.83 ± 0.016
5.048SerGly: 5.048 ± 0.028
1.661SerHis: 1.661 ± 0.014
3.367SerIle: 3.367 ± 0.019
3.905SerLys: 3.905 ± 0.021
6.341SerLeu: 6.341 ± 0.028
1.874SerMet: 1.874 ± 0.012
2.844SerAsn: 2.844 ± 0.018
3.927SerPro: 3.927 ± 0.019
2.585SerGln: 2.585 ± 0.017
4.866SerArg: 4.866 ± 0.026
7.828SerSer: 7.828 ± 0.041
5.574SerThr: 5.574 ± 0.029
5.239SerVal: 5.239 ± 0.022
1.068SerTrp: 1.068 ± 0.01
1.856SerTyr: 1.856 ± 0.014
0.001SerXaa: 0.001 ± 0.0
Thr
4.923ThrAla: 4.923 ± 0.024
0.992ThrCys: 0.992 ± 0.01
3.089ThrAsp: 3.089 ± 0.021
3.469ThrGlu: 3.469 ± 0.018
2.126ThrPhe: 2.126 ± 0.014
3.527ThrGly: 3.527 ± 0.023
1.312ThrHis: 1.312 ± 0.011
2.732ThrIle: 2.732 ± 0.02
3.345ThrLys: 3.345 ± 0.019
5.481ThrLeu: 5.481 ± 0.025
1.513ThrMet: 1.513 ± 0.012
2.256ThrAsn: 2.256 ± 0.014
3.738ThrPro: 3.738 ± 0.024
2.252ThrGln: 2.252 ± 0.015
3.769ThrArg: 3.769 ± 0.022
5.089ThrSer: 5.089 ± 0.025
4.625ThrThr: 4.625 ± 0.026
3.918ThrVal: 3.918 ± 0.024
0.823ThrTrp: 0.823 ± 0.009
1.464ThrTyr: 1.464 ± 0.011
0.001ThrXaa: 0.001 ± 0.0
Val
6.007ValAla: 6.007 ± 0.025
1.238ValCys: 1.238 ± 0.011
4.417ValAsp: 4.417 ± 0.022
4.595ValGlu: 4.595 ± 0.023
2.67ValPhe: 2.67 ± 0.017
3.871ValGly: 3.871 ± 0.023
1.797ValHis: 1.797 ± 0.013
3.049ValIle: 3.049 ± 0.017
3.81ValLys: 3.81 ± 0.022
6.645ValLeu: 6.645 ± 0.028
1.816ValMet: 1.816 ± 0.013
2.71ValAsn: 2.71 ± 0.018
3.312ValPro: 3.312 ± 0.018
2.743ValGln: 2.743 ± 0.017
4.141ValArg: 4.141 ± 0.021
5.094ValSer: 5.094 ± 0.022
4.153ValThr: 4.153 ± 0.026
5.605ValVal: 5.605 ± 0.028
0.94ValTrp: 0.94 ± 0.01
2.017ValTyr: 2.017 ± 0.013
0.002ValXaa: 0.002 ± 0.0
Trp
0.913TrpAla: 0.913 ± 0.01
0.286TrpCys: 0.286 ± 0.005
0.793TrpAsp: 0.793 ± 0.008
0.8TrpGlu: 0.8 ± 0.008
0.532TrpPhe: 0.532 ± 0.009
0.744TrpGly: 0.744 ± 0.011
0.355TrpHis: 0.355 ± 0.006
0.884TrpIle: 0.884 ± 0.011
0.952TrpLys: 0.952 ± 0.01
1.37TrpLeu: 1.37 ± 0.012
0.438TrpMet: 0.438 ± 0.007
0.675TrpAsn: 0.675 ± 0.009
0.463TrpPro: 0.463 ± 0.006
0.519TrpGln: 0.519 ± 0.007
1.004TrpArg: 1.004 ± 0.01
0.994TrpSer: 0.994 ± 0.011
0.848TrpThr: 0.848 ± 0.008
0.994TrpVal: 0.994 ± 0.011
0.243TrpTrp: 0.243 ± 0.005
0.462TrpTyr: 0.462 ± 0.007
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.261TyrAla: 2.261 ± 0.013
0.613TyrCys: 0.613 ± 0.008
1.543TyrAsp: 1.543 ± 0.012
1.776TyrGlu: 1.776 ± 0.013
1.134TyrPhe: 1.134 ± 0.01
1.768TyrGly: 1.768 ± 0.015
0.792TyrHis: 0.792 ± 0.009
1.184TyrIle: 1.184 ± 0.01
1.229TyrLys: 1.229 ± 0.012
2.662TyrLeu: 2.662 ± 0.017
0.676TyrMet: 0.676 ± 0.008
1.111TyrAsn: 1.111 ± 0.011
1.21TyrPro: 1.21 ± 0.011
1.146TyrGln: 1.146 ± 0.011
1.863TyrArg: 1.863 ± 0.013
1.913TyrSer: 1.913 ± 0.011
1.474TyrThr: 1.474 ± 0.015
2.051TyrVal: 2.051 ± 0.015
0.471TyrTrp: 0.471 ± 0.006
0.997TyrTyr: 0.997 ± 0.01
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.001XaaAla: 0.001 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.001XaaAsp: 0.001 ± 0.0
0.001XaaGlu: 0.001 ± 0.0
0.001XaaPhe: 0.001 ± 0.0
0.001XaaGly: 0.001 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.001XaaLeu: 0.001 ± 0.0
0.001XaaMet: 0.001 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.001XaaPro: 0.001 ± 0.0
0.001XaaGln: 0.001 ± 0.0
0.001XaaArg: 0.001 ± 0.0
0.002XaaSer: 0.002 ± 0.0
0.001XaaThr: 0.001 ± 0.0
0.001XaaVal: 0.001 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.442XaaXaa: 0.442 ± 0.053
Statistics based on 34747 proteins (10966191 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski