Amino acid dipepetide frequency for Microthlaspi erraticum

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.644AlaAla: 5.644 ± 0.026
1.084AlaCys: 1.084 ± 0.009
3.075AlaAsp: 3.075 ± 0.015
4.271AlaGlu: 4.271 ± 0.019
2.641AlaPhe: 2.641 ± 0.013
3.677AlaGly: 3.677 ± 0.018
1.28AlaHis: 1.28 ± 0.01
3.385AlaIle: 3.385 ± 0.015
4.011AlaLys: 4.011 ± 0.016
6.07AlaLeu: 6.07 ± 0.02
1.747AlaMet: 1.747 ± 0.011
2.451AlaAsn: 2.451 ± 0.013
2.863AlaPro: 2.863 ± 0.017
2.086AlaGln: 2.086 ± 0.012
3.624AlaArg: 3.624 ± 0.016
5.713AlaSer: 5.713 ± 0.02
3.627AlaThr: 3.627 ± 0.016
4.449AlaVal: 4.449 ± 0.019
0.793AlaTrp: 0.793 ± 0.007
1.736AlaTyr: 1.736 ± 0.012
0.0AlaXaa: 0.0 ± 0.0
Cys
0.874CysAla: 0.874 ± 0.009
0.477CysCys: 0.477 ± 0.005
0.967CysAsp: 0.967 ± 0.009
0.936CysGlu: 0.936 ± 0.008
0.93CysPhe: 0.93 ± 0.008
1.325CysGly: 1.325 ± 0.011
0.465CysHis: 0.465 ± 0.005
0.915CysIle: 0.915 ± 0.008
1.192CysLys: 1.192 ± 0.009
1.892CysLeu: 1.892 ± 0.011
0.443CysMet: 0.443 ± 0.005
0.766CysAsn: 0.766 ± 0.007
0.945CysPro: 0.945 ± 0.008
0.559CysGln: 0.559 ± 0.006
1.065CysArg: 1.065 ± 0.008
1.724CysSer: 1.724 ± 0.011
0.8CysThr: 0.8 ± 0.007
1.234CysVal: 1.234 ± 0.01
0.236CysTrp: 0.236 ± 0.004
0.576CysTyr: 0.576 ± 0.005
0.0CysXaa: 0.0 ± 0.0
Asp
3.333AspAla: 3.333 ± 0.017
0.979AspCys: 0.979 ± 0.008
3.795AspAsp: 3.795 ± 0.023
4.293AspGlu: 4.293 ± 0.018
2.474AspPhe: 2.474 ± 0.012
3.719AspGly: 3.719 ± 0.017
1.363AspHis: 1.363 ± 0.011
2.705AspIle: 2.705 ± 0.015
2.783AspLys: 2.783 ± 0.014
5.236AspLeu: 5.236 ± 0.017
1.277AspMet: 1.277 ± 0.009
1.918AspAsn: 1.918 ± 0.012
2.772AspPro: 2.772 ± 0.013
1.885AspGln: 1.885 ± 0.011
2.794AspArg: 2.794 ± 0.016
4.165AspSer: 4.165 ± 0.018
2.232AspThr: 2.232 ± 0.011
3.72AspVal: 3.72 ± 0.014
0.765AspTrp: 0.765 ± 0.007
1.59AspTyr: 1.59 ± 0.01
0.0AspXaa: 0.0 ± 0.0
Glu
4.946GluAla: 4.946 ± 0.021
0.971GluCys: 0.971 ± 0.008
4.317GluAsp: 4.317 ± 0.019
7.468GluGlu: 7.468 ± 0.038
2.451GluPhe: 2.451 ± 0.012
3.571GluGly: 3.571 ± 0.016
1.309GluHis: 1.309 ± 0.01
3.96GluIle: 3.96 ± 0.017
5.165GluLys: 5.165 ± 0.027
5.812GluLeu: 5.812 ± 0.021
1.917GluMet: 1.917 ± 0.012
2.962GluAsn: 2.962 ± 0.016
2.446GluPro: 2.446 ± 0.013
2.056GluGln: 2.056 ± 0.012
3.742GluArg: 3.742 ± 0.018
4.872GluSer: 4.872 ± 0.021
3.642GluThr: 3.642 ± 0.018
4.151GluVal: 4.151 ± 0.019
0.777GluTrp: 0.777 ± 0.007
1.671GluTyr: 1.671 ± 0.011
0.0GluXaa: 0.0 ± 0.0
Phe
2.464PheAla: 2.464 ± 0.013
0.877PheCys: 0.877 ± 0.007
2.393PheAsp: 2.393 ± 0.014
2.366PheGlu: 2.366 ± 0.013
1.912PhePhe: 1.912 ± 0.013
2.9PheGly: 2.9 ± 0.016
1.154PheHis: 1.154 ± 0.008
1.986PheIle: 1.986 ± 0.012
2.302PheLys: 2.302 ± 0.014
4.35PheLeu: 4.35 ± 0.018
1.036PheMet: 1.036 ± 0.008
1.636PheAsn: 1.636 ± 0.01
2.175PhePro: 2.175 ± 0.012
1.533PheGln: 1.533 ± 0.01
2.15PheArg: 2.15 ± 0.012
3.963PheSer: 3.963 ± 0.016
2.232PheThr: 2.232 ± 0.013
2.813PheVal: 2.813 ± 0.014
0.624PheTrp: 0.624 ± 0.007
1.225PheTyr: 1.225 ± 0.009
0.0PheXaa: 0.0 ± 0.0
Gly
3.507GlyAla: 3.507 ± 0.017
1.212GlyCys: 1.212 ± 0.01
3.449GlyAsp: 3.449 ± 0.018
3.979GlyGlu: 3.979 ± 0.017
3.19GlyPhe: 3.19 ± 0.015
5.287GlyGly: 5.287 ± 0.047
1.51GlyHis: 1.51 ± 0.011
3.313GlyIle: 3.313 ± 0.017
4.037GlyLys: 4.037 ± 0.019
5.58GlyLeu: 5.58 ± 0.02
1.468GlyMet: 1.468 ± 0.011
2.921GlyAsn: 2.921 ± 0.014
2.197GlyPro: 2.197 ± 0.013
1.993GlyGln: 1.993 ± 0.011
3.957GlyArg: 3.957 ± 0.021
5.531GlySer: 5.531 ± 0.025
3.043GlyThr: 3.043 ± 0.017
4.125GlyVal: 4.125 ± 0.028
0.869GlyTrp: 0.869 ± 0.007
2.143GlyTyr: 2.143 ± 0.014
0.0GlyXaa: 0.0 ± 0.0
His
1.321HisAla: 1.321 ± 0.009
0.527HisCys: 0.527 ± 0.006
1.142HisAsp: 1.142 ± 0.01
1.256HisGlu: 1.256 ± 0.01
1.043HisPhe: 1.043 ± 0.008
1.772HisGly: 1.772 ± 0.012
1.016HisHis: 1.016 ± 0.013
1.2HisIle: 1.2 ± 0.007
1.262HisLys: 1.262 ± 0.008
2.372HisLeu: 2.372 ± 0.013
0.61HisMet: 0.61 ± 0.007
0.899HisAsn: 0.899 ± 0.007
1.346HisPro: 1.346 ± 0.01
1.145HisGln: 1.145 ± 0.01
1.438HisArg: 1.438 ± 0.011
1.758HisSer: 1.758 ± 0.01
1.012HisThr: 1.012 ± 0.009
1.565HisVal: 1.565 ± 0.009
0.326HisTrp: 0.326 ± 0.005
0.671HisTyr: 0.671 ± 0.006
0.0HisXaa: 0.0 ± 0.0
Ile
3.274IleAla: 3.274 ± 0.014
1.019IleCys: 1.019 ± 0.009
2.947IleAsp: 2.947 ± 0.013
3.307IleGlu: 3.307 ± 0.016
2.08IlePhe: 2.08 ± 0.012
3.229IleGly: 3.229 ± 0.014
1.292IleHis: 1.292 ± 0.01
2.36IleIle: 2.36 ± 0.015
2.91IleLys: 2.91 ± 0.013
4.689IleLeu: 4.689 ± 0.018
1.06IleMet: 1.06 ± 0.008
2.043IleAsn: 2.043 ± 0.012
2.833IlePro: 2.833 ± 0.017
1.813IleGln: 1.813 ± 0.01
2.758IleArg: 2.758 ± 0.014
4.692IleSer: 4.692 ± 0.02
2.655IleThr: 2.655 ± 0.013
3.309IleVal: 3.309 ± 0.015
0.769IleTrp: 0.769 ± 0.008
1.396IleTyr: 1.396 ± 0.01
0.0IleXaa: 0.0 ± 0.0
Lys
4.131LysAla: 4.131 ± 0.02
1.001LysCys: 1.001 ± 0.008
3.317LysAsp: 3.317 ± 0.018
4.864LysGlu: 4.864 ± 0.024
2.195LysPhe: 2.195 ± 0.012
3.498LysGly: 3.498 ± 0.015
1.325LysHis: 1.325 ± 0.01
3.365LysIle: 3.365 ± 0.015
5.235LysLys: 5.235 ± 0.03
6.187LysLeu: 6.187 ± 0.024
1.562LysMet: 1.562 ± 0.011
2.499LysAsn: 2.499 ± 0.013
3.059LysPro: 3.059 ± 0.015
2.301LysGln: 2.301 ± 0.012
3.919LysArg: 3.919 ± 0.017
4.87LysSer: 4.87 ± 0.021
3.427LysThr: 3.427 ± 0.016
3.855LysVal: 3.855 ± 0.015
0.871LysTrp: 0.871 ± 0.007
1.501LysTyr: 1.501 ± 0.01
0.0LysXaa: 0.0 ± 0.0
Leu
6.153LeuAla: 6.153 ± 0.021
1.941LeuCys: 1.941 ± 0.01
5.124LeuAsp: 5.124 ± 0.02
6.171LeuGlu: 6.171 ± 0.021
3.792LeuPhe: 3.792 ± 0.018
5.745LeuGly: 5.745 ± 0.024
2.382LeuHis: 2.382 ± 0.014
4.388LeuIle: 4.388 ± 0.019
6.069LeuLys: 6.069 ± 0.025
9.302LeuLeu: 9.302 ± 0.033
2.155LeuMet: 2.155 ± 0.012
3.588LeuAsn: 3.588 ± 0.016
5.132LeuPro: 5.132 ± 0.02
3.793LeuGln: 3.793 ± 0.017
5.658LeuArg: 5.658 ± 0.02
8.212LeuSer: 8.212 ± 0.029
4.668LeuThr: 4.668 ± 0.019
6.366LeuVal: 6.366 ± 0.024
1.267LeuTrp: 1.267 ± 0.008
2.296LeuTyr: 2.296 ± 0.013
0.0LeuXaa: 0.0 ± 0.0
Met
1.948MetAla: 1.948 ± 0.011
0.361MetCys: 0.361 ± 0.005
1.523MetAsp: 1.523 ± 0.01
2.085MetGlu: 2.085 ± 0.011
0.941MetPhe: 0.941 ± 0.009
1.495MetGly: 1.495 ± 0.01
0.496MetHis: 0.496 ± 0.006
1.277MetIle: 1.277 ± 0.009
1.714MetLys: 1.714 ± 0.011
2.068MetLeu: 2.068 ± 0.012
0.767MetMet: 0.767 ± 0.008
1.004MetAsn: 1.004 ± 0.008
0.997MetPro: 0.997 ± 0.008
0.808MetGln: 0.808 ± 0.007
1.337MetArg: 1.337 ± 0.01
1.992MetSer: 1.992 ± 0.011
1.142MetThr: 1.142 ± 0.008
1.702MetVal: 1.702 ± 0.012
0.286MetTrp: 0.286 ± 0.004
0.604MetTyr: 0.604 ± 0.006
0.0MetXaa: 0.0 ± 0.0
Asn
2.451AsnAla: 2.451 ± 0.011
0.716AsnCys: 0.716 ± 0.006
1.898AsnAsp: 1.898 ± 0.013
2.391AsnGlu: 2.391 ± 0.014
1.677AsnPhe: 1.677 ± 0.01
3.194AsnGly: 3.194 ± 0.016
1.147AsnHis: 1.147 ± 0.009
2.083AsnIle: 2.083 ± 0.013
2.353AsnLys: 2.353 ± 0.012
4.252AsnLeu: 4.252 ± 0.023
1.016AsnMet: 1.016 ± 0.009
2.055AsnAsn: 2.055 ± 0.014
2.466AsnPro: 2.466 ± 0.015
1.672AsnGln: 1.672 ± 0.011
2.27AsnArg: 2.27 ± 0.011
3.423AsnSer: 3.423 ± 0.016
1.917AsnThr: 1.917 ± 0.012
2.648AsnVal: 2.648 ± 0.015
0.559AsnTrp: 0.559 ± 0.005
1.212AsnTyr: 1.212 ± 0.01
0.0AsnXaa: 0.0 ± 0.0
Pro
2.978ProAla: 2.978 ± 0.016
0.825ProCys: 0.825 ± 0.007
2.421ProAsp: 2.421 ± 0.013
3.396ProGlu: 3.396 ± 0.017
2.091ProPhe: 2.091 ± 0.012
2.651ProGly: 2.651 ± 0.014
1.138ProHis: 1.138 ± 0.009
2.325ProIle: 2.325 ± 0.012
2.949ProLys: 2.949 ± 0.018
4.326ProLeu: 4.326 ± 0.017
1.078ProMet: 1.078 ± 0.009
2.262ProAsn: 2.262 ± 0.012
4.36ProPro: 4.36 ± 0.063
1.889ProGln: 1.889 ± 0.012
3.098ProArg: 3.098 ± 0.016
5.054ProSer: 5.054 ± 0.025
2.924ProThr: 2.924 ± 0.019
3.259ProVal: 3.259 ± 0.015
0.633ProTrp: 0.633 ± 0.006
1.441ProTyr: 1.441 ± 0.014
0.0ProXaa: 0.0 ± 0.0
Gln
2.372GlnAla: 2.372 ± 0.013
0.576GlnCys: 0.576 ± 0.006
1.656GlnAsp: 1.656 ± 0.01
2.614GlnGlu: 2.614 ± 0.013
1.272GlnPhe: 1.272 ± 0.01
2.284GlnGly: 2.284 ± 0.014
0.803GlnHis: 0.803 ± 0.008
1.894GlnIle: 1.894 ± 0.011
2.046GlnLys: 2.046 ± 0.013
3.222GlnLeu: 3.222 ± 0.017
0.886GlnMet: 0.886 ± 0.008
1.565GlnAsn: 1.565 ± 0.011
1.785GlnPro: 1.785 ± 0.014
1.907GlnGln: 1.907 ± 0.02
2.418GlnArg: 2.418 ± 0.013
2.78GlnSer: 2.78 ± 0.013
1.906GlnThr: 1.906 ± 0.011
2.268GlnVal: 2.268 ± 0.012
0.471GlnTrp: 0.471 ± 0.006
0.774GlnTyr: 0.774 ± 0.007
0.0GlnXaa: 0.0 ± 0.0
Arg
3.468ArgAla: 3.468 ± 0.017
1.104ArgCys: 1.104 ± 0.009
2.932ArgAsp: 2.932 ± 0.015
3.733ArgGlu: 3.733 ± 0.018
2.626ArgPhe: 2.626 ± 0.012
3.495ArgGly: 3.495 ± 0.019
1.33ArgHis: 1.33 ± 0.01
2.956ArgIle: 2.956 ± 0.015
3.94ArgLys: 3.94 ± 0.017
5.482ArgLeu: 5.482 ± 0.019
1.396ArgMet: 1.396 ± 0.01
2.594ArgAsn: 2.594 ± 0.013
3.07ArgPro: 3.07 ± 0.022
1.913ArgGln: 1.913 ± 0.013
4.619ArgArg: 4.619 ± 0.024
5.359ArgSer: 5.359 ± 0.024
2.744ArgThr: 2.744 ± 0.014
3.723ArgVal: 3.723 ± 0.017
0.844ArgTrp: 0.844 ± 0.008
1.601ArgTyr: 1.601 ± 0.01
0.0ArgXaa: 0.0 ± 0.0
Ser
5.014SerAla: 5.014 ± 0.02
1.635SerCys: 1.635 ± 0.012
4.42SerAsp: 4.42 ± 0.019
4.868SerGlu: 4.868 ± 0.024
3.96SerPhe: 3.96 ± 0.017
5.658SerGly: 5.658 ± 0.026
2.041SerHis: 2.041 ± 0.012
4.228SerIle: 4.228 ± 0.017
5.023SerLys: 5.023 ± 0.019
8.699SerLeu: 8.699 ± 0.03
2.062SerMet: 2.062 ± 0.013
3.631SerAsn: 3.631 ± 0.019
4.872SerPro: 4.872 ± 0.035
2.981SerGln: 2.981 ± 0.015
5.092SerArg: 5.092 ± 0.021
11.416SerSer: 11.416 ± 0.044
4.684SerThr: 4.684 ± 0.02
5.305SerVal: 5.305 ± 0.02
1.224SerTrp: 1.224 ± 0.01
2.341SerTyr: 2.341 ± 0.013
0.0SerXaa: 0.0 ± 0.0
Thr
3.319ThrAla: 3.319 ± 0.016
0.989ThrCys: 0.989 ± 0.008
2.394ThrAsp: 2.394 ± 0.013
3.063ThrGlu: 3.063 ± 0.017
2.156ThrPhe: 2.156 ± 0.013
3.281ThrGly: 3.281 ± 0.017
1.143ThrHis: 1.143 ± 0.009
2.65ThrIle: 2.65 ± 0.015
3.183ThrLys: 3.183 ± 0.016
4.686ThrLeu: 4.686 ± 0.019
1.309ThrMet: 1.309 ± 0.008
2.089ThrAsn: 2.089 ± 0.011
2.736ThrPro: 2.736 ± 0.016
1.691ThrGln: 1.691 ± 0.011
3.016ThrArg: 3.016 ± 0.016
4.66ThrSer: 4.66 ± 0.017
3.384ThrThr: 3.384 ± 0.017
3.483ThrVal: 3.483 ± 0.016
0.824ThrTrp: 0.824 ± 0.008
1.328ThrTyr: 1.328 ± 0.01
0.0ThrXaa: 0.0 ± 0.0
Val
4.6ValAla: 4.6 ± 0.017
1.205ValCys: 1.205 ± 0.009
3.719ValAsp: 3.719 ± 0.014
4.664ValGlu: 4.664 ± 0.018
2.829ValPhe: 2.829 ± 0.014
3.713ValGly: 3.713 ± 0.024
1.44ValHis: 1.44 ± 0.011
3.304ValIle: 3.304 ± 0.018
4.158ValLys: 4.158 ± 0.016
6.142ValLeu: 6.142 ± 0.023
1.652ValMet: 1.652 ± 0.01
2.545ValAsn: 2.545 ± 0.013
3.216ValPro: 3.216 ± 0.015
2.114ValGln: 2.114 ± 0.012
3.325ValArg: 3.325 ± 0.014
5.664ValSer: 5.664 ± 0.017
3.347ValThr: 3.347 ± 0.015
4.898ValVal: 4.898 ± 0.021
0.878ValTrp: 0.878 ± 0.008
2.087ValTyr: 2.087 ± 0.015
0.0ValXaa: 0.0 ± 0.0
Trp
0.742TrpAla: 0.742 ± 0.007
0.251TrpCys: 0.251 ± 0.004
0.783TrpAsp: 0.783 ± 0.007
0.886TrpGlu: 0.886 ± 0.008
0.613TrpPhe: 0.613 ± 0.006
0.731TrpGly: 0.731 ± 0.008
0.279TrpHis: 0.279 ± 0.005
0.806TrpIle: 0.806 ± 0.007
0.985TrpLys: 0.985 ± 0.009
1.195TrpLeu: 1.195 ± 0.01
0.369TrpMet: 0.369 ± 0.004
0.732TrpAsn: 0.732 ± 0.007
0.534TrpPro: 0.534 ± 0.006
0.435TrpGln: 0.435 ± 0.005
1.051TrpArg: 1.051 ± 0.009
1.135TrpSer: 1.135 ± 0.01
0.69TrpThr: 0.69 ± 0.007
0.818TrpVal: 0.818 ± 0.007
0.257TrpTrp: 0.257 ± 0.004
0.376TrpTyr: 0.376 ± 0.005
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.69TyrAla: 1.69 ± 0.012
0.572TyrCys: 0.572 ± 0.006
1.524TyrAsp: 1.524 ± 0.01
1.596TyrGlu: 1.596 ± 0.011
1.216TyrPhe: 1.216 ± 0.01
2.084TyrGly: 2.084 ± 0.015
0.767TyrHis: 0.767 ± 0.008
1.337TyrIle: 1.337 ± 0.009
1.666TyrLys: 1.666 ± 0.011
2.643TyrLeu: 2.643 ± 0.016
0.738TyrMet: 0.738 ± 0.006
1.238TyrAsn: 1.238 ± 0.009
1.293TyrPro: 1.293 ± 0.01
0.926TyrGln: 0.926 ± 0.007
1.598TyrArg: 1.598 ± 0.011
2.17TyrSer: 2.17 ± 0.017
1.307TyrThr: 1.307 ± 0.009
1.806TyrVal: 1.806 ± 0.013
0.395TyrTrp: 0.395 ± 0.005
0.987TyrTyr: 0.987 ± 0.011
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.007XaaXaa: 0.007 ± 0.004
Statistics based on 45482 proteins (16402413 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski