Amino acid dipepetide frequency for Planococcus massiliensis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.7AlaAla: 8.7 ± 0.135
0.55AlaCys: 0.55 ± 0.024
4.331AlaAsp: 4.331 ± 0.063
6.637AlaGlu: 6.637 ± 0.112
3.978AlaPhe: 3.978 ± 0.071
6.402AlaGly: 6.402 ± 0.097
1.463AlaHis: 1.463 ± 0.046
6.682AlaIle: 6.682 ± 0.102
5.071AlaLys: 5.071 ± 0.074
8.301AlaLeu: 8.301 ± 0.094
2.56AlaMet: 2.56 ± 0.044
2.949AlaAsn: 2.949 ± 0.065
2.657AlaPro: 2.657 ± 0.058
2.578AlaGln: 2.578 ± 0.056
2.918AlaArg: 2.918 ± 0.057
4.717AlaSer: 4.717 ± 0.073
3.718AlaThr: 3.718 ± 0.057
6.616AlaVal: 6.616 ± 0.086
0.68AlaTrp: 0.68 ± 0.03
2.661AlaTyr: 2.661 ± 0.054
0.0AlaXaa: 0.0 ± 0.0
Cys
0.39CysAla: 0.39 ± 0.024
0.056CysCys: 0.056 ± 0.009
0.293CysAsp: 0.293 ± 0.017
0.384CysGlu: 0.384 ± 0.022
0.242CysPhe: 0.242 ± 0.017
0.576CysGly: 0.576 ± 0.025
0.16CysHis: 0.16 ± 0.012
0.37CysIle: 0.37 ± 0.022
0.235CysLys: 0.235 ± 0.018
0.533CysLeu: 0.533 ± 0.024
0.121CysMet: 0.121 ± 0.012
0.19CysAsn: 0.19 ± 0.015
0.28CysPro: 0.28 ± 0.019
0.204CysGln: 0.204 ± 0.016
0.279CysArg: 0.279 ± 0.017
0.387CysSer: 0.387 ± 0.022
0.305CysThr: 0.305 ± 0.019
0.316CysVal: 0.316 ± 0.018
0.056CysTrp: 0.056 ± 0.008
0.183CysTyr: 0.183 ± 0.013
0.0CysXaa: 0.0 ± 0.0
Asp
4.189AspAla: 4.189 ± 0.07
0.282AspCys: 0.282 ± 0.018
2.372AspAsp: 2.372 ± 0.059
4.557AspGlu: 4.557 ± 0.078
2.74AspPhe: 2.74 ± 0.056
3.683AspGly: 3.683 ± 0.071
1.081AspHis: 1.081 ± 0.03
3.695AspIle: 3.695 ± 0.063
2.47AspLys: 2.47 ± 0.054
5.094AspLeu: 5.094 ± 0.077
1.432AspMet: 1.432 ± 0.037
1.502AspAsn: 1.502 ± 0.045
2.239AspPro: 2.239 ± 0.049
1.911AspGln: 1.911 ± 0.043
2.438AspArg: 2.438 ± 0.054
2.727AspSer: 2.727 ± 0.055
2.304AspThr: 2.304 ± 0.051
3.628AspVal: 3.628 ± 0.068
0.664AspTrp: 0.664 ± 0.026
2.068AspTyr: 2.068 ± 0.047
0.0AspXaa: 0.0 ± 0.0
Glu
6.92GluAla: 6.92 ± 0.096
0.31GluCys: 0.31 ± 0.019
4.107GluAsp: 4.107 ± 0.068
7.857GluGlu: 7.857 ± 0.116
2.946GluPhe: 2.946 ± 0.057
4.809GluGly: 4.809 ± 0.077
1.5GluHis: 1.5 ± 0.038
5.541GluIle: 5.541 ± 0.09
6.283GluLys: 6.283 ± 0.096
7.743GluLeu: 7.743 ± 0.098
2.707GluMet: 2.707 ± 0.054
3.505GluAsn: 3.505 ± 0.065
2.307GluPro: 2.307 ± 0.055
3.58GluGln: 3.58 ± 0.062
3.837GluArg: 3.837 ± 0.081
3.902GluSer: 3.902 ± 0.068
4.356GluThr: 4.356 ± 0.07
5.321GluVal: 5.321 ± 0.091
0.969GluTrp: 0.969 ± 0.033
2.043GluTyr: 2.043 ± 0.049
0.0GluXaa: 0.0 ± 0.0
Phe
3.654PheAla: 3.654 ± 0.063
0.269PheCys: 0.269 ± 0.018
2.568PheAsp: 2.568 ± 0.055
3.169PheGlu: 3.169 ± 0.055
2.361PhePhe: 2.361 ± 0.06
3.633PheGly: 3.633 ± 0.075
1.022PheHis: 1.022 ± 0.036
3.61PheIle: 3.61 ± 0.076
2.333PheLys: 2.333 ± 0.052
4.585PheLeu: 4.585 ± 0.087
1.268PheMet: 1.268 ± 0.041
1.747PheAsn: 1.747 ± 0.045
1.667PhePro: 1.667 ± 0.041
1.59PheGln: 1.59 ± 0.043
1.589PheArg: 1.589 ± 0.043
3.191PheSer: 3.191 ± 0.065
2.62PheThr: 2.62 ± 0.055
3.186PheVal: 3.186 ± 0.06
0.501PheTrp: 0.501 ± 0.028
1.64PheTyr: 1.64 ± 0.039
0.0PheXaa: 0.0 ± 0.0
Gly
5.538GlyAla: 5.538 ± 0.083
0.475GlyCys: 0.475 ± 0.025
3.4GlyAsp: 3.4 ± 0.065
4.894GlyGlu: 4.894 ± 0.076
3.613GlyPhe: 3.613 ± 0.069
5.197GlyGly: 5.197 ± 0.093
1.43GlyHis: 1.43 ± 0.041
6.071GlyIle: 6.071 ± 0.098
4.842GlyLys: 4.842 ± 0.082
6.91GlyLeu: 6.91 ± 0.095
2.328GlyMet: 2.328 ± 0.047
2.631GlyAsn: 2.631 ± 0.062
1.933GlyPro: 1.933 ± 0.045
2.478GlyGln: 2.478 ± 0.056
2.885GlyArg: 2.885 ± 0.058
4.358GlySer: 4.358 ± 0.065
4.265GlyThr: 4.265 ± 0.079
5.008GlyVal: 5.008 ± 0.083
0.81GlyTrp: 0.81 ± 0.032
2.612GlyTyr: 2.612 ± 0.056
0.0GlyXaa: 0.0 ± 0.0
His
1.551HisAla: 1.551 ± 0.043
0.168HisCys: 0.168 ± 0.014
0.958HisAsp: 0.958 ± 0.029
1.45HisGlu: 1.45 ± 0.041
1.051HisPhe: 1.051 ± 0.033
1.4HisGly: 1.4 ± 0.039
0.606HisHis: 0.606 ± 0.027
1.354HisIle: 1.354 ± 0.036
0.839HisLys: 0.839 ± 0.028
2.049HisLeu: 2.049 ± 0.053
0.516HisMet: 0.516 ± 0.022
0.608HisAsn: 0.608 ± 0.027
1.174HisPro: 1.174 ± 0.038
0.825HisGln: 0.825 ± 0.031
0.941HisArg: 0.941 ± 0.032
1.167HisSer: 1.167 ± 0.036
1.014HisThr: 1.014 ± 0.031
1.345HisVal: 1.345 ± 0.038
0.21HisTrp: 0.21 ± 0.016
0.798HisTyr: 0.798 ± 0.028
0.0HisXaa: 0.0 ± 0.0
Ile
6.59IleAla: 6.59 ± 0.088
0.489IleCys: 0.489 ± 0.023
4.175IleAsp: 4.175 ± 0.062
5.755IleGlu: 5.755 ± 0.082
3.044IlePhe: 3.044 ± 0.068
6.011IleGly: 6.011 ± 0.098
1.535IleHis: 1.535 ± 0.043
4.881IleIle: 4.881 ± 0.098
3.415IleLys: 3.415 ± 0.057
6.733IleLeu: 6.733 ± 0.093
1.718IleMet: 1.718 ± 0.044
2.525IleAsn: 2.525 ± 0.052
3.103IlePro: 3.103 ± 0.054
2.771IleGln: 2.771 ± 0.054
3.238IleArg: 3.238 ± 0.054
4.655IleSer: 4.655 ± 0.075
3.796IleThr: 3.796 ± 0.063
5.277IleVal: 5.277 ± 0.086
0.654IleTrp: 0.654 ± 0.028
2.216IleTyr: 2.216 ± 0.049
0.0IleXaa: 0.0 ± 0.0
Lys
4.937LysAla: 4.937 ± 0.08
0.218LysCys: 0.218 ± 0.014
3.204LysAsp: 3.204 ± 0.062
6.293LysGlu: 6.293 ± 0.089
1.924LysPhe: 1.924 ± 0.044
4.065LysGly: 4.065 ± 0.07
1.07LysHis: 1.07 ± 0.033
3.98LysIle: 3.98 ± 0.073
5.062LysLys: 5.062 ± 0.083
5.238LysLeu: 5.238 ± 0.073
2.17LysMet: 2.17 ± 0.046
2.686LysAsn: 2.686 ± 0.054
2.176LysPro: 2.176 ± 0.049
2.614LysGln: 2.614 ± 0.055
3.105LysArg: 3.105 ± 0.058
3.29LysSer: 3.29 ± 0.063
3.501LysThr: 3.501 ± 0.061
4.087LysVal: 4.087 ± 0.072
0.757LysTrp: 0.757 ± 0.031
1.717LysTyr: 1.717 ± 0.049
0.0LysXaa: 0.0 ± 0.0
Leu
8.804LeuAla: 8.804 ± 0.104
0.513LeuCys: 0.513 ± 0.024
4.764LeuAsp: 4.764 ± 0.075
7.225LeuGlu: 7.225 ± 0.093
4.718LeuPhe: 4.718 ± 0.103
6.519LeuGly: 6.519 ± 0.096
1.861LeuHis: 1.861 ± 0.045
6.863LeuIle: 6.863 ± 0.103
6.245LeuLys: 6.245 ± 0.088
10.149LeuLeu: 10.149 ± 0.134
2.755LeuMet: 2.755 ± 0.048
3.87LeuAsn: 3.87 ± 0.067
4.183LeuPro: 4.183 ± 0.074
3.539LeuGln: 3.539 ± 0.065
3.657LeuArg: 3.657 ± 0.063
6.495LeuSer: 6.495 ± 0.084
5.452LeuThr: 5.452 ± 0.077
6.48LeuVal: 6.48 ± 0.095
0.875LeuTrp: 0.875 ± 0.032
3.033LeuTyr: 3.033 ± 0.062
0.0LeuXaa: 0.0 ± 0.0
Met
2.764MetAla: 2.764 ± 0.059
0.11MetCys: 0.11 ± 0.01
1.679MetAsp: 1.679 ± 0.042
2.433MetGlu: 2.433 ± 0.053
1.076MetPhe: 1.076 ± 0.033
1.811MetGly: 1.811 ± 0.044
0.493MetHis: 0.493 ± 0.022
2.001MetIle: 2.001 ± 0.047
2.543MetLys: 2.543 ± 0.052
2.628MetLeu: 2.628 ± 0.052
0.893MetMet: 0.893 ± 0.031
1.466MetAsn: 1.466 ± 0.033
1.241MetPro: 1.241 ± 0.039
0.97MetGln: 0.97 ± 0.034
1.202MetArg: 1.202 ± 0.036
1.594MetSer: 1.594 ± 0.04
1.754MetThr: 1.754 ± 0.042
1.784MetVal: 1.784 ± 0.045
0.212MetTrp: 0.212 ± 0.015
0.72MetTyr: 0.72 ± 0.026
0.0MetXaa: 0.0 ± 0.0
Asn
2.946AsnAla: 2.946 ± 0.063
0.233AsnCys: 0.233 ± 0.016
1.858AsnAsp: 1.858 ± 0.041
3.16AsnGlu: 3.16 ± 0.058
1.601AsnPhe: 1.601 ± 0.047
3.273AsnGly: 3.273 ± 0.07
0.829AsnHis: 0.829 ± 0.026
2.548AsnIle: 2.548 ± 0.052
2.087AsnLys: 2.087 ± 0.047
3.581AsnLeu: 3.581 ± 0.064
1.101AsnMet: 1.101 ± 0.038
1.355AsnAsn: 1.355 ± 0.043
2.058AsnPro: 2.058 ± 0.049
1.372AsnGln: 1.372 ± 0.035
1.965AsnArg: 1.965 ± 0.046
2.174AsnSer: 2.174 ± 0.056
1.787AsnThr: 1.787 ± 0.047
2.595AsnVal: 2.595 ± 0.057
0.44AsnTrp: 0.44 ± 0.022
1.38AsnTyr: 1.38 ± 0.043
0.0AsnXaa: 0.0 ± 0.0
Pro
3.161ProAla: 3.161 ± 0.061
0.165ProCys: 0.165 ± 0.012
2.127ProAsp: 2.127 ± 0.051
3.771ProGlu: 3.771 ± 0.064
1.977ProPhe: 1.977 ± 0.044
2.439ProGly: 2.439 ± 0.056
0.829ProHis: 0.829 ± 0.033
2.664ProIle: 2.664 ± 0.051
2.196ProLys: 2.196 ± 0.048
3.679ProLeu: 3.679 ± 0.063
0.94ProMet: 0.94 ± 0.034
1.507ProAsn: 1.507 ± 0.037
1.022ProPro: 1.022 ± 0.038
1.152ProGln: 1.152 ± 0.037
1.079ProArg: 1.079 ± 0.034
2.244ProSer: 2.244 ± 0.052
1.812ProThr: 1.812 ± 0.045
3.163ProVal: 3.163 ± 0.053
0.364ProTrp: 0.364 ± 0.019
1.351ProTyr: 1.351 ± 0.038
0.0ProXaa: 0.0 ± 0.0
Gln
3.219GlnAla: 3.219 ± 0.06
0.162GlnCys: 0.162 ± 0.013
1.592GlnAsp: 1.592 ± 0.045
2.99GlnGlu: 2.99 ± 0.065
1.549GlnPhe: 1.549 ± 0.038
2.175GlnGly: 2.175 ± 0.056
0.757GlnHis: 0.757 ± 0.025
2.351GlnIle: 2.351 ± 0.053
2.535GlnLys: 2.535 ± 0.057
4.065GlnLeu: 4.065 ± 0.076
1.208GlnMet: 1.208 ± 0.038
1.53GlnAsn: 1.53 ± 0.038
1.232GlnPro: 1.232 ± 0.037
1.896GlnGln: 1.896 ± 0.052
1.559GlnArg: 1.559 ± 0.043
1.947GlnSer: 1.947 ± 0.05
1.922GlnThr: 1.922 ± 0.044
2.307GlnVal: 2.307 ± 0.053
0.404GlnTrp: 0.404 ± 0.02
1.152GlnTyr: 1.152 ± 0.038
0.0GlnXaa: 0.0 ± 0.0
Arg
2.771ArgAla: 2.771 ± 0.068
0.209ArgCys: 0.209 ± 0.015
2.012ArgAsp: 2.012 ± 0.047
3.315ArgGlu: 3.315 ± 0.063
2.147ArgPhe: 2.147 ± 0.044
2.462ArgGly: 2.462 ± 0.055
0.876ArgHis: 0.876 ± 0.031
3.226ArgIle: 3.226 ± 0.062
3.101ArgLys: 3.101 ± 0.062
4.333ArgLeu: 4.333 ± 0.063
1.351ArgMet: 1.351 ± 0.038
1.799ArgAsn: 1.799 ± 0.041
1.437ArgPro: 1.437 ± 0.04
1.679ArgGln: 1.679 ± 0.041
1.846ArgArg: 1.846 ± 0.052
2.324ArgSer: 2.324 ± 0.053
2.191ArgThr: 2.191 ± 0.048
2.681ArgVal: 2.681 ± 0.051
0.388ArgTrp: 0.388 ± 0.02
1.513ArgTyr: 1.513 ± 0.037
0.0ArgXaa: 0.0 ± 0.0
Ser
4.592SerAla: 4.592 ± 0.07
0.327SerCys: 0.327 ± 0.015
2.82SerAsp: 2.82 ± 0.058
4.128SerGlu: 4.128 ± 0.065
3.206SerPhe: 3.206 ± 0.063
4.95SerGly: 4.95 ± 0.071
1.186SerHis: 1.186 ± 0.034
4.582SerIle: 4.582 ± 0.075
3.331SerLys: 3.331 ± 0.063
5.812SerLeu: 5.812 ± 0.088
1.763SerMet: 1.763 ± 0.041
2.077SerAsn: 2.077 ± 0.047
2.175SerPro: 2.175 ± 0.048
1.883SerGln: 1.883 ± 0.044
2.505SerArg: 2.505 ± 0.059
3.79SerSer: 3.79 ± 0.073
3.037SerThr: 3.037 ± 0.051
4.115SerVal: 4.115 ± 0.07
0.647SerTrp: 0.647 ± 0.024
2.127SerTyr: 2.127 ± 0.052
0.0SerXaa: 0.0 ± 0.0
Thr
4.837ThrAla: 4.837 ± 0.075
0.253ThrCys: 0.253 ± 0.014
2.892ThrAsp: 2.892 ± 0.053
3.965ThrGlu: 3.965 ± 0.067
2.376ThrPhe: 2.376 ± 0.048
4.468ThrGly: 4.468 ± 0.067
1.074ThrHis: 1.074 ± 0.033
4.014ThrIle: 4.014 ± 0.064
2.88ThrLys: 2.88 ± 0.058
4.96ThrLeu: 4.96 ± 0.063
1.341ThrMet: 1.341 ± 0.037
1.968ThrAsn: 1.968 ± 0.052
2.289ThrPro: 2.289 ± 0.052
1.425ThrGln: 1.425 ± 0.034
1.858ThrArg: 1.858 ± 0.044
3.007ThrSer: 3.007 ± 0.062
2.658ThrThr: 2.658 ± 0.064
4.27ThrVal: 4.27 ± 0.068
0.496ThrTrp: 0.496 ± 0.022
1.712ThrTyr: 1.712 ± 0.047
0.0ThrXaa: 0.0 ± 0.0
Val
5.447ValAla: 5.447 ± 0.098
0.46ValCys: 0.46 ± 0.022
3.65ValAsp: 3.65 ± 0.066
5.268ValGlu: 5.268 ± 0.09
3.379ValPhe: 3.379 ± 0.062
4.607ValGly: 4.607 ± 0.084
1.353ValHis: 1.353 ± 0.04
5.199ValIle: 5.199 ± 0.085
4.256ValLys: 4.256 ± 0.066
7.204ValLeu: 7.204 ± 0.103
1.985ValMet: 1.985 ± 0.044
2.76ValAsn: 2.76 ± 0.057
2.807ValPro: 2.807 ± 0.05
2.41ValGln: 2.41 ± 0.049
2.733ValArg: 2.733 ± 0.058
4.541ValSer: 4.541 ± 0.065
3.858ValThr: 3.858 ± 0.067
4.938ValVal: 4.938 ± 0.085
0.656ValTrp: 0.656 ± 0.026
2.236ValTyr: 2.236 ± 0.05
0.0ValXaa: 0.0 ± 0.0
Trp
0.687TrpAla: 0.687 ± 0.023
0.058TrpCys: 0.058 ± 0.007
0.477TrpAsp: 0.477 ± 0.025
0.68TrpGlu: 0.68 ± 0.025
0.525TrpPhe: 0.525 ± 0.025
0.673TrpGly: 0.673 ± 0.029
0.21TrpHis: 0.21 ± 0.015
0.823TrpIle: 0.823 ± 0.031
0.688TrpLys: 0.688 ± 0.028
1.181TrpLeu: 1.181 ± 0.045
0.366TrpMet: 0.366 ± 0.02
0.545TrpAsn: 0.545 ± 0.027
0.296TrpPro: 0.296 ± 0.017
0.431TrpGln: 0.431 ± 0.022
0.418TrpArg: 0.418 ± 0.022
0.562TrpSer: 0.562 ± 0.024
0.609TrpThr: 0.609 ± 0.027
0.645TrpVal: 0.645 ± 0.026
0.131TrpTrp: 0.131 ± 0.011
0.285TrpTyr: 0.285 ± 0.018
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.473TyrAla: 2.473 ± 0.047
0.25TyrCys: 0.25 ± 0.017
1.797TyrAsp: 1.797 ± 0.044
2.613TyrGlu: 2.613 ± 0.056
1.745TyrPhe: 1.745 ± 0.043
2.526TyrGly: 2.526 ± 0.056
0.689TyrHis: 0.689 ± 0.022
2.127TyrIle: 2.127 ± 0.043
1.682TyrLys: 1.682 ± 0.045
3.222TyrLeu: 3.222 ± 0.06
0.851TyrMet: 0.851 ± 0.027
1.123TyrAsn: 1.123 ± 0.033
1.367TyrPro: 1.367 ± 0.035
1.244TyrGln: 1.244 ± 0.034
1.608TyrArg: 1.608 ± 0.043
1.983TyrSer: 1.983 ± 0.044
1.783TyrThr: 1.783 ± 0.043
1.984TyrVal: 1.984 ± 0.051
0.378TyrTrp: 0.378 ± 0.021
1.227TyrTyr: 1.227 ± 0.038
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3353 proteins (986772 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski