Amino acid dipepetide frequency for Porphyromonas catoniae ATCC 51270

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.299AlaAla: 5.299 ± 0.127
0.751AlaCys: 0.751 ± 0.042
3.824AlaAsp: 3.824 ± 0.096
5.646AlaGlu: 5.646 ± 0.117
3.299AlaPhe: 3.299 ± 0.075
5.365AlaGly: 5.365 ± 0.106
1.765AlaHis: 1.765 ± 0.056
4.586AlaIle: 4.586 ± 0.092
4.355AlaLys: 4.355 ± 0.09
9.716AlaLeu: 9.716 ± 0.136
1.872AlaMet: 1.872 ± 0.064
2.554AlaAsn: 2.554 ± 0.09
3.304AlaPro: 3.304 ± 0.083
3.293AlaGln: 3.293 ± 0.084
4.672AlaArg: 4.672 ± 0.094
5.723AlaSer: 5.723 ± 0.122
4.594AlaThr: 4.594 ± 0.083
4.765AlaVal: 4.765 ± 0.09
0.891AlaTrp: 0.891 ± 0.044
3.188AlaTyr: 3.188 ± 0.079
0.0AlaXaa: 0.0 ± 0.0
Cys
0.648CysAla: 0.648 ± 0.033
0.095CysCys: 0.095 ± 0.013
0.448CysAsp: 0.448 ± 0.029
0.455CysGlu: 0.455 ± 0.026
0.408CysPhe: 0.408 ± 0.026
0.774CysGly: 0.774 ± 0.046
0.284CysHis: 0.284 ± 0.023
0.57CysIle: 0.57 ± 0.032
0.398CysLys: 0.398 ± 0.027
0.967CysLeu: 0.967 ± 0.045
0.181CysMet: 0.181 ± 0.02
0.288CysAsn: 0.288 ± 0.025
0.526CysPro: 0.526 ± 0.033
0.253CysGln: 0.253 ± 0.022
0.576CysArg: 0.576 ± 0.031
0.665CysSer: 0.665 ± 0.036
0.472CysThr: 0.472 ± 0.032
0.507CysVal: 0.507 ± 0.026
0.081CysTrp: 0.081 ± 0.012
0.346CysTyr: 0.346 ± 0.024
0.0CysXaa: 0.0 ± 0.0
Asp
4.084AspAla: 4.084 ± 0.083
0.417AspCys: 0.417 ± 0.028
2.14AspAsp: 2.14 ± 0.057
3.955AspGlu: 3.955 ± 0.084
2.573AspPhe: 2.573 ± 0.064
3.738AspGly: 3.738 ± 0.086
1.019AspHis: 1.019 ± 0.041
3.238AspIle: 3.238 ± 0.081
2.983AspLys: 2.983 ± 0.088
5.432AspLeu: 5.432 ± 0.088
1.105AspMet: 1.105 ± 0.043
1.739AspAsn: 1.739 ± 0.054
2.33AspPro: 2.33 ± 0.064
1.504AspGln: 1.504 ± 0.045
3.085AspArg: 3.085 ± 0.068
2.942AspSer: 2.942 ± 0.082
2.714AspThr: 2.714 ± 0.062
3.161AspVal: 3.161 ± 0.079
0.689AspTrp: 0.689 ± 0.033
2.58AspTyr: 2.58 ± 0.069
0.0AspXaa: 0.0 ± 0.0
Glu
6.726GluAla: 6.726 ± 0.133
0.484GluCys: 0.484 ± 0.029
3.524GluAsp: 3.524 ± 0.082
5.977GluGlu: 5.977 ± 0.123
1.844GluPhe: 1.844 ± 0.06
5.268GluGly: 5.268 ± 0.094
1.675GluHis: 1.675 ± 0.059
3.524GluIle: 3.524 ± 0.09
3.152GluLys: 3.152 ± 0.094
7.686GluLeu: 7.686 ± 0.118
1.399GluMet: 1.399 ± 0.049
1.83GluAsn: 1.83 ± 0.058
2.082GluPro: 2.082 ± 0.063
3.018GluGln: 3.018 ± 0.083
4.827GluArg: 4.827 ± 0.104
2.975GluSer: 2.975 ± 0.075
3.243GluThr: 3.243 ± 0.081
5.125GluVal: 5.125 ± 0.102
0.691GluTrp: 0.691 ± 0.035
2.401GluTyr: 2.401 ± 0.061
0.0GluXaa: 0.0 ± 0.0
Phe
3.321PheAla: 3.321 ± 0.082
0.396PheCys: 0.396 ± 0.026
2.539PheAsp: 2.539 ± 0.068
2.028PheGlu: 2.028 ± 0.056
1.968PhePhe: 1.968 ± 0.068
3.021PheGly: 3.021 ± 0.078
0.801PheHis: 0.801 ± 0.036
2.633PheIle: 2.633 ± 0.068
1.534PheLys: 1.534 ± 0.062
4.026PheLeu: 4.026 ± 0.1
0.867PheMet: 0.867 ± 0.039
1.36PheAsn: 1.36 ± 0.057
1.689PhePro: 1.689 ± 0.049
1.044PheGln: 1.044 ± 0.043
2.206PheArg: 2.206 ± 0.062
3.361PheSer: 3.361 ± 0.084
2.606PheThr: 2.606 ± 0.076
2.8PheVal: 2.8 ± 0.072
0.414PheTrp: 0.414 ± 0.026
1.415PheTyr: 1.415 ± 0.054
0.0PheXaa: 0.0 ± 0.0
Gly
5.627GlyAla: 5.627 ± 0.105
0.743GlyCys: 0.743 ± 0.039
3.538GlyAsp: 3.538 ± 0.088
4.724GlyGlu: 4.724 ± 0.095
2.968GlyPhe: 2.968 ± 0.07
5.523GlyGly: 5.523 ± 0.117
1.563GlyHis: 1.563 ± 0.057
4.851GlyIle: 4.851 ± 0.11
5.091GlyLys: 5.091 ± 0.102
7.093GlyLeu: 7.093 ± 0.102
1.822GlyMet: 1.822 ± 0.056
2.616GlyAsn: 2.616 ± 0.067
1.249GlyPro: 1.249 ± 0.053
2.535GlyGln: 2.535 ± 0.072
4.298GlyArg: 4.298 ± 0.085
4.527GlySer: 4.527 ± 0.101
4.093GlyThr: 4.093 ± 0.087
5.118GlyVal: 5.118 ± 0.114
0.948GlyTrp: 0.948 ± 0.037
3.542GlyTyr: 3.542 ± 0.075
0.0GlyXaa: 0.0 ± 0.0
His
1.522HisAla: 1.522 ± 0.05
0.245HisCys: 0.245 ± 0.02
0.919HisAsp: 0.919 ± 0.038
1.339HisGlu: 1.339 ± 0.05
1.074HisPhe: 1.074 ± 0.047
1.427HisGly: 1.427 ± 0.054
0.701HisHis: 0.701 ± 0.041
1.522HisIle: 1.522 ± 0.056
0.979HisLys: 0.979 ± 0.041
2.675HisLeu: 2.675 ± 0.074
0.357HisMet: 0.357 ± 0.025
0.824HisAsn: 0.824 ± 0.04
1.451HisPro: 1.451 ± 0.054
0.807HisGln: 0.807 ± 0.04
1.38HisArg: 1.38 ± 0.053
1.479HisSer: 1.479 ± 0.052
1.301HisThr: 1.301 ± 0.048
1.174HisVal: 1.174 ± 0.048
0.303HisTrp: 0.303 ± 0.026
1.068HisTyr: 1.068 ± 0.048
0.0HisXaa: 0.0 ± 0.0
Ile
5.351IleAla: 5.351 ± 0.123
0.646IleCys: 0.646 ± 0.037
3.785IleAsp: 3.785 ± 0.077
4.177IleGlu: 4.177 ± 0.094
2.185IlePhe: 2.185 ± 0.068
4.239IleGly: 4.239 ± 0.105
1.401IleHis: 1.401 ± 0.053
3.557IleIle: 3.557 ± 0.092
2.733IleLys: 2.733 ± 0.076
5.872IleLeu: 5.872 ± 0.111
1.001IleMet: 1.001 ± 0.042
2.296IleAsn: 2.296 ± 0.066
3.152IlePro: 3.152 ± 0.067
2.097IleGln: 2.097 ± 0.067
3.785IleArg: 3.785 ± 0.082
3.941IleSer: 3.941 ± 0.099
3.791IleThr: 3.791 ± 0.082
3.638IleVal: 3.638 ± 0.082
0.45IleTrp: 0.45 ± 0.029
2.294IleTyr: 2.294 ± 0.057
0.0IleXaa: 0.0 ± 0.0
Lys
4.65LysAla: 4.65 ± 0.101
0.253LysCys: 0.253 ± 0.023
3.083LysAsp: 3.083 ± 0.079
4.171LysGlu: 4.171 ± 0.085
1.365LysPhe: 1.365 ± 0.053
4.096LysGly: 4.096 ± 0.082
1.131LysHis: 1.131 ± 0.048
2.868LysIle: 2.868 ± 0.086
3.388LysLys: 3.388 ± 0.103
4.622LysLeu: 4.622 ± 0.094
1.384LysMet: 1.384 ± 0.047
1.67LysAsn: 1.67 ± 0.051
1.958LysPro: 1.958 ± 0.057
2.047LysGln: 2.047 ± 0.06
2.961LysArg: 2.961 ± 0.082
2.897LysSer: 2.897 ± 0.076
2.99LysThr: 2.99 ± 0.07
3.488LysVal: 3.488 ± 0.086
0.501LysTrp: 0.501 ± 0.034
1.939LysTyr: 1.939 ± 0.061
0.0LysXaa: 0.0 ± 0.0
Leu
8.305LeuAla: 8.305 ± 0.126
1.162LeuCys: 1.162 ± 0.048
5.146LeuAsp: 5.146 ± 0.097
6.183LeuGlu: 6.183 ± 0.115
4.408LeuPhe: 4.408 ± 0.11
7.883LeuGly: 7.883 ± 0.14
2.489LeuHis: 2.489 ± 0.069
5.947LeuIle: 5.947 ± 0.105
4.777LeuLys: 4.777 ± 0.093
11.54LeuLeu: 11.54 ± 0.216
2.363LeuMet: 2.363 ± 0.06
3.314LeuAsn: 3.314 ± 0.09
5.713LeuPro: 5.713 ± 0.089
3.934LeuGln: 3.934 ± 0.092
7.838LeuArg: 7.838 ± 0.135
9.206LeuSer: 9.206 ± 0.136
6.228LeuThr: 6.228 ± 0.106
6.163LeuVal: 6.163 ± 0.113
1.156LeuTrp: 1.156 ± 0.051
3.931LeuTyr: 3.931 ± 0.086
0.0LeuXaa: 0.0 ± 0.0
Met
1.947MetAla: 1.947 ± 0.06
0.141MetCys: 0.141 ± 0.015
1.148MetAsp: 1.148 ± 0.047
1.355MetGlu: 1.355 ± 0.045
0.522MetPhe: 0.522 ± 0.031
1.713MetGly: 1.713 ± 0.061
0.483MetHis: 0.483 ± 0.027
1.274MetIle: 1.274 ± 0.053
1.534MetLys: 1.534 ± 0.052
2.201MetLeu: 2.201 ± 0.059
0.586MetMet: 0.586 ± 0.031
1.053MetAsn: 1.053 ± 0.042
1.046MetPro: 1.046 ± 0.042
0.951MetGln: 0.951 ± 0.041
1.425MetArg: 1.425 ± 0.046
1.405MetSer: 1.405 ± 0.054
1.37MetThr: 1.37 ± 0.046
1.184MetVal: 1.184 ± 0.044
0.186MetTrp: 0.186 ± 0.017
0.596MetTyr: 0.596 ± 0.035
0.0MetXaa: 0.0 ± 0.0
Asn
2.732AsnAla: 2.732 ± 0.073
0.257AsnCys: 0.257 ± 0.019
1.725AsnAsp: 1.725 ± 0.062
2.022AsnGlu: 2.022 ± 0.063
1.367AsnPhe: 1.367 ± 0.056
2.325AsnGly: 2.325 ± 0.071
0.686AsnHis: 0.686 ± 0.033
2.308AsnIle: 2.308 ± 0.064
2.011AsnLys: 2.011 ± 0.055
3.311AsnLeu: 3.311 ± 0.098
0.798AsnMet: 0.798 ± 0.031
1.367AsnAsn: 1.367 ± 0.054
2.084AsnPro: 2.084 ± 0.063
1.168AsnGln: 1.168 ± 0.039
1.772AsnArg: 1.772 ± 0.059
1.856AsnSer: 1.856 ± 0.063
1.923AsnThr: 1.923 ± 0.067
2.189AsnVal: 2.189 ± 0.076
0.384AsnTrp: 0.384 ± 0.028
1.658AsnTyr: 1.658 ± 0.064
0.0AsnXaa: 0.0 ± 0.0
Pro
2.98ProAla: 2.98 ± 0.078
0.312ProCys: 0.312 ± 0.024
2.385ProAsp: 2.385 ± 0.071
3.81ProGlu: 3.81 ± 0.089
1.892ProPhe: 1.892 ± 0.052
2.44ProGly: 2.44 ± 0.073
1.01ProHis: 1.01 ± 0.044
2.849ProIle: 2.849 ± 0.071
2.378ProLys: 2.378 ± 0.074
4.415ProLeu: 4.415 ± 0.092
1.062ProMet: 1.062 ± 0.043
1.785ProAsn: 1.785 ± 0.056
1.268ProPro: 1.268 ± 0.063
1.832ProGln: 1.832 ± 0.057
2.199ProArg: 2.199 ± 0.071
3.526ProSer: 3.526 ± 0.09
2.942ProThr: 2.942 ± 0.075
2.408ProVal: 2.408 ± 0.062
0.505ProTrp: 0.505 ± 0.032
1.906ProTyr: 1.906 ± 0.056
0.0ProXaa: 0.0 ± 0.0
Gln
3.13GlnAla: 3.13 ± 0.089
0.253GlnCys: 0.253 ± 0.021
1.78GlnAsp: 1.78 ± 0.058
2.64GlnGlu: 2.64 ± 0.071
1.068GlnPhe: 1.068 ± 0.04
2.695GlnGly: 2.695 ± 0.079
0.836GlnHis: 0.836 ± 0.037
2.223GlnIle: 2.223 ± 0.067
1.606GlnLys: 1.606 ± 0.056
4.09GlnLeu: 4.09 ± 0.087
0.889GlnMet: 0.889 ± 0.035
1.086GlnAsn: 1.086 ± 0.044
1.479GlnPro: 1.479 ± 0.052
1.529GlnGln: 1.529 ± 0.058
2.723GlnArg: 2.723 ± 0.077
2.228GlnSer: 2.228 ± 0.062
2.051GlnThr: 2.051 ± 0.063
2.426GlnVal: 2.426 ± 0.06
0.379GlnTrp: 0.379 ± 0.024
1.174GlnTyr: 1.174 ± 0.046
0.0GlnXaa: 0.0 ± 0.0
Arg
4.593ArgAla: 4.593 ± 0.088
0.429ArgCys: 0.429 ± 0.028
2.869ArgAsp: 2.869 ± 0.077
4.183ArgGlu: 4.183 ± 0.096
2.526ArgPhe: 2.526 ± 0.069
4.107ArgGly: 4.107 ± 0.087
1.494ArgHis: 1.494 ± 0.057
4.046ArgIle: 4.046 ± 0.081
2.992ArgLys: 2.992 ± 0.077
6.945ArgLeu: 6.945 ± 0.114
1.623ArgMet: 1.623 ± 0.05
1.965ArgAsn: 1.965 ± 0.054
2.714ArgPro: 2.714 ± 0.081
2.177ArgGln: 2.177 ± 0.065
3.883ArgArg: 3.883 ± 0.101
3.829ArgSer: 3.829 ± 0.071
3.388ArgThr: 3.388 ± 0.079
3.71ArgVal: 3.71 ± 0.092
0.812ArgTrp: 0.812 ± 0.035
2.916ArgTyr: 2.916 ± 0.075
0.0ArgXaa: 0.0 ± 0.0
Ser
4.867SerAla: 4.867 ± 0.099
0.781SerCys: 0.781 ± 0.04
3.255SerAsp: 3.255 ± 0.078
4.119SerGlu: 4.119 ± 0.08
3.23SerPhe: 3.23 ± 0.081
5.306SerGly: 5.306 ± 0.107
1.396SerHis: 1.396 ± 0.048
4.164SerIle: 4.164 ± 0.085
3.038SerLys: 3.038 ± 0.078
8.093SerLeu: 8.093 ± 0.14
1.396SerMet: 1.396 ± 0.044
2.07SerAsn: 2.07 ± 0.065
3.323SerPro: 3.323 ± 0.077
2.011SerGln: 2.011 ± 0.061
3.44SerArg: 3.44 ± 0.077
5.184SerSer: 5.184 ± 0.115
3.812SerThr: 3.812 ± 0.088
4.245SerVal: 4.245 ± 0.097
0.87SerTrp: 0.87 ± 0.042
3.221SerTyr: 3.221 ± 0.104
0.0SerXaa: 0.0 ± 0.0
Thr
4.451ThrAla: 4.451 ± 0.095
0.419ThrCys: 0.419 ± 0.027
2.988ThrAsp: 2.988 ± 0.074
3.604ThrGlu: 3.604 ± 0.078
2.769ThrPhe: 2.769 ± 0.065
3.945ThrGly: 3.945 ± 0.087
1.244ThrHis: 1.244 ± 0.045
3.617ThrIle: 3.617 ± 0.094
3.009ThrLys: 3.009 ± 0.067
7.117ThrLeu: 7.117 ± 0.125
1.036ThrMet: 1.036 ± 0.042
1.956ThrAsn: 1.956 ± 0.062
3.678ThrPro: 3.678 ± 0.089
2.042ThrGln: 2.042 ± 0.056
2.711ThrArg: 2.711 ± 0.078
4.071ThrSer: 4.071 ± 0.087
3.671ThrThr: 3.671 ± 0.095
3.0ThrVal: 3.0 ± 0.077
0.555ThrTrp: 0.555 ± 0.038
2.277ThrTyr: 2.277 ± 0.063
0.0ThrXaa: 0.0 ± 0.0
Val
5.287ValAla: 5.287 ± 0.109
0.638ValCys: 0.638 ± 0.033
3.354ValAsp: 3.354 ± 0.08
3.972ValGlu: 3.972 ± 0.084
2.456ValPhe: 2.456 ± 0.064
4.71ValGly: 4.71 ± 0.105
1.308ValHis: 1.308 ± 0.052
3.741ValIle: 3.741 ± 0.093
3.007ValLys: 3.007 ± 0.076
6.323ValLeu: 6.323 ± 0.122
1.258ValMet: 1.258 ± 0.049
2.044ValAsn: 2.044 ± 0.05
2.542ValPro: 2.542 ± 0.068
2.061ValGln: 2.061 ± 0.058
3.934ValArg: 3.934 ± 0.075
4.601ValSer: 4.601 ± 0.081
3.64ValThr: 3.64 ± 0.078
4.221ValVal: 4.221 ± 0.108
0.653ValTrp: 0.653 ± 0.038
2.492ValTyr: 2.492 ± 0.065
0.0ValXaa: 0.0 ± 0.0
Trp
0.789TrpAla: 0.789 ± 0.036
0.134TrpCys: 0.134 ± 0.014
0.669TrpAsp: 0.669 ± 0.038
0.731TrpGlu: 0.731 ± 0.039
0.393TrpPhe: 0.393 ± 0.024
0.97TrpGly: 0.97 ± 0.048
0.303TrpHis: 0.303 ± 0.024
0.601TrpIle: 0.601 ± 0.032
0.536TrpLys: 0.536 ± 0.028
1.155TrpLeu: 1.155 ± 0.051
0.336TrpMet: 0.336 ± 0.022
0.4TrpAsn: 0.4 ± 0.027
0.238TrpPro: 0.238 ± 0.019
0.512TrpGln: 0.512 ± 0.031
0.808TrpArg: 0.808 ± 0.042
0.705TrpSer: 0.705 ± 0.036
0.56TrpThr: 0.56 ± 0.03
0.753TrpVal: 0.753 ± 0.037
0.146TrpTrp: 0.146 ± 0.017
0.408TrpTyr: 0.408 ± 0.027
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.154TyrAla: 3.154 ± 0.07
0.386TyrCys: 0.386 ± 0.029
2.309TyrAsp: 2.309 ± 0.07
2.363TyrGlu: 2.363 ± 0.066
1.713TyrPhe: 1.713 ± 0.056
2.923TyrGly: 2.923 ± 0.078
0.956TyrHis: 0.956 ± 0.037
2.271TyrIle: 2.271 ± 0.058
1.959TyrLys: 1.959 ± 0.063
4.408TyrLeu: 4.408 ± 0.114
0.784TyrMet: 0.784 ± 0.034
1.725TyrAsn: 1.725 ± 0.056
2.08TyrPro: 2.08 ± 0.061
1.47TyrGln: 1.47 ± 0.047
2.68TyrArg: 2.68 ± 0.079
2.699TyrSer: 2.699 ± 0.076
2.769TyrThr: 2.769 ± 0.076
2.168TyrVal: 2.168 ± 0.06
0.533TyrTrp: 0.533 ± 0.034
1.958TyrTyr: 1.958 ± 0.067
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1597 proteins (580261 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski