Amino acid dipepetide frequency for Veillonella montpellierensis DNF00314

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.331AlaAla: 6.331 ± 0.169
0.978AlaCys: 0.978 ± 0.047
3.96AlaAsp: 3.96 ± 0.088
3.987AlaGlu: 3.987 ± 0.096
2.878AlaPhe: 2.878 ± 0.07
5.539AlaGly: 5.539 ± 0.146
1.906AlaHis: 1.906 ± 0.077
7.1AlaIle: 7.1 ± 0.145
5.226AlaLys: 5.226 ± 0.13
7.906AlaLeu: 7.906 ± 0.142
2.679AlaMet: 2.679 ± 0.084
3.22AlaAsn: 3.22 ± 0.111
2.261AlaPro: 2.261 ± 0.071
2.69AlaGln: 2.69 ± 0.074
3.146AlaArg: 3.146 ± 0.084
4.271AlaSer: 4.271 ± 0.094
4.582AlaThr: 4.582 ± 0.1
5.773AlaVal: 5.773 ± 0.116
0.638AlaTrp: 0.638 ± 0.038
2.804AlaTyr: 2.804 ± 0.071
0.0AlaXaa: 0.0 ± 0.0
Cys
0.761CysAla: 0.761 ± 0.041
0.16CysCys: 0.16 ± 0.017
0.671CysAsp: 0.671 ± 0.042
0.702CysGlu: 0.702 ± 0.042
0.381CysPhe: 0.381 ± 0.03
1.09CysGly: 1.09 ± 0.052
0.433CysHis: 0.433 ± 0.033
1.094CysIle: 1.094 ± 0.044
0.732CysLys: 0.732 ± 0.047
0.891CysLeu: 0.891 ± 0.042
0.402CysMet: 0.402 ± 0.032
0.443CysAsn: 0.443 ± 0.03
0.52CysPro: 0.52 ± 0.04
0.419CysGln: 0.419 ± 0.029
0.533CysArg: 0.533 ± 0.032
0.607CysSer: 0.607 ± 0.036
0.632CysThr: 0.632 ± 0.038
0.798CysVal: 0.798 ± 0.044
0.089CysTrp: 0.089 ± 0.015
0.323CysTyr: 0.323 ± 0.026
0.0CysXaa: 0.0 ± 0.0
Asp
4.043AspAla: 4.043 ± 0.101
0.605AspCys: 0.605 ± 0.037
3.117AspAsp: 3.117 ± 0.087
3.861AspGlu: 3.861 ± 0.084
2.205AspPhe: 2.205 ± 0.08
4.439AspGly: 4.439 ± 0.229
1.059AspHis: 1.059 ± 0.047
5.282AspIle: 5.282 ± 0.118
3.529AspLys: 3.529 ± 0.117
4.095AspLeu: 4.095 ± 0.109
2.062AspMet: 2.062 ± 0.061
2.362AspAsn: 2.362 ± 0.092
1.786AspPro: 1.786 ± 0.058
1.241AspGln: 1.241 ± 0.058
2.333AspArg: 2.333 ± 0.08
2.854AspSer: 2.854 ± 0.073
3.968AspThr: 3.968 ± 0.099
4.609AspVal: 4.609 ± 0.09
0.557AspTrp: 0.557 ± 0.034
2.435AspTyr: 2.435 ± 0.075
0.0AspXaa: 0.0 ± 0.0
Glu
5.436GluAla: 5.436 ± 0.12
0.702GluCys: 0.702 ± 0.041
3.467GluAsp: 3.467 ± 0.091
5.392GluGlu: 5.392 ± 0.13
2.242GluPhe: 2.242 ± 0.071
3.989GluGly: 3.989 ± 0.087
1.453GluHis: 1.453 ± 0.059
4.524GluIle: 4.524 ± 0.085
4.42GluLys: 4.42 ± 0.112
6.519GluLeu: 6.519 ± 0.122
1.828GluMet: 1.828 ± 0.071
2.816GluAsn: 2.816 ± 0.088
1.803GluPro: 1.803 ± 0.071
2.688GluGln: 2.688 ± 0.078
3.43GluArg: 3.43 ± 0.09
3.258GluSer: 3.258 ± 0.086
3.355GluThr: 3.355 ± 0.097
4.385GluVal: 4.385 ± 0.106
0.628GluTrp: 0.628 ± 0.036
2.323GluTyr: 2.323 ± 0.074
0.0GluXaa: 0.0 ± 0.0
Phe
2.487PheAla: 2.487 ± 0.082
0.493PheCys: 0.493 ± 0.038
2.367PheAsp: 2.367 ± 0.071
1.985PheGlu: 1.985 ± 0.065
1.558PhePhe: 1.558 ± 0.069
2.744PheGly: 2.744 ± 0.088
0.827PheHis: 0.827 ± 0.039
3.04PheIle: 3.04 ± 0.1
2.058PheLys: 2.058 ± 0.073
3.206PheLeu: 3.206 ± 0.11
1.067PheMet: 1.067 ± 0.046
1.643PheAsn: 1.643 ± 0.07
1.26PhePro: 1.26 ± 0.053
1.017PheGln: 1.017 ± 0.047
1.291PheArg: 1.291 ± 0.05
2.646PheSer: 2.646 ± 0.072
2.375PheThr: 2.375 ± 0.077
2.545PheVal: 2.545 ± 0.088
0.338PheTrp: 0.338 ± 0.027
1.312PheTyr: 1.312 ± 0.06
0.0PheXaa: 0.0 ± 0.0
Gly
5.748GlyAla: 5.748 ± 0.135
1.009GlyCys: 1.009 ± 0.05
3.763GlyAsp: 3.763 ± 0.104
3.981GlyGlu: 3.981 ± 0.103
2.721GlyPhe: 2.721 ± 0.076
4.93GlyGly: 4.93 ± 0.115
1.826GlyHis: 1.826 ± 0.069
6.229GlyIle: 6.229 ± 0.131
5.133GlyLys: 5.133 ± 0.203
6.322GlyLeu: 6.322 ± 0.129
2.221GlyMet: 2.221 ± 0.071
2.982GlyAsn: 2.982 ± 0.103
1.774GlyPro: 1.774 ± 0.061
2.29GlyGln: 2.29 ± 0.076
3.195GlyArg: 3.195 ± 0.087
4.049GlySer: 4.049 ± 0.115
4.797GlyThr: 4.797 ± 0.132
5.365GlyVal: 5.365 ± 0.118
0.564GlyTrp: 0.564 ± 0.035
2.874GlyTyr: 2.874 ± 0.094
0.0GlyXaa: 0.0 ± 0.0
His
1.61HisAla: 1.61 ± 0.057
0.373HisCys: 0.373 ± 0.03
1.339HisAsp: 1.339 ± 0.063
1.422HisGlu: 1.422 ± 0.06
0.738HisPhe: 0.738 ± 0.042
1.604HisGly: 1.604 ± 0.061
0.69HisHis: 0.69 ± 0.045
2.323HisIle: 2.323 ± 0.075
1.407HisLys: 1.407 ± 0.06
1.741HisLeu: 1.741 ± 0.06
0.812HisMet: 0.812 ± 0.05
0.97HisAsn: 0.97 ± 0.046
1.042HisPro: 1.042 ± 0.055
0.773HisGln: 0.773 ± 0.045
1.248HisArg: 1.248 ± 0.052
1.227HisSer: 1.227 ± 0.053
1.668HisThr: 1.668 ± 0.059
1.882HisVal: 1.882 ± 0.068
0.265HisTrp: 0.265 ± 0.023
0.885HisTyr: 0.885 ± 0.043
0.0HisXaa: 0.0 ± 0.0
Ile
6.598IleAla: 6.598 ± 0.134
0.978IleCys: 0.978 ± 0.049
5.116IleAsp: 5.116 ± 0.113
5.125IleGlu: 5.125 ± 0.114
2.648IlePhe: 2.648 ± 0.083
6.327IleGly: 6.327 ± 0.157
1.906IleHis: 1.906 ± 0.071
6.194IleIle: 6.194 ± 0.167
4.385IleLys: 4.385 ± 0.106
6.899IleLeu: 6.899 ± 0.166
2.002IleMet: 2.002 ± 0.057
3.353IleAsn: 3.353 ± 0.096
3.5IlePro: 3.5 ± 0.092
2.679IleGln: 2.679 ± 0.075
3.403IleArg: 3.403 ± 0.087
5.027IleSer: 5.027 ± 0.097
5.127IleThr: 5.127 ± 0.109
6.022IleVal: 6.022 ± 0.117
0.539IleTrp: 0.539 ± 0.035
2.597IleTyr: 2.597 ± 0.08
0.0IleXaa: 0.0 ± 0.0
Lys
5.092LysAla: 5.092 ± 0.119
0.41LysCys: 0.41 ± 0.029
4.07LysAsp: 4.07 ± 0.225
5.734LysGlu: 5.734 ± 0.121
1.699LysPhe: 1.699 ± 0.061
4.227LysGly: 4.227 ± 0.096
1.337LysHis: 1.337 ± 0.05
3.997LysIle: 3.997 ± 0.11
4.843LysLys: 4.843 ± 0.135
4.764LysLeu: 4.764 ± 0.105
1.813LysMet: 1.813 ± 0.061
2.999LysAsn: 2.999 ± 0.121
2.081LysPro: 2.081 ± 0.066
2.65LysGln: 2.65 ± 0.07
3.241LysArg: 3.241 ± 0.089
3.287LysSer: 3.287 ± 0.084
3.583LysThr: 3.583 ± 0.119
4.055LysVal: 4.055 ± 0.086
0.545LysTrp: 0.545 ± 0.035
2.014LysTyr: 2.014 ± 0.076
0.0LysXaa: 0.0 ± 0.0
Leu
7.864LeuAla: 7.864 ± 0.148
1.123LeuCys: 1.123 ± 0.052
5.085LeuAsp: 5.085 ± 0.117
5.761LeuGlu: 5.761 ± 0.132
3.307LeuPhe: 3.307 ± 0.099
6.702LeuGly: 6.702 ± 0.137
2.29LeuHis: 2.29 ± 0.075
5.829LeuIle: 5.829 ± 0.143
4.804LeuLys: 4.804 ± 0.106
8.855LeuLeu: 8.855 ± 0.194
2.501LeuMet: 2.501 ± 0.072
3.088LeuAsn: 3.088 ± 0.076
3.705LeuPro: 3.705 ± 0.089
3.85LeuGln: 3.85 ± 0.098
4.002LeuArg: 4.002 ± 0.09
5.906LeuSer: 5.906 ± 0.127
5.177LeuThr: 5.177 ± 0.098
6.407LeuVal: 6.407 ± 0.139
0.854LeuTrp: 0.854 ± 0.047
3.079LeuTyr: 3.079 ± 0.08
0.0LeuXaa: 0.0 ± 0.0
Met
2.897MetAla: 2.897 ± 0.09
0.282MetCys: 0.282 ± 0.024
1.621MetAsp: 1.621 ± 0.067
1.848MetGlu: 1.848 ± 0.057
0.823MetPhe: 0.823 ± 0.05
2.228MetGly: 2.228 ± 0.074
0.63MetHis: 0.63 ± 0.035
2.304MetIle: 2.304 ± 0.081
2.381MetLys: 2.381 ± 0.06
2.385MetLeu: 2.385 ± 0.083
0.85MetMet: 0.85 ± 0.044
1.621MetAsn: 1.621 ± 0.061
1.179MetPro: 1.179 ± 0.05
0.941MetGln: 0.941 ± 0.046
1.264MetArg: 1.264 ± 0.055
1.691MetSer: 1.691 ± 0.061
1.894MetThr: 1.894 ± 0.069
2.039MetVal: 2.039 ± 0.066
0.195MetTrp: 0.195 ± 0.022
0.972MetTyr: 0.972 ± 0.049
0.0MetXaa: 0.0 ± 0.0
Asn
3.026AsnAla: 3.026 ± 0.089
0.421AsnCys: 0.421 ± 0.032
2.261AsnAsp: 2.261 ± 0.084
2.777AsnGlu: 2.777 ± 0.104
1.488AsnPhe: 1.488 ± 0.059
3.218AsnGly: 3.218 ± 0.135
1.179AsnHis: 1.179 ± 0.049
3.34AsnIle: 3.34 ± 0.092
2.729AsnLys: 2.729 ± 0.12
3.461AsnLeu: 3.461 ± 0.1
1.21AsnMet: 1.21 ± 0.048
1.863AsnAsn: 1.863 ± 0.081
1.875AsnPro: 1.875 ± 0.067
1.554AsnGln: 1.554 ± 0.056
2.19AsnArg: 2.19 ± 0.065
2.325AsnSer: 2.325 ± 0.087
2.632AsnThr: 2.632 ± 0.111
2.951AsnVal: 2.951 ± 0.084
0.371AsnTrp: 0.371 ± 0.028
1.71AsnTyr: 1.71 ± 0.073
0.0AsnXaa: 0.0 ± 0.0
Pro
2.348ProAla: 2.348 ± 0.071
0.369ProCys: 0.369 ± 0.027
1.894ProAsp: 1.894 ± 0.063
2.431ProGlu: 2.431 ± 0.09
1.589ProPhe: 1.589 ± 0.055
2.066ProGly: 2.066 ± 0.077
0.993ProHis: 0.993 ± 0.047
2.974ProIle: 2.974 ± 0.081
1.954ProLys: 1.954 ± 0.061
3.421ProLeu: 3.421 ± 0.101
1.063ProMet: 1.063 ± 0.052
1.538ProAsn: 1.538 ± 0.059
0.912ProPro: 0.912 ± 0.048
1.088ProGln: 1.088 ± 0.045
1.26ProArg: 1.26 ± 0.058
2.099ProSer: 2.099 ± 0.065
2.342ProThr: 2.342 ± 0.069
2.866ProVal: 2.866 ± 0.088
0.292ProTrp: 0.292 ± 0.027
1.473ProTyr: 1.473 ± 0.049
0.0ProXaa: 0.0 ± 0.0
Gln
3.204GlnAla: 3.204 ± 0.09
0.361GlnCys: 0.361 ± 0.031
1.768GlnAsp: 1.768 ± 0.068
2.609GlnGlu: 2.609 ± 0.077
1.187GlnPhe: 1.187 ± 0.047
2.323GlnGly: 2.323 ± 0.066
0.777GlnHis: 0.777 ± 0.041
2.449GlnIle: 2.449 ± 0.067
2.035GlnLys: 2.035 ± 0.06
3.604GlnLeu: 3.604 ± 0.102
1.009GlnMet: 1.009 ± 0.043
1.266GlnAsn: 1.266 ± 0.064
1.051GlnPro: 1.051 ± 0.049
1.6GlnGln: 1.6 ± 0.074
1.708GlnArg: 1.708 ± 0.06
2.056GlnSer: 2.056 ± 0.063
1.621GlnThr: 1.621 ± 0.059
2.333GlnVal: 2.333 ± 0.073
0.352GlnTrp: 0.352 ± 0.028
1.384GlnTyr: 1.384 ± 0.063
0.0GlnXaa: 0.0 ± 0.0
Arg
3.125ArgAla: 3.125 ± 0.074
0.483ArgCys: 0.483 ± 0.036
2.298ArgAsp: 2.298 ± 0.072
3.108ArgGlu: 3.108 ± 0.094
1.741ArgPhe: 1.741 ± 0.072
2.617ArgGly: 2.617 ± 0.072
1.156ArgHis: 1.156 ± 0.051
3.701ArgIle: 3.701 ± 0.088
2.721ArgLys: 2.721 ± 0.083
4.571ArgLeu: 4.571 ± 0.098
1.482ArgMet: 1.482 ± 0.058
1.898ArgAsn: 1.898 ± 0.062
1.417ArgPro: 1.417 ± 0.06
1.888ArgGln: 1.888 ± 0.07
2.362ArgArg: 2.362 ± 0.087
2.188ArgSer: 2.188 ± 0.063
2.431ArgThr: 2.431 ± 0.074
3.036ArgVal: 3.036 ± 0.083
0.419ArgTrp: 0.419 ± 0.028
1.763ArgTyr: 1.763 ± 0.058
0.0ArgXaa: 0.0 ± 0.0
Ser
3.774SerAla: 3.774 ± 0.09
0.636SerCys: 0.636 ± 0.041
3.04SerAsp: 3.04 ± 0.084
3.09SerGlu: 3.09 ± 0.084
2.383SerPhe: 2.383 ± 0.077
4.2SerGly: 4.2 ± 0.093
1.492SerHis: 1.492 ± 0.061
5.208SerIle: 5.208 ± 0.101
3.432SerLys: 3.432 ± 0.104
5.605SerLeu: 5.605 ± 0.111
1.842SerMet: 1.842 ± 0.06
2.619SerAsn: 2.619 ± 0.085
1.815SerPro: 1.815 ± 0.064
1.805SerGln: 1.805 ± 0.07
2.275SerArg: 2.275 ± 0.08
3.314SerSer: 3.314 ± 0.09
3.508SerThr: 3.508 ± 0.094
4.314SerVal: 4.314 ± 0.108
0.574SerTrp: 0.574 ± 0.037
2.34SerTyr: 2.34 ± 0.07
0.0SerXaa: 0.0 ± 0.0
Thr
4.288ThrAla: 4.288 ± 0.101
0.709ThrCys: 0.709 ± 0.041
3.365ThrAsp: 3.365 ± 0.093
3.183ThrGlu: 3.183 ± 0.085
2.242ThrPhe: 2.242 ± 0.067
4.694ThrGly: 4.694 ± 0.104
1.324ThrHis: 1.324 ± 0.053
5.438ThrIle: 5.438 ± 0.112
3.527ThrLys: 3.527 ± 0.09
5.647ThrLeu: 5.647 ± 0.116
1.919ThrMet: 1.919 ± 0.061
2.725ThrAsn: 2.725 ± 0.111
2.549ThrPro: 2.549 ± 0.071
1.689ThrGln: 1.689 ± 0.071
2.315ThrArg: 2.315 ± 0.071
3.616ThrSer: 3.616 ± 0.087
3.769ThrThr: 3.769 ± 0.133
5.046ThrVal: 5.046 ± 0.123
0.63ThrTrp: 0.63 ± 0.039
2.462ThrTyr: 2.462 ± 0.086
0.0ThrXaa: 0.0 ± 0.0
Val
6.107ValAla: 6.107 ± 0.142
0.933ValCys: 0.933 ± 0.042
4.42ValAsp: 4.42 ± 0.103
4.679ValGlu: 4.679 ± 0.123
2.487ValPhe: 2.487 ± 0.08
5.452ValGly: 5.452 ± 0.112
1.55ValHis: 1.55 ± 0.055
5.612ValIle: 5.612 ± 0.116
4.329ValLys: 4.329 ± 0.099
6.412ValLeu: 6.412 ± 0.123
1.96ValMet: 1.96 ± 0.066
3.102ValAsn: 3.102 ± 0.092
2.835ValPro: 2.835 ± 0.069
2.112ValGln: 2.112 ± 0.067
2.934ValArg: 2.934 ± 0.091
4.484ValSer: 4.484 ± 0.105
4.965ValThr: 4.965 ± 0.108
5.879ValVal: 5.879 ± 0.13
0.597ValTrp: 0.597 ± 0.033
2.418ValTyr: 2.418 ± 0.06
0.0ValXaa: 0.0 ± 0.0
Trp
0.626TrpAla: 0.626 ± 0.035
0.126TrpCys: 0.126 ± 0.017
0.501TrpAsp: 0.501 ± 0.032
0.468TrpGlu: 0.468 ± 0.035
0.408TrpPhe: 0.408 ± 0.028
0.551TrpGly: 0.551 ± 0.039
0.29TrpHis: 0.29 ± 0.029
0.63TrpIle: 0.63 ± 0.036
0.605TrpLys: 0.605 ± 0.035
0.891TrpLeu: 0.891 ± 0.045
0.292TrpMet: 0.292 ± 0.023
0.481TrpAsn: 0.481 ± 0.034
0.261TrpPro: 0.261 ± 0.026
0.46TrpGln: 0.46 ± 0.034
0.419TrpArg: 0.419 ± 0.032
0.52TrpSer: 0.52 ± 0.036
0.477TrpThr: 0.477 ± 0.034
0.491TrpVal: 0.491 ± 0.034
0.116TrpTrp: 0.116 ± 0.018
0.307TrpTyr: 0.307 ± 0.026
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.514TyrAla: 2.514 ± 0.078
0.53TyrCys: 0.53 ± 0.036
2.319TyrAsp: 2.319 ± 0.076
2.516TyrGlu: 2.516 ± 0.069
1.531TyrPhe: 1.531 ± 0.06
2.835TyrGly: 2.835 ± 0.086
0.899TyrHis: 0.899 ± 0.045
3.156TyrIle: 3.156 ± 0.09
2.192TyrLys: 2.192 ± 0.072
2.994TyrLeu: 2.994 ± 0.087
1.026TyrMet: 1.026 ± 0.053
1.591TyrAsn: 1.591 ± 0.065
1.345TyrPro: 1.345 ± 0.058
1.262TyrGln: 1.262 ± 0.056
1.859TyrArg: 1.859 ± 0.055
1.842TyrSer: 1.842 ± 0.065
2.234TyrThr: 2.234 ± 0.066
2.47TyrVal: 2.47 ± 0.076
0.359TyrTrp: 0.359 ± 0.03
1.473TyrTyr: 1.473 ± 0.056
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1513 proteins (482565 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski