Amino acid dipepetide frequency for Anaerotruncus sp. CAG:528

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.841AlaAla: 9.841 ± 0.183
1.458AlaCys: 1.458 ± 0.051
5.566AlaAsp: 5.566 ± 0.096
6.375AlaGlu: 6.375 ± 0.135
3.817AlaPhe: 3.817 ± 0.092
6.137AlaGly: 6.137 ± 0.109
1.184AlaHis: 1.184 ± 0.041
5.401AlaIle: 5.401 ± 0.102
6.314AlaLys: 6.314 ± 0.112
7.621AlaLeu: 7.621 ± 0.133
2.126AlaMet: 2.126 ± 0.062
3.546AlaAsn: 3.546 ± 0.083
2.4AlaPro: 2.4 ± 0.075
2.74AlaGln: 2.74 ± 0.073
2.79AlaArg: 2.79 ± 0.08
4.697AlaSer: 4.697 ± 0.106
3.568AlaThr: 3.568 ± 0.102
8.001AlaVal: 8.001 ± 0.148
0.52AlaTrp: 0.52 ± 0.034
2.96AlaTyr: 2.96 ± 0.083
0.002AlaXaa: 0.002 ± 0.002
Cys
1.711CysAla: 1.711 ± 0.061
0.355CysCys: 0.355 ± 0.028
1.24CysAsp: 1.24 ± 0.051
1.277CysGlu: 1.277 ± 0.046
0.746CysPhe: 0.746 ± 0.039
2.008CysGly: 2.008 ± 0.07
0.269CysHis: 0.269 ± 0.024
1.157CysIle: 1.157 ± 0.046
1.131CysLys: 1.131 ± 0.05
1.189CysLeu: 1.189 ± 0.048
0.376CysMet: 0.376 ± 0.025
0.762CysAsn: 0.762 ± 0.045
0.756CysPro: 0.756 ± 0.046
0.329CysGln: 0.329 ± 0.025
0.756CysArg: 0.756 ± 0.036
1.153CysSer: 1.153 ± 0.049
0.884CysThr: 0.884 ± 0.044
1.342CysVal: 1.342 ± 0.047
0.14CysTrp: 0.14 ± 0.018
0.591CysTyr: 0.591 ± 0.036
0.0CysXaa: 0.0 ± 0.0
Asp
4.339AspAla: 4.339 ± 0.102
1.186AspCys: 1.186 ± 0.049
3.364AspAsp: 3.364 ± 0.084
4.833AspGlu: 4.833 ± 0.115
3.359AspPhe: 3.359 ± 0.084
4.25AspGly: 4.25 ± 0.105
0.626AspHis: 0.626 ± 0.035
4.924AspIle: 4.924 ± 0.086
4.455AspLys: 4.455 ± 0.11
4.428AspLeu: 4.428 ± 0.1
1.824AspMet: 1.824 ± 0.059
2.637AspAsn: 2.637 ± 0.068
1.578AspPro: 1.578 ± 0.055
0.795AspGln: 0.795 ± 0.037
1.662AspArg: 1.662 ± 0.053
3.306AspSer: 3.306 ± 0.091
2.533AspThr: 2.533 ± 0.067
4.192AspVal: 4.192 ± 0.088
0.507AspTrp: 0.507 ± 0.035
3.182AspTyr: 3.182 ± 0.081
0.0AspXaa: 0.0 ± 0.0
Glu
5.159GluAla: 5.159 ± 0.109
0.997GluCys: 0.997 ± 0.051
3.104GluAsp: 3.104 ± 0.09
4.337GluGlu: 4.337 ± 0.13
2.955GluPhe: 2.955 ± 0.084
3.273GluGly: 3.273 ± 0.076
1.04GluHis: 1.04 ± 0.052
5.914GluIle: 5.914 ± 0.12
6.594GluLys: 6.594 ± 0.123
5.684GluLeu: 5.684 ± 0.114
2.142GluMet: 2.142 ± 0.068
4.892GluAsn: 4.892 ± 0.114
1.757GluPro: 1.757 ± 0.065
2.013GluGln: 2.013 ± 0.071
2.715GluArg: 2.715 ± 0.078
3.899GluSer: 3.899 ± 0.098
3.381GluThr: 3.381 ± 0.093
3.381GluVal: 3.381 ± 0.086
0.533GluTrp: 0.533 ± 0.033
3.22GluTyr: 3.22 ± 0.078
0.0GluXaa: 0.0 ± 0.0
Phe
4.11PheAla: 4.11 ± 0.085
0.858PheCys: 0.858 ± 0.041
3.028PheAsp: 3.028 ± 0.079
3.053PheGlu: 3.053 ± 0.065
1.886PhePhe: 1.886 ± 0.068
3.719PheGly: 3.719 ± 0.074
0.573PheHis: 0.573 ± 0.028
3.088PheIle: 3.088 ± 0.077
3.044PheLys: 3.044 ± 0.073
3.337PheLeu: 3.337 ± 0.077
1.044PheMet: 1.044 ± 0.039
2.106PheAsn: 2.106 ± 0.064
1.318PhePro: 1.318 ± 0.04
0.833PheGln: 0.833 ± 0.044
1.364PheArg: 1.364 ± 0.053
3.155PheSer: 3.155 ± 0.079
2.549PheThr: 2.549 ± 0.068
3.175PheVal: 3.175 ± 0.078
0.38PheTrp: 0.38 ± 0.03
1.689PheTyr: 1.689 ± 0.058
0.0PheXaa: 0.0 ± 0.0
Gly
5.985GlyAla: 5.985 ± 0.097
1.382GlyCys: 1.382 ± 0.051
4.053GlyAsp: 4.053 ± 0.084
4.833GlyGlu: 4.833 ± 0.098
3.31GlyPhe: 3.31 ± 0.075
5.546GlyGly: 5.546 ± 0.122
1.02GlyHis: 1.02 ± 0.044
5.721GlyIle: 5.721 ± 0.111
5.93GlyLys: 5.93 ± 0.093
5.277GlyLeu: 5.277 ± 0.108
1.946GlyMet: 1.946 ± 0.065
3.453GlyAsn: 3.453 ± 0.097
1.266GlyPro: 1.266 ± 0.056
1.444GlyGln: 1.444 ± 0.054
2.833GlyArg: 2.833 ± 0.09
4.464GlySer: 4.464 ± 0.099
4.379GlyThr: 4.379 ± 0.116
5.419GlyVal: 5.419 ± 0.114
0.571GlyTrp: 0.571 ± 0.035
2.924GlyTyr: 2.924 ± 0.083
0.002GlyXaa: 0.002 ± 0.002
His
1.037HisAla: 1.037 ± 0.047
0.316HisCys: 0.316 ± 0.025
0.824HisAsp: 0.824 ± 0.04
0.836HisGlu: 0.836 ± 0.045
0.742HisPhe: 0.742 ± 0.037
1.275HisGly: 1.275 ± 0.051
0.324HisHis: 0.324 ± 0.038
1.169HisIle: 1.169 ± 0.044
0.984HisLys: 0.984 ± 0.044
1.062HisLeu: 1.062 ± 0.045
0.366HisMet: 0.366 ± 0.027
0.771HisAsn: 0.771 ± 0.037
0.793HisPro: 0.793 ± 0.04
0.346HisGln: 0.346 ± 0.023
0.656HisArg: 0.656 ± 0.034
1.087HisSer: 1.087 ± 0.041
0.798HisThr: 0.798 ± 0.034
0.658HisVal: 0.658 ± 0.038
0.144HisTrp: 0.144 ± 0.018
0.622HisTyr: 0.622 ± 0.034
0.0HisXaa: 0.0 ± 0.0
Ile
6.463IleAla: 6.463 ± 0.125
1.66IleCys: 1.66 ± 0.054
4.448IleAsp: 4.448 ± 0.094
5.001IleGlu: 5.001 ± 0.111
2.879IlePhe: 2.879 ± 0.081
5.201IleGly: 5.201 ± 0.123
0.977IleHis: 0.977 ± 0.043
5.928IleIle: 5.928 ± 0.127
5.979IleLys: 5.979 ± 0.118
5.33IleLeu: 5.33 ± 0.116
1.911IleMet: 1.911 ± 0.063
3.797IleAsn: 3.797 ± 0.079
2.595IlePro: 2.595 ± 0.077
1.367IleGln: 1.367 ± 0.052
2.548IleArg: 2.548 ± 0.087
5.335IleSer: 5.335 ± 0.104
4.701IleThr: 4.701 ± 0.09
4.901IleVal: 4.901 ± 0.097
0.484IleTrp: 0.484 ± 0.027
2.786IleTyr: 2.786 ± 0.059
0.002IleXaa: 0.002 ± 0.002
Lys
6.574LysAla: 6.574 ± 0.134
1.109LysCys: 1.109 ± 0.043
3.611LysAsp: 3.611 ± 0.086
5.404LysGlu: 5.404 ± 0.118
2.713LysPhe: 2.713 ± 0.07
4.257LysGly: 4.257 ± 0.097
1.089LysHis: 1.089 ± 0.047
6.295LysIle: 6.295 ± 0.112
6.901LysLys: 6.901 ± 0.134
5.841LysLeu: 5.841 ± 0.107
2.315LysMet: 2.315 ± 0.06
4.759LysAsn: 4.759 ± 0.096
2.419LysPro: 2.419 ± 0.074
2.077LysGln: 2.077 ± 0.062
2.99LysArg: 2.99 ± 0.083
5.228LysSer: 5.228 ± 0.099
4.352LysThr: 4.352 ± 0.093
3.946LysVal: 3.946 ± 0.092
0.589LysTrp: 0.589 ± 0.034
3.451LysTyr: 3.451 ± 0.082
0.0LysXaa: 0.0 ± 0.0
Leu
7.168LeuAla: 7.168 ± 0.134
1.706LeuCys: 1.706 ± 0.062
4.893LeuAsp: 4.893 ± 0.108
4.866LeuGlu: 4.866 ± 0.099
3.779LeuPhe: 3.779 ± 0.092
5.935LeuGly: 5.935 ± 0.111
1.202LeuHis: 1.202 ± 0.045
5.643LeuIle: 5.643 ± 0.107
6.025LeuLys: 6.025 ± 0.126
7.085LeuLeu: 7.085 ± 0.148
2.157LeuMet: 2.157 ± 0.056
4.006LeuAsn: 4.006 ± 0.092
3.381LeuPro: 3.381 ± 0.077
2.262LeuGln: 2.262 ± 0.063
3.093LeuArg: 3.093 ± 0.093
6.065LeuSer: 6.065 ± 0.109
4.366LeuThr: 4.366 ± 0.104
4.966LeuVal: 4.966 ± 0.101
0.615LeuTrp: 0.615 ± 0.036
2.888LeuTyr: 2.888 ± 0.092
0.0LeuXaa: 0.0 ± 0.0
Met
2.18MetAla: 2.18 ± 0.065
0.387MetCys: 0.387 ± 0.027
1.297MetAsp: 1.297 ± 0.057
1.353MetGlu: 1.353 ± 0.053
1.037MetPhe: 1.037 ± 0.044
1.733MetGly: 1.733 ± 0.064
0.444MetHis: 0.444 ± 0.026
1.917MetIle: 1.917 ± 0.061
2.22MetLys: 2.22 ± 0.063
2.695MetLeu: 2.695 ± 0.063
0.736MetMet: 0.736 ± 0.042
1.491MetAsn: 1.491 ± 0.05
1.342MetPro: 1.342 ± 0.046
0.926MetGln: 0.926 ± 0.041
1.171MetArg: 1.171 ± 0.046
1.549MetSer: 1.549 ± 0.06
1.246MetThr: 1.246 ± 0.059
1.397MetVal: 1.397 ± 0.05
0.176MetTrp: 0.176 ± 0.018
0.775MetTyr: 0.775 ± 0.043
0.0MetXaa: 0.0 ± 0.0
Asn
4.379AsnAla: 4.379 ± 0.095
0.927AsnCys: 0.927 ± 0.041
2.87AsnAsp: 2.87 ± 0.068
3.55AsnGlu: 3.55 ± 0.088
1.873AsnPhe: 1.873 ± 0.068
4.761AsnGly: 4.761 ± 0.122
0.709AsnHis: 0.709 ± 0.039
3.993AsnIle: 3.993 ± 0.085
3.524AsnLys: 3.524 ± 0.093
3.806AsnLeu: 3.806 ± 0.101
1.342AsnMet: 1.342 ± 0.048
2.4AsnAsn: 2.4 ± 0.085
2.189AsnPro: 2.189 ± 0.058
1.217AsnGln: 1.217 ± 0.052
1.802AsnArg: 1.802 ± 0.06
3.339AsnSer: 3.339 ± 0.089
2.559AsnThr: 2.559 ± 0.091
3.16AsnVal: 3.16 ± 0.078
0.418AsnTrp: 0.418 ± 0.029
2.006AsnTyr: 2.006 ± 0.072
0.0AsnXaa: 0.0 ± 0.0
Pro
2.928ProAla: 2.928 ± 0.08
0.595ProCys: 0.595 ± 0.038
2.044ProAsp: 2.044 ± 0.069
2.691ProGlu: 2.691 ± 0.083
1.657ProPhe: 1.657 ± 0.054
2.182ProGly: 2.182 ± 0.063
0.56ProHis: 0.56 ± 0.032
2.139ProIle: 2.139 ± 0.066
2.124ProLys: 2.124 ± 0.072
2.764ProLeu: 2.764 ± 0.073
0.727ProMet: 0.727 ± 0.033
1.546ProAsn: 1.546 ± 0.055
0.904ProPro: 0.904 ± 0.046
1.471ProGln: 1.471 ± 0.057
1.022ProArg: 1.022 ± 0.043
1.851ProSer: 1.851 ± 0.059
1.629ProThr: 1.629 ± 0.062
2.702ProVal: 2.702 ± 0.058
0.313ProTrp: 0.313 ± 0.024
1.326ProTyr: 1.326 ± 0.049
0.0ProXaa: 0.0 ± 0.0
Gln
2.035GlnAla: 2.035 ± 0.056
0.375GlnCys: 0.375 ± 0.026
1.106GlnAsp: 1.106 ± 0.043
1.326GlnGlu: 1.326 ± 0.058
1.035GlnPhe: 1.035 ± 0.037
1.622GlnGly: 1.622 ± 0.066
0.418GlnHis: 0.418 ± 0.028
2.326GlnIle: 2.326 ± 0.074
2.322GlnLys: 2.322 ± 0.065
2.149GlnLeu: 2.149 ± 0.066
0.838GlnMet: 0.838 ± 0.037
1.72GlnAsn: 1.72 ± 0.051
0.855GlnPro: 0.855 ± 0.044
0.813GlnGln: 0.813 ± 0.046
1.115GlnArg: 1.115 ± 0.043
1.822GlnSer: 1.822 ± 0.057
1.438GlnThr: 1.438 ± 0.048
1.337GlnVal: 1.337 ± 0.055
0.211GlnTrp: 0.211 ± 0.023
1.106GlnTyr: 1.106 ± 0.044
0.0GlnXaa: 0.0 ± 0.0
Arg
2.729ArgAla: 2.729 ± 0.071
0.615ArgCys: 0.615 ± 0.035
1.966ArgAsp: 1.966 ± 0.071
2.862ArgGlu: 2.862 ± 0.073
1.838ArgPhe: 1.838 ± 0.063
2.313ArgGly: 2.313 ± 0.068
0.678ArgHis: 0.678 ± 0.03
2.886ArgIle: 2.886 ± 0.072
2.75ArgLys: 2.75 ± 0.081
3.624ArgLeu: 3.624 ± 0.088
0.96ArgMet: 0.96 ± 0.042
1.722ArgAsn: 1.722 ± 0.056
1.18ArgPro: 1.18 ± 0.044
1.227ArgGln: 1.227 ± 0.044
1.955ArgArg: 1.955 ± 0.071
1.926ArgSer: 1.926 ± 0.054
1.706ArgThr: 1.706 ± 0.057
2.388ArgVal: 2.388 ± 0.076
0.282ArgTrp: 0.282 ± 0.022
1.358ArgTyr: 1.358 ± 0.051
0.0ArgXaa: 0.0 ± 0.0
Ser
6.135SerAla: 6.135 ± 0.115
0.975SerCys: 0.975 ± 0.047
4.037SerAsp: 4.037 ± 0.097
4.292SerGlu: 4.292 ± 0.089
2.948SerPhe: 2.948 ± 0.073
5.683SerGly: 5.683 ± 0.114
0.993SerHis: 0.993 ± 0.038
3.881SerIle: 3.881 ± 0.078
4.268SerLys: 4.268 ± 0.098
5.375SerLeu: 5.375 ± 0.112
1.387SerMet: 1.387 ± 0.052
2.851SerAsn: 2.851 ± 0.091
2.002SerPro: 2.002 ± 0.057
1.764SerGln: 1.764 ± 0.059
2.551SerArg: 2.551 ± 0.08
4.022SerSer: 4.022 ± 0.106
3.044SerThr: 3.044 ± 0.086
5.41SerVal: 5.41 ± 0.112
0.489SerTrp: 0.489 ± 0.032
2.444SerTyr: 2.444 ± 0.079
0.0SerXaa: 0.0 ± 0.0
Thr
5.346ThrAla: 5.346 ± 0.119
0.686ThrCys: 0.686 ± 0.037
3.462ThrAsp: 3.462 ± 0.091
3.408ThrGlu: 3.408 ± 0.085
2.062ThrPhe: 2.062 ± 0.065
4.322ThrGly: 4.322 ± 0.108
0.829ThrHis: 0.829 ± 0.042
3.082ThrIle: 3.082 ± 0.07
3.191ThrLys: 3.191 ± 0.078
4.355ThrLeu: 4.355 ± 0.097
1.084ThrMet: 1.084 ± 0.045
2.088ThrAsn: 2.088 ± 0.065
2.148ThrPro: 2.148 ± 0.068
1.395ThrGln: 1.395 ± 0.053
1.666ThrArg: 1.666 ± 0.056
2.835ThrSer: 2.835 ± 0.088
2.739ThrThr: 2.739 ± 0.109
5.206ThrVal: 5.206 ± 0.136
0.338ThrTrp: 0.338 ± 0.027
1.929ThrTyr: 1.929 ± 0.074
0.0ThrXaa: 0.0 ± 0.0
Val
5.477ValAla: 5.477 ± 0.118
1.668ValCys: 1.668 ± 0.066
3.801ValAsp: 3.801 ± 0.088
3.708ValGlu: 3.708 ± 0.093
3.346ValPhe: 3.346 ± 0.106
3.995ValGly: 3.995 ± 0.102
1.064ValHis: 1.064 ± 0.042
5.324ValIle: 5.324 ± 0.104
4.777ValLys: 4.777 ± 0.106
6.663ValLeu: 6.663 ± 0.117
1.689ValMet: 1.689 ± 0.059
3.488ValAsn: 3.488 ± 0.097
2.757ValPro: 2.757 ± 0.071
1.698ValGln: 1.698 ± 0.055
2.542ValArg: 2.542 ± 0.085
5.41ValSer: 5.41 ± 0.107
3.741ValThr: 3.741 ± 0.1
4.628ValVal: 4.628 ± 0.114
0.555ValTrp: 0.555 ± 0.035
2.628ValTyr: 2.628 ± 0.077
0.0ValXaa: 0.0 ± 0.0
Trp
0.487TrpAla: 0.487 ± 0.029
0.16TrpCys: 0.16 ± 0.02
0.453TrpAsp: 0.453 ± 0.031
0.469TrpGlu: 0.469 ± 0.032
0.367TrpPhe: 0.367 ± 0.03
0.615TrpGly: 0.615 ± 0.042
0.2TrpHis: 0.2 ± 0.021
0.506TrpIle: 0.506 ± 0.031
0.522TrpLys: 0.522 ± 0.03
0.726TrpLeu: 0.726 ± 0.036
0.207TrpMet: 0.207 ± 0.021
0.471TrpAsn: 0.471 ± 0.033
0.136TrpPro: 0.136 ± 0.017
0.34TrpGln: 0.34 ± 0.025
0.327TrpArg: 0.327 ± 0.022
0.516TrpSer: 0.516 ± 0.035
0.322TrpThr: 0.322 ± 0.022
0.435TrpVal: 0.435 ± 0.028
0.102TrpTrp: 0.102 ± 0.014
0.327TrpTyr: 0.327 ± 0.026
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.98TyrAla: 2.98 ± 0.074
0.758TyrCys: 0.758 ± 0.037
2.813TyrAsp: 2.813 ± 0.088
2.513TyrGlu: 2.513 ± 0.065
1.962TyrPhe: 1.962 ± 0.061
2.931TyrGly: 2.931 ± 0.074
0.611TyrHis: 0.611 ± 0.033
2.999TyrIle: 2.999 ± 0.074
2.866TyrLys: 2.866 ± 0.079
3.088TyrLeu: 3.088 ± 0.085
0.891TyrMet: 0.891 ± 0.038
2.319TyrAsn: 2.319 ± 0.076
1.397TyrPro: 1.397 ± 0.042
0.911TyrGln: 0.911 ± 0.039
1.407TyrArg: 1.407 ± 0.053
2.89TyrSer: 2.89 ± 0.077
2.239TyrThr: 2.239 ± 0.077
2.317TyrVal: 2.317 ± 0.064
0.322TyrTrp: 0.322 ± 0.026
2.008TyrTyr: 2.008 ± 0.071
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.002XaaAla: 0.002 ± 0.002
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.002XaaPro: 0.002 ± 0.002
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.002XaaSer: 0.002 ± 0.002
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.011XaaXaa: 0.011 ± 0.006
Statistics based on 1705 proteins (549919 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski