Amino acid dipepetide frequency for Candidatus Tokpelaia sp. JSC188

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.736AlaAla: 7.736 ± 0.175
1.05AlaCys: 1.05 ± 0.063
4.409AlaAsp: 4.409 ± 0.121
5.074AlaGlu: 5.074 ± 0.135
3.921AlaPhe: 3.921 ± 0.107
5.814AlaGly: 5.814 ± 0.132
1.839AlaHis: 1.839 ± 0.084
7.348AlaIle: 7.348 ± 0.162
4.441AlaLys: 4.441 ± 0.122
9.152AlaLeu: 9.152 ± 0.175
2.46AlaMet: 2.46 ± 0.093
3.061AlaAsn: 3.061 ± 0.091
2.357AlaPro: 2.357 ± 0.076
2.975AlaGln: 2.975 ± 0.095
5.349AlaArg: 5.349 ± 0.122
4.873AlaSer: 4.873 ± 0.138
4.116AlaThr: 4.116 ± 0.109
5.66AlaVal: 5.66 ± 0.138
0.843AlaTrp: 0.843 ± 0.047
2.431AlaTyr: 2.431 ± 0.097
0.0AlaXaa: 0.0 ± 0.0
Cys
0.94CysAla: 0.94 ± 0.05
0.154CysCys: 0.154 ± 0.02
0.671CysAsp: 0.671 ± 0.045
0.562CysGlu: 0.562 ± 0.04
0.588CysPhe: 0.588 ± 0.04
0.976CysGly: 0.976 ± 0.062
0.313CysHis: 0.313 ± 0.031
0.961CysIle: 0.961 ± 0.054
0.603CysLys: 0.603 ± 0.045
1.065CysLeu: 1.065 ± 0.06
0.216CysMet: 0.216 ± 0.024
0.526CysAsn: 0.526 ± 0.042
0.429CysPro: 0.429 ± 0.04
0.346CysGln: 0.346 ± 0.031
0.651CysArg: 0.651 ± 0.044
0.843CysSer: 0.843 ± 0.052
0.509CysThr: 0.509 ± 0.04
0.583CysVal: 0.583 ± 0.045
0.148CysTrp: 0.148 ± 0.021
0.393CysTyr: 0.393 ± 0.03
0.0CysXaa: 0.0 ± 0.0
Asp
4.264AspAla: 4.264 ± 0.099
0.591AspCys: 0.591 ± 0.04
2.422AspAsp: 2.422 ± 0.105
3.087AspGlu: 3.087 ± 0.117
2.351AspPhe: 2.351 ± 0.083
3.448AspGly: 3.448 ± 0.123
1.209AspHis: 1.209 ± 0.055
5.083AspIle: 5.083 ± 0.123
2.845AspLys: 2.845 ± 0.096
4.909AspLeu: 4.909 ± 0.134
1.594AspMet: 1.594 ± 0.079
2.233AspAsn: 2.233 ± 0.081
2.114AspPro: 2.114 ± 0.084
1.449AspGln: 1.449 ± 0.06
2.922AspArg: 2.922 ± 0.104
2.806AspSer: 2.806 ± 0.086
2.549AspThr: 2.549 ± 0.091
3.427AspVal: 3.427 ± 0.108
0.606AspTrp: 0.606 ± 0.042
1.609AspTyr: 1.609 ± 0.079
0.0AspXaa: 0.0 ± 0.0
Glu
5.11GluAla: 5.11 ± 0.118
0.464GluCys: 0.464 ± 0.039
2.584GluAsp: 2.584 ± 0.103
3.821GluGlu: 3.821 ± 0.128
1.922GluPhe: 1.922 ± 0.081
3.566GluGly: 3.566 ± 0.118
1.301GluHis: 1.301 ± 0.064
5.633GluIle: 5.633 ± 0.135
5.172GluLys: 5.172 ± 0.132
4.879GluLeu: 4.879 ± 0.12
1.665GluMet: 1.665 ± 0.061
2.688GluAsn: 2.688 ± 0.088
1.7GluPro: 1.7 ± 0.074
2.182GluGln: 2.182 ± 0.081
4.125GluArg: 4.125 ± 0.117
2.874GluSer: 2.874 ± 0.103
3.149GluThr: 3.149 ± 0.097
3.693GluVal: 3.693 ± 0.11
0.63GluTrp: 0.63 ± 0.047
1.381GluTyr: 1.381 ± 0.071
0.0GluXaa: 0.0 ± 0.0
Phe
3.451PheAla: 3.451 ± 0.112
0.698PheCys: 0.698 ± 0.047
2.661PheAsp: 2.661 ± 0.081
2.295PheGlu: 2.295 ± 0.081
2.422PhePhe: 2.422 ± 0.112
3.253PheGly: 3.253 ± 0.101
0.926PheHis: 0.926 ± 0.044
3.365PheIle: 3.365 ± 0.122
1.656PheLys: 1.656 ± 0.064
4.329PheLeu: 4.329 ± 0.156
1.091PheMet: 1.091 ± 0.061
1.804PheAsn: 1.804 ± 0.07
1.49PhePro: 1.49 ± 0.066
1.227PheGln: 1.227 ± 0.057
2.185PheArg: 2.185 ± 0.074
4.125PheSer: 4.125 ± 0.117
2.168PheThr: 2.168 ± 0.085
2.806PheVal: 2.806 ± 0.09
0.526PheTrp: 0.526 ± 0.045
1.375PheTyr: 1.375 ± 0.059
0.0PheXaa: 0.0 ± 0.0
Gly
5.346GlyAla: 5.346 ± 0.136
0.858GlyCys: 0.858 ± 0.049
3.353GlyAsp: 3.353 ± 0.117
3.655GlyGlu: 3.655 ± 0.11
3.584GlyPhe: 3.584 ± 0.11
4.817GlyGly: 4.817 ± 0.141
1.597GlyHis: 1.597 ± 0.071
6.405GlyIle: 6.405 ± 0.144
4.486GlyLys: 4.486 ± 0.128
6.582GlyLeu: 6.582 ± 0.141
1.928GlyMet: 1.928 ± 0.077
2.72GlyAsn: 2.72 ± 0.106
2.091GlyPro: 2.091 ± 0.081
2.168GlyGln: 2.168 ± 0.082
4.166GlyArg: 4.166 ± 0.108
3.965GlySer: 3.965 ± 0.12
3.519GlyThr: 3.519 ± 0.111
4.376GlyVal: 4.376 ± 0.13
0.849GlyTrp: 0.849 ± 0.056
2.179GlyTyr: 2.179 ± 0.079
0.0GlyXaa: 0.0 ± 0.0
His
1.946HisAla: 1.946 ± 0.077
0.331HisCys: 0.331 ± 0.03
1.1HisAsp: 1.1 ± 0.066
1.141HisGlu: 1.141 ± 0.058
1.141HisPhe: 1.141 ± 0.054
1.73HisGly: 1.73 ± 0.076
0.63HisHis: 0.63 ± 0.049
1.913HisIle: 1.913 ± 0.072
0.982HisLys: 0.982 ± 0.047
2.162HisLeu: 2.162 ± 0.079
0.556HisMet: 0.556 ± 0.041
1.017HisAsn: 1.017 ± 0.054
1.144HisPro: 1.144 ± 0.06
0.689HisGln: 0.689 ± 0.044
1.28HisArg: 1.28 ± 0.065
1.348HisSer: 1.348 ± 0.07
1.115HisThr: 1.115 ± 0.056
1.357HisVal: 1.357 ± 0.064
0.242HisTrp: 0.242 ± 0.03
0.849HisTyr: 0.849 ± 0.049
0.0HisXaa: 0.0 ± 0.0
Ile
8.83IleAla: 8.83 ± 0.192
1.065IleCys: 1.065 ± 0.064
5.326IleAsp: 5.326 ± 0.145
5.332IleGlu: 5.332 ± 0.143
3.619IlePhe: 3.619 ± 0.119
6.485IleGly: 6.485 ± 0.164
1.715IleHis: 1.715 ± 0.078
6.612IleIle: 6.612 ± 0.177
4.143IleLys: 4.143 ± 0.125
7.742IleLeu: 7.742 ± 0.154
2.04IleMet: 2.04 ± 0.08
3.658IleAsn: 3.658 ± 0.114
3.469IlePro: 3.469 ± 0.112
2.28IleGln: 2.28 ± 0.081
4.823IleArg: 4.823 ± 0.116
6.118IleSer: 6.118 ± 0.139
4.462IleThr: 4.462 ± 0.106
5.607IleVal: 5.607 ± 0.141
0.757IleTrp: 0.757 ± 0.045
2.132IleTyr: 2.132 ± 0.077
0.0IleXaa: 0.0 ± 0.0
Lys
4.749LysAla: 4.749 ± 0.128
0.461LysCys: 0.461 ± 0.042
2.638LysAsp: 2.638 ± 0.087
3.377LysGlu: 3.377 ± 0.106
1.564LysPhe: 1.564 ± 0.075
3.634LysGly: 3.634 ± 0.112
1.112LysHis: 1.112 ± 0.059
5.157LysIle: 5.157 ± 0.131
4.45LysLys: 4.45 ± 0.137
5.098LysLeu: 5.098 ± 0.121
1.366LysMet: 1.366 ± 0.062
3.152LysAsn: 3.152 ± 0.108
2.283LysPro: 2.283 ± 0.088
2.135LysGln: 2.135 ± 0.09
3.803LysArg: 3.803 ± 0.116
3.208LysSer: 3.208 ± 0.104
3.409LysThr: 3.409 ± 0.1
3.167LysVal: 3.167 ± 0.092
0.517LysTrp: 0.517 ± 0.036
1.313LysTyr: 1.313 ± 0.067
0.0LysXaa: 0.0 ± 0.0
Leu
8.64LeuAla: 8.64 ± 0.175
1.204LeuCys: 1.204 ± 0.065
5.335LeuAsp: 5.335 ± 0.144
5.648LeuGlu: 5.648 ± 0.152
3.933LeuPhe: 3.933 ± 0.149
6.275LeuGly: 6.275 ± 0.15
2.176LeuHis: 2.176 ± 0.084
7.638LeuIle: 7.638 ± 0.149
5.63LeuLys: 5.63 ± 0.139
9.188LeuLeu: 9.188 ± 0.211
2.416LeuMet: 2.416 ± 0.094
3.992LeuAsn: 3.992 ± 0.117
4.433LeuPro: 4.433 ± 0.12
3.507LeuGln: 3.507 ± 0.1
5.654LeuArg: 5.654 ± 0.14
7.6LeuSer: 7.6 ± 0.141
4.929LeuThr: 4.929 ± 0.129
6.006LeuVal: 6.006 ± 0.141
0.866LeuTrp: 0.866 ± 0.055
2.605LeuTyr: 2.605 ± 0.085
0.0LeuXaa: 0.0 ± 0.0
Met
2.274MetAla: 2.274 ± 0.083
0.216MetCys: 0.216 ± 0.024
1.088MetAsp: 1.088 ± 0.057
1.431MetGlu: 1.431 ± 0.061
0.757MetPhe: 0.757 ± 0.045
1.547MetGly: 1.547 ± 0.069
0.645MetHis: 0.645 ± 0.047
2.327MetIle: 2.327 ± 0.092
1.549MetLys: 1.549 ± 0.068
2.865MetLeu: 2.865 ± 0.106
0.757MetMet: 0.757 ± 0.052
1.221MetAsn: 1.221 ± 0.066
1.354MetPro: 1.354 ± 0.056
1.177MetGln: 1.177 ± 0.057
1.869MetArg: 1.869 ± 0.075
1.626MetSer: 1.626 ± 0.066
1.612MetThr: 1.612 ± 0.072
1.561MetVal: 1.561 ± 0.06
0.169MetTrp: 0.169 ± 0.022
0.408MetTyr: 0.408 ± 0.036
0.0MetXaa: 0.0 ± 0.0
Asn
3.699AsnAla: 3.699 ± 0.112
0.455AsnCys: 0.455 ± 0.038
2.182AsnAsp: 2.182 ± 0.075
2.363AsnGlu: 2.363 ± 0.095
1.804AsnPhe: 1.804 ± 0.081
3.013AsnGly: 3.013 ± 0.101
0.964AsnHis: 0.964 ± 0.056
3.676AsnIle: 3.676 ± 0.114
2.126AsnLys: 2.126 ± 0.071
4.022AsnLeu: 4.022 ± 0.108
1.15AsnMet: 1.15 ± 0.057
1.813AsnAsn: 1.813 ± 0.089
2.2AsnPro: 2.2 ± 0.076
1.405AsnGln: 1.405 ± 0.059
2.629AsnArg: 2.629 ± 0.085
2.41AsnSer: 2.41 ± 0.086
2.165AsnThr: 2.165 ± 0.076
2.629AsnVal: 2.629 ± 0.09
0.562AsnTrp: 0.562 ± 0.038
1.201AsnTyr: 1.201 ± 0.056
0.0AsnXaa: 0.0 ± 0.0
Pro
2.573ProAla: 2.573 ± 0.1
0.376ProCys: 0.376 ± 0.033
2.241ProAsp: 2.241 ± 0.075
2.496ProGlu: 2.496 ± 0.085
1.958ProPhe: 1.958 ± 0.085
2.744ProGly: 2.744 ± 0.096
0.923ProHis: 0.923 ± 0.049
3.35ProIle: 3.35 ± 0.106
1.89ProLys: 1.89 ± 0.079
3.744ProLeu: 3.744 ± 0.107
0.937ProMet: 0.937 ± 0.049
1.49ProAsn: 1.49 ± 0.061
1.236ProPro: 1.236 ± 0.071
1.461ProGln: 1.461 ± 0.071
1.839ProArg: 1.839 ± 0.085
2.36ProSer: 2.36 ± 0.084
1.94ProThr: 1.94 ± 0.087
2.854ProVal: 2.854 ± 0.087
0.429ProTrp: 0.429 ± 0.036
1.242ProTyr: 1.242 ± 0.061
0.0ProXaa: 0.0 ± 0.0
Gln
3.105GlnAla: 3.105 ± 0.109
0.29GlnCys: 0.29 ± 0.029
1.49GlnAsp: 1.49 ± 0.066
2.064GlnGlu: 2.064 ± 0.075
1.274GlnPhe: 1.274 ± 0.058
2.043GlnGly: 2.043 ± 0.082
0.81GlnHis: 0.81 ± 0.045
3.081GlnIle: 3.081 ± 0.088
2.573GlnLys: 2.573 ± 0.09
3.226GlnLeu: 3.226 ± 0.101
0.958GlnMet: 0.958 ± 0.054
1.57GlnAsn: 1.57 ± 0.068
1.136GlnPro: 1.136 ± 0.059
1.298GlnGln: 1.298 ± 0.075
2.147GlnArg: 2.147 ± 0.07
1.833GlnSer: 1.833 ± 0.067
1.671GlnThr: 1.671 ± 0.07
2.026GlnVal: 2.026 ± 0.081
0.322GlnTrp: 0.322 ± 0.034
0.822GlnTyr: 0.822 ± 0.054
0.0GlnXaa: 0.0 ± 0.0
Arg
4.48ArgAla: 4.48 ± 0.121
0.612ArgCys: 0.612 ± 0.043
2.925ArgAsp: 2.925 ± 0.101
3.761ArgGlu: 3.761 ± 0.126
2.972ArgPhe: 2.972 ± 0.096
3.285ArgGly: 3.285 ± 0.102
1.325ArgHis: 1.325 ± 0.063
5.432ArgIle: 5.432 ± 0.122
3.637ArgLys: 3.637 ± 0.1
6.508ArgLeu: 6.508 ± 0.152
1.629ArgMet: 1.629 ± 0.069
2.617ArgAsn: 2.617 ± 0.091
1.978ArgPro: 1.978 ± 0.087
2.342ArgGln: 2.342 ± 0.084
3.951ArgArg: 3.951 ± 0.1
3.56ArgSer: 3.56 ± 0.099
2.859ArgThr: 2.859 ± 0.092
3.705ArgVal: 3.705 ± 0.115
0.671ArgTrp: 0.671 ± 0.046
1.943ArgTyr: 1.943 ± 0.084
0.0ArgXaa: 0.0 ± 0.0
Ser
4.826SerAla: 4.826 ± 0.121
0.79SerCys: 0.79 ± 0.06
3.336SerAsp: 3.336 ± 0.102
3.546SerGlu: 3.546 ± 0.115
3.123SerPhe: 3.123 ± 0.085
5.207SerGly: 5.207 ± 0.134
1.47SerHis: 1.47 ± 0.069
5.287SerIle: 5.287 ± 0.12
3.137SerLys: 3.137 ± 0.098
6.207SerLeu: 6.207 ± 0.137
1.748SerMet: 1.748 ± 0.068
2.83SerAsn: 2.83 ± 0.096
2.15SerPro: 2.15 ± 0.073
2.067SerGln: 2.067 ± 0.081
3.543SerArg: 3.543 ± 0.104
4.459SerSer: 4.459 ± 0.142
3.013SerThr: 3.013 ± 0.085
4.335SerVal: 4.335 ± 0.11
0.769SerTrp: 0.769 ± 0.056
1.881SerTyr: 1.881 ± 0.08
0.0SerXaa: 0.0 ± 0.0
Thr
4.501ThrAla: 4.501 ± 0.104
0.594ThrCys: 0.594 ± 0.049
2.511ThrAsp: 2.511 ± 0.092
2.836ThrGlu: 2.836 ± 0.087
2.28ThrPhe: 2.28 ± 0.091
4.137ThrGly: 4.137 ± 0.119
1.215ThrHis: 1.215 ± 0.056
4.394ThrIle: 4.394 ± 0.105
2.265ThrLys: 2.265 ± 0.08
5.453ThrLeu: 5.453 ± 0.13
1.183ThrMet: 1.183 ± 0.059
1.925ThrAsn: 1.925 ± 0.077
2.369ThrPro: 2.369 ± 0.084
1.612ThrGln: 1.612 ± 0.069
2.729ThrArg: 2.729 ± 0.093
3.049ThrSer: 3.049 ± 0.094
2.729ThrThr: 2.729 ± 0.092
3.528ThrVal: 3.528 ± 0.103
0.512ThrTrp: 0.512 ± 0.043
1.384ThrTyr: 1.384 ± 0.072
0.0ThrXaa: 0.0 ± 0.0
Val
5.382ValAla: 5.382 ± 0.151
0.748ValCys: 0.748 ± 0.046
3.217ValAsp: 3.217 ± 0.11
3.959ValGlu: 3.959 ± 0.128
2.927ValPhe: 2.927 ± 0.093
4.098ValGly: 4.098 ± 0.134
1.467ValHis: 1.467 ± 0.071
5.565ValIle: 5.565 ± 0.136
2.966ValLys: 2.966 ± 0.102
6.508ValLeu: 6.508 ± 0.146
1.742ValMet: 1.742 ± 0.076
2.54ValAsn: 2.54 ± 0.083
2.534ValPro: 2.534 ± 0.086
1.901ValGln: 1.901 ± 0.078
3.927ValArg: 3.927 ± 0.103
4.385ValSer: 4.385 ± 0.131
3.386ValThr: 3.386 ± 0.112
4.382ValVal: 4.382 ± 0.14
0.603ValTrp: 0.603 ± 0.042
1.493ValTyr: 1.493 ± 0.058
0.0ValXaa: 0.0 ± 0.0
Trp
0.665TrpAla: 0.665 ± 0.051
0.163TrpCys: 0.163 ± 0.022
0.381TrpAsp: 0.381 ± 0.03
0.408TrpGlu: 0.408 ± 0.033
0.473TrpPhe: 0.473 ± 0.044
0.532TrpGly: 0.532 ± 0.037
0.275TrpHis: 0.275 ± 0.03
0.861TrpIle: 0.861 ± 0.048
0.671TrpLys: 0.671 ± 0.046
1.337TrpLeu: 1.337 ± 0.07
0.331TrpMet: 0.331 ± 0.036
0.574TrpAsn: 0.574 ± 0.043
0.449TrpPro: 0.449 ± 0.037
0.538TrpGln: 0.538 ± 0.04
0.804TrpArg: 0.804 ± 0.05
0.559TrpSer: 0.559 ± 0.045
0.464TrpThr: 0.464 ± 0.037
0.58TrpVal: 0.58 ± 0.039
0.195TrpTrp: 0.195 ± 0.027
0.29TrpTyr: 0.29 ± 0.032
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.395TyrAla: 2.395 ± 0.082
0.358TyrCys: 0.358 ± 0.032
1.647TyrAsp: 1.647 ± 0.078
1.686TyrGlu: 1.686 ± 0.071
1.239TyrPhe: 1.239 ± 0.06
2.04TyrGly: 2.04 ± 0.082
0.763TyrHis: 0.763 ± 0.049
2.123TyrIle: 2.123 ± 0.084
1.372TyrLys: 1.372 ± 0.064
2.7TyrLeu: 2.7 ± 0.104
0.683TyrMet: 0.683 ± 0.042
1.035TyrAsn: 1.035 ± 0.062
1.165TyrPro: 1.165 ± 0.065
0.982TyrGln: 0.982 ± 0.054
1.798TyrArg: 1.798 ± 0.077
1.718TyrSer: 1.718 ± 0.071
1.44TyrThr: 1.44 ± 0.066
1.452TyrVal: 1.452 ± 0.065
0.334TyrTrp: 0.334 ± 0.034
0.828TyrTyr: 0.828 ± 0.06
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1066 proteins (338176 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski