Amino acid dipepetide frequency for Candidatus Profftella armatura

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.744AlaAla: 2.744 ± 0.244
0.736AlaCys: 0.736 ± 0.093
1.508AlaAsp: 1.508 ± 0.112
1.876AlaGlu: 1.876 ± 0.159
2.14AlaPhe: 2.14 ± 0.125
2.766AlaGly: 2.766 ± 0.177
1.111AlaHis: 1.111 ± 0.092
5.701AlaIle: 5.701 ± 0.228
3.604AlaLys: 3.604 ± 0.19
4.428AlaLeu: 4.428 ± 0.235
1.14AlaMet: 1.14 ± 0.119
2.633AlaAsn: 2.633 ± 0.122
1.295AlaPro: 1.295 ± 0.096
1.53AlaGln: 1.53 ± 0.111
2.295AlaArg: 2.295 ± 0.165
2.7AlaSer: 2.7 ± 0.185
2.015AlaThr: 2.015 ± 0.174
1.912AlaVal: 1.912 ± 0.117
0.324AlaTrp: 0.324 ± 0.05
1.53AlaTyr: 1.53 ± 0.121
0.0AlaXaa: 0.0 ± 0.0
Cys
0.758CysAla: 0.758 ± 0.07
0.132CysCys: 0.132 ± 0.032
0.603CysAsp: 0.603 ± 0.066
0.463CysGlu: 0.463 ± 0.064
0.662CysPhe: 0.662 ± 0.08
0.927CysGly: 0.927 ± 0.096
0.206CysHis: 0.206 ± 0.043
1.501CysIle: 1.501 ± 0.1
0.964CysLys: 0.964 ± 0.085
0.934CysLeu: 0.934 ± 0.075
0.199CysMet: 0.199 ± 0.042
0.861CysAsn: 0.861 ± 0.078
0.478CysPro: 0.478 ± 0.066
0.272CysGln: 0.272 ± 0.046
0.434CysArg: 0.434 ± 0.063
0.949CysSer: 0.949 ± 0.086
0.537CysThr: 0.537 ± 0.065
0.544CysVal: 0.544 ± 0.063
0.081CysTrp: 0.081 ± 0.025
0.544CysTyr: 0.544 ± 0.072
0.0CysXaa: 0.0 ± 0.0
Asp
1.979AspAla: 1.979 ± 0.119
0.508AspCys: 0.508 ± 0.066
1.545AspAsp: 1.545 ± 0.285
1.935AspGlu: 1.935 ± 0.177
2.369AspPhe: 2.369 ± 0.116
2.089AspGly: 2.089 ± 0.144
0.713AspHis: 0.713 ± 0.08
6.223AspIle: 6.223 ± 0.199
3.663AspLys: 3.663 ± 0.182
4.215AspLeu: 4.215 ± 0.191
0.868AspMet: 0.868 ± 0.08
2.758AspAsn: 2.758 ± 0.145
1.706AspPro: 1.706 ± 0.109
1.096AspGln: 1.096 ± 0.095
1.552AspArg: 1.552 ± 0.109
2.942AspSer: 2.942 ± 0.175
2.052AspThr: 2.052 ± 0.196
1.957AspVal: 1.957 ± 0.12
0.463AspTrp: 0.463 ± 0.068
1.935AspTyr: 1.935 ± 0.145
0.0AspXaa: 0.0 ± 0.0
Glu
2.589GluAla: 2.589 ± 0.161
0.493GluCys: 0.493 ± 0.068
1.876GluAsp: 1.876 ± 0.117
3.038GluGlu: 3.038 ± 0.16
2.369GluPhe: 2.369 ± 0.136
2.229GluGly: 2.229 ± 0.146
0.699GluHis: 0.699 ± 0.08
7.311GluIle: 7.311 ± 0.275
6.311GluLys: 6.311 ± 0.254
5.068GluLeu: 5.068 ± 0.202
1.221GluMet: 1.221 ± 0.114
4.075GluAsn: 4.075 ± 0.264
1.089GluPro: 1.089 ± 0.095
1.295GluGln: 1.295 ± 0.115
2.082GluArg: 2.082 ± 0.153
2.876GluSer: 2.876 ± 0.139
2.023GluThr: 2.023 ± 0.147
2.177GluVal: 2.177 ± 0.155
0.478GluTrp: 0.478 ± 0.061
2.023GluTyr: 2.023 ± 0.12
0.0GluXaa: 0.0 ± 0.0
Phe
1.677PheAla: 1.677 ± 0.107
0.772PheCys: 0.772 ± 0.081
2.574PheAsp: 2.574 ± 0.151
2.346PheGlu: 2.346 ± 0.136
3.244PhePhe: 3.244 ± 0.182
2.633PheGly: 2.633 ± 0.149
1.008PheHis: 1.008 ± 0.085
5.568PheIle: 5.568 ± 0.226
4.141PheLys: 4.141 ± 0.179
4.737PheLeu: 4.737 ± 0.22
1.125PheMet: 1.125 ± 0.096
3.972PheAsn: 3.972 ± 0.184
1.883PhePro: 1.883 ± 0.126
1.236PheGln: 1.236 ± 0.097
1.515PheArg: 1.515 ± 0.136
4.524PheSer: 4.524 ± 0.18
2.03PheThr: 2.03 ± 0.129
1.684PheVal: 1.684 ± 0.105
0.427PheTrp: 0.427 ± 0.064
2.243PheTyr: 2.243 ± 0.136
0.0PheXaa: 0.0 ± 0.0
Gly
2.751GlyAla: 2.751 ± 0.17
0.78GlyCys: 0.78 ± 0.093
2.391GlyAsp: 2.391 ± 0.148
2.67GlyGlu: 2.67 ± 0.177
2.729GlyPhe: 2.729 ± 0.148
3.612GlyGly: 3.612 ± 0.244
1.133GlyHis: 1.133 ± 0.082
6.473GlyIle: 6.473 ± 0.256
4.98GlyLys: 4.98 ± 0.265
4.818GlyLeu: 4.818 ± 0.237
1.449GlyMet: 1.449 ± 0.113
2.655GlyAsn: 2.655 ± 0.184
1.434GlyPro: 1.434 ± 0.111
1.449GlyGln: 1.449 ± 0.123
2.597GlyArg: 2.597 ± 0.175
3.251GlySer: 3.251 ± 0.151
2.516GlyThr: 2.516 ± 0.121
3.089GlyVal: 3.089 ± 0.181
0.493GlyTrp: 0.493 ± 0.067
2.251GlyTyr: 2.251 ± 0.129
0.0GlyXaa: 0.0 ± 0.0
His
1.125HisAla: 1.125 ± 0.091
0.28HisCys: 0.28 ± 0.056
0.655HisAsp: 0.655 ± 0.073
0.758HisGlu: 0.758 ± 0.087
0.736HisPhe: 0.736 ± 0.074
1.295HisGly: 1.295 ± 0.088
0.346HisHis: 0.346 ± 0.055
2.324HisIle: 2.324 ± 0.119
1.464HisLys: 1.464 ± 0.106
1.471HisLeu: 1.471 ± 0.092
0.463HisMet: 0.463 ± 0.064
1.206HisAsn: 1.206 ± 0.102
0.978HisPro: 0.978 ± 0.089
0.493HisGln: 0.493 ± 0.064
0.802HisArg: 0.802 ± 0.084
1.361HisSer: 1.361 ± 0.099
0.691HisThr: 0.691 ± 0.086
0.802HisVal: 0.802 ± 0.085
0.125HisTrp: 0.125 ± 0.029
0.721HisTyr: 0.721 ± 0.073
0.0HisXaa: 0.0 ± 0.0
Ile
5.701IleAla: 5.701 ± 0.261
1.464IleCys: 1.464 ± 0.112
6.215IleAsp: 6.215 ± 0.262
7.032IleGlu: 7.032 ± 0.247
6.024IlePhe: 6.024 ± 0.263
6.458IleGly: 6.458 ± 0.297
2.214IleHis: 2.214 ± 0.132
14.343IleIle: 14.343 ± 0.352
13.144IleLys: 13.144 ± 0.342
12.21IleLeu: 12.21 ± 0.332
2.317IleMet: 2.317 ± 0.183
11.092IleAsn: 11.092 ± 0.454
4.024IlePro: 4.024 ± 0.193
3.42IleGln: 3.42 ± 0.17
4.126IleArg: 4.126 ± 0.197
10.077IleSer: 10.077 ± 0.455
5.796IleThr: 5.796 ± 0.236
4.656IleVal: 4.656 ± 0.223
1.111IleTrp: 1.111 ± 0.117
4.252IleTyr: 4.252 ± 0.19
0.0IleXaa: 0.0 ± 0.0
Lys
3.067LysAla: 3.067 ± 0.148
0.824LysCys: 0.824 ± 0.079
3.428LysAsp: 3.428 ± 0.165
5.061LysGlu: 5.061 ± 0.245
4.59LysPhe: 4.59 ± 0.188
3.406LysGly: 3.406 ± 0.191
1.287LysHis: 1.287 ± 0.093
14.652LysIle: 14.652 ± 0.426
14.454LysLys: 14.454 ± 0.503
10.423LysLeu: 10.423 ± 0.356
2.037LysMet: 2.037 ± 0.174
11.96LysAsn: 11.96 ± 0.532
2.413LysPro: 2.413 ± 0.139
2.192LysGln: 2.192 ± 0.126
3.67LysArg: 3.67 ± 0.264
6.068LysSer: 6.068 ± 0.329
4.06LysThr: 4.06 ± 0.211
3.295LysVal: 3.295 ± 0.192
0.868LysTrp: 0.868 ± 0.099
4.465LysTyr: 4.465 ± 0.228
0.0LysXaa: 0.0 ± 0.0
Leu
4.178LeuAla: 4.178 ± 0.203
1.221LeuCys: 1.221 ± 0.096
4.347LeuAsp: 4.347 ± 0.196
5.612LeuGlu: 5.612 ± 0.25
4.583LeuPhe: 4.583 ± 0.198
4.73LeuGly: 4.73 ± 0.259
2.251LeuHis: 2.251 ± 0.109
10.827LeuIle: 10.827 ± 0.383
9.754LeuLys: 9.754 ± 0.35
10.18LeuLeu: 10.18 ± 0.36
2.133LeuMet: 2.133 ± 0.118
8.025LeuAsn: 8.025 ± 0.294
3.553LeuPro: 3.553 ± 0.136
2.898LeuGln: 2.898 ± 0.16
4.2LeuArg: 4.2 ± 0.241
8.231LeuSer: 8.231 ± 0.36
4.259LeuThr: 4.259 ± 0.204
3.994LeuVal: 3.994 ± 0.201
0.787LeuTrp: 0.787 ± 0.073
3.509LeuTyr: 3.509 ± 0.164
0.0LeuXaa: 0.0 ± 0.0
Met
0.971MetAla: 0.971 ± 0.095
0.206MetCys: 0.206 ± 0.037
0.956MetAsp: 0.956 ± 0.09
1.044MetGlu: 1.044 ± 0.1
0.861MetPhe: 0.861 ± 0.091
1.199MetGly: 1.199 ± 0.11
0.397MetHis: 0.397 ± 0.057
2.243MetIle: 2.243 ± 0.182
2.008MetLys: 2.008 ± 0.126
2.324MetLeu: 2.324 ± 0.165
0.419MetMet: 0.419 ± 0.062
1.53MetAsn: 1.53 ± 0.102
0.919MetPro: 0.919 ± 0.075
0.927MetGln: 0.927 ± 0.085
1.022MetArg: 1.022 ± 0.095
1.596MetSer: 1.596 ± 0.105
1.008MetThr: 1.008 ± 0.109
0.794MetVal: 0.794 ± 0.071
0.125MetTrp: 0.125 ± 0.03
0.552MetTyr: 0.552 ± 0.07
0.0MetXaa: 0.0 ± 0.0
Asn
2.641AsnAla: 2.641 ± 0.155
0.853AsnCys: 0.853 ± 0.074
2.847AsnAsp: 2.847 ± 0.227
3.788AsnGlu: 3.788 ± 0.164
5.046AsnPhe: 5.046 ± 0.233
3.487AsnGly: 3.487 ± 0.169
1.147AsnHis: 1.147 ± 0.093
11.563AsnIle: 11.563 ± 0.561
9.445AsnLys: 9.445 ± 0.423
7.716AsnLeu: 7.716 ± 0.28
1.684AsnMet: 1.684 ± 0.13
7.915AsnAsn: 7.915 ± 0.462
2.295AsnPro: 2.295 ± 0.135
1.986AsnGln: 1.986 ± 0.144
2.346AsnArg: 2.346 ± 0.154
5.487AsnSer: 5.487 ± 0.344
3.45AsnThr: 3.45 ± 0.125
2.449AsnVal: 2.449 ± 0.148
0.824AsnTrp: 0.824 ± 0.084
3.729AsnTyr: 3.729 ± 0.26
0.0AsnXaa: 0.0 ± 0.0
Pro
1.199ProAla: 1.199 ± 0.109
0.39ProCys: 0.39 ± 0.07
1.383ProAsp: 1.383 ± 0.121
1.971ProGlu: 1.971 ± 0.146
1.78ProPhe: 1.78 ± 0.11
2.126ProGly: 2.126 ± 0.143
0.611ProHis: 0.611 ± 0.067
4.163ProIle: 4.163 ± 0.203
2.773ProLys: 2.773 ± 0.137
3.038ProLeu: 3.038 ± 0.129
0.588ProMet: 0.588 ± 0.064
2.317ProAsn: 2.317 ± 0.138
0.794ProPro: 0.794 ± 0.084
0.846ProGln: 0.846 ± 0.078
1.022ProArg: 1.022 ± 0.086
2.008ProSer: 2.008 ± 0.128
1.339ProThr: 1.339 ± 0.101
1.699ProVal: 1.699 ± 0.143
0.324ProTrp: 0.324 ± 0.05
1.228ProTyr: 1.228 ± 0.116
0.0ProXaa: 0.0 ± 0.0
Gln
1.508GlnAla: 1.508 ± 0.119
0.316GlnCys: 0.316 ± 0.049
1.037GlnAsp: 1.037 ± 0.092
1.456GlnGlu: 1.456 ± 0.135
1.111GlnPhe: 1.111 ± 0.104
1.258GlnGly: 1.258 ± 0.098
0.427GlnHis: 0.427 ± 0.061
3.729GlnIle: 3.729 ± 0.191
3.111GlnLys: 3.111 ± 0.176
2.832GlnLeu: 2.832 ± 0.134
0.662GlnMet: 0.662 ± 0.083
1.78GlnAsn: 1.78 ± 0.131
0.662GlnPro: 0.662 ± 0.077
0.839GlnGln: 0.839 ± 0.094
1.206GlnArg: 1.206 ± 0.113
1.545GlnSer: 1.545 ± 0.102
0.971GlnThr: 0.971 ± 0.077
1.258GlnVal: 1.258 ± 0.111
0.272GlnTrp: 0.272 ± 0.041
1.265GlnTyr: 1.265 ± 0.11
0.0GlnXaa: 0.0 ± 0.0
Arg
1.861ArgAla: 1.861 ± 0.15
0.449ArgCys: 0.449 ± 0.057
1.736ArgAsp: 1.736 ± 0.124
2.052ArgGlu: 2.052 ± 0.141
1.846ArgPhe: 1.846 ± 0.161
2.229ArgGly: 2.229 ± 0.182
0.758ArgHis: 0.758 ± 0.079
4.59ArgIle: 4.59 ± 0.276
3.516ArgLys: 3.516 ± 0.214
3.987ArgLeu: 3.987 ± 0.188
0.919ArgMet: 0.919 ± 0.107
2.729ArgAsn: 2.729 ± 0.137
1.14ArgPro: 1.14 ± 0.115
1.096ArgGln: 1.096 ± 0.098
1.795ArgArg: 1.795 ± 0.176
2.273ArgSer: 2.273 ± 0.151
1.427ArgThr: 1.427 ± 0.137
2.133ArgVal: 2.133 ± 0.155
0.368ArgTrp: 0.368 ± 0.053
1.773ArgTyr: 1.773 ± 0.137
0.0ArgXaa: 0.0 ± 0.0
Ser
3.156SerAla: 3.156 ± 0.167
0.949SerCys: 0.949 ± 0.103
3.362SerAsp: 3.362 ± 0.246
3.553SerGlu: 3.553 ± 0.164
3.178SerPhe: 3.178 ± 0.133
4.641SerGly: 4.641 ± 0.167
1.17SerHis: 1.17 ± 0.088
8.812SerIle: 8.812 ± 0.416
6.745SerLys: 6.745 ± 0.348
6.958SerLeu: 6.958 ± 0.271
1.331SerMet: 1.331 ± 0.099
5.487SerAsn: 5.487 ± 0.289
2.163SerPro: 2.163 ± 0.124
1.868SerGln: 1.868 ± 0.128
2.53SerArg: 2.53 ± 0.184
5.164SerSer: 5.164 ± 0.325
2.795SerThr: 2.795 ± 0.175
2.928SerVal: 2.928 ± 0.145
0.537SerTrp: 0.537 ± 0.065
2.692SerTyr: 2.692 ± 0.148
0.0SerXaa: 0.0 ± 0.0
Thr
2.037ThrAla: 2.037 ± 0.142
0.478ThrCys: 0.478 ± 0.065
2.001ThrAsp: 2.001 ± 0.125
2.339ThrGlu: 2.339 ± 0.126
1.721ThrPhe: 1.721 ± 0.122
3.067ThrGly: 3.067 ± 0.162
0.868ThrHis: 0.868 ± 0.075
4.825ThrIle: 4.825 ± 0.184
3.818ThrLys: 3.818 ± 0.172
4.678ThrLeu: 4.678 ± 0.158
0.713ThrMet: 0.713 ± 0.074
3.141ThrAsn: 3.141 ± 0.148
1.692ThrPro: 1.692 ± 0.127
1.508ThrGln: 1.508 ± 0.133
1.729ThrArg: 1.729 ± 0.138
2.714ThrSer: 2.714 ± 0.14
2.052ThrThr: 2.052 ± 0.137
1.92ThrVal: 1.92 ± 0.14
0.368ThrTrp: 0.368 ± 0.047
1.493ThrTyr: 1.493 ± 0.105
0.0ThrXaa: 0.0 ± 0.0
Val
2.001ValAla: 2.001 ± 0.176
0.434ValCys: 0.434 ± 0.051
2.023ValAsp: 2.023 ± 0.189
2.082ValGlu: 2.082 ± 0.155
1.721ValPhe: 1.721 ± 0.117
2.471ValGly: 2.471 ± 0.173
0.868ValHis: 0.868 ± 0.087
5.311ValIle: 5.311 ± 0.265
3.428ValLys: 3.428 ± 0.208
4.281ValLeu: 4.281 ± 0.184
0.846ValMet: 0.846 ± 0.063
2.913ValAsn: 2.913 ± 0.163
1.375ValPro: 1.375 ± 0.116
0.971ValGln: 0.971 ± 0.097
1.861ValArg: 1.861 ± 0.144
2.957ValSer: 2.957 ± 0.14
2.06ValThr: 2.06 ± 0.145
1.912ValVal: 1.912 ± 0.149
0.353ValTrp: 0.353 ± 0.06
1.258ValTyr: 1.258 ± 0.094
0.0ValXaa: 0.0 ± 0.0
Trp
0.316TrpAla: 0.316 ± 0.05
0.147TrpCys: 0.147 ± 0.038
0.368TrpAsp: 0.368 ± 0.043
0.449TrpGlu: 0.449 ± 0.06
0.441TrpPhe: 0.441 ± 0.072
0.478TrpGly: 0.478 ± 0.076
0.154TrpHis: 0.154 ± 0.032
1.206TrpIle: 1.206 ± 0.115
0.927TrpLys: 0.927 ± 0.119
1.0TrpLeu: 1.0 ± 0.1
0.199TrpMet: 0.199 ± 0.043
0.596TrpAsn: 0.596 ± 0.072
0.353TrpPro: 0.353 ± 0.052
0.191TrpGln: 0.191 ± 0.044
0.346TrpArg: 0.346 ± 0.052
0.508TrpSer: 0.508 ± 0.058
0.309TrpThr: 0.309 ± 0.055
0.463TrpVal: 0.463 ± 0.073
0.118TrpTrp: 0.118 ± 0.032
0.346TrpTyr: 0.346 ± 0.046
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.729TyrAla: 1.729 ± 0.102
0.618TyrCys: 0.618 ± 0.081
1.714TyrAsp: 1.714 ± 0.116
1.795TyrGlu: 1.795 ± 0.119
2.192TyrPhe: 2.192 ± 0.126
2.42TyrGly: 2.42 ± 0.125
0.728TyrHis: 0.728 ± 0.078
4.318TyrIle: 4.318 ± 0.173
3.972TyrLys: 3.972 ± 0.216
3.972TyrLeu: 3.972 ± 0.209
0.78TyrMet: 0.78 ± 0.089
3.053TyrAsn: 3.053 ± 0.158
1.353TyrPro: 1.353 ± 0.127
1.133TyrGln: 1.133 ± 0.079
1.545TyrArg: 1.545 ± 0.115
2.788TyrSer: 2.788 ± 0.131
1.773TyrThr: 1.773 ± 0.143
1.456TyrVal: 1.456 ± 0.103
0.471TyrTrp: 0.471 ± 0.072
1.604TyrTyr: 1.604 ± 0.098
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 372 proteins (135952 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski