Amino acid dipepetide frequency for Nanobsidianus stetteri

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.94AlaAla: 1.94 ± 0.129
0.263AlaCys: 0.263 ± 0.039
1.375AlaAsp: 1.375 ± 0.089
2.046AlaGlu: 2.046 ± 0.136
1.638AlaPhe: 1.638 ± 0.092
2.365AlaGly: 2.365 ± 0.125
0.604AlaHis: 0.604 ± 0.067
4.623AlaIle: 4.623 ± 0.154
3.416AlaLys: 3.416 ± 0.154
4.103AlaLeu: 4.103 ± 0.179
0.654AlaMet: 0.654 ± 0.057
1.957AlaAsn: 1.957 ± 0.141
1.062AlaPro: 1.062 ± 0.082
0.922AlaGln: 0.922 ± 0.076
1.537AlaArg: 1.537 ± 0.1
2.203AlaSer: 2.203 ± 0.131
1.66AlaThr: 1.66 ± 0.115
2.035AlaVal: 2.035 ± 0.118
0.324AlaTrp: 0.324 ± 0.035
1.968AlaTyr: 1.968 ± 0.119
0.0AlaXaa: 0.0 ± 0.0
Cys
0.145CysAla: 0.145 ± 0.028
0.022CysCys: 0.022 ± 0.01
0.201CysAsp: 0.201 ± 0.034
0.291CysGlu: 0.291 ± 0.044
0.179CysPhe: 0.179 ± 0.034
0.615CysGly: 0.615 ± 0.081
0.056CysHis: 0.056 ± 0.02
0.565CysIle: 0.565 ± 0.063
0.548CysLys: 0.548 ± 0.058
0.408CysLeu: 0.408 ± 0.051
0.089CysMet: 0.089 ± 0.02
0.514CysAsn: 0.514 ± 0.068
0.47CysPro: 0.47 ± 0.057
0.201CysGln: 0.201 ± 0.042
0.14CysArg: 0.14 ± 0.03
0.509CysSer: 0.509 ± 0.079
0.324CysThr: 0.324 ± 0.045
0.218CysVal: 0.218 ± 0.037
0.045CysTrp: 0.045 ± 0.017
0.369CysTyr: 0.369 ± 0.055
0.0CysXaa: 0.0 ± 0.0
Asp
1.521AspAla: 1.521 ± 0.093
0.263AspCys: 0.263 ± 0.045
2.029AspAsp: 2.029 ± 0.118
3.494AspGlu: 3.494 ± 0.175
2.628AspPhe: 2.628 ± 0.128
1.962AspGly: 1.962 ± 0.135
0.526AspHis: 0.526 ± 0.056
8.145AspIle: 8.145 ± 0.279
5.551AspLys: 5.551 ± 0.23
5.255AspLeu: 5.255 ± 0.184
0.732AspMet: 0.732 ± 0.058
3.108AspAsn: 3.108 ± 0.141
1.996AspPro: 1.996 ± 0.112
0.861AspGln: 0.861 ± 0.067
1.521AspArg: 1.521 ± 0.089
2.069AspSer: 2.069 ± 0.087
1.448AspThr: 1.448 ± 0.092
2.773AspVal: 2.773 ± 0.143
0.542AspTrp: 0.542 ± 0.065
3.254AspTyr: 3.254 ± 0.139
0.0AspXaa: 0.0 ± 0.0
Glu
2.365GluAla: 2.365 ± 0.122
0.38GluCys: 0.38 ± 0.056
3.796GluAsp: 3.796 ± 0.165
6.664GluGlu: 6.664 ± 0.265
3.203GluPhe: 3.203 ± 0.14
3.27GluGly: 3.27 ± 0.157
0.542GluHis: 0.542 ± 0.062
9.085GluIle: 9.085 ± 0.282
9.476GluLys: 9.476 ± 0.337
6.983GluLeu: 6.983 ± 0.256
1.202GluMet: 1.202 ± 0.093
6.032GluAsn: 6.032 ± 0.197
1.465GluPro: 1.465 ± 0.099
1.135GluGln: 1.135 ± 0.097
2.79GluArg: 2.79 ± 0.166
2.969GluSer: 2.969 ± 0.136
2.264GluThr: 2.264 ± 0.118
3.55GluVal: 3.55 ± 0.166
0.587GluTrp: 0.587 ± 0.056
4.612GluTyr: 4.612 ± 0.179
0.0GluXaa: 0.0 ± 0.0
Phe
1.767PheAla: 1.767 ± 0.098
0.28PheCys: 0.28 ± 0.04
2.616PheAsp: 2.616 ± 0.117
3.069PheGlu: 3.069 ± 0.161
2.331PhePhe: 2.331 ± 0.158
2.874PheGly: 2.874 ± 0.16
0.453PheHis: 0.453 ± 0.059
5.797PheIle: 5.797 ± 0.27
3.064PheLys: 3.064 ± 0.15
5.473PheLeu: 5.473 ± 0.297
0.654PheMet: 0.654 ± 0.062
3.382PheAsn: 3.382 ± 0.151
1.532PhePro: 1.532 ± 0.097
0.995PheGln: 0.995 ± 0.078
1.459PheArg: 1.459 ± 0.112
3.343PheSer: 3.343 ± 0.164
1.957PheThr: 1.957 ± 0.125
2.65PheVal: 2.65 ± 0.119
0.363PheTrp: 0.363 ± 0.046
3.399PheTyr: 3.399 ± 0.162
0.0PheXaa: 0.0 ± 0.0
Gly
1.862GlyAla: 1.862 ± 0.127
0.335GlyCys: 0.335 ± 0.052
2.342GlyAsp: 2.342 ± 0.143
3.226GlyGlu: 3.226 ± 0.148
2.41GlyPhe: 2.41 ± 0.119
2.751GlyGly: 2.751 ± 0.147
0.716GlyHis: 0.716 ± 0.084
6.725GlyIle: 6.725 ± 0.175
5.138GlyLys: 5.138 ± 0.191
4.696GlyLeu: 4.696 ± 0.164
0.995GlyMet: 0.995 ± 0.088
3.5GlyAsn: 3.5 ± 0.19
1.364GlyPro: 1.364 ± 0.107
1.37GlyGln: 1.37 ± 0.122
1.957GlyArg: 1.957 ± 0.129
3.203GlySer: 3.203 ± 0.153
2.326GlyThr: 2.326 ± 0.131
2.493GlyVal: 2.493 ± 0.153
0.57GlyTrp: 0.57 ± 0.061
3.701GlyTyr: 3.701 ± 0.145
0.0GlyXaa: 0.0 ± 0.0
His
0.526HisAla: 0.526 ± 0.058
0.073HisCys: 0.073 ± 0.022
0.352HisAsp: 0.352 ± 0.045
0.576HisGlu: 0.576 ± 0.058
0.397HisPhe: 0.397 ± 0.056
0.626HisGly: 0.626 ± 0.068
0.263HisHis: 0.263 ± 0.033
1.168HisIle: 1.168 ± 0.08
0.727HisLys: 0.727 ± 0.066
1.006HisLeu: 1.006 ± 0.076
0.229HisMet: 0.229 ± 0.036
0.509HisAsn: 0.509 ± 0.053
0.598HisPro: 0.598 ± 0.059
0.201HisGln: 0.201 ± 0.034
0.464HisArg: 0.464 ± 0.051
0.581HisSer: 0.581 ± 0.053
0.408HisThr: 0.408 ± 0.056
0.531HisVal: 0.531 ± 0.058
0.117HisTrp: 0.117 ± 0.029
0.498HisTyr: 0.498 ± 0.064
0.0HisXaa: 0.0 ± 0.0
Ile
4.763IleAla: 4.763 ± 0.178
0.626IleCys: 0.626 ± 0.064
6.569IleAsp: 6.569 ± 0.2
8.587IleGlu: 8.587 ± 0.304
6.837IlePhe: 6.837 ± 0.267
5.865IleGly: 5.865 ± 0.205
1.001IleHis: 1.001 ± 0.076
15.642IleIle: 15.642 ± 0.472
12.243IleLys: 12.243 ± 0.35
13.58IleLeu: 13.58 ± 0.347
1.845IleMet: 1.845 ± 0.114
9.655IleAsn: 9.655 ± 0.319
4.003IlePro: 4.003 ± 0.171
2.242IleGln: 2.242 ± 0.113
4.087IleArg: 4.087 ± 0.149
8.274IleSer: 8.274 ± 0.226
4.355IleThr: 4.355 ± 0.148
6.122IleVal: 6.122 ± 0.194
0.682IleTrp: 0.682 ± 0.058
8.073IleTyr: 8.073 ± 0.26
0.0IleXaa: 0.0 ± 0.0
Lys
3.008LysAla: 3.008 ± 0.115
0.537LysCys: 0.537 ± 0.064
6.491LysAsp: 6.491 ± 0.216
9.934LysGlu: 9.934 ± 0.359
3.79LysPhe: 3.79 ± 0.157
3.818LysGly: 3.818 ± 0.164
0.861LysHis: 0.861 ± 0.068
12.501LysIle: 12.501 ± 0.348
9.359LysLys: 9.359 ± 0.302
8.391LysLeu: 8.391 ± 0.255
1.923LysMet: 1.923 ± 0.116
7.363LysAsn: 7.363 ± 0.271
2.113LysPro: 2.113 ± 0.125
1.56LysGln: 1.56 ± 0.103
3.623LysArg: 3.623 ± 0.143
4.087LysSer: 4.087 ± 0.168
2.801LysThr: 2.801 ± 0.13
5.177LysVal: 5.177 ± 0.182
0.643LysTrp: 0.643 ± 0.062
7.184LysTyr: 7.184 ± 0.222
0.0LysXaa: 0.0 ± 0.0
Leu
3.779LeuAla: 3.779 ± 0.147
0.403LeuCys: 0.403 ± 0.05
5.345LeuAsp: 5.345 ± 0.198
7.726LeuGlu: 7.726 ± 0.244
5.155LeuPhe: 5.155 ± 0.263
4.83LeuGly: 4.83 ± 0.2
0.962LeuHis: 0.962 ± 0.085
10.745LeuIle: 10.745 ± 0.36
9.224LeuLys: 9.224 ± 0.228
10.488LeuLeu: 10.488 ± 0.281
1.577LeuMet: 1.577 ± 0.094
7.967LeuAsn: 7.967 ± 0.246
3.472LeuPro: 3.472 ± 0.146
2.27LeuGln: 2.27 ± 0.146
3.511LeuArg: 3.511 ± 0.15
7.346LeuSer: 7.346 ± 0.262
3.768LeuThr: 3.768 ± 0.156
4.288LeuVal: 4.288 ± 0.169
0.632LeuTrp: 0.632 ± 0.063
7.363LeuTyr: 7.363 ± 0.239
0.0LeuXaa: 0.0 ± 0.0
Met
0.772MetAla: 0.772 ± 0.066
0.039MetCys: 0.039 ± 0.014
1.085MetAsp: 1.085 ± 0.09
1.168MetGlu: 1.168 ± 0.072
0.71MetPhe: 0.71 ± 0.062
0.889MetGly: 0.889 ± 0.073
0.212MetHis: 0.212 ± 0.039
1.722MetIle: 1.722 ± 0.111
1.85MetLys: 1.85 ± 0.103
1.498MetLeu: 1.498 ± 0.102
0.224MetMet: 0.224 ± 0.035
1.208MetAsn: 1.208 ± 0.08
0.52MetPro: 0.52 ± 0.06
0.403MetGln: 0.403 ± 0.046
0.615MetArg: 0.615 ± 0.067
0.861MetSer: 0.861 ± 0.075
0.537MetThr: 0.537 ± 0.055
1.051MetVal: 1.051 ± 0.085
0.095MetTrp: 0.095 ± 0.024
0.738MetTyr: 0.738 ± 0.067
0.0MetXaa: 0.0 ± 0.0
Asn
2.102AsnAla: 2.102 ± 0.104
0.43AsnCys: 0.43 ± 0.047
2.98AsnAsp: 2.98 ± 0.144
4.322AsnGlu: 4.322 ± 0.174
3.092AsnPhe: 3.092 ± 0.151
3.802AsnGly: 3.802 ± 0.223
0.514AsnHis: 0.514 ± 0.051
11.986AsnIle: 11.986 ± 0.37
7.575AsnLys: 7.575 ± 0.276
7.726AsnLeu: 7.726 ± 0.296
1.057AsnMet: 1.057 ± 0.082
7.681AsnAsn: 7.681 ± 0.386
2.482AsnPro: 2.482 ± 0.135
2.152AsnGln: 2.152 ± 0.155
2.057AsnArg: 2.057 ± 0.123
4.193AsnSer: 4.193 ± 0.215
2.946AsnThr: 2.946 ± 0.16
3.869AsnVal: 3.869 ± 0.193
0.492AsnTrp: 0.492 ± 0.048
5.149AsnTyr: 5.149 ± 0.245
0.0AsnXaa: 0.0 ± 0.0
Pro
1.258ProAla: 1.258 ± 0.088
0.134ProCys: 0.134 ± 0.028
1.588ProAsp: 1.588 ± 0.105
2.566ProGlu: 2.566 ± 0.126
1.621ProPhe: 1.621 ± 0.089
2.046ProGly: 2.046 ± 0.118
0.363ProHis: 0.363 ± 0.046
3.287ProIle: 3.287 ± 0.136
2.454ProLys: 2.454 ± 0.121
2.874ProLeu: 2.874 ± 0.135
0.419ProMet: 0.419 ± 0.058
2.365ProAsn: 2.365 ± 0.142
1.247ProPro: 1.247 ± 0.124
0.995ProGln: 0.995 ± 0.084
0.744ProArg: 0.744 ± 0.064
2.052ProSer: 2.052 ± 0.114
1.37ProThr: 1.37 ± 0.106
1.795ProVal: 1.795 ± 0.104
0.358ProTrp: 0.358 ± 0.041
2.069ProTyr: 2.069 ± 0.12
0.0ProXaa: 0.0 ± 0.0
Gln
1.062GlnAla: 1.062 ± 0.09
0.201GlnCys: 0.201 ± 0.049
1.057GlnAsp: 1.057 ± 0.076
1.325GlnGlu: 1.325 ± 0.085
0.928GlnPhe: 0.928 ± 0.076
1.068GlnGly: 1.068 ± 0.108
0.19GlnHis: 0.19 ± 0.028
2.516GlnIle: 2.516 ± 0.117
1.604GlnLys: 1.604 ± 0.092
2.158GlnLeu: 2.158 ± 0.15
0.47GlnMet: 0.47 ± 0.05
1.923GlnAsn: 1.923 ± 0.159
0.772GlnPro: 0.772 ± 0.087
0.939GlnGln: 0.939 ± 0.103
0.721GlnArg: 0.721 ± 0.056
1.414GlnSer: 1.414 ± 0.097
1.174GlnThr: 1.174 ± 0.107
1.219GlnVal: 1.219 ± 0.099
0.184GlnTrp: 0.184 ± 0.031
1.906GlnTyr: 1.906 ± 0.145
0.0GlnXaa: 0.0 ± 0.0
Arg
1.454ArgAla: 1.454 ± 0.102
0.196ArgCys: 0.196 ± 0.031
1.778ArgAsp: 1.778 ± 0.105
3.248ArgGlu: 3.248 ± 0.143
1.493ArgPhe: 1.493 ± 0.09
1.834ArgGly: 1.834 ± 0.118
0.358ArgHis: 0.358 ± 0.059
3.779ArgIle: 3.779 ± 0.197
4.003ArgLys: 4.003 ± 0.156
3.17ArgLeu: 3.17 ± 0.157
0.559ArgMet: 0.559 ± 0.064
2.091ArgAsn: 2.091 ± 0.122
0.939ArgPro: 0.939 ± 0.085
0.553ArgGln: 0.553 ± 0.06
1.8ArgArg: 1.8 ± 0.122
1.554ArgSer: 1.554 ± 0.1
1.202ArgThr: 1.202 ± 0.102
1.621ArgVal: 1.621 ± 0.088
0.285ArgTrp: 0.285 ± 0.042
2.018ArgTyr: 2.018 ± 0.108
0.0ArgXaa: 0.0 ± 0.0
Ser
2.024SerAla: 2.024 ± 0.115
0.481SerCys: 0.481 ± 0.069
2.119SerAsp: 2.119 ± 0.106
3.656SerGlu: 3.656 ± 0.13
3.108SerPhe: 3.108 ± 0.153
3.511SerGly: 3.511 ± 0.147
0.531SerHis: 0.531 ± 0.054
7.452SerIle: 7.452 ± 0.235
4.909SerLys: 4.909 ± 0.179
6.289SerLeu: 6.289 ± 0.219
1.168SerMet: 1.168 ± 0.082
4.377SerAsn: 4.377 ± 0.225
1.862SerPro: 1.862 ± 0.107
1.968SerGln: 1.968 ± 0.131
1.705SerArg: 1.705 ± 0.109
4.372SerSer: 4.372 ± 0.243
3.086SerThr: 3.086 ± 0.21
2.616SerVal: 2.616 ± 0.133
0.553SerTrp: 0.553 ± 0.053
3.634SerTyr: 3.634 ± 0.155
0.0SerXaa: 0.0 ± 0.0
Thr
1.688ThrAla: 1.688 ± 0.109
0.285ThrCys: 0.285 ± 0.041
1.56ThrAsp: 1.56 ± 0.111
2.013ThrGlu: 2.013 ± 0.092
2.029ThrPhe: 2.029 ± 0.114
2.521ThrGly: 2.521 ± 0.126
0.458ThrHis: 0.458 ± 0.055
4.786ThrIle: 4.786 ± 0.174
2.672ThrLys: 2.672 ± 0.12
4.187ThrLeu: 4.187 ± 0.146
0.587ThrMet: 0.587 ± 0.056
2.65ThrAsn: 2.65 ± 0.169
1.549ThrPro: 1.549 ± 0.111
1.057ThrGln: 1.057 ± 0.081
1.04ThrArg: 1.04 ± 0.092
2.683ThrSer: 2.683 ± 0.178
2.141ThrThr: 2.141 ± 0.217
2.136ThrVal: 2.136 ± 0.173
0.263ThrTrp: 0.263 ± 0.039
2.622ThrTyr: 2.622 ± 0.13
0.0ThrXaa: 0.0 ± 0.0
Val
2.393ValAla: 2.393 ± 0.132
0.369ValCys: 0.369 ± 0.049
2.65ValAsp: 2.65 ± 0.115
3.528ValGlu: 3.528 ± 0.161
2.192ValPhe: 2.192 ± 0.111
2.985ValGly: 2.985 ± 0.153
0.565ValHis: 0.565 ± 0.056
5.395ValIle: 5.395 ± 0.202
4.746ValLys: 4.746 ± 0.201
4.73ValLeu: 4.73 ± 0.163
0.788ValMet: 0.788 ± 0.061
3.449ValAsn: 3.449 ± 0.17
1.705ValPro: 1.705 ± 0.11
1.085ValGln: 1.085 ± 0.082
1.834ValArg: 1.834 ± 0.138
3.326ValSer: 3.326 ± 0.137
2.287ValThr: 2.287 ± 0.176
2.683ValVal: 2.683 ± 0.159
0.458ValTrp: 0.458 ± 0.058
3.427ValTyr: 3.427 ± 0.154
0.0ValXaa: 0.0 ± 0.0
Trp
0.291TrpAla: 0.291 ± 0.044
0.056TrpCys: 0.056 ± 0.018
0.425TrpAsp: 0.425 ± 0.048
0.621TrpGlu: 0.621 ± 0.062
0.319TrpPhe: 0.319 ± 0.044
0.447TrpGly: 0.447 ± 0.046
0.117TrpHis: 0.117 ± 0.026
0.995TrpIle: 0.995 ± 0.087
0.693TrpLys: 0.693 ± 0.075
0.676TrpLeu: 0.676 ± 0.071
0.151TrpMet: 0.151 ± 0.03
0.576TrpAsn: 0.576 ± 0.053
0.184TrpPro: 0.184 ± 0.035
0.184TrpGln: 0.184 ± 0.031
0.335TrpArg: 0.335 ± 0.045
0.324TrpSer: 0.324 ± 0.04
0.285TrpThr: 0.285 ± 0.04
0.514TrpVal: 0.514 ± 0.056
0.123TrpTrp: 0.123 ± 0.024
0.498TrpTyr: 0.498 ± 0.05
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.979TyrAla: 1.979 ± 0.109
0.537TyrCys: 0.537 ± 0.062
3.321TyrAsp: 3.321 ± 0.128
4.31TyrGlu: 4.31 ± 0.159
3.399TyrPhe: 3.399 ± 0.161
3.6TyrGly: 3.6 ± 0.138
0.559TyrHis: 0.559 ± 0.052
8.134TyrIle: 8.134 ± 0.273
5.909TyrLys: 5.909 ± 0.2
7.156TyrLeu: 7.156 ± 0.249
0.867TyrMet: 0.867 ± 0.078
6.301TyrAsn: 6.301 ± 0.278
2.27TyrPro: 2.27 ± 0.118
1.739TyrGln: 1.739 ± 0.132
1.996TyrArg: 1.996 ± 0.105
4.059TyrSer: 4.059 ± 0.19
2.588TyrThr: 2.588 ± 0.14
3.254TyrVal: 3.254 ± 0.147
0.498TyrTrp: 0.498 ± 0.051
4.886TyrTyr: 4.886 ± 0.262
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 647 proteins (178873 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski