Amino acid dipepetide frequency for Buchnera aphidicola (Thelaxes californica)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.335AlaAla: 2.335 ± 0.14
0.636AlaCys: 0.636 ± 0.072
1.401AlaAsp: 1.401 ± 0.102
1.76AlaGlu: 1.76 ± 0.103
1.577AlaPhe: 1.577 ± 0.116
2.085AlaGly: 2.085 ± 0.129
0.86AlaHis: 0.86 ± 0.089
4.569AlaIle: 4.569 ± 0.172
3.276AlaLys: 3.276 ± 0.133
4.406AlaLeu: 4.406 ± 0.151
1.036AlaMet: 1.036 ± 0.097
2.085AlaAsn: 2.085 ± 0.133
1.164AlaPro: 1.164 ± 0.086
1.685AlaGln: 1.685 ± 0.11
1.563AlaArg: 1.563 ± 0.122
2.538AlaSer: 2.538 ± 0.131
2.051AlaThr: 2.051 ± 0.121
2.159AlaVal: 2.159 ± 0.137
0.345AlaTrp: 0.345 ± 0.049
1.536AlaTyr: 1.536 ± 0.105
0.0AlaXaa: 0.0 ± 0.0
Cys
0.562CysAla: 0.562 ± 0.061
0.162CysCys: 0.162 ± 0.029
0.636CysAsp: 0.636 ± 0.065
0.474CysGlu: 0.474 ± 0.053
0.758CysPhe: 0.758 ± 0.064
0.832CysGly: 0.832 ± 0.09
0.352CysHis: 0.352 ± 0.054
1.807CysIle: 1.807 ± 0.132
1.036CysLys: 1.036 ± 0.088
1.239CysLeu: 1.239 ± 0.097
0.277CysMet: 0.277 ± 0.046
0.86CysAsn: 0.86 ± 0.069
0.494CysPro: 0.494 ± 0.068
0.433CysGln: 0.433 ± 0.048
0.318CysArg: 0.318 ± 0.047
1.11CysSer: 1.11 ± 0.09
0.812CysThr: 0.812 ± 0.069
0.65CysVal: 0.65 ± 0.066
0.122CysTrp: 0.122 ± 0.035
0.521CysTyr: 0.521 ± 0.058
0.0CysXaa: 0.0 ± 0.0
Asp
1.854AspAla: 1.854 ± 0.121
0.562AspCys: 0.562 ± 0.06
1.266AspAsp: 1.266 ± 0.117
1.834AspGlu: 1.834 ± 0.119
2.443AspPhe: 2.443 ± 0.142
1.706AspGly: 1.706 ± 0.12
0.961AspHis: 0.961 ± 0.079
5.733AspIle: 5.733 ± 0.19
2.91AspLys: 2.91 ± 0.148
4.088AspLeu: 4.088 ± 0.178
0.893AspMet: 0.893 ± 0.078
2.328AspAsn: 2.328 ± 0.131
1.184AspPro: 1.184 ± 0.092
1.367AspGln: 1.367 ± 0.104
1.157AspArg: 1.157 ± 0.08
2.335AspSer: 2.335 ± 0.119
1.929AspThr: 1.929 ± 0.102
2.43AspVal: 2.43 ± 0.135
0.332AspTrp: 0.332 ± 0.054
1.624AspTyr: 1.624 ± 0.108
0.0AspXaa: 0.0 ± 0.0
Glu
2.003GluAla: 2.003 ± 0.136
0.589GluCys: 0.589 ± 0.061
1.638GluAsp: 1.638 ± 0.129
2.741GluGlu: 2.741 ± 0.159
1.76GluPhe: 1.76 ± 0.128
2.125GluGly: 2.125 ± 0.136
1.042GluHis: 1.042 ± 0.09
6.47GluIle: 6.47 ± 0.255
6.904GluLys: 6.904 ± 0.226
4.521GluLeu: 4.521 ± 0.187
1.32GluMet: 1.32 ± 0.093
3.648GluAsn: 3.648 ± 0.153
1.029GluPro: 1.029 ± 0.083
1.841GluGln: 1.841 ± 0.111
1.773GluArg: 1.773 ± 0.122
2.714GluSer: 2.714 ± 0.137
2.179GluThr: 2.179 ± 0.145
2.261GluVal: 2.261 ± 0.131
0.44GluTrp: 0.44 ± 0.045
2.098GluTyr: 2.098 ± 0.13
0.0GluXaa: 0.0 ± 0.0
Phe
1.415PheAla: 1.415 ± 0.089
0.893PheCys: 0.893 ± 0.08
2.098PheAsp: 2.098 ± 0.146
2.051PheGlu: 2.051 ± 0.137
4.596PhePhe: 4.596 ± 0.246
2.633PheGly: 2.633 ± 0.129
1.299PheHis: 1.299 ± 0.093
5.712PheIle: 5.712 ± 0.26
3.851PheLys: 3.851 ± 0.17
5.99PheLeu: 5.99 ± 0.284
0.975PheMet: 0.975 ± 0.083
3.743PheAsn: 3.743 ± 0.172
1.902PhePro: 1.902 ± 0.113
2.064PheGln: 2.064 ± 0.125
1.272PheArg: 1.272 ± 0.088
4.819PheSer: 4.819 ± 0.187
2.132PheThr: 2.132 ± 0.11
1.956PheVal: 1.956 ± 0.126
0.541PheTrp: 0.541 ± 0.061
2.531PheTyr: 2.531 ± 0.166
0.0PheXaa: 0.0 ± 0.0
Gly
2.369GlyAla: 2.369 ± 0.131
0.866GlyCys: 0.866 ± 0.071
2.159GlyAsp: 2.159 ± 0.131
2.45GlyGlu: 2.45 ± 0.158
2.389GlyPhe: 2.389 ± 0.149
3.337GlyGly: 3.337 ± 0.221
1.124GlyHis: 1.124 ± 0.099
6.944GlyIle: 6.944 ± 0.268
4.623GlyLys: 4.623 ± 0.198
4.359GlyLeu: 4.359 ± 0.196
1.442GlyMet: 1.442 ± 0.096
2.66GlyAsn: 2.66 ± 0.15
1.184GlyPro: 1.184 ± 0.101
1.435GlyGln: 1.435 ± 0.126
1.861GlyArg: 1.861 ± 0.128
3.046GlySer: 3.046 ± 0.149
2.687GlyThr: 2.687 ± 0.13
2.951GlyVal: 2.951 ± 0.168
0.474GlyTrp: 0.474 ± 0.059
1.787GlyTyr: 1.787 ± 0.111
0.0GlyXaa: 0.0 ± 0.0
His
1.124HisAla: 1.124 ± 0.083
0.393HisCys: 0.393 ± 0.049
0.934HisAsp: 0.934 ± 0.074
0.914HisGlu: 0.914 ± 0.073
1.225HisPhe: 1.225 ± 0.106
1.225HisGly: 1.225 ± 0.1
0.562HisHis: 0.562 ± 0.066
2.497HisIle: 2.497 ± 0.151
1.834HisLys: 1.834 ± 0.106
2.261HisLeu: 2.261 ± 0.119
0.447HisMet: 0.447 ± 0.054
1.868HisAsn: 1.868 ± 0.117
1.151HisPro: 1.151 ± 0.101
1.042HisGln: 1.042 ± 0.09
0.866HisArg: 0.866 ± 0.069
1.57HisSer: 1.57 ± 0.096
1.286HisThr: 1.286 ± 0.091
1.096HisVal: 1.096 ± 0.097
0.237HisTrp: 0.237 ± 0.035
1.015HisTyr: 1.015 ± 0.085
0.0HisXaa: 0.0 ± 0.0
Ile
5.482IleAla: 5.482 ± 0.205
1.584IleCys: 1.584 ± 0.102
5.651IleAsp: 5.651 ± 0.181
6.355IleGlu: 6.355 ± 0.215
6.985IlePhe: 6.985 ± 0.317
6.328IleGly: 6.328 ± 0.23
3.398IleHis: 3.398 ± 0.153
14.125IleIle: 14.125 ± 0.449
12.487IleLys: 12.487 ± 0.365
12.508IleLeu: 12.508 ± 0.327
2.328IleMet: 2.328 ± 0.132
10.166IleAsn: 10.166 ± 0.304
4.305IlePro: 4.305 ± 0.152
5.821IleGln: 5.821 ± 0.225
3.689IleArg: 3.689 ± 0.157
8.352IleSer: 8.352 ± 0.252
6.105IleThr: 6.105 ± 0.183
5.509IleVal: 5.509 ± 0.188
0.846IleTrp: 0.846 ± 0.081
4.46IleTyr: 4.46 ± 0.198
0.0IleXaa: 0.0 ± 0.0
Lys
2.504LysAla: 2.504 ± 0.139
1.029LysCys: 1.029 ± 0.082
3.459LysAsp: 3.459 ± 0.159
5.421LysGlu: 5.421 ± 0.214
3.844LysPhe: 3.844 ± 0.153
3.716LysGly: 3.716 ± 0.16
1.672LysHis: 1.672 ± 0.108
15.932LysIle: 15.932 ± 0.49
21.144LysLys: 21.144 ± 0.632
7.33LysLeu: 7.33 ± 0.248
2.409LysMet: 2.409 ± 0.133
12.866LysAsn: 12.866 ± 0.383
1.76LysPro: 1.76 ± 0.123
3.154LysGln: 3.154 ± 0.158
2.721LysArg: 2.721 ± 0.146
4.69LysSer: 4.69 ± 0.194
4.359LysThr: 4.359 ± 0.192
3.614LysVal: 3.614 ± 0.17
0.975LysTrp: 0.975 ± 0.082
4.44LysTyr: 4.44 ± 0.192
0.0LysXaa: 0.0 ± 0.0
Leu
3.337LeuAla: 3.337 ± 0.146
1.232LeuCys: 1.232 ± 0.077
3.824LeuAsp: 3.824 ± 0.179
4.9LeuGlu: 4.9 ± 0.197
4.853LeuPhe: 4.853 ± 0.236
4.873LeuGly: 4.873 ± 0.181
2.437LeuHis: 2.437 ± 0.121
10.247LeuIle: 10.247 ± 0.32
10.701LeuLys: 10.701 ± 0.324
9.618LeuLeu: 9.618 ± 0.327
2.017LeuMet: 2.017 ± 0.125
7.824LeuAsn: 7.824 ± 0.243
2.822LeuPro: 2.822 ± 0.146
3.58LeuGln: 3.58 ± 0.149
2.917LeuArg: 2.917 ± 0.165
7.574LeuSer: 7.574 ± 0.223
4.183LeuThr: 4.183 ± 0.162
3.831LeuVal: 3.831 ± 0.177
0.724LeuTrp: 0.724 ± 0.079
3.702LeuTyr: 3.702 ± 0.172
0.0LeuXaa: 0.0 ± 0.0
Met
0.88MetAla: 0.88 ± 0.083
0.264MetCys: 0.264 ± 0.044
0.765MetAsp: 0.765 ± 0.079
1.036MetGlu: 1.036 ± 0.078
1.124MetPhe: 1.124 ± 0.109
1.212MetGly: 1.212 ± 0.089
0.65MetHis: 0.65 ± 0.072
2.484MetIle: 2.484 ± 0.133
2.342MetLys: 2.342 ± 0.12
2.376MetLeu: 2.376 ± 0.125
0.562MetMet: 0.562 ± 0.069
2.003MetAsn: 2.003 ± 0.105
0.717MetPro: 0.717 ± 0.081
0.88MetGln: 0.88 ± 0.065
0.772MetArg: 0.772 ± 0.084
1.387MetSer: 1.387 ± 0.113
0.981MetThr: 0.981 ± 0.074
1.022MetVal: 1.022 ± 0.082
0.162MetTrp: 0.162 ± 0.029
0.839MetTyr: 0.839 ± 0.069
0.0MetXaa: 0.0 ± 0.0
Asn
2.552AsnAla: 2.552 ± 0.13
1.022AsnCys: 1.022 ± 0.085
2.768AsnAsp: 2.768 ± 0.124
3.208AsnGlu: 3.208 ± 0.161
4.914AsnPhe: 4.914 ± 0.217
2.504AsnGly: 2.504 ± 0.14
1.726AsnHis: 1.726 ± 0.113
12.948AsnIle: 12.948 ± 0.406
7.892AsnLys: 7.892 ± 0.29
6.382AsnLeu: 6.382 ± 0.214
1.685AsnMet: 1.685 ± 0.098
7.783AsnAsn: 7.783 ± 0.338
2.2AsnPro: 2.2 ± 0.126
3.174AsnGln: 3.174 ± 0.152
1.983AsnArg: 1.983 ± 0.12
4.332AsnSer: 4.332 ± 0.175
4.156AsnThr: 4.156 ± 0.158
3.682AsnVal: 3.682 ± 0.161
0.67AsnTrp: 0.67 ± 0.074
3.005AsnTyr: 3.005 ± 0.141
0.0AsnXaa: 0.0 ± 0.0
Pro
0.954ProAla: 0.954 ± 0.086
0.291ProCys: 0.291 ± 0.049
1.056ProAsp: 1.056 ± 0.097
1.841ProGlu: 1.841 ± 0.105
1.618ProPhe: 1.618 ± 0.103
1.76ProGly: 1.76 ± 0.13
0.873ProHis: 0.873 ± 0.075
3.946ProIle: 3.946 ± 0.18
2.518ProLys: 2.518 ± 0.116
2.816ProLeu: 2.816 ± 0.138
0.751ProMet: 0.751 ± 0.076
1.861ProAsn: 1.861 ± 0.106
0.826ProPro: 0.826 ± 0.07
0.934ProGln: 0.934 ± 0.08
0.697ProArg: 0.697 ± 0.064
1.719ProSer: 1.719 ± 0.111
1.408ProThr: 1.408 ± 0.106
1.462ProVal: 1.462 ± 0.096
0.277ProTrp: 0.277 ± 0.055
1.374ProTyr: 1.374 ± 0.089
0.0ProXaa: 0.0 ± 0.0
Gln
1.658GlnAla: 1.658 ± 0.104
0.501GlnCys: 0.501 ± 0.058
1.509GlnAsp: 1.509 ± 0.099
2.274GlnGlu: 2.274 ± 0.147
1.624GlnPhe: 1.624 ± 0.094
1.631GlnGly: 1.631 ± 0.115
0.927GlnHis: 0.927 ± 0.084
3.878GlnIle: 3.878 ± 0.172
5.063GlnLys: 5.063 ± 0.2
3.824GlnLeu: 3.824 ± 0.163
0.69GlnMet: 0.69 ± 0.056
2.964GlnAsn: 2.964 ± 0.139
1.124GlnPro: 1.124 ± 0.085
1.577GlnGln: 1.577 ± 0.118
1.063GlnArg: 1.063 ± 0.084
2.47GlnSer: 2.47 ± 0.128
1.536GlnThr: 1.536 ± 0.105
1.401GlnVal: 1.401 ± 0.098
0.365GlnTrp: 0.365 ± 0.061
1.848GlnTyr: 1.848 ± 0.114
0.0GlnXaa: 0.0 ± 0.0
Arg
1.387ArgAla: 1.387 ± 0.099
0.406ArgCys: 0.406 ± 0.055
1.144ArgAsp: 1.144 ± 0.086
1.475ArgGlu: 1.475 ± 0.109
1.462ArgPhe: 1.462 ± 0.11
1.699ArgGly: 1.699 ± 0.125
0.697ArgHis: 0.697 ± 0.068
3.824ArgIle: 3.824 ± 0.158
2.998ArgLys: 2.998 ± 0.162
2.592ArgLeu: 2.592 ± 0.13
0.941ArgMet: 0.941 ± 0.084
2.098ArgAsn: 2.098 ± 0.119
0.981ArgPro: 0.981 ± 0.075
1.029ArgGln: 1.029 ± 0.086
1.286ArgArg: 1.286 ± 0.103
1.834ArgSer: 1.834 ± 0.12
1.631ArgThr: 1.631 ± 0.105
1.563ArgVal: 1.563 ± 0.128
0.223ArgTrp: 0.223 ± 0.037
1.069ArgTyr: 1.069 ± 0.081
0.0ArgXaa: 0.0 ± 0.0
Ser
2.782SerAla: 2.782 ± 0.148
0.907SerCys: 0.907 ± 0.087
2.667SerAsp: 2.667 ± 0.121
3.242SerGlu: 3.242 ± 0.133
3.689SerPhe: 3.689 ± 0.199
4.657SerGly: 4.657 ± 0.221
1.584SerHis: 1.584 ± 0.099
8.569SerIle: 8.569 ± 0.312
5.34SerLys: 5.34 ± 0.182
6.443SerLeu: 6.443 ± 0.217
1.455SerMet: 1.455 ± 0.098
4.217SerAsn: 4.217 ± 0.175
1.475SerPro: 1.475 ± 0.089
2.064SerGln: 2.064 ± 0.102
1.949SerArg: 1.949 ± 0.119
4.508SerSer: 4.508 ± 0.185
2.924SerThr: 2.924 ± 0.138
3.316SerVal: 3.316 ± 0.166
0.596SerTrp: 0.596 ± 0.069
2.254SerTyr: 2.254 ± 0.135
0.0SerXaa: 0.0 ± 0.0
Thr
2.118ThrAla: 2.118 ± 0.127
0.643ThrCys: 0.643 ± 0.072
1.78ThrAsp: 1.78 ± 0.099
2.423ThrGlu: 2.423 ± 0.139
2.274ThrPhe: 2.274 ± 0.114
3.134ThrGly: 3.134 ± 0.165
1.124ThrHis: 1.124 ± 0.086
6.03ThrIle: 6.03 ± 0.185
3.885ThrLys: 3.885 ± 0.142
4.88ThrLeu: 4.88 ± 0.168
0.981ThrMet: 0.981 ± 0.077
2.992ThrAsn: 2.992 ± 0.135
1.888ThrPro: 1.888 ± 0.112
1.827ThrGln: 1.827 ± 0.119
1.421ThrArg: 1.421 ± 0.109
3.093ThrSer: 3.093 ± 0.173
2.572ThrThr: 2.572 ± 0.153
2.748ThrVal: 2.748 ± 0.136
0.318ThrTrp: 0.318 ± 0.049
1.618ThrTyr: 1.618 ± 0.104
0.0ThrXaa: 0.0 ± 0.0
Val
2.03ValAla: 2.03 ± 0.121
0.643ValCys: 0.643 ± 0.07
2.166ValAsp: 2.166 ± 0.132
2.288ValGlu: 2.288 ± 0.149
2.301ValPhe: 2.301 ± 0.15
2.538ValGly: 2.538 ± 0.169
1.096ValHis: 1.096 ± 0.1
5.381ValIle: 5.381 ± 0.198
3.736ValLys: 3.736 ± 0.164
4.995ValLeu: 4.995 ± 0.182
1.124ValMet: 1.124 ± 0.092
2.89ValAsn: 2.89 ± 0.152
1.469ValPro: 1.469 ± 0.099
1.807ValGln: 1.807 ± 0.118
1.55ValArg: 1.55 ± 0.102
3.31ValSer: 3.31 ± 0.147
2.382ValThr: 2.382 ± 0.127
2.572ValVal: 2.572 ± 0.155
0.332ValTrp: 0.332 ± 0.051
1.645ValTyr: 1.645 ± 0.109
0.0ValXaa: 0.0 ± 0.0
Trp
0.23TrpAla: 0.23 ± 0.043
0.135TrpCys: 0.135 ± 0.033
0.298TrpAsp: 0.298 ± 0.049
0.426TrpGlu: 0.426 ± 0.072
0.535TrpPhe: 0.535 ± 0.058
0.365TrpGly: 0.365 ± 0.047
0.176TrpHis: 0.176 ± 0.033
1.029TrpIle: 1.029 ± 0.109
1.09TrpLys: 1.09 ± 0.1
0.819TrpLeu: 0.819 ± 0.077
0.271TrpMet: 0.271 ± 0.042
0.805TrpAsn: 0.805 ± 0.073
0.217TrpPro: 0.217 ± 0.036
0.217TrpGln: 0.217 ± 0.04
0.311TrpArg: 0.311 ± 0.046
0.426TrpSer: 0.426 ± 0.054
0.264TrpThr: 0.264 ± 0.041
0.372TrpVal: 0.372 ± 0.054
0.054TrpTrp: 0.054 ± 0.023
0.359TrpTyr: 0.359 ± 0.048
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.489TyrAla: 1.489 ± 0.106
0.697TyrCys: 0.697 ± 0.067
1.719TyrAsp: 1.719 ± 0.113
1.983TyrGlu: 1.983 ± 0.127
2.504TyrPhe: 2.504 ± 0.155
1.787TyrGly: 1.787 ± 0.126
0.988TyrHis: 0.988 ± 0.077
4.359TyrIle: 4.359 ± 0.212
3.411TyrLys: 3.411 ± 0.162
3.75TyrLeu: 3.75 ± 0.197
0.846TyrMet: 0.846 ± 0.084
2.924TyrAsn: 2.924 ± 0.142
1.09TyrPro: 1.09 ± 0.077
2.003TyrGln: 2.003 ± 0.125
1.171TyrArg: 1.171 ± 0.094
2.809TyrSer: 2.809 ± 0.156
2.173TyrThr: 2.173 ± 0.125
1.658TyrVal: 1.658 ± 0.105
0.365TyrTrp: 0.365 ± 0.06
1.503TyrTyr: 1.503 ± 0.125
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 455 proteins (147751 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski