Amino acid dipepetide frequency for cyanobacterium G8-9

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.688AlaAla: 4.688 ± 0.288
0.643AlaCys: 0.643 ± 0.132
3.294AlaAsp: 3.294 ± 0.193
3.788AlaGlu: 3.788 ± 0.227
3.492AlaPhe: 3.492 ± 0.217
4.946AlaGly: 4.946 ± 0.296
1.286AlaHis: 1.286 ± 0.117
5.707AlaIle: 5.707 ± 0.219
6.222AlaLys: 6.222 ± 0.325
7.468AlaLeu: 7.468 ± 0.31
2.305AlaMet: 2.305 ± 0.17
2.582AlaAsn: 2.582 ± 0.152
2.127AlaPro: 2.127 ± 0.145
2.384AlaGln: 2.384 ± 0.171
2.443AlaArg: 2.443 ± 0.172
4.837AlaSer: 4.837 ± 0.251
3.68AlaThr: 3.68 ± 0.2
4.56AlaVal: 4.56 ± 0.227
0.445AlaTrp: 0.445 ± 0.067
2.779AlaTyr: 2.779 ± 0.184
0.0AlaXaa: 0.0 ± 0.0
Cys
0.504CysAla: 0.504 ± 0.081
0.109CysCys: 0.109 ± 0.035
0.682CysAsp: 0.682 ± 0.101
0.574CysGlu: 0.574 ± 0.085
0.356CysPhe: 0.356 ± 0.058
0.831CysGly: 0.831 ± 0.101
0.247CysHis: 0.247 ± 0.054
0.504CysIle: 0.504 ± 0.075
0.811CysLys: 0.811 ± 0.095
0.682CysLeu: 0.682 ± 0.083
0.267CysMet: 0.267 ± 0.055
0.455CysAsn: 0.455 ± 0.071
0.336CysPro: 0.336 ± 0.071
0.346CysGln: 0.346 ± 0.06
0.475CysArg: 0.475 ± 0.07
0.603CysSer: 0.603 ± 0.079
0.485CysThr: 0.485 ± 0.079
0.574CysVal: 0.574 ± 0.067
0.109CysTrp: 0.109 ± 0.039
0.277CysTyr: 0.277 ± 0.046
0.0CysXaa: 0.0 ± 0.0
Asp
4.115AspAla: 4.115 ± 0.208
0.386AspCys: 0.386 ± 0.055
2.868AspAsp: 2.868 ± 0.188
4.451AspGlu: 4.451 ± 0.257
2.661AspPhe: 2.661 ± 0.149
3.393AspGly: 3.393 ± 0.193
0.752AspHis: 0.752 ± 0.098
5.163AspIle: 5.163 ± 0.261
4.807AspLys: 4.807 ± 0.259
4.214AspLeu: 4.214 ± 0.209
1.701AspMet: 1.701 ± 0.144
2.582AspAsn: 2.582 ± 0.166
1.869AspPro: 1.869 ± 0.181
0.87AspGln: 0.87 ± 0.108
1.899AspArg: 1.899 ± 0.144
3.007AspSer: 3.007 ± 0.198
3.086AspThr: 3.086 ± 0.214
3.759AspVal: 3.759 ± 0.183
0.504AspTrp: 0.504 ± 0.078
2.463AspTyr: 2.463 ± 0.159
0.0AspXaa: 0.0 ± 0.0
Glu
5.44GluAla: 5.44 ± 0.286
0.772GluCys: 0.772 ± 0.105
3.778GluAsp: 3.778 ± 0.223
5.994GluGlu: 5.994 ± 0.296
2.295GluPhe: 2.295 ± 0.159
4.313GluGly: 4.313 ± 0.25
1.276GluHis: 1.276 ± 0.114
5.401GluIle: 5.401 ± 0.219
6.716GluLys: 6.716 ± 0.304
6.192GluLeu: 6.192 ± 0.313
2.018GluMet: 2.018 ± 0.146
3.749GluAsn: 3.749 ± 0.211
1.573GluPro: 1.573 ± 0.159
2.146GluGln: 2.146 ± 0.143
3.195GluArg: 3.195 ± 0.193
3.591GluSer: 3.591 ± 0.213
3.323GluThr: 3.323 ± 0.188
5.43GluVal: 5.43 ± 0.298
1.365GluTrp: 1.365 ± 0.286
2.582GluTyr: 2.582 ± 0.155
0.0GluXaa: 0.0 ± 0.0
Phe
3.136PheAla: 3.136 ± 0.183
0.386PheCys: 0.386 ± 0.06
3.234PheAsp: 3.234 ± 0.209
2.859PheGlu: 2.859 ± 0.19
2.305PhePhe: 2.305 ± 0.175
3.185PheGly: 3.185 ± 0.214
0.722PheHis: 0.722 ± 0.083
4.055PheIle: 4.055 ± 0.217
3.284PheLys: 3.284 ± 0.161
4.431PheLeu: 4.431 ± 0.273
1.365PheMet: 1.365 ± 0.138
2.216PheAsn: 2.216 ± 0.162
1.484PhePro: 1.484 ± 0.127
1.058PheGln: 1.058 ± 0.085
1.306PheArg: 1.306 ± 0.106
3.581PheSer: 3.581 ± 0.222
2.888PheThr: 2.888 ± 0.189
3.155PheVal: 3.155 ± 0.192
0.376PheTrp: 0.376 ± 0.074
1.701PheTyr: 1.701 ± 0.127
0.0PheXaa: 0.0 ± 0.0
Gly
4.303GlyAla: 4.303 ± 0.246
0.91GlyCys: 0.91 ± 0.105
3.373GlyAsp: 3.373 ± 0.186
3.976GlyGlu: 3.976 ± 0.224
3.541GlyPhe: 3.541 ± 0.19
4.105GlyGly: 4.105 ± 0.271
1.414GlyHis: 1.414 ± 0.139
5.559GlyIle: 5.559 ± 0.219
4.936GlyLys: 4.936 ± 0.237
5.846GlyLeu: 5.846 ± 0.235
2.502GlyMet: 2.502 ± 0.232
2.493GlyAsn: 2.493 ± 0.157
1.335GlyPro: 1.335 ± 0.135
1.691GlyGln: 1.691 ± 0.129
2.512GlyArg: 2.512 ± 0.171
3.749GlySer: 3.749 ± 0.221
3.412GlyThr: 3.412 ± 0.182
4.787GlyVal: 4.787 ± 0.181
0.673GlyTrp: 0.673 ± 0.092
3.086GlyTyr: 3.086 ± 0.185
0.0GlyXaa: 0.0 ± 0.0
His
1.217HisAla: 1.217 ± 0.119
0.297HisCys: 0.297 ± 0.055
0.979HisAsp: 0.979 ± 0.088
1.177HisGlu: 1.177 ± 0.095
1.118HisPhe: 1.118 ± 0.103
1.256HisGly: 1.256 ± 0.126
0.524HisHis: 0.524 ± 0.073
1.355HisIle: 1.355 ± 0.115
1.316HisLys: 1.316 ± 0.111
1.85HisLeu: 1.85 ± 0.143
0.534HisMet: 0.534 ± 0.069
0.9HisAsn: 0.9 ± 0.093
0.841HisPro: 0.841 ± 0.085
0.712HisGln: 0.712 ± 0.078
0.682HisArg: 0.682 ± 0.074
1.405HisSer: 1.405 ± 0.129
1.187HisThr: 1.187 ± 0.139
1.078HisVal: 1.078 ± 0.102
0.227HisTrp: 0.227 ± 0.045
0.811HisTyr: 0.811 ± 0.101
0.0HisXaa: 0.0 ± 0.0
Ile
6.568IleAla: 6.568 ± 0.304
0.564IleCys: 0.564 ± 0.069
4.926IleAsp: 4.926 ± 0.231
6.489IleGlu: 6.489 ± 0.331
3.61IlePhe: 3.61 ± 0.233
5.045IleGly: 5.045 ± 0.212
1.246IleHis: 1.246 ± 0.107
5.638IleIle: 5.638 ± 0.316
6.419IleLys: 6.419 ± 0.229
7.052IleLeu: 7.052 ± 0.238
1.86IleMet: 1.86 ± 0.153
3.432IleAsn: 3.432 ± 0.214
3.145IlePro: 3.145 ± 0.179
2.196IleGln: 2.196 ± 0.151
2.7IleArg: 2.7 ± 0.171
4.827IleSer: 4.827 ± 0.265
4.006IleThr: 4.006 ± 0.188
5.529IleVal: 5.529 ± 0.224
0.623IleTrp: 0.623 ± 0.088
2.71IleTyr: 2.71 ± 0.186
0.0IleXaa: 0.0 ± 0.0
Lys
5.48LysAla: 5.48 ± 0.228
0.692LysCys: 0.692 ± 0.081
3.986LysAsp: 3.986 ± 0.286
7.646LysGlu: 7.646 ± 0.298
2.997LysPhe: 2.997 ± 0.179
4.679LysGly: 4.679 ± 0.247
1.751LysHis: 1.751 ± 0.145
6.053LysIle: 6.053 ± 0.31
8.17LysLys: 8.17 ± 0.522
6.914LysLeu: 6.914 ± 0.277
2.522LysMet: 2.522 ± 0.181
4.016LysAsn: 4.016 ± 0.209
2.878LysPro: 2.878 ± 0.184
2.483LysGln: 2.483 ± 0.159
3.432LysArg: 3.432 ± 0.216
5.045LysSer: 5.045 ± 0.217
4.441LysThr: 4.441 ± 0.252
5.529LysVal: 5.529 ± 0.262
0.841LysTrp: 0.841 ± 0.138
3.205LysTyr: 3.205 ± 0.186
0.0LysXaa: 0.0 ± 0.0
Leu
7.151LeuAla: 7.151 ± 0.32
0.762LeuCys: 0.762 ± 0.089
5.292LeuAsp: 5.292 ± 0.234
6.776LeuGlu: 6.776 ± 0.291
4.906LeuPhe: 4.906 ± 0.244
6.103LeuGly: 6.103 ± 0.249
1.939LeuHis: 1.939 ± 0.143
6.934LeuIle: 6.934 ± 0.28
7.458LeuLys: 7.458 ± 0.313
9.446LeuLeu: 9.446 ± 0.341
2.473LeuMet: 2.473 ± 0.155
3.957LeuAsn: 3.957 ± 0.23
3.858LeuPro: 3.858 ± 0.26
2.512LeuGln: 2.512 ± 0.159
3.284LeuArg: 3.284 ± 0.178
6.785LeuSer: 6.785 ± 0.253
5.331LeuThr: 5.331 ± 0.231
5.668LeuVal: 5.668 ± 0.235
0.92LeuTrp: 0.92 ± 0.098
3.66LeuTyr: 3.66 ± 0.211
0.0LeuXaa: 0.0 ± 0.0
Met
1.988MetAla: 1.988 ± 0.141
0.198MetCys: 0.198 ± 0.042
1.434MetAsp: 1.434 ± 0.123
2.384MetGlu: 2.384 ± 0.271
1.316MetPhe: 1.316 ± 0.128
2.216MetGly: 2.216 ± 0.183
0.663MetHis: 0.663 ± 0.098
2.73MetIle: 2.73 ± 0.182
2.611MetLys: 2.611 ± 0.186
2.69MetLeu: 2.69 ± 0.154
1.039MetMet: 1.039 ± 0.108
1.108MetAsn: 1.108 ± 0.099
0.999MetPro: 0.999 ± 0.095
1.029MetGln: 1.029 ± 0.103
1.108MetArg: 1.108 ± 0.094
1.721MetSer: 1.721 ± 0.141
1.444MetThr: 1.444 ± 0.121
1.741MetVal: 1.741 ± 0.154
0.297MetTrp: 0.297 ± 0.055
0.801MetTyr: 0.801 ± 0.078
0.0MetXaa: 0.0 ± 0.0
Asn
2.977AsnAla: 2.977 ± 0.154
0.415AsnCys: 0.415 ± 0.073
2.453AsnAsp: 2.453 ± 0.172
3.027AsnGlu: 3.027 ± 0.181
1.968AsnPhe: 1.968 ± 0.14
3.294AsnGly: 3.294 ± 0.311
0.89AsnHis: 0.89 ± 0.089
3.749AsnIle: 3.749 ± 0.212
3.403AsnLys: 3.403 ± 0.21
4.342AsnLeu: 4.342 ± 0.214
1.128AsnMet: 1.128 ± 0.118
1.949AsnAsn: 1.949 ± 0.166
2.305AsnPro: 2.305 ± 0.161
1.375AsnGln: 1.375 ± 0.116
1.741AsnArg: 1.741 ± 0.149
2.324AsnSer: 2.324 ± 0.153
2.532AsnThr: 2.532 ± 0.167
2.76AsnVal: 2.76 ± 0.179
0.307AsnTrp: 0.307 ± 0.051
2.067AsnTyr: 2.067 ± 0.151
0.0AsnXaa: 0.0 ± 0.0
Pro
1.8ProAla: 1.8 ± 0.164
0.317ProCys: 0.317 ± 0.065
1.444ProAsp: 1.444 ± 0.12
2.532ProGlu: 2.532 ± 0.174
1.998ProPhe: 1.998 ± 0.144
1.929ProGly: 1.929 ± 0.155
0.712ProHis: 0.712 ± 0.088
2.364ProIle: 2.364 ± 0.153
2.572ProLys: 2.572 ± 0.193
4.036ProLeu: 4.036 ± 0.208
0.9ProMet: 0.9 ± 0.101
1.494ProAsn: 1.494 ± 0.137
1.217ProPro: 1.217 ± 0.324
1.078ProGln: 1.078 ± 0.111
1.187ProArg: 1.187 ± 0.099
2.315ProSer: 2.315 ± 0.136
1.909ProThr: 1.909 ± 0.131
2.453ProVal: 2.453 ± 0.225
0.386ProTrp: 0.386 ± 0.06
1.553ProTyr: 1.553 ± 0.121
0.0ProXaa: 0.0 ± 0.0
Gln
1.919GlnAla: 1.919 ± 0.15
0.237GlnCys: 0.237 ± 0.049
1.494GlnAsp: 1.494 ± 0.125
2.532GlnGlu: 2.532 ± 0.184
1.266GlnPhe: 1.266 ± 0.112
1.751GlnGly: 1.751 ± 0.116
0.564GlnHis: 0.564 ± 0.081
2.315GlnIle: 2.315 ± 0.147
2.7GlnLys: 2.7 ± 0.218
2.601GlnLeu: 2.601 ± 0.14
0.811GlnMet: 0.811 ± 0.08
1.563GlnAsn: 1.563 ± 0.116
0.88GlnPro: 0.88 ± 0.09
0.959GlnGln: 0.959 ± 0.1
1.256GlnArg: 1.256 ± 0.106
1.761GlnSer: 1.761 ± 0.121
1.573GlnThr: 1.573 ± 0.13
1.721GlnVal: 1.721 ± 0.131
0.326GlnTrp: 0.326 ± 0.065
1.157GlnTyr: 1.157 ± 0.104
0.0GlnXaa: 0.0 ± 0.0
Arg
2.552ArgAla: 2.552 ± 0.167
0.317ArgCys: 0.317 ± 0.061
2.057ArgAsp: 2.057 ± 0.134
2.72ArgGlu: 2.72 ± 0.169
1.672ArgPhe: 1.672 ± 0.132
2.374ArgGly: 2.374 ± 0.167
0.633ArgHis: 0.633 ± 0.081
2.73ArgIle: 2.73 ± 0.151
2.878ArgLys: 2.878 ± 0.182
3.551ArgLeu: 3.551 ± 0.179
1.217ArgMet: 1.217 ± 0.127
1.711ArgAsn: 1.711 ± 0.129
1.187ArgPro: 1.187 ± 0.118
1.246ArgGln: 1.246 ± 0.115
1.711ArgArg: 1.711 ± 0.154
1.869ArgSer: 1.869 ± 0.127
1.741ArgThr: 1.741 ± 0.141
2.72ArgVal: 2.72 ± 0.155
0.574ArgTrp: 0.574 ± 0.094
1.612ArgTyr: 1.612 ± 0.12
0.0ArgXaa: 0.0 ± 0.0
Ser
4.046SerAla: 4.046 ± 0.235
0.514SerCys: 0.514 ± 0.076
3.264SerAsp: 3.264 ± 0.2
3.581SerGlu: 3.581 ± 0.203
3.462SerPhe: 3.462 ± 0.196
3.976SerGly: 3.976 ± 0.204
1.325SerHis: 1.325 ± 0.117
5.104SerIle: 5.104 ± 0.255
5.262SerLys: 5.262 ± 0.221
5.707SerLeu: 5.707 ± 0.298
2.087SerMet: 2.087 ± 0.157
2.888SerAsn: 2.888 ± 0.215
1.899SerPro: 1.899 ± 0.131
2.047SerGln: 2.047 ± 0.13
2.334SerArg: 2.334 ± 0.15
4.283SerSer: 4.283 ± 0.217
3.64SerThr: 3.64 ± 0.215
3.976SerVal: 3.976 ± 0.212
0.564SerTrp: 0.564 ± 0.076
2.512SerTyr: 2.512 ± 0.116
0.0SerXaa: 0.0 ± 0.0
Thr
3.6ThrAla: 3.6 ± 0.202
0.564ThrCys: 0.564 ± 0.091
3.126ThrAsp: 3.126 ± 0.191
2.532ThrGlu: 2.532 ± 0.158
2.493ThrPhe: 2.493 ± 0.148
3.195ThrGly: 3.195 ± 0.205
1.266ThrHis: 1.266 ± 0.122
4.679ThrIle: 4.679 ± 0.193
3.957ThrLys: 3.957 ± 0.234
6.479ThrLeu: 6.479 ± 0.326
1.642ThrMet: 1.642 ± 0.116
2.127ThrAsn: 2.127 ± 0.144
2.552ThrPro: 2.552 ± 0.177
1.79ThrGln: 1.79 ± 0.143
1.573ThrArg: 1.573 ± 0.124
3.185ThrSer: 3.185 ± 0.219
3.027ThrThr: 3.027 ± 0.211
3.67ThrVal: 3.67 ± 0.185
0.366ThrTrp: 0.366 ± 0.068
2.364ThrTyr: 2.364 ± 0.168
0.0ThrXaa: 0.0 ± 0.0
Val
4.728ValAla: 4.728 ± 0.243
0.692ValCys: 0.692 ± 0.096
3.818ValAsp: 3.818 ± 0.2
4.886ValGlu: 4.886 ± 0.256
2.532ValPhe: 2.532 ± 0.165
4.51ValGly: 4.51 ± 0.227
1.197ValHis: 1.197 ± 0.112
5.183ValIle: 5.183 ± 0.282
5.282ValLys: 5.282 ± 0.252
6.696ValLeu: 6.696 ± 0.254
1.751ValMet: 1.751 ± 0.12
2.76ValAsn: 2.76 ± 0.174
2.315ValPro: 2.315 ± 0.137
1.949ValGln: 1.949 ± 0.107
2.315ValArg: 2.315 ± 0.142
4.916ValSer: 4.916 ± 0.306
3.561ValThr: 3.561 ± 0.216
5.035ValVal: 5.035 ± 0.269
0.633ValTrp: 0.633 ± 0.084
2.107ValTyr: 2.107 ± 0.158
0.0ValXaa: 0.0 ± 0.0
Trp
0.712TrpAla: 0.712 ± 0.087
0.089TrpCys: 0.089 ± 0.032
0.514TrpAsp: 0.514 ± 0.069
0.524TrpGlu: 0.524 ± 0.079
0.544TrpPhe: 0.544 ± 0.085
0.702TrpGly: 0.702 ± 0.085
0.168TrpHis: 0.168 ± 0.04
0.722TrpIle: 0.722 ± 0.102
0.772TrpLys: 0.772 ± 0.15
0.9TrpLeu: 0.9 ± 0.107
0.396TrpMet: 0.396 ± 0.056
0.999TrpAsn: 0.999 ± 0.276
0.218TrpPro: 0.218 ± 0.048
0.317TrpGln: 0.317 ± 0.056
0.406TrpArg: 0.406 ± 0.065
0.415TrpSer: 0.415 ± 0.063
0.465TrpThr: 0.465 ± 0.072
0.682TrpVal: 0.682 ± 0.082
0.109TrpTrp: 0.109 ± 0.038
0.425TrpTyr: 0.425 ± 0.076
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.75TyrAla: 2.75 ± 0.173
0.366TyrCys: 0.366 ± 0.061
2.532TyrAsp: 2.532 ± 0.17
2.7TyrGlu: 2.7 ± 0.166
2.196TyrPhe: 2.196 ± 0.163
2.404TyrGly: 2.404 ± 0.169
0.89TyrHis: 0.89 ± 0.099
2.651TyrIle: 2.651 ± 0.151
3.185TyrLys: 3.185 ± 0.18
4.115TyrLeu: 4.115 ± 0.225
0.999TyrMet: 0.999 ± 0.108
2.018TyrAsn: 2.018 ± 0.147
1.177TyrPro: 1.177 ± 0.094
1.197TyrGln: 1.197 ± 0.092
1.513TyrArg: 1.513 ± 0.13
2.255TyrSer: 2.255 ± 0.163
2.453TyrThr: 2.453 ± 0.163
2.038TyrVal: 2.038 ± 0.132
0.435TyrTrp: 0.435 ± 0.066
1.79TyrTyr: 1.79 ± 0.153
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 461 proteins (101100 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski