Amino acid dipepetide frequency for Cronobacter phage vB_CsaM_GAP32

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.16AlaAla: 4.16 ± 0.241
0.491AlaCys: 0.491 ± 0.068
3.252AlaAsp: 3.252 ± 0.194
3.901AlaGlu: 3.901 ± 0.21
2.39AlaPhe: 2.39 ± 0.171
3.4AlaGly: 3.4 ± 0.22
0.889AlaHis: 0.889 ± 0.091
3.78AlaIle: 3.78 ± 0.193
4.021AlaLys: 4.021 ± 0.205
4.66AlaLeu: 4.66 ± 0.244
1.612AlaMet: 1.612 ± 0.134
3.169AlaAsn: 3.169 ± 0.181
1.751AlaPro: 1.751 ± 0.143
2.048AlaGln: 2.048 ± 0.16
2.288AlaArg: 2.288 ± 0.141
3.724AlaSer: 3.724 ± 0.219
3.298AlaThr: 3.298 ± 0.325
3.891AlaVal: 3.891 ± 0.201
0.639AlaTrp: 0.639 ± 0.075
2.344AlaTyr: 2.344 ± 0.168
0.0AlaXaa: 0.0 ± 0.0
Cys
0.528CysAla: 0.528 ± 0.072
0.222CysCys: 0.222 ± 0.051
0.602CysAsp: 0.602 ± 0.064
0.945CysGlu: 0.945 ± 0.108
0.574CysPhe: 0.574 ± 0.069
0.852CysGly: 0.852 ± 0.095
0.278CysHis: 0.278 ± 0.049
0.815CysIle: 0.815 ± 0.086
0.852CysLys: 0.852 ± 0.114
0.788CysLeu: 0.788 ± 0.092
0.315CysMet: 0.315 ± 0.054
0.667CysAsn: 0.667 ± 0.083
0.528CysPro: 0.528 ± 0.078
0.417CysGln: 0.417 ± 0.068
0.389CysArg: 0.389 ± 0.066
1.001CysSer: 1.001 ± 0.102
0.982CysThr: 0.982 ± 0.099
0.908CysVal: 0.908 ± 0.104
0.167CysTrp: 0.167 ± 0.039
0.473CysTyr: 0.473 ± 0.063
0.0CysXaa: 0.0 ± 0.0
Asp
3.975AspAla: 3.975 ± 0.21
0.834AspCys: 0.834 ± 0.1
4.549AspAsp: 4.549 ± 0.244
5.763AspGlu: 5.763 ± 0.282
3.252AspPhe: 3.252 ± 0.186
4.438AspGly: 4.438 ± 0.187
0.973AspHis: 0.973 ± 0.086
4.975AspIle: 4.975 ± 0.25
3.919AspLys: 3.919 ± 0.24
5.049AspLeu: 5.049 ± 0.271
1.77AspMet: 1.77 ± 0.14
3.613AspAsn: 3.613 ± 0.226
2.363AspPro: 2.363 ± 0.146
1.594AspGln: 1.594 ± 0.193
2.335AspArg: 2.335 ± 0.148
4.123AspSer: 4.123 ± 0.194
3.724AspThr: 3.724 ± 0.18
4.475AspVal: 4.475 ± 0.201
0.945AspTrp: 0.945 ± 0.098
3.243AspTyr: 3.243 ± 0.175
0.0AspXaa: 0.0 ± 0.0
Glu
3.678GluAla: 3.678 ± 0.215
0.871GluCys: 0.871 ± 0.093
4.447GluAsp: 4.447 ± 0.281
6.05GluGlu: 6.05 ± 0.325
3.715GluPhe: 3.715 ± 0.176
2.983GluGly: 2.983 ± 0.192
1.631GluHis: 1.631 ± 0.128
5.142GluIle: 5.142 ± 0.227
4.595GluLys: 4.595 ± 0.254
6.939GluLeu: 6.939 ± 0.275
2.511GluMet: 2.511 ± 0.197
4.243GluAsn: 4.243 ± 0.228
2.057GluPro: 2.057 ± 0.144
3.141GluGln: 3.141 ± 0.191
3.057GluArg: 3.057 ± 0.187
4.355GluSer: 4.355 ± 0.232
3.799GluThr: 3.799 ± 0.198
4.91GluVal: 4.91 ± 0.225
0.852GluTrp: 0.852 ± 0.106
3.947GluTyr: 3.947 ± 0.172
0.0GluXaa: 0.0 ± 0.0
Phe
2.279PheAla: 2.279 ± 0.164
0.565PheCys: 0.565 ± 0.088
3.419PheAsp: 3.419 ± 0.185
3.789PheGlu: 3.789 ± 0.193
1.797PhePhe: 1.797 ± 0.151
2.974PheGly: 2.974 ± 0.165
0.899PheHis: 0.899 ± 0.093
3.196PheIle: 3.196 ± 0.166
3.308PheLys: 3.308 ± 0.173
3.178PheLeu: 3.178 ± 0.195
1.297PheMet: 1.297 ± 0.106
2.983PheAsn: 2.983 ± 0.171
1.204PhePro: 1.204 ± 0.106
1.427PheGln: 1.427 ± 0.129
1.334PheArg: 1.334 ± 0.109
3.076PheSer: 3.076 ± 0.179
2.576PheThr: 2.576 ± 0.17
3.271PheVal: 3.271 ± 0.17
0.713PheTrp: 0.713 ± 0.082
1.862PheTyr: 1.862 ± 0.104
0.0PheXaa: 0.0 ± 0.0
Gly
2.9GlyAla: 2.9 ± 0.266
0.713GlyCys: 0.713 ± 0.088
3.345GlyAsp: 3.345 ± 0.187
3.789GlyGlu: 3.789 ± 0.204
2.437GlyPhe: 2.437 ± 0.15
2.826GlyGly: 2.826 ± 0.195
0.973GlyHis: 0.973 ± 0.101
4.178GlyIle: 4.178 ± 0.173
4.253GlyLys: 4.253 ± 0.199
4.216GlyLeu: 4.216 ± 0.209
1.38GlyMet: 1.38 ± 0.133
3.724GlyAsn: 3.724 ± 0.319
0.639GlyPro: 0.639 ± 0.071
1.872GlyGln: 1.872 ± 0.16
2.242GlyArg: 2.242 ± 0.152
4.512GlySer: 4.512 ± 0.237
5.253GlyThr: 5.253 ± 0.352
3.808GlyVal: 3.808 ± 0.209
0.713GlyTrp: 0.713 ± 0.085
3.094GlyTyr: 3.094 ± 0.185
0.0GlyXaa: 0.0 ± 0.0
His
0.982HisAla: 0.982 ± 0.097
0.306HisCys: 0.306 ± 0.05
1.334HisAsp: 1.334 ± 0.104
1.538HisGlu: 1.538 ± 0.134
1.13HisPhe: 1.13 ± 0.106
1.408HisGly: 1.408 ± 0.126
0.51HisHis: 0.51 ± 0.072
1.427HisIle: 1.427 ± 0.124
1.39HisLys: 1.39 ± 0.129
1.334HisLeu: 1.334 ± 0.115
0.482HisMet: 0.482 ± 0.064
1.251HisAsn: 1.251 ± 0.109
0.843HisPro: 0.843 ± 0.077
0.537HisGln: 0.537 ± 0.076
0.63HisArg: 0.63 ± 0.073
1.288HisSer: 1.288 ± 0.13
1.093HisThr: 1.093 ± 0.1
1.01HisVal: 1.01 ± 0.094
0.232HisTrp: 0.232 ± 0.045
0.825HisTyr: 0.825 ± 0.092
0.0HisXaa: 0.0 ± 0.0
Ile
4.216IleAla: 4.216 ± 0.205
0.908IleCys: 0.908 ± 0.111
4.938IleAsp: 4.938 ± 0.226
5.522IleGlu: 5.522 ± 0.254
2.539IlePhe: 2.539 ± 0.159
3.419IleGly: 3.419 ± 0.175
1.575IleHis: 1.575 ± 0.107
4.688IleIle: 4.688 ± 0.227
5.142IleLys: 5.142 ± 0.25
5.086IleLeu: 5.086 ± 0.231
1.834IleMet: 1.834 ± 0.146
4.493IleAsn: 4.493 ± 0.206
2.761IlePro: 2.761 ± 0.175
2.854IleGln: 2.854 ± 0.164
3.039IleArg: 3.039 ± 0.178
4.836IleSer: 4.836 ± 0.219
4.225IleThr: 4.225 ± 0.254
4.531IleVal: 4.531 ± 0.194
0.76IleTrp: 0.76 ± 0.088
2.77IleTyr: 2.77 ± 0.176
0.0IleXaa: 0.0 ± 0.0
Lys
3.567LysAla: 3.567 ± 0.233
0.834LysCys: 0.834 ± 0.107
4.123LysAsp: 4.123 ± 0.216
5.012LysGlu: 5.012 ± 0.237
3.122LysPhe: 3.122 ± 0.179
3.011LysGly: 3.011 ± 0.161
1.723LysHis: 1.723 ± 0.137
4.688LysIle: 4.688 ± 0.201
5.003LysLys: 5.003 ± 0.341
5.568LysLeu: 5.568 ± 0.213
2.298LysMet: 2.298 ± 0.152
4.262LysAsn: 4.262 ± 0.197
2.187LysPro: 2.187 ± 0.134
2.659LysGln: 2.659 ± 0.161
3.02LysArg: 3.02 ± 0.175
4.123LysSer: 4.123 ± 0.192
3.734LysThr: 3.734 ± 0.201
4.679LysVal: 4.679 ± 0.253
0.658LysTrp: 0.658 ± 0.073
4.03LysTyr: 4.03 ± 0.205
0.0LysXaa: 0.0 ± 0.0
Leu
4.558LeuAla: 4.558 ± 0.275
0.945LeuCys: 0.945 ± 0.113
5.161LeuAsp: 5.161 ± 0.252
5.272LeuGlu: 5.272 ± 0.247
3.067LeuPhe: 3.067 ± 0.17
3.91LeuGly: 3.91 ± 0.229
1.538LeuHis: 1.538 ± 0.109
4.531LeuIle: 4.531 ± 0.207
5.679LeuLys: 5.679 ± 0.257
5.642LeuLeu: 5.642 ± 0.241
2.177LeuMet: 2.177 ± 0.142
5.364LeuAsn: 5.364 ± 0.237
2.854LeuPro: 2.854 ± 0.179
3.15LeuGln: 3.15 ± 0.18
3.595LeuArg: 3.595 ± 0.207
6.152LeuSer: 6.152 ± 0.226
4.808LeuThr: 4.808 ± 0.356
4.901LeuVal: 4.901 ± 0.251
0.825LeuTrp: 0.825 ± 0.074
3.391LeuTyr: 3.391 ± 0.197
0.0LeuXaa: 0.0 ± 0.0
Met
1.964MetAla: 1.964 ± 0.125
0.259MetCys: 0.259 ± 0.057
1.89MetAsp: 1.89 ± 0.133
1.594MetGlu: 1.594 ± 0.124
1.547MetPhe: 1.547 ± 0.126
1.279MetGly: 1.279 ± 0.115
0.435MetHis: 0.435 ± 0.058
1.936MetIle: 1.936 ± 0.133
2.325MetLys: 2.325 ± 0.138
1.872MetLeu: 1.872 ± 0.135
0.852MetMet: 0.852 ± 0.087
2.261MetAsn: 2.261 ± 0.147
0.806MetPro: 0.806 ± 0.085
1.038MetGln: 1.038 ± 0.097
1.038MetArg: 1.038 ± 0.099
2.057MetSer: 2.057 ± 0.132
1.566MetThr: 1.566 ± 0.13
1.362MetVal: 1.362 ± 0.133
0.269MetTrp: 0.269 ± 0.052
1.39MetTyr: 1.39 ± 0.126
0.0MetXaa: 0.0 ± 0.0
Asn
3.345AsnAla: 3.345 ± 0.21
0.825AsnCys: 0.825 ± 0.092
3.975AsnAsp: 3.975 ± 0.179
4.243AsnGlu: 4.243 ± 0.222
2.641AsnPhe: 2.641 ± 0.145
4.781AsnGly: 4.781 ± 0.245
1.084AsnHis: 1.084 ± 0.088
4.688AsnIle: 4.688 ± 0.247
4.132AsnLys: 4.132 ± 0.207
4.41AsnLeu: 4.41 ± 0.188
1.89AsnMet: 1.89 ± 0.133
4.002AsnAsn: 4.002 ± 0.214
2.566AsnPro: 2.566 ± 0.172
1.844AsnGln: 1.844 ± 0.136
2.427AsnArg: 2.427 ± 0.162
4.716AsnSer: 4.716 ± 0.249
4.021AsnThr: 4.021 ± 0.238
3.928AsnVal: 3.928 ± 0.258
0.574AsnTrp: 0.574 ± 0.063
2.659AsnTyr: 2.659 ± 0.165
0.0AsnXaa: 0.0 ± 0.0
Pro
1.807ProAla: 1.807 ± 0.146
0.334ProCys: 0.334 ± 0.055
2.641ProAsp: 2.641 ± 0.183
3.057ProGlu: 3.057 ± 0.182
1.603ProPhe: 1.603 ± 0.109
1.881ProGly: 1.881 ± 0.124
0.556ProHis: 0.556 ± 0.074
2.075ProIle: 2.075 ± 0.132
1.964ProLys: 1.964 ± 0.129
2.029ProLeu: 2.029 ± 0.132
0.649ProMet: 0.649 ± 0.076
2.057ProAsn: 2.057 ± 0.156
0.695ProPro: 0.695 ± 0.087
1.075ProGln: 1.075 ± 0.111
1.028ProArg: 1.028 ± 0.105
2.261ProSer: 2.261 ± 0.132
1.89ProThr: 1.89 ± 0.138
2.474ProVal: 2.474 ± 0.159
0.287ProTrp: 0.287 ± 0.062
1.464ProTyr: 1.464 ± 0.125
0.0ProXaa: 0.0 ± 0.0
Gln
2.094GlnAla: 2.094 ± 0.17
0.426GlnCys: 0.426 ± 0.06
2.14GlnAsp: 2.14 ± 0.16
2.826GlnGlu: 2.826 ± 0.18
1.677GlnPhe: 1.677 ± 0.139
1.695GlnGly: 1.695 ± 0.165
0.806GlnHis: 0.806 ± 0.081
2.613GlnIle: 2.613 ± 0.16
2.39GlnLys: 2.39 ± 0.169
2.946GlnLeu: 2.946 ± 0.182
1.269GlnMet: 1.269 ± 0.109
2.251GlnAsn: 2.251 ± 0.186
0.964GlnPro: 0.964 ± 0.098
1.436GlnGln: 1.436 ± 0.13
1.816GlnArg: 1.816 ± 0.121
2.196GlnSer: 2.196 ± 0.163
1.631GlnThr: 1.631 ± 0.139
1.862GlnVal: 1.862 ± 0.123
0.491GlnTrp: 0.491 ± 0.073
2.372GlnTyr: 2.372 ± 0.156
0.0GlnXaa: 0.0 ± 0.0
Arg
1.973ArgAla: 1.973 ± 0.144
0.473ArgCys: 0.473 ± 0.07
2.437ArgAsp: 2.437 ± 0.139
2.603ArgGlu: 2.603 ± 0.179
1.834ArgPhe: 1.834 ± 0.136
2.14ArgGly: 2.14 ± 0.136
0.676ArgHis: 0.676 ± 0.075
3.057ArgIle: 3.057 ± 0.143
3.002ArgLys: 3.002 ± 0.204
3.187ArgLeu: 3.187 ± 0.17
1.084ArgMet: 1.084 ± 0.101
2.705ArgAsn: 2.705 ± 0.146
0.917ArgPro: 0.917 ± 0.11
1.464ArgGln: 1.464 ± 0.103
1.844ArgArg: 1.844 ± 0.132
2.437ArgSer: 2.437 ± 0.146
2.39ArgThr: 2.39 ± 0.174
2.585ArgVal: 2.585 ± 0.141
0.528ArgTrp: 0.528 ± 0.08
1.899ArgTyr: 1.899 ± 0.153
0.0ArgXaa: 0.0 ± 0.0
Ser
4.03SerAla: 4.03 ± 0.218
0.75SerCys: 0.75 ± 0.097
4.345SerAsp: 4.345 ± 0.222
4.299SerGlu: 4.299 ± 0.216
3.178SerPhe: 3.178 ± 0.159
4.679SerGly: 4.679 ± 0.303
1.038SerHis: 1.038 ± 0.076
5.096SerIle: 5.096 ± 0.231
4.169SerLys: 4.169 ± 0.208
4.966SerLeu: 4.966 ± 0.221
1.436SerMet: 1.436 ± 0.112
3.826SerAsn: 3.826 ± 0.199
1.899SerPro: 1.899 ± 0.133
2.122SerGln: 2.122 ± 0.143
2.381SerArg: 2.381 ± 0.149
4.91SerSer: 4.91 ± 0.264
6.421SerThr: 6.421 ± 0.296
5.04SerVal: 5.04 ± 0.183
0.806SerTrp: 0.806 ± 0.091
2.937SerTyr: 2.937 ± 0.156
0.0SerXaa: 0.0 ± 0.0
Thr
3.715ThrAla: 3.715 ± 0.278
0.593ThrCys: 0.593 ± 0.093
4.206ThrAsp: 4.206 ± 0.185
4.104ThrGlu: 4.104 ± 0.222
3.132ThrPhe: 3.132 ± 0.224
4.382ThrGly: 4.382 ± 0.437
1.158ThrHis: 1.158 ± 0.107
4.568ThrIle: 4.568 ± 0.237
3.734ThrLys: 3.734 ± 0.228
4.929ThrLeu: 4.929 ± 0.386
1.473ThrMet: 1.473 ± 0.11
3.687ThrAsn: 3.687 ± 0.214
2.474ThrPro: 2.474 ± 0.16
1.983ThrGln: 1.983 ± 0.164
1.918ThrArg: 1.918 ± 0.131
4.169ThrSer: 4.169 ± 0.245
3.863ThrThr: 3.863 ± 0.429
5.346ThrVal: 5.346 ± 0.449
0.75ThrTrp: 0.75 ± 0.086
2.983ThrTyr: 2.983 ± 0.145
0.0ThrXaa: 0.0 ± 0.0
Val
3.196ValAla: 3.196 ± 0.197
0.797ValCys: 0.797 ± 0.095
4.373ValAsp: 4.373 ± 0.196
4.466ValGlu: 4.466 ± 0.22
2.668ValPhe: 2.668 ± 0.153
3.317ValGly: 3.317 ± 0.21
1.538ValHis: 1.538 ± 0.134
4.558ValIle: 4.558 ± 0.178
4.271ValLys: 4.271 ± 0.196
7.088ValLeu: 7.088 ± 0.273
1.844ValMet: 1.844 ± 0.136
4.012ValAsn: 4.012 ± 0.244
2.659ValPro: 2.659 ± 0.147
3.104ValGln: 3.104 ± 0.148
2.539ValArg: 2.539 ± 0.167
4.549ValSer: 4.549 ± 0.244
4.206ValThr: 4.206 ± 0.431
4.531ValVal: 4.531 ± 0.205
0.547ValTrp: 0.547 ± 0.076
3.122ValTyr: 3.122 ± 0.199
0.0ValXaa: 0.0 ± 0.0
Trp
0.445TrpAla: 0.445 ± 0.07
0.176TrpCys: 0.176 ± 0.032
0.908TrpAsp: 0.908 ± 0.101
0.732TrpGlu: 0.732 ± 0.085
0.528TrpPhe: 0.528 ± 0.065
0.574TrpGly: 0.574 ± 0.077
0.269TrpHis: 0.269 ± 0.048
0.806TrpIle: 0.806 ± 0.092
0.806TrpLys: 0.806 ± 0.083
0.778TrpLeu: 0.778 ± 0.091
0.334TrpMet: 0.334 ± 0.054
0.76TrpAsn: 0.76 ± 0.091
0.167TrpPro: 0.167 ± 0.037
0.389TrpGln: 0.389 ± 0.065
0.454TrpArg: 0.454 ± 0.069
0.76TrpSer: 0.76 ± 0.085
0.769TrpThr: 0.769 ± 0.089
0.843TrpVal: 0.843 ± 0.094
0.12TrpTrp: 0.12 ± 0.039
0.788TrpTyr: 0.788 ± 0.089
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.298TyrAla: 2.298 ± 0.149
0.982TyrCys: 0.982 ± 0.108
3.826TyrAsp: 3.826 ± 0.193
3.419TyrGlu: 3.419 ± 0.183
2.316TyrPhe: 2.316 ± 0.18
2.918TyrGly: 2.918 ± 0.182
1.001TyrHis: 1.001 ± 0.098
3.317TyrIle: 3.317 ± 0.201
3.345TyrLys: 3.345 ± 0.197
2.854TyrLeu: 2.854 ± 0.152
1.223TyrMet: 1.223 ± 0.113
3.419TyrAsn: 3.419 ± 0.216
1.492TyrPro: 1.492 ± 0.11
1.76TyrGln: 1.76 ± 0.119
1.862TyrArg: 1.862 ± 0.132
3.085TyrSer: 3.085 ± 0.194
2.918TyrThr: 2.918 ± 0.175
3.03TyrVal: 3.03 ± 0.195
0.528TyrTrp: 0.528 ± 0.077
2.298TyrTyr: 2.298 ± 0.172
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 545 proteins (107935 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski