Amino acid dipepetide frequency for Cronobacter phage vB_CsaM_GAP161

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.827AlaAla: 4.827 ± 0.327
0.652AlaCys: 0.652 ± 0.097
3.893AlaAsp: 3.893 ± 0.261
4.686AlaGlu: 4.686 ± 0.337
2.589AlaPhe: 2.589 ± 0.247
4.633AlaGly: 4.633 ± 0.341
1.356AlaHis: 1.356 ± 0.16
5.056AlaIle: 5.056 ± 0.335
5.214AlaLys: 5.214 ± 0.353
6.06AlaLeu: 6.06 ± 0.339
2.484AlaMet: 2.484 ± 0.203
3.823AlaAsn: 3.823 ± 0.269
2.202AlaPro: 2.202 ± 0.248
2.431AlaGln: 2.431 ± 0.207
3.1AlaArg: 3.1 ± 0.261
3.893AlaSer: 3.893 ± 0.235
4.457AlaThr: 4.457 ± 0.532
5.161AlaVal: 5.161 ± 0.29
1.075AlaTrp: 1.075 ± 0.145
3.153AlaTyr: 3.153 ± 0.201
0.0AlaXaa: 0.0 ± 0.0
Cys
0.934CysAla: 0.934 ± 0.133
0.106CysCys: 0.106 ± 0.047
0.652CysAsp: 0.652 ± 0.112
0.793CysGlu: 0.793 ± 0.134
0.669CysPhe: 0.669 ± 0.113
0.934CysGly: 0.934 ± 0.138
0.247CysHis: 0.247 ± 0.071
0.617CysIle: 0.617 ± 0.112
1.039CysLys: 1.039 ± 0.143
0.705CysLeu: 0.705 ± 0.101
0.317CysMet: 0.317 ± 0.075
0.74CysAsn: 0.74 ± 0.125
0.617CysPro: 0.617 ± 0.119
0.282CysGln: 0.282 ± 0.078
0.528CysArg: 0.528 ± 0.077
0.669CysSer: 0.669 ± 0.096
0.546CysThr: 0.546 ± 0.102
0.74CysVal: 0.74 ± 0.109
0.106CysTrp: 0.106 ± 0.041
0.388CysTyr: 0.388 ± 0.095
0.0CysXaa: 0.0 ± 0.0
Asp
4.474AspAla: 4.474 ± 0.275
0.581AspCys: 0.581 ± 0.09
4.228AspAsp: 4.228 ± 0.316
4.51AspGlu: 4.51 ± 0.269
2.889AspPhe: 2.889 ± 0.25
4.703AspGly: 4.703 ± 0.314
1.145AspHis: 1.145 ± 0.158
4.633AspIle: 4.633 ± 0.295
4.668AspLys: 4.668 ± 0.34
5.461AspLeu: 5.461 ± 0.324
1.744AspMet: 1.744 ± 0.174
3.153AspAsn: 3.153 ± 0.22
2.748AspPro: 2.748 ± 0.234
1.638AspGln: 1.638 ± 0.134
3.224AspArg: 3.224 ± 0.254
3.893AspSer: 3.893 ± 0.232
3.4AspThr: 3.4 ± 0.213
4.545AspVal: 4.545 ± 0.337
1.075AspTrp: 1.075 ± 0.143
3.276AspTyr: 3.276 ± 0.247
0.0AspXaa: 0.0 ± 0.0
Glu
5.373GluAla: 5.373 ± 0.349
0.986GluCys: 0.986 ± 0.128
3.312GluAsp: 3.312 ± 0.197
5.108GluGlu: 5.108 ± 0.366
2.907GluPhe: 2.907 ± 0.204
3.276GluGly: 3.276 ± 0.232
1.356GluHis: 1.356 ± 0.173
5.531GluIle: 5.531 ± 0.323
5.514GluLys: 5.514 ± 0.347
6.253GluLeu: 6.253 ± 0.357
1.973GluMet: 1.973 ± 0.181
3.664GluAsn: 3.664 ± 0.241
1.85GluPro: 1.85 ± 0.187
2.854GluGln: 2.854 ± 0.251
3.276GluArg: 3.276 ± 0.207
3.435GluSer: 3.435 ± 0.241
4.069GluThr: 4.069 ± 0.288
4.069GluVal: 4.069 ± 0.267
1.004GluTrp: 1.004 ± 0.13
2.836GluTyr: 2.836 ± 0.242
0.0GluXaa: 0.0 ± 0.0
Phe
2.554PheAla: 2.554 ± 0.21
0.599PheCys: 0.599 ± 0.11
3.77PheAsp: 3.77 ± 0.278
2.589PheGlu: 2.589 ± 0.252
1.48PhePhe: 1.48 ± 0.159
2.854PheGly: 2.854 ± 0.246
0.599PheHis: 0.599 ± 0.093
2.449PheIle: 2.449 ± 0.203
2.907PheLys: 2.907 ± 0.254
2.537PheLeu: 2.537 ± 0.195
1.286PheMet: 1.286 ± 0.168
2.484PheAsn: 2.484 ± 0.225
1.444PhePro: 1.444 ± 0.169
1.409PheGln: 1.409 ± 0.156
1.779PheArg: 1.779 ± 0.176
2.678PheSer: 2.678 ± 0.221
2.484PheThr: 2.484 ± 0.211
3.118PheVal: 3.118 ± 0.216
0.599PheTrp: 0.599 ± 0.097
1.638PheTyr: 1.638 ± 0.185
0.0PheXaa: 0.0 ± 0.0
Gly
4.016GlyAla: 4.016 ± 0.324
0.916GlyCys: 0.916 ± 0.149
4.51GlyAsp: 4.51 ± 0.341
3.717GlyGlu: 3.717 ± 0.264
3.083GlyPhe: 3.083 ± 0.25
4.087GlyGly: 4.087 ± 0.389
0.916GlyHis: 0.916 ± 0.141
4.016GlyIle: 4.016 ± 0.282
5.285GlyLys: 5.285 ± 0.315
4.668GlyLeu: 4.668 ± 0.343
1.726GlyMet: 1.726 ± 0.182
3.4GlyAsn: 3.4 ± 0.334
0.881GlyPro: 0.881 ± 0.125
1.656GlyGln: 1.656 ± 0.199
2.466GlyArg: 2.466 ± 0.21
4.474GlySer: 4.474 ± 0.327
3.682GlyThr: 3.682 ± 0.405
5.249GlyVal: 5.249 ± 0.332
1.039GlyTrp: 1.039 ± 0.163
3.417GlyTyr: 3.417 ± 0.232
0.0GlyXaa: 0.0 ± 0.0
His
1.268HisAla: 1.268 ± 0.148
0.229HisCys: 0.229 ± 0.087
1.215HisAsp: 1.215 ± 0.159
1.251HisGlu: 1.251 ± 0.152
0.934HisPhe: 0.934 ± 0.125
0.916HisGly: 0.916 ± 0.143
0.44HisHis: 0.44 ± 0.086
1.497HisIle: 1.497 ± 0.184
1.198HisLys: 1.198 ± 0.17
1.215HisLeu: 1.215 ± 0.137
0.458HisMet: 0.458 ± 0.09
1.022HisAsn: 1.022 ± 0.136
0.898HisPro: 0.898 ± 0.111
0.722HisGln: 0.722 ± 0.104
1.039HisArg: 1.039 ± 0.155
0.793HisSer: 0.793 ± 0.128
1.075HisThr: 1.075 ± 0.129
1.515HisVal: 1.515 ± 0.172
0.211HisTrp: 0.211 ± 0.062
0.846HisTyr: 0.846 ± 0.121
0.0HisXaa: 0.0 ± 0.0
Ile
5.091IleAla: 5.091 ± 0.351
0.793IleCys: 0.793 ± 0.142
5.038IleAsp: 5.038 ± 0.307
4.897IleGlu: 4.897 ± 0.295
2.184IlePhe: 2.184 ± 0.21
3.611IleGly: 3.611 ± 0.286
1.18IleHis: 1.18 ± 0.144
3.453IleIle: 3.453 ± 0.222
4.668IleLys: 4.668 ± 0.341
4.474IleLeu: 4.474 ± 0.272
1.603IleMet: 1.603 ± 0.155
3.541IleAsn: 3.541 ± 0.244
2.589IlePro: 2.589 ± 0.232
2.114IleGln: 2.114 ± 0.196
3.963IleArg: 3.963 ± 0.243
3.294IleSer: 3.294 ± 0.25
4.51IleThr: 4.51 ± 0.3
4.756IleVal: 4.756 ± 0.278
0.757IleTrp: 0.757 ± 0.112
2.36IleTyr: 2.36 ± 0.175
0.0IleXaa: 0.0 ± 0.0
Lys
6.112LysAla: 6.112 ± 0.398
0.81LysCys: 0.81 ± 0.118
4.562LysAsp: 4.562 ± 0.323
5.285LysGlu: 5.285 ± 0.315
2.977LysPhe: 2.977 ± 0.269
4.298LysGly: 4.298 ± 0.334
1.691LysHis: 1.691 ± 0.176
4.756LysIle: 4.756 ± 0.278
4.739LysLys: 4.739 ± 0.363
6.359LysLeu: 6.359 ± 0.373
2.202LysMet: 2.202 ± 0.18
4.21LysAsn: 4.21 ± 0.238
2.713LysPro: 2.713 ± 0.256
2.942LysGln: 2.942 ± 0.242
3.752LysArg: 3.752 ± 0.291
3.453LysSer: 3.453 ± 0.277
4.386LysThr: 4.386 ± 0.31
4.827LysVal: 4.827 ± 0.293
1.004LysTrp: 1.004 ± 0.135
3.347LysTyr: 3.347 ± 0.291
0.0LysXaa: 0.0 ± 0.0
Leu
5.337LeuAla: 5.337 ± 0.336
0.81LeuCys: 0.81 ± 0.121
5.425LeuAsp: 5.425 ± 0.346
4.686LeuGlu: 4.686 ± 0.309
3.4LeuPhe: 3.4 ± 0.229
4.351LeuGly: 4.351 ± 0.236
1.533LeuHis: 1.533 ± 0.165
4.545LeuIle: 4.545 ± 0.275
5.743LeuLys: 5.743 ± 0.323
5.003LeuLeu: 5.003 ± 0.341
2.308LeuMet: 2.308 ± 0.197
4.421LeuAsn: 4.421 ± 0.288
3.611LeuPro: 3.611 ± 0.264
2.466LeuGln: 2.466 ± 0.25
3.893LeuArg: 3.893 ± 0.28
5.003LeuSer: 5.003 ± 0.282
4.474LeuThr: 4.474 ± 0.301
5.144LeuVal: 5.144 ± 0.254
0.828LeuTrp: 0.828 ± 0.128
2.977LeuTyr: 2.977 ± 0.241
0.0LeuXaa: 0.0 ± 0.0
Met
1.85MetAla: 1.85 ± 0.182
0.264MetCys: 0.264 ± 0.061
1.621MetAsp: 1.621 ± 0.182
1.797MetGlu: 1.797 ± 0.188
1.392MetPhe: 1.392 ± 0.147
1.638MetGly: 1.638 ± 0.151
0.476MetHis: 0.476 ± 0.089
1.726MetIle: 1.726 ± 0.166
3.312MetLys: 3.312 ± 0.256
2.026MetLeu: 2.026 ± 0.213
0.793MetMet: 0.793 ± 0.112
1.797MetAsn: 1.797 ± 0.203
0.81MetPro: 0.81 ± 0.095
1.11MetGln: 1.11 ± 0.14
1.444MetArg: 1.444 ± 0.161
1.832MetSer: 1.832 ± 0.173
1.621MetThr: 1.621 ± 0.178
1.568MetVal: 1.568 ± 0.157
0.282MetTrp: 0.282 ± 0.069
1.198MetTyr: 1.198 ± 0.146
0.0MetXaa: 0.0 ± 0.0
Asn
4.333AsnAla: 4.333 ± 0.31
0.528AsnCys: 0.528 ± 0.09
3.259AsnAsp: 3.259 ± 0.224
3.77AsnGlu: 3.77 ± 0.259
2.026AsnPhe: 2.026 ± 0.205
4.298AsnGly: 4.298 ± 0.31
1.18AsnHis: 1.18 ± 0.149
3.224AsnIle: 3.224 ± 0.216
3.435AsnLys: 3.435 ± 0.247
4.421AsnLeu: 4.421 ± 0.292
1.356AsnMet: 1.356 ± 0.176
2.73AsnAsn: 2.73 ± 0.253
2.801AsnPro: 2.801 ± 0.226
1.744AsnGln: 1.744 ± 0.207
2.572AsnArg: 2.572 ± 0.209
3.206AsnSer: 3.206 ± 0.222
3.417AsnThr: 3.417 ± 0.33
3.699AsnVal: 3.699 ± 0.287
0.458AsnTrp: 0.458 ± 0.089
1.726AsnTyr: 1.726 ± 0.197
0.0AsnXaa: 0.0 ± 0.0
Pro
2.73ProAla: 2.73 ± 0.253
0.405ProCys: 0.405 ± 0.096
2.995ProAsp: 2.995 ± 0.247
3.312ProGlu: 3.312 ± 0.296
1.568ProPhe: 1.568 ± 0.17
1.85ProGly: 1.85 ± 0.183
0.74ProHis: 0.74 ± 0.136
1.814ProIle: 1.814 ± 0.173
2.378ProLys: 2.378 ± 0.235
2.678ProLeu: 2.678 ± 0.215
0.757ProMet: 0.757 ± 0.12
1.726ProAsn: 1.726 ± 0.145
1.092ProPro: 1.092 ± 0.14
1.092ProGln: 1.092 ± 0.117
1.814ProArg: 1.814 ± 0.192
2.079ProSer: 2.079 ± 0.219
2.325ProThr: 2.325 ± 0.2
2.924ProVal: 2.924 ± 0.225
0.423ProTrp: 0.423 ± 0.076
1.726ProTyr: 1.726 ± 0.169
0.0ProXaa: 0.0 ± 0.0
Gln
2.801GlnAla: 2.801 ± 0.237
0.423GlnCys: 0.423 ± 0.087
1.585GlnAsp: 1.585 ± 0.16
1.955GlnGlu: 1.955 ± 0.187
1.304GlnPhe: 1.304 ± 0.161
1.726GlnGly: 1.726 ± 0.188
0.669GlnHis: 0.669 ± 0.096
2.713GlnIle: 2.713 ± 0.239
2.449GlnLys: 2.449 ± 0.216
2.801GlnLeu: 2.801 ± 0.219
1.18GlnMet: 1.18 ± 0.153
1.55GlnAsn: 1.55 ± 0.158
1.18GlnPro: 1.18 ± 0.142
1.603GlnGln: 1.603 ± 0.22
1.902GlnArg: 1.902 ± 0.206
2.026GlnSer: 2.026 ± 0.209
2.096GlnThr: 2.096 ± 0.17
2.079GlnVal: 2.079 ± 0.16
0.458GlnTrp: 0.458 ± 0.107
1.533GlnTyr: 1.533 ± 0.176
0.0GlnXaa: 0.0 ± 0.0
Arg
2.695ArgAla: 2.695 ± 0.211
0.511ArgCys: 0.511 ± 0.097
3.153ArgAsp: 3.153 ± 0.225
3.611ArgGlu: 3.611 ± 0.307
1.885ArgPhe: 1.885 ± 0.209
3.047ArgGly: 3.047 ± 0.228
0.775ArgHis: 0.775 ± 0.123
3.417ArgIle: 3.417 ± 0.217
3.911ArgLys: 3.911 ± 0.287
3.611ArgLeu: 3.611 ± 0.254
1.515ArgMet: 1.515 ± 0.183
2.695ArgAsn: 2.695 ± 0.187
1.585ArgPro: 1.585 ± 0.14
1.656ArgGln: 1.656 ± 0.163
2.167ArgArg: 2.167 ± 0.234
2.907ArgSer: 2.907 ± 0.22
2.431ArgThr: 2.431 ± 0.189
3.576ArgVal: 3.576 ± 0.235
0.951ArgTrp: 0.951 ± 0.115
1.902ArgTyr: 1.902 ± 0.206
0.0ArgXaa: 0.0 ± 0.0
Ser
3.805SerAla: 3.805 ± 0.263
0.669SerCys: 0.669 ± 0.108
3.699SerAsp: 3.699 ± 0.269
3.664SerGlu: 3.664 ± 0.27
2.801SerPhe: 2.801 ± 0.245
4.985SerGly: 4.985 ± 0.312
0.986SerHis: 0.986 ± 0.134
3.823SerIle: 3.823 ± 0.263
4.351SerLys: 4.351 ± 0.245
4.157SerLeu: 4.157 ± 0.244
1.568SerMet: 1.568 ± 0.176
3.136SerAsn: 3.136 ± 0.25
2.149SerPro: 2.149 ± 0.202
1.762SerGln: 1.762 ± 0.184
2.713SerArg: 2.713 ± 0.21
3.241SerSer: 3.241 ± 0.299
2.942SerThr: 2.942 ± 0.264
4.245SerVal: 4.245 ± 0.27
0.687SerTrp: 0.687 ± 0.121
2.114SerTyr: 2.114 ± 0.188
0.0SerXaa: 0.0 ± 0.0
Thr
4.228ThrAla: 4.228 ± 0.377
0.528ThrCys: 0.528 ± 0.09
3.77ThrAsp: 3.77 ± 0.317
3.682ThrGlu: 3.682 ± 0.217
2.255ThrPhe: 2.255 ± 0.209
4.598ThrGly: 4.598 ± 0.308
1.057ThrHis: 1.057 ± 0.135
4.052ThrIle: 4.052 ± 0.285
4.122ThrLys: 4.122 ± 0.263
4.897ThrLeu: 4.897 ± 0.332
1.409ThrMet: 1.409 ± 0.139
2.783ThrAsn: 2.783 ± 0.248
2.907ThrPro: 2.907 ± 0.273
2.079ThrGln: 2.079 ± 0.228
2.378ThrArg: 2.378 ± 0.187
3.153ThrSer: 3.153 ± 0.236
3.312ThrThr: 3.312 ± 0.292
4.686ThrVal: 4.686 ± 0.334
0.634ThrTrp: 0.634 ± 0.103
2.396ThrTyr: 2.396 ± 0.204
0.0ThrXaa: 0.0 ± 0.0
Val
4.633ValAla: 4.633 ± 0.328
1.022ValCys: 1.022 ± 0.139
5.108ValAsp: 5.108 ± 0.294
5.602ValGlu: 5.602 ± 0.338
2.907ValPhe: 2.907 ± 0.213
3.928ValGly: 3.928 ± 0.32
1.092ValHis: 1.092 ± 0.141
4.104ValIle: 4.104 ± 0.252
5.408ValLys: 5.408 ± 0.309
4.65ValLeu: 4.65 ± 0.295
2.184ValMet: 2.184 ± 0.196
4.104ValAsn: 4.104 ± 0.245
2.554ValPro: 2.554 ± 0.215
2.413ValGln: 2.413 ± 0.238
3.294ValArg: 3.294 ± 0.23
4.175ValSer: 4.175 ± 0.314
4.333ValThr: 4.333 ± 0.343
5.161ValVal: 5.161 ± 0.379
1.004ValTrp: 1.004 ± 0.12
3.435ValTyr: 3.435 ± 0.231
0.0ValXaa: 0.0 ± 0.0
Trp
0.669TrpAla: 0.669 ± 0.113
0.159TrpCys: 0.159 ± 0.056
1.092TrpAsp: 1.092 ± 0.135
0.74TrpGlu: 0.74 ± 0.118
0.617TrpPhe: 0.617 ± 0.096
0.705TrpGly: 0.705 ± 0.114
0.37TrpHis: 0.37 ± 0.071
0.74TrpIle: 0.74 ± 0.1
1.286TrpLys: 1.286 ± 0.165
1.145TrpLeu: 1.145 ± 0.171
0.493TrpMet: 0.493 ± 0.089
0.757TrpAsn: 0.757 ± 0.121
0.247TrpPro: 0.247 ± 0.062
0.546TrpGln: 0.546 ± 0.099
0.617TrpArg: 0.617 ± 0.107
0.705TrpSer: 0.705 ± 0.107
0.652TrpThr: 0.652 ± 0.107
0.934TrpVal: 0.934 ± 0.14
0.159TrpTrp: 0.159 ± 0.048
0.652TrpTyr: 0.652 ± 0.104
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.012TyrAla: 3.012 ± 0.236
0.634TyrCys: 0.634 ± 0.094
3.1TyrAsp: 3.1 ± 0.215
2.995TyrGlu: 2.995 ± 0.236
1.392TyrPhe: 1.392 ± 0.169
2.766TyrGly: 2.766 ± 0.239
0.934TyrHis: 0.934 ± 0.142
2.713TyrIle: 2.713 ± 0.238
2.818TyrLys: 2.818 ± 0.253
2.836TyrLeu: 2.836 ± 0.23
1.18TyrMet: 1.18 ± 0.149
2.449TyrAsn: 2.449 ± 0.212
1.55TyrPro: 1.55 ± 0.172
1.515TyrGln: 1.515 ± 0.164
2.079TyrArg: 2.079 ± 0.223
2.589TyrSer: 2.589 ± 0.22
2.713TyrThr: 2.713 ± 0.248
3.118TyrVal: 3.118 ± 0.223
0.528TyrTrp: 0.528 ± 0.092
1.603TyrTyr: 1.603 ± 0.176
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 275 proteins (56770 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski