Amino acid dipepetide frequency for Campylobacter phage vB_CjeM_Los1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.05AlaAla: 1.05 ± 0.183
0.429AlaCys: 0.429 ± 0.091
2.075AlaAsp: 2.075 ± 0.215
2.004AlaGlu: 2.004 ± 0.246
1.694AlaPhe: 1.694 ± 0.211
1.622AlaGly: 1.622 ± 0.279
0.358AlaHis: 0.358 ± 0.084
3.268AlaIle: 3.268 ± 0.279
2.934AlaLys: 2.934 ± 0.31
3.292AlaLeu: 3.292 ± 0.305
0.787AlaMet: 0.787 ± 0.127
2.648AlaAsn: 2.648 ± 0.242
0.787AlaPro: 0.787 ± 0.127
0.906AlaGln: 0.906 ± 0.151
1.24AlaArg: 1.24 ± 0.159
2.433AlaSer: 2.433 ± 0.266
1.98AlaThr: 1.98 ± 0.3
2.099AlaVal: 2.099 ± 0.247
0.191AlaTrp: 0.191 ± 0.066
1.527AlaTyr: 1.527 ± 0.18
0.0AlaXaa: 0.0 ± 0.0
Cys
0.477CysAla: 0.477 ± 0.099
0.215CysCys: 0.215 ± 0.089
1.193CysAsp: 1.193 ± 0.197
1.145CysGlu: 1.145 ± 0.168
0.406CysPhe: 0.406 ± 0.091
1.193CysGly: 1.193 ± 0.157
0.143CysHis: 0.143 ± 0.055
1.503CysIle: 1.503 ± 0.188
2.051CysLys: 2.051 ± 0.309
1.336CysLeu: 1.336 ± 0.208
0.262CysMet: 0.262 ± 0.07
1.908CysAsn: 1.908 ± 0.283
1.527CysPro: 1.527 ± 0.37
0.501CysGln: 0.501 ± 0.119
0.382CysArg: 0.382 ± 0.084
1.217CysSer: 1.217 ± 0.186
0.763CysThr: 0.763 ± 0.182
0.883CysVal: 0.883 ± 0.139
0.024CysTrp: 0.024 ± 0.023
1.05CysTyr: 1.05 ± 0.175
0.0CysXaa: 0.0 ± 0.0
Asp
2.028AspAla: 2.028 ± 0.214
0.978AspCys: 0.978 ± 0.157
5.176AspAsp: 5.176 ± 0.522
5.224AspGlu: 5.224 ± 0.374
3.817AspPhe: 3.817 ± 0.267
2.886AspGly: 2.886 ± 0.238
0.549AspHis: 0.549 ± 0.105
6.846AspIle: 6.846 ± 0.49
6.202AspLys: 6.202 ± 0.4
5.176AspLeu: 5.176 ± 0.395
1.312AspMet: 1.312 ± 0.17
5.653AspAsn: 5.653 ± 0.405
1.455AspPro: 1.455 ± 0.189
1.121AspGln: 1.121 ± 0.175
1.717AspArg: 1.717 ± 0.203
3.34AspSer: 3.34 ± 0.276
2.362AspThr: 2.362 ± 0.261
2.886AspVal: 2.886 ± 0.305
0.668AspTrp: 0.668 ± 0.121
3.721AspTyr: 3.721 ± 0.31
0.0AspXaa: 0.0 ± 0.0
Glu
2.195GluAla: 2.195 ± 0.234
1.646GluCys: 1.646 ± 0.335
3.196GluAsp: 3.196 ± 0.375
3.435GluGlu: 3.435 ± 0.336
3.626GluPhe: 3.626 ± 0.288
2.266GluGly: 2.266 ± 0.286
0.954GluHis: 0.954 ± 0.141
6.727GluIle: 6.727 ± 0.445
7.49GluLys: 7.49 ± 0.532
7.419GluLeu: 7.419 ± 0.466
1.217GluMet: 1.217 ± 0.158
6.154GluAsn: 6.154 ± 0.39
1.264GluPro: 1.264 ± 0.21
1.36GluGln: 1.36 ± 0.193
1.789GluArg: 1.789 ± 0.224
4.651GluSer: 4.651 ± 0.317
3.22GluThr: 3.22 ± 0.288
4.222GluVal: 4.222 ± 0.364
0.644GluTrp: 0.644 ± 0.113
4.485GluTyr: 4.485 ± 0.287
0.0GluXaa: 0.0 ± 0.0
Phe
1.622PheAla: 1.622 ± 0.22
0.906PheCys: 0.906 ± 0.15
3.817PheAsp: 3.817 ± 0.336
4.055PheGlu: 4.055 ± 0.335
1.741PhePhe: 1.741 ± 0.232
2.409PheGly: 2.409 ± 0.222
0.62PheHis: 0.62 ± 0.106
4.103PheIle: 4.103 ± 0.314
6.584PheLys: 6.584 ± 0.56
3.721PheLeu: 3.721 ± 0.32
1.336PheMet: 1.336 ± 0.201
4.27PheAsn: 4.27 ± 0.315
0.93PhePro: 0.93 ± 0.149
0.954PheGln: 0.954 ± 0.155
1.455PheArg: 1.455 ± 0.202
3.459PheSer: 3.459 ± 0.283
3.244PheThr: 3.244 ± 0.328
2.242PheVal: 2.242 ± 0.204
0.239PheTrp: 0.239 ± 0.067
2.481PheTyr: 2.481 ± 0.265
0.0PheXaa: 0.0 ± 0.0
Gly
2.195GlyAla: 2.195 ± 0.24
0.716GlyCys: 0.716 ± 0.125
3.483GlyAsp: 3.483 ± 0.3
2.051GlyGlu: 2.051 ± 0.259
3.077GlyPhe: 3.077 ± 0.314
2.314GlyGly: 2.314 ± 0.275
1.884GlyHis: 1.884 ± 0.472
4.222GlyIle: 4.222 ± 0.298
4.556GlyLys: 4.556 ± 0.328
3.483GlyLeu: 3.483 ± 0.298
1.145GlyMet: 1.145 ± 0.154
4.055GlyAsn: 4.055 ± 0.302
0.382GlyPro: 0.382 ± 0.09
1.384GlyGln: 1.384 ± 0.179
1.407GlyArg: 1.407 ± 0.213
4.485GlySer: 4.485 ± 0.409
2.672GlyThr: 2.672 ± 0.279
2.624GlyVal: 2.624 ± 0.235
0.191GlyTrp: 0.191 ± 0.07
3.435GlyTyr: 3.435 ± 0.367
0.0GlyXaa: 0.0 ± 0.0
His
0.406HisAla: 0.406 ± 0.112
0.167HisCys: 0.167 ± 0.057
0.787HisAsp: 0.787 ± 0.14
0.62HisGlu: 0.62 ± 0.148
0.811HisPhe: 0.811 ± 0.188
0.692HisGly: 0.692 ± 0.125
0.239HisHis: 0.239 ± 0.081
2.242HisIle: 2.242 ± 0.33
1.884HisLys: 1.884 ± 0.25
1.55HisLeu: 1.55 ± 0.215
0.215HisMet: 0.215 ± 0.073
1.169HisAsn: 1.169 ± 0.184
0.382HisPro: 0.382 ± 0.102
0.239HisGln: 0.239 ± 0.071
0.286HisArg: 0.286 ± 0.077
1.169HisSer: 1.169 ± 0.188
1.121HisThr: 1.121 ± 0.17
0.954HisVal: 0.954 ± 0.222
0.143HisTrp: 0.143 ± 0.059
0.906HisTyr: 0.906 ± 0.161
0.0HisXaa: 0.0 ± 0.0
Ile
2.6IleAla: 2.6 ± 0.248
1.813IleCys: 1.813 ± 0.22
6.321IleAsp: 6.321 ± 0.388
6.083IleGlu: 6.083 ± 0.288
4.341IlePhe: 4.341 ± 0.302
3.578IleGly: 3.578 ± 0.303
1.193IleHis: 1.193 ± 0.199
8.206IleIle: 8.206 ± 0.452
10.305IleLys: 10.305 ± 0.509
7.729IleLeu: 7.729 ± 0.479
1.861IleMet: 1.861 ± 0.183
8.874IleAsn: 8.874 ± 0.477
3.101IlePro: 3.101 ± 0.326
3.006IleGln: 3.006 ± 0.258
2.338IleArg: 2.338 ± 0.219
6.965IleSer: 6.965 ± 0.341
5.51IleThr: 5.51 ± 0.375
4.747IleVal: 4.747 ± 0.416
1.026IleTrp: 1.026 ± 0.133
3.435IleTyr: 3.435 ± 0.26
0.0IleXaa: 0.0 ± 0.0
Lys
3.029LysAla: 3.029 ± 0.299
2.433LysCys: 2.433 ± 0.513
7.013LysAsp: 7.013 ± 0.468
7.943LysGlu: 7.943 ± 0.515
4.604LysPhe: 4.604 ± 0.328
4.771LysGly: 4.771 ± 0.39
2.242LysHis: 2.242 ± 0.28
9.112LysIle: 9.112 ± 0.429
7.729LysLys: 7.729 ± 0.504
9.303LysLeu: 9.303 ± 0.474
2.051LysMet: 2.051 ± 0.212
9.565LysAsn: 9.565 ± 0.508
2.433LysPro: 2.433 ± 0.272
3.554LysGln: 3.554 ± 0.308
2.815LysArg: 2.815 ± 0.381
5.868LysSer: 5.868 ± 0.404
5.367LysThr: 5.367 ± 0.37
4.675LysVal: 4.675 ± 0.342
0.93LysTrp: 0.93 ± 0.143
6.321LysTyr: 6.321 ± 0.368
0.0LysXaa: 0.0 ± 0.0
Leu
3.196LeuAla: 3.196 ± 0.286
1.956LeuCys: 1.956 ± 0.234
5.606LeuAsp: 5.606 ± 0.334
7.013LeuGlu: 7.013 ± 0.516
3.22LeuPhe: 3.22 ± 0.282
5.343LeuGly: 5.343 ± 0.535
1.574LeuHis: 1.574 ± 0.198
6.441LeuIle: 6.441 ± 0.375
9.971LeuLys: 9.971 ± 0.542
7.275LeuLeu: 7.275 ± 0.471
2.218LeuMet: 2.218 ± 0.222
7.037LeuAsn: 7.037 ± 0.394
3.006LeuPro: 3.006 ± 0.286
2.934LeuGln: 2.934 ± 0.299
2.099LeuArg: 2.099 ± 0.183
5.844LeuSer: 5.844 ± 0.365
3.84LeuThr: 3.84 ± 0.298
3.459LeuVal: 3.459 ± 0.266
0.525LeuTrp: 0.525 ± 0.097
4.246LeuTyr: 4.246 ± 0.274
0.0LeuXaa: 0.0 ± 0.0
Met
1.312MetAla: 1.312 ± 0.15
0.549MetCys: 0.549 ± 0.091
1.193MetAsp: 1.193 ± 0.159
1.407MetGlu: 1.407 ± 0.205
1.36MetPhe: 1.36 ± 0.194
1.002MetGly: 1.002 ± 0.156
0.191MetHis: 0.191 ± 0.079
1.384MetIle: 1.384 ± 0.188
2.457MetLys: 2.457 ± 0.25
2.099MetLeu: 2.099 ± 0.248
0.262MetMet: 0.262 ± 0.082
1.717MetAsn: 1.717 ± 0.221
0.501MetPro: 0.501 ± 0.126
0.477MetGln: 0.477 ± 0.115
0.453MetArg: 0.453 ± 0.102
1.598MetSer: 1.598 ± 0.186
0.835MetThr: 0.835 ± 0.143
1.026MetVal: 1.026 ± 0.136
0.215MetTrp: 0.215 ± 0.07
1.097MetTyr: 1.097 ± 0.156
0.0MetXaa: 0.0 ± 0.0
Asn
2.266AsnAla: 2.266 ± 0.241
1.455AsnCys: 1.455 ± 0.208
4.341AsnAsp: 4.341 ± 0.31
5.51AsnGlu: 5.51 ± 0.423
4.127AsnPhe: 4.127 ± 0.317
5.725AsnGly: 5.725 ± 0.454
1.67AsnHis: 1.67 ± 0.203
10.52AsnIle: 10.52 ± 0.567
8.611AsnLys: 8.611 ± 0.462
7.442AsnLeu: 7.442 ± 0.529
2.075AsnMet: 2.075 ± 0.217
7.609AsnAsn: 7.609 ± 0.5
2.123AsnPro: 2.123 ± 0.268
2.123AsnGln: 2.123 ± 0.21
2.242AsnArg: 2.242 ± 0.233
5.129AsnSer: 5.129 ± 0.344
4.079AsnThr: 4.079 ± 0.339
4.723AsnVal: 4.723 ± 0.302
0.453AsnTrp: 0.453 ± 0.106
4.007AsnTyr: 4.007 ± 0.334
0.0AsnXaa: 0.0 ± 0.0
Pro
0.668ProAla: 0.668 ± 0.127
0.215ProCys: 0.215 ± 0.077
1.55ProAsp: 1.55 ± 0.162
1.956ProGlu: 1.956 ± 0.218
1.264ProPhe: 1.264 ± 0.151
1.384ProGly: 1.384 ± 0.162
0.358ProHis: 0.358 ± 0.106
2.648ProIle: 2.648 ± 0.261
2.552ProLys: 2.552 ± 0.27
1.956ProLeu: 1.956 ± 0.234
0.358ProMet: 0.358 ± 0.105
2.29ProAsn: 2.29 ± 0.219
0.596ProPro: 0.596 ± 0.154
0.668ProGln: 0.668 ± 0.152
0.692ProArg: 0.692 ± 0.108
2.576ProSer: 2.576 ± 0.313
1.574ProThr: 1.574 ± 0.191
1.312ProVal: 1.312 ± 0.163
0.167ProTrp: 0.167 ± 0.06
1.384ProTyr: 1.384 ± 0.204
0.0ProXaa: 0.0 ± 0.0
Gln
1.193GlnAla: 1.193 ± 0.22
0.453GlnCys: 0.453 ± 0.107
1.24GlnAsp: 1.24 ± 0.187
1.789GlnGlu: 1.789 ± 0.212
1.646GlnPhe: 1.646 ± 0.21
1.574GlnGly: 1.574 ± 0.206
0.31GlnHis: 0.31 ± 0.094
1.646GlnIle: 1.646 ± 0.206
2.314GlnLys: 2.314 ± 0.279
3.22GlnLeu: 3.22 ± 0.255
0.644GlnMet: 0.644 ± 0.112
2.099GlnAsn: 2.099 ± 0.236
0.692GlnPro: 0.692 ± 0.132
1.264GlnGln: 1.264 ± 0.195
0.763GlnArg: 0.763 ± 0.141
1.479GlnSer: 1.479 ± 0.193
1.312GlnThr: 1.312 ± 0.174
1.36GlnVal: 1.36 ± 0.199
0.191GlnTrp: 0.191 ± 0.068
1.455GlnTyr: 1.455 ± 0.189
0.0GlnXaa: 0.0 ± 0.0
Arg
1.073ArgAla: 1.073 ± 0.161
0.31ArgCys: 0.31 ± 0.094
1.384ArgAsp: 1.384 ± 0.195
1.861ArgGlu: 1.861 ± 0.2
1.336ArgPhe: 1.336 ± 0.165
1.431ArgGly: 1.431 ± 0.181
0.286ArgHis: 0.286 ± 0.094
2.004ArgIle: 2.004 ± 0.224
2.839ArgLys: 2.839 ± 0.261
2.695ArgLeu: 2.695 ± 0.245
0.549ArgMet: 0.549 ± 0.123
1.956ArgAsn: 1.956 ± 0.233
0.549ArgPro: 0.549 ± 0.102
1.026ArgGln: 1.026 ± 0.168
0.716ArgArg: 0.716 ± 0.153
1.574ArgSer: 1.574 ± 0.199
1.622ArgThr: 1.622 ± 0.199
1.479ArgVal: 1.479 ± 0.169
0.215ArgTrp: 0.215 ± 0.077
1.336ArgTyr: 1.336 ± 0.199
0.0ArgXaa: 0.0 ± 0.0
Ser
2.266SerAla: 2.266 ± 0.252
0.692SerCys: 0.692 ± 0.145
4.532SerAsp: 4.532 ± 0.362
4.508SerGlu: 4.508 ± 0.338
4.079SerPhe: 4.079 ± 0.33
3.387SerGly: 3.387 ± 0.385
1.073SerHis: 1.073 ± 0.162
6.798SerIle: 6.798 ± 0.363
7.252SerLys: 7.252 ± 0.419
6.655SerLeu: 6.655 ± 0.367
1.67SerMet: 1.67 ± 0.189
5.558SerAsn: 5.558 ± 0.348
1.407SerPro: 1.407 ± 0.199
1.431SerGln: 1.431 ± 0.169
1.837SerArg: 1.837 ± 0.218
4.795SerSer: 4.795 ± 0.406
3.459SerThr: 3.459 ± 0.291
3.65SerVal: 3.65 ± 0.331
0.62SerTrp: 0.62 ± 0.126
3.316SerTyr: 3.316 ± 0.264
0.0SerXaa: 0.0 ± 0.0
Thr
1.55ThrAla: 1.55 ± 0.202
1.002ThrCys: 1.002 ± 0.178
3.101ThrAsp: 3.101 ± 0.298
3.65ThrGlu: 3.65 ± 0.242
2.934ThrPhe: 2.934 ± 0.256
2.815ThrGly: 2.815 ± 0.29
0.644ThrHis: 0.644 ± 0.138
4.747ThrIle: 4.747 ± 0.327
4.508ThrLys: 4.508 ± 0.4
4.198ThrLeu: 4.198 ± 0.346
0.763ThrMet: 0.763 ± 0.117
4.318ThrAsn: 4.318 ± 0.375
2.409ThrPro: 2.409 ± 0.233
1.55ThrGln: 1.55 ± 0.237
1.24ThrArg: 1.24 ± 0.165
3.459ThrSer: 3.459 ± 0.251
2.815ThrThr: 2.815 ± 0.241
3.006ThrVal: 3.006 ± 0.293
0.406ThrTrp: 0.406 ± 0.095
2.886ThrTyr: 2.886 ± 0.242
0.0ThrXaa: 0.0 ± 0.0
Val
2.218ValAla: 2.218 ± 0.286
1.002ValCys: 1.002 ± 0.187
3.029ValAsp: 3.029 ± 0.42
3.936ValGlu: 3.936 ± 0.328
2.886ValPhe: 2.886 ± 0.302
2.529ValGly: 2.529 ± 0.264
0.453ValHis: 0.453 ± 0.101
4.866ValIle: 4.866 ± 0.339
5.796ValLys: 5.796 ± 0.365
3.96ValLeu: 3.96 ± 0.348
0.811ValMet: 0.811 ± 0.136
3.65ValAsn: 3.65 ± 0.298
1.121ValPro: 1.121 ± 0.158
0.835ValGln: 0.835 ± 0.147
1.264ValArg: 1.264 ± 0.146
3.984ValSer: 3.984 ± 0.385
2.91ValThr: 2.91 ± 0.296
2.862ValVal: 2.862 ± 0.255
0.692ValTrp: 0.692 ± 0.139
2.6ValTyr: 2.6 ± 0.242
0.0ValXaa: 0.0 ± 0.0
Trp
0.358TrpAla: 0.358 ± 0.096
0.239TrpCys: 0.239 ± 0.088
0.668TrpAsp: 0.668 ± 0.102
0.883TrpGlu: 0.883 ± 0.135
0.262TrpPhe: 0.262 ± 0.082
0.358TrpGly: 0.358 ± 0.08
0.334TrpHis: 0.334 ± 0.082
0.596TrpIle: 0.596 ± 0.122
0.501TrpLys: 0.501 ± 0.113
0.525TrpLeu: 0.525 ± 0.116
0.286TrpMet: 0.286 ± 0.088
0.692TrpAsn: 0.692 ± 0.128
0.024TrpPro: 0.024 ± 0.022
0.143TrpGln: 0.143 ± 0.055
0.215TrpArg: 0.215 ± 0.074
0.406TrpSer: 0.406 ± 0.108
0.477TrpThr: 0.477 ± 0.085
0.62TrpVal: 0.62 ± 0.11
0.048TrpTrp: 0.048 ± 0.034
0.453TrpTyr: 0.453 ± 0.12
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.622TyrAla: 1.622 ± 0.195
1.002TyrCys: 1.002 ± 0.143
3.411TyrAsp: 3.411 ± 0.288
2.886TyrGlu: 2.886 ± 0.239
2.958TyrPhe: 2.958 ± 0.272
2.481TyrGly: 2.481 ± 0.257
0.906TyrHis: 0.906 ± 0.138
5.152TyrIle: 5.152 ± 0.365
5.224TyrLys: 5.224 ± 0.375
3.769TyrLeu: 3.769 ± 0.323
1.288TyrMet: 1.288 ± 0.18
5.009TyrAsn: 5.009 ± 0.328
1.479TyrPro: 1.479 ± 0.181
1.169TyrGln: 1.169 ± 0.152
1.288TyrArg: 1.288 ± 0.153
4.556TyrSer: 4.556 ± 0.364
2.839TyrThr: 2.839 ± 0.252
2.576TyrVal: 2.576 ± 0.266
0.501TyrTrp: 0.501 ± 0.099
2.886TyrTyr: 2.886 ± 0.227
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 169 proteins (41923 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski