Amino acid dipepetide frequency for Acinetobacter phage vB_AbaM_B9

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.857AlaAla: 2.857 ± 0.557
0.789AlaCys: 0.789 ± 0.212
2.669AlaAsp: 2.669 ± 0.293
3.383AlaGlu: 3.383 ± 0.366
1.767AlaPhe: 1.767 ± 0.269
1.917AlaGly: 1.917 ± 0.264
0.827AlaHis: 0.827 ± 0.188
3.909AlaIle: 3.909 ± 0.419
4.511AlaLys: 4.511 ± 0.51
5.074AlaLeu: 5.074 ± 0.509
1.353AlaMet: 1.353 ± 0.212
2.669AlaAsn: 2.669 ± 0.353
1.09AlaPro: 1.09 ± 0.227
1.804AlaGln: 1.804 ± 0.3
1.917AlaArg: 1.917 ± 0.249
2.969AlaSer: 2.969 ± 0.338
3.383AlaThr: 3.383 ± 0.4
2.782AlaVal: 2.782 ± 0.387
0.714AlaTrp: 0.714 ± 0.147
2.669AlaTyr: 2.669 ± 0.285
0.0AlaXaa: 0.0 ± 0.0
Cys
0.639CysAla: 0.639 ± 0.149
0.338CysCys: 0.338 ± 0.119
1.203CysAsp: 1.203 ± 0.212
0.827CysGlu: 0.827 ± 0.16
1.09CysPhe: 1.09 ± 0.197
0.827CysGly: 0.827 ± 0.183
0.301CysHis: 0.301 ± 0.122
0.714CysIle: 0.714 ± 0.167
1.466CysLys: 1.466 ± 0.252
1.391CysLeu: 1.391 ± 0.246
0.338CysMet: 0.338 ± 0.117
0.94CysAsn: 0.94 ± 0.209
0.413CysPro: 0.413 ± 0.106
0.338CysGln: 0.338 ± 0.113
0.564CysArg: 0.564 ± 0.14
0.865CysSer: 0.865 ± 0.198
0.526CysThr: 0.526 ± 0.157
0.827CysVal: 0.827 ± 0.187
0.226CysTrp: 0.226 ± 0.088
0.827CysTyr: 0.827 ± 0.168
0.0CysXaa: 0.0 ± 0.0
Asp
3.608AspAla: 3.608 ± 0.375
1.052AspCys: 1.052 ± 0.188
5.15AspAsp: 5.15 ± 0.581
5.826AspGlu: 5.826 ± 0.511
3.834AspPhe: 3.834 ± 0.385
4.661AspGly: 4.661 ± 0.43
1.128AspHis: 1.128 ± 0.23
4.774AspIle: 4.774 ± 0.453
4.511AspLys: 4.511 ± 0.44
6.052AspLeu: 6.052 ± 0.464
1.391AspMet: 1.391 ± 0.197
3.458AspAsn: 3.458 ± 0.359
1.203AspPro: 1.203 ± 0.273
2.18AspGln: 2.18 ± 0.295
1.917AspArg: 1.917 ± 0.268
3.684AspSer: 3.684 ± 0.334
2.556AspThr: 2.556 ± 0.378
5.15AspVal: 5.15 ± 0.467
1.316AspTrp: 1.316 ± 0.213
3.721AspTyr: 3.721 ± 0.378
0.0AspXaa: 0.0 ± 0.0
Glu
4.548GluAla: 4.548 ± 0.489
0.752GluCys: 0.752 ± 0.196
4.623GluAsp: 4.623 ± 0.424
6.164GluGlu: 6.164 ± 0.684
3.646GluPhe: 3.646 ± 0.448
3.233GluGly: 3.233 ± 0.312
1.316GluHis: 1.316 ± 0.259
5.225GluIle: 5.225 ± 0.421
5.525GluLys: 5.525 ± 0.554
6.766GluLeu: 6.766 ± 0.475
2.406GluMet: 2.406 ± 0.356
4.435GluAsn: 4.435 ± 0.416
1.24GluPro: 1.24 ± 0.193
3.12GluGln: 3.12 ± 0.389
2.894GluArg: 2.894 ± 0.321
3.759GluSer: 3.759 ± 0.389
3.608GluThr: 3.608 ± 0.317
6.277GluVal: 6.277 ± 0.502
1.128GluTrp: 1.128 ± 0.249
5.037GluTyr: 5.037 ± 0.414
0.0GluXaa: 0.0 ± 0.0
Phe
2.03PheAla: 2.03 ± 0.246
0.639PheCys: 0.639 ± 0.173
3.684PheAsp: 3.684 ± 0.429
3.045PheGlu: 3.045 ± 0.312
1.804PhePhe: 1.804 ± 0.262
3.12PheGly: 3.12 ± 0.383
0.338PheHis: 0.338 ± 0.096
3.458PheIle: 3.458 ± 0.403
3.947PheLys: 3.947 ± 0.41
2.631PheLeu: 2.631 ± 0.338
1.052PheMet: 1.052 ± 0.195
2.969PheAsn: 2.969 ± 0.359
1.128PhePro: 1.128 ± 0.176
1.24PheGln: 1.24 ± 0.205
1.879PheArg: 1.879 ± 0.262
3.308PheSer: 3.308 ± 0.419
2.819PheThr: 2.819 ± 0.324
3.082PheVal: 3.082 ± 0.422
0.601PheTrp: 0.601 ± 0.127
2.669PheTyr: 2.669 ± 0.311
0.0PheXaa: 0.0 ± 0.0
Gly
1.917GlyAla: 1.917 ± 0.266
0.902GlyCys: 0.902 ± 0.201
3.646GlyAsp: 3.646 ± 0.36
4.736GlyGlu: 4.736 ± 0.462
2.969GlyPhe: 2.969 ± 0.305
3.872GlyGly: 3.872 ± 0.527
0.639GlyHis: 0.639 ± 0.142
4.097GlyIle: 4.097 ± 0.47
4.849GlyLys: 4.849 ± 0.373
4.661GlyLeu: 4.661 ± 0.479
0.827GlyMet: 0.827 ± 0.178
3.27GlyAsn: 3.27 ± 0.339
0.15GlyPro: 0.15 ± 0.094
1.654GlyGln: 1.654 ± 0.271
2.518GlyArg: 2.518 ± 0.398
4.21GlySer: 4.21 ± 0.415
3.496GlyThr: 3.496 ± 0.532
5.676GlyVal: 5.676 ± 0.467
1.391GlyTrp: 1.391 ± 0.279
3.045GlyTyr: 3.045 ± 0.39
0.0GlyXaa: 0.0 ± 0.0
His
0.865HisAla: 0.865 ± 0.18
0.413HisCys: 0.413 ± 0.115
1.165HisAsp: 1.165 ± 0.225
1.09HisGlu: 1.09 ± 0.218
0.752HisPhe: 0.752 ± 0.149
1.052HisGly: 1.052 ± 0.23
0.376HisHis: 0.376 ± 0.122
0.94HisIle: 0.94 ± 0.178
1.353HisLys: 1.353 ± 0.258
1.879HisLeu: 1.879 ± 0.296
0.526HisMet: 0.526 ± 0.142
0.977HisAsn: 0.977 ± 0.159
0.752HisPro: 0.752 ± 0.166
0.413HisGln: 0.413 ± 0.132
0.714HisArg: 0.714 ± 0.174
1.24HisSer: 1.24 ± 0.214
1.24HisThr: 1.24 ± 0.245
0.977HisVal: 0.977 ± 0.188
0.338HisTrp: 0.338 ± 0.095
0.902HisTyr: 0.902 ± 0.222
0.0HisXaa: 0.0 ± 0.0
Ile
3.345IleAla: 3.345 ± 0.49
1.09IleCys: 1.09 ± 0.241
5.074IleAsp: 5.074 ± 0.388
5.563IleGlu: 5.563 ± 0.529
1.804IlePhe: 1.804 ± 0.251
4.36IleGly: 4.36 ± 0.43
1.691IleHis: 1.691 ± 0.222
4.886IleIle: 4.886 ± 0.484
5.751IleLys: 5.751 ± 0.509
5.638IleLeu: 5.638 ± 0.488
1.616IleMet: 1.616 ± 0.242
5.601IleAsn: 5.601 ± 0.576
1.804IlePro: 1.804 ± 0.234
2.819IleGln: 2.819 ± 0.325
2.782IleArg: 2.782 ± 0.282
4.21IleSer: 4.21 ± 0.352
4.285IleThr: 4.285 ± 0.478
4.886IleVal: 4.886 ± 0.489
0.677IleTrp: 0.677 ± 0.161
2.443IleTyr: 2.443 ± 0.288
0.0IleXaa: 0.0 ± 0.0
Lys
3.646LysAla: 3.646 ± 0.412
1.165LysCys: 1.165 ± 0.2
5.563LysAsp: 5.563 ± 0.476
6.766LysGlu: 6.766 ± 0.539
2.932LysPhe: 2.932 ± 0.283
5.375LysGly: 5.375 ± 0.56
1.729LysHis: 1.729 ± 0.278
5.338LysIle: 5.338 ± 0.488
5.789LysLys: 5.789 ± 0.542
7.33LysLeu: 7.33 ± 0.533
2.782LysMet: 2.782 ± 0.352
4.699LysAsn: 4.699 ± 0.439
1.729LysPro: 1.729 ± 0.257
3.345LysGln: 3.345 ± 0.352
3.27LysArg: 3.27 ± 0.38
4.623LysSer: 4.623 ± 0.46
4.811LysThr: 4.811 ± 0.407
4.774LysVal: 4.774 ± 0.402
0.865LysTrp: 0.865 ± 0.176
4.135LysTyr: 4.135 ± 0.473
0.0LysXaa: 0.0 ± 0.0
Leu
4.586LeuAla: 4.586 ± 0.441
1.128LeuCys: 1.128 ± 0.248
5.338LeuAsp: 5.338 ± 0.445
6.465LeuGlu: 6.465 ± 0.495
3.947LeuPhe: 3.947 ± 0.372
4.774LeuGly: 4.774 ± 0.415
1.466LeuHis: 1.466 ± 0.264
5.413LeuIle: 5.413 ± 0.492
7.405LeuLys: 7.405 ± 0.571
7.292LeuLeu: 7.292 ± 0.661
1.579LeuMet: 1.579 ± 0.223
5.939LeuAsn: 5.939 ± 0.465
2.443LeuPro: 2.443 ± 0.323
3.12LeuGln: 3.12 ± 0.407
2.556LeuArg: 2.556 ± 0.324
6.39LeuSer: 6.39 ± 0.602
5.601LeuThr: 5.601 ± 0.47
4.999LeuVal: 4.999 ± 0.381
0.902LeuTrp: 0.902 ± 0.168
3.045LeuTyr: 3.045 ± 0.27
0.0LeuXaa: 0.0 ± 0.0
Met
1.316MetAla: 1.316 ± 0.223
0.301MetCys: 0.301 ± 0.101
0.94MetAsp: 0.94 ± 0.201
1.804MetGlu: 1.804 ± 0.295
0.94MetPhe: 0.94 ± 0.158
0.601MetGly: 0.601 ± 0.137
0.301MetHis: 0.301 ± 0.116
1.691MetIle: 1.691 ± 0.281
2.067MetLys: 2.067 ± 0.296
2.105MetLeu: 2.105 ± 0.263
0.564MetMet: 0.564 ± 0.164
1.466MetAsn: 1.466 ± 0.209
0.338MetPro: 0.338 ± 0.092
0.827MetGln: 0.827 ± 0.178
0.94MetArg: 0.94 ± 0.201
2.518MetSer: 2.518 ± 0.268
1.804MetThr: 1.804 ± 0.263
0.752MetVal: 0.752 ± 0.176
0.15MetTrp: 0.15 ± 0.068
0.714MetTyr: 0.714 ± 0.168
0.0MetXaa: 0.0 ± 0.0
Asn
2.932AsnAla: 2.932 ± 0.333
0.601AsnCys: 0.601 ± 0.172
3.834AsnAsp: 3.834 ± 0.389
4.849AsnGlu: 4.849 ± 0.454
2.594AsnPhe: 2.594 ± 0.342
4.135AsnGly: 4.135 ± 0.532
1.466AsnHis: 1.466 ± 0.225
3.909AsnIle: 3.909 ± 0.428
5.262AsnLys: 5.262 ± 0.397
4.774AsnLeu: 4.774 ± 0.413
1.278AsnMet: 1.278 ± 0.252
3.834AsnAsn: 3.834 ± 0.553
2.255AsnPro: 2.255 ± 0.312
2.067AsnGln: 2.067 ± 0.268
2.594AsnArg: 2.594 ± 0.297
3.947AsnSer: 3.947 ± 0.518
3.646AsnThr: 3.646 ± 0.398
3.533AsnVal: 3.533 ± 0.368
0.865AsnTrp: 0.865 ± 0.204
2.894AsnTyr: 2.894 ± 0.341
0.0AsnXaa: 0.0 ± 0.0
Pro
1.278ProAla: 1.278 ± 0.27
0.263ProCys: 0.263 ± 0.104
1.541ProAsp: 1.541 ± 0.267
1.767ProGlu: 1.767 ± 0.247
0.977ProPhe: 0.977 ± 0.156
0.0ProGly: 0.0 ± 0.0
0.564ProHis: 0.564 ± 0.148
2.33ProIle: 2.33 ± 0.317
2.143ProLys: 2.143 ± 0.314
1.767ProLeu: 1.767 ± 0.261
0.526ProMet: 0.526 ± 0.133
1.804ProAsn: 1.804 ± 0.321
0.639ProPro: 0.639 ± 0.201
0.752ProGln: 0.752 ± 0.182
0.902ProArg: 0.902 ± 0.214
2.293ProSer: 2.293 ± 0.306
1.955ProThr: 1.955 ± 0.325
1.541ProVal: 1.541 ± 0.189
0.038ProTrp: 0.038 ± 0.035
1.466ProTyr: 1.466 ± 0.21
0.0ProXaa: 0.0 ± 0.0
Gln
2.218GlnAla: 2.218 ± 0.31
0.489GlnCys: 0.489 ± 0.139
2.105GlnAsp: 2.105 ± 0.319
2.631GlnGlu: 2.631 ± 0.295
1.278GlnPhe: 1.278 ± 0.257
2.33GlnGly: 2.33 ± 0.408
0.677GlnHis: 0.677 ± 0.186
2.293GlnIle: 2.293 ± 0.298
2.33GlnLys: 2.33 ± 0.301
2.819GlnLeu: 2.819 ± 0.5
0.639GlnMet: 0.639 ± 0.134
1.992GlnAsn: 1.992 ± 0.332
1.015GlnPro: 1.015 ± 0.243
1.691GlnGln: 1.691 ± 0.277
1.579GlnArg: 1.579 ± 0.229
1.879GlnSer: 1.879 ± 0.336
1.691GlnThr: 1.691 ± 0.275
2.481GlnVal: 2.481 ± 0.29
0.601GlnTrp: 0.601 ± 0.167
1.767GlnTyr: 1.767 ± 0.289
0.0GlnXaa: 0.0 ± 0.0
Arg
1.504ArgAla: 1.504 ± 0.253
0.639ArgCys: 0.639 ± 0.158
2.067ArgAsp: 2.067 ± 0.242
3.157ArgGlu: 3.157 ± 0.346
1.992ArgPhe: 1.992 ± 0.244
2.18ArgGly: 2.18 ± 0.247
0.865ArgHis: 0.865 ± 0.2
3.157ArgIle: 3.157 ± 0.293
3.233ArgLys: 3.233 ± 0.377
3.421ArgLeu: 3.421 ± 0.325
0.752ArgMet: 0.752 ± 0.185
2.143ArgAsn: 2.143 ± 0.312
0.902ArgPro: 0.902 ± 0.142
1.353ArgGln: 1.353 ± 0.338
0.94ArgArg: 0.94 ± 0.187
2.255ArgSer: 2.255 ± 0.291
1.691ArgThr: 1.691 ± 0.213
2.631ArgVal: 2.631 ± 0.366
0.564ArgTrp: 0.564 ± 0.133
1.691ArgTyr: 1.691 ± 0.233
0.0ArgXaa: 0.0 ± 0.0
Ser
3.27SerAla: 3.27 ± 0.389
0.827SerCys: 0.827 ± 0.151
3.909SerAsp: 3.909 ± 0.413
5.225SerGlu: 5.225 ± 0.395
3.496SerPhe: 3.496 ± 0.387
3.984SerGly: 3.984 ± 0.431
1.052SerHis: 1.052 ± 0.211
4.736SerIle: 4.736 ± 0.451
5.563SerLys: 5.563 ± 0.473
5.488SerLeu: 5.488 ± 0.467
1.353SerMet: 1.353 ± 0.208
3.571SerAsn: 3.571 ± 0.313
1.804SerPro: 1.804 ± 0.252
1.691SerGln: 1.691 ± 0.283
2.33SerArg: 2.33 ± 0.292
4.999SerSer: 4.999 ± 0.506
3.421SerThr: 3.421 ± 0.422
4.999SerVal: 4.999 ± 0.476
0.902SerTrp: 0.902 ± 0.189
3.045SerTyr: 3.045 ± 0.291
0.0SerXaa: 0.0 ± 0.0
Thr
2.894ThrAla: 2.894 ± 0.341
1.015ThrCys: 1.015 ± 0.184
3.233ThrAsp: 3.233 ± 0.323
3.383ThrGlu: 3.383 ± 0.296
2.969ThrPhe: 2.969 ± 0.34
3.721ThrGly: 3.721 ± 0.529
1.278ThrHis: 1.278 ± 0.24
4.06ThrIle: 4.06 ± 0.493
4.285ThrLys: 4.285 ± 0.417
5.037ThrLeu: 5.037 ± 0.401
0.94ThrMet: 0.94 ± 0.203
3.27ThrAsn: 3.27 ± 0.434
2.143ThrPro: 2.143 ± 0.32
2.218ThrGln: 2.218 ± 0.376
1.767ThrArg: 1.767 ± 0.286
4.247ThrSer: 4.247 ± 0.37
3.759ThrThr: 3.759 ± 0.479
3.834ThrVal: 3.834 ± 0.369
0.601ThrTrp: 0.601 ± 0.161
2.669ThrTyr: 2.669 ± 0.329
0.0ThrXaa: 0.0 ± 0.0
Val
3.308ValAla: 3.308 ± 0.361
0.94ValCys: 0.94 ± 0.249
6.315ValAsp: 6.315 ± 0.543
5.225ValGlu: 5.225 ± 0.522
3.458ValPhe: 3.458 ± 0.376
4.435ValGly: 4.435 ± 0.458
0.94ValHis: 0.94 ± 0.208
5.187ValIle: 5.187 ± 0.423
5.789ValLys: 5.789 ± 0.508
4.774ValLeu: 4.774 ± 0.453
1.203ValMet: 1.203 ± 0.224
3.571ValAsn: 3.571 ± 0.431
1.955ValPro: 1.955 ± 0.262
1.579ValGln: 1.579 ± 0.232
2.669ValArg: 2.669 ± 0.299
4.511ValSer: 4.511 ± 0.386
3.984ValThr: 3.984 ± 0.389
5.601ValVal: 5.601 ± 0.528
0.902ValTrp: 0.902 ± 0.217
3.496ValTyr: 3.496 ± 0.377
0.0ValXaa: 0.0 ± 0.0
Trp
0.451TrpAla: 0.451 ± 0.155
0.301TrpCys: 0.301 ± 0.111
0.977TrpAsp: 0.977 ± 0.189
0.865TrpGlu: 0.865 ± 0.197
0.902TrpPhe: 0.902 ± 0.168
0.489TrpGly: 0.489 ± 0.118
0.188TrpHis: 0.188 ± 0.063
1.128TrpIle: 1.128 ± 0.225
1.09TrpLys: 1.09 ± 0.189
1.24TrpLeu: 1.24 ± 0.213
0.376TrpMet: 0.376 ± 0.16
1.391TrpAsn: 1.391 ± 0.23
0.0TrpPro: 0.0 ± 0.0
0.376TrpGln: 0.376 ± 0.126
0.526TrpArg: 0.526 ± 0.137
0.789TrpSer: 0.789 ± 0.177
0.789TrpThr: 0.789 ± 0.164
0.714TrpVal: 0.714 ± 0.173
0.226TrpTrp: 0.226 ± 0.092
0.94TrpTyr: 0.94 ± 0.187
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.842TyrAla: 1.842 ± 0.236
1.015TyrCys: 1.015 ± 0.175
4.172TyrAsp: 4.172 ± 0.48
2.932TyrGlu: 2.932 ± 0.356
2.293TyrPhe: 2.293 ± 0.422
3.458TyrGly: 3.458 ± 0.364
0.827TyrHis: 0.827 ± 0.179
3.082TyrIle: 3.082 ± 0.352
3.796TyrLys: 3.796 ± 0.386
4.247TyrLeu: 4.247 ± 0.383
0.601TyrMet: 0.601 ± 0.147
3.27TyrAsn: 3.27 ± 0.428
1.541TyrPro: 1.541 ± 0.257
1.917TyrGln: 1.917 ± 0.241
1.879TyrArg: 1.879 ± 0.291
2.969TyrSer: 2.969 ± 0.346
2.255TyrThr: 2.255 ± 0.27
4.323TyrVal: 4.323 ± 0.356
0.752TyrTrp: 0.752 ± 0.136
2.744TyrTyr: 2.744 ± 0.311
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 156 proteins (26605 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski