Amino acid dipepetide frequency for Cronobacter phage vB_CsaP_009

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.133AlaAla: 3.133 ± 0.615
0.576AlaCys: 0.576 ± 0.204
2.377AlaAsp: 2.377 ± 0.339
4.25AlaGlu: 4.25 ± 0.54
2.449AlaPhe: 2.449 ± 0.309
3.89AlaGly: 3.89 ± 0.568
0.864AlaHis: 0.864 ± 0.188
3.781AlaIle: 3.781 ± 0.388
3.745AlaLys: 3.745 ± 0.353
4.394AlaLeu: 4.394 ± 0.312
1.729AlaMet: 1.729 ± 0.264
3.097AlaAsn: 3.097 ± 0.318
1.981AlaPro: 1.981 ± 0.23
2.593AlaGln: 2.593 ± 0.415
2.773AlaArg: 2.773 ± 0.357
3.926AlaSer: 3.926 ± 0.403
3.673AlaThr: 3.673 ± 0.42
3.673AlaVal: 3.673 ± 0.421
0.288AlaTrp: 0.288 ± 0.099
1.801AlaTyr: 1.801 ± 0.325
0.0AlaXaa: 0.0 ± 0.0
Cys
0.648CysAla: 0.648 ± 0.178
0.288CysCys: 0.288 ± 0.104
0.864CysAsp: 0.864 ± 0.171
0.684CysGlu: 0.684 ± 0.142
0.576CysPhe: 0.576 ± 0.168
1.044CysGly: 1.044 ± 0.236
0.252CysHis: 0.252 ± 0.111
0.612CysIle: 0.612 ± 0.135
0.648CysLys: 0.648 ± 0.207
0.792CysLeu: 0.792 ± 0.204
0.288CysMet: 0.288 ± 0.115
0.756CysAsn: 0.756 ± 0.166
0.576CysPro: 0.576 ± 0.173
0.36CysGln: 0.36 ± 0.107
0.396CysArg: 0.396 ± 0.117
0.864CysSer: 0.864 ± 0.186
0.684CysThr: 0.684 ± 0.17
0.684CysVal: 0.684 ± 0.137
0.144CysTrp: 0.144 ± 0.072
0.504CysTyr: 0.504 ± 0.153
0.0CysXaa: 0.0 ± 0.0
Asp
3.529AspAla: 3.529 ± 0.475
0.792AspCys: 0.792 ± 0.179
3.89AspAsp: 3.89 ± 0.4
4.646AspGlu: 4.646 ± 0.429
2.665AspPhe: 2.665 ± 0.355
5.402AspGly: 5.402 ± 0.61
1.224AspHis: 1.224 ± 0.231
4.934AspIle: 4.934 ± 0.468
5.078AspLys: 5.078 ± 0.489
5.69AspLeu: 5.69 ± 0.44
1.801AspMet: 1.801 ± 0.254
4.394AspAsn: 4.394 ± 0.373
2.089AspPro: 2.089 ± 0.288
1.837AspGln: 1.837 ± 0.26
2.845AspArg: 2.845 ± 0.316
5.258AspSer: 5.258 ± 0.465
3.709AspThr: 3.709 ± 0.448
4.07AspVal: 4.07 ± 0.344
0.936AspTrp: 0.936 ± 0.206
2.989AspTyr: 2.989 ± 0.402
0.0AspXaa: 0.0 ± 0.0
Glu
5.186GluAla: 5.186 ± 0.515
0.828GluCys: 0.828 ± 0.233
4.934GluAsp: 4.934 ± 0.545
6.879GluGlu: 6.879 ± 0.637
2.701GluPhe: 2.701 ± 0.299
3.313GluGly: 3.313 ± 0.449
1.188GluHis: 1.188 ± 0.212
4.214GluIle: 4.214 ± 0.416
4.826GluLys: 4.826 ± 0.44
6.086GluLeu: 6.086 ± 0.495
2.917GluMet: 2.917 ± 0.376
3.998GluAsn: 3.998 ± 0.396
1.837GluPro: 1.837 ± 0.269
2.485GluGln: 2.485 ± 0.409
3.421GluArg: 3.421 ± 0.488
5.438GluSer: 5.438 ± 0.532
3.781GluThr: 3.781 ± 0.299
5.546GluVal: 5.546 ± 0.485
0.828GluTrp: 0.828 ± 0.142
2.953GluTyr: 2.953 ± 0.332
0.0GluXaa: 0.0 ± 0.0
Phe
1.621PheAla: 1.621 ± 0.26
0.504PheCys: 0.504 ± 0.165
3.962PheAsp: 3.962 ± 0.439
2.773PheGlu: 2.773 ± 0.338
1.549PhePhe: 1.549 ± 0.266
2.737PheGly: 2.737 ± 0.415
0.864PheHis: 0.864 ± 0.178
2.629PheIle: 2.629 ± 0.413
3.745PheLys: 3.745 ± 0.42
2.989PheLeu: 2.989 ± 0.416
1.08PheMet: 1.08 ± 0.192
3.097PheAsn: 3.097 ± 0.342
1.188PhePro: 1.188 ± 0.201
1.26PheGln: 1.26 ± 0.194
1.693PheArg: 1.693 ± 0.258
3.565PheSer: 3.565 ± 0.408
2.089PheThr: 2.089 ± 0.336
2.665PheVal: 2.665 ± 0.382
0.324PheTrp: 0.324 ± 0.12
1.441PheTyr: 1.441 ± 0.318
0.0PheXaa: 0.0 ± 0.0
Gly
3.998GlyAla: 3.998 ± 0.477
0.72GlyCys: 0.72 ± 0.228
4.43GlyAsp: 4.43 ± 0.448
4.682GlyGlu: 4.682 ± 0.415
2.917GlyPhe: 2.917 ± 0.373
4.718GlyGly: 4.718 ± 0.682
1.116GlyHis: 1.116 ± 0.22
3.962GlyIle: 3.962 ± 0.421
5.762GlyLys: 5.762 ± 0.502
5.33GlyLeu: 5.33 ± 0.437
2.305GlyMet: 2.305 ± 0.321
3.853GlyAsn: 3.853 ± 0.403
0.468GlyPro: 0.468 ± 0.18
1.801GlyGln: 1.801 ± 0.292
2.449GlyArg: 2.449 ± 0.29
5.258GlySer: 5.258 ± 0.526
3.349GlyThr: 3.349 ± 0.424
4.862GlyVal: 4.862 ± 0.439
1.333GlyTrp: 1.333 ± 0.218
2.629GlyTyr: 2.629 ± 0.302
0.0GlyXaa: 0.0 ± 0.0
His
0.648HisAla: 0.648 ± 0.187
0.396HisCys: 0.396 ± 0.131
1.188HisAsp: 1.188 ± 0.183
1.008HisGlu: 1.008 ± 0.196
0.936HisPhe: 0.936 ± 0.162
1.116HisGly: 1.116 ± 0.211
0.108HisHis: 0.108 ± 0.059
1.152HisIle: 1.152 ± 0.224
1.224HisLys: 1.224 ± 0.246
1.765HisLeu: 1.765 ± 0.219
0.396HisMet: 0.396 ± 0.111
0.828HisAsn: 0.828 ± 0.185
0.864HisPro: 0.864 ± 0.177
0.432HisGln: 0.432 ± 0.123
1.513HisArg: 1.513 ± 0.241
1.369HisSer: 1.369 ± 0.266
0.684HisThr: 0.684 ± 0.156
1.116HisVal: 1.116 ± 0.219
0.216HisTrp: 0.216 ± 0.097
1.08HisTyr: 1.08 ± 0.216
0.0HisXaa: 0.0 ± 0.0
Ile
3.745IleAla: 3.745 ± 0.313
0.792IleCys: 0.792 ± 0.198
5.366IleAsp: 5.366 ± 0.46
4.718IleGlu: 4.718 ± 0.43
2.557IlePhe: 2.557 ± 0.441
3.998IleGly: 3.998 ± 0.432
1.26IleHis: 1.26 ± 0.252
3.998IleIle: 3.998 ± 0.42
5.474IleLys: 5.474 ± 0.454
4.79IleLeu: 4.79 ± 0.378
1.765IleMet: 1.765 ± 0.224
4.574IleAsn: 4.574 ± 0.465
3.205IlePro: 3.205 ± 0.267
1.765IleGln: 1.765 ± 0.269
2.953IleArg: 2.953 ± 0.359
5.042IleSer: 5.042 ± 0.465
4.286IleThr: 4.286 ± 0.464
3.529IleVal: 3.529 ± 0.433
0.612IleTrp: 0.612 ± 0.151
2.629IleTyr: 2.629 ± 0.342
0.0IleXaa: 0.0 ± 0.0
Lys
4.682LysAla: 4.682 ± 0.586
0.648LysCys: 0.648 ± 0.156
6.41LysAsp: 6.41 ± 0.515
7.023LysGlu: 7.023 ± 0.729
2.449LysPhe: 2.449 ± 0.33
5.042LysGly: 5.042 ± 0.427
1.405LysHis: 1.405 ± 0.201
5.33LysIle: 5.33 ± 0.425
6.555LysLys: 6.555 ± 0.571
6.23LysLeu: 6.23 ± 0.483
2.593LysMet: 2.593 ± 0.344
4.106LysAsn: 4.106 ± 0.419
3.277LysPro: 3.277 ± 0.401
2.449LysGln: 2.449 ± 0.364
3.169LysArg: 3.169 ± 0.376
4.466LysSer: 4.466 ± 0.355
4.574LysThr: 4.574 ± 0.414
4.718LysVal: 4.718 ± 0.438
1.188LysTrp: 1.188 ± 0.192
3.457LysTyr: 3.457 ± 0.483
0.0LysXaa: 0.0 ± 0.0
Leu
4.178LeuAla: 4.178 ± 0.549
0.972LeuCys: 0.972 ± 0.224
6.158LeuAsp: 6.158 ± 0.47
5.258LeuGlu: 5.258 ± 0.448
2.773LeuPhe: 2.773 ± 0.365
4.574LeuGly: 4.574 ± 0.49
1.297LeuHis: 1.297 ± 0.219
5.51LeuIle: 5.51 ± 0.513
6.447LeuLys: 6.447 ± 0.559
5.582LeuLeu: 5.582 ± 0.541
2.125LeuMet: 2.125 ± 0.284
4.646LeuAsn: 4.646 ± 0.419
2.881LeuPro: 2.881 ± 0.339
2.773LeuGln: 2.773 ± 0.34
3.745LeuArg: 3.745 ± 0.393
6.879LeuSer: 6.879 ± 0.442
4.214LeuThr: 4.214 ± 0.403
4.43LeuVal: 4.43 ± 0.444
0.612LeuTrp: 0.612 ± 0.16
3.025LeuTyr: 3.025 ± 0.345
0.0LeuXaa: 0.0 ± 0.0
Met
2.053MetAla: 2.053 ± 0.256
0.432MetCys: 0.432 ± 0.147
1.657MetAsp: 1.657 ± 0.236
1.981MetGlu: 1.981 ± 0.222
1.621MetPhe: 1.621 ± 0.311
1.873MetGly: 1.873 ± 0.368
0.54MetHis: 0.54 ± 0.131
1.729MetIle: 1.729 ± 0.301
3.241MetLys: 3.241 ± 0.322
1.873MetLeu: 1.873 ± 0.313
1.188MetMet: 1.188 ± 0.204
1.729MetAsn: 1.729 ± 0.263
1.116MetPro: 1.116 ± 0.201
0.72MetGln: 0.72 ± 0.151
1.297MetArg: 1.297 ± 0.214
2.485MetSer: 2.485 ± 0.314
1.837MetThr: 1.837 ± 0.314
1.585MetVal: 1.585 ± 0.293
0.288MetTrp: 0.288 ± 0.105
0.792MetTyr: 0.792 ± 0.193
0.0MetXaa: 0.0 ± 0.0
Asn
3.421AsnAla: 3.421 ± 0.367
0.72AsnCys: 0.72 ± 0.156
2.845AsnAsp: 2.845 ± 0.315
4.25AsnGlu: 4.25 ± 0.364
2.197AsnPhe: 2.197 ± 0.268
4.322AsnGly: 4.322 ± 0.38
1.188AsnHis: 1.188 ± 0.233
3.817AsnIle: 3.817 ± 0.368
5.51AsnLys: 5.51 ± 0.337
4.934AsnLeu: 4.934 ± 0.432
1.765AsnMet: 1.765 ± 0.254
3.637AsnAsn: 3.637 ± 0.356
2.629AsnPro: 2.629 ± 0.331
2.521AsnGln: 2.521 ± 0.265
3.097AsnArg: 3.097 ± 0.324
3.998AsnSer: 3.998 ± 0.386
2.989AsnThr: 2.989 ± 0.274
3.061AsnVal: 3.061 ± 0.322
0.792AsnTrp: 0.792 ± 0.163
1.765AsnTyr: 1.765 ± 0.245
0.0AsnXaa: 0.0 ± 0.0
Pro
1.405ProAla: 1.405 ± 0.272
0.468ProCys: 0.468 ± 0.12
2.593ProAsp: 2.593 ± 0.338
3.133ProGlu: 3.133 ± 0.347
2.161ProPhe: 2.161 ± 0.306
1.369ProGly: 1.369 ± 0.212
0.684ProHis: 0.684 ± 0.131
2.341ProIle: 2.341 ± 0.264
2.413ProLys: 2.413 ± 0.379
1.945ProLeu: 1.945 ± 0.213
1.044ProMet: 1.044 ± 0.186
2.089ProAsn: 2.089 ± 0.266
0.972ProPro: 0.972 ± 0.208
1.116ProGln: 1.116 ± 0.216
0.972ProArg: 0.972 ± 0.19
2.377ProSer: 2.377 ± 0.369
2.161ProThr: 2.161 ± 0.268
2.377ProVal: 2.377 ± 0.375
0.432ProTrp: 0.432 ± 0.109
1.729ProTyr: 1.729 ± 0.254
0.0ProXaa: 0.0 ± 0.0
Gln
2.017GlnAla: 2.017 ± 0.235
0.18GlnCys: 0.18 ± 0.085
1.945GlnAsp: 1.945 ± 0.281
2.629GlnGlu: 2.629 ± 0.313
1.549GlnPhe: 1.549 ± 0.264
1.909GlnGly: 1.909 ± 0.321
0.396GlnHis: 0.396 ± 0.104
2.233GlnIle: 2.233 ± 0.313
2.089GlnLys: 2.089 ± 0.251
2.521GlnLeu: 2.521 ± 0.331
1.044GlnMet: 1.044 ± 0.245
1.981GlnAsn: 1.981 ± 0.372
1.08GlnPro: 1.08 ± 0.265
1.08GlnGln: 1.08 ± 0.263
1.693GlnArg: 1.693 ± 0.245
1.369GlnSer: 1.369 ± 0.214
1.477GlnThr: 1.477 ± 0.242
2.557GlnVal: 2.557 ± 0.296
0.396GlnTrp: 0.396 ± 0.126
1.116GlnTyr: 1.116 ± 0.237
0.0GlnXaa: 0.0 ± 0.0
Arg
2.593ArgAla: 2.593 ± 0.335
0.468ArgCys: 0.468 ± 0.146
3.349ArgAsp: 3.349 ± 0.321
2.917ArgGlu: 2.917 ± 0.342
2.017ArgPhe: 2.017 ± 0.235
2.953ArgGly: 2.953 ± 0.337
0.972ArgHis: 0.972 ± 0.175
2.917ArgIle: 2.917 ± 0.325
3.529ArgLys: 3.529 ± 0.351
3.998ArgLeu: 3.998 ± 0.457
1.513ArgMet: 1.513 ± 0.257
2.701ArgAsn: 2.701 ± 0.304
1.152ArgPro: 1.152 ± 0.192
0.828ArgGln: 0.828 ± 0.189
2.197ArgArg: 2.197 ± 0.338
3.169ArgSer: 3.169 ± 0.307
1.909ArgThr: 1.909 ± 0.298
2.953ArgVal: 2.953 ± 0.313
0.468ArgTrp: 0.468 ± 0.185
1.549ArgTyr: 1.549 ± 0.265
0.0ArgXaa: 0.0 ± 0.0
Ser
3.493SerAla: 3.493 ± 0.366
0.936SerCys: 0.936 ± 0.203
4.322SerAsp: 4.322 ± 0.44
5.078SerGlu: 5.078 ± 0.446
3.745SerPhe: 3.745 ± 0.393
5.33SerGly: 5.33 ± 0.534
1.405SerHis: 1.405 ± 0.249
5.042SerIle: 5.042 ± 0.405
6.735SerLys: 6.735 ± 0.793
5.69SerLeu: 5.69 ± 0.537
1.909SerMet: 1.909 ± 0.242
3.926SerAsn: 3.926 ± 0.376
2.557SerPro: 2.557 ± 0.31
2.197SerGln: 2.197 ± 0.317
3.205SerArg: 3.205 ± 0.367
4.898SerSer: 4.898 ± 0.457
4.61SerThr: 4.61 ± 0.42
5.618SerVal: 5.618 ± 0.448
0.504SerTrp: 0.504 ± 0.122
2.557SerTyr: 2.557 ± 0.283
0.0SerXaa: 0.0 ± 0.0
Thr
3.241ThrAla: 3.241 ± 0.334
0.432ThrCys: 0.432 ± 0.125
3.097ThrAsp: 3.097 ± 0.406
3.349ThrGlu: 3.349 ± 0.318
2.485ThrPhe: 2.485 ± 0.303
4.97ThrGly: 4.97 ± 0.456
0.9ThrHis: 0.9 ± 0.175
4.646ThrIle: 4.646 ± 0.477
3.565ThrLys: 3.565 ± 0.356
5.402ThrLeu: 5.402 ± 0.488
1.008ThrMet: 1.008 ± 0.163
2.737ThrAsn: 2.737 ± 0.274
2.197ThrPro: 2.197 ± 0.272
1.621ThrGln: 1.621 ± 0.273
1.909ThrArg: 1.909 ± 0.261
4.286ThrSer: 4.286 ± 0.36
2.773ThrThr: 2.773 ± 0.339
3.493ThrVal: 3.493 ± 0.51
0.756ThrTrp: 0.756 ± 0.193
1.981ThrTyr: 1.981 ± 0.253
0.0ThrXaa: 0.0 ± 0.0
Val
3.025ValAla: 3.025 ± 0.352
0.864ValCys: 0.864 ± 0.213
4.574ValAsp: 4.574 ± 0.366
4.466ValGlu: 4.466 ± 0.408
2.233ValPhe: 2.233 ± 0.32
3.853ValGly: 3.853 ± 0.391
1.297ValHis: 1.297 ± 0.187
4.43ValIle: 4.43 ± 0.431
5.078ValLys: 5.078 ± 0.468
4.358ValLeu: 4.358 ± 0.408
2.017ValMet: 2.017 ± 0.299
4.286ValAsn: 4.286 ± 0.456
2.233ValPro: 2.233 ± 0.273
2.053ValGln: 2.053 ± 0.251
3.133ValArg: 3.133 ± 0.394
5.258ValSer: 5.258 ± 0.545
3.817ValThr: 3.817 ± 0.332
4.502ValVal: 4.502 ± 0.461
0.684ValTrp: 0.684 ± 0.157
2.413ValTyr: 2.413 ± 0.427
0.0ValXaa: 0.0 ± 0.0
Trp
0.576TrpAla: 0.576 ± 0.135
0.144TrpCys: 0.144 ± 0.063
0.864TrpAsp: 0.864 ± 0.186
1.08TrpGlu: 1.08 ± 0.211
0.72TrpPhe: 0.72 ± 0.184
0.9TrpGly: 0.9 ± 0.157
0.18TrpHis: 0.18 ± 0.077
0.972TrpIle: 0.972 ± 0.17
0.792TrpLys: 0.792 ± 0.152
0.792TrpLeu: 0.792 ± 0.17
0.288TrpMet: 0.288 ± 0.103
0.612TrpAsn: 0.612 ± 0.144
0.108TrpPro: 0.108 ± 0.054
0.216TrpGln: 0.216 ± 0.083
0.432TrpArg: 0.432 ± 0.133
1.224TrpSer: 1.224 ± 0.246
0.432TrpThr: 0.432 ± 0.134
0.684TrpVal: 0.684 ± 0.141
0.216TrpTrp: 0.216 ± 0.099
0.288TrpTyr: 0.288 ± 0.09
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.801TyrAla: 1.801 ± 0.236
0.504TyrCys: 0.504 ± 0.126
2.989TyrAsp: 2.989 ± 0.344
2.557TyrGlu: 2.557 ± 0.313
1.477TyrPhe: 1.477 ± 0.243
2.629TyrGly: 2.629 ± 0.315
0.9TyrHis: 0.9 ± 0.187
2.809TyrIle: 2.809 ± 0.284
3.313TyrLys: 3.313 ± 0.392
2.917TyrLeu: 2.917 ± 0.382
1.224TyrMet: 1.224 ± 0.202
2.413TyrAsn: 2.413 ± 0.236
1.369TyrPro: 1.369 ± 0.248
1.152TyrGln: 1.152 ± 0.199
1.26TyrArg: 1.26 ± 0.202
2.629TyrSer: 2.629 ± 0.329
1.801TyrThr: 1.801 ± 0.263
2.449TyrVal: 2.449 ± 0.353
0.504TyrTrp: 0.504 ± 0.124
1.441TyrTyr: 1.441 ± 0.225
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 140 proteins (27768 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski