Amino acid dipepetide frequency for Vibrio phage vB_ValS_PJ32

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.853AlaAla: 8.853 ± 1.124
0.847AlaCys: 0.847 ± 0.203
4.914AlaAsp: 4.914 ± 0.515
6.523AlaGlu: 6.523 ± 0.71
2.499AlaPhe: 2.499 ± 0.296
4.702AlaGly: 4.702 ± 0.43
1.144AlaHis: 1.144 ± 0.199
5.634AlaIle: 5.634 ± 0.502
6.523AlaLys: 6.523 ± 0.662
8.514AlaLeu: 8.514 ± 0.637
2.796AlaMet: 2.796 ± 0.329
4.363AlaAsn: 4.363 ± 0.45
3.304AlaPro: 3.304 ± 0.412
3.728AlaGln: 3.728 ± 0.531
4.405AlaArg: 4.405 ± 0.442
4.024AlaSer: 4.024 ± 0.346
4.405AlaThr: 4.405 ± 0.578
5.676AlaVal: 5.676 ± 0.669
0.635AlaTrp: 0.635 ± 0.149
2.881AlaTyr: 2.881 ± 0.365
0.0AlaXaa: 0.0 ± 0.0
Cys
0.974CysAla: 0.974 ± 0.196
0.339CysCys: 0.339 ± 0.155
0.508CysAsp: 0.508 ± 0.169
0.72CysGlu: 0.72 ± 0.203
0.339CysPhe: 0.339 ± 0.125
1.017CysGly: 1.017 ± 0.204
0.254CysHis: 0.254 ± 0.093
0.424CysIle: 0.424 ± 0.149
0.466CysLys: 0.466 ± 0.131
1.313CysLeu: 1.313 ± 0.279
0.169CysMet: 0.169 ± 0.082
0.508CysAsn: 0.508 ± 0.153
0.635CysPro: 0.635 ± 0.182
0.297CysGln: 0.297 ± 0.114
0.974CysArg: 0.974 ± 0.18
0.805CysSer: 0.805 ± 0.225
1.059CysThr: 1.059 ± 0.23
0.551CysVal: 0.551 ± 0.163
0.127CysTrp: 0.127 ± 0.074
0.508CysTyr: 0.508 ± 0.137
0.0CysXaa: 0.0 ± 0.0
Asp
5.719AspAla: 5.719 ± 0.395
0.72AspCys: 0.72 ± 0.157
2.584AspAsp: 2.584 ± 0.361
3.601AspGlu: 3.601 ± 0.442
2.626AspPhe: 2.626 ± 0.324
3.516AspGly: 3.516 ± 0.408
1.525AspHis: 1.525 ± 0.267
3.219AspIle: 3.219 ± 0.358
4.194AspLys: 4.194 ± 0.473
7.371AspLeu: 7.371 ± 0.441
1.017AspMet: 1.017 ± 0.207
2.457AspAsn: 2.457 ± 0.295
3.982AspPro: 3.982 ± 0.427
2.923AspGln: 2.923 ± 0.374
3.304AspArg: 3.304 ± 0.318
2.838AspSer: 2.838 ± 0.326
3.262AspThr: 3.262 ± 0.429
3.685AspVal: 3.685 ± 0.458
0.297AspTrp: 0.297 ± 0.101
2.415AspTyr: 2.415 ± 0.301
0.0AspXaa: 0.0 ± 0.0
Glu
6.396GluAla: 6.396 ± 0.607
0.932GluCys: 0.932 ± 0.208
3.855GluAsp: 3.855 ± 0.425
4.067GluGlu: 4.067 ± 0.424
2.33GluPhe: 2.33 ± 0.299
4.405GluGly: 4.405 ± 0.534
1.525GluHis: 1.525 ± 0.248
5.295GluIle: 5.295 ± 0.373
4.151GluLys: 4.151 ± 0.541
7.413GluLeu: 7.413 ± 0.854
1.356GluMet: 1.356 ± 0.229
3.008GluAsn: 3.008 ± 0.335
2.287GluPro: 2.287 ± 0.339
2.796GluGln: 2.796 ± 0.467
3.855GluArg: 3.855 ± 0.478
3.982GluSer: 3.982 ± 0.503
4.533GluThr: 4.533 ± 0.439
4.321GluVal: 4.321 ± 0.426
0.762GluTrp: 0.762 ± 0.181
2.287GluTyr: 2.287 ± 0.302
0.0GluXaa: 0.0 ± 0.0
Phe
2.33PheAla: 2.33 ± 0.31
0.72PheCys: 0.72 ± 0.17
2.881PheAsp: 2.881 ± 0.49
2.076PheGlu: 2.076 ± 0.326
1.017PhePhe: 1.017 ± 0.21
2.076PheGly: 2.076 ± 0.264
0.466PheHis: 0.466 ± 0.148
2.881PheIle: 2.881 ± 0.335
2.881PheLys: 2.881 ± 0.364
3.135PheLeu: 3.135 ± 0.296
0.762PheMet: 0.762 ± 0.153
2.076PheAsn: 2.076 ± 0.242
1.059PhePro: 1.059 ± 0.202
0.847PheGln: 0.847 ± 0.198
1.271PheArg: 1.271 ± 0.213
2.669PheSer: 2.669 ± 0.342
1.991PheThr: 1.991 ± 0.317
2.118PheVal: 2.118 ± 0.314
0.0PheTrp: 0.0 ± 0.0
1.271PheTyr: 1.271 ± 0.172
0.0PheXaa: 0.0 ± 0.0
Gly
5.253GlyAla: 5.253 ± 0.57
0.593GlyCys: 0.593 ± 0.148
4.448GlyAsp: 4.448 ± 0.449
3.812GlyGlu: 3.812 ± 0.409
2.965GlyPhe: 2.965 ± 0.311
4.787GlyGly: 4.787 ± 0.482
1.144GlyHis: 1.144 ± 0.24
3.685GlyIle: 3.685 ± 0.401
3.177GlyLys: 3.177 ± 0.443
6.312GlyLeu: 6.312 ± 0.467
1.059GlyMet: 1.059 ± 0.236
2.965GlyAsn: 2.965 ± 0.356
0.762GlyPro: 0.762 ± 0.187
2.203GlyGln: 2.203 ± 0.278
3.219GlyArg: 3.219 ± 0.378
4.829GlySer: 4.829 ± 0.506
3.77GlyThr: 3.77 ± 0.385
4.278GlyVal: 4.278 ± 0.412
0.72GlyTrp: 0.72 ± 0.181
2.965GlyTyr: 2.965 ± 0.313
0.0GlyXaa: 0.0 ± 0.0
His
1.567HisAla: 1.567 ± 0.238
0.339HisCys: 0.339 ± 0.12
0.805HisAsp: 0.805 ± 0.19
1.228HisGlu: 1.228 ± 0.238
0.508HisPhe: 0.508 ± 0.131
1.398HisGly: 1.398 ± 0.229
0.424HisHis: 0.424 ± 0.151
0.72HisIle: 0.72 ± 0.205
1.61HisLys: 1.61 ± 0.27
1.779HisLeu: 1.779 ± 0.301
0.169HisMet: 0.169 ± 0.067
0.974HisAsn: 0.974 ± 0.237
0.932HisPro: 0.932 ± 0.204
0.381HisGln: 0.381 ± 0.124
1.356HisArg: 1.356 ± 0.233
0.805HisSer: 0.805 ± 0.205
1.356HisThr: 1.356 ± 0.248
1.398HisVal: 1.398 ± 0.253
0.042HisTrp: 0.042 ± 0.046
0.974HisTyr: 0.974 ± 0.206
0.0HisXaa: 0.0 ± 0.0
Ile
4.871IleAla: 4.871 ± 0.546
1.059IleCys: 1.059 ± 0.24
4.914IleAsp: 4.914 ± 0.425
5.295IleGlu: 5.295 ± 0.38
1.567IlePhe: 1.567 ± 0.284
3.177IleGly: 3.177 ± 0.363
0.89IleHis: 0.89 ± 0.191
3.558IleIle: 3.558 ± 0.408
4.575IleLys: 4.575 ± 0.47
4.914IleLeu: 4.914 ± 0.475
1.059IleMet: 1.059 ± 0.21
3.855IleAsn: 3.855 ± 0.393
3.135IlePro: 3.135 ± 0.427
2.584IleGln: 2.584 ± 0.386
4.067IleArg: 4.067 ± 0.398
3.855IleSer: 3.855 ± 0.426
3.982IleThr: 3.982 ± 0.418
3.77IleVal: 3.77 ± 0.349
0.381IleTrp: 0.381 ± 0.127
1.567IleTyr: 1.567 ± 0.304
0.0IleXaa: 0.0 ± 0.0
Lys
6.905LysAla: 6.905 ± 0.644
0.593LysCys: 0.593 ± 0.209
4.829LysAsp: 4.829 ± 0.498
5.253LysGlu: 5.253 ± 0.495
1.356LysPhe: 1.356 ± 0.189
3.982LysGly: 3.982 ± 0.409
1.059LysHis: 1.059 ± 0.224
3.516LysIle: 3.516 ± 0.457
5.126LysLys: 5.126 ± 0.541
6.396LysLeu: 6.396 ± 0.609
1.228LysMet: 1.228 ± 0.224
2.881LysAsn: 2.881 ± 0.31
3.77LysPro: 3.77 ± 0.501
2.881LysGln: 2.881 ± 0.325
3.516LysArg: 3.516 ± 0.36
3.812LysSer: 3.812 ± 0.376
4.405LysThr: 4.405 ± 0.443
3.431LysVal: 3.431 ± 0.283
0.635LysTrp: 0.635 ± 0.154
1.737LysTyr: 1.737 ± 0.34
0.0LysXaa: 0.0 ± 0.0
Leu
8.091LeuAla: 8.091 ± 0.697
1.144LeuCys: 1.144 ± 0.232
6.947LeuAsp: 6.947 ± 0.453
6.312LeuGlu: 6.312 ± 0.526
3.008LeuPhe: 3.008 ± 0.378
5.337LeuGly: 5.337 ± 0.553
2.16LeuHis: 2.16 ± 0.369
7.201LeuIle: 7.201 ± 0.646
6.693LeuLys: 6.693 ± 0.559
7.244LeuLeu: 7.244 ± 0.745
2.33LeuMet: 2.33 ± 0.341
5.719LeuAsn: 5.719 ± 0.504
2.838LeuPro: 2.838 ± 0.302
3.77LeuGln: 3.77 ± 0.495
4.363LeuArg: 4.363 ± 0.36
5.634LeuSer: 5.634 ± 0.449
6.566LeuThr: 6.566 ± 0.605
5.126LeuVal: 5.126 ± 0.444
0.72LeuTrp: 0.72 ± 0.227
2.796LeuTyr: 2.796 ± 0.298
0.0LeuXaa: 0.0 ± 0.0
Met
2.287MetAla: 2.287 ± 0.296
0.339MetCys: 0.339 ± 0.125
0.678MetAsp: 0.678 ± 0.173
0.89MetGlu: 0.89 ± 0.224
0.974MetPhe: 0.974 ± 0.178
1.186MetGly: 1.186 ± 0.212
0.508MetHis: 0.508 ± 0.149
1.313MetIle: 1.313 ± 0.251
1.271MetLys: 1.271 ± 0.245
2.457MetLeu: 2.457 ± 0.35
0.254MetMet: 0.254 ± 0.101
1.144MetAsn: 1.144 ± 0.196
1.059MetPro: 1.059 ± 0.216
1.101MetGln: 1.101 ± 0.214
1.398MetArg: 1.398 ± 0.243
1.228MetSer: 1.228 ± 0.217
1.737MetThr: 1.737 ± 0.25
1.313MetVal: 1.313 ± 0.269
0.254MetTrp: 0.254 ± 0.119
0.381MetTyr: 0.381 ± 0.114
0.0MetXaa: 0.0 ± 0.0
Asn
4.405AsnAla: 4.405 ± 0.455
0.678AsnCys: 0.678 ± 0.203
3.008AsnAsp: 3.008 ± 0.374
3.643AsnGlu: 3.643 ± 0.425
2.626AsnPhe: 2.626 ± 0.362
3.516AsnGly: 3.516 ± 0.455
0.678AsnHis: 0.678 ± 0.156
2.753AsnIle: 2.753 ± 0.349
2.711AsnLys: 2.711 ± 0.293
4.448AsnLeu: 4.448 ± 0.404
1.313AsnMet: 1.313 ± 0.222
3.05AsnAsn: 3.05 ± 0.481
3.177AsnPro: 3.177 ± 0.377
1.779AsnGln: 1.779 ± 0.27
2.415AsnArg: 2.415 ± 0.259
2.415AsnSer: 2.415 ± 0.353
3.474AsnThr: 3.474 ± 0.394
3.219AsnVal: 3.219 ± 0.409
0.551AsnTrp: 0.551 ± 0.142
1.949AsnTyr: 1.949 ± 0.3
0.0AsnXaa: 0.0 ± 0.0
Pro
1.652ProAla: 1.652 ± 0.235
0.424ProCys: 0.424 ± 0.127
3.304ProAsp: 3.304 ± 0.469
3.812ProGlu: 3.812 ± 0.454
1.101ProPhe: 1.101 ± 0.195
2.415ProGly: 2.415 ± 0.316
0.847ProHis: 0.847 ± 0.21
2.118ProIle: 2.118 ± 0.365
3.135ProLys: 3.135 ± 0.43
3.346ProLeu: 3.346 ± 0.383
0.339ProMet: 0.339 ± 0.122
2.287ProAsn: 2.287 ± 0.34
1.356ProPro: 1.356 ± 0.267
1.44ProGln: 1.44 ± 0.276
1.949ProArg: 1.949 ± 0.404
1.906ProSer: 1.906 ± 0.277
3.05ProThr: 3.05 ± 0.356
3.643ProVal: 3.643 ± 0.482
0.381ProTrp: 0.381 ± 0.132
1.44ProTyr: 1.44 ± 0.284
0.0ProXaa: 0.0 ± 0.0
Gln
3.685GlnAla: 3.685 ± 0.535
0.297GlnCys: 0.297 ± 0.12
1.779GlnAsp: 1.779 ± 0.282
2.542GlnGlu: 2.542 ± 0.433
1.228GlnPhe: 1.228 ± 0.192
2.499GlnGly: 2.499 ± 0.311
0.551GlnHis: 0.551 ± 0.138
2.372GlnIle: 2.372 ± 0.323
2.245GlnLys: 2.245 ± 0.344
4.194GlnLeu: 4.194 ± 0.591
1.017GlnMet: 1.017 ± 0.206
1.44GlnAsn: 1.44 ± 0.245
0.974GlnPro: 0.974 ± 0.248
1.483GlnGln: 1.483 ± 0.337
1.949GlnArg: 1.949 ± 0.405
2.372GlnSer: 2.372 ± 0.336
2.499GlnThr: 2.499 ± 0.315
2.372GlnVal: 2.372 ± 0.339
0.339GlnTrp: 0.339 ± 0.141
1.059GlnTyr: 1.059 ± 0.273
0.0GlnXaa: 0.0 ± 0.0
Arg
4.617ArgAla: 4.617 ± 0.565
0.466ArgCys: 0.466 ± 0.143
2.457ArgAsp: 2.457 ± 0.333
4.405ArgGlu: 4.405 ± 0.727
2.203ArgPhe: 2.203 ± 0.326
3.219ArgGly: 3.219 ± 0.407
0.72ArgHis: 0.72 ± 0.176
3.728ArgIle: 3.728 ± 0.372
3.346ArgLys: 3.346 ± 0.421
4.914ArgLeu: 4.914 ± 0.419
1.059ArgMet: 1.059 ± 0.202
2.753ArgAsn: 2.753 ± 0.323
1.779ArgPro: 1.779 ± 0.281
1.61ArgGln: 1.61 ± 0.314
2.965ArgArg: 2.965 ± 0.461
2.542ArgSer: 2.542 ± 0.324
3.219ArgThr: 3.219 ± 0.379
4.405ArgVal: 4.405 ± 0.59
0.297ArgTrp: 0.297 ± 0.107
2.16ArgTyr: 2.16 ± 0.312
0.0ArgXaa: 0.0 ± 0.0
Ser
4.871SerAla: 4.871 ± 0.451
0.424SerCys: 0.424 ± 0.161
2.838SerAsp: 2.838 ± 0.366
3.982SerGlu: 3.982 ± 0.421
2.16SerPhe: 2.16 ± 0.321
4.617SerGly: 4.617 ± 0.485
1.101SerHis: 1.101 ± 0.199
4.236SerIle: 4.236 ± 0.495
3.897SerLys: 3.897 ± 0.464
4.956SerLeu: 4.956 ± 0.495
1.652SerMet: 1.652 ± 0.211
3.304SerAsn: 3.304 ± 0.485
2.245SerPro: 2.245 ± 0.339
1.525SerGln: 1.525 ± 0.277
2.881SerArg: 2.881 ± 0.31
3.558SerSer: 3.558 ± 0.474
4.363SerThr: 4.363 ± 0.43
3.05SerVal: 3.05 ± 0.333
0.169SerTrp: 0.169 ± 0.072
1.949SerTyr: 1.949 ± 0.329
0.0SerXaa: 0.0 ± 0.0
Thr
5.337ThrAla: 5.337 ± 0.575
0.678ThrCys: 0.678 ± 0.166
4.151ThrAsp: 4.151 ± 0.413
4.49ThrGlu: 4.49 ± 0.398
2.372ThrPhe: 2.372 ± 0.274
4.914ThrGly: 4.914 ± 0.383
1.313ThrHis: 1.313 ± 0.217
4.533ThrIle: 4.533 ± 0.484
3.982ThrLys: 3.982 ± 0.39
6.778ThrLeu: 6.778 ± 0.654
1.271ThrMet: 1.271 ± 0.198
2.796ThrAsn: 2.796 ± 0.338
3.135ThrPro: 3.135 ± 0.375
2.16ThrGln: 2.16 ± 0.275
2.923ThrArg: 2.923 ± 0.338
3.516ThrSer: 3.516 ± 0.401
5.041ThrThr: 5.041 ± 0.499
3.77ThrVal: 3.77 ± 0.455
0.593ThrTrp: 0.593 ± 0.149
1.991ThrTyr: 1.991 ± 0.365
0.0ThrXaa: 0.0 ± 0.0
Val
4.914ValAla: 4.914 ± 0.511
0.847ValCys: 0.847 ± 0.179
3.558ValAsp: 3.558 ± 0.354
4.533ValGlu: 4.533 ± 0.492
2.584ValPhe: 2.584 ± 0.373
3.346ValGly: 3.346 ± 0.397
1.144ValHis: 1.144 ± 0.232
3.643ValIle: 3.643 ± 0.298
4.956ValLys: 4.956 ± 0.466
4.321ValLeu: 4.321 ± 0.408
1.61ValMet: 1.61 ± 0.234
3.601ValAsn: 3.601 ± 0.328
2.16ValPro: 2.16 ± 0.32
1.694ValGln: 1.694 ± 0.316
3.643ValArg: 3.643 ± 0.357
4.871ValSer: 4.871 ± 0.46
4.575ValThr: 4.575 ± 0.514
3.685ValVal: 3.685 ± 0.563
0.297ValTrp: 0.297 ± 0.112
1.991ValTyr: 1.991 ± 0.284
0.0ValXaa: 0.0 ± 0.0
Trp
0.678TrpAla: 0.678 ± 0.163
0.085TrpCys: 0.085 ± 0.053
0.381TrpAsp: 0.381 ± 0.142
0.297TrpGlu: 0.297 ± 0.093
0.085TrpPhe: 0.085 ± 0.058
0.678TrpGly: 0.678 ± 0.193
0.127TrpHis: 0.127 ± 0.102
0.72TrpIle: 0.72 ± 0.19
0.339TrpLys: 0.339 ± 0.124
0.678TrpLeu: 0.678 ± 0.18
0.254TrpMet: 0.254 ± 0.096
0.508TrpAsn: 0.508 ± 0.127
0.042TrpPro: 0.042 ± 0.049
0.381TrpGln: 0.381 ± 0.137
0.593TrpArg: 0.593 ± 0.176
0.466TrpSer: 0.466 ± 0.128
0.339TrpThr: 0.339 ± 0.133
0.593TrpVal: 0.593 ± 0.188
0.127TrpTrp: 0.127 ± 0.075
0.254TrpTyr: 0.254 ± 0.105
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.008TyrAla: 3.008 ± 0.355
0.466TyrCys: 0.466 ± 0.133
2.415TyrAsp: 2.415 ± 0.304
2.076TyrGlu: 2.076 ± 0.282
0.974TyrPhe: 0.974 ± 0.202
2.16TyrGly: 2.16 ± 0.291
1.186TyrHis: 1.186 ± 0.222
1.525TyrIle: 1.525 ± 0.268
2.033TyrLys: 2.033 ± 0.309
3.516TyrLeu: 3.516 ± 0.403
1.017TyrMet: 1.017 ± 0.216
2.118TyrAsn: 2.118 ± 0.372
1.313TyrPro: 1.313 ± 0.281
1.313TyrGln: 1.313 ± 0.25
1.821TyrArg: 1.821 ± 0.241
1.652TyrSer: 1.652 ± 0.341
2.118TyrThr: 2.118 ± 0.341
1.567TyrVal: 1.567 ± 0.26
0.297TyrTrp: 0.297 ± 0.109
1.228TyrTyr: 1.228 ± 0.268
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 111 proteins (23608 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski