Amino acid dipepetide frequency for Vibrio phage VBM1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.632AlaAla: 4.632 ± 0.904
0.744AlaCys: 0.744 ± 0.27
3.805AlaAsp: 3.805 ± 0.388
4.136AlaGlu: 4.136 ± 0.691
2.647AlaPhe: 2.647 ± 0.413
4.549AlaGly: 4.549 ± 0.81
0.993AlaHis: 0.993 ± 0.278
4.549AlaIle: 4.549 ± 0.67
5.045AlaLys: 5.045 ± 0.736
5.542AlaLeu: 5.542 ± 0.719
1.654AlaMet: 1.654 ± 0.514
3.143AlaAsn: 3.143 ± 0.495
1.985AlaPro: 1.985 ± 0.449
2.481AlaGln: 2.481 ± 0.745
1.985AlaArg: 1.985 ± 0.401
5.624AlaSer: 5.624 ± 0.908
4.218AlaThr: 4.218 ± 0.688
6.369AlaVal: 6.369 ± 0.905
0.91AlaTrp: 0.91 ± 0.265
1.902AlaTyr: 1.902 ± 0.298
0.0AlaXaa: 0.0 ± 0.0
Cys
0.331CysAla: 0.331 ± 0.132
0.165CysCys: 0.165 ± 0.124
0.827CysAsp: 0.827 ± 0.261
1.323CysGlu: 1.323 ± 0.297
0.414CysPhe: 0.414 ± 0.177
0.496CysGly: 0.496 ± 0.195
0.248CysHis: 0.248 ± 0.161
0.414CysIle: 0.414 ± 0.168
0.414CysLys: 0.414 ± 0.194
0.579CysLeu: 0.579 ± 0.169
0.165CysMet: 0.165 ± 0.103
0.579CysAsn: 0.579 ± 0.26
0.331CysPro: 0.331 ± 0.154
0.414CysGln: 0.414 ± 0.169
0.827CysArg: 0.827 ± 0.209
0.744CysSer: 0.744 ± 0.287
0.248CysThr: 0.248 ± 0.125
0.744CysVal: 0.744 ± 0.297
0.331CysTrp: 0.331 ± 0.167
0.496CysTyr: 0.496 ± 0.198
0.0CysXaa: 0.0 ± 0.0
Asp
5.211AspAla: 5.211 ± 0.638
0.662AspCys: 0.662 ± 0.203
3.309AspAsp: 3.309 ± 0.414
5.294AspGlu: 5.294 ± 0.906
3.557AspPhe: 3.557 ± 0.545
5.045AspGly: 5.045 ± 0.727
1.075AspHis: 1.075 ± 0.234
3.888AspIle: 3.888 ± 0.421
4.549AspLys: 4.549 ± 0.556
4.797AspLeu: 4.797 ± 0.662
1.572AspMet: 1.572 ± 0.335
3.309AspAsn: 3.309 ± 0.466
2.316AspPro: 2.316 ± 0.371
2.151AspGln: 2.151 ± 0.336
2.068AspArg: 2.068 ± 0.451
4.797AspSer: 4.797 ± 0.58
3.474AspThr: 3.474 ± 0.477
4.136AspVal: 4.136 ± 0.523
1.489AspTrp: 1.489 ± 0.389
1.737AspTyr: 1.737 ± 0.311
0.0AspXaa: 0.0 ± 0.0
Glu
4.797GluAla: 4.797 ± 0.553
0.496GluCys: 0.496 ± 0.234
3.639GluAsp: 3.639 ± 0.522
4.88GluGlu: 4.88 ± 0.943
3.97GluPhe: 3.97 ± 0.89
3.557GluGly: 3.557 ± 0.721
0.827GluHis: 0.827 ± 0.215
5.211GluIle: 5.211 ± 0.814
5.707GluLys: 5.707 ± 0.879
7.692GluLeu: 7.692 ± 0.841
1.737GluMet: 1.737 ± 0.443
4.136GluAsn: 4.136 ± 0.61
2.481GluPro: 2.481 ± 0.705
3.474GluGln: 3.474 ± 0.479
3.143GluArg: 3.143 ± 0.561
5.376GluSer: 5.376 ± 0.632
3.97GluThr: 3.97 ± 0.49
4.632GluVal: 4.632 ± 0.624
1.158GluTrp: 1.158 ± 0.351
2.564GluTyr: 2.564 ± 0.522
0.0GluXaa: 0.0 ± 0.0
Phe
2.812PheAla: 2.812 ± 0.442
0.662PheCys: 0.662 ± 0.235
3.391PheAsp: 3.391 ± 0.487
2.73PheGlu: 2.73 ± 0.441
1.654PhePhe: 1.654 ± 0.372
3.391PheGly: 3.391 ± 0.599
0.579PheHis: 0.579 ± 0.21
3.226PheIle: 3.226 ± 0.567
3.639PheLys: 3.639 ± 0.57
2.812PheLeu: 2.812 ± 0.468
1.406PheMet: 1.406 ± 0.316
2.481PheAsn: 2.481 ± 0.53
1.075PhePro: 1.075 ± 0.325
1.158PheGln: 1.158 ± 0.294
1.406PheArg: 1.406 ± 0.295
4.384PheSer: 4.384 ± 0.742
2.233PheThr: 2.233 ± 0.364
2.481PheVal: 2.481 ± 0.525
0.579PheTrp: 0.579 ± 0.218
1.241PheTyr: 1.241 ± 0.3
0.0PheXaa: 0.0 ± 0.0
Gly
4.797GlyAla: 4.797 ± 0.718
0.414GlyCys: 0.414 ± 0.174
4.549GlyAsp: 4.549 ± 0.657
4.963GlyGlu: 4.963 ± 0.438
4.301GlyPhe: 4.301 ± 0.859
5.045GlyGly: 5.045 ± 0.816
0.579GlyHis: 0.579 ± 0.204
4.218GlyIle: 4.218 ± 0.702
4.715GlyLys: 4.715 ± 0.722
4.797GlyLeu: 4.797 ± 0.566
2.151GlyMet: 2.151 ± 0.364
3.226GlyAsn: 3.226 ± 0.581
0.331GlyPro: 0.331 ± 0.154
2.316GlyGln: 2.316 ± 0.316
4.301GlyArg: 4.301 ± 0.593
5.542GlySer: 5.542 ± 0.866
3.309GlyThr: 3.309 ± 0.52
5.294GlyVal: 5.294 ± 0.547
0.496GlyTrp: 0.496 ± 0.21
3.639GlyTyr: 3.639 ± 0.539
0.0GlyXaa: 0.0 ± 0.0
His
0.744HisAla: 0.744 ± 0.23
0.331HisCys: 0.331 ± 0.169
0.827HisAsp: 0.827 ± 0.273
0.993HisGlu: 0.993 ± 0.327
0.414HisPhe: 0.414 ± 0.178
0.91HisGly: 0.91 ± 0.309
0.496HisHis: 0.496 ± 0.206
0.662HisIle: 0.662 ± 0.278
1.241HisLys: 1.241 ± 0.297
1.572HisLeu: 1.572 ± 0.413
0.083HisMet: 0.083 ± 0.084
0.496HisAsn: 0.496 ± 0.205
0.579HisPro: 0.579 ± 0.229
0.993HisGln: 0.993 ± 0.279
0.744HisArg: 0.744 ± 0.221
0.993HisSer: 0.993 ± 0.246
0.993HisThr: 0.993 ± 0.254
0.744HisVal: 0.744 ± 0.186
0.165HisTrp: 0.165 ± 0.108
0.662HisTyr: 0.662 ± 0.197
0.0HisXaa: 0.0 ± 0.0
Ile
4.053IleAla: 4.053 ± 0.629
0.91IleCys: 0.91 ± 0.303
4.384IleAsp: 4.384 ± 0.533
5.873IleGlu: 5.873 ± 0.779
1.985IlePhe: 1.985 ± 0.612
3.888IleGly: 3.888 ± 0.715
0.744IleHis: 0.744 ± 0.218
3.06IleIle: 3.06 ± 0.482
5.873IleLys: 5.873 ± 0.579
5.211IleLeu: 5.211 ± 0.608
1.158IleMet: 1.158 ± 0.27
4.632IleAsn: 4.632 ± 0.59
2.73IlePro: 2.73 ± 0.379
2.647IleGln: 2.647 ± 0.364
2.564IleArg: 2.564 ± 0.383
3.97IleSer: 3.97 ± 0.627
4.88IleThr: 4.88 ± 0.548
2.151IleVal: 2.151 ± 0.403
0.744IleTrp: 0.744 ± 0.217
2.316IleTyr: 2.316 ± 0.45
0.0IleXaa: 0.0 ± 0.0
Lys
5.624LysAla: 5.624 ± 0.862
0.91LysCys: 0.91 ± 0.253
3.639LysAsp: 3.639 ± 0.6
5.294LysGlu: 5.294 ± 0.714
3.888LysPhe: 3.888 ± 0.533
4.218LysGly: 4.218 ± 0.491
1.406LysHis: 1.406 ± 0.461
4.963LysIle: 4.963 ± 0.626
4.053LysLys: 4.053 ± 0.865
6.203LysLeu: 6.203 ± 0.682
1.737LysMet: 1.737 ± 0.386
3.557LysAsn: 3.557 ± 0.453
4.053LysPro: 4.053 ± 0.503
3.474LysGln: 3.474 ± 0.534
2.895LysArg: 2.895 ± 0.638
4.88LysSer: 4.88 ± 0.719
3.97LysThr: 3.97 ± 0.474
3.391LysVal: 3.391 ± 0.497
0.827LysTrp: 0.827 ± 0.243
2.481LysTyr: 2.481 ± 0.468
0.0LysXaa: 0.0 ± 0.0
Leu
4.549LeuAla: 4.549 ± 0.36
0.662LeuCys: 0.662 ± 0.234
5.624LeuAsp: 5.624 ± 0.646
6.038LeuGlu: 6.038 ± 0.801
2.068LeuPhe: 2.068 ± 0.415
5.542LeuGly: 5.542 ± 0.769
1.075LeuHis: 1.075 ± 0.312
6.121LeuIle: 6.121 ± 0.633
6.369LeuLys: 6.369 ± 0.887
5.542LeuLeu: 5.542 ± 0.644
1.82LeuMet: 1.82 ± 0.435
4.384LeuAsn: 4.384 ± 0.45
3.474LeuPro: 3.474 ± 0.417
3.143LeuGln: 3.143 ± 0.518
3.888LeuArg: 3.888 ± 0.562
6.534LeuSer: 6.534 ± 0.642
5.707LeuThr: 5.707 ± 0.496
4.963LeuVal: 4.963 ± 0.571
0.91LeuTrp: 0.91 ± 0.292
2.233LeuTyr: 2.233 ± 0.496
0.0LeuXaa: 0.0 ± 0.0
Met
2.151MetAla: 2.151 ± 0.378
0.165MetCys: 0.165 ± 0.125
0.993MetAsp: 0.993 ± 0.322
1.323MetGlu: 1.323 ± 0.285
0.662MetPhe: 0.662 ± 0.227
1.075MetGly: 1.075 ± 0.301
0.414MetHis: 0.414 ± 0.155
1.737MetIle: 1.737 ± 0.515
2.068MetLys: 2.068 ± 0.373
1.737MetLeu: 1.737 ± 0.384
0.414MetMet: 0.414 ± 0.227
1.489MetAsn: 1.489 ± 0.288
0.744MetPro: 0.744 ± 0.248
0.993MetGln: 0.993 ± 0.324
1.489MetArg: 1.489 ± 0.354
2.812MetSer: 2.812 ± 0.451
1.241MetThr: 1.241 ± 0.316
1.241MetVal: 1.241 ± 0.34
0.248MetTrp: 0.248 ± 0.145
0.662MetTyr: 0.662 ± 0.217
0.0MetXaa: 0.0 ± 0.0
Asn
3.97AsnAla: 3.97 ± 0.718
0.579AsnCys: 0.579 ± 0.211
3.474AsnAsp: 3.474 ± 0.514
3.143AsnGlu: 3.143 ± 0.502
1.82AsnPhe: 1.82 ± 0.343
5.128AsnGly: 5.128 ± 0.757
0.579AsnHis: 0.579 ± 0.207
2.564AsnIle: 2.564 ± 0.42
2.895AsnLys: 2.895 ± 0.462
4.136AsnLeu: 4.136 ± 0.513
1.406AsnMet: 1.406 ± 0.382
2.481AsnAsn: 2.481 ± 0.462
3.391AsnPro: 3.391 ± 0.492
2.481AsnGln: 2.481 ± 0.458
2.068AsnArg: 2.068 ± 0.383
4.384AsnSer: 4.384 ± 0.577
2.151AsnThr: 2.151 ± 0.572
3.226AsnVal: 3.226 ± 0.484
0.579AsnTrp: 0.579 ± 0.182
1.489AsnTyr: 1.489 ± 0.316
0.0AsnXaa: 0.0 ± 0.0
Pro
2.399ProAla: 2.399 ± 0.39
0.165ProCys: 0.165 ± 0.104
2.564ProAsp: 2.564 ± 0.478
4.053ProGlu: 4.053 ± 0.819
1.572ProPhe: 1.572 ± 0.384
1.489ProGly: 1.489 ± 0.398
0.496ProHis: 0.496 ± 0.193
2.564ProIle: 2.564 ± 0.48
2.316ProLys: 2.316 ± 0.5
1.902ProLeu: 1.902 ± 0.348
0.744ProMet: 0.744 ± 0.29
1.985ProAsn: 1.985 ± 0.452
0.744ProPro: 0.744 ± 0.265
0.993ProGln: 0.993 ± 0.32
0.993ProArg: 0.993 ± 0.287
3.143ProSer: 3.143 ± 0.5
2.812ProThr: 2.812 ± 0.392
3.06ProVal: 3.06 ± 0.416
0.083ProTrp: 0.083 ± 0.079
1.241ProTyr: 1.241 ± 0.359
0.0ProXaa: 0.0 ± 0.0
Gln
2.647GlnAla: 2.647 ± 0.45
0.083GlnCys: 0.083 ± 0.083
1.737GlnAsp: 1.737 ± 0.335
3.226GlnGlu: 3.226 ± 0.568
1.654GlnPhe: 1.654 ± 0.279
1.902GlnGly: 1.902 ± 0.406
0.165GlnHis: 0.165 ± 0.114
2.481GlnIle: 2.481 ± 0.561
2.812GlnLys: 2.812 ± 0.454
3.557GlnLeu: 3.557 ± 0.549
0.827GlnMet: 0.827 ± 0.292
1.985GlnAsn: 1.985 ± 0.441
1.241GlnPro: 1.241 ± 0.301
2.399GlnGln: 2.399 ± 0.527
1.572GlnArg: 1.572 ± 0.334
3.143GlnSer: 3.143 ± 0.687
2.481GlnThr: 2.481 ± 0.412
2.647GlnVal: 2.647 ± 0.672
0.662GlnTrp: 0.662 ± 0.283
2.151GlnTyr: 2.151 ± 0.416
0.0GlnXaa: 0.0 ± 0.0
Arg
2.647ArgAla: 2.647 ± 0.367
0.662ArgCys: 0.662 ± 0.266
2.647ArgAsp: 2.647 ± 0.53
2.73ArgGlu: 2.73 ± 0.546
2.151ArgPhe: 2.151 ± 0.439
2.978ArgGly: 2.978 ± 0.573
0.662ArgHis: 0.662 ± 0.234
2.481ArgIle: 2.481 ± 0.526
2.812ArgLys: 2.812 ± 0.548
3.557ArgLeu: 3.557 ± 0.506
1.406ArgMet: 1.406 ± 0.35
2.812ArgAsn: 2.812 ± 0.386
0.993ArgPro: 0.993 ± 0.305
1.82ArgGln: 1.82 ± 0.301
1.902ArgArg: 1.902 ± 0.5
2.73ArgSer: 2.73 ± 0.409
1.737ArgThr: 1.737 ± 0.361
3.226ArgVal: 3.226 ± 0.644
0.414ArgTrp: 0.414 ± 0.19
1.985ArgTyr: 1.985 ± 0.433
0.0ArgXaa: 0.0 ± 0.0
Ser
4.549SerAla: 4.549 ± 0.787
0.579SerCys: 0.579 ± 0.288
6.121SerAsp: 6.121 ± 0.597
6.782SerGlu: 6.782 ± 0.763
2.812SerPhe: 2.812 ± 0.524
6.286SerGly: 6.286 ± 0.652
1.323SerHis: 1.323 ± 0.309
5.624SerIle: 5.624 ± 0.667
6.203SerLys: 6.203 ± 0.706
6.865SerLeu: 6.865 ± 0.756
1.902SerMet: 1.902 ± 0.389
3.474SerAsn: 3.474 ± 0.501
2.812SerPro: 2.812 ± 0.507
2.812SerGln: 2.812 ± 0.454
2.978SerArg: 2.978 ± 0.562
6.452SerSer: 6.452 ± 0.824
3.888SerThr: 3.888 ± 0.568
5.376SerVal: 5.376 ± 0.737
0.662SerTrp: 0.662 ± 0.255
1.985SerTyr: 1.985 ± 0.446
0.0SerXaa: 0.0 ± 0.0
Thr
3.474ThrAla: 3.474 ± 0.655
0.662ThrCys: 0.662 ± 0.191
3.226ThrAsp: 3.226 ± 0.614
4.136ThrGlu: 4.136 ± 0.497
2.316ThrPhe: 2.316 ± 0.542
5.211ThrGly: 5.211 ± 0.775
1.241ThrHis: 1.241 ± 0.329
4.632ThrIle: 4.632 ± 0.559
3.888ThrLys: 3.888 ± 0.505
4.963ThrLeu: 4.963 ± 0.738
0.827ThrMet: 0.827 ± 0.263
2.895ThrAsn: 2.895 ± 0.516
3.309ThrPro: 3.309 ± 0.561
1.985ThrGln: 1.985 ± 0.374
1.82ThrArg: 1.82 ± 0.336
4.136ThrSer: 4.136 ± 0.536
3.391ThrThr: 3.391 ± 0.643
4.136ThrVal: 4.136 ± 0.61
0.496ThrTrp: 0.496 ± 0.17
1.489ThrTyr: 1.489 ± 0.295
0.0ThrXaa: 0.0 ± 0.0
Val
4.797ValAla: 4.797 ± 0.648
0.414ValCys: 0.414 ± 0.16
5.873ValAsp: 5.873 ± 0.669
3.391ValGlu: 3.391 ± 0.544
2.812ValPhe: 2.812 ± 0.447
4.963ValGly: 4.963 ± 1.014
0.744ValHis: 0.744 ± 0.247
3.309ValIle: 3.309 ± 0.562
3.805ValLys: 3.805 ± 0.533
5.128ValLeu: 5.128 ± 0.837
1.489ValMet: 1.489 ± 0.358
2.73ValAsn: 2.73 ± 0.373
1.489ValPro: 1.489 ± 0.367
1.985ValGln: 1.985 ± 0.408
2.895ValArg: 2.895 ± 0.409
6.617ValSer: 6.617 ± 0.751
4.549ValThr: 4.549 ± 0.692
4.549ValVal: 4.549 ± 0.708
0.91ValTrp: 0.91 ± 0.325
2.647ValTyr: 2.647 ± 0.442
0.0ValXaa: 0.0 ± 0.0
Trp
0.993TrpAla: 0.993 ± 0.273
0.165TrpCys: 0.165 ± 0.124
0.662TrpAsp: 0.662 ± 0.353
1.075TrpGlu: 1.075 ± 0.262
0.827TrpPhe: 0.827 ± 0.287
0.496TrpGly: 0.496 ± 0.256
0.248TrpHis: 0.248 ± 0.159
0.248TrpIle: 0.248 ± 0.116
1.323TrpLys: 1.323 ± 0.297
1.158TrpLeu: 1.158 ± 0.255
0.248TrpMet: 0.248 ± 0.138
0.662TrpAsn: 0.662 ± 0.191
0.579TrpPro: 0.579 ± 0.24
0.331TrpGln: 0.331 ± 0.159
0.496TrpArg: 0.496 ± 0.215
0.744TrpSer: 0.744 ± 0.32
0.91TrpThr: 0.91 ± 0.271
0.579TrpVal: 0.579 ± 0.202
0.0TrpTrp: 0.0 ± 0.0
0.496TrpTyr: 0.496 ± 0.183
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.737TyrAla: 1.737 ± 0.411
0.662TyrCys: 0.662 ± 0.266
3.143TyrAsp: 3.143 ± 0.572
2.068TyrGlu: 2.068 ± 0.45
1.82TyrPhe: 1.82 ± 0.458
2.812TyrGly: 2.812 ± 0.545
0.744TyrHis: 0.744 ± 0.333
1.82TyrIle: 1.82 ± 0.413
1.737TyrLys: 1.737 ± 0.445
2.895TyrLeu: 2.895 ± 0.533
0.827TyrMet: 0.827 ± 0.253
1.489TyrAsn: 1.489 ± 0.334
0.662TyrPro: 0.662 ± 0.244
1.323TyrGln: 1.323 ± 0.275
2.399TyrArg: 2.399 ± 0.462
2.481TyrSer: 2.481 ± 0.356
2.068TyrThr: 2.068 ± 0.389
2.151TyrVal: 2.151 ± 0.333
0.579TyrTrp: 0.579 ± 0.254
1.158TyrTyr: 1.158 ± 0.281
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 56 proteins (12091 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski