Amino acid dipepetide frequency for Vibrio virus 2019VC1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.582AlaAla: 6.582 ± 0.962
0.681AlaCys: 0.681 ± 0.34
3.858AlaAsp: 3.858 ± 0.531
4.918AlaGlu: 4.918 ± 0.836
3.102AlaPhe: 3.102 ± 0.552
5.598AlaGly: 5.598 ± 0.843
1.21AlaHis: 1.21 ± 0.289
5.447AlaIle: 5.447 ± 0.82
5.901AlaLys: 5.901 ± 0.93
7.565AlaLeu: 7.565 ± 1.019
2.194AlaMet: 2.194 ± 0.527
3.253AlaAsn: 3.253 ± 0.61
2.345AlaPro: 2.345 ± 0.385
4.615AlaGln: 4.615 ± 0.828
4.918AlaArg: 4.918 ± 0.73
4.615AlaSer: 4.615 ± 0.77
4.388AlaThr: 4.388 ± 0.619
5.598AlaVal: 5.598 ± 0.815
0.908AlaTrp: 0.908 ± 0.314
2.724AlaTyr: 2.724 ± 0.462
0.0AlaXaa: 0.0 ± 0.0
Cys
1.059CysAla: 1.059 ± 0.283
0.076CysCys: 0.076 ± 0.065
0.681CysAsp: 0.681 ± 0.177
1.059CysGlu: 1.059 ± 0.354
0.605CysPhe: 0.605 ± 0.221
0.908CysGly: 0.908 ± 0.322
0.151CysHis: 0.151 ± 0.101
0.832CysIle: 0.832 ± 0.31
0.908CysLys: 0.908 ± 0.302
1.059CysLeu: 1.059 ± 0.376
0.454CysMet: 0.454 ± 0.142
0.757CysAsn: 0.757 ± 0.249
0.227CysPro: 0.227 ± 0.17
0.303CysGln: 0.303 ± 0.185
0.605CysArg: 0.605 ± 0.19
0.681CysSer: 0.681 ± 0.281
0.757CysThr: 0.757 ± 0.386
0.832CysVal: 0.832 ± 0.362
0.303CysTrp: 0.303 ± 0.132
0.984CysTyr: 0.984 ± 0.404
0.0CysXaa: 0.0 ± 0.0
Asp
3.631AspAla: 3.631 ± 0.602
0.454AspCys: 0.454 ± 0.237
3.48AspAsp: 3.48 ± 0.628
3.404AspGlu: 3.404 ± 0.55
2.27AspPhe: 2.27 ± 0.402
5.523AspGly: 5.523 ± 0.923
0.605AspHis: 0.605 ± 0.219
3.177AspIle: 3.177 ± 0.683
4.161AspLys: 4.161 ± 0.558
4.539AspLeu: 4.539 ± 0.609
1.286AspMet: 1.286 ± 0.398
2.421AspAsn: 2.421 ± 0.549
1.816AspPro: 1.816 ± 0.527
1.816AspGln: 1.816 ± 0.354
2.724AspArg: 2.724 ± 0.505
2.27AspSer: 2.27 ± 0.427
3.177AspThr: 3.177 ± 0.674
1.967AspVal: 1.967 ± 0.395
0.984AspTrp: 0.984 ± 0.293
1.967AspTyr: 1.967 ± 0.397
0.0AspXaa: 0.0 ± 0.0
Glu
5.598GluAla: 5.598 ± 0.839
1.286GluCys: 1.286 ± 0.348
2.345GluAsp: 2.345 ± 0.458
3.404GluGlu: 3.404 ± 0.624
2.572GluPhe: 2.572 ± 0.482
3.707GluGly: 3.707 ± 0.643
1.664GluHis: 1.664 ± 0.433
4.842GluIle: 4.842 ± 0.738
4.691GluLys: 4.691 ± 0.796
5.825GluLeu: 5.825 ± 0.763
2.724GluMet: 2.724 ± 0.468
3.631GluAsn: 3.631 ± 0.489
1.21GluPro: 1.21 ± 0.293
2.572GluGln: 2.572 ± 0.423
2.194GluArg: 2.194 ± 0.428
4.691GluSer: 4.691 ± 0.761
3.631GluThr: 3.631 ± 0.652
3.934GluVal: 3.934 ± 0.695
1.362GluTrp: 1.362 ± 0.432
2.572GluTyr: 2.572 ± 0.483
0.0GluXaa: 0.0 ± 0.0
Phe
3.026PheAla: 3.026 ± 0.605
0.832PheCys: 0.832 ± 0.333
2.572PheAsp: 2.572 ± 0.646
2.118PheGlu: 2.118 ± 0.605
0.984PhePhe: 0.984 ± 0.326
3.253PheGly: 3.253 ± 0.459
0.984PheHis: 0.984 ± 0.319
2.648PheIle: 2.648 ± 0.433
2.724PheLys: 2.724 ± 0.473
2.951PheLeu: 2.951 ± 0.531
0.681PheMet: 0.681 ± 0.222
1.74PheAsn: 1.74 ± 0.419
1.74PhePro: 1.74 ± 0.474
1.059PheGln: 1.059 ± 0.306
2.043PheArg: 2.043 ± 0.38
2.875PheSer: 2.875 ± 0.493
2.043PheThr: 2.043 ± 0.512
2.572PheVal: 2.572 ± 0.597
0.984PheTrp: 0.984 ± 0.295
0.908PheTyr: 0.908 ± 0.263
0.0PheXaa: 0.0 ± 0.0
Gly
5.674GlyAla: 5.674 ± 0.795
0.908GlyCys: 0.908 ± 0.289
4.539GlyAsp: 4.539 ± 0.784
4.766GlyGlu: 4.766 ± 0.592
2.724GlyPhe: 2.724 ± 0.519
4.993GlyGly: 4.993 ± 0.76
1.362GlyHis: 1.362 ± 0.435
4.464GlyIle: 4.464 ± 0.691
5.523GlyLys: 5.523 ± 0.66
4.993GlyLeu: 4.993 ± 0.622
2.648GlyMet: 2.648 ± 0.447
3.934GlyAsn: 3.934 ± 0.599
0.076GlyPro: 0.076 ± 0.07
2.194GlyGln: 2.194 ± 0.599
3.253GlyArg: 3.253 ± 0.543
3.404GlySer: 3.404 ± 0.539
2.799GlyThr: 2.799 ± 0.441
5.144GlyVal: 5.144 ± 0.688
0.984GlyTrp: 0.984 ± 0.25
2.118GlyTyr: 2.118 ± 0.477
0.0GlyXaa: 0.0 ± 0.0
His
1.286HisAla: 1.286 ± 0.365
0.151HisCys: 0.151 ± 0.101
0.605HisAsp: 0.605 ± 0.213
1.513HisGlu: 1.513 ± 0.407
1.135HisPhe: 1.135 ± 0.381
1.74HisGly: 1.74 ± 0.43
1.059HisHis: 1.059 ± 0.343
1.21HisIle: 1.21 ± 0.389
1.437HisLys: 1.437 ± 0.502
1.513HisLeu: 1.513 ± 0.523
0.454HisMet: 0.454 ± 0.171
0.681HisAsn: 0.681 ± 0.234
1.21HisPro: 1.21 ± 0.475
0.832HisGln: 0.832 ± 0.26
1.135HisArg: 1.135 ± 0.388
1.74HisSer: 1.74 ± 0.469
1.21HisThr: 1.21 ± 0.448
1.059HisVal: 1.059 ± 0.333
0.303HisTrp: 0.303 ± 0.159
1.059HisTyr: 1.059 ± 0.314
0.0HisXaa: 0.0 ± 0.0
Ile
6.052IleAla: 6.052 ± 0.779
1.135IleCys: 1.135 ± 0.415
4.085IleAsp: 4.085 ± 0.842
4.993IleGlu: 4.993 ± 0.656
2.27IlePhe: 2.27 ± 0.493
3.556IleGly: 3.556 ± 0.708
0.908IleHis: 0.908 ± 0.34
3.934IleIle: 3.934 ± 0.496
4.085IleLys: 4.085 ± 0.687
4.312IleLeu: 4.312 ± 0.695
1.513IleMet: 1.513 ± 0.401
3.026IleAsn: 3.026 ± 0.652
3.177IlePro: 3.177 ± 0.705
2.27IleGln: 2.27 ± 0.491
3.404IleArg: 3.404 ± 0.495
4.615IleSer: 4.615 ± 0.589
6.052IleThr: 6.052 ± 0.863
4.539IleVal: 4.539 ± 0.728
0.908IleTrp: 0.908 ± 0.308
2.194IleTyr: 2.194 ± 0.438
0.0IleXaa: 0.0 ± 0.0
Lys
6.279LysAla: 6.279 ± 0.926
0.605LysCys: 0.605 ± 0.217
3.329LysAsp: 3.329 ± 0.59
3.707LysGlu: 3.707 ± 0.615
1.664LysPhe: 1.664 ± 0.315
3.783LysGly: 3.783 ± 0.741
1.816LysHis: 1.816 ± 0.498
3.48LysIle: 3.48 ± 0.659
3.783LysLys: 3.783 ± 0.71
6.128LysLeu: 6.128 ± 1.094
2.875LysMet: 2.875 ± 0.544
2.118LysAsn: 2.118 ± 0.594
3.329LysPro: 3.329 ± 0.601
3.707LysGln: 3.707 ± 0.933
5.977LysArg: 5.977 ± 1.046
6.052LysSer: 6.052 ± 1.109
4.842LysThr: 4.842 ± 0.73
5.75LysVal: 5.75 ± 0.841
1.513LysTrp: 1.513 ± 0.352
2.27LysTyr: 2.27 ± 0.582
0.0LysXaa: 0.0 ± 0.0
Leu
6.204LeuAla: 6.204 ± 0.812
0.908LeuCys: 0.908 ± 0.298
3.253LeuAsp: 3.253 ± 0.554
4.918LeuGlu: 4.918 ± 0.813
2.875LeuPhe: 2.875 ± 0.664
3.253LeuGly: 3.253 ± 0.78
1.589LeuHis: 1.589 ± 0.366
4.918LeuIle: 4.918 ± 0.77
6.355LeuLys: 6.355 ± 1.004
5.523LeuLeu: 5.523 ± 1.229
3.253LeuMet: 3.253 ± 0.787
3.404LeuAsn: 3.404 ± 0.723
3.177LeuPro: 3.177 ± 0.695
2.648LeuGln: 2.648 ± 0.629
6.052LeuArg: 6.052 ± 0.875
7.414LeuSer: 7.414 ± 0.793
5.825LeuThr: 5.825 ± 0.86
4.766LeuVal: 4.766 ± 0.923
0.605LeuTrp: 0.605 ± 0.226
2.648LeuTyr: 2.648 ± 0.803
0.0LeuXaa: 0.0 ± 0.0
Met
2.497MetAla: 2.497 ± 0.513
0.227MetCys: 0.227 ± 0.13
1.664MetAsp: 1.664 ± 0.506
1.513MetGlu: 1.513 ± 0.278
1.135MetPhe: 1.135 ± 0.312
1.21MetGly: 1.21 ± 0.356
0.757MetHis: 0.757 ± 0.261
1.891MetIle: 1.891 ± 0.396
2.497MetLys: 2.497 ± 0.535
1.967MetLeu: 1.967 ± 0.412
0.757MetMet: 0.757 ± 0.237
1.362MetAsn: 1.362 ± 0.295
0.984MetPro: 0.984 ± 0.322
1.286MetGln: 1.286 ± 0.3
3.177MetArg: 3.177 ± 0.787
2.118MetSer: 2.118 ± 0.363
2.27MetThr: 2.27 ± 0.429
1.589MetVal: 1.589 ± 0.316
0.227MetTrp: 0.227 ± 0.126
0.681MetTyr: 0.681 ± 0.229
0.0MetXaa: 0.0 ± 0.0
Asn
3.556AsnAla: 3.556 ± 0.631
0.454AsnCys: 0.454 ± 0.157
1.891AsnAsp: 1.891 ± 0.465
2.421AsnGlu: 2.421 ± 0.386
2.118AsnPhe: 2.118 ± 0.651
4.237AsnGly: 4.237 ± 0.616
0.681AsnHis: 0.681 ± 0.313
2.648AsnIle: 2.648 ± 0.578
4.085AsnLys: 4.085 ± 0.495
3.404AsnLeu: 3.404 ± 0.939
1.891AsnMet: 1.891 ± 0.391
2.118AsnAsn: 2.118 ± 0.467
1.513AsnPro: 1.513 ± 0.472
1.437AsnGln: 1.437 ± 0.327
2.118AsnArg: 2.118 ± 0.631
2.345AsnSer: 2.345 ± 0.492
2.118AsnThr: 2.118 ± 0.482
3.934AsnVal: 3.934 ± 0.908
0.832AsnTrp: 0.832 ± 0.21
1.513AsnTyr: 1.513 ± 0.33
0.0AsnXaa: 0.0 ± 0.0
Pro
4.01ProAla: 4.01 ± 0.653
0.908ProCys: 0.908 ± 0.308
2.648ProAsp: 2.648 ± 0.634
3.177ProGlu: 3.177 ± 0.67
1.362ProPhe: 1.362 ± 0.317
1.362ProGly: 1.362 ± 0.479
0.605ProHis: 0.605 ± 0.213
2.27ProIle: 2.27 ± 0.432
1.362ProLys: 1.362 ± 0.433
2.118ProLeu: 2.118 ± 0.402
1.059ProMet: 1.059 ± 0.417
1.286ProAsn: 1.286 ± 0.545
1.437ProPro: 1.437 ± 0.459
1.21ProGln: 1.21 ± 0.319
1.891ProArg: 1.891 ± 0.581
1.891ProSer: 1.891 ± 0.425
2.194ProThr: 2.194 ± 0.604
2.875ProVal: 2.875 ± 0.564
0.378ProTrp: 0.378 ± 0.176
0.605ProTyr: 0.605 ± 0.218
0.0ProXaa: 0.0 ± 0.0
Gln
2.27GlnAla: 2.27 ± 0.612
0.757GlnCys: 0.757 ± 0.268
2.421GlnAsp: 2.421 ± 0.439
2.421GlnGlu: 2.421 ± 0.604
1.362GlnPhe: 1.362 ± 0.309
2.043GlnGly: 2.043 ± 0.422
1.059GlnHis: 1.059 ± 0.443
3.177GlnIle: 3.177 ± 0.621
2.648GlnLys: 2.648 ± 0.69
3.631GlnLeu: 3.631 ± 0.773
1.135GlnMet: 1.135 ± 0.258
1.589GlnAsn: 1.589 ± 0.368
1.74GlnPro: 1.74 ± 0.514
2.118GlnGln: 2.118 ± 0.443
2.799GlnArg: 2.799 ± 0.449
1.664GlnSer: 1.664 ± 0.42
1.816GlnThr: 1.816 ± 0.412
1.816GlnVal: 1.816 ± 0.393
0.151GlnTrp: 0.151 ± 0.095
1.891GlnTyr: 1.891 ± 0.432
0.0GlnXaa: 0.0 ± 0.0
Arg
3.707ArgAla: 3.707 ± 0.621
0.681ArgCys: 0.681 ± 0.237
1.967ArgAsp: 1.967 ± 0.576
4.464ArgGlu: 4.464 ± 0.704
2.497ArgPhe: 2.497 ± 0.456
4.615ArgGly: 4.615 ± 0.641
1.362ArgHis: 1.362 ± 0.375
4.464ArgIle: 4.464 ± 0.858
7.414ArgLys: 7.414 ± 1.544
4.312ArgLeu: 4.312 ± 0.697
1.21ArgMet: 1.21 ± 0.341
2.27ArgAsn: 2.27 ± 0.536
1.967ArgPro: 1.967 ± 0.512
1.74ArgGln: 1.74 ± 0.347
4.161ArgArg: 4.161 ± 0.569
4.388ArgSer: 4.388 ± 0.829
2.043ArgThr: 2.043 ± 0.33
4.691ArgVal: 4.691 ± 0.786
0.303ArgTrp: 0.303 ± 0.154
1.74ArgTyr: 1.74 ± 0.385
0.0ArgXaa: 0.0 ± 0.0
Ser
4.918SerAla: 4.918 ± 0.614
0.908SerCys: 0.908 ± 0.313
3.631SerAsp: 3.631 ± 0.607
4.539SerGlu: 4.539 ± 0.654
2.724SerPhe: 2.724 ± 0.512
5.901SerGly: 5.901 ± 0.735
1.286SerHis: 1.286 ± 0.446
3.707SerIle: 3.707 ± 0.841
3.102SerLys: 3.102 ± 0.52
6.052SerLeu: 6.052 ± 0.778
1.437SerMet: 1.437 ± 0.371
2.497SerAsn: 2.497 ± 0.526
1.967SerPro: 1.967 ± 0.411
2.345SerGln: 2.345 ± 0.497
4.237SerArg: 4.237 ± 0.669
3.329SerSer: 3.329 ± 0.728
3.404SerThr: 3.404 ± 0.521
5.523SerVal: 5.523 ± 0.665
0.757SerTrp: 0.757 ± 0.378
2.043SerTyr: 2.043 ± 0.564
0.0SerXaa: 0.0 ± 0.0
Thr
6.052ThrAla: 6.052 ± 0.939
0.681ThrCys: 0.681 ± 0.317
1.967ThrAsp: 1.967 ± 0.423
4.01ThrGlu: 4.01 ± 0.62
2.799ThrPhe: 2.799 ± 0.652
5.144ThrGly: 5.144 ± 0.726
1.513ThrHis: 1.513 ± 0.503
4.464ThrIle: 4.464 ± 0.737
3.404ThrLys: 3.404 ± 0.605
4.464ThrLeu: 4.464 ± 0.894
1.437ThrMet: 1.437 ± 0.409
3.934ThrAsn: 3.934 ± 0.653
2.421ThrPro: 2.421 ± 0.502
2.421ThrGln: 2.421 ± 0.571
1.967ThrArg: 1.967 ± 0.459
2.497ThrSer: 2.497 ± 0.426
2.648ThrThr: 2.648 ± 0.59
4.539ThrVal: 4.539 ± 0.746
0.681ThrTrp: 0.681 ± 0.252
1.74ThrTyr: 1.74 ± 0.452
0.0ThrXaa: 0.0 ± 0.0
Val
4.842ValAla: 4.842 ± 0.674
1.135ValCys: 1.135 ± 0.326
3.783ValAsp: 3.783 ± 0.65
4.464ValGlu: 4.464 ± 0.815
2.875ValPhe: 2.875 ± 0.549
3.253ValGly: 3.253 ± 0.534
1.437ValHis: 1.437 ± 0.415
5.447ValIle: 5.447 ± 0.849
6.431ValLys: 6.431 ± 1.223
6.279ValLeu: 6.279 ± 0.799
1.816ValMet: 1.816 ± 0.393
3.253ValAsn: 3.253 ± 0.638
2.27ValPro: 2.27 ± 0.518
2.043ValGln: 2.043 ± 0.536
4.161ValArg: 4.161 ± 0.78
3.858ValSer: 3.858 ± 0.721
5.371ValThr: 5.371 ± 1.115
2.951ValVal: 2.951 ± 0.646
0.378ValTrp: 0.378 ± 0.163
1.362ValTyr: 1.362 ± 0.418
0.0ValXaa: 0.0 ± 0.0
Trp
0.681TrpAla: 0.681 ± 0.248
0.151TrpCys: 0.151 ± 0.09
0.53TrpAsp: 0.53 ± 0.227
0.757TrpGlu: 0.757 ± 0.274
0.757TrpPhe: 0.757 ± 0.236
0.53TrpGly: 0.53 ± 0.209
0.227TrpHis: 0.227 ± 0.118
1.286TrpIle: 1.286 ± 0.396
0.832TrpLys: 0.832 ± 0.288
1.437TrpLeu: 1.437 ± 0.419
0.378TrpMet: 0.378 ± 0.145
0.605TrpAsn: 0.605 ± 0.283
0.454TrpPro: 0.454 ± 0.241
0.454TrpGln: 0.454 ± 0.227
1.589TrpArg: 1.589 ± 0.353
0.908TrpSer: 0.908 ± 0.449
0.227TrpThr: 0.227 ± 0.142
1.059TrpVal: 1.059 ± 0.264
0.151TrpTrp: 0.151 ± 0.106
0.227TrpTyr: 0.227 ± 0.124
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.497TyrAla: 2.497 ± 0.497
0.303TyrCys: 0.303 ± 0.132
2.421TyrAsp: 2.421 ± 0.519
1.967TyrGlu: 1.967 ± 0.473
0.908TyrPhe: 0.908 ± 0.392
1.891TyrGly: 1.891 ± 0.467
1.059TyrHis: 1.059 ± 0.329
2.724TyrIle: 2.724 ± 0.808
1.664TyrLys: 1.664 ± 0.448
1.513TyrLeu: 1.513 ± 0.416
0.53TyrMet: 0.53 ± 0.164
1.437TyrAsn: 1.437 ± 0.414
1.589TyrPro: 1.589 ± 0.387
1.437TyrGln: 1.437 ± 0.405
1.589TyrArg: 1.589 ± 0.441
2.951TyrSer: 2.951 ± 0.58
1.967TyrThr: 1.967 ± 0.376
2.27TyrVal: 2.27 ± 0.457
0.53TyrTrp: 0.53 ± 0.197
0.757TyrTyr: 0.757 ± 0.287
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 69 proteins (13219 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski