Amino acid dipepetide frequency for Vibrio phage JSF20

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.832AlaAla: 9.832 ± 1.211
0.684AlaCys: 0.684 ± 0.216
5.044AlaAsp: 5.044 ± 0.713
5.985AlaGlu: 5.985 ± 0.941
2.65AlaPhe: 2.65 ± 0.426
5.728AlaGly: 5.728 ± 0.863
1.111AlaHis: 1.111 ± 0.228
5.386AlaIle: 5.386 ± 0.843
6.925AlaLys: 6.925 ± 0.843
7.096AlaLeu: 7.096 ± 0.923
3.249AlaMet: 3.249 ± 0.667
4.446AlaAsn: 4.446 ± 0.6
2.308AlaPro: 2.308 ± 0.405
3.762AlaGln: 3.762 ± 0.728
3.847AlaArg: 3.847 ± 0.522
5.215AlaSer: 5.215 ± 0.786
3.933AlaThr: 3.933 ± 0.667
5.643AlaVal: 5.643 ± 0.682
1.282AlaTrp: 1.282 ± 0.48
3.762AlaTyr: 3.762 ± 0.702
0.0AlaXaa: 0.0 ± 0.0
Cys
1.026CysAla: 1.026 ± 0.278
0.171CysCys: 0.171 ± 0.129
0.513CysAsp: 0.513 ± 0.252
0.855CysGlu: 0.855 ± 0.313
0.855CysPhe: 0.855 ± 0.251
0.684CysGly: 0.684 ± 0.244
0.513CysHis: 0.513 ± 0.167
0.598CysIle: 0.598 ± 0.259
0.342CysLys: 0.342 ± 0.175
0.769CysLeu: 0.769 ± 0.249
0.342CysMet: 0.342 ± 0.234
0.085CysAsn: 0.085 ± 0.082
0.684CysPro: 0.684 ± 0.217
0.342CysGln: 0.342 ± 0.179
0.769CysArg: 0.769 ± 0.3
0.342CysSer: 0.342 ± 0.174
0.256CysThr: 0.256 ± 0.139
0.598CysVal: 0.598 ± 0.248
0.171CysTrp: 0.171 ± 0.131
0.171CysTyr: 0.171 ± 0.167
0.0CysXaa: 0.0 ± 0.0
Asp
5.301AspAla: 5.301 ± 0.932
0.94AspCys: 0.94 ± 0.294
3.591AspAsp: 3.591 ± 0.743
3.847AspGlu: 3.847 ± 0.562
2.736AspPhe: 2.736 ± 0.599
4.275AspGly: 4.275 ± 0.647
0.769AspHis: 0.769 ± 0.21
3.676AspIle: 3.676 ± 0.562
3.847AspLys: 3.847 ± 0.667
4.104AspLeu: 4.104 ± 0.546
1.71AspMet: 1.71 ± 0.355
2.479AspAsn: 2.479 ± 0.608
2.65AspPro: 2.65 ± 0.688
1.453AspGln: 1.453 ± 0.46
2.223AspArg: 2.223 ± 0.487
3.163AspSer: 3.163 ± 0.612
4.189AspThr: 4.189 ± 0.706
4.446AspVal: 4.446 ± 0.538
1.111AspTrp: 1.111 ± 0.415
2.65AspTyr: 2.65 ± 0.624
0.0AspXaa: 0.0 ± 0.0
Glu
8.293GluAla: 8.293 ± 0.984
0.769GluCys: 0.769 ± 0.259
5.728GluAsp: 5.728 ± 0.706
6.498GluGlu: 6.498 ± 0.852
2.223GluPhe: 2.223 ± 0.335
5.472GluGly: 5.472 ± 0.856
1.624GluHis: 1.624 ± 0.462
4.104GluIle: 4.104 ± 0.689
3.42GluLys: 3.42 ± 0.443
6.327GluLeu: 6.327 ± 0.879
2.907GluMet: 2.907 ± 0.546
2.65GluAsn: 2.65 ± 0.458
1.624GluPro: 1.624 ± 0.351
3.762GluGln: 3.762 ± 0.57
3.847GluArg: 3.847 ± 0.555
4.788GluSer: 4.788 ± 0.746
2.992GluThr: 2.992 ± 0.458
4.702GluVal: 4.702 ± 0.65
1.282GluTrp: 1.282 ± 0.298
2.907GluTyr: 2.907 ± 0.435
0.0GluXaa: 0.0 ± 0.0
Phe
2.65PheAla: 2.65 ± 0.569
0.684PheCys: 0.684 ± 0.228
2.736PheAsp: 2.736 ± 0.569
3.163PheGlu: 3.163 ± 0.454
1.282PhePhe: 1.282 ± 0.425
3.163PheGly: 3.163 ± 0.588
0.684PheHis: 0.684 ± 0.277
2.394PheIle: 2.394 ± 0.503
3.078PheLys: 3.078 ± 0.626
3.334PheLeu: 3.334 ± 0.667
1.453PheMet: 1.453 ± 0.374
2.821PheAsn: 2.821 ± 0.607
1.026PhePro: 1.026 ± 0.292
1.368PheGln: 1.368 ± 0.311
1.539PheArg: 1.539 ± 0.316
2.736PheSer: 2.736 ± 0.428
2.565PheThr: 2.565 ± 0.584
2.479PheVal: 2.479 ± 0.53
0.342PheTrp: 0.342 ± 0.18
1.453PheTyr: 1.453 ± 0.361
0.0PheXaa: 0.0 ± 0.0
Gly
6.327GlyAla: 6.327 ± 1.027
0.427GlyCys: 0.427 ± 0.194
3.933GlyAsp: 3.933 ± 0.655
5.13GlyGlu: 5.13 ± 0.654
2.565GlyPhe: 2.565 ± 0.344
5.215GlyGly: 5.215 ± 0.738
1.453GlyHis: 1.453 ± 0.375
3.249GlyIle: 3.249 ± 0.463
5.301GlyLys: 5.301 ± 0.841
6.583GlyLeu: 6.583 ± 1.036
2.394GlyMet: 2.394 ± 0.521
3.078GlyAsn: 3.078 ± 0.471
0.0GlyPro: 0.0 ± 0.0
2.821GlyGln: 2.821 ± 0.408
3.762GlyArg: 3.762 ± 0.441
3.762GlySer: 3.762 ± 0.709
4.189GlyThr: 4.189 ± 0.65
4.275GlyVal: 4.275 ± 0.702
0.769GlyTrp: 0.769 ± 0.295
3.334GlyTyr: 3.334 ± 0.4
0.0GlyXaa: 0.0 ± 0.0
His
1.197HisAla: 1.197 ± 0.28
0.256HisCys: 0.256 ± 0.145
1.111HisAsp: 1.111 ± 0.317
1.624HisGlu: 1.624 ± 0.38
1.368HisPhe: 1.368 ± 0.327
0.94HisGly: 0.94 ± 0.255
0.171HisHis: 0.171 ± 0.109
1.111HisIle: 1.111 ± 0.258
0.855HisLys: 0.855 ± 0.297
1.368HisLeu: 1.368 ± 0.395
0.427HisMet: 0.427 ± 0.224
0.684HisAsn: 0.684 ± 0.248
0.769HisPro: 0.769 ± 0.301
0.342HisGln: 0.342 ± 0.175
1.026HisArg: 1.026 ± 0.322
1.111HisSer: 1.111 ± 0.331
0.684HisThr: 0.684 ± 0.307
1.624HisVal: 1.624 ± 0.339
0.598HisTrp: 0.598 ± 0.25
0.427HisTyr: 0.427 ± 0.197
0.0HisXaa: 0.0 ± 0.0
Ile
4.189IleAla: 4.189 ± 0.484
0.769IleCys: 0.769 ± 0.304
2.907IleAsp: 2.907 ± 0.496
4.104IleGlu: 4.104 ± 0.685
1.026IlePhe: 1.026 ± 0.32
3.334IleGly: 3.334 ± 0.453
0.855IleHis: 0.855 ± 0.269
2.907IleIle: 2.907 ± 0.646
4.959IleLys: 4.959 ± 0.591
4.531IleLeu: 4.531 ± 0.607
1.282IleMet: 1.282 ± 0.35
2.992IleAsn: 2.992 ± 0.573
2.65IlePro: 2.65 ± 0.468
2.479IleGln: 2.479 ± 0.579
3.334IleArg: 3.334 ± 0.574
3.163IleSer: 3.163 ± 0.596
3.505IleThr: 3.505 ± 0.579
3.847IleVal: 3.847 ± 0.472
0.513IleTrp: 0.513 ± 0.186
1.453IleTyr: 1.453 ± 0.319
0.0IleXaa: 0.0 ± 0.0
Lys
6.412LysAla: 6.412 ± 0.781
0.769LysCys: 0.769 ± 0.351
4.189LysAsp: 4.189 ± 0.619
4.617LysGlu: 4.617 ± 0.739
3.078LysPhe: 3.078 ± 0.518
4.702LysGly: 4.702 ± 0.668
1.368LysHis: 1.368 ± 0.436
2.907LysIle: 2.907 ± 0.375
5.215LysLys: 5.215 ± 0.835
5.643LysLeu: 5.643 ± 0.842
2.479LysMet: 2.479 ± 0.418
2.565LysAsn: 2.565 ± 0.532
3.078LysPro: 3.078 ± 0.691
3.42LysGln: 3.42 ± 0.639
3.933LysArg: 3.933 ± 0.56
3.933LysSer: 3.933 ± 0.581
3.847LysThr: 3.847 ± 0.585
4.446LysVal: 4.446 ± 0.756
1.026LysTrp: 1.026 ± 0.367
2.736LysTyr: 2.736 ± 0.523
0.0LysXaa: 0.0 ± 0.0
Leu
7.011LeuAla: 7.011 ± 0.928
0.855LeuCys: 0.855 ± 0.352
3.847LeuAsp: 3.847 ± 0.413
6.925LeuGlu: 6.925 ± 0.9
2.565LeuPhe: 2.565 ± 0.567
5.215LeuGly: 5.215 ± 0.925
1.539LeuHis: 1.539 ± 0.383
5.215LeuIle: 5.215 ± 0.581
7.609LeuLys: 7.609 ± 0.796
5.557LeuLeu: 5.557 ± 0.813
2.052LeuMet: 2.052 ± 0.322
4.36LeuAsn: 4.36 ± 0.55
2.479LeuPro: 2.479 ± 0.469
3.762LeuGln: 3.762 ± 0.533
5.044LeuArg: 5.044 ± 0.712
4.189LeuSer: 4.189 ± 0.558
5.044LeuThr: 5.044 ± 0.69
4.36LeuVal: 4.36 ± 0.512
1.026LeuTrp: 1.026 ± 0.274
2.65LeuTyr: 2.65 ± 0.448
0.0LeuXaa: 0.0 ± 0.0
Met
3.163MetAla: 3.163 ± 0.527
0.427MetCys: 0.427 ± 0.205
1.282MetAsp: 1.282 ± 0.379
2.394MetGlu: 2.394 ± 0.421
1.111MetPhe: 1.111 ± 0.309
1.795MetGly: 1.795 ± 0.636
0.342MetHis: 0.342 ± 0.183
1.368MetIle: 1.368 ± 0.3
1.881MetLys: 1.881 ± 0.431
1.966MetLeu: 1.966 ± 0.39
0.427MetMet: 0.427 ± 0.159
1.539MetAsn: 1.539 ± 0.4
1.624MetPro: 1.624 ± 0.358
1.624MetGln: 1.624 ± 0.44
1.282MetArg: 1.282 ± 0.349
1.881MetSer: 1.881 ± 0.388
2.65MetThr: 2.65 ± 0.578
2.052MetVal: 2.052 ± 0.333
0.171MetTrp: 0.171 ± 0.123
0.342MetTyr: 0.342 ± 0.188
0.0MetXaa: 0.0 ± 0.0
Asn
3.42AsnAla: 3.42 ± 0.732
0.427AsnCys: 0.427 ± 0.184
2.65AsnAsp: 2.65 ± 0.42
2.565AsnGlu: 2.565 ± 0.478
1.966AsnPhe: 1.966 ± 0.362
3.591AsnGly: 3.591 ± 0.599
1.026AsnHis: 1.026 ± 0.279
2.308AsnIle: 2.308 ± 0.547
2.736AsnLys: 2.736 ± 0.509
4.873AsnLeu: 4.873 ± 0.755
1.795AsnMet: 1.795 ± 0.48
2.223AsnAsn: 2.223 ± 0.351
2.992AsnPro: 2.992 ± 0.457
1.539AsnGln: 1.539 ± 0.405
2.308AsnArg: 2.308 ± 0.57
2.479AsnSer: 2.479 ± 0.617
2.394AsnThr: 2.394 ± 0.413
3.42AsnVal: 3.42 ± 0.407
0.598AsnTrp: 0.598 ± 0.294
1.539AsnTyr: 1.539 ± 0.354
0.0AsnXaa: 0.0 ± 0.0
Pro
2.992ProAla: 2.992 ± 0.609
0.256ProCys: 0.256 ± 0.18
2.65ProAsp: 2.65 ± 0.466
4.531ProGlu: 4.531 ± 0.529
1.624ProPhe: 1.624 ± 0.378
0.0ProGly: 0.0 ± 0.0
0.855ProHis: 0.855 ± 0.215
1.368ProIle: 1.368 ± 0.376
1.795ProLys: 1.795 ± 0.445
2.479ProLeu: 2.479 ± 0.398
0.855ProMet: 0.855 ± 0.282
2.565ProAsn: 2.565 ± 0.559
0.684ProPro: 0.684 ± 0.244
1.624ProGln: 1.624 ± 0.442
1.111ProArg: 1.111 ± 0.301
3.078ProSer: 3.078 ± 0.474
1.71ProThr: 1.71 ± 0.357
3.762ProVal: 3.762 ± 0.508
0.598ProTrp: 0.598 ± 0.263
1.111ProTyr: 1.111 ± 0.365
0.0ProXaa: 0.0 ± 0.0
Gln
5.386GlnAla: 5.386 ± 1.028
0.256GlnCys: 0.256 ± 0.136
2.565GlnAsp: 2.565 ± 0.672
3.42GlnGlu: 3.42 ± 0.674
1.795GlnPhe: 1.795 ± 0.406
2.565GlnGly: 2.565 ± 0.461
0.513GlnHis: 0.513 ± 0.184
2.137GlnIle: 2.137 ± 0.398
2.223GlnLys: 2.223 ± 0.433
4.018GlnLeu: 4.018 ± 0.66
0.769GlnMet: 0.769 ± 0.246
0.855GlnAsn: 0.855 ± 0.302
0.94GlnPro: 0.94 ± 0.245
2.052GlnGln: 2.052 ± 0.556
2.052GlnArg: 2.052 ± 0.379
2.821GlnSer: 2.821 ± 0.467
2.565GlnThr: 2.565 ± 0.503
3.505GlnVal: 3.505 ± 0.532
0.598GlnTrp: 0.598 ± 0.253
1.282GlnTyr: 1.282 ± 0.307
0.0GlnXaa: 0.0 ± 0.0
Arg
4.018ArgAla: 4.018 ± 0.49
0.598ArgCys: 0.598 ± 0.272
3.163ArgAsp: 3.163 ± 0.496
3.505ArgGlu: 3.505 ± 0.632
2.394ArgPhe: 2.394 ± 0.39
3.505ArgGly: 3.505 ± 0.531
0.427ArgHis: 0.427 ± 0.186
3.163ArgIle: 3.163 ± 0.584
3.591ArgLys: 3.591 ± 0.695
3.933ArgLeu: 3.933 ± 0.628
0.94ArgMet: 0.94 ± 0.264
2.394ArgAsn: 2.394 ± 0.463
2.137ArgPro: 2.137 ± 0.407
2.394ArgGln: 2.394 ± 0.533
2.308ArgArg: 2.308 ± 0.39
2.821ArgSer: 2.821 ± 0.478
2.565ArgThr: 2.565 ± 0.444
3.163ArgVal: 3.163 ± 0.575
0.684ArgTrp: 0.684 ± 0.231
1.539ArgTyr: 1.539 ± 0.359
0.0ArgXaa: 0.0 ± 0.0
Ser
4.617SerAla: 4.617 ± 0.674
0.598SerCys: 0.598 ± 0.237
3.933SerAsp: 3.933 ± 0.613
3.163SerGlu: 3.163 ± 0.516
2.992SerPhe: 2.992 ± 0.51
5.899SerGly: 5.899 ± 0.896
0.855SerHis: 0.855 ± 0.341
3.676SerIle: 3.676 ± 0.545
4.36SerLys: 4.36 ± 0.819
4.788SerLeu: 4.788 ± 0.509
1.624SerMet: 1.624 ± 0.429
2.736SerAsn: 2.736 ± 0.496
2.052SerPro: 2.052 ± 0.345
1.795SerGln: 1.795 ± 0.309
2.052SerArg: 2.052 ± 0.383
3.505SerSer: 3.505 ± 0.877
2.907SerThr: 2.907 ± 0.55
2.821SerVal: 2.821 ± 0.407
0.427SerTrp: 0.427 ± 0.159
2.565SerTyr: 2.565 ± 0.445
0.0SerXaa: 0.0 ± 0.0
Thr
3.249ThrAla: 3.249 ± 0.582
0.342ThrCys: 0.342 ± 0.153
2.565ThrAsp: 2.565 ± 0.503
4.104ThrGlu: 4.104 ± 0.553
4.018ThrPhe: 4.018 ± 0.707
5.557ThrGly: 5.557 ± 0.737
1.026ThrHis: 1.026 ± 0.249
3.505ThrIle: 3.505 ± 0.595
3.163ThrLys: 3.163 ± 0.754
5.301ThrLeu: 5.301 ± 0.894
1.282ThrMet: 1.282 ± 0.312
2.565ThrAsn: 2.565 ± 0.57
2.821ThrPro: 2.821 ± 0.454
2.65ThrGln: 2.65 ± 0.447
2.65ThrArg: 2.65 ± 0.37
2.479ThrSer: 2.479 ± 0.413
3.762ThrThr: 3.762 ± 0.548
3.505ThrVal: 3.505 ± 0.523
0.598ThrTrp: 0.598 ± 0.316
2.223ThrTyr: 2.223 ± 0.308
0.0ThrXaa: 0.0 ± 0.0
Val
4.959ValAla: 4.959 ± 0.511
0.171ValCys: 0.171 ± 0.112
3.591ValAsp: 3.591 ± 0.488
6.156ValGlu: 6.156 ± 0.752
2.394ValPhe: 2.394 ± 0.425
3.42ValGly: 3.42 ± 0.584
1.453ValHis: 1.453 ± 0.325
3.163ValIle: 3.163 ± 0.575
5.728ValLys: 5.728 ± 0.78
5.13ValLeu: 5.13 ± 0.432
1.881ValMet: 1.881 ± 0.572
3.42ValAsn: 3.42 ± 0.587
2.821ValPro: 2.821 ± 0.418
2.479ValGln: 2.479 ± 0.471
3.42ValArg: 3.42 ± 0.663
3.591ValSer: 3.591 ± 0.526
4.446ValThr: 4.446 ± 0.686
4.275ValVal: 4.275 ± 0.721
0.598ValTrp: 0.598 ± 0.157
2.821ValTyr: 2.821 ± 0.455
0.0ValXaa: 0.0 ± 0.0
Trp
0.94TrpAla: 0.94 ± 0.347
0.171TrpCys: 0.171 ± 0.13
0.855TrpAsp: 0.855 ± 0.282
0.769TrpGlu: 0.769 ± 0.241
0.427TrpPhe: 0.427 ± 0.19
0.513TrpGly: 0.513 ± 0.172
0.427TrpHis: 0.427 ± 0.22
0.598TrpIle: 0.598 ± 0.261
1.197TrpLys: 1.197 ± 0.399
1.197TrpLeu: 1.197 ± 0.325
0.171TrpMet: 0.171 ± 0.103
0.684TrpAsn: 0.684 ± 0.254
0.342TrpPro: 0.342 ± 0.166
0.427TrpGln: 0.427 ± 0.175
0.684TrpArg: 0.684 ± 0.224
0.855TrpSer: 0.855 ± 0.309
1.282TrpThr: 1.282 ± 0.355
0.855TrpVal: 0.855 ± 0.295
0.171TrpTrp: 0.171 ± 0.097
0.598TrpTyr: 0.598 ± 0.264
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.907TyrAla: 2.907 ± 0.625
0.513TyrCys: 0.513 ± 0.219
2.308TyrAsp: 2.308 ± 0.365
2.565TyrGlu: 2.565 ± 0.444
1.881TyrPhe: 1.881 ± 0.487
2.992TyrGly: 2.992 ± 0.459
0.684TyrHis: 0.684 ± 0.235
1.966TyrIle: 1.966 ± 0.479
2.137TyrLys: 2.137 ± 0.54
2.394TyrLeu: 2.394 ± 0.461
1.197TyrMet: 1.197 ± 0.279
1.71TyrAsn: 1.71 ± 0.407
1.966TyrPro: 1.966 ± 0.402
2.223TyrGln: 2.223 ± 0.442
2.052TyrArg: 2.052 ± 0.321
1.453TyrSer: 1.453 ± 0.288
1.881TyrThr: 1.881 ± 0.479
2.137TyrVal: 2.137 ± 0.503
0.598TyrTrp: 0.598 ± 0.204
0.427TyrTyr: 0.427 ± 0.199
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 48 proteins (11697 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski