Amino acid dipepetide frequency for Vibrio phage vB_VhaP_VH-5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.484AlaAla: 8.484 ± 0.974
0.972AlaCys: 0.972 ± 0.256
5.479AlaAsp: 5.479 ± 0.929
5.037AlaGlu: 5.037 ± 0.626
3.093AlaPhe: 3.093 ± 0.481
6.451AlaGly: 6.451 ± 0.877
0.884AlaHis: 0.884 ± 0.226
3.8AlaIle: 3.8 ± 0.488
5.921AlaLys: 5.921 ± 0.832
9.014AlaLeu: 9.014 ± 0.903
2.474AlaMet: 2.474 ± 0.567
4.065AlaAsn: 4.065 ± 0.51
2.651AlaPro: 2.651 ± 0.516
4.86AlaGln: 4.86 ± 0.89
3.358AlaArg: 3.358 ± 0.597
4.153AlaSer: 4.153 ± 0.44
5.479AlaThr: 5.479 ± 0.817
5.391AlaVal: 5.391 ± 0.932
0.619AlaTrp: 0.619 ± 0.254
2.739AlaTyr: 2.739 ± 0.709
0.0AlaXaa: 0.0 ± 0.0
Cys
0.707CysAla: 0.707 ± 0.251
0.088CysCys: 0.088 ± 0.076
0.442CysAsp: 0.442 ± 0.188
0.707CysGlu: 0.707 ± 0.279
0.088CysPhe: 0.088 ± 0.095
1.06CysGly: 1.06 ± 0.358
0.442CysHis: 0.442 ± 0.18
0.53CysIle: 0.53 ± 0.26
0.442CysLys: 0.442 ± 0.178
1.326CysLeu: 1.326 ± 0.357
0.265CysMet: 0.265 ± 0.188
0.353CysAsn: 0.353 ± 0.198
0.177CysPro: 0.177 ± 0.117
0.265CysGln: 0.265 ± 0.134
0.619CysArg: 0.619 ± 0.269
0.619CysSer: 0.619 ± 0.3
1.237CysThr: 1.237 ± 0.311
0.707CysVal: 0.707 ± 0.308
0.177CysTrp: 0.177 ± 0.116
0.619CysTyr: 0.619 ± 0.236
0.0CysXaa: 0.0 ± 0.0
Asp
5.921AspAla: 5.921 ± 0.784
0.884AspCys: 0.884 ± 0.271
3.446AspAsp: 3.446 ± 0.635
3.977AspGlu: 3.977 ± 0.602
2.386AspPhe: 2.386 ± 0.443
3.623AspGly: 3.623 ± 0.557
1.06AspHis: 1.06 ± 0.368
4.153AspIle: 4.153 ± 0.566
2.828AspLys: 2.828 ± 0.442
4.86AspLeu: 4.86 ± 0.7
1.767AspMet: 1.767 ± 0.346
3.005AspAsn: 3.005 ± 0.487
1.856AspPro: 1.856 ± 0.476
0.884AspGln: 0.884 ± 0.243
2.474AspArg: 2.474 ± 0.432
3.005AspSer: 3.005 ± 0.435
3.623AspThr: 3.623 ± 0.531
6.716AspVal: 6.716 ± 0.891
0.972AspTrp: 0.972 ± 0.327
3.181AspTyr: 3.181 ± 0.608
0.0AspXaa: 0.0 ± 0.0
Glu
5.479GluAla: 5.479 ± 0.665
0.972GluCys: 0.972 ± 0.371
4.065GluAsp: 4.065 ± 0.584
4.242GluGlu: 4.242 ± 0.814
2.298GluPhe: 2.298 ± 0.395
4.595GluGly: 4.595 ± 0.752
1.856GluHis: 1.856 ± 0.532
1.856GluIle: 1.856 ± 0.666
3.535GluLys: 3.535 ± 0.746
6.363GluLeu: 6.363 ± 0.91
0.619GluMet: 0.619 ± 0.249
2.209GluAsn: 2.209 ± 0.609
1.591GluPro: 1.591 ± 0.3
4.595GluGln: 4.595 ± 0.707
3.446GluArg: 3.446 ± 0.534
3.181GluSer: 3.181 ± 0.549
1.856GluThr: 1.856 ± 0.426
5.832GluVal: 5.832 ± 0.859
0.707GluTrp: 0.707 ± 0.268
2.916GluTyr: 2.916 ± 0.458
0.0GluXaa: 0.0 ± 0.0
Phe
2.386PheAla: 2.386 ± 0.44
0.53PheCys: 0.53 ± 0.208
2.033PheAsp: 2.033 ± 0.587
2.916PheGlu: 2.916 ± 0.455
0.884PhePhe: 0.884 ± 0.246
2.033PheGly: 2.033 ± 0.374
0.707PheHis: 0.707 ± 0.301
1.944PheIle: 1.944 ± 0.363
2.739PheLys: 2.739 ± 0.487
2.651PheLeu: 2.651 ± 0.461
0.972PheMet: 0.972 ± 0.197
2.121PheAsn: 2.121 ± 0.463
1.237PhePro: 1.237 ± 0.292
1.502PheGln: 1.502 ± 0.299
1.237PheArg: 1.237 ± 0.284
2.298PheSer: 2.298 ± 0.422
2.298PheThr: 2.298 ± 0.44
2.033PheVal: 2.033 ± 0.376
0.353PheTrp: 0.353 ± 0.18
0.53PheTyr: 0.53 ± 0.169
0.0PheXaa: 0.0 ± 0.0
Gly
5.921GlyAla: 5.921 ± 0.658
0.972GlyCys: 0.972 ± 0.272
3.093GlyAsp: 3.093 ± 0.464
3.005GlyGlu: 3.005 ± 0.633
2.209GlyPhe: 2.209 ± 0.398
5.479GlyGly: 5.479 ± 1.157
1.06GlyHis: 1.06 ± 0.236
3.623GlyIle: 3.623 ± 0.492
3.888GlyLys: 3.888 ± 0.47
6.628GlyLeu: 6.628 ± 0.652
2.298GlyMet: 2.298 ± 0.403
2.209GlyAsn: 2.209 ± 0.502
0.0GlyPro: 0.0 ± 0.0
2.739GlyGln: 2.739 ± 0.37
4.419GlyArg: 4.419 ± 0.61
4.33GlySer: 4.33 ± 0.545
6.009GlyThr: 6.009 ± 0.629
7.158GlyVal: 7.158 ± 0.977
1.237GlyTrp: 1.237 ± 0.333
3.712GlyTyr: 3.712 ± 0.544
0.0GlyXaa: 0.0 ± 0.0
His
1.502HisAla: 1.502 ± 0.432
0.088HisCys: 0.088 ± 0.082
1.326HisAsp: 1.326 ± 0.369
0.795HisGlu: 0.795 ± 0.221
1.237HisPhe: 1.237 ± 0.412
1.149HisGly: 1.149 ± 0.194
0.53HisHis: 0.53 ± 0.178
1.679HisIle: 1.679 ± 0.334
1.149HisLys: 1.149 ± 0.273
2.033HisLeu: 2.033 ± 0.499
0.177HisMet: 0.177 ± 0.125
1.237HisAsn: 1.237 ± 0.305
1.326HisPro: 1.326 ± 0.432
0.795HisGln: 0.795 ± 0.236
1.149HisArg: 1.149 ± 0.371
1.06HisSer: 1.06 ± 0.337
1.149HisThr: 1.149 ± 0.364
1.679HisVal: 1.679 ± 0.394
0.177HisTrp: 0.177 ± 0.12
1.06HisTyr: 1.06 ± 0.29
0.0HisXaa: 0.0 ± 0.0
Ile
4.153IleAla: 4.153 ± 0.517
0.177IleCys: 0.177 ± 0.121
3.181IleAsp: 3.181 ± 0.506
2.651IleGlu: 2.651 ± 0.514
1.06IlePhe: 1.06 ± 0.363
3.27IleGly: 3.27 ± 0.624
1.326IleHis: 1.326 ± 0.286
2.209IleIle: 2.209 ± 0.591
2.916IleLys: 2.916 ± 0.465
2.916IleLeu: 2.916 ± 0.483
0.884IleMet: 0.884 ± 0.217
2.916IleAsn: 2.916 ± 0.591
2.298IlePro: 2.298 ± 0.505
2.033IleGln: 2.033 ± 0.49
1.679IleArg: 1.679 ± 0.374
2.298IleSer: 2.298 ± 0.324
3.712IleThr: 3.712 ± 0.616
2.651IleVal: 2.651 ± 0.465
0.088IleTrp: 0.088 ± 0.08
1.326IleTyr: 1.326 ± 0.321
0.0IleXaa: 0.0 ± 0.0
Lys
4.507LysAla: 4.507 ± 0.906
0.884LysCys: 0.884 ± 0.289
4.153LysAsp: 4.153 ± 0.712
3.977LysGlu: 3.977 ± 0.785
1.767LysPhe: 1.767 ± 0.352
4.33LysGly: 4.33 ± 0.532
1.502LysHis: 1.502 ± 0.396
1.149LysIle: 1.149 ± 0.32
1.856LysLys: 1.856 ± 0.432
5.921LysLeu: 5.921 ± 0.709
0.884LysMet: 0.884 ± 0.255
2.298LysAsn: 2.298 ± 0.449
2.474LysPro: 2.474 ± 0.498
2.739LysGln: 2.739 ± 0.564
4.507LysArg: 4.507 ± 0.77
3.535LysSer: 3.535 ± 0.548
2.563LysThr: 2.563 ± 0.376
3.888LysVal: 3.888 ± 0.449
0.884LysTrp: 0.884 ± 0.254
3.181LysTyr: 3.181 ± 0.472
0.0LysXaa: 0.0 ± 0.0
Leu
8.484LeuAla: 8.484 ± 1.05
0.972LeuCys: 0.972 ± 0.282
6.186LeuAsp: 6.186 ± 0.757
6.539LeuGlu: 6.539 ± 0.941
2.298LeuPhe: 2.298 ± 0.354
6.805LeuGly: 6.805 ± 0.791
1.502LeuHis: 1.502 ± 0.394
4.065LeuIle: 4.065 ± 0.554
5.479LeuLys: 5.479 ± 0.764
6.805LeuLeu: 6.805 ± 0.922
1.679LeuMet: 1.679 ± 0.409
3.888LeuAsn: 3.888 ± 0.869
3.446LeuPro: 3.446 ± 0.65
5.744LeuGln: 5.744 ± 0.826
4.065LeuArg: 4.065 ± 0.642
5.921LeuSer: 5.921 ± 0.889
5.302LeuThr: 5.302 ± 0.51
5.302LeuVal: 5.302 ± 0.743
0.795LeuTrp: 0.795 ± 0.266
3.8LeuTyr: 3.8 ± 0.467
0.0LeuXaa: 0.0 ± 0.0
Met
2.033MetAla: 2.033 ± 0.415
0.265MetCys: 0.265 ± 0.126
1.502MetAsp: 1.502 ± 0.458
1.237MetGlu: 1.237 ± 0.387
0.972MetPhe: 0.972 ± 0.228
1.414MetGly: 1.414 ± 0.327
0.707MetHis: 0.707 ± 0.228
0.619MetIle: 0.619 ± 0.236
0.795MetLys: 0.795 ± 0.385
2.033MetLeu: 2.033 ± 0.462
0.707MetMet: 0.707 ± 0.252
1.237MetAsn: 1.237 ± 0.35
0.442MetPro: 0.442 ± 0.199
1.06MetGln: 1.06 ± 0.282
1.591MetArg: 1.591 ± 0.446
2.298MetSer: 2.298 ± 0.65
1.944MetThr: 1.944 ± 0.365
1.237MetVal: 1.237 ± 0.396
0.442MetTrp: 0.442 ± 0.17
1.326MetTyr: 1.326 ± 0.301
0.0MetXaa: 0.0 ± 0.0
Asn
2.916AsnAla: 2.916 ± 0.639
0.53AsnCys: 0.53 ± 0.2
2.033AsnAsp: 2.033 ± 0.377
1.679AsnGlu: 1.679 ± 0.34
1.856AsnPhe: 1.856 ± 0.356
3.181AsnGly: 3.181 ± 0.666
1.237AsnHis: 1.237 ± 0.328
2.828AsnIle: 2.828 ± 0.699
2.298AsnLys: 2.298 ± 0.356
3.535AsnLeu: 3.535 ± 0.437
1.149AsnMet: 1.149 ± 0.258
3.181AsnAsn: 3.181 ± 0.58
3.093AsnPro: 3.093 ± 0.449
1.856AsnGln: 1.856 ± 0.352
2.209AsnArg: 2.209 ± 0.537
3.093AsnSer: 3.093 ± 0.516
3.977AsnThr: 3.977 ± 0.931
3.005AsnVal: 3.005 ± 0.642
0.442AsnTrp: 0.442 ± 0.255
2.033AsnTyr: 2.033 ± 0.423
0.0AsnXaa: 0.0 ± 0.0
Pro
2.651ProAla: 2.651 ± 0.44
0.177ProCys: 0.177 ± 0.128
2.916ProAsp: 2.916 ± 0.502
4.065ProGlu: 4.065 ± 0.749
1.149ProPhe: 1.149 ± 0.29
0.0ProGly: 0.0 ± 0.0
0.884ProHis: 0.884 ± 0.209
1.767ProIle: 1.767 ± 0.421
2.121ProLys: 2.121 ± 0.428
2.298ProLeu: 2.298 ± 0.398
0.972ProMet: 0.972 ± 0.317
1.326ProAsn: 1.326 ± 0.273
1.591ProPro: 1.591 ± 0.464
2.209ProGln: 2.209 ± 0.444
1.856ProArg: 1.856 ± 0.428
2.651ProSer: 2.651 ± 0.612
2.916ProThr: 2.916 ± 0.568
3.977ProVal: 3.977 ± 0.588
0.0ProTrp: 0.0 ± 0.0
1.679ProTyr: 1.679 ± 0.42
0.0ProXaa: 0.0 ± 0.0
Gln
5.391GlnAla: 5.391 ± 0.896
0.265GlnCys: 0.265 ± 0.154
2.651GlnAsp: 2.651 ± 0.517
3.712GlnGlu: 3.712 ± 0.597
1.149GlnPhe: 1.149 ± 0.256
4.065GlnGly: 4.065 ± 0.46
1.06GlnHis: 1.06 ± 0.337
1.326GlnIle: 1.326 ± 0.277
3.093GlnLys: 3.093 ± 0.513
4.419GlnLeu: 4.419 ± 0.776
1.06GlnMet: 1.06 ± 0.24
2.651GlnAsn: 2.651 ± 0.557
1.591GlnPro: 1.591 ± 0.41
3.8GlnGln: 3.8 ± 1.012
2.121GlnArg: 2.121 ± 0.64
2.474GlnSer: 2.474 ± 0.526
2.563GlnThr: 2.563 ± 0.45
3.535GlnVal: 3.535 ± 0.403
0.707GlnTrp: 0.707 ± 0.215
2.209GlnTyr: 2.209 ± 0.534
0.0GlnXaa: 0.0 ± 0.0
Arg
3.623ArgAla: 3.623 ± 0.577
0.353ArgCys: 0.353 ± 0.163
2.298ArgAsp: 2.298 ± 0.434
2.739ArgGlu: 2.739 ± 0.7
2.209ArgPhe: 2.209 ± 0.488
3.712ArgGly: 3.712 ± 0.435
1.502ArgHis: 1.502 ± 0.347
3.005ArgIle: 3.005 ± 0.44
3.8ArgLys: 3.8 ± 0.578
4.419ArgLeu: 4.419 ± 0.592
2.121ArgMet: 2.121 ± 0.586
2.209ArgAsn: 2.209 ± 0.536
1.414ArgPro: 1.414 ± 0.299
1.679ArgGln: 1.679 ± 0.315
4.419ArgArg: 4.419 ± 0.855
3.181ArgSer: 3.181 ± 0.559
4.153ArgThr: 4.153 ± 0.59
3.977ArgVal: 3.977 ± 0.526
0.265ArgTrp: 0.265 ± 0.134
1.767ArgTyr: 1.767 ± 0.311
0.0ArgXaa: 0.0 ± 0.0
Ser
5.391SerAla: 5.391 ± 0.622
0.353SerCys: 0.353 ± 0.18
3.535SerAsp: 3.535 ± 0.682
3.8SerGlu: 3.8 ± 0.492
2.298SerPhe: 2.298 ± 0.338
4.153SerGly: 4.153 ± 0.648
0.707SerHis: 0.707 ± 0.273
2.563SerIle: 2.563 ± 0.584
3.623SerLys: 3.623 ± 0.604
5.391SerLeu: 5.391 ± 0.522
2.121SerMet: 2.121 ± 0.449
2.298SerAsn: 2.298 ± 0.59
2.563SerPro: 2.563 ± 0.308
2.033SerGln: 2.033 ± 0.475
2.916SerArg: 2.916 ± 0.556
4.153SerSer: 4.153 ± 0.594
6.186SerThr: 6.186 ± 1.138
3.623SerVal: 3.623 ± 0.551
0.972SerTrp: 0.972 ± 0.275
2.033SerTyr: 2.033 ± 0.442
0.0SerXaa: 0.0 ± 0.0
Thr
6.363ThrAla: 6.363 ± 0.755
0.53ThrCys: 0.53 ± 0.222
4.242ThrAsp: 4.242 ± 0.582
4.684ThrGlu: 4.684 ± 0.662
2.121ThrPhe: 2.121 ± 0.541
5.832ThrGly: 5.832 ± 0.732
1.414ThrHis: 1.414 ± 0.349
2.386ThrIle: 2.386 ± 0.41
3.27ThrLys: 3.27 ± 0.452
5.744ThrLeu: 5.744 ± 0.701
1.326ThrMet: 1.326 ± 0.271
3.181ThrAsn: 3.181 ± 1.0
4.419ThrPro: 4.419 ± 0.627
3.005ThrGln: 3.005 ± 0.485
3.446ThrArg: 3.446 ± 0.613
3.535ThrSer: 3.535 ± 0.703
5.214ThrThr: 5.214 ± 0.925
4.772ThrVal: 4.772 ± 1.038
1.237ThrTrp: 1.237 ± 0.371
2.033ThrTyr: 2.033 ± 0.538
0.0ThrXaa: 0.0 ± 0.0
Val
5.656ValAla: 5.656 ± 0.91
0.972ValCys: 0.972 ± 0.33
4.86ValAsp: 4.86 ± 0.615
2.916ValGlu: 2.916 ± 0.38
2.209ValPhe: 2.209 ± 0.511
5.125ValGly: 5.125 ± 0.788
1.856ValHis: 1.856 ± 0.407
2.033ValIle: 2.033 ± 0.6
4.419ValLys: 4.419 ± 0.565
7.953ValLeu: 7.953 ± 0.893
1.149ValMet: 1.149 ± 0.282
3.093ValAsn: 3.093 ± 0.538
3.535ValPro: 3.535 ± 0.804
6.186ValGln: 6.186 ± 1.045
4.595ValArg: 4.595 ± 0.662
4.772ValSer: 4.772 ± 0.579
4.772ValThr: 4.772 ± 1.044
5.567ValVal: 5.567 ± 0.977
0.619ValTrp: 0.619 ± 0.211
2.828ValTyr: 2.828 ± 0.41
0.0ValXaa: 0.0 ± 0.0
Trp
0.353TrpAla: 0.353 ± 0.166
0.353TrpCys: 0.353 ± 0.2
0.884TrpAsp: 0.884 ± 0.387
0.884TrpGlu: 0.884 ± 0.341
0.53TrpPhe: 0.53 ± 0.185
0.707TrpGly: 0.707 ± 0.236
0.177TrpHis: 0.177 ± 0.112
0.265TrpIle: 0.265 ± 0.143
0.53TrpLys: 0.53 ± 0.236
1.326TrpLeu: 1.326 ± 0.353
0.265TrpMet: 0.265 ± 0.174
0.972TrpAsn: 0.972 ± 0.301
0.0TrpPro: 0.0 ± 0.0
0.442TrpGln: 0.442 ± 0.161
0.619TrpArg: 0.619 ± 0.227
0.707TrpSer: 0.707 ± 0.223
0.442TrpThr: 0.442 ± 0.172
1.237TrpVal: 1.237 ± 0.303
0.088TrpTrp: 0.088 ± 0.1
0.353TrpTyr: 0.353 ± 0.158
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.005TyrAla: 3.005 ± 0.638
0.53TyrCys: 0.53 ± 0.229
2.298TyrAsp: 2.298 ± 0.467
2.651TyrGlu: 2.651 ± 0.511
1.591TyrPhe: 1.591 ± 0.33
3.005TyrGly: 3.005 ± 0.443
0.884TyrHis: 0.884 ± 0.325
1.944TyrIle: 1.944 ± 0.33
2.298TyrLys: 2.298 ± 0.492
3.446TyrLeu: 3.446 ± 0.451
0.884TyrMet: 0.884 ± 0.252
1.679TyrAsn: 1.679 ± 0.401
1.502TyrPro: 1.502 ± 0.276
1.414TyrGln: 1.414 ± 0.502
2.121TyrArg: 2.121 ± 0.462
3.535TyrSer: 3.535 ± 0.598
3.358TyrThr: 3.358 ± 0.557
2.828TyrVal: 2.828 ± 0.454
0.353TyrTrp: 0.353 ± 0.166
1.944TyrTyr: 1.944 ± 0.593
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 32 proteins (11317 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski