Amino acid dipepetide frequency for Vibrio phage vB_VpaM_MAR

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.022AlaAla: 12.022 ± 1.952
0.864AlaCys: 0.864 ± 0.268
6.757AlaAsp: 6.757 ± 0.787
9.036AlaGlu: 9.036 ± 0.876
3.457AlaPhe: 3.457 ± 0.446
7.936AlaGly: 7.936 ± 1.106
1.571AlaHis: 1.571 ± 0.289
4.479AlaIle: 4.479 ± 0.589
6.757AlaLys: 6.757 ± 0.725
8.093AlaLeu: 8.093 ± 0.787
2.279AlaMet: 2.279 ± 0.401
4.007AlaAsn: 4.007 ± 0.529
4.007AlaPro: 4.007 ± 0.482
4.164AlaGln: 4.164 ± 0.618
5.029AlaArg: 5.029 ± 0.482
5.029AlaSer: 5.029 ± 0.709
4.95AlaThr: 4.95 ± 0.655
6.286AlaVal: 6.286 ± 0.731
1.414AlaTrp: 1.414 ± 0.297
2.671AlaTyr: 2.671 ± 0.507
0.0AlaXaa: 0.0 ± 0.0
Cys
0.629CysAla: 0.629 ± 0.229
0.55CysCys: 0.55 ± 0.196
0.314CysAsp: 0.314 ± 0.137
0.629CysGlu: 0.629 ± 0.18
0.393CysPhe: 0.393 ± 0.189
1.414CysGly: 1.414 ± 0.38
0.236CysHis: 0.236 ± 0.156
0.943CysIle: 0.943 ± 0.273
0.471CysLys: 0.471 ± 0.202
0.786CysLeu: 0.786 ± 0.221
0.157CysMet: 0.157 ± 0.109
0.629CysAsn: 0.629 ± 0.203
0.707CysPro: 0.707 ± 0.221
0.393CysGln: 0.393 ± 0.198
1.336CysArg: 1.336 ± 0.324
0.786CysSer: 0.786 ± 0.284
0.707CysThr: 0.707 ± 0.244
0.943CysVal: 0.943 ± 0.327
0.157CysTrp: 0.157 ± 0.11
0.157CysTyr: 0.157 ± 0.099
0.0CysXaa: 0.0 ± 0.0
Asp
5.029AspAla: 5.029 ± 0.569
1.021AspCys: 1.021 ± 0.391
3.693AspAsp: 3.693 ± 0.73
4.007AspGlu: 4.007 ± 0.555
2.593AspPhe: 2.593 ± 0.495
5.264AspGly: 5.264 ± 0.685
0.864AspHis: 0.864 ± 0.31
3.614AspIle: 3.614 ± 0.531
3.064AspLys: 3.064 ± 0.577
6.757AspLeu: 6.757 ± 0.735
1.257AspMet: 1.257 ± 0.336
2.279AspAsn: 2.279 ± 0.478
2.671AspPro: 2.671 ± 0.446
3.379AspGln: 3.379 ± 0.468
3.064AspArg: 3.064 ± 0.499
2.436AspSer: 2.436 ± 0.324
2.043AspThr: 2.043 ± 0.413
4.95AspVal: 4.95 ± 0.638
0.786AspTrp: 0.786 ± 0.241
2.2AspTyr: 2.2 ± 0.445
0.0AspXaa: 0.0 ± 0.0
Glu
7.779GluAla: 7.779 ± 0.595
0.864GluCys: 0.864 ± 0.377
2.593GluAsp: 2.593 ± 0.58
4.872GluGlu: 4.872 ± 0.704
2.907GluPhe: 2.907 ± 0.427
5.107GluGly: 5.107 ± 0.502
1.65GluHis: 1.65 ± 0.331
3.614GluIle: 3.614 ± 0.652
2.986GluLys: 2.986 ± 0.496
7.779GluLeu: 7.779 ± 0.873
1.964GluMet: 1.964 ± 0.365
1.886GluAsn: 1.886 ± 0.339
2.593GluPro: 2.593 ± 0.389
4.872GluGln: 4.872 ± 0.646
4.322GluArg: 4.322 ± 0.674
3.457GluSer: 3.457 ± 0.53
4.479GluThr: 4.479 ± 0.526
5.107GluVal: 5.107 ± 0.636
0.864GluTrp: 0.864 ± 0.222
2.436GluTyr: 2.436 ± 0.402
0.0GluXaa: 0.0 ± 0.0
Phe
3.064PheAla: 3.064 ± 0.532
0.864PheCys: 0.864 ± 0.3
3.379PheAsp: 3.379 ± 0.528
2.2PheGlu: 2.2 ± 0.416
1.021PhePhe: 1.021 ± 0.315
3.064PheGly: 3.064 ± 0.585
0.707PheHis: 0.707 ± 0.261
1.729PheIle: 1.729 ± 0.35
2.357PheLys: 2.357 ± 0.317
1.571PheLeu: 1.571 ± 0.358
0.864PheMet: 0.864 ± 0.239
1.336PheAsn: 1.336 ± 0.329
0.943PhePro: 0.943 ± 0.283
0.943PheGln: 0.943 ± 0.232
1.571PheArg: 1.571 ± 0.444
2.593PheSer: 2.593 ± 0.511
1.729PheThr: 1.729 ± 0.308
2.514PheVal: 2.514 ± 0.505
0.55PheTrp: 0.55 ± 0.2
1.1PheTyr: 1.1 ± 0.348
0.0PheXaa: 0.0 ± 0.0
Gly
5.972GlyAla: 5.972 ± 0.981
0.943GlyCys: 0.943 ± 0.282
4.636GlyAsp: 4.636 ± 0.502
5.343GlyGlu: 5.343 ± 0.827
3.221GlyPhe: 3.221 ± 0.505
5.264GlyGly: 5.264 ± 0.55
1.807GlyHis: 1.807 ± 0.415
3.3GlyIle: 3.3 ± 0.448
5.029GlyLys: 5.029 ± 0.672
6.993GlyLeu: 6.993 ± 0.697
2.671GlyMet: 2.671 ± 0.483
1.807GlyAsn: 1.807 ± 0.329
1.729GlyPro: 1.729 ± 0.384
3.143GlyGln: 3.143 ± 0.515
3.3GlyArg: 3.3 ± 0.443
3.536GlySer: 3.536 ± 0.497
4.636GlyThr: 4.636 ± 0.843
5.893GlyVal: 5.893 ± 0.91
1.414GlyTrp: 1.414 ± 0.289
2.671GlyTyr: 2.671 ± 0.502
0.0GlyXaa: 0.0 ± 0.0
His
1.336HisAla: 1.336 ± 0.351
0.55HisCys: 0.55 ± 0.215
0.943HisAsp: 0.943 ± 0.237
1.807HisGlu: 1.807 ± 0.367
0.55HisPhe: 0.55 ± 0.148
1.414HisGly: 1.414 ± 0.386
0.629HisHis: 0.629 ± 0.228
1.886HisIle: 1.886 ± 0.424
1.021HisLys: 1.021 ± 0.292
1.886HisLeu: 1.886 ± 0.339
0.471HisMet: 0.471 ± 0.169
1.021HisAsn: 1.021 ± 0.322
0.629HisPro: 0.629 ± 0.175
0.864HisGln: 0.864 ± 0.248
1.179HisArg: 1.179 ± 0.273
0.55HisSer: 0.55 ± 0.199
1.021HisThr: 1.021 ± 0.324
1.336HisVal: 1.336 ± 0.242
0.393HisTrp: 0.393 ± 0.19
0.786HisTyr: 0.786 ± 0.228
0.0HisXaa: 0.0 ± 0.0
Ile
4.793IleAla: 4.793 ± 0.776
0.707IleCys: 0.707 ± 0.228
3.379IleAsp: 3.379 ± 0.464
3.064IleGlu: 3.064 ± 0.462
0.943IlePhe: 0.943 ± 0.266
4.4IleGly: 4.4 ± 0.407
1.1IleHis: 1.1 ± 0.264
2.514IleIle: 2.514 ± 0.379
3.536IleLys: 3.536 ± 0.585
3.693IleLeu: 3.693 ± 0.592
0.943IleMet: 0.943 ± 0.277
1.807IleAsn: 1.807 ± 0.404
2.121IlePro: 2.121 ± 0.36
1.964IleGln: 1.964 ± 0.361
2.279IleArg: 2.279 ± 0.394
3.064IleSer: 3.064 ± 0.499
4.4IleThr: 4.4 ± 0.441
3.143IleVal: 3.143 ± 0.681
0.157IleTrp: 0.157 ± 0.111
1.021IleTyr: 1.021 ± 0.23
0.0IleXaa: 0.0 ± 0.0
Lys
6.679LysAla: 6.679 ± 0.747
0.314LysCys: 0.314 ± 0.155
3.379LysAsp: 3.379 ± 0.503
4.164LysGlu: 4.164 ± 0.594
1.65LysPhe: 1.65 ± 0.437
4.007LysGly: 4.007 ± 0.507
1.336LysHis: 1.336 ± 0.39
1.807LysIle: 1.807 ± 0.36
3.143LysLys: 3.143 ± 0.59
4.086LysLeu: 4.086 ± 0.545
1.65LysMet: 1.65 ± 0.354
2.436LysAsn: 2.436 ± 0.38
2.436LysPro: 2.436 ± 0.447
1.886LysGln: 1.886 ± 0.517
2.907LysArg: 2.907 ± 0.48
3.379LysSer: 3.379 ± 0.52
4.007LysThr: 4.007 ± 0.71
5.343LysVal: 5.343 ± 0.717
1.021LysTrp: 1.021 ± 0.272
1.493LysTyr: 1.493 ± 0.452
0.0LysXaa: 0.0 ± 0.0
Leu
9.429LeuAla: 9.429 ± 0.776
0.471LeuCys: 0.471 ± 0.189
5.264LeuAsp: 5.264 ± 0.515
5.972LeuGlu: 5.972 ± 0.628
2.986LeuPhe: 2.986 ± 0.468
6.286LeuGly: 6.286 ± 0.783
1.729LeuHis: 1.729 ± 0.367
3.85LeuIle: 3.85 ± 0.479
5.264LeuLys: 5.264 ± 0.626
7.15LeuLeu: 7.15 ± 0.474
1.336LeuMet: 1.336 ± 0.306
5.029LeuAsn: 5.029 ± 0.689
3.772LeuPro: 3.772 ± 0.652
4.636LeuGln: 4.636 ± 0.571
6.05LeuArg: 6.05 ± 0.661
5.736LeuSer: 5.736 ± 0.78
5.422LeuThr: 5.422 ± 0.51
5.736LeuVal: 5.736 ± 0.699
1.1LeuTrp: 1.1 ± 0.268
1.729LeuTyr: 1.729 ± 0.273
0.0LeuXaa: 0.0 ± 0.0
Met
3.064MetAla: 3.064 ± 0.613
0.55MetCys: 0.55 ± 0.193
1.571MetAsp: 1.571 ± 0.309
1.571MetGlu: 1.571 ± 0.408
0.55MetPhe: 0.55 ± 0.185
1.179MetGly: 1.179 ± 0.332
0.471MetHis: 0.471 ± 0.197
0.786MetIle: 0.786 ± 0.215
1.336MetLys: 1.336 ± 0.474
2.121MetLeu: 2.121 ± 0.472
0.55MetMet: 0.55 ± 0.236
1.1MetAsn: 1.1 ± 0.27
1.021MetPro: 1.021 ± 0.231
1.179MetGln: 1.179 ± 0.309
1.179MetArg: 1.179 ± 0.277
1.571MetSer: 1.571 ± 0.402
1.964MetThr: 1.964 ± 0.377
1.414MetVal: 1.414 ± 0.343
0.393MetTrp: 0.393 ± 0.196
0.236MetTyr: 0.236 ± 0.131
0.0MetXaa: 0.0 ± 0.0
Asn
3.457AsnAla: 3.457 ± 0.493
0.236AsnCys: 0.236 ± 0.123
1.886AsnAsp: 1.886 ± 0.548
2.436AsnGlu: 2.436 ± 0.451
0.707AsnPhe: 0.707 ± 0.247
3.85AsnGly: 3.85 ± 0.47
1.336AsnHis: 1.336 ± 0.266
1.493AsnIle: 1.493 ± 0.302
1.571AsnLys: 1.571 ± 0.35
3.536AsnLeu: 3.536 ± 0.553
1.021AsnMet: 1.021 ± 0.277
1.1AsnAsn: 1.1 ± 0.242
1.964AsnPro: 1.964 ± 0.392
2.121AsnGln: 2.121 ± 0.382
2.279AsnArg: 2.279 ± 0.394
1.571AsnSer: 1.571 ± 0.316
2.043AsnThr: 2.043 ± 0.442
2.043AsnVal: 2.043 ± 0.408
0.629AsnTrp: 0.629 ± 0.222
1.257AsnTyr: 1.257 ± 0.248
0.0AsnXaa: 0.0 ± 0.0
Pro
3.85ProAla: 3.85 ± 0.55
0.943ProCys: 0.943 ± 0.316
3.693ProAsp: 3.693 ± 0.634
3.85ProGlu: 3.85 ± 0.57
0.864ProPhe: 0.864 ± 0.299
2.514ProGly: 2.514 ± 0.401
0.471ProHis: 0.471 ± 0.177
2.043ProIle: 2.043 ± 0.442
1.493ProLys: 1.493 ± 0.326
3.379ProLeu: 3.379 ± 0.474
0.707ProMet: 0.707 ± 0.197
1.1ProAsn: 1.1 ± 0.229
0.786ProPro: 0.786 ± 0.252
1.964ProGln: 1.964 ± 0.609
1.571ProArg: 1.571 ± 0.421
2.121ProSer: 2.121 ± 0.388
2.357ProThr: 2.357 ± 0.391
3.3ProVal: 3.3 ± 0.51
0.707ProTrp: 0.707 ± 0.278
0.864ProTyr: 0.864 ± 0.3
0.0ProXaa: 0.0 ± 0.0
Gln
6.05GlnAla: 6.05 ± 1.071
0.236GlnCys: 0.236 ± 0.127
1.886GlnAsp: 1.886 ± 0.369
4.714GlnGlu: 4.714 ± 0.637
1.886GlnPhe: 1.886 ± 0.434
2.986GlnGly: 2.986 ± 0.465
1.1GlnHis: 1.1 ± 0.342
1.964GlnIle: 1.964 ± 0.352
2.121GlnLys: 2.121 ± 0.364
5.107GlnLeu: 5.107 ± 0.668
1.179GlnMet: 1.179 ± 0.291
1.807GlnAsn: 1.807 ± 0.441
1.964GlnPro: 1.964 ± 0.382
2.593GlnGln: 2.593 ± 0.764
2.2GlnArg: 2.2 ± 0.4
2.514GlnSer: 2.514 ± 0.359
2.436GlnThr: 2.436 ± 0.369
4.322GlnVal: 4.322 ± 0.593
0.786GlnTrp: 0.786 ± 0.312
1.414GlnTyr: 1.414 ± 0.334
0.0GlnXaa: 0.0 ± 0.0
Arg
5.343ArgAla: 5.343 ± 0.568
0.314ArgCys: 0.314 ± 0.146
2.121ArgAsp: 2.121 ± 0.402
4.4ArgGlu: 4.4 ± 0.62
2.436ArgPhe: 2.436 ± 0.446
2.829ArgGly: 2.829 ± 0.422
1.336ArgHis: 1.336 ± 0.374
3.379ArgIle: 3.379 ± 0.521
3.693ArgLys: 3.693 ± 0.449
5.186ArgLeu: 5.186 ± 0.647
1.414ArgMet: 1.414 ± 0.337
1.257ArgAsn: 1.257 ± 0.289
1.729ArgPro: 1.729 ± 0.432
3.3ArgGln: 3.3 ± 0.499
3.614ArgArg: 3.614 ± 0.494
2.593ArgSer: 2.593 ± 0.495
2.514ArgThr: 2.514 ± 0.383
4.872ArgVal: 4.872 ± 0.526
1.257ArgTrp: 1.257 ± 0.384
1.65ArgTyr: 1.65 ± 0.341
0.0ArgXaa: 0.0 ± 0.0
Ser
5.657SerAla: 5.657 ± 0.853
0.629SerCys: 0.629 ± 0.203
3.457SerAsp: 3.457 ± 0.513
2.829SerGlu: 2.829 ± 0.456
2.043SerPhe: 2.043 ± 0.354
5.107SerGly: 5.107 ± 0.668
1.179SerHis: 1.179 ± 0.261
2.279SerIle: 2.279 ± 0.447
3.693SerLys: 3.693 ± 0.504
4.872SerLeu: 4.872 ± 0.519
1.65SerMet: 1.65 ± 0.355
2.043SerAsn: 2.043 ± 0.41
2.593SerPro: 2.593 ± 0.466
2.907SerGln: 2.907 ± 0.41
3.221SerArg: 3.221 ± 0.49
2.593SerSer: 2.593 ± 0.499
1.886SerThr: 1.886 ± 0.365
3.064SerVal: 3.064 ± 0.563
0.629SerTrp: 0.629 ± 0.183
1.65SerTyr: 1.65 ± 0.344
0.0SerXaa: 0.0 ± 0.0
Thr
5.814ThrAla: 5.814 ± 0.664
0.629ThrCys: 0.629 ± 0.189
3.3ThrAsp: 3.3 ± 0.487
4.164ThrGlu: 4.164 ± 0.529
1.571ThrPhe: 1.571 ± 0.322
4.479ThrGly: 4.479 ± 0.614
0.864ThrHis: 0.864 ± 0.272
2.593ThrIle: 2.593 ± 0.406
3.221ThrLys: 3.221 ± 0.493
5.343ThrLeu: 5.343 ± 0.697
1.1ThrMet: 1.1 ± 0.212
1.571ThrAsn: 1.571 ± 0.304
2.436ThrPro: 2.436 ± 0.479
2.907ThrGln: 2.907 ± 0.468
2.671ThrArg: 2.671 ± 0.373
4.086ThrSer: 4.086 ± 0.647
3.221ThrThr: 3.221 ± 0.494
3.85ThrVal: 3.85 ± 0.485
0.786ThrTrp: 0.786 ± 0.24
1.886ThrTyr: 1.886 ± 0.385
0.0ThrXaa: 0.0 ± 0.0
Val
7.229ValAla: 7.229 ± 1.079
1.021ValCys: 1.021 ± 0.274
5.186ValAsp: 5.186 ± 0.76
3.772ValGlu: 3.772 ± 0.53
2.514ValPhe: 2.514 ± 0.321
4.007ValGly: 4.007 ± 0.508
1.1ValHis: 1.1 ± 0.276
5.343ValIle: 5.343 ± 0.656
4.086ValLys: 4.086 ± 0.547
6.207ValLeu: 6.207 ± 0.814
1.886ValMet: 1.886 ± 0.349
3.221ValAsn: 3.221 ± 0.584
2.829ValPro: 2.829 ± 0.401
3.221ValGln: 3.221 ± 0.434
4.243ValArg: 4.243 ± 0.726
3.693ValSer: 3.693 ± 0.602
4.714ValThr: 4.714 ± 0.625
6.207ValVal: 6.207 ± 0.792
1.257ValTrp: 1.257 ± 0.246
2.121ValTyr: 2.121 ± 0.433
0.0ValXaa: 0.0 ± 0.0
Trp
1.257TrpAla: 1.257 ± 0.29
0.236TrpCys: 0.236 ± 0.14
1.414TrpAsp: 1.414 ± 0.362
0.943TrpGlu: 0.943 ± 0.317
1.1TrpPhe: 1.1 ± 0.256
0.707TrpGly: 0.707 ± 0.19
0.393TrpHis: 0.393 ± 0.165
0.629TrpIle: 0.629 ± 0.202
0.943TrpLys: 0.943 ± 0.262
1.257TrpLeu: 1.257 ± 0.237
0.079TrpMet: 0.079 ± 0.077
0.393TrpAsn: 0.393 ± 0.182
0.707TrpPro: 0.707 ± 0.252
0.393TrpGln: 0.393 ± 0.178
0.786TrpArg: 0.786 ± 0.25
1.179TrpSer: 1.179 ± 0.241
0.471TrpThr: 0.471 ± 0.157
1.493TrpVal: 1.493 ± 0.399
0.314TrpTrp: 0.314 ± 0.144
0.157TrpTyr: 0.157 ± 0.106
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.357TyrAla: 2.357 ± 0.462
0.393TyrCys: 0.393 ± 0.188
2.121TyrAsp: 2.121 ± 0.282
2.357TyrGlu: 2.357 ± 0.48
0.707TyrPhe: 0.707 ± 0.267
1.493TyrGly: 1.493 ± 0.352
0.393TyrHis: 0.393 ± 0.247
0.786TyrIle: 0.786 ± 0.278
1.414TyrLys: 1.414 ± 0.295
3.143TyrLeu: 3.143 ± 0.497
0.55TyrMet: 0.55 ± 0.186
0.707TyrAsn: 0.707 ± 0.182
0.943TyrPro: 0.943 ± 0.308
2.436TyrGln: 2.436 ± 0.542
2.436TyrArg: 2.436 ± 0.406
1.493TyrSer: 1.493 ± 0.4
1.493TyrThr: 1.493 ± 0.308
2.043TyrVal: 2.043 ± 0.3
0.236TyrTrp: 0.236 ± 0.115
0.864TyrTyr: 0.864 ± 0.259
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 62 proteins (12728 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski