Amino acid dipepetide frequency for Vibrio phage ValB1MD-2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.372AlaAla: 7.372 ± 0.979
0.637AlaCys: 0.637 ± 0.261
4.004AlaAsp: 4.004 ± 0.598
6.189AlaGlu: 6.189 ± 0.697
3.276AlaPhe: 3.276 ± 0.597
6.007AlaGly: 6.007 ± 0.909
1.456AlaHis: 1.456 ± 0.355
6.553AlaIle: 6.553 ± 0.794
5.005AlaLys: 5.005 ± 1.06
7.008AlaLeu: 7.008 ± 0.888
1.638AlaMet: 1.638 ± 0.364
3.64AlaAsn: 3.64 ± 0.479
2.184AlaPro: 2.184 ± 0.411
2.548AlaGln: 2.548 ± 0.414
2.821AlaArg: 2.821 ± 0.531
5.369AlaSer: 5.369 ± 0.764
6.553AlaThr: 6.553 ± 1.067
4.55AlaVal: 4.55 ± 0.881
1.365AlaTrp: 1.365 ± 0.286
2.457AlaTyr: 2.457 ± 0.369
0.0AlaXaa: 0.0 ± 0.0
Cys
0.637CysAla: 0.637 ± 0.358
0.091CysCys: 0.091 ± 0.082
0.637CysAsp: 0.637 ± 0.245
0.455CysGlu: 0.455 ± 0.193
0.364CysPhe: 0.364 ± 0.166
0.637CysGly: 0.637 ± 0.263
0.182CysHis: 0.182 ± 0.138
0.637CysIle: 0.637 ± 0.217
0.546CysLys: 0.546 ± 0.209
1.092CysLeu: 1.092 ± 0.375
0.273CysMet: 0.273 ± 0.17
0.273CysAsn: 0.273 ± 0.153
0.546CysPro: 0.546 ± 0.229
0.546CysGln: 0.546 ± 0.224
0.546CysArg: 0.546 ± 0.202
1.001CysSer: 1.001 ± 0.329
0.546CysThr: 0.546 ± 0.237
0.819CysVal: 0.819 ± 0.287
0.0CysTrp: 0.0 ± 0.0
0.273CysTyr: 0.273 ± 0.152
0.0CysXaa: 0.0 ± 0.0
Asp
4.277AspAla: 4.277 ± 0.435
0.546AspCys: 0.546 ± 0.225
3.094AspAsp: 3.094 ± 0.511
3.64AspGlu: 3.64 ± 0.644
3.276AspPhe: 3.276 ± 0.485
3.458AspGly: 3.458 ± 0.514
1.092AspHis: 1.092 ± 0.311
4.004AspIle: 4.004 ± 0.678
4.55AspLys: 4.55 ± 0.587
4.914AspLeu: 4.914 ± 0.656
1.456AspMet: 1.456 ± 0.306
1.638AspAsn: 1.638 ± 0.455
2.548AspPro: 2.548 ± 0.567
1.729AspGln: 1.729 ± 0.446
2.639AspArg: 2.639 ± 0.399
4.277AspSer: 4.277 ± 0.645
1.911AspThr: 1.911 ± 0.531
3.003AspVal: 3.003 ± 0.481
0.819AspTrp: 0.819 ± 0.293
2.184AspTyr: 2.184 ± 0.436
0.0AspXaa: 0.0 ± 0.0
Glu
5.278GluAla: 5.278 ± 0.618
1.183GluCys: 1.183 ± 0.35
3.367GluAsp: 3.367 ± 0.61
4.277GluGlu: 4.277 ± 0.75
2.184GluPhe: 2.184 ± 0.526
2.73GluGly: 2.73 ± 0.45
1.82GluHis: 1.82 ± 0.438
3.367GluIle: 3.367 ± 0.473
4.823GluLys: 4.823 ± 0.863
7.918GluLeu: 7.918 ± 0.766
2.184GluMet: 2.184 ± 0.441
3.094GluAsn: 3.094 ± 0.546
3.094GluPro: 3.094 ± 0.542
4.368GluGln: 4.368 ± 0.663
3.822GluArg: 3.822 ± 0.726
3.822GluSer: 3.822 ± 0.511
3.731GluThr: 3.731 ± 0.669
5.096GluVal: 5.096 ± 0.621
1.092GluTrp: 1.092 ± 0.273
2.457GluTyr: 2.457 ± 0.407
0.0GluXaa: 0.0 ± 0.0
Phe
2.912PheAla: 2.912 ± 0.574
0.364PheCys: 0.364 ± 0.168
2.366PheAsp: 2.366 ± 0.478
3.458PheGlu: 3.458 ± 0.591
0.91PhePhe: 0.91 ± 0.345
2.093PheGly: 2.093 ± 0.531
0.455PheHis: 0.455 ± 0.147
1.911PheIle: 1.911 ± 0.472
2.912PheLys: 2.912 ± 0.476
2.821PheLeu: 2.821 ± 0.562
1.001PheMet: 1.001 ± 0.253
2.184PheAsn: 2.184 ± 0.456
1.365PhePro: 1.365 ± 0.303
1.183PheGln: 1.183 ± 0.274
2.093PheArg: 2.093 ± 0.456
3.64PheSer: 3.64 ± 0.746
2.912PheThr: 2.912 ± 0.541
2.457PheVal: 2.457 ± 0.461
0.819PheTrp: 0.819 ± 0.323
1.001PheTyr: 1.001 ± 0.315
0.0PheXaa: 0.0 ± 0.0
Gly
3.458GlyAla: 3.458 ± 0.581
1.183GlyCys: 1.183 ± 0.306
3.731GlyAsp: 3.731 ± 0.669
4.368GlyGlu: 4.368 ± 0.755
2.093GlyPhe: 2.093 ± 0.511
3.367GlyGly: 3.367 ± 0.609
1.638GlyHis: 1.638 ± 0.394
3.822GlyIle: 3.822 ± 0.786
4.368GlyLys: 4.368 ± 0.612
4.095GlyLeu: 4.095 ± 0.805
1.82GlyMet: 1.82 ± 0.487
2.639GlyAsn: 2.639 ± 0.382
0.819GlyPro: 0.819 ± 0.289
2.366GlyGln: 2.366 ± 0.515
3.549GlyArg: 3.549 ± 0.61
4.368GlySer: 4.368 ± 0.526
3.549GlyThr: 3.549 ± 0.624
4.459GlyVal: 4.459 ± 0.767
1.001GlyTrp: 1.001 ± 0.272
1.729GlyTyr: 1.729 ± 0.351
0.0GlyXaa: 0.0 ± 0.0
His
1.365HisAla: 1.365 ± 0.358
0.273HisCys: 0.273 ± 0.171
0.91HisAsp: 0.91 ± 0.267
1.729HisGlu: 1.729 ± 0.449
1.092HisPhe: 1.092 ± 0.327
1.82HisGly: 1.82 ± 0.356
0.637HisHis: 0.637 ± 0.263
1.638HisIle: 1.638 ± 0.392
1.183HisLys: 1.183 ± 0.317
2.275HisLeu: 2.275 ± 0.541
0.637HisMet: 0.637 ± 0.25
0.819HisAsn: 0.819 ± 0.319
0.728HisPro: 0.728 ± 0.222
0.546HisGln: 0.546 ± 0.251
1.001HisArg: 1.001 ± 0.3
1.729HisSer: 1.729 ± 0.418
1.547HisThr: 1.547 ± 0.426
1.547HisVal: 1.547 ± 0.364
0.273HisTrp: 0.273 ± 0.15
0.637HisTyr: 0.637 ± 0.292
0.0HisXaa: 0.0 ± 0.0
Ile
7.19IleAla: 7.19 ± 0.826
0.364IleCys: 0.364 ± 0.221
3.276IleAsp: 3.276 ± 0.535
4.732IleGlu: 4.732 ± 0.804
1.365IlePhe: 1.365 ± 0.335
3.549IleGly: 3.549 ± 0.538
1.456IleHis: 1.456 ± 0.402
3.094IleIle: 3.094 ± 0.692
4.55IleLys: 4.55 ± 0.687
4.368IleLeu: 4.368 ± 0.605
1.001IleMet: 1.001 ± 0.299
2.093IleAsn: 2.093 ± 0.423
3.458IlePro: 3.458 ± 0.603
2.457IleGln: 2.457 ± 0.41
3.367IleArg: 3.367 ± 0.463
3.458IleSer: 3.458 ± 0.407
3.549IleThr: 3.549 ± 0.65
2.821IleVal: 2.821 ± 0.624
0.91IleTrp: 0.91 ± 0.322
2.184IleTyr: 2.184 ± 0.488
0.0IleXaa: 0.0 ± 0.0
Lys
6.28LysAla: 6.28 ± 0.716
0.273LysCys: 0.273 ± 0.164
3.458LysAsp: 3.458 ± 0.476
4.823LysGlu: 4.823 ± 0.623
2.821LysPhe: 2.821 ± 0.436
4.732LysGly: 4.732 ± 0.579
2.002LysHis: 2.002 ± 0.469
4.004LysIle: 4.004 ± 0.679
4.823LysLys: 4.823 ± 1.003
8.464LysLeu: 8.464 ± 0.834
1.183LysMet: 1.183 ± 0.315
4.095LysAsn: 4.095 ± 0.695
3.367LysPro: 3.367 ± 0.588
3.731LysGln: 3.731 ± 0.574
4.732LysArg: 4.732 ± 0.837
4.459LysSer: 4.459 ± 0.799
3.094LysThr: 3.094 ± 0.484
4.368LysVal: 4.368 ± 0.818
0.728LysTrp: 0.728 ± 0.281
1.547LysTyr: 1.547 ± 0.461
0.0LysXaa: 0.0 ± 0.0
Leu
6.644LeuAla: 6.644 ± 0.828
1.001LeuCys: 1.001 ± 0.3
5.005LeuAsp: 5.005 ± 0.623
6.28LeuGlu: 6.28 ± 0.801
3.276LeuPhe: 3.276 ± 0.676
4.914LeuGly: 4.914 ± 0.717
1.82LeuHis: 1.82 ± 0.557
4.095LeuIle: 4.095 ± 0.501
8.373LeuLys: 8.373 ± 1.059
7.281LeuLeu: 7.281 ± 0.939
2.366LeuMet: 2.366 ± 0.373
5.369LeuAsn: 5.369 ± 0.836
4.004LeuPro: 4.004 ± 0.573
4.004LeuGln: 4.004 ± 0.597
4.004LeuArg: 4.004 ± 0.466
8.464LeuSer: 8.464 ± 0.899
6.007LeuThr: 6.007 ± 0.872
5.005LeuVal: 5.005 ± 0.614
1.001LeuTrp: 1.001 ± 0.354
2.275LeuTyr: 2.275 ± 0.434
0.0LeuXaa: 0.0 ± 0.0
Met
2.184MetAla: 2.184 ± 0.594
0.364MetCys: 0.364 ± 0.192
1.183MetAsp: 1.183 ± 0.307
1.638MetGlu: 1.638 ± 0.348
1.092MetPhe: 1.092 ± 0.29
0.91MetGly: 0.91 ± 0.276
0.637MetHis: 0.637 ± 0.194
0.91MetIle: 0.91 ± 0.306
1.547MetLys: 1.547 ± 0.312
2.275MetLeu: 2.275 ± 0.36
0.455MetMet: 0.455 ± 0.206
1.092MetAsn: 1.092 ± 0.279
0.819MetPro: 0.819 ± 0.221
0.91MetGln: 0.91 ± 0.281
0.637MetArg: 0.637 ± 0.253
2.093MetSer: 2.093 ± 0.556
2.639MetThr: 2.639 ± 0.474
1.82MetVal: 1.82 ± 0.367
0.182MetTrp: 0.182 ± 0.159
0.546MetTyr: 0.546 ± 0.204
0.0MetXaa: 0.0 ± 0.0
Asn
3.822AsnAla: 3.822 ± 0.623
0.091AsnCys: 0.091 ± 0.092
2.912AsnAsp: 2.912 ± 0.529
3.276AsnGlu: 3.276 ± 0.601
1.365AsnPhe: 1.365 ± 0.309
3.185AsnGly: 3.185 ± 0.42
1.001AsnHis: 1.001 ± 0.278
2.73AsnIle: 2.73 ± 0.453
2.821AsnLys: 2.821 ± 0.433
5.005AsnLeu: 5.005 ± 0.748
1.82AsnMet: 1.82 ± 0.397
2.275AsnAsn: 2.275 ± 0.649
2.002AsnPro: 2.002 ± 0.392
2.639AsnGln: 2.639 ± 0.497
2.639AsnArg: 2.639 ± 0.581
3.276AsnSer: 3.276 ± 0.483
2.912AsnThr: 2.912 ± 0.604
2.457AsnVal: 2.457 ± 0.506
0.546AsnTrp: 0.546 ± 0.194
1.638AsnTyr: 1.638 ± 0.39
0.0AsnXaa: 0.0 ± 0.0
Pro
2.548ProAla: 2.548 ± 0.6
0.546ProCys: 0.546 ± 0.237
2.821ProAsp: 2.821 ± 0.557
3.731ProGlu: 3.731 ± 0.626
1.729ProPhe: 1.729 ± 0.389
2.639ProGly: 2.639 ± 0.639
0.819ProHis: 0.819 ± 0.322
2.912ProIle: 2.912 ± 0.456
2.912ProLys: 2.912 ± 0.68
2.548ProLeu: 2.548 ± 0.445
0.91ProMet: 0.91 ± 0.302
2.366ProAsn: 2.366 ± 0.401
1.365ProPro: 1.365 ± 0.322
1.638ProGln: 1.638 ± 0.384
1.547ProArg: 1.547 ± 0.371
2.093ProSer: 2.093 ± 0.423
2.73ProThr: 2.73 ± 0.486
1.911ProVal: 1.911 ± 0.497
0.819ProTrp: 0.819 ± 0.27
0.91ProTyr: 0.91 ± 0.285
0.0ProXaa: 0.0 ± 0.0
Gln
3.822GlnAla: 3.822 ± 0.57
0.364GlnCys: 0.364 ± 0.167
2.184GlnAsp: 2.184 ± 0.476
2.457GlnGlu: 2.457 ± 0.523
1.729GlnPhe: 1.729 ± 0.451
2.002GlnGly: 2.002 ± 0.334
1.365GlnHis: 1.365 ± 0.298
2.366GlnIle: 2.366 ± 0.412
2.184GlnLys: 2.184 ± 0.369
3.913GlnLeu: 3.913 ± 0.503
1.274GlnMet: 1.274 ± 0.519
2.093GlnAsn: 2.093 ± 0.315
1.911GlnPro: 1.911 ± 0.333
2.457GlnGln: 2.457 ± 0.577
3.003GlnArg: 3.003 ± 0.555
3.731GlnSer: 3.731 ± 0.576
1.547GlnThr: 1.547 ± 0.392
3.185GlnVal: 3.185 ± 0.594
1.092GlnTrp: 1.092 ± 0.274
0.91GlnTyr: 0.91 ± 0.241
0.0GlnXaa: 0.0 ± 0.0
Arg
3.913ArgAla: 3.913 ± 0.514
0.273ArgCys: 0.273 ± 0.187
2.73ArgAsp: 2.73 ± 0.531
2.457ArgGlu: 2.457 ± 0.469
3.458ArgPhe: 3.458 ± 0.583
2.093ArgGly: 2.093 ± 0.615
1.092ArgHis: 1.092 ± 0.304
3.458ArgIle: 3.458 ± 0.694
4.004ArgLys: 4.004 ± 0.662
4.914ArgLeu: 4.914 ± 0.506
0.728ArgMet: 0.728 ± 0.254
2.73ArgAsn: 2.73 ± 0.442
1.092ArgPro: 1.092 ± 0.357
2.366ArgGln: 2.366 ± 0.456
2.73ArgArg: 2.73 ± 0.675
3.185ArgSer: 3.185 ± 0.533
2.184ArgThr: 2.184 ± 0.414
3.458ArgVal: 3.458 ± 0.57
0.728ArgTrp: 0.728 ± 0.293
1.456ArgTyr: 1.456 ± 0.339
0.0ArgXaa: 0.0 ± 0.0
Ser
5.369SerAla: 5.369 ± 0.948
0.637SerCys: 0.637 ± 0.258
3.549SerAsp: 3.549 ± 0.403
5.278SerGlu: 5.278 ± 0.607
2.093SerPhe: 2.093 ± 0.345
4.732SerGly: 4.732 ± 0.676
1.456SerHis: 1.456 ± 0.327
4.368SerIle: 4.368 ± 0.601
5.552SerLys: 5.552 ± 0.685
6.098SerLeu: 6.098 ± 0.695
1.82SerMet: 1.82 ± 0.458
3.367SerAsn: 3.367 ± 0.465
2.912SerPro: 2.912 ± 0.329
2.457SerGln: 2.457 ± 0.473
3.094SerArg: 3.094 ± 0.546
3.64SerSer: 3.64 ± 0.55
4.641SerThr: 4.641 ± 0.784
4.732SerVal: 4.732 ± 0.663
0.546SerTrp: 0.546 ± 0.235
2.366SerTyr: 2.366 ± 0.436
0.0SerXaa: 0.0 ± 0.0
Thr
5.278ThrAla: 5.278 ± 0.706
0.455ThrCys: 0.455 ± 0.235
3.549ThrAsp: 3.549 ± 0.501
4.004ThrGlu: 4.004 ± 0.649
2.184ThrPhe: 2.184 ± 0.386
3.458ThrGly: 3.458 ± 0.546
1.183ThrHis: 1.183 ± 0.337
3.731ThrIle: 3.731 ± 0.597
5.187ThrLys: 5.187 ± 0.761
5.643ThrLeu: 5.643 ± 0.869
1.092ThrMet: 1.092 ± 0.482
3.458ThrAsn: 3.458 ± 0.576
2.73ThrPro: 2.73 ± 0.479
2.457ThrGln: 2.457 ± 0.633
1.82ThrArg: 1.82 ± 0.461
3.731ThrSer: 3.731 ± 0.585
3.367ThrThr: 3.367 ± 0.471
3.003ThrVal: 3.003 ± 0.704
1.092ThrTrp: 1.092 ± 0.26
1.911ThrTyr: 1.911 ± 0.441
0.0ThrXaa: 0.0 ± 0.0
Val
5.552ValAla: 5.552 ± 0.637
0.728ValCys: 0.728 ± 0.258
4.459ValAsp: 4.459 ± 0.515
4.186ValGlu: 4.186 ± 0.681
2.366ValPhe: 2.366 ± 0.533
3.549ValGly: 3.549 ± 0.687
1.274ValHis: 1.274 ± 0.36
3.367ValIle: 3.367 ± 0.422
4.095ValLys: 4.095 ± 0.606
5.552ValLeu: 5.552 ± 0.771
1.456ValMet: 1.456 ± 0.438
3.458ValAsn: 3.458 ± 0.569
2.73ValPro: 2.73 ± 0.553
2.457ValGln: 2.457 ± 0.429
2.093ValArg: 2.093 ± 0.448
3.367ValSer: 3.367 ± 0.499
3.913ValThr: 3.913 ± 0.572
5.096ValVal: 5.096 ± 0.784
1.274ValTrp: 1.274 ± 0.272
2.184ValTyr: 2.184 ± 0.435
0.0ValXaa: 0.0 ± 0.0
Trp
0.91TrpAla: 0.91 ± 0.309
0.182TrpCys: 0.182 ± 0.124
0.728TrpAsp: 0.728 ± 0.272
0.91TrpGlu: 0.91 ± 0.215
0.91TrpPhe: 0.91 ± 0.279
0.364TrpGly: 0.364 ± 0.173
0.455TrpHis: 0.455 ± 0.182
0.728TrpIle: 0.728 ± 0.245
1.001TrpLys: 1.001 ± 0.299
1.638TrpLeu: 1.638 ± 0.381
0.182TrpMet: 0.182 ± 0.112
0.728TrpAsn: 0.728 ± 0.264
0.637TrpPro: 0.637 ± 0.231
1.001TrpGln: 1.001 ± 0.275
1.092TrpArg: 1.092 ± 0.351
0.728TrpSer: 0.728 ± 0.217
1.001TrpThr: 1.001 ± 0.341
1.183TrpVal: 1.183 ± 0.261
0.364TrpTrp: 0.364 ± 0.16
0.91TrpTyr: 0.91 ± 0.285
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.911TyrAla: 1.911 ± 0.329
0.546TyrCys: 0.546 ± 0.327
1.092TyrAsp: 1.092 ± 0.263
1.82TyrGlu: 1.82 ± 0.364
1.274TyrPhe: 1.274 ± 0.357
2.184TyrGly: 2.184 ± 0.468
0.546TyrHis: 0.546 ± 0.287
1.547TyrIle: 1.547 ± 0.39
2.548TyrLys: 2.548 ± 0.458
3.367TyrLeu: 3.367 ± 0.613
0.546TyrMet: 0.546 ± 0.205
1.001TyrAsn: 1.001 ± 0.332
1.183TyrPro: 1.183 ± 0.309
1.729TyrGln: 1.729 ± 0.621
1.82TyrArg: 1.82 ± 0.416
2.275TyrSer: 2.275 ± 0.482
1.001TyrThr: 1.001 ± 0.303
2.184TyrVal: 2.184 ± 0.497
1.001TyrTrp: 1.001 ± 0.315
0.455TyrTyr: 0.455 ± 0.22
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 44 proteins (10989 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski