Amino acid dipepetide frequency for Vibrio phage pYD21-A

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.828AlaAla: 7.828 ± 0.981
1.128AlaCys: 1.128 ± 0.262
5.36AlaAsp: 5.36 ± 0.593
5.007AlaGlu: 5.007 ± 0.741
2.327AlaPhe: 2.327 ± 0.368
4.514AlaGly: 4.514 ± 0.513
1.834AlaHis: 1.834 ± 0.381
4.796AlaIle: 4.796 ± 0.579
5.924AlaLys: 5.924 ± 0.786
6.277AlaLeu: 6.277 ± 0.664
3.103AlaMet: 3.103 ± 0.689
3.456AlaAsn: 3.456 ± 0.5
2.751AlaPro: 2.751 ± 0.468
2.892AlaGln: 2.892 ± 0.381
5.078AlaArg: 5.078 ± 0.762
5.36AlaSer: 5.36 ± 0.921
4.796AlaThr: 4.796 ± 0.477
5.078AlaVal: 5.078 ± 0.736
1.199AlaTrp: 1.199 ± 0.308
2.962AlaTyr: 2.962 ± 0.412
0.0AlaXaa: 0.0 ± 0.0
Cys
0.705CysAla: 0.705 ± 0.25
0.353CysCys: 0.353 ± 0.202
0.705CysAsp: 0.705 ± 0.242
1.269CysGlu: 1.269 ± 0.395
0.423CysPhe: 0.423 ± 0.185
1.058CysGly: 1.058 ± 0.256
0.282CysHis: 0.282 ± 0.146
1.058CysIle: 1.058 ± 0.274
0.917CysLys: 0.917 ± 0.225
1.128CysLeu: 1.128 ± 0.344
0.282CysMet: 0.282 ± 0.129
0.917CysAsn: 0.917 ± 0.206
0.423CysPro: 0.423 ± 0.165
0.282CysGln: 0.282 ± 0.139
0.776CysArg: 0.776 ± 0.233
1.058CysSer: 1.058 ± 0.251
0.917CysThr: 0.917 ± 0.283
0.494CysVal: 0.494 ± 0.192
0.282CysTrp: 0.282 ± 0.129
0.705CysTyr: 0.705 ± 0.224
0.0CysXaa: 0.0 ± 0.0
Asp
5.642AspAla: 5.642 ± 0.583
0.635AspCys: 0.635 ± 0.186
3.033AspAsp: 3.033 ± 0.543
5.29AspGlu: 5.29 ± 0.649
1.693AspPhe: 1.693 ± 0.404
5.854AspGly: 5.854 ± 0.599
1.269AspHis: 1.269 ± 0.274
4.161AspIle: 4.161 ± 0.65
4.232AspLys: 4.232 ± 0.539
4.866AspLeu: 4.866 ± 0.53
1.834AspMet: 1.834 ± 0.382
2.892AspAsn: 2.892 ± 0.536
3.033AspPro: 3.033 ± 0.435
1.269AspGln: 1.269 ± 0.264
2.116AspArg: 2.116 ± 0.344
4.302AspSer: 4.302 ± 0.615
3.315AspThr: 3.315 ± 0.43
4.373AspVal: 4.373 ± 0.587
0.423AspTrp: 0.423 ± 0.182
2.892AspTyr: 2.892 ± 0.445
0.0AspXaa: 0.0 ± 0.0
Glu
5.007GluAla: 5.007 ± 0.695
1.199GluCys: 1.199 ± 0.274
3.95GluAsp: 3.95 ± 0.633
4.514GluGlu: 4.514 ± 0.493
4.232GluPhe: 4.232 ± 0.519
3.667GluGly: 3.667 ± 0.509
1.552GluHis: 1.552 ± 0.374
5.148GluIle: 5.148 ± 0.769
3.808GluLys: 3.808 ± 0.466
6.277GluLeu: 6.277 ± 0.67
2.116GluMet: 2.116 ± 0.402
3.244GluAsn: 3.244 ± 0.474
2.68GluPro: 2.68 ± 0.505
3.456GluGln: 3.456 ± 0.514
3.879GluArg: 3.879 ± 0.512
4.302GluSer: 4.302 ± 0.638
3.879GluThr: 3.879 ± 0.547
3.315GluVal: 3.315 ± 0.432
1.411GluTrp: 1.411 ± 0.267
3.103GluTyr: 3.103 ± 0.439
0.0GluXaa: 0.0 ± 0.0
Phe
2.609PheAla: 2.609 ± 0.388
0.564PheCys: 0.564 ± 0.183
3.174PheAsp: 3.174 ± 0.448
2.539PheGlu: 2.539 ± 0.509
1.058PhePhe: 1.058 ± 0.359
2.962PheGly: 2.962 ± 0.474
0.635PheHis: 0.635 ± 0.188
3.244PheIle: 3.244 ± 0.492
1.904PheLys: 1.904 ± 0.323
1.693PheLeu: 1.693 ± 0.332
1.128PheMet: 1.128 ± 0.278
2.327PheAsn: 2.327 ± 0.365
0.987PhePro: 0.987 ± 0.257
0.917PheGln: 0.917 ± 0.28
1.693PheArg: 1.693 ± 0.233
2.116PheSer: 2.116 ± 0.288
2.045PheThr: 2.045 ± 0.365
2.609PheVal: 2.609 ± 0.365
0.353PheTrp: 0.353 ± 0.194
1.128PheTyr: 1.128 ± 0.317
0.0PheXaa: 0.0 ± 0.0
Gly
5.713GlyAla: 5.713 ± 0.77
0.705GlyCys: 0.705 ± 0.227
4.584GlyAsp: 4.584 ± 0.574
4.796GlyGlu: 4.796 ± 0.568
3.174GlyPhe: 3.174 ± 0.425
6.347GlyGly: 6.347 ± 0.883
1.481GlyHis: 1.481 ± 0.384
3.667GlyIle: 3.667 ± 0.542
5.36GlyLys: 5.36 ± 0.763
5.431GlyLeu: 5.431 ± 0.616
2.116GlyMet: 2.116 ± 0.383
4.373GlyAsn: 4.373 ± 0.505
1.411GlyPro: 1.411 ± 0.312
2.045GlyGln: 2.045 ± 0.412
2.398GlyArg: 2.398 ± 0.318
4.655GlySer: 4.655 ± 0.585
4.161GlyThr: 4.161 ± 0.509
6.488GlyVal: 6.488 ± 0.754
1.058GlyTrp: 1.058 ± 0.251
3.244GlyTyr: 3.244 ± 0.443
0.0GlyXaa: 0.0 ± 0.0
His
1.904HisAla: 1.904 ± 0.341
0.071HisCys: 0.071 ± 0.064
1.481HisAsp: 1.481 ± 0.334
1.269HisGlu: 1.269 ± 0.296
0.846HisPhe: 0.846 ± 0.273
2.327HisGly: 2.327 ± 0.405
0.776HisHis: 0.776 ± 0.254
1.34HisIle: 1.34 ± 0.3
1.552HisLys: 1.552 ± 0.408
0.917HisLeu: 0.917 ± 0.393
0.282HisMet: 0.282 ± 0.136
0.846HisAsn: 0.846 ± 0.236
0.705HisPro: 0.705 ± 0.187
0.846HisGln: 0.846 ± 0.208
0.776HisArg: 0.776 ± 0.228
1.481HisSer: 1.481 ± 0.338
0.423HisThr: 0.423 ± 0.144
1.411HisVal: 1.411 ± 0.305
0.212HisTrp: 0.212 ± 0.128
1.058HisTyr: 1.058 ± 0.312
0.0HisXaa: 0.0 ± 0.0
Ile
4.937IleAla: 4.937 ± 0.518
0.705IleCys: 0.705 ± 0.217
5.148IleAsp: 5.148 ± 0.652
5.36IleGlu: 5.36 ± 0.599
1.622IlePhe: 1.622 ± 0.347
4.937IleGly: 4.937 ± 0.561
1.128IleHis: 1.128 ± 0.312
2.398IleIle: 2.398 ± 0.407
4.514IleLys: 4.514 ± 0.574
3.103IleLeu: 3.103 ± 0.496
1.269IleMet: 1.269 ± 0.3
4.091IleAsn: 4.091 ± 0.517
1.904IlePro: 1.904 ± 0.345
1.693IleGln: 1.693 ± 0.309
1.622IleArg: 1.622 ± 0.304
4.161IleSer: 4.161 ± 0.478
4.584IleThr: 4.584 ± 0.54
4.02IleVal: 4.02 ± 0.439
0.705IleTrp: 0.705 ± 0.177
2.045IleTyr: 2.045 ± 0.396
0.0IleXaa: 0.0 ± 0.0
Lys
4.584LysAla: 4.584 ± 0.663
0.846LysCys: 0.846 ± 0.37
3.526LysAsp: 3.526 ± 0.429
4.373LysGlu: 4.373 ± 0.602
2.257LysPhe: 2.257 ± 0.426
2.821LysGly: 2.821 ± 0.47
1.269LysHis: 1.269 ± 0.314
3.738LysIle: 3.738 ± 0.525
4.302LysLys: 4.302 ± 0.65
5.783LysLeu: 5.783 ± 0.611
2.398LysMet: 2.398 ± 0.424
2.468LysAsn: 2.468 ± 0.357
2.609LysPro: 2.609 ± 0.458
3.315LysGln: 3.315 ± 0.527
3.103LysArg: 3.103 ± 0.483
4.796LysSer: 4.796 ± 0.598
3.103LysThr: 3.103 ± 0.442
4.725LysVal: 4.725 ± 0.692
1.199LysTrp: 1.199 ± 0.316
2.327LysTyr: 2.327 ± 0.343
0.0LysXaa: 0.0 ± 0.0
Leu
6.841LeuAla: 6.841 ± 0.765
1.411LeuCys: 1.411 ± 0.31
5.148LeuAsp: 5.148 ± 0.605
6.065LeuGlu: 6.065 ± 0.865
2.116LeuPhe: 2.116 ± 0.405
5.36LeuGly: 5.36 ± 0.517
1.622LeuHis: 1.622 ± 0.316
4.02LeuIle: 4.02 ± 0.416
4.091LeuLys: 4.091 ± 0.533
4.232LeuLeu: 4.232 ± 0.526
1.763LeuMet: 1.763 ± 0.388
3.667LeuAsn: 3.667 ± 0.54
3.033LeuPro: 3.033 ± 0.438
1.904LeuGln: 1.904 ± 0.342
4.091LeuArg: 4.091 ± 0.586
4.796LeuSer: 4.796 ± 0.543
5.148LeuThr: 5.148 ± 0.508
4.584LeuVal: 4.584 ± 0.557
0.635LeuTrp: 0.635 ± 0.169
2.398LeuTyr: 2.398 ± 0.412
0.0LeuXaa: 0.0 ± 0.0
Met
3.385MetAla: 3.385 ± 0.545
0.282MetCys: 0.282 ± 0.136
1.128MetAsp: 1.128 ± 0.384
1.763MetGlu: 1.763 ± 0.466
0.846MetPhe: 0.846 ± 0.212
1.128MetGly: 1.128 ± 0.254
0.635MetHis: 0.635 ± 0.216
2.045MetIle: 2.045 ± 0.407
2.045MetLys: 2.045 ± 0.351
1.975MetLeu: 1.975 ± 0.393
0.635MetMet: 0.635 ± 0.247
1.128MetAsn: 1.128 ± 0.274
1.411MetPro: 1.411 ± 0.339
1.411MetGln: 1.411 ± 0.334
1.34MetArg: 1.34 ± 0.287
2.186MetSer: 2.186 ± 0.427
2.116MetThr: 2.116 ± 0.453
1.622MetVal: 1.622 ± 0.517
0.282MetTrp: 0.282 ± 0.141
1.199MetTyr: 1.199 ± 0.227
0.0MetXaa: 0.0 ± 0.0
Asn
5.572AsnAla: 5.572 ± 0.849
0.635AsnCys: 0.635 ± 0.205
2.962AsnAsp: 2.962 ± 0.569
3.738AsnGlu: 3.738 ± 0.56
0.705AsnPhe: 0.705 ± 0.201
5.078AsnGly: 5.078 ± 0.709
1.269AsnHis: 1.269 ± 0.36
2.468AsnIle: 2.468 ± 0.348
4.373AsnLys: 4.373 ± 0.602
2.892AsnLeu: 2.892 ± 0.364
0.917AsnMet: 0.917 ± 0.204
2.892AsnAsn: 2.892 ± 0.409
2.257AsnPro: 2.257 ± 0.361
2.045AsnGln: 2.045 ± 0.355
1.904AsnArg: 1.904 ± 0.417
3.95AsnSer: 3.95 ± 0.621
2.751AsnThr: 2.751 ± 0.444
3.103AsnVal: 3.103 ± 0.602
0.635AsnTrp: 0.635 ± 0.179
1.622AsnTyr: 1.622 ± 0.315
0.0AsnXaa: 0.0 ± 0.0
Pro
2.468ProAla: 2.468 ± 0.387
0.423ProCys: 0.423 ± 0.199
2.962ProAsp: 2.962 ± 0.459
2.821ProGlu: 2.821 ± 0.503
1.269ProPhe: 1.269 ± 0.319
1.411ProGly: 1.411 ± 0.37
0.635ProHis: 0.635 ± 0.231
2.398ProIle: 2.398 ± 0.371
2.045ProLys: 2.045 ± 0.379
2.398ProLeu: 2.398 ± 0.435
1.128ProMet: 1.128 ± 0.3
2.257ProAsn: 2.257 ± 0.415
1.481ProPro: 1.481 ± 0.299
1.199ProGln: 1.199 ± 0.318
1.34ProArg: 1.34 ± 0.345
2.257ProSer: 2.257 ± 0.357
2.609ProThr: 2.609 ± 0.377
3.315ProVal: 3.315 ± 0.39
0.494ProTrp: 0.494 ± 0.154
1.693ProTyr: 1.693 ± 0.328
0.0ProXaa: 0.0 ± 0.0
Gln
3.033GlnAla: 3.033 ± 0.476
0.635GlnCys: 0.635 ± 0.165
1.552GlnAsp: 1.552 ± 0.355
1.411GlnGlu: 1.411 ± 0.3
1.199GlnPhe: 1.199 ± 0.267
2.327GlnGly: 2.327 ± 0.519
0.635GlnHis: 0.635 ± 0.202
2.468GlnIle: 2.468 ± 0.451
1.904GlnLys: 1.904 ± 0.398
3.667GlnLeu: 3.667 ± 0.472
1.269GlnMet: 1.269 ± 0.278
1.411GlnAsn: 1.411 ± 0.257
1.058GlnPro: 1.058 ± 0.27
2.116GlnGln: 2.116 ± 0.474
2.186GlnArg: 2.186 ± 0.333
1.481GlnSer: 1.481 ± 0.293
2.257GlnThr: 2.257 ± 0.455
2.398GlnVal: 2.398 ± 0.438
0.494GlnTrp: 0.494 ± 0.171
1.552GlnTyr: 1.552 ± 0.306
0.0GlnXaa: 0.0 ± 0.0
Arg
3.808ArgAla: 3.808 ± 0.407
0.846ArgCys: 0.846 ± 0.249
2.257ArgAsp: 2.257 ± 0.355
3.244ArgGlu: 3.244 ± 0.574
2.116ArgPhe: 2.116 ± 0.4
3.244ArgGly: 3.244 ± 0.58
1.058ArgHis: 1.058 ± 0.331
2.539ArgIle: 2.539 ± 0.426
2.186ArgLys: 2.186 ± 0.406
3.738ArgLeu: 3.738 ± 0.481
2.045ArgMet: 2.045 ± 0.352
1.975ArgAsn: 1.975 ± 0.395
1.834ArgPro: 1.834 ± 0.318
2.045ArgGln: 2.045 ± 0.375
2.398ArgArg: 2.398 ± 0.546
1.834ArgSer: 1.834 ± 0.444
3.385ArgThr: 3.385 ± 0.473
3.526ArgVal: 3.526 ± 0.456
0.705ArgTrp: 0.705 ± 0.189
1.904ArgTyr: 1.904 ± 0.399
0.0ArgXaa: 0.0 ± 0.0
Ser
4.937SerAla: 4.937 ± 0.603
1.128SerCys: 1.128 ± 0.284
4.161SerAsp: 4.161 ± 0.481
4.655SerGlu: 4.655 ± 0.475
2.609SerPhe: 2.609 ± 0.498
5.572SerGly: 5.572 ± 0.747
0.917SerHis: 0.917 ± 0.26
3.667SerIle: 3.667 ± 0.645
3.526SerLys: 3.526 ± 0.473
5.783SerLeu: 5.783 ± 0.535
1.693SerMet: 1.693 ± 0.287
4.091SerAsn: 4.091 ± 0.558
2.327SerPro: 2.327 ± 0.395
2.327SerGln: 2.327 ± 0.492
3.103SerArg: 3.103 ± 0.476
3.879SerSer: 3.879 ± 0.559
3.244SerThr: 3.244 ± 0.535
4.937SerVal: 4.937 ± 0.554
1.199SerTrp: 1.199 ± 0.258
2.327SerTyr: 2.327 ± 0.387
0.0SerXaa: 0.0 ± 0.0
Thr
4.796ThrAla: 4.796 ± 0.594
0.987ThrCys: 0.987 ± 0.3
3.244ThrAsp: 3.244 ± 0.582
3.315ThrGlu: 3.315 ± 0.494
2.751ThrPhe: 2.751 ± 0.468
5.854ThrGly: 5.854 ± 0.789
0.917ThrHis: 0.917 ± 0.282
3.95ThrIle: 3.95 ± 0.538
3.244ThrLys: 3.244 ± 0.483
4.584ThrLeu: 4.584 ± 0.631
1.693ThrMet: 1.693 ± 0.495
2.751ThrAsn: 2.751 ± 0.475
3.103ThrPro: 3.103 ± 0.468
1.693ThrGln: 1.693 ± 0.402
2.751ThrArg: 2.751 ± 0.35
4.937ThrSer: 4.937 ± 0.706
3.808ThrThr: 3.808 ± 0.452
3.597ThrVal: 3.597 ± 0.498
0.494ThrTrp: 0.494 ± 0.181
2.116ThrTyr: 2.116 ± 0.488
0.0ThrXaa: 0.0 ± 0.0
Val
4.725ValAla: 4.725 ± 0.628
0.635ValCys: 0.635 ± 0.207
5.078ValAsp: 5.078 ± 0.48
5.642ValGlu: 5.642 ± 0.553
2.257ValPhe: 2.257 ± 0.366
5.219ValGly: 5.219 ± 0.49
0.776ValHis: 0.776 ± 0.259
4.443ValIle: 4.443 ± 0.58
4.514ValLys: 4.514 ± 0.665
3.95ValLeu: 3.95 ± 0.569
1.411ValMet: 1.411 ± 0.278
4.302ValAsn: 4.302 ± 0.616
1.763ValPro: 1.763 ± 0.393
1.552ValGln: 1.552 ± 0.324
3.244ValArg: 3.244 ± 0.444
5.219ValSer: 5.219 ± 0.781
4.796ValThr: 4.796 ± 0.6
3.95ValVal: 3.95 ± 0.628
1.269ValTrp: 1.269 ± 0.257
2.257ValTyr: 2.257 ± 0.39
0.0ValXaa: 0.0 ± 0.0
Trp
0.564TrpAla: 0.564 ± 0.179
0.212TrpCys: 0.212 ± 0.115
0.564TrpAsp: 0.564 ± 0.173
0.987TrpGlu: 0.987 ± 0.359
0.494TrpPhe: 0.494 ± 0.22
0.635TrpGly: 0.635 ± 0.241
0.494TrpHis: 0.494 ± 0.176
0.917TrpIle: 0.917 ± 0.22
0.705TrpLys: 0.705 ± 0.22
1.128TrpLeu: 1.128 ± 0.291
0.353TrpMet: 0.353 ± 0.145
0.776TrpAsn: 0.776 ± 0.238
0.282TrpPro: 0.282 ± 0.171
0.353TrpGln: 0.353 ± 0.15
1.058TrpArg: 1.058 ± 0.302
1.269TrpSer: 1.269 ± 0.301
0.705TrpThr: 0.705 ± 0.23
1.552TrpVal: 1.552 ± 0.38
0.212TrpTrp: 0.212 ± 0.12
0.564TrpTyr: 0.564 ± 0.158
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.68TyrAla: 2.68 ± 0.414
0.635TyrCys: 0.635 ± 0.204
3.103TyrAsp: 3.103 ± 0.42
2.68TyrGlu: 2.68 ± 0.409
1.693TyrPhe: 1.693 ± 0.329
2.892TyrGly: 2.892 ± 0.53
1.34TyrHis: 1.34 ± 0.429
1.481TyrIle: 1.481 ± 0.281
2.398TyrLys: 2.398 ± 0.433
3.103TyrLeu: 3.103 ± 0.479
1.058TyrMet: 1.058 ± 0.28
1.904TyrAsn: 1.904 ± 0.339
1.552TyrPro: 1.552 ± 0.326
1.693TyrGln: 1.693 ± 0.361
1.834TyrArg: 1.834 ± 0.347
2.045TyrSer: 2.045 ± 0.334
2.539TyrThr: 2.539 ± 0.411
1.904TyrVal: 1.904 ± 0.386
0.564TyrTrp: 0.564 ± 0.231
1.34TyrTyr: 1.34 ± 0.294
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 74 proteins (14180 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski