Amino acid dipepetide frequency for Escherichia phage VB_EcoS-Golestan

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.156AlaAla: 10.156 ± 1.528
1.359AlaCys: 1.359 ± 0.326
6.294AlaAsp: 6.294 ± 0.716
7.009AlaGlu: 7.009 ± 0.76
4.148AlaPhe: 4.148 ± 0.566
7.724AlaGly: 7.724 ± 0.737
1.573AlaHis: 1.573 ± 0.315
5.364AlaIle: 5.364 ± 0.712
5.65AlaLys: 5.65 ± 0.708
7.367AlaLeu: 7.367 ± 0.718
2.36AlaMet: 2.36 ± 0.414
3.576AlaAsn: 3.576 ± 0.442
3.791AlaPro: 3.791 ± 0.49
3.791AlaGln: 3.791 ± 0.633
4.005AlaArg: 4.005 ± 0.505
6.365AlaSer: 6.365 ± 0.998
6.222AlaThr: 6.222 ± 0.838
6.651AlaVal: 6.651 ± 0.627
1.43AlaTrp: 1.43 ± 0.292
3.004AlaTyr: 3.004 ± 0.506
0.0AlaXaa: 0.0 ± 0.0
Cys
1.001CysAla: 1.001 ± 0.267
0.072CysCys: 0.072 ± 0.072
0.715CysAsp: 0.715 ± 0.207
1.073CysGlu: 1.073 ± 0.311
0.215CysPhe: 0.215 ± 0.124
1.216CysGly: 1.216 ± 0.325
0.143CysHis: 0.143 ± 0.108
0.429CysIle: 0.429 ± 0.158
0.715CysLys: 0.715 ± 0.249
0.572CysLeu: 0.572 ± 0.208
0.0CysMet: 0.0 ± 0.0
0.501CysAsn: 0.501 ± 0.198
0.572CysPro: 0.572 ± 0.236
0.358CysGln: 0.358 ± 0.146
0.787CysArg: 0.787 ± 0.258
0.572CysSer: 0.572 ± 0.245
0.787CysThr: 0.787 ± 0.274
0.787CysVal: 0.787 ± 0.228
0.215CysTrp: 0.215 ± 0.107
0.358CysTyr: 0.358 ± 0.162
0.0CysXaa: 0.0 ± 0.0
Asp
6.794AspAla: 6.794 ± 0.738
0.572AspCys: 0.572 ± 0.187
4.72AspAsp: 4.72 ± 0.661
4.506AspGlu: 4.506 ± 0.628
2.932AspPhe: 2.932 ± 0.45
6.508AspGly: 6.508 ± 0.917
0.715AspHis: 0.715 ± 0.235
3.862AspIle: 3.862 ± 0.415
2.861AspLys: 2.861 ± 0.337
4.577AspLeu: 4.577 ± 0.52
1.43AspMet: 1.43 ± 0.29
2.789AspAsn: 2.789 ± 0.485
1.645AspPro: 1.645 ± 0.335
0.715AspGln: 0.715 ± 0.205
2.718AspArg: 2.718 ± 0.446
2.789AspSer: 2.789 ± 0.397
4.005AspThr: 4.005 ± 0.584
3.934AspVal: 3.934 ± 0.547
1.073AspTrp: 1.073 ± 0.257
2.146AspTyr: 2.146 ± 0.405
0.0AspXaa: 0.0 ± 0.0
Glu
6.365GluAla: 6.365 ± 0.741
0.429GluCys: 0.429 ± 0.177
3.505GluAsp: 3.505 ± 0.511
5.722GluGlu: 5.722 ± 1.034
2.789GluPhe: 2.789 ± 0.419
4.291GluGly: 4.291 ± 0.536
0.93GluHis: 0.93 ± 0.27
3.29GluIle: 3.29 ± 0.524
4.148GluLys: 4.148 ± 0.632
6.008GluLeu: 6.008 ± 0.693
3.004GluMet: 3.004 ± 0.462
1.931GluAsn: 1.931 ± 0.455
2.146GluPro: 2.146 ± 0.435
3.862GluGln: 3.862 ± 0.984
3.719GluArg: 3.719 ± 0.607
3.004GluSer: 3.004 ± 0.428
3.361GluThr: 3.361 ± 0.564
4.863GluVal: 4.863 ± 0.603
1.359GluTrp: 1.359 ± 0.327
2.36GluTyr: 2.36 ± 0.386
0.0GluXaa: 0.0 ± 0.0
Phe
2.718PheAla: 2.718 ± 0.408
0.501PheCys: 0.501 ± 0.202
3.719PheAsp: 3.719 ± 0.623
2.503PheGlu: 2.503 ± 0.43
1.144PhePhe: 1.144 ± 0.301
3.29PheGly: 3.29 ± 0.483
0.501PheHis: 0.501 ± 0.194
2.575PheIle: 2.575 ± 0.412
1.931PheLys: 1.931 ± 0.364
2.503PheLeu: 2.503 ± 0.452
0.572PheMet: 0.572 ± 0.214
2.003PheAsn: 2.003 ± 0.315
1.287PhePro: 1.287 ± 0.324
1.073PheGln: 1.073 ± 0.243
2.146PheArg: 2.146 ± 0.311
3.361PheSer: 3.361 ± 0.612
2.861PheThr: 2.861 ± 0.339
2.861PheVal: 2.861 ± 0.564
0.715PheTrp: 0.715 ± 0.228
1.073PheTyr: 1.073 ± 0.264
0.0PheXaa: 0.0 ± 0.0
Gly
7.438GlyAla: 7.438 ± 0.863
1.359GlyCys: 1.359 ± 0.299
4.434GlyAsp: 4.434 ± 0.477
5.149GlyGlu: 5.149 ± 0.623
3.576GlyPhe: 3.576 ± 0.564
5.793GlyGly: 5.793 ± 0.8
1.287GlyHis: 1.287 ± 0.379
3.218GlyIle: 3.218 ± 0.588
5.078GlyLys: 5.078 ± 0.646
5.293GlyLeu: 5.293 ± 0.718
2.36GlyMet: 2.36 ± 0.531
4.148GlyAsn: 4.148 ± 0.576
2.146GlyPro: 2.146 ± 0.387
2.432GlyGln: 2.432 ± 0.422
3.862GlyArg: 3.862 ± 0.388
5.436GlySer: 5.436 ± 0.929
4.363GlyThr: 4.363 ± 0.607
5.722GlyVal: 5.722 ± 0.78
1.001GlyTrp: 1.001 ± 0.256
2.789GlyTyr: 2.789 ± 0.43
0.0GlyXaa: 0.0 ± 0.0
His
1.502HisAla: 1.502 ± 0.362
0.572HisCys: 0.572 ± 0.188
0.93HisAsp: 0.93 ± 0.246
0.93HisGlu: 0.93 ± 0.292
0.858HisPhe: 0.858 ± 0.253
1.001HisGly: 1.001 ± 0.274
0.644HisHis: 0.644 ± 0.239
1.073HisIle: 1.073 ± 0.305
1.216HisLys: 1.216 ± 0.348
1.573HisLeu: 1.573 ± 0.408
0.358HisMet: 0.358 ± 0.146
0.93HisAsn: 0.93 ± 0.285
0.715HisPro: 0.715 ± 0.239
0.787HisGln: 0.787 ± 0.238
1.287HisArg: 1.287 ± 0.259
0.644HisSer: 0.644 ± 0.224
0.572HisThr: 0.572 ± 0.28
1.073HisVal: 1.073 ± 0.302
0.143HisTrp: 0.143 ± 0.099
0.715HisTyr: 0.715 ± 0.19
0.0HisXaa: 0.0 ± 0.0
Ile
5.507IleAla: 5.507 ± 0.705
0.715IleCys: 0.715 ± 0.22
3.862IleAsp: 3.862 ± 0.578
3.004IleGlu: 3.004 ± 0.491
1.287IlePhe: 1.287 ± 0.323
3.576IleGly: 3.576 ± 0.513
0.572IleHis: 0.572 ± 0.188
2.718IleIle: 2.718 ± 0.41
3.361IleLys: 3.361 ± 0.55
3.361IleLeu: 3.361 ± 0.583
0.93IleMet: 0.93 ± 0.335
3.147IleAsn: 3.147 ± 0.559
3.004IlePro: 3.004 ± 0.434
1.931IleGln: 1.931 ± 0.446
1.931IleArg: 1.931 ± 0.357
3.004IleSer: 3.004 ± 0.439
4.649IleThr: 4.649 ± 0.684
4.077IleVal: 4.077 ± 0.415
1.001IleTrp: 1.001 ± 0.275
1.573IleTyr: 1.573 ± 0.316
0.0IleXaa: 0.0 ± 0.0
Lys
6.222LysAla: 6.222 ± 0.943
0.501LysCys: 0.501 ± 0.175
3.29LysAsp: 3.29 ± 0.559
3.361LysGlu: 3.361 ± 0.757
2.36LysPhe: 2.36 ± 0.418
3.361LysGly: 3.361 ± 0.497
1.43LysHis: 1.43 ± 0.284
2.289LysIle: 2.289 ± 0.456
3.29LysLys: 3.29 ± 0.548
4.291LysLeu: 4.291 ± 0.533
2.646LysMet: 2.646 ± 0.581
2.36LysAsn: 2.36 ± 0.525
2.789LysPro: 2.789 ± 0.427
2.503LysGln: 2.503 ± 0.459
3.791LysArg: 3.791 ± 0.593
3.29LysSer: 3.29 ± 0.597
3.147LysThr: 3.147 ± 0.432
3.791LysVal: 3.791 ± 0.535
0.715LysTrp: 0.715 ± 0.228
2.432LysTyr: 2.432 ± 0.427
0.0LysXaa: 0.0 ± 0.0
Leu
7.581LeuAla: 7.581 ± 0.744
0.644LeuCys: 0.644 ± 0.218
3.791LeuAsp: 3.791 ± 0.557
4.935LeuGlu: 4.935 ± 0.773
2.074LeuPhe: 2.074 ± 0.433
5.149LeuGly: 5.149 ± 0.554
1.645LeuHis: 1.645 ± 0.36
4.434LeuIle: 4.434 ± 0.436
4.434LeuLys: 4.434 ± 0.63
4.935LeuLeu: 4.935 ± 0.67
1.645LeuMet: 1.645 ± 0.301
4.434LeuAsn: 4.434 ± 0.722
3.576LeuPro: 3.576 ± 0.493
3.218LeuGln: 3.218 ± 0.468
5.364LeuArg: 5.364 ± 0.676
4.22LeuSer: 4.22 ± 0.522
4.863LeuThr: 4.863 ± 0.714
4.792LeuVal: 4.792 ± 0.631
1.359LeuTrp: 1.359 ± 0.382
2.217LeuTyr: 2.217 ± 0.368
0.0LeuXaa: 0.0 ± 0.0
Met
2.789MetAla: 2.789 ± 0.456
0.501MetCys: 0.501 ± 0.142
0.93MetAsp: 0.93 ± 0.27
1.144MetGlu: 1.144 ± 0.321
1.144MetPhe: 1.144 ± 0.256
1.788MetGly: 1.788 ± 0.361
0.501MetHis: 0.501 ± 0.165
1.716MetIle: 1.716 ± 0.428
1.86MetLys: 1.86 ± 0.36
1.86MetLeu: 1.86 ± 0.395
0.644MetMet: 0.644 ± 0.226
1.073MetAsn: 1.073 ± 0.258
1.216MetPro: 1.216 ± 0.295
0.93MetGln: 0.93 ± 0.24
1.359MetArg: 1.359 ± 0.271
2.217MetSer: 2.217 ± 0.358
2.074MetThr: 2.074 ± 0.37
1.502MetVal: 1.502 ± 0.291
0.286MetTrp: 0.286 ± 0.121
0.358MetTyr: 0.358 ± 0.182
0.0MetXaa: 0.0 ± 0.0
Asn
4.22AsnAla: 4.22 ± 0.563
0.429AsnCys: 0.429 ± 0.219
2.861AsnAsp: 2.861 ± 0.396
2.646AsnGlu: 2.646 ± 0.394
1.287AsnPhe: 1.287 ± 0.276
4.434AsnGly: 4.434 ± 0.464
0.358AsnHis: 0.358 ± 0.157
2.575AsnIle: 2.575 ± 0.46
2.432AsnLys: 2.432 ± 0.353
3.433AsnLeu: 3.433 ± 0.503
0.787AsnMet: 0.787 ± 0.244
2.575AsnAsn: 2.575 ± 0.409
1.931AsnPro: 1.931 ± 0.424
1.43AsnGln: 1.43 ± 0.307
2.36AsnArg: 2.36 ± 0.5
2.503AsnSer: 2.503 ± 0.354
2.503AsnThr: 2.503 ± 0.498
3.433AsnVal: 3.433 ± 0.408
0.644AsnTrp: 0.644 ± 0.203
1.359AsnTyr: 1.359 ± 0.346
0.0AsnXaa: 0.0 ± 0.0
Pro
3.862ProAla: 3.862 ± 0.48
0.572ProCys: 0.572 ± 0.182
2.646ProAsp: 2.646 ± 0.489
3.576ProGlu: 3.576 ± 0.476
1.716ProPhe: 1.716 ± 0.369
2.718ProGly: 2.718 ± 0.461
0.858ProHis: 0.858 ± 0.259
2.146ProIle: 2.146 ± 0.411
1.502ProLys: 1.502 ± 0.318
3.004ProLeu: 3.004 ± 0.538
0.858ProMet: 0.858 ± 0.314
1.359ProAsn: 1.359 ± 0.34
1.144ProPro: 1.144 ± 0.279
1.216ProGln: 1.216 ± 0.262
1.716ProArg: 1.716 ± 0.365
2.217ProSer: 2.217 ± 0.457
2.289ProThr: 2.289 ± 0.422
4.148ProVal: 4.148 ± 0.467
0.429ProTrp: 0.429 ± 0.175
1.359ProTyr: 1.359 ± 0.34
0.0ProXaa: 0.0 ± 0.0
Gln
3.934GlnAla: 3.934 ± 0.595
0.358GlnCys: 0.358 ± 0.147
2.074GlnAsp: 2.074 ± 0.449
2.289GlnGlu: 2.289 ± 0.492
1.645GlnPhe: 1.645 ± 0.306
2.003GlnGly: 2.003 ± 0.395
1.073GlnHis: 1.073 ± 0.301
1.931GlnIle: 1.931 ± 0.409
2.718GlnLys: 2.718 ± 0.513
3.648GlnLeu: 3.648 ± 0.497
1.073GlnMet: 1.073 ± 0.317
1.359GlnAsn: 1.359 ± 0.371
1.502GlnPro: 1.502 ± 0.31
1.931GlnGln: 1.931 ± 0.576
2.146GlnArg: 2.146 ± 0.35
1.86GlnSer: 1.86 ± 0.366
2.289GlnThr: 2.289 ± 0.43
2.432GlnVal: 2.432 ± 0.415
0.644GlnTrp: 0.644 ± 0.203
1.573GlnTyr: 1.573 ± 0.386
0.0GlnXaa: 0.0 ± 0.0
Arg
4.077ArgAla: 4.077 ± 0.595
0.286ArgCys: 0.286 ± 0.134
3.218ArgAsp: 3.218 ± 0.457
3.648ArgGlu: 3.648 ± 0.612
1.86ArgPhe: 1.86 ± 0.322
3.862ArgGly: 3.862 ± 0.457
1.144ArgHis: 1.144 ± 0.27
2.575ArgIle: 2.575 ± 0.454
4.077ArgLys: 4.077 ± 0.589
4.434ArgLeu: 4.434 ± 0.567
2.003ArgMet: 2.003 ± 0.404
2.432ArgAsn: 2.432 ± 0.339
1.502ArgPro: 1.502 ± 0.324
3.218ArgGln: 3.218 ± 0.471
5.006ArgArg: 5.006 ± 0.747
2.503ArgSer: 2.503 ± 0.326
2.289ArgThr: 2.289 ± 0.414
3.433ArgVal: 3.433 ± 0.446
0.787ArgTrp: 0.787 ± 0.257
1.86ArgTyr: 1.86 ± 0.323
0.0ArgXaa: 0.0 ± 0.0
Ser
5.078SerAla: 5.078 ± 1.023
0.286SerCys: 0.286 ± 0.131
3.576SerAsp: 3.576 ± 0.481
3.433SerGlu: 3.433 ± 0.469
2.789SerPhe: 2.789 ± 0.429
6.937SerGly: 6.937 ± 1.399
1.073SerHis: 1.073 ± 0.24
3.791SerIle: 3.791 ± 0.546
3.147SerLys: 3.147 ± 0.516
4.077SerLeu: 4.077 ± 0.545
1.287SerMet: 1.287 ± 0.324
3.075SerAsn: 3.075 ± 0.457
2.36SerPro: 2.36 ± 0.467
1.502SerGln: 1.502 ± 0.332
2.932SerArg: 2.932 ± 0.493
2.575SerSer: 2.575 ± 0.426
4.005SerThr: 4.005 ± 0.746
4.005SerVal: 4.005 ± 0.635
0.572SerTrp: 0.572 ± 0.185
1.716SerTyr: 1.716 ± 0.324
0.0SerXaa: 0.0 ± 0.0
Thr
6.365ThrAla: 6.365 ± 0.925
0.572ThrCys: 0.572 ± 0.227
3.576ThrAsp: 3.576 ± 0.583
4.22ThrGlu: 4.22 ± 0.466
3.075ThrPhe: 3.075 ± 0.494
5.507ThrGly: 5.507 ± 0.697
0.93ThrHis: 0.93 ± 0.312
3.004ThrIle: 3.004 ± 0.366
3.29ThrLys: 3.29 ± 0.562
4.935ThrLeu: 4.935 ± 0.576
1.073ThrMet: 1.073 ± 0.256
1.716ThrAsn: 1.716 ± 0.357
4.148ThrPro: 4.148 ± 0.513
2.003ThrGln: 2.003 ± 0.398
2.861ThrArg: 2.861 ± 0.474
3.361ThrSer: 3.361 ± 0.57
4.363ThrThr: 4.363 ± 0.667
4.649ThrVal: 4.649 ± 0.773
0.858ThrTrp: 0.858 ± 0.278
2.217ThrTyr: 2.217 ± 0.452
0.0ThrXaa: 0.0 ± 0.0
Val
7.581ValAla: 7.581 ± 0.801
0.858ValCys: 0.858 ± 0.292
4.077ValAsp: 4.077 ± 0.467
5.221ValGlu: 5.221 ± 0.67
2.289ValPhe: 2.289 ± 0.385
3.934ValGly: 3.934 ± 0.46
0.93ValHis: 0.93 ± 0.252
3.862ValIle: 3.862 ± 0.522
3.29ValLys: 3.29 ± 0.528
4.935ValLeu: 4.935 ± 0.757
1.573ValMet: 1.573 ± 0.399
2.646ValAsn: 2.646 ± 0.478
2.289ValPro: 2.289 ± 0.364
3.147ValGln: 3.147 ± 0.604
3.433ValArg: 3.433 ± 0.45
5.293ValSer: 5.293 ± 0.783
5.149ValThr: 5.149 ± 0.6
5.65ValVal: 5.65 ± 0.744
1.287ValTrp: 1.287 ± 0.305
3.075ValTyr: 3.075 ± 0.528
0.0ValXaa: 0.0 ± 0.0
Trp
1.216TrpAla: 1.216 ± 0.347
0.072TrpCys: 0.072 ± 0.076
0.644TrpAsp: 0.644 ± 0.187
0.572TrpGlu: 0.572 ± 0.213
1.001TrpPhe: 1.001 ± 0.285
1.216TrpGly: 1.216 ± 0.249
0.429TrpHis: 0.429 ± 0.205
0.429TrpIle: 0.429 ± 0.168
1.001TrpLys: 1.001 ± 0.293
1.86TrpLeu: 1.86 ± 0.35
0.858TrpMet: 0.858 ± 0.225
0.715TrpAsn: 0.715 ± 0.218
0.358TrpPro: 0.358 ± 0.191
0.644TrpGln: 0.644 ± 0.223
1.001TrpArg: 1.001 ± 0.258
0.858TrpSer: 0.858 ± 0.269
0.644TrpThr: 0.644 ± 0.175
0.858TrpVal: 0.858 ± 0.245
0.358TrpTrp: 0.358 ± 0.171
0.501TrpTyr: 0.501 ± 0.217
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.433TyrAla: 3.433 ± 0.504
0.429TyrCys: 0.429 ± 0.174
2.432TyrAsp: 2.432 ± 0.482
2.146TyrGlu: 2.146 ± 0.389
1.144TyrPhe: 1.144 ± 0.293
2.575TyrGly: 2.575 ± 0.363
0.787TyrHis: 0.787 ± 0.22
1.931TyrIle: 1.931 ± 0.357
1.931TyrLys: 1.931 ± 0.453
2.646TyrLeu: 2.646 ± 0.504
0.358TyrMet: 0.358 ± 0.157
1.43TyrAsn: 1.43 ± 0.269
1.216TyrPro: 1.216 ± 0.332
1.788TyrGln: 1.788 ± 0.35
1.788TyrArg: 1.788 ± 0.387
2.146TyrSer: 2.146 ± 0.398
2.289TyrThr: 2.289 ± 0.478
1.86TyrVal: 1.86 ± 0.342
0.358TyrTrp: 0.358 ± 0.173
1.073TyrTyr: 1.073 ± 0.202
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 78 proteins (13983 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski