Amino acid dipepetide frequency for Escherichia phage P2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.413AlaAla: 10.413 ± 2.258
0.722AlaCys: 0.722 ± 0.288
5.774AlaAsp: 5.774 ± 0.784
4.743AlaGlu: 4.743 ± 0.737
3.402AlaPhe: 3.402 ± 0.671
9.176AlaGly: 9.176 ± 1.053
1.856AlaHis: 1.856 ± 0.316
4.743AlaIle: 4.743 ± 0.806
4.433AlaLys: 4.433 ± 0.762
11.135AlaLeu: 11.135 ± 1.287
2.784AlaMet: 2.784 ± 0.544
2.371AlaAsn: 2.371 ± 0.418
5.774AlaPro: 5.774 ± 0.709
4.124AlaGln: 4.124 ± 0.786
4.743AlaArg: 4.743 ± 0.982
7.836AlaSer: 7.836 ± 0.938
6.496AlaThr: 6.496 ± 0.928
8.042AlaVal: 8.042 ± 1.085
1.65AlaTrp: 1.65 ± 0.402
2.371AlaTyr: 2.371 ± 0.44
0.0AlaXaa: 0.0 ± 0.0
Cys
0.928CysAla: 0.928 ± 0.318
0.0CysCys: 0.0 ± 0.0
0.722CysAsp: 0.722 ± 0.297
0.516CysGlu: 0.516 ± 0.246
0.206CysPhe: 0.206 ± 0.134
0.516CysGly: 0.516 ± 0.219
0.103CysHis: 0.103 ± 0.098
0.516CysIle: 0.516 ± 0.227
0.309CysLys: 0.309 ± 0.174
0.722CysLeu: 0.722 ± 0.261
0.309CysMet: 0.309 ± 0.178
0.309CysAsn: 0.309 ± 0.14
0.516CysPro: 0.516 ± 0.238
0.928CysGln: 0.928 ± 0.301
0.928CysArg: 0.928 ± 0.278
0.722CysSer: 0.722 ± 0.254
0.722CysThr: 0.722 ± 0.266
0.516CysVal: 0.516 ± 0.23
0.206CysTrp: 0.206 ± 0.166
0.412CysTyr: 0.412 ± 0.2
0.0CysXaa: 0.0 ± 0.0
Asp
7.011AspAla: 7.011 ± 0.841
0.309AspCys: 0.309 ± 0.156
3.402AspAsp: 3.402 ± 0.636
3.506AspGlu: 3.506 ± 0.721
3.093AspPhe: 3.093 ± 0.693
4.846AspGly: 4.846 ± 0.534
0.516AspHis: 0.516 ± 0.19
3.609AspIle: 3.609 ± 0.733
2.474AspLys: 2.474 ± 0.541
4.433AspLeu: 4.433 ± 0.641
0.619AspMet: 0.619 ± 0.237
1.547AspAsn: 1.547 ± 0.337
1.959AspPro: 1.959 ± 0.631
1.65AspGln: 1.65 ± 0.474
2.578AspArg: 2.578 ± 0.682
2.578AspSer: 2.578 ± 0.447
3.815AspThr: 3.815 ± 0.647
3.093AspVal: 3.093 ± 0.57
0.516AspTrp: 0.516 ± 0.232
2.681AspTyr: 2.681 ± 0.524
0.0AspXaa: 0.0 ± 0.0
Glu
5.258GluAla: 5.258 ± 0.8
0.412GluCys: 0.412 ± 0.2
1.959GluAsp: 1.959 ± 0.477
3.402GluGlu: 3.402 ± 0.572
2.165GluPhe: 2.165 ± 0.426
2.578GluGly: 2.578 ± 0.496
1.237GluHis: 1.237 ± 0.456
3.299GluIle: 3.299 ± 0.437
3.609GluLys: 3.609 ± 0.617
7.836GluLeu: 7.836 ± 0.845
2.268GluMet: 2.268 ± 0.398
2.99GluAsn: 2.99 ± 0.684
3.299GluPro: 3.299 ± 0.67
2.578GluGln: 2.578 ± 0.527
4.537GluArg: 4.537 ± 0.975
3.918GluSer: 3.918 ± 0.585
3.402GluThr: 3.402 ± 0.626
3.918GluVal: 3.918 ± 0.685
0.825GluTrp: 0.825 ± 0.287
2.268GluTyr: 2.268 ± 0.518
0.0GluXaa: 0.0 ± 0.0
Phe
2.578PheAla: 2.578 ± 0.437
0.516PheCys: 0.516 ± 0.201
1.443PheAsp: 1.443 ± 0.383
2.062PheGlu: 2.062 ± 0.45
1.443PhePhe: 1.443 ± 0.47
1.547PheGly: 1.547 ± 0.38
0.722PheHis: 0.722 ± 0.293
1.547PheIle: 1.547 ± 0.412
2.165PheLys: 2.165 ± 0.462
3.196PheLeu: 3.196 ± 0.559
0.825PheMet: 0.825 ± 0.301
1.65PheAsn: 1.65 ± 0.403
1.856PhePro: 1.856 ± 0.529
1.34PheGln: 1.34 ± 0.338
2.062PheArg: 2.062 ± 0.461
2.268PheSer: 2.268 ± 0.468
2.784PheThr: 2.784 ± 0.491
1.443PheVal: 1.443 ± 0.414
0.722PheTrp: 0.722 ± 0.314
1.134PheTyr: 1.134 ± 0.379
0.0PheXaa: 0.0 ± 0.0
Gly
5.98GlyAla: 5.98 ± 0.854
1.134GlyCys: 1.134 ± 0.482
4.537GlyAsp: 4.537 ± 0.51
4.949GlyGlu: 4.949 ± 0.761
2.165GlyPhe: 2.165 ± 0.526
5.877GlyGly: 5.877 ± 0.98
0.516GlyHis: 0.516 ± 0.234
3.918GlyIle: 3.918 ± 0.639
5.568GlyLys: 5.568 ± 0.671
4.949GlyLeu: 4.949 ± 0.661
1.959GlyMet: 1.959 ± 0.643
2.371GlyAsn: 2.371 ± 0.334
0.722GlyPro: 0.722 ± 0.408
1.959GlyGln: 1.959 ± 0.494
5.361GlyArg: 5.361 ± 0.722
3.402GlySer: 3.402 ± 0.565
3.918GlyThr: 3.918 ± 0.782
5.464GlyVal: 5.464 ± 0.82
1.443GlyTrp: 1.443 ± 0.349
1.753GlyTyr: 1.753 ± 0.4
0.0GlyXaa: 0.0 ± 0.0
His
1.856HisAla: 1.856 ± 0.506
0.412HisCys: 0.412 ± 0.206
0.825HisAsp: 0.825 ± 0.321
0.825HisGlu: 0.825 ± 0.341
0.309HisPhe: 0.309 ± 0.225
1.443HisGly: 1.443 ± 0.379
0.516HisHis: 0.516 ± 0.203
1.237HisIle: 1.237 ± 0.347
0.825HisLys: 0.825 ± 0.236
1.856HisLeu: 1.856 ± 0.41
0.619HisMet: 0.619 ± 0.302
0.928HisAsn: 0.928 ± 0.374
0.928HisPro: 0.928 ± 0.32
0.825HisGln: 0.825 ± 0.283
1.134HisArg: 1.134 ± 0.311
0.619HisSer: 0.619 ± 0.258
0.722HisThr: 0.722 ± 0.293
0.619HisVal: 0.619 ± 0.251
0.309HisTrp: 0.309 ± 0.18
0.516HisTyr: 0.516 ± 0.198
0.0HisXaa: 0.0 ± 0.0
Ile
5.155IleAla: 5.155 ± 0.874
0.412IleCys: 0.412 ± 0.214
3.402IleAsp: 3.402 ± 0.628
3.196IleGlu: 3.196 ± 0.554
1.856IlePhe: 1.856 ± 0.516
3.609IleGly: 3.609 ± 0.701
0.516IleHis: 0.516 ± 0.231
3.402IleIle: 3.402 ± 0.518
1.856IleLys: 1.856 ± 0.542
2.99IleLeu: 2.99 ± 0.569
1.031IleMet: 1.031 ± 0.309
2.784IleAsn: 2.784 ± 0.62
2.887IlePro: 2.887 ± 0.592
1.856IleGln: 1.856 ± 0.339
5.052IleArg: 5.052 ± 0.617
4.846IleSer: 4.846 ± 0.541
4.227IleThr: 4.227 ± 0.513
3.506IleVal: 3.506 ± 0.528
0.825IleTrp: 0.825 ± 0.245
1.237IleTyr: 1.237 ± 0.357
0.0IleXaa: 0.0 ± 0.0
Lys
5.155LysAla: 5.155 ± 0.62
0.412LysCys: 0.412 ± 0.156
1.237LysAsp: 1.237 ± 0.372
2.784LysGlu: 2.784 ± 0.6
1.65LysPhe: 1.65 ± 0.354
2.784LysGly: 2.784 ± 0.5
1.34LysHis: 1.34 ± 0.352
2.474LysIle: 2.474 ± 0.53
3.815LysLys: 3.815 ± 0.818
6.496LysLeu: 6.496 ± 0.927
0.825LysMet: 0.825 ± 0.268
2.887LysAsn: 2.887 ± 0.754
2.99LysPro: 2.99 ± 0.591
1.547LysGln: 1.547 ± 0.359
3.712LysArg: 3.712 ± 0.654
2.578LysSer: 2.578 ± 0.476
3.299LysThr: 3.299 ± 0.599
3.299LysVal: 3.299 ± 0.563
1.134LysTrp: 1.134 ± 0.394
2.784LysTyr: 2.784 ± 0.541
0.0LysXaa: 0.0 ± 0.0
Leu
9.692LeuAla: 9.692 ± 1.036
0.928LeuCys: 0.928 ± 0.285
5.568LeuAsp: 5.568 ± 0.667
6.083LeuGlu: 6.083 ± 0.731
3.196LeuPhe: 3.196 ± 0.682
4.124LeuGly: 4.124 ± 0.682
1.65LeuHis: 1.65 ± 0.335
5.464LeuIle: 5.464 ± 0.744
5.98LeuLys: 5.98 ± 0.91
5.98LeuLeu: 5.98 ± 0.686
3.506LeuMet: 3.506 ± 0.527
4.743LeuAsn: 4.743 ± 0.595
4.743LeuPro: 4.743 ± 0.84
3.196LeuGln: 3.196 ± 0.54
5.774LeuArg: 5.774 ± 0.641
6.805LeuSer: 6.805 ± 0.851
7.32LeuThr: 7.32 ± 0.907
4.433LeuVal: 4.433 ± 0.815
1.134LeuTrp: 1.134 ± 0.306
2.268LeuTyr: 2.268 ± 0.598
0.0LeuXaa: 0.0 ± 0.0
Met
3.196MetAla: 3.196 ± 0.506
0.206MetCys: 0.206 ± 0.141
0.825MetAsp: 0.825 ± 0.293
1.134MetGlu: 1.134 ± 0.273
0.722MetPhe: 0.722 ± 0.292
0.928MetGly: 0.928 ± 0.317
0.722MetHis: 0.722 ± 0.273
1.34MetIle: 1.34 ± 0.357
1.443MetLys: 1.443 ± 0.38
3.299MetLeu: 3.299 ± 0.791
1.031MetMet: 1.031 ± 0.299
2.062MetAsn: 2.062 ± 0.502
1.34MetPro: 1.34 ± 0.317
0.309MetGln: 0.309 ± 0.207
2.165MetArg: 2.165 ± 0.537
1.547MetSer: 1.547 ± 0.312
3.093MetThr: 3.093 ± 0.572
1.237MetVal: 1.237 ± 0.376
0.412MetTrp: 0.412 ± 0.259
0.516MetTyr: 0.516 ± 0.214
0.0MetXaa: 0.0 ± 0.0
Asn
4.021AsnAla: 4.021 ± 0.624
0.516AsnCys: 0.516 ± 0.185
2.268AsnAsp: 2.268 ± 0.517
2.681AsnGlu: 2.681 ± 0.623
1.237AsnPhe: 1.237 ± 0.354
3.506AsnGly: 3.506 ± 0.676
0.722AsnHis: 0.722 ± 0.251
2.165AsnIle: 2.165 ± 0.396
1.856AsnLys: 1.856 ± 0.417
3.402AsnLeu: 3.402 ± 0.553
0.825AsnMet: 0.825 ± 0.268
1.753AsnAsn: 1.753 ± 0.459
1.753AsnPro: 1.753 ± 0.432
1.65AsnGln: 1.65 ± 0.416
2.784AsnArg: 2.784 ± 0.657
2.371AsnSer: 2.371 ± 0.526
2.474AsnThr: 2.474 ± 0.461
1.959AsnVal: 1.959 ± 0.293
0.206AsnTrp: 0.206 ± 0.141
1.134AsnTyr: 1.134 ± 0.285
0.0AsnXaa: 0.0 ± 0.0
Pro
5.052ProAla: 5.052 ± 0.828
0.206ProCys: 0.206 ± 0.148
3.609ProAsp: 3.609 ± 0.628
3.712ProGlu: 3.712 ± 0.685
1.443ProPhe: 1.443 ± 0.566
2.887ProGly: 2.887 ± 0.638
1.134ProHis: 1.134 ± 0.394
2.165ProIle: 2.165 ± 0.652
2.578ProLys: 2.578 ± 0.607
4.64ProLeu: 4.64 ± 0.635
0.722ProMet: 0.722 ± 0.285
1.237ProAsn: 1.237 ± 0.329
1.959ProPro: 1.959 ± 0.526
1.856ProGln: 1.856 ± 0.429
2.474ProArg: 2.474 ± 0.678
2.887ProSer: 2.887 ± 0.665
2.681ProThr: 2.681 ± 0.63
5.361ProVal: 5.361 ± 0.805
0.619ProTrp: 0.619 ± 0.235
1.031ProTyr: 1.031 ± 0.296
0.0ProXaa: 0.0 ± 0.0
Gln
4.743GlnAla: 4.743 ± 1.229
0.309GlnCys: 0.309 ± 0.189
1.753GlnAsp: 1.753 ± 0.445
2.371GlnGlu: 2.371 ± 0.474
0.825GlnPhe: 0.825 ± 0.301
1.753GlnGly: 1.753 ± 0.342
0.619GlnHis: 0.619 ± 0.262
2.681GlnIle: 2.681 ± 0.565
2.062GlnLys: 2.062 ± 0.492
3.918GlnLeu: 3.918 ± 0.619
1.134GlnMet: 1.134 ± 0.335
0.928GlnAsn: 0.928 ± 0.313
1.547GlnPro: 1.547 ± 0.422
2.062GlnGln: 2.062 ± 0.471
4.124GlnArg: 4.124 ± 0.614
2.681GlnSer: 2.681 ± 0.59
1.959GlnThr: 1.959 ± 0.398
1.856GlnVal: 1.856 ± 0.415
0.516GlnTrp: 0.516 ± 0.255
0.722GlnTyr: 0.722 ± 0.264
0.0GlnXaa: 0.0 ± 0.0
Arg
5.98ArgAla: 5.98 ± 0.733
1.031ArgCys: 1.031 ± 0.292
3.093ArgAsp: 3.093 ± 0.507
4.846ArgGlu: 4.846 ± 0.642
1.65ArgPhe: 1.65 ± 0.384
4.021ArgGly: 4.021 ± 0.832
1.34ArgHis: 1.34 ± 0.351
3.815ArgIle: 3.815 ± 0.621
3.299ArgLys: 3.299 ± 0.551
5.774ArgLeu: 5.774 ± 0.735
1.547ArgMet: 1.547 ± 0.446
3.093ArgAsn: 3.093 ± 0.707
2.784ArgPro: 2.784 ± 0.575
3.918ArgGln: 3.918 ± 0.758
4.227ArgArg: 4.227 ± 0.766
2.887ArgSer: 2.887 ± 0.845
3.712ArgThr: 3.712 ± 0.546
5.155ArgVal: 5.155 ± 0.94
0.928ArgTrp: 0.928 ± 0.287
2.474ArgTyr: 2.474 ± 0.594
0.0ArgXaa: 0.0 ± 0.0
Ser
6.702SerAla: 6.702 ± 0.906
0.516SerCys: 0.516 ± 0.191
3.402SerAsp: 3.402 ± 0.644
4.021SerGlu: 4.021 ± 0.559
2.062SerPhe: 2.062 ± 0.564
4.64SerGly: 4.64 ± 0.723
1.134SerHis: 1.134 ± 0.588
2.165SerIle: 2.165 ± 0.414
3.196SerLys: 3.196 ± 0.454
6.392SerLeu: 6.392 ± 1.065
1.753SerMet: 1.753 ± 0.472
1.856SerAsn: 1.856 ± 0.44
3.093SerPro: 3.093 ± 0.742
2.268SerGln: 2.268 ± 0.382
3.712SerArg: 3.712 ± 0.722
2.474SerSer: 2.474 ± 0.544
4.433SerThr: 4.433 ± 0.682
4.949SerVal: 4.949 ± 0.808
0.619SerTrp: 0.619 ± 0.224
0.516SerTyr: 0.516 ± 0.304
0.0SerXaa: 0.0 ± 0.0
Thr
7.527ThrAla: 7.527 ± 1.236
0.825ThrCys: 0.825 ± 0.314
4.227ThrAsp: 4.227 ± 0.62
3.506ThrGlu: 3.506 ± 0.619
2.474ThrPhe: 2.474 ± 0.461
7.011ThrGly: 7.011 ± 0.984
0.825ThrHis: 0.825 ± 0.309
3.402ThrIle: 3.402 ± 0.534
2.578ThrLys: 2.578 ± 0.589
6.496ThrLeu: 6.496 ± 0.902
2.474ThrMet: 2.474 ± 0.435
1.65ThrAsn: 1.65 ± 0.402
4.124ThrPro: 4.124 ± 0.521
2.165ThrGln: 2.165 ± 0.559
4.33ThrArg: 4.33 ± 0.462
3.402ThrSer: 3.402 ± 0.417
3.712ThrThr: 3.712 ± 0.706
3.815ThrVal: 3.815 ± 0.677
0.412ThrTrp: 0.412 ± 0.207
0.825ThrTyr: 0.825 ± 0.289
0.0ThrXaa: 0.0 ± 0.0
Val
6.805ValAla: 6.805 ± 1.076
1.134ValCys: 1.134 ± 0.385
4.227ValAsp: 4.227 ± 0.694
4.433ValGlu: 4.433 ± 0.79
2.165ValPhe: 2.165 ± 0.525
4.537ValGly: 4.537 ± 0.736
0.619ValHis: 0.619 ± 0.262
3.712ValIle: 3.712 ± 0.644
4.021ValLys: 4.021 ± 0.567
5.671ValLeu: 5.671 ± 0.763
1.65ValMet: 1.65 ± 0.395
2.681ValAsn: 2.681 ± 0.52
2.784ValPro: 2.784 ± 0.534
2.165ValGln: 2.165 ± 0.468
3.093ValArg: 3.093 ± 0.619
4.124ValSer: 4.124 ± 0.537
5.155ValThr: 5.155 ± 0.982
4.846ValVal: 4.846 ± 0.693
0.516ValTrp: 0.516 ± 0.334
1.443ValTyr: 1.443 ± 0.483
0.0ValXaa: 0.0 ± 0.0
Trp
1.237TrpAla: 1.237 ± 0.287
0.0TrpCys: 0.0 ± 0.0
1.031TrpAsp: 1.031 ± 0.342
0.825TrpGlu: 0.825 ± 0.279
0.206TrpPhe: 0.206 ± 0.139
0.412TrpGly: 0.412 ± 0.256
0.516TrpHis: 0.516 ± 0.239
0.619TrpIle: 0.619 ± 0.249
0.722TrpLys: 0.722 ± 0.256
1.753TrpLeu: 1.753 ± 0.405
0.619TrpMet: 0.619 ± 0.275
0.722TrpAsn: 0.722 ± 0.369
1.443TrpPro: 1.443 ± 0.402
0.516TrpGln: 0.516 ± 0.214
1.134TrpArg: 1.134 ± 0.352
0.619TrpSer: 0.619 ± 0.28
0.309TrpThr: 0.309 ± 0.162
0.619TrpVal: 0.619 ± 0.216
0.619TrpTrp: 0.619 ± 0.243
0.412TrpTyr: 0.412 ± 0.222
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.299TyrAla: 3.299 ± 0.581
0.103TyrCys: 0.103 ± 0.099
0.928TyrAsp: 0.928 ± 0.303
2.165TyrGlu: 2.165 ± 0.447
1.031TyrPhe: 1.031 ± 0.366
1.856TyrGly: 1.856 ± 0.439
0.722TyrHis: 0.722 ± 0.22
2.371TyrIle: 2.371 ± 0.426
0.412TyrLys: 0.412 ± 0.186
1.65TyrLeu: 1.65 ± 0.323
0.928TyrMet: 0.928 ± 0.319
0.722TyrAsn: 0.722 ± 0.24
1.753TyrPro: 1.753 ± 0.407
1.65TyrGln: 1.65 ± 0.463
1.65TyrArg: 1.65 ± 0.403
1.34TyrSer: 1.34 ± 0.349
1.547TyrThr: 1.547 ± 0.436
1.65TyrVal: 1.65 ± 0.391
0.722TyrTrp: 0.722 ± 0.287
0.825TyrTyr: 0.825 ± 0.246
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 46 proteins (9700 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski