Amino acid dipepetide frequency for Enterobacteria phage fiAA91-ss

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.701AlaAla: 9.701 ± 1.688
0.792AlaCys: 0.792 ± 0.31
6.335AlaAsp: 6.335 ± 0.839
5.049AlaGlu: 5.049 ± 0.755
3.069AlaPhe: 3.069 ± 0.565
8.018AlaGly: 8.018 ± 0.957
1.782AlaHis: 1.782 ± 0.344
3.564AlaIle: 3.564 ± 0.62
4.851AlaLys: 4.851 ± 0.702
9.998AlaLeu: 9.998 ± 1.048
1.98AlaMet: 1.98 ± 0.395
2.376AlaAsn: 2.376 ± 0.493
4.455AlaPro: 4.455 ± 0.563
3.564AlaGln: 3.564 ± 0.795
4.95AlaArg: 4.95 ± 0.854
7.523AlaSer: 7.523 ± 0.83
6.731AlaThr: 6.731 ± 1.052
7.523AlaVal: 7.523 ± 0.935
1.287AlaTrp: 1.287 ± 0.335
3.069AlaTyr: 3.069 ± 0.541
0.0AlaXaa: 0.0 ± 0.0
Cys
0.594CysAla: 0.594 ± 0.229
0.0CysCys: 0.0 ± 0.0
0.495CysAsp: 0.495 ± 0.18
0.297CysGlu: 0.297 ± 0.136
0.198CysPhe: 0.198 ± 0.149
0.693CysGly: 0.693 ± 0.247
0.099CysHis: 0.099 ± 0.101
0.396CysIle: 0.396 ± 0.185
0.396CysLys: 0.396 ± 0.182
0.495CysLeu: 0.495 ± 0.194
0.594CysMet: 0.594 ± 0.257
0.198CysAsn: 0.198 ± 0.125
0.495CysPro: 0.495 ± 0.209
0.792CysGln: 0.792 ± 0.268
1.089CysArg: 1.089 ± 0.282
0.891CysSer: 0.891 ± 0.259
0.693CysThr: 0.693 ± 0.29
0.891CysVal: 0.891 ± 0.325
0.099CysTrp: 0.099 ± 0.091
0.297CysTyr: 0.297 ± 0.179
0.0CysXaa: 0.0 ± 0.0
Asp
6.236AspAla: 6.236 ± 0.8
0.396AspCys: 0.396 ± 0.188
2.871AspAsp: 2.871 ± 0.526
4.257AspGlu: 4.257 ± 0.834
3.465AspPhe: 3.465 ± 0.756
4.554AspGly: 4.554 ± 0.619
0.693AspHis: 0.693 ± 0.214
4.554AspIle: 4.554 ± 0.879
2.079AspLys: 2.079 ± 0.407
3.762AspLeu: 3.762 ± 0.551
0.594AspMet: 0.594 ± 0.191
1.782AspAsn: 1.782 ± 0.395
1.485AspPro: 1.485 ± 0.357
1.485AspGln: 1.485 ± 0.506
2.376AspArg: 2.376 ± 0.606
3.465AspSer: 3.465 ± 0.709
3.762AspThr: 3.762 ± 0.568
3.366AspVal: 3.366 ± 0.737
0.99AspTrp: 0.99 ± 0.364
2.574AspTyr: 2.574 ± 0.558
0.0AspXaa: 0.0 ± 0.0
Glu
4.95GluAla: 4.95 ± 0.57
0.297GluCys: 0.297 ± 0.182
1.98GluAsp: 1.98 ± 0.447
3.564GluGlu: 3.564 ± 0.649
1.881GluPhe: 1.881 ± 0.306
2.277GluGly: 2.277 ± 0.371
1.683GluHis: 1.683 ± 0.474
2.97GluIle: 2.97 ± 0.628
4.059GluLys: 4.059 ± 0.6
7.721GluLeu: 7.721 ± 0.701
2.178GluMet: 2.178 ± 0.429
3.366GluAsn: 3.366 ± 0.685
2.871GluPro: 2.871 ± 0.661
3.168GluGln: 3.168 ± 0.567
4.851GluArg: 4.851 ± 0.776
4.455GluSer: 4.455 ± 0.682
3.069GluThr: 3.069 ± 0.522
4.455GluVal: 4.455 ± 0.651
0.891GluTrp: 0.891 ± 0.265
1.881GluTyr: 1.881 ± 0.406
0.0GluXaa: 0.0 ± 0.0
Phe
2.772PheAla: 2.772 ± 0.519
0.495PheCys: 0.495 ± 0.18
1.881PheAsp: 1.881 ± 0.407
2.376PheGlu: 2.376 ± 0.492
1.485PhePhe: 1.485 ± 0.396
1.485PheGly: 1.485 ± 0.337
0.792PheHis: 0.792 ± 0.301
1.386PheIle: 1.386 ± 0.402
2.97PheLys: 2.97 ± 0.491
3.861PheLeu: 3.861 ± 0.569
1.089PheMet: 1.089 ± 0.27
1.881PheAsn: 1.881 ± 0.496
1.386PhePro: 1.386 ± 0.423
1.089PheGln: 1.089 ± 0.31
2.079PheArg: 2.079 ± 0.441
2.277PheSer: 2.277 ± 0.481
3.366PheThr: 3.366 ± 0.496
1.287PheVal: 1.287 ± 0.336
0.693PheTrp: 0.693 ± 0.309
1.683PheTyr: 1.683 ± 0.551
0.0PheXaa: 0.0 ± 0.0
Gly
5.642GlyAla: 5.642 ± 1.117
1.188GlyCys: 1.188 ± 0.329
3.861GlyAsp: 3.861 ± 0.497
4.257GlyGlu: 4.257 ± 0.591
2.871GlyPhe: 2.871 ± 0.566
5.741GlyGly: 5.741 ± 0.909
0.792GlyHis: 0.792 ± 0.228
3.762GlyIle: 3.762 ± 0.555
5.444GlyLys: 5.444 ± 0.698
4.851GlyLeu: 4.851 ± 0.533
2.178GlyMet: 2.178 ± 0.444
2.079GlyAsn: 2.079 ± 0.379
0.495GlyPro: 0.495 ± 0.31
2.574GlyGln: 2.574 ± 0.498
4.554GlyArg: 4.554 ± 0.622
3.564GlySer: 3.564 ± 0.651
3.861GlyThr: 3.861 ± 0.696
5.345GlyVal: 5.345 ± 0.718
1.386GlyTrp: 1.386 ± 0.275
1.98GlyTyr: 1.98 ± 0.376
0.0GlyXaa: 0.0 ± 0.0
His
2.079HisAla: 2.079 ± 0.512
0.297HisCys: 0.297 ± 0.164
0.891HisAsp: 0.891 ± 0.29
1.089HisGlu: 1.089 ± 0.393
0.396HisPhe: 0.396 ± 0.256
1.485HisGly: 1.485 ± 0.454
0.693HisHis: 0.693 ± 0.223
1.485HisIle: 1.485 ± 0.466
0.99HisLys: 0.99 ± 0.306
2.079HisLeu: 2.079 ± 0.416
0.495HisMet: 0.495 ± 0.19
0.891HisAsn: 0.891 ± 0.385
0.99HisPro: 0.99 ± 0.327
1.089HisGln: 1.089 ± 0.335
1.188HisArg: 1.188 ± 0.306
0.594HisSer: 0.594 ± 0.215
0.792HisThr: 0.792 ± 0.279
0.99HisVal: 0.99 ± 0.394
0.198HisTrp: 0.198 ± 0.116
0.594HisTyr: 0.594 ± 0.203
0.0HisXaa: 0.0 ± 0.0
Ile
4.554IleAla: 4.554 ± 0.539
0.198IleCys: 0.198 ± 0.149
3.564IleAsp: 3.564 ± 0.651
3.168IleGlu: 3.168 ± 0.561
1.683IlePhe: 1.683 ± 0.416
3.762IleGly: 3.762 ± 0.669
0.693IleHis: 0.693 ± 0.277
3.366IleIle: 3.366 ± 0.572
2.277IleLys: 2.277 ± 0.497
3.267IleLeu: 3.267 ± 0.616
1.089IleMet: 1.089 ± 0.306
2.574IleAsn: 2.574 ± 0.458
2.97IlePro: 2.97 ± 0.551
1.683IleGln: 1.683 ± 0.419
4.95IleArg: 4.95 ± 0.552
4.752IleSer: 4.752 ± 0.582
4.257IleThr: 4.257 ± 0.582
3.168IleVal: 3.168 ± 0.481
0.891IleTrp: 0.891 ± 0.256
1.485IleTyr: 1.485 ± 0.377
0.0IleXaa: 0.0 ± 0.0
Lys
5.345LysAla: 5.345 ± 0.682
0.396LysCys: 0.396 ± 0.167
1.98LysAsp: 1.98 ± 0.569
2.97LysGlu: 2.97 ± 0.542
1.584LysPhe: 1.584 ± 0.325
2.97LysGly: 2.97 ± 0.429
1.188LysHis: 1.188 ± 0.355
2.376LysIle: 2.376 ± 0.476
3.366LysLys: 3.366 ± 0.705
5.543LysLeu: 5.543 ± 0.876
0.891LysMet: 0.891 ± 0.285
3.663LysAsn: 3.663 ± 0.776
2.871LysPro: 2.871 ± 0.538
2.079LysGln: 2.079 ± 0.427
3.861LysArg: 3.861 ± 0.557
2.673LysSer: 2.673 ± 0.647
3.861LysThr: 3.861 ± 0.605
3.465LysVal: 3.465 ± 0.662
0.891LysTrp: 0.891 ± 0.291
2.772LysTyr: 2.772 ± 0.446
0.0LysXaa: 0.0 ± 0.0
Leu
9.8LeuAla: 9.8 ± 0.977
0.891LeuCys: 0.891 ± 0.248
5.049LeuAsp: 5.049 ± 0.864
6.038LeuGlu: 6.038 ± 0.68
3.564LeuPhe: 3.564 ± 0.707
4.257LeuGly: 4.257 ± 0.608
2.178LeuHis: 2.178 ± 0.541
5.246LeuIle: 5.246 ± 0.752
6.137LeuLys: 6.137 ± 1.0
5.84LeuLeu: 5.84 ± 0.936
3.267LeuMet: 3.267 ± 0.512
4.851LeuAsn: 4.851 ± 0.691
4.455LeuPro: 4.455 ± 0.788
3.366LeuGln: 3.366 ± 0.542
4.851LeuArg: 4.851 ± 0.578
6.632LeuSer: 6.632 ± 0.806
6.83LeuThr: 6.83 ± 0.872
4.059LeuVal: 4.059 ± 0.567
1.089LeuTrp: 1.089 ± 0.298
2.079LeuTyr: 2.079 ± 0.534
0.0LeuXaa: 0.0 ± 0.0
Met
2.475MetAla: 2.475 ± 0.418
0.297MetCys: 0.297 ± 0.16
0.792MetAsp: 0.792 ± 0.277
1.386MetGlu: 1.386 ± 0.412
0.891MetPhe: 0.891 ± 0.262
0.792MetGly: 0.792 ± 0.287
0.693MetHis: 0.693 ± 0.247
1.089MetIle: 1.089 ± 0.28
1.188MetLys: 1.188 ± 0.329
3.069MetLeu: 3.069 ± 0.573
0.99MetMet: 0.99 ± 0.279
1.782MetAsn: 1.782 ± 0.417
0.891MetPro: 0.891 ± 0.292
0.792MetGln: 0.792 ± 0.257
2.178MetArg: 2.178 ± 0.519
1.782MetSer: 1.782 ± 0.302
3.168MetThr: 3.168 ± 0.606
1.782MetVal: 1.782 ± 0.34
0.495MetTrp: 0.495 ± 0.217
0.495MetTyr: 0.495 ± 0.208
0.0MetXaa: 0.0 ± 0.0
Asn
2.97AsnAla: 2.97 ± 0.63
0.396AsnCys: 0.396 ± 0.172
2.178AsnAsp: 2.178 ± 0.525
1.881AsnGlu: 1.881 ± 0.405
1.683AsnPhe: 1.683 ± 0.424
3.861AsnGly: 3.861 ± 0.634
0.792AsnHis: 0.792 ± 0.305
2.673AsnIle: 2.673 ± 0.429
1.98AsnLys: 1.98 ± 0.515
2.871AsnLeu: 2.871 ± 0.64
0.99AsnMet: 0.99 ± 0.238
2.079AsnAsn: 2.079 ± 0.379
1.98AsnPro: 1.98 ± 0.452
1.485AsnGln: 1.485 ± 0.392
3.267AsnArg: 3.267 ± 0.604
2.97AsnSer: 2.97 ± 0.585
2.475AsnThr: 2.475 ± 0.425
2.277AsnVal: 2.277 ± 0.292
0.693AsnTrp: 0.693 ± 0.261
1.089AsnTyr: 1.089 ± 0.241
0.0AsnXaa: 0.0 ± 0.0
Pro
4.455ProAla: 4.455 ± 0.817
0.198ProCys: 0.198 ± 0.149
3.465ProAsp: 3.465 ± 0.535
3.267ProGlu: 3.267 ± 0.656
1.485ProPhe: 1.485 ± 0.458
2.871ProGly: 2.871 ± 0.617
0.99ProHis: 0.99 ± 0.421
2.079ProIle: 2.079 ± 0.576
2.277ProLys: 2.277 ± 0.49
4.356ProLeu: 4.356 ± 0.606
0.495ProMet: 0.495 ± 0.224
0.891ProAsn: 0.891 ± 0.291
1.683ProPro: 1.683 ± 0.387
1.683ProGln: 1.683 ± 0.362
2.376ProArg: 2.376 ± 0.635
2.772ProSer: 2.772 ± 0.577
1.584ProThr: 1.584 ± 0.336
5.246ProVal: 5.246 ± 0.863
0.594ProTrp: 0.594 ± 0.261
0.792ProTyr: 0.792 ± 0.256
0.0ProXaa: 0.0 ± 0.0
Gln
4.554GlnAla: 4.554 ± 1.096
0.297GlnCys: 0.297 ± 0.152
2.277GlnAsp: 2.277 ± 0.664
2.574GlnGlu: 2.574 ± 0.548
1.386GlnPhe: 1.386 ± 0.388
1.584GlnGly: 1.584 ± 0.354
0.792GlnHis: 0.792 ± 0.275
2.376GlnIle: 2.376 ± 0.487
2.376GlnLys: 2.376 ± 0.476
4.158GlnLeu: 4.158 ± 0.593
1.188GlnMet: 1.188 ± 0.293
0.99GlnAsn: 0.99 ± 0.281
1.782GlnPro: 1.782 ± 0.512
2.475GlnGln: 2.475 ± 0.567
3.96GlnArg: 3.96 ± 0.738
2.079GlnSer: 2.079 ± 0.535
1.98GlnThr: 1.98 ± 0.382
1.98GlnVal: 1.98 ± 0.382
0.495GlnTrp: 0.495 ± 0.216
0.396GlnTyr: 0.396 ± 0.234
0.0GlnXaa: 0.0 ± 0.0
Arg
5.444ArgAla: 5.444 ± 0.589
0.891ArgCys: 0.891 ± 0.279
3.069ArgAsp: 3.069 ± 0.486
4.752ArgGlu: 4.752 ± 0.626
2.079ArgPhe: 2.079 ± 0.46
3.267ArgGly: 3.267 ± 0.752
1.584ArgHis: 1.584 ± 0.353
3.267ArgIle: 3.267 ± 0.577
3.762ArgLys: 3.762 ± 0.495
6.533ArgLeu: 6.533 ± 0.992
1.287ArgMet: 1.287 ± 0.391
2.97ArgAsn: 2.97 ± 0.62
2.871ArgPro: 2.871 ± 0.542
3.564ArgGln: 3.564 ± 0.63
5.049ArgArg: 5.049 ± 0.607
2.772ArgSer: 2.772 ± 0.478
3.267ArgThr: 3.267 ± 0.504
5.246ArgVal: 5.246 ± 0.938
0.891ArgTrp: 0.891 ± 0.28
3.267ArgTyr: 3.267 ± 0.656
0.0ArgXaa: 0.0 ± 0.0
Ser
6.533SerAla: 6.533 ± 0.883
0.594SerCys: 0.594 ± 0.2
4.356SerAsp: 4.356 ± 0.721
4.059SerGlu: 4.059 ± 0.484
2.178SerPhe: 2.178 ± 0.542
5.049SerGly: 5.049 ± 0.874
1.188SerHis: 1.188 ± 0.512
2.772SerIle: 2.772 ± 0.508
2.871SerLys: 2.871 ± 0.507
6.236SerLeu: 6.236 ± 0.922
1.881SerMet: 1.881 ± 0.474
2.178SerAsn: 2.178 ± 0.45
2.574SerPro: 2.574 ± 0.733
2.178SerGln: 2.178 ± 0.36
4.158SerArg: 4.158 ± 0.565
2.871SerSer: 2.871 ± 0.566
3.663SerThr: 3.663 ± 0.569
4.752SerVal: 4.752 ± 0.964
0.495SerTrp: 0.495 ± 0.166
1.485SerTyr: 1.485 ± 0.469
0.0SerXaa: 0.0 ± 0.0
Thr
7.523ThrAla: 7.523 ± 1.244
0.891ThrCys: 0.891 ± 0.299
3.762ThrAsp: 3.762 ± 0.528
3.069ThrGlu: 3.069 ± 0.704
2.079ThrPhe: 2.079 ± 0.419
7.325ThrGly: 7.325 ± 0.981
0.99ThrHis: 0.99 ± 0.294
3.267ThrIle: 3.267 ± 0.58
2.376ThrLys: 2.376 ± 0.518
6.533ThrLeu: 6.533 ± 1.005
2.079ThrMet: 2.079 ± 0.372
1.584ThrAsn: 1.584 ± 0.355
4.158ThrPro: 4.158 ± 0.549
1.98ThrGln: 1.98 ± 0.458
3.663ThrArg: 3.663 ± 0.595
3.663ThrSer: 3.663 ± 0.381
3.762ThrThr: 3.762 ± 1.053
4.653ThrVal: 4.653 ± 0.743
0.594ThrTrp: 0.594 ± 0.261
1.089ThrTyr: 1.089 ± 0.32
0.0ThrXaa: 0.0 ± 0.0
Val
6.632ValAla: 6.632 ± 0.96
0.99ValCys: 0.99 ± 0.365
3.762ValAsp: 3.762 ± 0.578
5.049ValGlu: 5.049 ± 0.88
2.475ValPhe: 2.475 ± 0.476
4.059ValGly: 4.059 ± 0.662
0.594ValHis: 0.594 ± 0.239
4.257ValIle: 4.257 ± 0.621
4.455ValLys: 4.455 ± 0.589
5.543ValLeu: 5.543 ± 0.822
2.079ValMet: 2.079 ± 0.426
2.673ValAsn: 2.673 ± 0.484
2.772ValPro: 2.772 ± 0.41
2.376ValGln: 2.376 ± 0.443
3.069ValArg: 3.069 ± 0.624
4.257ValSer: 4.257 ± 0.523
5.444ValThr: 5.444 ± 1.065
4.158ValVal: 4.158 ± 0.643
0.495ValTrp: 0.495 ± 0.196
1.386ValTyr: 1.386 ± 0.439
0.0ValXaa: 0.0 ± 0.0
Trp
1.287TrpAla: 1.287 ± 0.267
0.0TrpCys: 0.0 ± 0.0
0.891TrpAsp: 0.891 ± 0.245
0.99TrpGlu: 0.99 ± 0.251
0.396TrpPhe: 0.396 ± 0.162
0.396TrpGly: 0.396 ± 0.232
0.396TrpHis: 0.396 ± 0.218
0.594TrpIle: 0.594 ± 0.203
0.495TrpLys: 0.495 ± 0.192
1.584TrpLeu: 1.584 ± 0.427
0.693TrpMet: 0.693 ± 0.289
0.99TrpAsn: 0.99 ± 0.38
0.99TrpPro: 0.99 ± 0.292
0.594TrpGln: 0.594 ± 0.194
1.287TrpArg: 1.287 ± 0.423
0.792TrpSer: 0.792 ± 0.268
0.495TrpThr: 0.495 ± 0.262
0.495TrpVal: 0.495 ± 0.18
0.495TrpTrp: 0.495 ± 0.197
0.594TrpTyr: 0.594 ± 0.23
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.069TyrAla: 3.069 ± 0.55
0.198TyrCys: 0.198 ± 0.142
1.287TyrAsp: 1.287 ± 0.47
2.475TyrGlu: 2.475 ± 0.496
1.485TyrPhe: 1.485 ± 0.391
1.98TyrGly: 1.98 ± 0.47
0.693TyrHis: 0.693 ± 0.218
2.871TyrIle: 2.871 ± 0.536
0.396TyrLys: 0.396 ± 0.322
2.178TyrLeu: 2.178 ± 0.63
0.891TyrMet: 0.891 ± 0.256
0.792TyrAsn: 0.792 ± 0.24
1.386TyrPro: 1.386 ± 0.379
1.782TyrGln: 1.782 ± 0.503
2.079TyrArg: 2.079 ± 0.455
1.287TyrSer: 1.287 ± 0.311
1.98TyrThr: 1.98 ± 0.547
1.386TyrVal: 1.386 ± 0.415
0.792TyrTrp: 0.792 ± 0.263
0.99TyrTyr: 0.99 ± 0.257
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 40 proteins (10103 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski