Amino acid dipepetide frequency for Bacillus phage AP631

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.595AlaAla: 4.595 ± 1.013
0.347AlaCys: 0.347 ± 0.167
3.381AlaAsp: 3.381 ± 0.545
5.809AlaGlu: 5.809 ± 0.896
2.948AlaPhe: 2.948 ± 0.442
4.335AlaGly: 4.335 ± 0.833
0.867AlaHis: 0.867 ± 0.275
5.029AlaIle: 5.029 ± 0.603
4.508AlaLys: 4.508 ± 0.647
5.029AlaLeu: 5.029 ± 0.703
1.387AlaMet: 1.387 ± 0.448
3.295AlaAsn: 3.295 ± 0.605
1.127AlaPro: 1.127 ± 0.299
2.254AlaGln: 2.254 ± 0.415
2.688AlaArg: 2.688 ± 0.615
3.815AlaSer: 3.815 ± 0.766
3.641AlaThr: 3.641 ± 0.753
3.988AlaVal: 3.988 ± 0.8
0.867AlaTrp: 0.867 ± 0.251
1.907AlaTyr: 1.907 ± 0.407
0.0AlaXaa: 0.0 ± 0.0
Cys
0.173CysAla: 0.173 ± 0.134
0.173CysCys: 0.173 ± 0.119
0.694CysAsp: 0.694 ± 0.282
0.694CysGlu: 0.694 ± 0.36
0.26CysPhe: 0.26 ± 0.199
0.607CysGly: 0.607 ± 0.3
0.26CysHis: 0.26 ± 0.13
0.607CysIle: 0.607 ± 0.219
0.867CysLys: 0.867 ± 0.378
0.78CysLeu: 0.78 ± 0.376
0.173CysMet: 0.173 ± 0.122
0.347CysAsn: 0.347 ± 0.166
0.694CysPro: 0.694 ± 0.306
0.347CysGln: 0.347 ± 0.156
0.694CysArg: 0.694 ± 0.343
0.78CysSer: 0.78 ± 0.332
0.78CysThr: 0.78 ± 0.465
0.78CysVal: 0.78 ± 0.314
0.0CysTrp: 0.0 ± 0.0
0.78CysTyr: 0.78 ± 0.322
0.0CysXaa: 0.0 ± 0.0
Asp
3.381AspAla: 3.381 ± 0.59
0.607AspCys: 0.607 ± 0.315
3.208AspAsp: 3.208 ± 0.491
5.202AspGlu: 5.202 ± 0.723
2.341AspPhe: 2.341 ± 0.449
4.075AspGly: 4.075 ± 0.545
0.607AspHis: 0.607 ± 0.196
4.075AspIle: 4.075 ± 0.475
6.329AspLys: 6.329 ± 0.751
4.942AspLeu: 4.942 ± 0.751
1.734AspMet: 1.734 ± 0.345
2.514AspAsn: 2.514 ± 0.605
1.474AspPro: 1.474 ± 0.411
2.081AspGln: 2.081 ± 0.455
2.688AspArg: 2.688 ± 0.452
1.907AspSer: 1.907 ± 0.359
2.861AspThr: 2.861 ± 0.522
3.728AspVal: 3.728 ± 0.487
0.607AspTrp: 0.607 ± 0.204
1.994AspTyr: 1.994 ± 0.455
0.0AspXaa: 0.0 ± 0.0
Glu
4.942GluAla: 4.942 ± 0.658
1.04GluCys: 1.04 ± 0.374
4.508GluAsp: 4.508 ± 0.604
7.976GluGlu: 7.976 ± 1.055
3.208GluPhe: 3.208 ± 0.614
4.248GluGly: 4.248 ± 0.594
1.214GluHis: 1.214 ± 0.342
8.41GluIle: 8.41 ± 0.712
8.757GluLys: 8.757 ± 0.76
8.843GluLeu: 8.843 ± 0.869
3.468GluMet: 3.468 ± 0.64
4.769GluAsn: 4.769 ± 0.664
1.214GluPro: 1.214 ± 0.412
4.335GluGln: 4.335 ± 0.709
4.248GluArg: 4.248 ± 0.767
4.508GluSer: 4.508 ± 0.818
4.335GluThr: 4.335 ± 0.559
4.855GluVal: 4.855 ± 0.749
0.867GluTrp: 0.867 ± 0.251
2.601GluTyr: 2.601 ± 0.44
0.0GluXaa: 0.0 ± 0.0
Phe
2.168PheAla: 2.168 ± 0.469
0.52PheCys: 0.52 ± 0.222
2.688PheAsp: 2.688 ± 0.586
2.948PheGlu: 2.948 ± 0.548
1.214PhePhe: 1.214 ± 0.357
2.341PheGly: 2.341 ± 0.415
0.607PheHis: 0.607 ± 0.224
2.514PheIle: 2.514 ± 0.539
3.902PheLys: 3.902 ± 0.382
4.075PheLeu: 4.075 ± 0.603
1.301PheMet: 1.301 ± 0.311
2.254PheAsn: 2.254 ± 0.413
0.867PhePro: 0.867 ± 0.31
1.214PheGln: 1.214 ± 0.339
1.994PheArg: 1.994 ± 0.443
1.994PheSer: 1.994 ± 0.432
1.907PheThr: 1.907 ± 0.517
2.428PheVal: 2.428 ± 0.555
0.607PheTrp: 0.607 ± 0.272
1.734PheTyr: 1.734 ± 0.44
0.0PheXaa: 0.0 ± 0.0
Gly
3.295GlyAla: 3.295 ± 0.63
0.434GlyCys: 0.434 ± 0.247
3.121GlyAsp: 3.121 ± 0.511
5.375GlyGlu: 5.375 ± 0.92
3.641GlyPhe: 3.641 ± 0.433
3.381GlyGly: 3.381 ± 0.618
1.04GlyHis: 1.04 ± 0.237
4.248GlyIle: 4.248 ± 0.653
5.722GlyLys: 5.722 ± 0.572
5.202GlyLeu: 5.202 ± 0.833
2.341GlyMet: 2.341 ± 0.582
2.341GlyAsn: 2.341 ± 0.53
0.607GlyPro: 0.607 ± 0.197
2.081GlyGln: 2.081 ± 0.32
2.428GlyArg: 2.428 ± 0.446
3.381GlySer: 3.381 ± 0.449
2.341GlyThr: 2.341 ± 0.369
3.988GlyVal: 3.988 ± 0.839
0.954GlyTrp: 0.954 ± 0.339
2.254GlyTyr: 2.254 ± 0.493
0.0GlyXaa: 0.0 ± 0.0
His
1.127HisAla: 1.127 ± 0.261
0.347HisCys: 0.347 ± 0.193
0.52HisAsp: 0.52 ± 0.218
0.954HisGlu: 0.954 ± 0.251
0.954HisPhe: 0.954 ± 0.265
0.78HisGly: 0.78 ± 0.212
0.52HisHis: 0.52 ± 0.305
0.78HisIle: 0.78 ± 0.251
1.04HisLys: 1.04 ± 0.277
1.821HisLeu: 1.821 ± 0.382
0.434HisMet: 0.434 ± 0.164
0.867HisAsn: 0.867 ± 0.26
0.347HisPro: 0.347 ± 0.168
0.52HisGln: 0.52 ± 0.216
1.04HisArg: 1.04 ± 0.254
0.607HisSer: 0.607 ± 0.229
1.04HisThr: 1.04 ± 0.339
0.434HisVal: 0.434 ± 0.19
0.26HisTrp: 0.26 ± 0.146
1.127HisTyr: 1.127 ± 0.362
0.0HisXaa: 0.0 ± 0.0
Ile
4.248IleAla: 4.248 ± 0.491
0.607IleCys: 0.607 ± 0.186
5.029IleAsp: 5.029 ± 0.72
6.503IleGlu: 6.503 ± 0.717
1.734IlePhe: 1.734 ± 0.444
3.902IleGly: 3.902 ± 0.591
1.127IleHis: 1.127 ± 0.303
3.208IleIle: 3.208 ± 0.591
7.196IleLys: 7.196 ± 0.706
5.722IleLeu: 5.722 ± 0.73
1.647IleMet: 1.647 ± 0.453
3.988IleAsn: 3.988 ± 0.572
2.861IlePro: 2.861 ± 0.537
3.035IleGln: 3.035 ± 0.624
4.248IleArg: 4.248 ± 0.555
4.855IleSer: 4.855 ± 0.687
3.208IleThr: 3.208 ± 0.46
3.902IleVal: 3.902 ± 0.598
0.694IleTrp: 0.694 ± 0.235
1.821IleTyr: 1.821 ± 0.397
0.0IleXaa: 0.0 ± 0.0
Lys
5.636LysAla: 5.636 ± 0.976
1.647LysCys: 1.647 ± 0.838
3.902LysAsp: 3.902 ± 0.56
9.71LysGlu: 9.71 ± 1.063
3.208LysPhe: 3.208 ± 0.556
5.115LysGly: 5.115 ± 0.677
1.821LysHis: 1.821 ± 0.446
7.37LysIle: 7.37 ± 0.771
10.231LysLys: 10.231 ± 1.235
8.41LysLeu: 8.41 ± 1.099
3.208LysMet: 3.208 ± 0.46
5.636LysAsn: 5.636 ± 0.672
2.774LysPro: 2.774 ± 0.599
3.208LysGln: 3.208 ± 0.539
4.682LysArg: 4.682 ± 0.756
5.202LysSer: 5.202 ± 0.474
5.202LysThr: 5.202 ± 0.704
6.416LysVal: 6.416 ± 0.556
0.954LysTrp: 0.954 ± 0.275
3.555LysTyr: 3.555 ± 0.559
0.0LysXaa: 0.0 ± 0.0
Leu
5.115LeuAla: 5.115 ± 0.884
0.694LeuCys: 0.694 ± 0.282
4.595LeuAsp: 4.595 ± 0.544
7.109LeuGlu: 7.109 ± 0.839
3.121LeuPhe: 3.121 ± 0.546
4.855LeuGly: 4.855 ± 0.559
1.127LeuHis: 1.127 ± 0.262
4.335LeuIle: 4.335 ± 0.662
9.537LeuLys: 9.537 ± 0.691
6.416LeuLeu: 6.416 ± 0.882
1.647LeuMet: 1.647 ± 0.36
5.375LeuAsn: 5.375 ± 0.787
1.994LeuPro: 1.994 ± 0.445
3.641LeuGln: 3.641 ± 0.66
3.641LeuArg: 3.641 ± 0.748
5.636LeuSer: 5.636 ± 0.732
5.722LeuThr: 5.722 ± 0.601
4.595LeuVal: 4.595 ± 0.693
0.954LeuTrp: 0.954 ± 0.348
3.035LeuTyr: 3.035 ± 0.601
0.0LeuXaa: 0.0 ± 0.0
Met
1.127MetAla: 1.127 ± 0.321
0.0MetCys: 0.0 ± 0.0
1.474MetAsp: 1.474 ± 0.414
2.514MetGlu: 2.514 ± 0.43
0.694MetPhe: 0.694 ± 0.363
1.561MetGly: 1.561 ± 0.38
0.52MetHis: 0.52 ± 0.245
2.081MetIle: 2.081 ± 0.315
3.728MetLys: 3.728 ± 0.63
1.561MetLeu: 1.561 ± 0.357
0.434MetMet: 0.434 ± 0.206
1.301MetAsn: 1.301 ± 0.354
1.04MetPro: 1.04 ± 0.236
0.867MetGln: 0.867 ± 0.244
1.647MetArg: 1.647 ± 0.297
2.081MetSer: 2.081 ± 0.402
2.168MetThr: 2.168 ± 0.363
0.867MetVal: 0.867 ± 0.291
0.694MetTrp: 0.694 ± 0.227
0.78MetTyr: 0.78 ± 0.322
0.0MetXaa: 0.0 ± 0.0
Asn
3.641AsnAla: 3.641 ± 0.601
0.26AsnCys: 0.26 ± 0.162
2.428AsnAsp: 2.428 ± 0.426
5.462AsnGlu: 5.462 ± 0.764
1.734AsnPhe: 1.734 ± 0.398
3.815AsnGly: 3.815 ± 0.478
1.04AsnHis: 1.04 ± 0.318
3.468AsnIle: 3.468 ± 0.634
4.508AsnLys: 4.508 ± 0.65
3.208AsnLeu: 3.208 ± 0.479
1.821AsnMet: 1.821 ± 0.312
2.428AsnAsn: 2.428 ± 0.434
1.647AsnPro: 1.647 ± 0.414
2.688AsnGln: 2.688 ± 0.479
3.641AsnArg: 3.641 ± 0.542
2.948AsnSer: 2.948 ± 0.461
2.341AsnThr: 2.341 ± 0.409
3.902AsnVal: 3.902 ± 0.692
0.694AsnTrp: 0.694 ± 0.311
1.474AsnTyr: 1.474 ± 0.372
0.0AsnXaa: 0.0 ± 0.0
Pro
1.647ProAla: 1.647 ± 0.398
0.434ProCys: 0.434 ± 0.263
1.387ProAsp: 1.387 ± 0.393
2.514ProGlu: 2.514 ± 0.545
0.78ProPhe: 0.78 ± 0.225
1.04ProGly: 1.04 ± 0.237
0.52ProHis: 0.52 ± 0.231
1.994ProIle: 1.994 ± 0.475
1.561ProLys: 1.561 ± 0.399
1.821ProLeu: 1.821 ± 0.475
0.52ProMet: 0.52 ± 0.227
1.214ProAsn: 1.214 ± 0.299
0.607ProPro: 0.607 ± 0.241
0.954ProGln: 0.954 ± 0.294
0.954ProArg: 0.954 ± 0.422
2.428ProSer: 2.428 ± 0.39
1.821ProThr: 1.821 ± 0.476
1.994ProVal: 1.994 ± 0.393
0.087ProTrp: 0.087 ± 0.077
1.301ProTyr: 1.301 ± 0.364
0.0ProXaa: 0.0 ± 0.0
Gln
3.468GlnAla: 3.468 ± 0.697
0.347GlnCys: 0.347 ± 0.21
1.821GlnAsp: 1.821 ± 0.397
2.861GlnGlu: 2.861 ± 0.489
1.734GlnPhe: 1.734 ± 0.477
1.907GlnGly: 1.907 ± 0.416
0.867GlnHis: 0.867 ± 0.29
1.821GlnIle: 1.821 ± 0.326
4.248GlnLys: 4.248 ± 0.611
2.774GlnLeu: 2.774 ± 0.495
1.127GlnMet: 1.127 ± 0.331
2.254GlnAsn: 2.254 ± 0.475
1.561GlnPro: 1.561 ± 0.346
2.601GlnGln: 2.601 ± 0.586
2.168GlnArg: 2.168 ± 0.444
3.035GlnSer: 3.035 ± 0.594
1.821GlnThr: 1.821 ± 0.413
2.341GlnVal: 2.341 ± 0.409
0.173GlnTrp: 0.173 ± 0.111
1.734GlnTyr: 1.734 ± 0.434
0.0GlnXaa: 0.0 ± 0.0
Arg
2.861ArgAla: 2.861 ± 0.512
0.607ArgCys: 0.607 ± 0.325
3.381ArgAsp: 3.381 ± 0.471
4.942ArgGlu: 4.942 ± 0.622
1.994ArgPhe: 1.994 ± 0.421
2.861ArgGly: 2.861 ± 0.564
0.434ArgHis: 0.434 ± 0.195
3.902ArgIle: 3.902 ± 0.878
4.162ArgLys: 4.162 ± 0.748
4.595ArgLeu: 4.595 ± 0.595
0.954ArgMet: 0.954 ± 0.238
2.774ArgAsn: 2.774 ± 0.646
0.954ArgPro: 0.954 ± 0.298
2.428ArgGln: 2.428 ± 0.588
1.474ArgArg: 1.474 ± 0.366
2.514ArgSer: 2.514 ± 0.776
2.254ArgThr: 2.254 ± 0.426
2.774ArgVal: 2.774 ± 0.547
0.607ArgTrp: 0.607 ± 0.201
1.994ArgTyr: 1.994 ± 0.411
0.0ArgXaa: 0.0 ± 0.0
Ser
3.381SerAla: 3.381 ± 0.823
0.607SerCys: 0.607 ± 0.282
4.248SerAsp: 4.248 ± 0.525
3.815SerGlu: 3.815 ± 0.601
2.254SerPhe: 2.254 ± 0.491
3.815SerGly: 3.815 ± 0.479
1.127SerHis: 1.127 ± 0.326
3.728SerIle: 3.728 ± 0.637
6.936SerLys: 6.936 ± 0.859
5.115SerLeu: 5.115 ± 0.591
1.907SerMet: 1.907 ± 0.536
3.381SerAsn: 3.381 ± 0.554
1.474SerPro: 1.474 ± 0.314
2.168SerGln: 2.168 ± 0.411
2.688SerArg: 2.688 ± 0.462
3.468SerSer: 3.468 ± 0.688
3.555SerThr: 3.555 ± 0.476
4.162SerVal: 4.162 ± 0.56
0.694SerTrp: 0.694 ± 0.265
1.734SerTyr: 1.734 ± 0.408
0.0SerXaa: 0.0 ± 0.0
Thr
3.815ThrAla: 3.815 ± 0.672
0.173ThrCys: 0.173 ± 0.125
2.601ThrAsp: 2.601 ± 0.481
4.682ThrGlu: 4.682 ± 0.767
2.861ThrPhe: 2.861 ± 0.59
3.902ThrGly: 3.902 ± 0.619
0.434ThrHis: 0.434 ± 0.186
4.422ThrIle: 4.422 ± 0.7
4.942ThrLys: 4.942 ± 0.716
4.422ThrLeu: 4.422 ± 0.655
1.301ThrMet: 1.301 ± 0.321
2.601ThrAsn: 2.601 ± 0.481
1.994ThrPro: 1.994 ± 0.569
2.168ThrGln: 2.168 ± 0.32
2.168ThrArg: 2.168 ± 0.517
3.468ThrSer: 3.468 ± 0.563
3.815ThrThr: 3.815 ± 0.688
3.815ThrVal: 3.815 ± 0.566
0.087ThrTrp: 0.087 ± 0.099
1.734ThrTyr: 1.734 ± 0.353
0.0ThrXaa: 0.0 ± 0.0
Val
4.335ValAla: 4.335 ± 0.675
0.434ValCys: 0.434 ± 0.226
4.508ValAsp: 4.508 ± 0.508
5.549ValGlu: 5.549 ± 0.671
1.994ValPhe: 1.994 ± 0.478
2.948ValGly: 2.948 ± 0.529
0.607ValHis: 0.607 ± 0.236
4.595ValIle: 4.595 ± 0.583
5.375ValLys: 5.375 ± 0.699
4.422ValLeu: 4.422 ± 0.811
0.867ValMet: 0.867 ± 0.263
3.208ValAsn: 3.208 ± 0.595
1.474ValPro: 1.474 ± 0.34
2.601ValGln: 2.601 ± 0.649
3.035ValArg: 3.035 ± 0.484
4.855ValSer: 4.855 ± 0.574
3.641ValThr: 3.641 ± 0.638
3.988ValVal: 3.988 ± 0.487
0.694ValTrp: 0.694 ± 0.32
2.688ValTyr: 2.688 ± 0.513
0.0ValXaa: 0.0 ± 0.0
Trp
0.434TrpAla: 0.434 ± 0.286
0.26TrpCys: 0.26 ± 0.145
0.867TrpAsp: 0.867 ± 0.237
0.694TrpGlu: 0.694 ± 0.252
0.52TrpPhe: 0.52 ± 0.229
0.52TrpGly: 0.52 ± 0.194
0.173TrpHis: 0.173 ± 0.134
0.867TrpIle: 0.867 ± 0.35
0.607TrpLys: 0.607 ± 0.234
1.214TrpLeu: 1.214 ± 0.353
0.0TrpMet: 0.0 ± 0.0
0.694TrpAsn: 0.694 ± 0.232
0.173TrpPro: 0.173 ± 0.118
0.434TrpGln: 0.434 ± 0.171
0.52TrpArg: 0.52 ± 0.213
1.301TrpSer: 1.301 ± 0.408
0.78TrpThr: 0.78 ± 0.25
0.52TrpVal: 0.52 ± 0.216
0.26TrpTrp: 0.26 ± 0.144
0.52TrpTyr: 0.52 ± 0.22
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.428TyrAla: 2.428 ± 0.483
0.954TyrCys: 0.954 ± 0.291
1.994TyrAsp: 1.994 ± 0.424
3.035TyrGlu: 3.035 ± 0.482
2.081TyrPhe: 2.081 ± 0.423
2.168TyrGly: 2.168 ± 0.454
0.607TyrHis: 0.607 ± 0.187
2.081TyrIle: 2.081 ± 0.375
3.728TyrLys: 3.728 ± 0.609
2.774TyrLeu: 2.774 ± 0.594
0.78TyrMet: 0.78 ± 0.234
1.994TyrAsn: 1.994 ± 0.415
0.694TyrPro: 0.694 ± 0.26
1.04TyrGln: 1.04 ± 0.348
1.821TyrArg: 1.821 ± 0.439
1.474TyrSer: 1.474 ± 0.243
2.168TyrThr: 2.168 ± 0.33
2.341TyrVal: 2.341 ± 0.401
0.607TyrTrp: 0.607 ± 0.205
0.867TyrTyr: 0.867 ± 0.294
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 56 proteins (11535 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski