Amino acid dipepetide frequency for Microbacterium phage Ciel

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
22.472AlaAla: 22.472 ± 2.373
0.184AlaCys: 0.184 ± 0.16
8.289AlaAsp: 8.289 ± 1.371
8.289AlaGlu: 8.289 ± 1.247
5.157AlaPhe: 5.157 ± 0.635
17.13AlaGly: 17.13 ± 1.81
1.658AlaHis: 1.658 ± 0.66
5.71AlaIle: 5.71 ± 0.806
4.052AlaLys: 4.052 ± 0.862
12.71AlaLeu: 12.71 ± 1.575
3.5AlaMet: 3.5 ± 1.008
2.21AlaAsn: 2.21 ± 0.822
8.841AlaPro: 8.841 ± 1.322
4.789AlaGln: 4.789 ± 1.028
7.92AlaArg: 7.92 ± 0.973
5.71AlaSer: 5.71 ± 1.096
6.447AlaThr: 6.447 ± 1.188
8.473AlaVal: 8.473 ± 1.532
2.026AlaTrp: 2.026 ± 0.497
2.763AlaTyr: 2.763 ± 0.771
0.0AlaXaa: 0.0 ± 0.0
Cys
0.184CysAla: 0.184 ± 0.185
0.0CysCys: 0.0 ± 0.0
0.184CysAsp: 0.184 ± 0.151
0.368CysGlu: 0.368 ± 0.262
0.184CysPhe: 0.184 ± 0.151
0.184CysGly: 0.184 ± 0.187
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.184CysLys: 0.184 ± 0.183
0.368CysLeu: 0.368 ± 0.275
0.0CysMet: 0.0 ± 0.0
0.368CysAsn: 0.368 ± 0.234
0.0CysPro: 0.0 ± 0.0
0.184CysGln: 0.184 ± 0.187
0.368CysArg: 0.368 ± 0.265
0.0CysSer: 0.0 ± 0.0
0.553CysThr: 0.553 ± 0.453
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.368CysTyr: 0.368 ± 0.223
0.0CysXaa: 0.0 ± 0.0
Asp
9.947AspAla: 9.947 ± 1.306
0.184AspCys: 0.184 ± 0.178
3.131AspAsp: 3.131 ± 0.893
3.5AspGlu: 3.5 ± 1.159
0.737AspPhe: 0.737 ± 0.469
5.342AspGly: 5.342 ± 1.118
1.105AspHis: 1.105 ± 0.431
0.368AspIle: 0.368 ± 0.223
0.184AspLys: 0.184 ± 0.198
7.92AspLeu: 7.92 ± 1.582
0.553AspMet: 0.553 ± 0.365
1.105AspAsn: 1.105 ± 0.397
2.947AspPro: 2.947 ± 0.694
1.474AspGln: 1.474 ± 0.613
3.5AspArg: 3.5 ± 0.989
2.395AspSer: 2.395 ± 0.459
2.947AspThr: 2.947 ± 0.691
4.973AspVal: 4.973 ± 1.185
1.105AspTrp: 1.105 ± 0.44
1.474AspTyr: 1.474 ± 0.485
0.0AspXaa: 0.0 ± 0.0
Glu
6.815GluAla: 6.815 ± 1.244
0.184GluCys: 0.184 ± 0.187
2.947GluAsp: 2.947 ± 0.669
0.737GluGlu: 0.737 ± 0.479
0.737GluPhe: 0.737 ± 0.353
3.868GluGly: 3.868 ± 0.702
0.368GluHis: 0.368 ± 0.231
3.868GluIle: 3.868 ± 0.675
0.368GluLys: 0.368 ± 0.219
8.473GluLeu: 8.473 ± 1.165
0.368GluMet: 0.368 ± 0.25
3.684GluAsn: 3.684 ± 1.036
1.658GluPro: 1.658 ± 0.597
3.316GluGln: 3.316 ± 0.832
4.973GluArg: 4.973 ± 1.458
3.316GluSer: 3.316 ± 0.644
3.5GluThr: 3.5 ± 0.694
2.395GluVal: 2.395 ± 0.498
1.105GluTrp: 1.105 ± 0.5
1.474GluTyr: 1.474 ± 0.411
0.0GluXaa: 0.0 ± 0.0
Phe
4.237PheAla: 4.237 ± 0.953
0.0PheCys: 0.0 ± 0.0
2.395PheAsp: 2.395 ± 0.671
1.289PheGlu: 1.289 ± 0.572
0.184PhePhe: 0.184 ± 0.16
4.052PheGly: 4.052 ± 0.836
0.184PheHis: 0.184 ± 0.185
1.105PheIle: 1.105 ± 0.327
0.553PheLys: 0.553 ± 0.254
2.763PheLeu: 2.763 ± 0.72
0.921PheMet: 0.921 ± 0.287
1.474PheAsn: 1.474 ± 0.571
0.737PhePro: 0.737 ± 0.294
1.105PheGln: 1.105 ± 0.53
1.658PheArg: 1.658 ± 0.543
0.921PheSer: 0.921 ± 0.443
3.5PheThr: 3.5 ± 0.852
2.395PheVal: 2.395 ± 0.58
0.184PheTrp: 0.184 ± 0.198
0.553PheTyr: 0.553 ± 0.366
0.0PheXaa: 0.0 ± 0.0
Gly
12.157GlyAla: 12.157 ± 2.325
0.184GlyCys: 0.184 ± 0.183
4.789GlyAsp: 4.789 ± 1.301
3.5GlyGlu: 3.5 ± 0.751
2.763GlyPhe: 2.763 ± 0.461
6.631GlyGly: 6.631 ± 1.074
1.105GlyHis: 1.105 ± 0.568
5.526GlyIle: 5.526 ± 0.839
2.395GlyLys: 2.395 ± 0.549
8.105GlyLeu: 8.105 ± 2.048
1.658GlyMet: 1.658 ± 0.416
2.395GlyAsn: 2.395 ± 0.68
2.579GlyPro: 2.579 ± 1.068
3.868GlyGln: 3.868 ± 0.855
6.447GlyArg: 6.447 ± 0.771
7.736GlySer: 7.736 ± 0.975
4.237GlyThr: 4.237 ± 0.776
9.947GlyVal: 9.947 ± 1.082
2.395GlyTrp: 2.395 ± 0.622
2.026GlyTyr: 2.026 ± 0.493
0.0GlyXaa: 0.0 ± 0.0
His
1.658HisAla: 1.658 ± 0.446
0.0HisCys: 0.0 ± 0.0
1.474HisAsp: 1.474 ± 0.558
0.0HisGlu: 0.0 ± 0.0
0.553HisPhe: 0.553 ± 0.365
1.658HisGly: 1.658 ± 0.606
0.0HisHis: 0.0 ± 0.0
0.184HisIle: 0.184 ± 0.151
0.368HisLys: 0.368 ± 0.29
1.105HisLeu: 1.105 ± 0.394
0.184HisMet: 0.184 ± 0.16
0.368HisAsn: 0.368 ± 0.309
0.737HisPro: 0.737 ± 0.267
0.184HisGln: 0.184 ± 0.151
1.289HisArg: 1.289 ± 0.538
0.737HisSer: 0.737 ± 0.269
1.105HisThr: 1.105 ± 0.677
2.21HisVal: 2.21 ± 0.506
0.368HisTrp: 0.368 ± 0.232
0.921HisTyr: 0.921 ± 0.322
0.0HisXaa: 0.0 ± 0.0
Ile
4.421IleAla: 4.421 ± 0.679
0.184IleCys: 0.184 ± 0.196
3.868IleAsp: 3.868 ± 0.697
5.71IleGlu: 5.71 ± 1.048
0.737IlePhe: 0.737 ± 0.286
3.684IleGly: 3.684 ± 0.768
1.474IleHis: 1.474 ± 0.431
0.921IleIle: 0.921 ± 0.458
0.553IleLys: 0.553 ± 0.284
3.5IleLeu: 3.5 ± 1.242
0.184IleMet: 0.184 ± 0.198
0.737IleAsn: 0.737 ± 0.339
2.947IlePro: 2.947 ± 1.018
1.105IleGln: 1.105 ± 0.393
3.684IleArg: 3.684 ± 0.757
4.052IleSer: 4.052 ± 0.811
2.947IleThr: 2.947 ± 0.597
3.684IleVal: 3.684 ± 0.754
0.368IleTrp: 0.368 ± 0.227
0.368IleTyr: 0.368 ± 0.213
0.0IleXaa: 0.0 ± 0.0
Lys
1.658LysAla: 1.658 ± 0.441
0.0LysCys: 0.0 ± 0.0
1.289LysAsp: 1.289 ± 0.574
0.553LysGlu: 0.553 ± 0.283
0.737LysPhe: 0.737 ± 0.505
1.105LysGly: 1.105 ± 0.47
0.368LysHis: 0.368 ± 0.257
1.289LysIle: 1.289 ± 0.503
0.368LysLys: 0.368 ± 0.253
1.658LysLeu: 1.658 ± 0.632
0.553LysMet: 0.553 ± 0.25
0.921LysAsn: 0.921 ± 0.446
0.553LysPro: 0.553 ± 0.417
0.184LysGln: 0.184 ± 0.157
2.21LysArg: 2.21 ± 0.706
2.21LysSer: 2.21 ± 0.547
2.026LysThr: 2.026 ± 0.55
0.553LysVal: 0.553 ± 0.437
0.368LysTrp: 0.368 ± 0.237
0.368LysTyr: 0.368 ± 0.232
0.0LysXaa: 0.0 ± 0.0
Leu
9.578LeuAla: 9.578 ± 1.078
0.553LeuCys: 0.553 ± 0.298
6.078LeuAsp: 6.078 ± 0.907
2.21LeuGlu: 2.21 ± 0.486
3.868LeuPhe: 3.868 ± 0.852
8.289LeuGly: 8.289 ± 1.079
1.658LeuHis: 1.658 ± 0.522
4.605LeuIle: 4.605 ± 1.218
1.105LeuLys: 1.105 ± 0.401
7.368LeuLeu: 7.368 ± 1.923
2.579LeuMet: 2.579 ± 0.654
3.684LeuAsn: 3.684 ± 0.96
8.105LeuPro: 8.105 ± 1.568
5.71LeuGln: 5.71 ± 1.034
6.631LeuArg: 6.631 ± 1.025
5.71LeuSer: 5.71 ± 0.977
8.289LeuThr: 8.289 ± 1.277
6.263LeuVal: 6.263 ± 1.264
0.553LeuTrp: 0.553 ± 0.273
1.842LeuTyr: 1.842 ± 0.565
0.0LeuXaa: 0.0 ± 0.0
Met
2.947MetAla: 2.947 ± 0.643
0.0MetCys: 0.0 ± 0.0
1.289MetAsp: 1.289 ± 0.404
0.737MetGlu: 0.737 ± 0.326
0.553MetPhe: 0.553 ± 0.267
2.21MetGly: 2.21 ± 0.573
0.368MetHis: 0.368 ± 0.24
1.474MetIle: 1.474 ± 0.748
0.184MetLys: 0.184 ± 0.151
0.553MetLeu: 0.553 ± 0.303
0.0MetMet: 0.0 ± 0.0
1.105MetAsn: 1.105 ± 0.352
1.105MetPro: 1.105 ± 0.44
0.368MetGln: 0.368 ± 0.294
1.658MetArg: 1.658 ± 0.823
2.395MetSer: 2.395 ± 0.871
1.658MetThr: 1.658 ± 0.577
1.474MetVal: 1.474 ± 0.412
0.184MetTrp: 0.184 ± 0.151
0.184MetTyr: 0.184 ± 0.178
0.0MetXaa: 0.0 ± 0.0
Asn
7.368AsnAla: 7.368 ± 0.962
0.184AsnCys: 0.184 ± 0.151
0.184AsnAsp: 0.184 ± 0.183
1.105AsnGlu: 1.105 ± 0.298
0.368AsnPhe: 0.368 ± 0.214
3.868AsnGly: 3.868 ± 0.566
0.553AsnHis: 0.553 ± 0.273
0.921AsnIle: 0.921 ± 0.454
0.737AsnLys: 0.737 ± 0.314
2.763AsnLeu: 2.763 ± 0.696
0.737AsnMet: 0.737 ± 0.43
1.105AsnAsn: 1.105 ± 0.429
1.289AsnPro: 1.289 ± 0.406
0.0AsnGln: 0.0 ± 0.0
2.395AsnArg: 2.395 ± 0.601
2.21AsnSer: 2.21 ± 0.65
1.289AsnThr: 1.289 ± 0.3
2.395AsnVal: 2.395 ± 0.673
0.0AsnTrp: 0.0 ± 0.0
0.553AsnTyr: 0.553 ± 0.308
0.0AsnXaa: 0.0 ± 0.0
Pro
9.394ProAla: 9.394 ± 1.41
0.368ProCys: 0.368 ± 0.301
2.763ProAsp: 2.763 ± 0.731
2.947ProGlu: 2.947 ± 0.824
1.105ProPhe: 1.105 ± 0.407
3.868ProGly: 3.868 ± 0.842
0.737ProHis: 0.737 ± 0.511
2.21ProIle: 2.21 ± 0.362
2.026ProLys: 2.026 ± 0.654
4.237ProLeu: 4.237 ± 0.939
1.105ProMet: 1.105 ± 0.456
1.474ProAsn: 1.474 ± 0.507
1.658ProPro: 1.658 ± 0.754
1.842ProGln: 1.842 ± 0.505
2.947ProArg: 2.947 ± 0.813
3.5ProSer: 3.5 ± 0.427
3.868ProThr: 3.868 ± 0.73
3.684ProVal: 3.684 ± 0.761
0.368ProTrp: 0.368 ± 0.221
1.474ProTyr: 1.474 ± 0.48
0.0ProXaa: 0.0 ± 0.0
Gln
4.237GlnAla: 4.237 ± 1.043
0.0GlnCys: 0.0 ± 0.0
1.474GlnAsp: 1.474 ± 0.424
1.289GlnGlu: 1.289 ± 0.411
1.842GlnPhe: 1.842 ± 0.53
2.763GlnGly: 2.763 ± 0.653
1.289GlnHis: 1.289 ± 0.407
1.658GlnIle: 1.658 ± 0.58
0.553GlnLys: 0.553 ± 0.283
9.762GlnLeu: 9.762 ± 1.449
0.737GlnMet: 0.737 ± 0.469
1.474GlnAsn: 1.474 ± 0.511
2.947GlnPro: 2.947 ± 0.755
2.579GlnGln: 2.579 ± 0.628
3.5GlnArg: 3.5 ± 0.702
1.658GlnSer: 1.658 ± 0.434
1.105GlnThr: 1.105 ± 0.393
1.658GlnVal: 1.658 ± 0.411
0.184GlnTrp: 0.184 ± 0.147
0.737GlnTyr: 0.737 ± 0.376
0.0GlnXaa: 0.0 ± 0.0
Arg
8.473ArgAla: 8.473 ± 1.664
0.737ArgCys: 0.737 ± 0.344
5.342ArgAsp: 5.342 ± 0.867
6.631ArgGlu: 6.631 ± 1.559
2.579ArgPhe: 2.579 ± 0.637
4.605ArgGly: 4.605 ± 0.841
0.921ArgHis: 0.921 ± 0.463
3.316ArgIle: 3.316 ± 0.74
1.474ArgLys: 1.474 ± 0.541
6.999ArgLeu: 6.999 ± 1.085
1.105ArgMet: 1.105 ± 0.361
1.842ArgAsn: 1.842 ± 0.479
2.579ArgPro: 2.579 ± 0.612
4.237ArgGln: 4.237 ± 1.008
8.473ArgArg: 8.473 ± 1.784
2.21ArgSer: 2.21 ± 0.589
2.579ArgThr: 2.579 ± 0.585
6.078ArgVal: 6.078 ± 1.101
2.21ArgTrp: 2.21 ± 0.667
2.026ArgTyr: 2.026 ± 0.459
0.0ArgXaa: 0.0 ± 0.0
Ser
9.21SerAla: 9.21 ± 1.215
0.368SerCys: 0.368 ± 0.392
2.21SerAsp: 2.21 ± 0.518
3.868SerGlu: 3.868 ± 0.768
1.105SerPhe: 1.105 ± 0.389
4.421SerGly: 4.421 ± 0.681
1.105SerHis: 1.105 ± 0.411
3.5SerIle: 3.5 ± 0.802
1.289SerLys: 1.289 ± 0.511
4.052SerLeu: 4.052 ± 1.029
2.21SerMet: 2.21 ± 0.624
1.474SerAsn: 1.474 ± 0.622
2.21SerPro: 2.21 ± 0.598
2.763SerGln: 2.763 ± 0.727
3.684SerArg: 3.684 ± 0.801
4.605SerSer: 4.605 ± 0.901
4.052SerThr: 4.052 ± 0.762
4.605SerVal: 4.605 ± 0.824
1.105SerTrp: 1.105 ± 0.404
1.474SerTyr: 1.474 ± 0.554
0.0SerXaa: 0.0 ± 0.0
Thr
9.026ThrAla: 9.026 ± 1.36
0.0ThrCys: 0.0 ± 0.0
2.395ThrAsp: 2.395 ± 0.766
3.5ThrGlu: 3.5 ± 0.975
4.052ThrPhe: 4.052 ± 0.939
6.999ThrGly: 6.999 ± 1.508
0.737ThrHis: 0.737 ± 0.453
3.316ThrIle: 3.316 ± 1.079
0.737ThrLys: 0.737 ± 0.345
4.605ThrLeu: 4.605 ± 1.022
1.658ThrMet: 1.658 ± 0.646
1.658ThrAsn: 1.658 ± 0.519
4.605ThrPro: 4.605 ± 0.845
1.105ThrGln: 1.105 ± 0.473
4.052ThrArg: 4.052 ± 1.006
2.947ThrSer: 2.947 ± 0.815
6.078ThrThr: 6.078 ± 1.39
5.342ThrVal: 5.342 ± 1.076
1.105ThrTrp: 1.105 ± 0.422
1.105ThrTyr: 1.105 ± 0.539
0.0ThrXaa: 0.0 ± 0.0
Val
10.683ValAla: 10.683 ± 1.518
0.0ValCys: 0.0 ± 0.0
3.316ValAsp: 3.316 ± 0.851
5.526ValGlu: 5.526 ± 1.22
1.289ValPhe: 1.289 ± 0.604
5.894ValGly: 5.894 ± 0.958
0.553ValHis: 0.553 ± 0.28
3.5ValIle: 3.5 ± 0.918
1.289ValLys: 1.289 ± 0.494
4.789ValLeu: 4.789 ± 1.1
1.289ValMet: 1.289 ± 0.591
2.21ValAsn: 2.21 ± 0.921
3.868ValPro: 3.868 ± 0.72
4.605ValGln: 4.605 ± 1.257
6.263ValArg: 6.263 ± 1.239
5.526ValSer: 5.526 ± 0.978
6.815ValThr: 6.815 ± 1.286
6.078ValVal: 6.078 ± 1.384
0.553ValTrp: 0.553 ± 0.297
1.474ValTyr: 1.474 ± 0.411
0.0ValXaa: 0.0 ± 0.0
Trp
1.658TrpAla: 1.658 ± 0.529
0.0TrpCys: 0.0 ± 0.0
0.553TrpAsp: 0.553 ± 0.262
0.737TrpGlu: 0.737 ± 0.428
1.105TrpPhe: 1.105 ± 0.401
0.737TrpGly: 0.737 ± 0.224
0.921TrpHis: 0.921 ± 0.449
1.289TrpIle: 1.289 ± 0.427
0.0TrpLys: 0.0 ± 0.0
2.395TrpLeu: 2.395 ± 0.683
0.184TrpMet: 0.184 ± 0.178
0.368TrpAsn: 0.368 ± 0.232
0.368TrpPro: 0.368 ± 0.301
0.737TrpGln: 0.737 ± 0.294
0.553TrpArg: 0.553 ± 0.313
0.737TrpSer: 0.737 ± 0.362
1.474TrpThr: 1.474 ± 0.51
0.368TrpVal: 0.368 ± 0.233
0.0TrpTrp: 0.0 ± 0.0
0.368TrpTyr: 0.368 ± 0.223
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.395TyrAla: 2.395 ± 0.634
0.368TyrCys: 0.368 ± 0.262
0.921TyrAsp: 0.921 ± 0.398
1.658TyrGlu: 1.658 ± 0.409
0.553TyrPhe: 0.553 ± 0.288
3.131TyrGly: 3.131 ± 0.555
0.0TyrHis: 0.0 ± 0.0
0.368TyrIle: 0.368 ± 0.319
0.553TyrLys: 0.553 ± 0.27
0.553TyrLeu: 0.553 ± 0.342
0.737TyrMet: 0.737 ± 0.382
0.368TyrAsn: 0.368 ± 0.267
1.842TyrPro: 1.842 ± 0.515
1.289TyrGln: 1.289 ± 0.381
2.21TyrArg: 2.21 ± 0.871
0.737TyrSer: 0.737 ± 0.389
0.553TyrThr: 0.553 ± 0.313
2.947TyrVal: 2.947 ± 0.807
0.368TyrTrp: 0.368 ± 0.319
0.737TyrTyr: 0.737 ± 0.444
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 25 proteins (5430 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski