Amino acid dipepetide frequency for Cyanophage KBS-S-1A

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.392AlaAla: 10.392 ± 1.631
1.123AlaCys: 1.123 ± 0.466
5.617AlaAsp: 5.617 ± 1.0
5.617AlaGlu: 5.617 ± 0.78
3.651AlaPhe: 3.651 ± 0.518
5.336AlaGly: 5.336 ± 1.001
1.404AlaHis: 1.404 ± 0.426
4.494AlaIle: 4.494 ± 0.75
5.336AlaLys: 5.336 ± 1.115
7.162AlaLeu: 7.162 ± 1.011
3.23AlaMet: 3.23 ± 0.754
5.196AlaAsn: 5.196 ± 1.125
3.089AlaPro: 3.089 ± 0.75
2.528AlaGln: 2.528 ± 0.544
5.477AlaArg: 5.477 ± 1.173
6.179AlaSer: 6.179 ± 1.161
3.932AlaThr: 3.932 ± 0.826
6.179AlaVal: 6.179 ± 1.134
0.702AlaTrp: 0.702 ± 0.279
3.089AlaTyr: 3.089 ± 0.691
0.0AlaXaa: 0.0 ± 0.0
Cys
0.983CysAla: 0.983 ± 0.449
0.0CysCys: 0.0 ± 0.0
0.702CysAsp: 0.702 ± 0.309
0.843CysGlu: 0.843 ± 0.454
0.281CysPhe: 0.281 ± 0.212
0.421CysGly: 0.421 ± 0.239
0.281CysHis: 0.281 ± 0.184
0.562CysIle: 0.562 ± 0.26
0.843CysLys: 0.843 ± 0.381
0.843CysLeu: 0.843 ± 0.467
0.281CysMet: 0.281 ± 0.272
0.562CysAsn: 0.562 ± 0.247
0.702CysPro: 0.702 ± 0.344
0.421CysGln: 0.421 ± 0.233
0.702CysArg: 0.702 ± 0.344
1.404CysSer: 1.404 ± 0.539
0.702CysThr: 0.702 ± 0.332
0.421CysVal: 0.421 ± 0.306
0.14CysTrp: 0.14 ± 0.16
0.14CysTyr: 0.14 ± 0.167
0.0CysXaa: 0.0 ± 0.0
Asp
5.758AspAla: 5.758 ± 1.103
0.562AspCys: 0.562 ± 0.276
4.353AspAsp: 4.353 ± 1.033
3.651AspGlu: 3.651 ± 0.925
2.106AspPhe: 2.106 ± 0.404
4.634AspGly: 4.634 ± 1.02
0.983AspHis: 0.983 ± 0.532
5.617AspIle: 5.617 ± 0.506
3.37AspLys: 3.37 ± 0.828
5.898AspLeu: 5.898 ± 0.956
1.123AspMet: 1.123 ± 0.625
3.932AspAsn: 3.932 ± 0.605
2.809AspPro: 2.809 ± 0.853
2.528AspGln: 2.528 ± 0.551
3.089AspArg: 3.089 ± 0.633
3.511AspSer: 3.511 ± 0.902
3.932AspThr: 3.932 ± 0.923
4.353AspVal: 4.353 ± 0.702
0.702AspTrp: 0.702 ± 0.348
2.387AspTyr: 2.387 ± 0.634
0.0AspXaa: 0.0 ± 0.0
Glu
7.162GluAla: 7.162 ± 1.132
0.281GluCys: 0.281 ± 0.199
3.932GluAsp: 3.932 ± 0.681
5.055GluGlu: 5.055 ± 1.052
2.809GluPhe: 2.809 ± 0.747
4.634GluGly: 4.634 ± 0.883
1.545GluHis: 1.545 ± 0.533
2.387GluIle: 2.387 ± 0.575
2.528GluLys: 2.528 ± 0.594
5.617GluLeu: 5.617 ± 0.851
1.404GluMet: 1.404 ± 0.44
2.668GluAsn: 2.668 ± 0.862
1.826GluPro: 1.826 ± 0.598
1.966GluGln: 1.966 ± 0.387
3.089GluArg: 3.089 ± 0.798
3.37GluSer: 3.37 ± 0.539
3.511GluThr: 3.511 ± 0.73
3.792GluVal: 3.792 ± 0.832
1.545GluTrp: 1.545 ± 0.464
1.966GluTyr: 1.966 ± 0.554
0.0GluXaa: 0.0 ± 0.0
Phe
1.826PheAla: 1.826 ± 0.532
0.421PheCys: 0.421 ± 0.223
1.685PheAsp: 1.685 ± 0.504
2.106PheGlu: 2.106 ± 0.578
1.404PhePhe: 1.404 ± 0.589
2.809PheGly: 2.809 ± 0.751
0.702PheHis: 0.702 ± 0.477
1.685PheIle: 1.685 ± 0.671
3.089PheLys: 3.089 ± 0.714
2.387PheLeu: 2.387 ± 0.683
1.404PheMet: 1.404 ± 0.511
2.809PheAsn: 2.809 ± 0.54
1.264PhePro: 1.264 ± 0.532
0.983PheGln: 0.983 ± 0.407
1.545PheArg: 1.545 ± 0.425
2.247PheSer: 2.247 ± 0.632
1.966PheThr: 1.966 ± 0.504
3.511PheVal: 3.511 ± 0.822
0.281PheTrp: 0.281 ± 0.175
0.983PheTyr: 0.983 ± 0.416
0.0PheXaa: 0.0 ± 0.0
Gly
5.477GlyAla: 5.477 ± 1.233
0.843GlyCys: 0.843 ± 0.282
5.336GlyAsp: 5.336 ± 0.598
2.809GlyGlu: 2.809 ± 0.676
2.949GlyPhe: 2.949 ± 0.681
4.353GlyGly: 4.353 ± 1.102
0.562GlyHis: 0.562 ± 0.263
3.932GlyIle: 3.932 ± 0.972
3.651GlyLys: 3.651 ± 0.758
6.319GlyLeu: 6.319 ± 1.268
1.545GlyMet: 1.545 ± 0.417
3.37GlyAsn: 3.37 ± 0.725
1.264GlyPro: 1.264 ± 0.381
2.247GlyGln: 2.247 ± 0.758
4.915GlyArg: 4.915 ± 0.893
4.634GlySer: 4.634 ± 0.855
4.915GlyThr: 4.915 ± 1.129
4.072GlyVal: 4.072 ± 0.699
0.843GlyTrp: 0.843 ± 0.305
2.528GlyTyr: 2.528 ± 0.565
0.0GlyXaa: 0.0 ± 0.0
His
1.545HisAla: 1.545 ± 0.458
0.281HisCys: 0.281 ± 0.237
0.562HisAsp: 0.562 ± 0.389
1.545HisGlu: 1.545 ± 0.472
0.702HisPhe: 0.702 ± 0.281
1.123HisGly: 1.123 ± 0.542
0.562HisHis: 0.562 ± 0.32
0.702HisIle: 0.702 ± 0.322
0.983HisLys: 0.983 ± 0.407
1.123HisLeu: 1.123 ± 0.519
0.421HisMet: 0.421 ± 0.267
1.264HisAsn: 1.264 ± 0.486
0.843HisPro: 0.843 ± 0.433
0.843HisGln: 0.843 ± 0.384
0.983HisArg: 0.983 ± 0.409
0.562HisSer: 0.562 ± 0.313
1.404HisThr: 1.404 ± 0.446
0.843HisVal: 0.843 ± 0.409
0.421HisTrp: 0.421 ± 0.375
0.843HisTyr: 0.843 ± 0.334
0.0HisXaa: 0.0 ± 0.0
Ile
4.775IleAla: 4.775 ± 0.842
0.562IleCys: 0.562 ± 0.23
3.932IleAsp: 3.932 ± 0.621
3.792IleGlu: 3.792 ± 0.884
0.983IlePhe: 0.983 ± 0.279
2.949IleGly: 2.949 ± 0.791
1.123IleHis: 1.123 ± 0.585
2.387IleIle: 2.387 ± 0.499
4.213IleLys: 4.213 ± 0.9
4.213IleLeu: 4.213 ± 0.555
1.123IleMet: 1.123 ± 0.374
3.089IleAsn: 3.089 ± 0.726
2.528IlePro: 2.528 ± 0.736
1.826IleGln: 1.826 ± 0.671
2.809IleArg: 2.809 ± 0.756
4.213IleSer: 4.213 ± 0.973
5.336IleThr: 5.336 ± 1.16
1.826IleVal: 1.826 ± 0.542
0.562IleTrp: 0.562 ± 0.233
1.826IleTyr: 1.826 ± 0.611
0.0IleXaa: 0.0 ± 0.0
Lys
4.915LysAla: 4.915 ± 0.713
0.702LysCys: 0.702 ± 0.355
4.353LysAsp: 4.353 ± 0.837
3.37LysGlu: 3.37 ± 0.891
1.826LysPhe: 1.826 ± 0.446
3.792LysGly: 3.792 ± 0.826
1.123LysHis: 1.123 ± 0.461
2.106LysIle: 2.106 ± 0.786
3.089LysLys: 3.089 ± 0.747
5.617LysLeu: 5.617 ± 0.933
1.966LysMet: 1.966 ± 0.519
3.089LysAsn: 3.089 ± 0.653
3.932LysPro: 3.932 ± 1.066
1.123LysGln: 1.123 ± 0.447
3.37LysArg: 3.37 ± 0.57
3.23LysSer: 3.23 ± 0.628
2.949LysThr: 2.949 ± 0.967
2.809LysVal: 2.809 ± 0.464
0.421LysTrp: 0.421 ± 0.226
2.949LysTyr: 2.949 ± 0.958
0.0LysXaa: 0.0 ± 0.0
Leu
7.724LeuAla: 7.724 ± 1.042
1.264LeuCys: 1.264 ± 0.532
6.46LeuAsp: 6.46 ± 0.898
4.072LeuGlu: 4.072 ± 0.616
2.247LeuPhe: 2.247 ± 0.6
6.038LeuGly: 6.038 ± 1.205
1.404LeuHis: 1.404 ± 0.494
4.775LeuIle: 4.775 ± 0.854
3.37LeuLys: 3.37 ± 0.75
5.477LeuLeu: 5.477 ± 1.02
1.404LeuMet: 1.404 ± 0.566
4.072LeuAsn: 4.072 ± 0.674
3.792LeuPro: 3.792 ± 0.658
4.775LeuGln: 4.775 ± 0.828
4.634LeuArg: 4.634 ± 0.873
4.915LeuSer: 4.915 ± 0.934
5.477LeuThr: 5.477 ± 1.03
5.055LeuVal: 5.055 ± 0.961
0.421LeuTrp: 0.421 ± 0.241
2.528LeuTyr: 2.528 ± 0.519
0.0LeuXaa: 0.0 ± 0.0
Met
2.387MetAla: 2.387 ± 0.566
0.281MetCys: 0.281 ± 0.195
0.983MetAsp: 0.983 ± 0.414
1.685MetGlu: 1.685 ± 0.615
0.562MetPhe: 0.562 ± 0.216
1.685MetGly: 1.685 ± 0.563
0.421MetHis: 0.421 ± 0.283
2.106MetIle: 2.106 ± 0.636
1.123MetLys: 1.123 ± 0.481
2.247MetLeu: 2.247 ± 0.55
0.281MetMet: 0.281 ± 0.218
0.843MetAsn: 0.843 ± 0.4
0.983MetPro: 0.983 ± 0.399
0.702MetGln: 0.702 ± 0.348
2.247MetArg: 2.247 ± 0.659
1.123MetSer: 1.123 ± 0.35
2.247MetThr: 2.247 ± 0.538
1.264MetVal: 1.264 ± 0.393
0.702MetTrp: 0.702 ± 0.387
0.421MetTyr: 0.421 ± 0.231
0.0MetXaa: 0.0 ± 0.0
Asn
4.072AsnAla: 4.072 ± 0.703
0.421AsnCys: 0.421 ± 0.288
3.651AsnAsp: 3.651 ± 0.899
2.528AsnGlu: 2.528 ± 0.86
2.106AsnPhe: 2.106 ± 0.671
5.055AsnGly: 5.055 ± 0.992
0.983AsnHis: 0.983 ± 0.378
2.949AsnIle: 2.949 ± 0.686
2.106AsnLys: 2.106 ± 0.604
4.213AsnLeu: 4.213 ± 0.886
0.702AsnMet: 0.702 ± 0.304
2.387AsnAsn: 2.387 ± 0.733
3.651AsnPro: 3.651 ± 0.67
1.826AsnGln: 1.826 ± 0.471
3.651AsnArg: 3.651 ± 0.834
3.089AsnSer: 3.089 ± 0.659
3.37AsnThr: 3.37 ± 0.947
2.949AsnVal: 2.949 ± 0.506
1.123AsnTrp: 1.123 ± 0.307
2.106AsnTyr: 2.106 ± 0.555
0.0AsnXaa: 0.0 ± 0.0
Pro
2.949ProAla: 2.949 ± 0.716
0.14ProCys: 0.14 ± 0.151
2.949ProAsp: 2.949 ± 0.694
3.792ProGlu: 3.792 ± 0.737
1.966ProPhe: 1.966 ± 0.484
2.668ProGly: 2.668 ± 0.547
1.264ProHis: 1.264 ± 0.41
1.264ProIle: 1.264 ± 0.372
3.089ProLys: 3.089 ± 0.813
2.387ProLeu: 2.387 ± 0.629
0.702ProMet: 0.702 ± 0.298
2.247ProAsn: 2.247 ± 0.515
1.404ProPro: 1.404 ± 0.331
2.106ProGln: 2.106 ± 0.83
1.264ProArg: 1.264 ± 0.505
2.809ProSer: 2.809 ± 0.44
2.528ProThr: 2.528 ± 0.563
1.826ProVal: 1.826 ± 0.571
1.545ProTrp: 1.545 ± 0.425
1.545ProTyr: 1.545 ± 0.437
0.0ProXaa: 0.0 ± 0.0
Gln
4.353GlnAla: 4.353 ± 0.854
0.421GlnCys: 0.421 ± 0.25
1.545GlnAsp: 1.545 ± 0.41
1.966GlnGlu: 1.966 ± 0.639
0.562GlnPhe: 0.562 ± 0.309
2.106GlnGly: 2.106 ± 0.561
1.123GlnHis: 1.123 ± 0.433
1.545GlnIle: 1.545 ± 0.327
1.685GlnLys: 1.685 ± 0.511
4.494GlnLeu: 4.494 ± 0.706
0.983GlnMet: 0.983 ± 0.297
1.966GlnAsn: 1.966 ± 0.596
1.404GlnPro: 1.404 ± 0.35
0.983GlnGln: 0.983 ± 0.354
1.966GlnArg: 1.966 ± 0.634
3.511GlnSer: 3.511 ± 1.103
2.809GlnThr: 2.809 ± 0.581
2.809GlnVal: 2.809 ± 0.806
0.281GlnTrp: 0.281 ± 0.186
1.264GlnTyr: 1.264 ± 0.374
0.0GlnXaa: 0.0 ± 0.0
Arg
5.336ArgAla: 5.336 ± 0.754
0.562ArgCys: 0.562 ± 0.346
1.966ArgAsp: 1.966 ± 0.505
3.23ArgGlu: 3.23 ± 0.698
0.983ArgPhe: 0.983 ± 0.347
3.23ArgGly: 3.23 ± 0.73
0.702ArgHis: 0.702 ± 0.398
4.353ArgIle: 4.353 ± 0.82
4.072ArgLys: 4.072 ± 1.185
5.196ArgLeu: 5.196 ± 1.315
2.247ArgMet: 2.247 ± 0.554
1.966ArgAsn: 1.966 ± 0.467
1.966ArgPro: 1.966 ± 0.479
2.668ArgGln: 2.668 ± 0.521
3.511ArgArg: 3.511 ± 1.059
4.634ArgSer: 4.634 ± 0.917
3.792ArgThr: 3.792 ± 1.125
3.37ArgVal: 3.37 ± 0.808
0.421ArgTrp: 0.421 ± 0.244
3.511ArgTyr: 3.511 ± 0.649
0.0ArgXaa: 0.0 ± 0.0
Ser
5.196SerAla: 5.196 ± 1.041
0.421SerCys: 0.421 ± 0.281
4.634SerAsp: 4.634 ± 0.961
5.055SerGlu: 5.055 ± 0.856
2.668SerPhe: 2.668 ± 0.925
5.055SerGly: 5.055 ± 1.087
0.843SerHis: 0.843 ± 0.366
4.213SerIle: 4.213 ± 0.938
4.494SerLys: 4.494 ± 0.779
3.932SerLeu: 3.932 ± 0.59
1.826SerMet: 1.826 ± 0.473
4.494SerAsn: 4.494 ± 0.991
3.651SerPro: 3.651 ± 0.61
3.792SerGln: 3.792 ± 0.737
4.213SerArg: 4.213 ± 1.493
6.881SerSer: 6.881 ± 2.295
5.758SerThr: 5.758 ± 1.641
3.37SerVal: 3.37 ± 0.665
1.264SerTrp: 1.264 ± 0.459
1.545SerTyr: 1.545 ± 0.462
0.0SerXaa: 0.0 ± 0.0
Thr
6.881ThrAla: 6.881 ± 0.801
1.264ThrCys: 1.264 ± 0.557
4.494ThrAsp: 4.494 ± 0.804
3.37ThrGlu: 3.37 ± 0.706
3.37ThrPhe: 3.37 ± 0.999
4.634ThrGly: 4.634 ± 0.84
0.983ThrHis: 0.983 ± 0.497
3.511ThrIle: 3.511 ± 0.628
2.668ThrLys: 2.668 ± 0.556
4.634ThrLeu: 4.634 ± 0.723
1.123ThrMet: 1.123 ± 0.387
2.528ThrAsn: 2.528 ± 0.678
2.809ThrPro: 2.809 ± 0.37
2.387ThrGln: 2.387 ± 0.49
3.23ThrArg: 3.23 ± 0.894
7.443ThrSer: 7.443 ± 1.434
4.072ThrThr: 4.072 ± 0.892
4.072ThrVal: 4.072 ± 0.972
0.281ThrTrp: 0.281 ± 0.2
2.809ThrTyr: 2.809 ± 0.6
0.0ThrXaa: 0.0 ± 0.0
Val
4.072ValAla: 4.072 ± 0.794
0.983ValCys: 0.983 ± 0.461
4.072ValAsp: 4.072 ± 0.842
3.511ValGlu: 3.511 ± 0.627
1.966ValPhe: 1.966 ± 0.608
3.511ValGly: 3.511 ± 0.74
0.843ValHis: 0.843 ± 0.308
2.668ValIle: 2.668 ± 0.574
4.634ValLys: 4.634 ± 0.842
3.511ValLeu: 3.511 ± 0.815
1.123ValMet: 1.123 ± 0.364
4.072ValAsn: 4.072 ± 0.8
1.545ValPro: 1.545 ± 0.453
1.826ValGln: 1.826 ± 0.538
3.37ValArg: 3.37 ± 0.618
6.46ValSer: 6.46 ± 1.026
4.634ValThr: 4.634 ± 0.714
5.336ValVal: 5.336 ± 1.118
0.843ValTrp: 0.843 ± 0.416
1.966ValTyr: 1.966 ± 0.592
0.0ValXaa: 0.0 ± 0.0
Trp
1.264TrpAla: 1.264 ± 0.497
0.421TrpCys: 0.421 ± 0.234
1.123TrpAsp: 1.123 ± 0.416
0.702TrpGlu: 0.702 ± 0.313
0.562TrpPhe: 0.562 ± 0.252
0.281TrpGly: 0.281 ± 0.181
0.14TrpHis: 0.14 ± 0.164
0.702TrpIle: 0.702 ± 0.332
0.843TrpLys: 0.843 ± 0.316
2.106TrpLeu: 2.106 ± 0.573
0.562TrpMet: 0.562 ± 0.318
0.14TrpAsn: 0.14 ± 0.136
0.281TrpPro: 0.281 ± 0.204
0.702TrpGln: 0.702 ± 0.291
0.702TrpArg: 0.702 ± 0.29
1.123TrpSer: 1.123 ± 0.349
0.281TrpThr: 0.281 ± 0.282
0.421TrpVal: 0.421 ± 0.263
0.281TrpTrp: 0.281 ± 0.201
0.281TrpTyr: 0.281 ± 0.173
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.668TyrAla: 2.668 ± 0.642
0.281TyrCys: 0.281 ± 0.226
2.949TyrAsp: 2.949 ± 0.801
2.387TyrGlu: 2.387 ± 0.605
1.404TyrPhe: 1.404 ± 0.435
2.387TyrGly: 2.387 ± 0.589
0.562TyrHis: 0.562 ± 0.264
2.106TyrIle: 2.106 ± 0.62
1.545TyrLys: 1.545 ± 0.514
2.528TyrLeu: 2.528 ± 0.646
0.702TyrMet: 0.702 ± 0.237
2.247TyrAsn: 2.247 ± 0.458
0.702TyrPro: 0.702 ± 0.34
1.545TyrGln: 1.545 ± 0.577
2.949TyrArg: 2.949 ± 0.747
1.966TyrSer: 1.966 ± 0.587
2.949TyrThr: 2.949 ± 0.639
2.668TyrVal: 2.668 ± 0.597
0.14TyrTrp: 0.14 ± 0.16
1.123TyrTyr: 1.123 ± 0.395
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 37 proteins (7122 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski