Amino acid dipepetide frequency for Flavobacterium phage 11b

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.867AlaAla: 3.867 ± 0.745
0.737AlaCys: 0.737 ± 0.288
3.038AlaAsp: 3.038 ± 0.51
3.498AlaGlu: 3.498 ± 0.598
2.486AlaPhe: 2.486 ± 0.59
3.775AlaGly: 3.775 ± 0.892
0.921AlaHis: 0.921 ± 0.25
5.34AlaIle: 5.34 ± 0.879
4.695AlaLys: 4.695 ± 0.793
6.444AlaLeu: 6.444 ± 0.876
1.197AlaMet: 1.197 ± 0.274
3.498AlaAsn: 3.498 ± 0.697
1.749AlaPro: 1.749 ± 0.495
2.67AlaGln: 2.67 ± 0.515
2.025AlaArg: 2.025 ± 0.429
3.867AlaSer: 3.867 ± 0.768
5.064AlaThr: 5.064 ± 1.191
5.064AlaVal: 5.064 ± 0.671
0.552AlaTrp: 0.552 ± 0.25
2.394AlaTyr: 2.394 ± 0.416
0.0AlaXaa: 0.0 ± 0.0
Cys
0.46CysAla: 0.46 ± 0.204
0.092CysCys: 0.092 ± 0.078
0.368CysAsp: 0.368 ± 0.181
0.46CysGlu: 0.46 ± 0.204
0.644CysPhe: 0.644 ± 0.253
0.644CysGly: 0.644 ± 0.359
0.276CysHis: 0.276 ± 0.161
0.644CysIle: 0.644 ± 0.263
0.644CysLys: 0.644 ± 0.349
1.197CysLeu: 1.197 ± 0.328
0.184CysMet: 0.184 ± 0.148
0.737CysAsn: 0.737 ± 0.287
0.0CysPro: 0.0 ± 0.0
0.092CysGln: 0.092 ± 0.104
0.368CysArg: 0.368 ± 0.214
0.276CysSer: 0.276 ± 0.176
0.276CysThr: 0.276 ± 0.159
0.552CysVal: 0.552 ± 0.254
0.092CysTrp: 0.092 ± 0.094
0.644CysTyr: 0.644 ± 0.242
0.0CysXaa: 0.0 ± 0.0
Asp
3.222AspAla: 3.222 ± 0.494
0.46AspCys: 0.46 ± 0.186
2.578AspAsp: 2.578 ± 0.532
2.762AspGlu: 2.762 ± 0.693
2.486AspPhe: 2.486 ± 0.546
3.406AspGly: 3.406 ± 0.56
0.46AspHis: 0.46 ± 0.2
5.064AspIle: 5.064 ± 0.698
5.156AspLys: 5.156 ± 0.92
6.352AspLeu: 6.352 ± 0.829
0.921AspMet: 0.921 ± 0.294
4.879AspAsn: 4.879 ± 0.719
1.657AspPro: 1.657 ± 0.383
1.105AspGln: 1.105 ± 0.297
1.749AspArg: 1.749 ± 0.443
3.222AspSer: 3.222 ± 0.522
2.946AspThr: 2.946 ± 0.436
3.406AspVal: 3.406 ± 0.45
0.552AspTrp: 0.552 ± 0.236
3.498AspTyr: 3.498 ± 0.76
0.0AspXaa: 0.0 ± 0.0
Glu
2.578GluAla: 2.578 ± 0.573
0.46GluCys: 0.46 ± 0.197
2.946GluAsp: 2.946 ± 0.509
3.683GluGlu: 3.683 ± 0.731
3.038GluPhe: 3.038 ± 0.549
3.038GluGly: 3.038 ± 0.732
0.921GluHis: 0.921 ± 0.263
5.708GluIle: 5.708 ± 0.795
5.064GluLys: 5.064 ± 0.912
7.549GluLeu: 7.549 ± 1.038
1.289GluMet: 1.289 ± 0.442
4.419GluAsn: 4.419 ± 0.696
1.289GluPro: 1.289 ± 0.335
2.117GluGln: 2.117 ± 0.506
1.657GluArg: 1.657 ± 0.39
3.775GluSer: 3.775 ± 0.526
3.314GluThr: 3.314 ± 0.599
4.143GluVal: 4.143 ± 0.61
0.921GluTrp: 0.921 ± 0.374
2.762GluTyr: 2.762 ± 0.569
0.0GluXaa: 0.0 ± 0.0
Phe
2.302PheAla: 2.302 ± 0.649
0.921PheCys: 0.921 ± 0.328
3.314PheAsp: 3.314 ± 0.522
2.117PheGlu: 2.117 ± 0.517
1.657PhePhe: 1.657 ± 0.425
3.222PheGly: 3.222 ± 0.524
0.46PheHis: 0.46 ± 0.196
4.787PheIle: 4.787 ± 0.703
4.235PheLys: 4.235 ± 0.627
3.038PheLeu: 3.038 ± 0.511
1.289PheMet: 1.289 ± 0.333
3.683PheAsn: 3.683 ± 0.555
0.921PhePro: 0.921 ± 0.265
1.013PheGln: 1.013 ± 0.307
1.841PheArg: 1.841 ± 0.377
3.038PheSer: 3.038 ± 0.58
3.775PheThr: 3.775 ± 0.485
2.025PheVal: 2.025 ± 0.55
0.368PheTrp: 0.368 ± 0.172
1.841PheTyr: 1.841 ± 0.479
0.0PheXaa: 0.0 ± 0.0
Gly
4.051GlyAla: 4.051 ± 0.843
0.737GlyCys: 0.737 ± 0.306
2.762GlyAsp: 2.762 ± 0.594
4.419GlyGlu: 4.419 ± 0.635
3.222GlyPhe: 3.222 ± 0.54
5.8GlyGly: 5.8 ± 1.465
0.368GlyHis: 0.368 ± 0.154
5.708GlyIle: 5.708 ± 0.714
4.143GlyLys: 4.143 ± 0.623
6.352GlyLeu: 6.352 ± 0.766
1.289GlyMet: 1.289 ± 0.368
4.235GlyAsn: 4.235 ± 0.958
0.368GlyPro: 0.368 ± 0.169
2.025GlyGln: 2.025 ± 0.465
1.565GlyArg: 1.565 ± 0.412
5.248GlySer: 5.248 ± 0.907
5.34GlyThr: 5.34 ± 1.088
4.143GlyVal: 4.143 ± 0.504
0.829GlyTrp: 0.829 ± 0.296
2.578GlyTyr: 2.578 ± 0.488
0.0GlyXaa: 0.0 ± 0.0
His
0.644HisAla: 0.644 ± 0.237
0.0HisCys: 0.0 ± 0.0
0.737HisAsp: 0.737 ± 0.302
0.737HisGlu: 0.737 ± 0.234
0.368HisPhe: 0.368 ± 0.172
0.552HisGly: 0.552 ± 0.206
0.184HisHis: 0.184 ± 0.119
0.737HisIle: 0.737 ± 0.313
0.46HisLys: 0.46 ± 0.221
0.921HisLeu: 0.921 ± 0.295
0.092HisMet: 0.092 ± 0.097
0.644HisAsn: 0.644 ± 0.23
0.368HisPro: 0.368 ± 0.164
0.368HisGln: 0.368 ± 0.167
0.276HisArg: 0.276 ± 0.166
1.105HisSer: 1.105 ± 0.26
1.105HisThr: 1.105 ± 0.382
0.552HisVal: 0.552 ± 0.189
0.092HisTrp: 0.092 ± 0.103
0.368HisTyr: 0.368 ± 0.184
0.0HisXaa: 0.0 ± 0.0
Ile
5.8IleAla: 5.8 ± 0.793
1.013IleCys: 1.013 ± 0.353
5.708IleAsp: 5.708 ± 0.729
6.997IleGlu: 6.997 ± 0.849
2.946IlePhe: 2.946 ± 0.538
5.34IleGly: 5.34 ± 0.617
1.013IleHis: 1.013 ± 0.324
5.524IleIle: 5.524 ± 0.973
9.114IleLys: 9.114 ± 1.292
5.432IleLeu: 5.432 ± 0.848
1.473IleMet: 1.473 ± 0.399
6.905IleAsn: 6.905 ± 0.929
2.21IlePro: 2.21 ± 0.422
3.406IleGln: 3.406 ± 0.627
2.67IleArg: 2.67 ± 0.497
5.984IleSer: 5.984 ± 0.606
5.984IleThr: 5.984 ± 0.786
4.235IleVal: 4.235 ± 0.44
0.368IleTrp: 0.368 ± 0.205
3.13IleTyr: 3.13 ± 0.619
0.0IleXaa: 0.0 ± 0.0
Lys
5.34LysAla: 5.34 ± 0.682
0.829LysCys: 0.829 ± 0.359
5.984LysAsp: 5.984 ± 0.919
5.892LysGlu: 5.892 ± 0.842
2.946LysPhe: 2.946 ± 0.68
4.879LysGly: 4.879 ± 0.637
1.289LysHis: 1.289 ± 0.358
6.905LysIle: 6.905 ± 0.989
7.457LysLys: 7.457 ± 1.263
7.641LysLeu: 7.641 ± 0.857
2.854LysMet: 2.854 ± 0.51
5.616LysAsn: 5.616 ± 0.877
2.394LysPro: 2.394 ± 0.429
3.314LysGln: 3.314 ± 0.685
2.762LysArg: 2.762 ± 0.446
4.879LysSer: 4.879 ± 0.896
4.419LysThr: 4.419 ± 0.743
4.327LysVal: 4.327 ± 0.652
1.289LysTrp: 1.289 ± 0.441
4.419LysTyr: 4.419 ± 0.766
0.0LysXaa: 0.0 ± 0.0
Leu
6.168LeuAla: 6.168 ± 1.122
0.737LeuCys: 0.737 ± 0.248
5.8LeuAsp: 5.8 ± 0.7
6.352LeuGlu: 6.352 ± 0.889
3.13LeuPhe: 3.13 ± 0.566
4.971LeuGly: 4.971 ± 0.706
1.197LeuHis: 1.197 ± 0.34
7.457LeuIle: 7.457 ± 0.99
9.391LeuLys: 9.391 ± 1.187
5.616LeuLeu: 5.616 ± 0.694
2.486LeuMet: 2.486 ± 0.527
7.365LeuAsn: 7.365 ± 0.761
3.406LeuPro: 3.406 ± 0.603
3.867LeuGln: 3.867 ± 0.662
2.946LeuArg: 2.946 ± 0.603
6.629LeuSer: 6.629 ± 0.998
5.708LeuThr: 5.708 ± 0.756
4.235LeuVal: 4.235 ± 0.569
0.368LeuTrp: 0.368 ± 0.171
2.946LeuTyr: 2.946 ± 0.523
0.0LeuXaa: 0.0 ± 0.0
Met
2.67MetAla: 2.67 ± 0.466
0.0MetCys: 0.0 ± 0.0
0.737MetAsp: 0.737 ± 0.264
1.289MetGlu: 1.289 ± 0.39
0.829MetPhe: 0.829 ± 0.31
0.829MetGly: 0.829 ± 0.257
0.276MetHis: 0.276 ± 0.167
1.381MetIle: 1.381 ± 0.375
2.025MetLys: 2.025 ± 0.429
1.289MetLeu: 1.289 ± 0.391
0.368MetMet: 0.368 ± 0.186
1.013MetAsn: 1.013 ± 0.272
0.552MetPro: 0.552 ± 0.206
1.197MetGln: 1.197 ± 0.339
0.737MetArg: 0.737 ± 0.291
1.197MetSer: 1.197 ± 0.406
1.473MetThr: 1.473 ± 0.437
0.829MetVal: 0.829 ± 0.307
0.184MetTrp: 0.184 ± 0.133
0.737MetTyr: 0.737 ± 0.298
0.0MetXaa: 0.0 ± 0.0
Asn
3.959AsnAla: 3.959 ± 0.549
0.644AsnCys: 0.644 ± 0.255
3.13AsnAsp: 3.13 ± 0.576
4.235AsnGlu: 4.235 ± 0.604
3.867AsnPhe: 3.867 ± 0.799
5.984AsnGly: 5.984 ± 0.925
0.921AsnHis: 0.921 ± 0.36
5.616AsnIle: 5.616 ± 1.012
5.616AsnLys: 5.616 ± 0.636
6.26AsnLeu: 6.26 ± 0.734
1.381AsnMet: 1.381 ± 0.341
6.352AsnAsn: 6.352 ± 0.75
2.486AsnPro: 2.486 ± 0.403
2.762AsnGln: 2.762 ± 0.542
1.933AsnArg: 1.933 ± 0.406
3.867AsnSer: 3.867 ± 0.491
5.432AsnThr: 5.432 ± 0.705
4.235AsnVal: 4.235 ± 0.77
0.921AsnTrp: 0.921 ± 0.398
3.314AsnTyr: 3.314 ± 0.584
0.0AsnXaa: 0.0 ± 0.0
Pro
2.117ProAla: 2.117 ± 0.599
0.184ProCys: 0.184 ± 0.134
1.841ProAsp: 1.841 ± 0.533
1.105ProGlu: 1.105 ± 0.281
1.657ProPhe: 1.657 ± 0.306
0.921ProGly: 0.921 ± 0.311
0.276ProHis: 0.276 ± 0.134
2.67ProIle: 2.67 ± 0.549
2.025ProLys: 2.025 ± 0.527
3.13ProLeu: 3.13 ± 0.506
0.552ProMet: 0.552 ± 0.254
2.117ProAsn: 2.117 ± 0.4
1.013ProPro: 1.013 ± 0.386
1.013ProGln: 1.013 ± 0.288
0.552ProArg: 0.552 ± 0.224
1.565ProSer: 1.565 ± 0.478
2.946ProThr: 2.946 ± 0.618
1.841ProVal: 1.841 ± 0.364
0.092ProTrp: 0.092 ± 0.088
0.552ProTyr: 0.552 ± 0.188
0.0ProXaa: 0.0 ± 0.0
Gln
1.841GlnAla: 1.841 ± 0.494
0.368GlnCys: 0.368 ± 0.235
1.565GlnAsp: 1.565 ± 0.368
1.841GlnGlu: 1.841 ± 0.414
2.025GlnPhe: 2.025 ± 0.424
2.394GlnGly: 2.394 ± 0.494
0.184GlnHis: 0.184 ± 0.113
3.59GlnIle: 3.59 ± 0.477
2.946GlnLys: 2.946 ± 0.49
3.038GlnLeu: 3.038 ± 0.527
0.644GlnMet: 0.644 ± 0.299
2.854GlnAsn: 2.854 ± 0.548
1.565GlnPro: 1.565 ± 0.342
1.473GlnGln: 1.473 ± 0.335
1.197GlnArg: 1.197 ± 0.315
2.117GlnSer: 2.117 ± 0.42
2.025GlnThr: 2.025 ± 0.618
2.578GlnVal: 2.578 ± 0.465
0.0GlnTrp: 0.0 ± 0.0
1.657GlnTyr: 1.657 ± 0.429
0.0GlnXaa: 0.0 ± 0.0
Arg
1.749ArgAla: 1.749 ± 0.429
0.276ArgCys: 0.276 ± 0.185
1.289ArgAsp: 1.289 ± 0.38
1.473ArgGlu: 1.473 ± 0.452
1.105ArgPhe: 1.105 ± 0.338
1.565ArgGly: 1.565 ± 0.479
0.092ArgHis: 0.092 ± 0.084
3.683ArgIle: 3.683 ± 0.544
2.762ArgLys: 2.762 ± 0.461
3.683ArgLeu: 3.683 ± 0.621
0.644ArgMet: 0.644 ± 0.242
1.933ArgAsn: 1.933 ± 0.364
0.737ArgPro: 0.737 ± 0.24
0.552ArgGln: 0.552 ± 0.245
1.289ArgArg: 1.289 ± 0.291
1.841ArgSer: 1.841 ± 0.376
1.473ArgThr: 1.473 ± 0.329
2.21ArgVal: 2.21 ± 0.432
0.829ArgTrp: 0.829 ± 0.291
1.565ArgTyr: 1.565 ± 0.353
0.0ArgXaa: 0.0 ± 0.0
Ser
4.787SerAla: 4.787 ± 0.967
0.737SerCys: 0.737 ± 0.242
3.406SerAsp: 3.406 ± 0.552
3.683SerGlu: 3.683 ± 0.483
4.419SerPhe: 4.419 ± 0.597
4.603SerGly: 4.603 ± 0.582
0.644SerHis: 0.644 ± 0.244
5.708SerIle: 5.708 ± 0.766
3.775SerLys: 3.775 ± 0.75
5.432SerLeu: 5.432 ± 0.814
0.644SerMet: 0.644 ± 0.216
3.867SerAsn: 3.867 ± 0.689
1.473SerPro: 1.473 ± 0.514
2.486SerGln: 2.486 ± 0.446
2.025SerArg: 2.025 ± 0.445
3.959SerSer: 3.959 ± 1.05
4.695SerThr: 4.695 ± 0.852
3.498SerVal: 3.498 ± 0.718
0.829SerTrp: 0.829 ± 0.305
2.302SerTyr: 2.302 ± 0.373
0.0SerXaa: 0.0 ± 0.0
Thr
5.984ThrAla: 5.984 ± 1.369
0.092ThrCys: 0.092 ± 0.104
3.498ThrAsp: 3.498 ± 0.516
3.775ThrGlu: 3.775 ± 0.527
2.946ThrPhe: 2.946 ± 0.468
6.076ThrGly: 6.076 ± 0.886
0.092ThrHis: 0.092 ± 0.081
5.34ThrIle: 5.34 ± 0.914
5.156ThrLys: 5.156 ± 0.787
6.629ThrLeu: 6.629 ± 0.902
0.829ThrMet: 0.829 ± 0.315
3.775ThrAsn: 3.775 ± 0.527
2.486ThrPro: 2.486 ± 0.57
2.67ThrGln: 2.67 ± 0.378
1.841ThrArg: 1.841 ± 0.549
3.775ThrSer: 3.775 ± 0.678
5.616ThrThr: 5.616 ± 1.473
4.051ThrVal: 4.051 ± 0.47
0.552ThrTrp: 0.552 ± 0.184
2.946ThrTyr: 2.946 ± 0.445
0.0ThrXaa: 0.0 ± 0.0
Val
2.486ValAla: 2.486 ± 0.437
0.092ValCys: 0.092 ± 0.082
3.314ValAsp: 3.314 ± 0.493
3.13ValGlu: 3.13 ± 0.546
2.854ValPhe: 2.854 ± 0.47
3.959ValGly: 3.959 ± 0.763
0.46ValHis: 0.46 ± 0.178
4.787ValIle: 4.787 ± 0.725
6.352ValLys: 6.352 ± 0.676
6.168ValLeu: 6.168 ± 0.694
0.46ValMet: 0.46 ± 0.237
5.432ValAsn: 5.432 ± 0.847
1.841ValPro: 1.841 ± 0.369
2.486ValGln: 2.486 ± 0.488
1.105ValArg: 1.105 ± 0.273
3.498ValSer: 3.498 ± 0.607
3.498ValThr: 3.498 ± 0.68
3.775ValVal: 3.775 ± 0.699
0.46ValTrp: 0.46 ± 0.197
2.302ValTyr: 2.302 ± 0.471
0.0ValXaa: 0.0 ± 0.0
Trp
0.552TrpAla: 0.552 ± 0.202
0.184TrpCys: 0.184 ± 0.136
0.276TrpAsp: 0.276 ± 0.141
0.921TrpGlu: 0.921 ± 0.284
0.644TrpPhe: 0.644 ± 0.323
0.368TrpGly: 0.368 ± 0.187
0.0TrpHis: 0.0 ± 0.0
1.105TrpIle: 1.105 ± 0.252
0.737TrpLys: 0.737 ± 0.268
0.737TrpLeu: 0.737 ± 0.282
0.092TrpMet: 0.092 ± 0.112
0.829TrpAsn: 0.829 ± 0.295
0.0TrpPro: 0.0 ± 0.0
0.368TrpGln: 0.368 ± 0.186
0.552TrpArg: 0.552 ± 0.191
1.105TrpSer: 1.105 ± 0.262
0.46TrpThr: 0.46 ± 0.19
0.552TrpVal: 0.552 ± 0.22
0.092TrpTrp: 0.092 ± 0.1
0.46TrpTyr: 0.46 ± 0.204
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.302TyrAla: 2.302 ± 0.493
0.092TyrCys: 0.092 ± 0.102
3.59TyrAsp: 3.59 ± 0.593
2.302TyrGlu: 2.302 ± 0.481
2.578TyrPhe: 2.578 ± 0.569
2.854TyrGly: 2.854 ± 0.542
0.276TyrHis: 0.276 ± 0.145
3.59TyrIle: 3.59 ± 0.588
3.683TyrLys: 3.683 ± 0.658
3.867TyrLeu: 3.867 ± 0.543
0.829TyrMet: 0.829 ± 0.278
2.762TyrAsn: 2.762 ± 0.652
1.657TyrPro: 1.657 ± 0.361
0.921TyrGln: 0.921 ± 0.272
1.657TyrArg: 1.657 ± 0.408
2.025TyrSer: 2.025 ± 0.422
2.578TyrThr: 2.578 ± 0.786
2.21TyrVal: 2.21 ± 0.404
0.644TyrTrp: 0.644 ± 0.284
1.749TyrTyr: 1.749 ± 0.521
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 65 proteins (10863 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski