Amino acid dipepetide frequency for Thermus phage phi OH2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.5AlaAla: 6.5 ± 1.195
0.257AlaCys: 0.257 ± 0.146
4.533AlaAsp: 4.533 ± 0.744
4.789AlaGlu: 4.789 ± 0.673
3.421AlaPhe: 3.421 ± 0.728
5.388AlaGly: 5.388 ± 0.72
1.881AlaHis: 1.881 ± 0.368
5.046AlaIle: 5.046 ± 0.577
6.5AlaLys: 6.5 ± 0.791
5.986AlaLeu: 5.986 ± 0.879
2.566AlaMet: 2.566 ± 0.506
4.362AlaAsn: 4.362 ± 0.82
1.881AlaPro: 1.881 ± 0.38
3.079AlaGln: 3.079 ± 0.548
2.48AlaArg: 2.48 ± 0.449
4.704AlaSer: 4.704 ± 1.014
5.217AlaThr: 5.217 ± 0.778
5.131AlaVal: 5.131 ± 0.705
0.599AlaTrp: 0.599 ± 0.208
3.079AlaTyr: 3.079 ± 0.567
0.0AlaXaa: 0.0 ± 0.0
Cys
0.513CysAla: 0.513 ± 0.269
0.0CysCys: 0.0 ± 0.0
0.513CysAsp: 0.513 ± 0.216
0.77CysGlu: 0.77 ± 0.258
0.428CysPhe: 0.428 ± 0.212
0.342CysGly: 0.342 ± 0.168
0.0CysHis: 0.0 ± 0.0
0.342CysIle: 0.342 ± 0.183
0.599CysLys: 0.599 ± 0.186
0.171CysLeu: 0.171 ± 0.108
0.086CysMet: 0.086 ± 0.083
0.257CysAsn: 0.257 ± 0.145
0.342CysPro: 0.342 ± 0.192
0.257CysGln: 0.257 ± 0.153
0.599CysArg: 0.599 ± 0.319
0.257CysSer: 0.257 ± 0.162
0.086CysThr: 0.086 ± 0.087
0.428CysVal: 0.428 ± 0.219
0.0CysTrp: 0.0 ± 0.0
0.086CysTyr: 0.086 ± 0.093
0.0CysXaa: 0.0 ± 0.0
Asp
4.618AspAla: 4.618 ± 0.799
0.428AspCys: 0.428 ± 0.231
3.848AspAsp: 3.848 ± 0.549
6.927AspGlu: 6.927 ± 0.775
2.651AspPhe: 2.651 ± 0.484
4.105AspGly: 4.105 ± 0.686
0.428AspHis: 0.428 ± 0.165
4.704AspIle: 4.704 ± 0.662
3.335AspLys: 3.335 ± 0.654
3.848AspLeu: 3.848 ± 0.529
1.796AspMet: 1.796 ± 0.29
1.881AspAsn: 1.881 ± 0.352
1.71AspPro: 1.71 ± 0.373
1.71AspGln: 1.71 ± 0.386
2.993AspArg: 2.993 ± 0.481
2.138AspSer: 2.138 ± 0.49
2.822AspThr: 2.822 ± 0.414
4.362AspVal: 4.362 ± 0.706
0.599AspTrp: 0.599 ± 0.203
2.566AspTyr: 2.566 ± 0.433
0.0AspXaa: 0.0 ± 0.0
Glu
6.072GluAla: 6.072 ± 0.924
0.855GluCys: 0.855 ± 0.314
4.019GluAsp: 4.019 ± 0.604
7.355GluGlu: 7.355 ± 0.993
3.763GluPhe: 3.763 ± 0.513
3.592GluGly: 3.592 ± 0.537
2.138GluHis: 2.138 ± 0.527
5.473GluIle: 5.473 ± 0.698
5.901GluLys: 5.901 ± 0.684
8.381GluLeu: 8.381 ± 1.033
2.48GluMet: 2.48 ± 0.305
3.677GluAsn: 3.677 ± 0.563
2.651GluPro: 2.651 ± 0.436
4.362GluGln: 4.362 ± 0.855
5.046GluArg: 5.046 ± 0.799
2.651GluSer: 2.651 ± 0.485
4.019GluThr: 4.019 ± 0.786
6.243GluVal: 6.243 ± 0.674
1.368GluTrp: 1.368 ± 0.339
3.506GluTyr: 3.506 ± 0.479
0.0GluXaa: 0.0 ± 0.0
Phe
2.48PheAla: 2.48 ± 0.483
0.171PheCys: 0.171 ± 0.104
2.822PheAsp: 2.822 ± 0.57
3.677PheGlu: 3.677 ± 0.433
1.283PhePhe: 1.283 ± 0.395
1.967PheGly: 1.967 ± 0.461
0.684PheHis: 0.684 ± 0.283
2.822PheIle: 2.822 ± 0.489
3.934PheLys: 3.934 ± 0.657
3.25PheLeu: 3.25 ± 0.444
1.368PheMet: 1.368 ± 0.319
1.283PheAsn: 1.283 ± 0.333
0.77PhePro: 0.77 ± 0.231
1.197PheGln: 1.197 ± 0.369
2.566PheArg: 2.566 ± 0.405
3.25PheSer: 3.25 ± 0.556
2.053PheThr: 2.053 ± 0.439
2.993PheVal: 2.993 ± 0.527
0.257PheTrp: 0.257 ± 0.133
1.539PheTyr: 1.539 ± 0.298
0.0PheXaa: 0.0 ± 0.0
Gly
4.447GlyAla: 4.447 ± 0.737
0.342GlyCys: 0.342 ± 0.178
2.993GlyAsp: 2.993 ± 0.445
5.131GlyGlu: 5.131 ± 0.701
2.224GlyPhe: 2.224 ± 0.5
3.421GlyGly: 3.421 ± 0.628
1.539GlyHis: 1.539 ± 0.309
4.704GlyIle: 4.704 ± 0.563
5.473GlyLys: 5.473 ± 0.859
5.046GlyLeu: 5.046 ± 0.784
1.368GlyMet: 1.368 ± 0.359
3.592GlyAsn: 3.592 ± 0.662
1.026GlyPro: 1.026 ± 0.308
1.881GlyGln: 1.881 ± 0.364
3.164GlyArg: 3.164 ± 0.468
2.993GlySer: 2.993 ± 0.654
3.848GlyThr: 3.848 ± 0.659
5.302GlyVal: 5.302 ± 0.715
0.513GlyTrp: 0.513 ± 0.213
2.822GlyTyr: 2.822 ± 0.492
0.0GlyXaa: 0.0 ± 0.0
His
1.368HisAla: 1.368 ± 0.3
0.342HisCys: 0.342 ± 0.184
0.599HisAsp: 0.599 ± 0.195
1.967HisGlu: 1.967 ± 0.402
0.599HisPhe: 0.599 ± 0.22
1.283HisGly: 1.283 ± 0.33
0.513HisHis: 0.513 ± 0.2
0.855HisIle: 0.855 ± 0.246
1.283HisLys: 1.283 ± 0.324
1.283HisLeu: 1.283 ± 0.302
0.513HisMet: 0.513 ± 0.258
0.77HisAsn: 0.77 ± 0.326
1.283HisPro: 1.283 ± 0.363
0.684HisGln: 0.684 ± 0.195
0.855HisArg: 0.855 ± 0.314
1.026HisSer: 1.026 ± 0.279
1.026HisThr: 1.026 ± 0.349
1.026HisVal: 1.026 ± 0.317
0.342HisTrp: 0.342 ± 0.144
0.77HisTyr: 0.77 ± 0.286
0.0HisXaa: 0.0 ± 0.0
Ile
5.559IleAla: 5.559 ± 0.655
0.342IleCys: 0.342 ± 0.149
4.618IleAsp: 4.618 ± 0.652
5.986IleGlu: 5.986 ± 0.735
1.283IlePhe: 1.283 ± 0.377
4.362IleGly: 4.362 ± 0.689
1.283IleHis: 1.283 ± 0.269
3.421IleIle: 3.421 ± 0.435
6.072IleLys: 6.072 ± 0.736
4.019IleLeu: 4.019 ± 0.559
1.026IleMet: 1.026 ± 0.301
3.677IleAsn: 3.677 ± 0.515
2.737IlePro: 2.737 ± 0.41
2.651IleGln: 2.651 ± 0.51
4.875IleArg: 4.875 ± 0.666
2.737IleSer: 2.737 ± 0.682
3.592IleThr: 3.592 ± 0.565
4.276IleVal: 4.276 ± 0.711
0.599IleTrp: 0.599 ± 0.198
2.395IleTyr: 2.395 ± 0.391
0.0IleXaa: 0.0 ± 0.0
Lys
5.559LysAla: 5.559 ± 0.797
0.342LysCys: 0.342 ± 0.17
4.704LysAsp: 4.704 ± 0.732
7.013LysGlu: 7.013 ± 1.09
3.164LysPhe: 3.164 ± 0.497
5.302LysGly: 5.302 ± 0.559
1.539LysHis: 1.539 ± 0.439
5.131LysIle: 5.131 ± 0.698
8.296LysLys: 8.296 ± 1.1
7.526LysLeu: 7.526 ± 0.955
2.651LysMet: 2.651 ± 0.484
4.704LysAsn: 4.704 ± 0.712
2.309LysPro: 2.309 ± 0.46
4.618LysGln: 4.618 ± 0.652
4.533LysArg: 4.533 ± 0.749
3.592LysSer: 3.592 ± 0.625
4.276LysThr: 4.276 ± 0.633
4.96LysVal: 4.96 ± 0.655
0.941LysTrp: 0.941 ± 0.287
2.651LysTyr: 2.651 ± 0.545
0.0LysXaa: 0.0 ± 0.0
Leu
7.868LeuAla: 7.868 ± 1.136
0.257LeuCys: 0.257 ± 0.185
5.644LeuAsp: 5.644 ± 0.715
7.013LeuGlu: 7.013 ± 0.847
2.737LeuPhe: 2.737 ± 0.449
5.559LeuGly: 5.559 ± 0.607
1.283LeuHis: 1.283 ± 0.35
4.96LeuIle: 4.96 ± 0.632
8.381LeuLys: 8.381 ± 0.81
7.184LeuLeu: 7.184 ± 0.921
1.796LeuMet: 1.796 ± 0.449
2.566LeuAsn: 2.566 ± 0.494
2.737LeuPro: 2.737 ± 0.396
3.934LeuGln: 3.934 ± 0.69
4.362LeuArg: 4.362 ± 0.826
4.447LeuSer: 4.447 ± 0.694
4.019LeuThr: 4.019 ± 0.593
4.704LeuVal: 4.704 ± 0.564
0.599LeuTrp: 0.599 ± 0.258
2.395LeuTyr: 2.395 ± 0.479
0.0LeuXaa: 0.0 ± 0.0
Met
2.822MetAla: 2.822 ± 0.622
0.171MetCys: 0.171 ± 0.119
1.881MetAsp: 1.881 ± 0.4
1.71MetGlu: 1.71 ± 0.367
1.368MetPhe: 1.368 ± 0.351
1.197MetGly: 1.197 ± 0.315
0.0MetHis: 0.0 ± 0.0
1.539MetIle: 1.539 ± 0.411
2.138MetLys: 2.138 ± 0.499
1.625MetLeu: 1.625 ± 0.307
1.026MetMet: 1.026 ± 0.32
1.368MetAsn: 1.368 ± 0.294
1.112MetPro: 1.112 ± 0.317
1.454MetGln: 1.454 ± 0.34
1.71MetArg: 1.71 ± 0.482
1.625MetSer: 1.625 ± 0.341
1.454MetThr: 1.454 ± 0.318
1.026MetVal: 1.026 ± 0.262
0.428MetTrp: 0.428 ± 0.173
1.026MetTyr: 1.026 ± 0.198
0.0MetXaa: 0.0 ± 0.0
Asn
2.993AsnAla: 2.993 ± 0.558
0.257AsnCys: 0.257 ± 0.162
2.395AsnAsp: 2.395 ± 0.499
4.447AsnGlu: 4.447 ± 0.608
1.71AsnPhe: 1.71 ± 0.426
4.019AsnGly: 4.019 ± 0.622
0.77AsnHis: 0.77 ± 0.222
2.908AsnIle: 2.908 ± 0.525
3.079AsnLys: 3.079 ± 0.522
3.335AsnLeu: 3.335 ± 0.529
1.283AsnMet: 1.283 ± 0.357
1.796AsnAsn: 1.796 ± 0.362
2.651AsnPro: 2.651 ± 0.479
1.796AsnGln: 1.796 ± 0.382
2.737AsnArg: 2.737 ± 0.53
2.822AsnSer: 2.822 ± 0.536
2.651AsnThr: 2.651 ± 0.577
2.822AsnVal: 2.822 ± 0.442
0.684AsnTrp: 0.684 ± 0.292
1.026AsnTyr: 1.026 ± 0.333
0.0AsnXaa: 0.0 ± 0.0
Pro
2.737ProAla: 2.737 ± 0.624
0.171ProCys: 0.171 ± 0.118
1.881ProAsp: 1.881 ± 0.357
2.48ProGlu: 2.48 ± 0.51
1.368ProPhe: 1.368 ± 0.327
2.053ProGly: 2.053 ± 0.535
0.599ProHis: 0.599 ± 0.218
2.822ProIle: 2.822 ± 0.618
2.908ProLys: 2.908 ± 0.706
2.309ProLeu: 2.309 ± 0.473
0.684ProMet: 0.684 ± 0.264
1.881ProAsn: 1.881 ± 0.516
1.368ProPro: 1.368 ± 0.342
0.941ProGln: 0.941 ± 0.375
1.112ProArg: 1.112 ± 0.321
1.881ProSer: 1.881 ± 0.395
2.566ProThr: 2.566 ± 0.511
1.881ProVal: 1.881 ± 0.327
0.342ProTrp: 0.342 ± 0.211
1.283ProTyr: 1.283 ± 0.322
0.0ProXaa: 0.0 ± 0.0
Gln
3.934GlnAla: 3.934 ± 0.7
0.342GlnCys: 0.342 ± 0.166
2.053GlnAsp: 2.053 ± 0.542
3.335GlnGlu: 3.335 ± 0.521
1.881GlnPhe: 1.881 ± 0.383
1.625GlnGly: 1.625 ± 0.344
0.428GlnHis: 0.428 ± 0.266
2.908GlnIle: 2.908 ± 0.629
3.421GlnLys: 3.421 ± 0.703
4.704GlnLeu: 4.704 ± 0.65
1.283GlnMet: 1.283 ± 0.301
1.454GlnAsn: 1.454 ± 0.309
0.855GlnPro: 0.855 ± 0.28
2.053GlnGln: 2.053 ± 0.447
2.48GlnArg: 2.48 ± 0.549
1.881GlnSer: 1.881 ± 0.445
2.908GlnThr: 2.908 ± 0.572
2.309GlnVal: 2.309 ± 0.495
0.171GlnTrp: 0.171 ± 0.113
1.283GlnTyr: 1.283 ± 0.337
0.0GlnXaa: 0.0 ± 0.0
Arg
3.592ArgAla: 3.592 ± 0.521
0.599ArgCys: 0.599 ± 0.218
2.309ArgAsp: 2.309 ± 0.512
4.704ArgGlu: 4.704 ± 0.862
2.053ArgPhe: 2.053 ± 0.411
2.48ArgGly: 2.48 ± 0.44
1.026ArgHis: 1.026 ± 0.367
3.592ArgIle: 3.592 ± 0.67
5.046ArgLys: 5.046 ± 0.654
4.704ArgLeu: 4.704 ± 0.638
1.625ArgMet: 1.625 ± 0.436
2.138ArgAsn: 2.138 ± 0.453
1.71ArgPro: 1.71 ± 0.385
2.737ArgGln: 2.737 ± 0.693
3.25ArgArg: 3.25 ± 0.693
2.053ArgSer: 2.053 ± 0.473
2.138ArgThr: 2.138 ± 0.53
3.848ArgVal: 3.848 ± 0.635
0.684ArgTrp: 0.684 ± 0.212
2.908ArgTyr: 2.908 ± 0.605
0.0ArgXaa: 0.0 ± 0.0
Ser
4.447SerAla: 4.447 ± 0.929
0.257SerCys: 0.257 ± 0.136
2.566SerAsp: 2.566 ± 0.463
3.848SerGlu: 3.848 ± 0.627
2.908SerPhe: 2.908 ± 0.515
4.618SerGly: 4.618 ± 0.945
0.684SerHis: 0.684 ± 0.245
2.993SerIle: 2.993 ± 0.455
3.592SerLys: 3.592 ± 0.612
4.618SerLeu: 4.618 ± 0.71
1.283SerMet: 1.283 ± 0.402
1.881SerAsn: 1.881 ± 0.374
1.026SerPro: 1.026 ± 0.273
1.539SerGln: 1.539 ± 0.326
2.138SerArg: 2.138 ± 0.516
2.053SerSer: 2.053 ± 0.406
2.651SerThr: 2.651 ± 0.56
3.25SerVal: 3.25 ± 0.517
0.513SerTrp: 0.513 ± 0.237
1.881SerTyr: 1.881 ± 0.506
0.0SerXaa: 0.0 ± 0.0
Thr
3.848ThrAla: 3.848 ± 0.551
0.257ThrCys: 0.257 ± 0.157
2.309ThrAsp: 2.309 ± 0.484
3.421ThrGlu: 3.421 ± 0.575
2.48ThrPhe: 2.48 ± 0.553
4.276ThrGly: 4.276 ± 0.739
1.283ThrHis: 1.283 ± 0.31
4.191ThrIle: 4.191 ± 0.473
4.618ThrLys: 4.618 ± 0.599
4.96ThrLeu: 4.96 ± 0.751
1.368ThrMet: 1.368 ± 0.31
2.651ThrAsn: 2.651 ± 0.496
2.309ThrPro: 2.309 ± 0.449
1.881ThrGln: 1.881 ± 0.528
1.539ThrArg: 1.539 ± 0.41
2.651ThrSer: 2.651 ± 0.56
2.908ThrThr: 2.908 ± 0.565
5.131ThrVal: 5.131 ± 0.73
0.77ThrTrp: 0.77 ± 0.344
2.566ThrTyr: 2.566 ± 0.431
0.0ThrXaa: 0.0 ± 0.0
Val
4.105ValAla: 4.105 ± 0.518
0.257ValCys: 0.257 ± 0.156
4.704ValAsp: 4.704 ± 0.479
5.046ValGlu: 5.046 ± 0.694
3.592ValPhe: 3.592 ± 0.667
3.506ValGly: 3.506 ± 0.559
0.941ValHis: 0.941 ± 0.267
3.506ValIle: 3.506 ± 0.449
5.473ValLys: 5.473 ± 0.785
5.388ValLeu: 5.388 ± 0.654
1.283ValMet: 1.283 ± 0.326
4.105ValAsn: 4.105 ± 0.836
3.335ValPro: 3.335 ± 0.558
2.822ValGln: 2.822 ± 0.486
3.592ValArg: 3.592 ± 0.718
3.677ValSer: 3.677 ± 0.69
4.362ValThr: 4.362 ± 0.837
4.533ValVal: 4.533 ± 0.746
0.941ValTrp: 0.941 ± 0.272
3.335ValTyr: 3.335 ± 0.408
0.0ValXaa: 0.0 ± 0.0
Trp
0.855TrpAla: 0.855 ± 0.362
0.086TrpCys: 0.086 ± 0.087
0.342TrpAsp: 0.342 ± 0.169
0.77TrpGlu: 0.77 ± 0.277
0.086TrpPhe: 0.086 ± 0.082
0.599TrpGly: 0.599 ± 0.223
0.257TrpHis: 0.257 ± 0.14
0.77TrpIle: 0.77 ± 0.218
1.454TrpLys: 1.454 ± 0.354
1.112TrpLeu: 1.112 ± 0.298
0.257TrpMet: 0.257 ± 0.137
0.599TrpAsn: 0.599 ± 0.209
0.257TrpPro: 0.257 ± 0.154
0.257TrpGln: 0.257 ± 0.121
0.599TrpArg: 0.599 ± 0.244
0.257TrpSer: 0.257 ± 0.147
0.513TrpThr: 0.513 ± 0.25
1.026TrpVal: 1.026 ± 0.279
0.171TrpTrp: 0.171 ± 0.106
0.513TrpTyr: 0.513 ± 0.183
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.079TyrAla: 3.079 ± 0.499
0.342TyrCys: 0.342 ± 0.217
2.737TyrAsp: 2.737 ± 0.438
3.164TyrGlu: 3.164 ± 0.628
1.368TyrPhe: 1.368 ± 0.31
2.138TyrGly: 2.138 ± 0.353
1.112TyrHis: 1.112 ± 0.261
2.822TyrIle: 2.822 ± 0.504
2.309TyrLys: 2.309 ± 0.462
3.079TyrLeu: 3.079 ± 0.631
0.855TyrMet: 0.855 ± 0.314
1.539TyrAsn: 1.539 ± 0.389
1.197TyrPro: 1.197 ± 0.32
1.454TyrGln: 1.454 ± 0.38
2.566TyrArg: 2.566 ± 0.509
2.053TyrSer: 2.053 ± 0.334
2.309TyrThr: 2.309 ± 0.478
3.25TyrVal: 3.25 ± 0.448
0.257TyrTrp: 0.257 ± 0.132
1.454TyrTyr: 1.454 ± 0.392
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 60 proteins (11694 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski