Amino acid dipepetide frequency for Pseudoalteromonas phage H105/1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.367AlaAla: 7.367 ± 1.4
0.947AlaCys: 0.947 ± 0.252
4.631AlaAsp: 4.631 ± 0.622
7.156AlaGlu: 7.156 ± 1.1
2.526AlaPhe: 2.526 ± 0.541
6.735AlaGly: 6.735 ± 1.072
0.316AlaHis: 0.316 ± 0.176
4.841AlaIle: 4.841 ± 0.76
6.104AlaLys: 6.104 ± 0.934
8.314AlaLeu: 8.314 ± 1.585
1.473AlaMet: 1.473 ± 0.365
4.315AlaAsn: 4.315 ± 0.66
2.736AlaPro: 2.736 ± 0.568
4.104AlaGln: 4.104 ± 0.967
4.42AlaArg: 4.42 ± 0.774
6.314AlaSer: 6.314 ± 0.944
5.578AlaThr: 5.578 ± 0.924
4.42AlaVal: 4.42 ± 0.694
0.526AlaTrp: 0.526 ± 0.221
2.315AlaTyr: 2.315 ± 0.455
0.0AlaXaa: 0.0 ± 0.0
Cys
0.737CysAla: 0.737 ± 0.253
0.21CysCys: 0.21 ± 0.156
0.737CysAsp: 0.737 ± 0.356
1.052CysGlu: 1.052 ± 0.315
0.421CysPhe: 0.421 ± 0.23
0.316CysGly: 0.316 ± 0.184
0.105CysHis: 0.105 ± 0.121
1.052CysIle: 1.052 ± 0.322
0.842CysLys: 0.842 ± 0.293
0.737CysLeu: 0.737 ± 0.262
0.526CysMet: 0.526 ± 0.243
0.737CysAsn: 0.737 ± 0.302
0.105CysPro: 0.105 ± 0.096
0.316CysGln: 0.316 ± 0.167
0.421CysArg: 0.421 ± 0.182
0.947CysSer: 0.947 ± 0.281
0.421CysThr: 0.421 ± 0.221
0.737CysVal: 0.737 ± 0.267
0.0CysTrp: 0.0 ± 0.0
0.421CysTyr: 0.421 ± 0.236
0.0CysXaa: 0.0 ± 0.0
Asp
4.42AspAla: 4.42 ± 0.585
0.842AspCys: 0.842 ± 0.311
4.736AspAsp: 4.736 ± 0.965
5.683AspGlu: 5.683 ± 0.893
3.683AspPhe: 3.683 ± 0.689
4.946AspGly: 4.946 ± 0.733
0.316AspHis: 0.316 ± 0.152
4.841AspIle: 4.841 ± 0.755
3.052AspLys: 3.052 ± 0.621
6.209AspLeu: 6.209 ± 0.819
1.684AspMet: 1.684 ± 0.402
2.842AspAsn: 2.842 ± 0.539
2.315AspPro: 2.315 ± 0.445
1.789AspGln: 1.789 ± 0.408
2.21AspArg: 2.21 ± 0.521
5.473AspSer: 5.473 ± 0.693
2.947AspThr: 2.947 ± 0.435
3.368AspVal: 3.368 ± 0.616
0.842AspTrp: 0.842 ± 0.339
3.578AspTyr: 3.578 ± 0.688
0.0AspXaa: 0.0 ± 0.0
Glu
5.367GluAla: 5.367 ± 1.022
1.368GluCys: 1.368 ± 0.393
2.631GluAsp: 2.631 ± 0.545
3.789GluGlu: 3.789 ± 0.63
3.368GluPhe: 3.368 ± 0.52
3.894GluGly: 3.894 ± 0.537
0.737GluHis: 0.737 ± 0.294
5.367GluIle: 5.367 ± 0.719
4.525GluLys: 4.525 ± 0.953
7.683GluLeu: 7.683 ± 1.013
1.894GluMet: 1.894 ± 0.457
3.473GluAsn: 3.473 ± 0.727
0.737GluPro: 0.737 ± 0.331
3.052GluGln: 3.052 ± 0.722
3.052GluArg: 3.052 ± 0.777
5.052GluSer: 5.052 ± 0.481
2.526GluThr: 2.526 ± 0.385
3.473GluVal: 3.473 ± 0.616
0.631GluTrp: 0.631 ± 0.263
3.157GluTyr: 3.157 ± 0.543
0.0GluXaa: 0.0 ± 0.0
Phe
2.421PheAla: 2.421 ± 0.678
0.316PheCys: 0.316 ± 0.172
3.052PheAsp: 3.052 ± 0.631
3.578PheGlu: 3.578 ± 0.885
1.789PhePhe: 1.789 ± 0.38
3.157PheGly: 3.157 ± 0.631
0.21PheHis: 0.21 ± 0.158
2.736PheIle: 2.736 ± 0.458
2.947PheLys: 2.947 ± 0.558
2.21PheLeu: 2.21 ± 0.466
1.052PheMet: 1.052 ± 0.292
3.052PheAsn: 3.052 ± 0.486
0.842PhePro: 0.842 ± 0.28
0.947PheGln: 0.947 ± 0.284
1.052PheArg: 1.052 ± 0.337
2.526PheSer: 2.526 ± 0.49
3.473PheThr: 3.473 ± 0.505
3.052PheVal: 3.052 ± 0.754
0.21PheTrp: 0.21 ± 0.133
1.894PheTyr: 1.894 ± 0.419
0.0PheXaa: 0.0 ± 0.0
Gly
6.104GlyAla: 6.104 ± 1.336
0.842GlyCys: 0.842 ± 0.255
5.788GlyAsp: 5.788 ± 1.039
3.789GlyGlu: 3.789 ± 0.649
3.683GlyPhe: 3.683 ± 0.659
8.209GlyGly: 8.209 ± 1.415
0.316GlyHis: 0.316 ± 0.202
4.315GlyIle: 4.315 ± 0.722
5.893GlyLys: 5.893 ± 0.783
5.157GlyLeu: 5.157 ± 0.688
0.947GlyMet: 0.947 ± 0.284
3.368GlyAsn: 3.368 ± 0.707
0.316GlyPro: 0.316 ± 0.189
3.262GlyGln: 3.262 ± 0.52
2.631GlyArg: 2.631 ± 0.626
5.788GlySer: 5.788 ± 0.967
4.104GlyThr: 4.104 ± 0.742
5.367GlyVal: 5.367 ± 0.629
0.421GlyTrp: 0.421 ± 0.272
3.157GlyTyr: 3.157 ± 0.517
0.0GlyXaa: 0.0 ± 0.0
His
0.947HisAla: 0.947 ± 0.334
0.316HisCys: 0.316 ± 0.17
0.631HisAsp: 0.631 ± 0.263
0.526HisGlu: 0.526 ± 0.25
0.316HisPhe: 0.316 ± 0.169
1.052HisGly: 1.052 ± 0.322
0.21HisHis: 0.21 ± 0.135
0.631HisIle: 0.631 ± 0.291
0.947HisLys: 0.947 ± 0.287
0.737HisLeu: 0.737 ± 0.293
0.105HisMet: 0.105 ± 0.113
0.316HisAsn: 0.316 ± 0.163
0.21HisPro: 0.21 ± 0.123
0.421HisGln: 0.421 ± 0.199
0.526HisArg: 0.526 ± 0.19
0.316HisSer: 0.316 ± 0.205
0.316HisThr: 0.316 ± 0.196
0.631HisVal: 0.631 ± 0.26
0.0HisTrp: 0.0 ± 0.0
0.421HisTyr: 0.421 ± 0.272
0.0HisXaa: 0.0 ± 0.0
Ile
4.736IleAla: 4.736 ± 0.789
0.316IleCys: 0.316 ± 0.174
5.578IleAsp: 5.578 ± 0.707
5.367IleGlu: 5.367 ± 0.86
2.526IlePhe: 2.526 ± 0.479
3.052IleGly: 3.052 ± 0.542
0.631IleHis: 0.631 ± 0.255
3.789IleIle: 3.789 ± 0.779
6.209IleLys: 6.209 ± 0.797
5.157IleLeu: 5.157 ± 0.717
1.263IleMet: 1.263 ± 0.397
4.525IleAsn: 4.525 ± 0.676
1.579IlePro: 1.579 ± 0.356
2.736IleGln: 2.736 ± 0.55
2.631IleArg: 2.631 ± 0.604
5.473IleSer: 5.473 ± 0.666
4.841IleThr: 4.841 ± 1.063
3.262IleVal: 3.262 ± 0.583
0.421IleTrp: 0.421 ± 0.234
2.0IleTyr: 2.0 ± 0.433
0.0IleXaa: 0.0 ± 0.0
Lys
6.841LysAla: 6.841 ± 0.926
0.737LysCys: 0.737 ± 0.281
4.104LysAsp: 4.104 ± 0.584
4.525LysGlu: 4.525 ± 0.924
2.315LysPhe: 2.315 ± 0.494
4.841LysGly: 4.841 ± 0.805
0.526LysHis: 0.526 ± 0.243
3.473LysIle: 3.473 ± 0.685
4.42LysLys: 4.42 ± 0.791
6.42LysLeu: 6.42 ± 1.091
1.684LysMet: 1.684 ± 0.465
3.683LysAsn: 3.683 ± 0.555
1.473LysPro: 1.473 ± 0.342
3.683LysGln: 3.683 ± 0.985
2.631LysArg: 2.631 ± 0.527
4.631LysSer: 4.631 ± 0.675
3.894LysThr: 3.894 ± 0.476
3.052LysVal: 3.052 ± 0.59
0.842LysTrp: 0.842 ± 0.346
2.631LysTyr: 2.631 ± 0.568
0.0LysXaa: 0.0 ± 0.0
Leu
8.314LeuAla: 8.314 ± 1.182
0.842LeuCys: 0.842 ± 0.344
5.683LeuAsp: 5.683 ± 0.938
5.262LeuGlu: 5.262 ± 1.189
2.842LeuPhe: 2.842 ± 0.521
5.052LeuGly: 5.052 ± 0.692
0.842LeuHis: 0.842 ± 0.458
6.314LeuIle: 6.314 ± 0.91
5.999LeuLys: 5.999 ± 0.827
5.157LeuLeu: 5.157 ± 1.066
1.473LeuMet: 1.473 ± 0.397
4.315LeuAsn: 4.315 ± 0.651
2.105LeuPro: 2.105 ± 0.52
3.262LeuGln: 3.262 ± 0.644
3.578LeuArg: 3.578 ± 0.508
7.367LeuSer: 7.367 ± 0.864
6.42LeuThr: 6.42 ± 0.635
4.736LeuVal: 4.736 ± 0.762
0.737LeuTrp: 0.737 ± 0.259
1.684LeuTyr: 1.684 ± 0.453
0.0LeuXaa: 0.0 ± 0.0
Met
1.894MetAla: 1.894 ± 0.423
0.316MetCys: 0.316 ± 0.175
0.842MetAsp: 0.842 ± 0.343
1.158MetGlu: 1.158 ± 0.333
0.737MetPhe: 0.737 ± 0.296
1.473MetGly: 1.473 ± 0.415
0.21MetHis: 0.21 ± 0.137
1.684MetIle: 1.684 ± 0.399
1.684MetLys: 1.684 ± 0.503
1.368MetLeu: 1.368 ± 0.348
0.316MetMet: 0.316 ± 0.149
1.158MetAsn: 1.158 ± 0.353
0.842MetPro: 0.842 ± 0.271
0.842MetGln: 0.842 ± 0.305
1.158MetArg: 1.158 ± 0.344
2.736MetSer: 2.736 ± 0.588
1.579MetThr: 1.579 ± 0.542
1.368MetVal: 1.368 ± 0.356
0.21MetTrp: 0.21 ± 0.17
0.947MetTyr: 0.947 ± 0.393
0.0MetXaa: 0.0 ± 0.0
Asn
4.525AsnAla: 4.525 ± 0.549
0.316AsnCys: 0.316 ± 0.145
3.578AsnAsp: 3.578 ± 0.475
4.21AsnGlu: 4.21 ± 0.621
2.315AsnPhe: 2.315 ± 0.436
5.367AsnGly: 5.367 ± 0.595
0.526AsnHis: 0.526 ± 0.238
3.683AsnIle: 3.683 ± 0.728
4.104AsnLys: 4.104 ± 0.653
3.368AsnLeu: 3.368 ± 0.633
0.947AsnMet: 0.947 ± 0.284
3.999AsnAsn: 3.999 ± 0.565
2.0AsnPro: 2.0 ± 0.518
2.421AsnGln: 2.421 ± 0.492
2.105AsnArg: 2.105 ± 0.653
3.789AsnSer: 3.789 ± 0.881
3.262AsnThr: 3.262 ± 0.631
3.473AsnVal: 3.473 ± 0.504
0.526AsnTrp: 0.526 ± 0.254
1.894AsnTyr: 1.894 ± 0.559
0.0AsnXaa: 0.0 ± 0.0
Pro
2.947ProAla: 2.947 ± 0.402
0.0ProCys: 0.0 ± 0.0
1.579ProAsp: 1.579 ± 0.406
1.263ProGlu: 1.263 ± 0.376
1.368ProPhe: 1.368 ± 0.412
0.316ProGly: 0.316 ± 0.165
0.21ProHis: 0.21 ± 0.145
1.263ProIle: 1.263 ± 0.347
1.684ProLys: 1.684 ± 0.438
2.631ProLeu: 2.631 ± 0.609
0.947ProMet: 0.947 ± 0.262
2.105ProAsn: 2.105 ± 0.671
0.526ProPro: 0.526 ± 0.208
1.473ProGln: 1.473 ± 0.39
1.052ProArg: 1.052 ± 0.394
1.789ProSer: 1.789 ± 0.399
2.0ProThr: 2.0 ± 0.56
2.315ProVal: 2.315 ± 0.512
0.105ProTrp: 0.105 ± 0.121
1.158ProTyr: 1.158 ± 0.295
0.0ProXaa: 0.0 ± 0.0
Gln
5.262GlnAla: 5.262 ± 1.187
0.21GlnCys: 0.21 ± 0.157
1.789GlnAsp: 1.789 ± 0.371
2.631GlnGlu: 2.631 ± 0.548
1.263GlnPhe: 1.263 ± 0.319
3.157GlnGly: 3.157 ± 0.64
0.842GlnHis: 0.842 ± 0.274
3.157GlnIle: 3.157 ± 0.515
1.579GlnLys: 1.579 ± 0.357
4.21GlnLeu: 4.21 ± 0.745
0.947GlnMet: 0.947 ± 0.317
1.263GlnAsn: 1.263 ± 0.304
1.473GlnPro: 1.473 ± 0.296
3.578GlnGln: 3.578 ± 1.352
2.0GlnArg: 2.0 ± 0.413
3.368GlnSer: 3.368 ± 0.662
2.105GlnThr: 2.105 ± 0.538
2.526GlnVal: 2.526 ± 0.496
0.105GlnTrp: 0.105 ± 0.103
1.894GlnTyr: 1.894 ± 0.397
0.0GlnXaa: 0.0 ± 0.0
Arg
2.947ArgAla: 2.947 ± 0.533
0.631ArgCys: 0.631 ± 0.295
3.578ArgAsp: 3.578 ± 0.701
3.368ArgGlu: 3.368 ± 0.805
1.579ArgPhe: 1.579 ± 0.412
1.789ArgGly: 1.789 ± 0.496
0.526ArgHis: 0.526 ± 0.217
3.368ArgIle: 3.368 ± 0.509
3.473ArgLys: 3.473 ± 0.679
3.578ArgLeu: 3.578 ± 0.609
1.052ArgMet: 1.052 ± 0.308
1.789ArgAsn: 1.789 ± 0.368
1.158ArgPro: 1.158 ± 0.326
2.947ArgGln: 2.947 ± 0.555
1.894ArgArg: 1.894 ± 0.472
2.421ArgSer: 2.421 ± 0.487
1.894ArgThr: 1.894 ± 0.484
2.526ArgVal: 2.526 ± 0.559
0.21ArgTrp: 0.21 ± 0.147
0.631ArgTyr: 0.631 ± 0.277
0.0ArgXaa: 0.0 ± 0.0
Ser
6.314SerAla: 6.314 ± 0.826
0.947SerCys: 0.947 ± 0.265
5.052SerAsp: 5.052 ± 0.66
3.789SerGlu: 3.789 ± 0.753
3.262SerPhe: 3.262 ± 0.456
7.788SerGly: 7.788 ± 1.109
0.947SerHis: 0.947 ± 0.271
5.052SerIle: 5.052 ± 0.832
4.21SerLys: 4.21 ± 0.797
5.262SerLeu: 5.262 ± 0.686
2.526SerMet: 2.526 ± 0.55
4.315SerAsn: 4.315 ± 0.795
2.947SerPro: 2.947 ± 0.509
2.631SerGln: 2.631 ± 0.475
3.473SerArg: 3.473 ± 0.534
4.946SerSer: 4.946 ± 0.942
5.262SerThr: 5.262 ± 1.142
5.157SerVal: 5.157 ± 0.711
1.052SerTrp: 1.052 ± 0.276
2.736SerTyr: 2.736 ± 0.524
0.0SerXaa: 0.0 ± 0.0
Thr
6.735ThrAla: 6.735 ± 1.25
0.421ThrCys: 0.421 ± 0.199
4.946ThrAsp: 4.946 ± 0.607
2.736ThrGlu: 2.736 ± 0.56
2.421ThrPhe: 2.421 ± 0.582
4.631ThrGly: 4.631 ± 0.913
0.631ThrHis: 0.631 ± 0.234
4.21ThrIle: 4.21 ± 0.745
3.368ThrLys: 3.368 ± 0.565
5.788ThrLeu: 5.788 ± 0.732
0.421ThrMet: 0.421 ± 0.227
3.368ThrAsn: 3.368 ± 0.661
1.789ThrPro: 1.789 ± 0.443
2.842ThrGln: 2.842 ± 0.486
2.105ThrArg: 2.105 ± 0.339
5.578ThrSer: 5.578 ± 1.105
3.473ThrThr: 3.473 ± 0.623
4.21ThrVal: 4.21 ± 0.808
0.421ThrTrp: 0.421 ± 0.238
1.894ThrTyr: 1.894 ± 0.434
0.0ThrXaa: 0.0 ± 0.0
Val
4.946ValAla: 4.946 ± 0.827
0.526ValCys: 0.526 ± 0.222
4.525ValAsp: 4.525 ± 0.794
3.473ValGlu: 3.473 ± 0.452
2.0ValPhe: 2.0 ± 0.527
5.157ValGly: 5.157 ± 0.727
0.737ValHis: 0.737 ± 0.234
3.473ValIle: 3.473 ± 0.586
3.262ValLys: 3.262 ± 0.709
3.683ValLeu: 3.683 ± 0.522
1.894ValMet: 1.894 ± 0.625
5.367ValAsn: 5.367 ± 0.62
1.684ValPro: 1.684 ± 0.407
1.368ValGln: 1.368 ± 0.283
2.315ValArg: 2.315 ± 0.552
4.946ValSer: 4.946 ± 0.927
5.157ValThr: 5.157 ± 0.95
3.578ValVal: 3.578 ± 0.716
0.526ValTrp: 0.526 ± 0.213
2.105ValTyr: 2.105 ± 0.455
0.0ValXaa: 0.0 ± 0.0
Trp
0.631TrpAla: 0.631 ± 0.225
0.0TrpCys: 0.0 ± 0.0
0.737TrpAsp: 0.737 ± 0.311
0.526TrpGlu: 0.526 ± 0.285
0.316TrpPhe: 0.316 ± 0.19
0.631TrpGly: 0.631 ± 0.27
0.21TrpHis: 0.21 ± 0.134
0.421TrpIle: 0.421 ± 0.203
0.526TrpLys: 0.526 ± 0.254
0.737TrpLeu: 0.737 ± 0.318
0.0TrpMet: 0.0 ± 0.0
0.526TrpAsn: 0.526 ± 0.271
0.0TrpPro: 0.0 ± 0.0
0.21TrpGln: 0.21 ± 0.169
0.21TrpArg: 0.21 ± 0.176
0.737TrpSer: 0.737 ± 0.308
0.421TrpThr: 0.421 ± 0.197
0.842TrpVal: 0.842 ± 0.325
0.0TrpTrp: 0.0 ± 0.0
0.316TrpTyr: 0.316 ± 0.196
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.789TyrAla: 1.789 ± 0.428
0.737TyrCys: 0.737 ± 0.345
1.579TyrAsp: 1.579 ± 0.386
1.894TyrGlu: 1.894 ± 0.442
1.789TyrPhe: 1.789 ± 0.431
1.894TyrGly: 1.894 ± 0.429
0.526TyrHis: 0.526 ± 0.232
2.421TyrIle: 2.421 ± 0.506
1.579TyrLys: 1.579 ± 0.455
3.578TyrLeu: 3.578 ± 0.667
1.263TyrMet: 1.263 ± 0.393
1.894TyrAsn: 1.894 ± 0.463
1.894TyrPro: 1.894 ± 0.572
1.473TyrGln: 1.473 ± 0.337
1.894TyrArg: 1.894 ± 0.444
3.473TyrSer: 3.473 ± 0.608
2.315TyrThr: 2.315 ± 0.444
2.631TyrVal: 2.631 ± 0.526
0.21TyrTrp: 0.21 ± 0.156
0.842TyrTyr: 0.842 ± 0.239
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 52 proteins (9503 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski