Amino acid dipepetide frequency for Moraxella phage Mcat14

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.0AlaAla: 9.0 ± 2.56
1.34AlaCys: 1.34 ± 0.568
5.936AlaAsp: 5.936 ± 1.022
7.277AlaGlu: 7.277 ± 1.296
4.404AlaPhe: 4.404 ± 1.538
5.553AlaGly: 5.553 ± 1.204
1.532AlaHis: 1.532 ± 0.407
7.468AlaIle: 7.468 ± 1.62
8.617AlaLys: 8.617 ± 1.435
8.234AlaLeu: 8.234 ± 1.38
3.255AlaMet: 3.255 ± 1.074
4.596AlaAsn: 4.596 ± 0.694
1.34AlaPro: 1.34 ± 0.577
4.979AlaGln: 4.979 ± 1.257
2.489AlaArg: 2.489 ± 0.834
6.319AlaSer: 6.319 ± 1.093
6.319AlaThr: 6.319 ± 1.364
6.894AlaVal: 6.894 ± 1.014
0.574AlaTrp: 0.574 ± 0.452
2.681AlaTyr: 2.681 ± 0.756
0.0AlaXaa: 0.0 ± 0.0
Cys
0.383CysAla: 0.383 ± 0.301
0.191CysCys: 0.191 ± 0.151
0.766CysAsp: 0.766 ± 0.375
0.191CysGlu: 0.191 ± 0.219
0.383CysPhe: 0.383 ± 0.218
0.383CysGly: 0.383 ± 0.289
0.383CysHis: 0.383 ± 0.265
0.383CysIle: 0.383 ± 0.27
0.191CysLys: 0.191 ± 0.22
1.149CysLeu: 1.149 ± 0.574
0.191CysMet: 0.191 ± 0.239
0.191CysAsn: 0.191 ± 0.169
0.191CysPro: 0.191 ± 0.151
0.574CysGln: 0.574 ± 0.369
0.957CysArg: 0.957 ± 0.391
0.191CysSer: 0.191 ± 0.22
0.574CysThr: 0.574 ± 0.402
0.574CysVal: 0.574 ± 0.279
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.787AspAla: 4.787 ± 0.754
0.383AspCys: 0.383 ± 0.25
5.745AspAsp: 5.745 ± 1.162
4.596AspGlu: 4.596 ± 0.667
1.915AspPhe: 1.915 ± 0.387
5.17AspGly: 5.17 ± 0.82
1.149AspHis: 1.149 ± 0.496
6.128AspIle: 6.128 ± 1.186
4.021AspLys: 4.021 ± 0.615
4.787AspLeu: 4.787 ± 1.074
1.532AspMet: 1.532 ± 0.36
3.83AspAsn: 3.83 ± 1.06
2.298AspPro: 2.298 ± 0.746
1.723AspGln: 1.723 ± 0.841
1.915AspArg: 1.915 ± 0.812
3.638AspSer: 3.638 ± 0.688
2.872AspThr: 2.872 ± 0.76
3.83AspVal: 3.83 ± 0.568
0.574AspTrp: 0.574 ± 0.312
2.489AspTyr: 2.489 ± 0.63
0.0AspXaa: 0.0 ± 0.0
Glu
5.936GluAla: 5.936 ± 1.145
0.191GluCys: 0.191 ± 0.167
3.064GluAsp: 3.064 ± 0.707
3.064GluGlu: 3.064 ± 0.669
2.489GluPhe: 2.489 ± 0.687
2.872GluGly: 2.872 ± 0.47
0.957GluHis: 0.957 ± 0.333
3.83GluIle: 3.83 ± 0.92
4.213GluLys: 4.213 ± 0.869
7.085GluLeu: 7.085 ± 0.917
1.149GluMet: 1.149 ± 0.392
3.064GluAsn: 3.064 ± 0.815
2.298GluPro: 2.298 ± 0.816
4.021GluGln: 4.021 ± 0.912
3.064GluArg: 3.064 ± 0.576
3.638GluSer: 3.638 ± 0.736
4.404GluThr: 4.404 ± 0.685
4.404GluVal: 4.404 ± 0.852
0.574GluTrp: 0.574 ± 0.356
3.064GluTyr: 3.064 ± 0.715
0.0GluXaa: 0.0 ± 0.0
Phe
2.106PheAla: 2.106 ± 0.703
0.766PheCys: 0.766 ± 0.378
3.255PheAsp: 3.255 ± 0.834
3.064PheGlu: 3.064 ± 0.806
0.957PhePhe: 0.957 ± 0.387
4.596PheGly: 4.596 ± 1.232
0.191PheHis: 0.191 ± 0.218
1.34PheIle: 1.34 ± 0.429
1.915PheLys: 1.915 ± 0.609
2.681PheLeu: 2.681 ± 0.568
0.766PheMet: 0.766 ± 0.567
2.681PheAsn: 2.681 ± 0.607
0.766PhePro: 0.766 ± 0.249
0.383PheGln: 0.383 ± 0.289
0.957PheArg: 0.957 ± 0.557
2.489PheSer: 2.489 ± 0.683
1.723PheThr: 1.723 ± 0.529
1.149PheVal: 1.149 ± 0.456
0.574PheTrp: 0.574 ± 0.317
1.723PheTyr: 1.723 ± 0.562
0.0PheXaa: 0.0 ± 0.0
Gly
5.936GlyAla: 5.936 ± 0.925
0.0GlyCys: 0.0 ± 0.0
5.17GlyAsp: 5.17 ± 0.961
4.787GlyGlu: 4.787 ± 0.78
3.064GlyPhe: 3.064 ± 0.6
4.787GlyGly: 4.787 ± 1.32
1.34GlyHis: 1.34 ± 0.542
5.17GlyIle: 5.17 ± 1.377
5.553GlyLys: 5.553 ± 1.159
8.234GlyLeu: 8.234 ± 1.476
2.872GlyMet: 2.872 ± 0.648
3.255GlyAsn: 3.255 ± 0.686
0.574GlyPro: 0.574 ± 0.332
1.532GlyGln: 1.532 ± 0.465
3.447GlyArg: 3.447 ± 0.929
3.064GlySer: 3.064 ± 0.774
2.872GlyThr: 2.872 ± 0.814
7.66GlyVal: 7.66 ± 1.34
0.574GlyTrp: 0.574 ± 0.332
1.532GlyTyr: 1.532 ± 0.595
0.0GlyXaa: 0.0 ± 0.0
His
1.915HisAla: 1.915 ± 0.585
0.0HisCys: 0.0 ± 0.0
1.915HisAsp: 1.915 ± 0.618
0.957HisGlu: 0.957 ± 0.402
0.0HisPhe: 0.0 ± 0.0
1.34HisGly: 1.34 ± 0.495
0.383HisHis: 0.383 ± 0.244
1.723HisIle: 1.723 ± 0.721
1.532HisLys: 1.532 ± 0.566
2.298HisLeu: 2.298 ± 0.667
0.766HisMet: 0.766 ± 0.295
0.957HisAsn: 0.957 ± 0.529
0.0HisPro: 0.0 ± 0.0
0.383HisGln: 0.383 ± 0.262
0.383HisArg: 0.383 ± 0.249
0.0HisSer: 0.0 ± 0.0
1.34HisThr: 1.34 ± 0.544
0.191HisVal: 0.191 ± 0.151
0.191HisTrp: 0.191 ± 0.196
0.574HisTyr: 0.574 ± 0.357
0.0HisXaa: 0.0 ± 0.0
Ile
7.277IleAla: 7.277 ± 1.369
0.191IleCys: 0.191 ± 0.185
5.17IleAsp: 5.17 ± 0.884
3.638IleGlu: 3.638 ± 0.639
2.489IlePhe: 2.489 ± 0.571
5.745IleGly: 5.745 ± 1.031
0.766IleHis: 0.766 ± 0.322
5.17IleIle: 5.17 ± 1.093
7.851IleLys: 7.851 ± 1.6
3.638IleLeu: 3.638 ± 0.632
1.723IleMet: 1.723 ± 0.542
4.404IleAsn: 4.404 ± 1.153
1.532IlePro: 1.532 ± 0.347
1.915IleGln: 1.915 ± 0.622
2.106IleArg: 2.106 ± 0.604
3.255IleSer: 3.255 ± 1.303
3.447IleThr: 3.447 ± 0.672
2.298IleVal: 2.298 ± 0.457
0.383IleTrp: 0.383 ± 0.347
1.532IleTyr: 1.532 ± 0.726
0.0IleXaa: 0.0 ± 0.0
Lys
7.66LysAla: 7.66 ± 0.949
0.191LysCys: 0.191 ± 0.167
4.787LysAsp: 4.787 ± 0.95
4.404LysGlu: 4.404 ± 1.117
3.064LysPhe: 3.064 ± 0.63
5.17LysGly: 5.17 ± 0.9
0.957LysHis: 0.957 ± 0.415
3.83LysIle: 3.83 ± 0.785
6.511LysLys: 6.511 ± 1.374
8.043LysLeu: 8.043 ± 1.397
2.489LysMet: 2.489 ± 0.472
3.638LysAsn: 3.638 ± 0.928
1.723LysPro: 1.723 ± 0.589
3.447LysGln: 3.447 ± 0.908
3.83LysArg: 3.83 ± 1.042
6.894LysSer: 6.894 ± 1.235
4.213LysThr: 4.213 ± 1.059
1.723LysVal: 1.723 ± 0.393
0.957LysTrp: 0.957 ± 0.483
2.489LysTyr: 2.489 ± 0.721
0.0LysXaa: 0.0 ± 0.0
Leu
9.192LeuAla: 9.192 ± 1.458
0.957LeuCys: 0.957 ± 0.557
6.702LeuAsp: 6.702 ± 0.926
6.702LeuGlu: 6.702 ± 1.345
2.489LeuPhe: 2.489 ± 0.798
7.468LeuGly: 7.468 ± 1.319
1.532LeuHis: 1.532 ± 0.643
4.787LeuIle: 4.787 ± 0.893
8.617LeuLys: 8.617 ± 1.008
8.426LeuLeu: 8.426 ± 1.392
1.149LeuMet: 1.149 ± 0.471
2.681LeuAsn: 2.681 ± 0.842
4.213LeuPro: 4.213 ± 0.57
3.255LeuGln: 3.255 ± 0.601
4.404LeuArg: 4.404 ± 0.755
5.553LeuSer: 5.553 ± 0.742
6.128LeuThr: 6.128 ± 1.106
4.787LeuVal: 4.787 ± 1.143
1.149LeuTrp: 1.149 ± 0.391
3.064LeuTyr: 3.064 ± 0.541
0.0LeuXaa: 0.0 ± 0.0
Met
2.681MetAla: 2.681 ± 0.84
0.191MetCys: 0.191 ± 0.22
1.34MetAsp: 1.34 ± 0.507
1.34MetGlu: 1.34 ± 0.463
0.957MetPhe: 0.957 ± 0.492
1.915MetGly: 1.915 ± 0.523
0.766MetHis: 0.766 ± 0.475
1.532MetIle: 1.532 ± 0.692
2.872MetLys: 2.872 ± 0.801
2.298MetLeu: 2.298 ± 0.48
1.34MetMet: 1.34 ± 0.536
1.34MetAsn: 1.34 ± 0.687
0.766MetPro: 0.766 ± 0.324
1.149MetGln: 1.149 ± 0.559
0.957MetArg: 0.957 ± 0.442
2.489MetSer: 2.489 ± 0.526
2.489MetThr: 2.489 ± 0.79
2.298MetVal: 2.298 ± 1.051
0.191MetTrp: 0.191 ± 0.239
0.766MetTyr: 0.766 ± 0.548
0.0MetXaa: 0.0 ± 0.0
Asn
5.745AsnAla: 5.745 ± 1.669
0.191AsnCys: 0.191 ± 0.196
2.298AsnAsp: 2.298 ± 0.614
2.872AsnGlu: 2.872 ± 0.765
1.149AsnPhe: 1.149 ± 0.394
3.064AsnGly: 3.064 ± 0.638
0.191AsnHis: 0.191 ± 0.265
4.596AsnIle: 4.596 ± 1.139
2.681AsnLys: 2.681 ± 0.839
4.979AsnLeu: 4.979 ± 1.051
2.106AsnMet: 2.106 ± 0.748
1.915AsnAsn: 1.915 ± 0.523
2.681AsnPro: 2.681 ± 0.714
2.106AsnGln: 2.106 ± 0.712
1.34AsnArg: 1.34 ± 0.703
2.872AsnSer: 2.872 ± 0.542
1.723AsnThr: 1.723 ± 0.728
2.298AsnVal: 2.298 ± 0.772
0.766AsnTrp: 0.766 ± 0.322
1.34AsnTyr: 1.34 ± 0.498
0.0AsnXaa: 0.0 ± 0.0
Pro
2.489ProAla: 2.489 ± 0.508
0.191ProCys: 0.191 ± 0.151
2.106ProAsp: 2.106 ± 0.65
2.106ProGlu: 2.106 ± 0.574
1.723ProPhe: 1.723 ± 0.439
0.766ProGly: 0.766 ± 0.418
0.383ProHis: 0.383 ± 0.218
2.298ProIle: 2.298 ± 0.525
1.915ProLys: 1.915 ± 0.657
2.489ProLeu: 2.489 ± 0.539
1.34ProMet: 1.34 ± 0.426
2.298ProAsn: 2.298 ± 0.683
1.915ProPro: 1.915 ± 0.526
1.723ProGln: 1.723 ± 0.703
0.574ProArg: 0.574 ± 0.404
2.298ProSer: 2.298 ± 0.729
1.723ProThr: 1.723 ± 0.383
1.532ProVal: 1.532 ± 0.439
0.191ProTrp: 0.191 ± 0.151
1.149ProTyr: 1.149 ± 0.412
0.0ProXaa: 0.0 ± 0.0
Gln
6.511GlnAla: 6.511 ± 1.085
0.191GlnCys: 0.191 ± 0.196
0.957GlnAsp: 0.957 ± 0.583
2.298GlnGlu: 2.298 ± 0.46
0.957GlnPhe: 0.957 ± 0.391
1.532GlnGly: 1.532 ± 0.53
1.149GlnHis: 1.149 ± 0.454
3.255GlnIle: 3.255 ± 0.638
3.255GlnLys: 3.255 ± 0.55
5.553GlnLeu: 5.553 ± 1.001
1.149GlnMet: 1.149 ± 0.422
2.489GlnAsn: 2.489 ± 0.65
1.149GlnPro: 1.149 ± 0.504
1.915GlnGln: 1.915 ± 0.752
2.298GlnArg: 2.298 ± 0.715
2.872GlnSer: 2.872 ± 0.723
1.915GlnThr: 1.915 ± 0.594
2.298GlnVal: 2.298 ± 0.725
0.191GlnTrp: 0.191 ± 0.22
0.766GlnTyr: 0.766 ± 0.331
0.0GlnXaa: 0.0 ± 0.0
Arg
3.447ArgAla: 3.447 ± 0.968
0.574ArgCys: 0.574 ± 0.328
1.723ArgAsp: 1.723 ± 0.523
3.638ArgGlu: 3.638 ± 0.66
1.532ArgPhe: 1.532 ± 0.517
2.681ArgGly: 2.681 ± 0.713
1.149ArgHis: 1.149 ± 0.371
1.532ArgIle: 1.532 ± 0.454
1.34ArgLys: 1.34 ± 0.47
6.319ArgLeu: 6.319 ± 1.854
0.383ArgMet: 0.383 ± 0.287
1.34ArgAsn: 1.34 ± 0.49
1.532ArgPro: 1.532 ± 0.566
2.872ArgGln: 2.872 ± 0.992
2.489ArgArg: 2.489 ± 1.223
2.106ArgSer: 2.106 ± 0.565
2.681ArgThr: 2.681 ± 0.58
2.489ArgVal: 2.489 ± 0.731
0.574ArgTrp: 0.574 ± 0.336
2.298ArgTyr: 2.298 ± 0.764
0.0ArgXaa: 0.0 ± 0.0
Ser
6.319SerAla: 6.319 ± 0.918
0.766SerCys: 0.766 ± 0.377
3.064SerAsp: 3.064 ± 0.755
3.255SerGlu: 3.255 ± 0.934
2.298SerPhe: 2.298 ± 0.963
5.17SerGly: 5.17 ± 0.669
0.957SerHis: 0.957 ± 0.469
3.447SerIle: 3.447 ± 0.984
2.681SerLys: 2.681 ± 0.737
4.404SerLeu: 4.404 ± 0.981
1.915SerMet: 1.915 ± 0.653
3.064SerAsn: 3.064 ± 1.063
2.106SerPro: 2.106 ± 0.697
3.064SerGln: 3.064 ± 0.889
3.638SerArg: 3.638 ± 0.854
2.872SerSer: 2.872 ± 0.819
3.064SerThr: 3.064 ± 0.632
5.17SerVal: 5.17 ± 1.045
0.574SerTrp: 0.574 ± 0.241
1.723SerTyr: 1.723 ± 0.434
0.0SerXaa: 0.0 ± 0.0
Thr
7.851ThrAla: 7.851 ± 1.322
0.191ThrCys: 0.191 ± 0.265
3.83ThrAsp: 3.83 ± 0.916
3.064ThrGlu: 3.064 ± 0.854
0.766ThrPhe: 0.766 ± 0.38
5.362ThrGly: 5.362 ± 1.033
0.574ThrHis: 0.574 ± 0.337
3.638ThrIle: 3.638 ± 1.294
5.17ThrLys: 5.17 ± 0.96
5.362ThrLeu: 5.362 ± 1.14
2.106ThrMet: 2.106 ± 0.584
1.915ThrAsn: 1.915 ± 0.589
1.915ThrPro: 1.915 ± 0.602
2.681ThrGln: 2.681 ± 0.56
1.34ThrArg: 1.34 ± 0.595
2.298ThrSer: 2.298 ± 0.789
2.298ThrThr: 2.298 ± 0.78
2.681ThrVal: 2.681 ± 0.815
0.383ThrTrp: 0.383 ± 0.283
0.957ThrTyr: 0.957 ± 0.399
0.0ThrXaa: 0.0 ± 0.0
Val
6.128ValAla: 6.128 ± 1.037
0.766ValCys: 0.766 ± 0.352
2.489ValAsp: 2.489 ± 0.485
3.83ValGlu: 3.83 ± 0.632
1.723ValPhe: 1.723 ± 0.531
3.255ValGly: 3.255 ± 0.807
1.34ValHis: 1.34 ± 0.542
2.489ValIle: 2.489 ± 0.646
4.596ValLys: 4.596 ± 0.914
5.745ValLeu: 5.745 ± 0.709
1.915ValMet: 1.915 ± 0.513
2.872ValAsn: 2.872 ± 0.836
2.681ValPro: 2.681 ± 0.689
1.915ValGln: 1.915 ± 0.516
3.447ValArg: 3.447 ± 1.101
3.83ValSer: 3.83 ± 0.758
2.489ValThr: 2.489 ± 0.642
4.787ValVal: 4.787 ± 0.837
1.149ValTrp: 1.149 ± 0.547
2.298ValTyr: 2.298 ± 0.857
0.0ValXaa: 0.0 ± 0.0
Trp
1.34TrpAla: 1.34 ± 0.545
0.191TrpCys: 0.191 ± 0.239
0.766TrpAsp: 0.766 ± 0.289
0.383TrpGlu: 0.383 ± 0.223
0.574TrpPhe: 0.574 ± 0.418
0.766TrpGly: 0.766 ± 0.444
0.191TrpHis: 0.191 ± 0.144
0.383TrpIle: 0.383 ± 0.18
0.383TrpLys: 0.383 ± 0.294
0.574TrpLeu: 0.574 ± 0.267
0.383TrpMet: 0.383 ± 0.341
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
1.723TrpGln: 1.723 ± 0.658
0.191TrpArg: 0.191 ± 0.239
0.574TrpSer: 0.574 ± 0.369
0.383TrpThr: 0.383 ± 0.22
0.766TrpVal: 0.766 ± 0.444
0.383TrpTrp: 0.383 ± 0.246
0.574TrpTyr: 0.574 ± 0.386
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.106TyrAla: 2.106 ± 0.618
0.383TyrCys: 0.383 ± 0.259
2.298TyrAsp: 2.298 ± 0.708
1.915TyrGlu: 1.915 ± 0.658
1.149TyrPhe: 1.149 ± 0.446
3.83TyrGly: 3.83 ± 0.819
0.957TyrHis: 0.957 ± 0.45
1.532TyrIle: 1.532 ± 0.53
2.106TyrLys: 2.106 ± 0.542
0.957TyrLeu: 0.957 ± 0.407
0.957TyrMet: 0.957 ± 0.4
0.383TyrAsn: 0.383 ± 0.286
1.532TyrPro: 1.532 ± 0.595
1.34TyrGln: 1.34 ± 0.363
2.872TyrArg: 2.872 ± 0.63
2.298TyrSer: 2.298 ± 0.628
1.723TyrThr: 1.723 ± 0.408
2.106TyrVal: 2.106 ± 0.958
0.574TyrTrp: 0.574 ± 0.279
1.915TyrTyr: 1.915 ± 0.553
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 24 proteins (5223 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski