Amino acid dipepetide frequency for Moraxella phage Mcat5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.667AlaAla: 2.667 ± 0.552
0.684AlaCys: 0.684 ± 0.244
6.564AlaAsp: 6.564 ± 0.743
3.829AlaGlu: 3.829 ± 0.641
3.829AlaPhe: 3.829 ± 0.524
4.376AlaGly: 4.376 ± 0.631
2.256AlaHis: 2.256 ± 0.385
6.017AlaIle: 6.017 ± 0.646
8.41AlaLys: 8.41 ± 1.032
7.179AlaLeu: 7.179 ± 0.807
2.803AlaMet: 2.803 ± 0.613
5.265AlaAsn: 5.265 ± 0.711
2.256AlaPro: 2.256 ± 0.507
3.556AlaGln: 3.556 ± 0.608
3.214AlaArg: 3.214 ± 0.466
5.06AlaSer: 5.06 ± 0.905
5.197AlaThr: 5.197 ± 0.794
5.675AlaVal: 5.675 ± 0.646
1.504AlaTrp: 1.504 ± 0.3
2.94AlaTyr: 2.94 ± 0.455
0.0AlaXaa: 0.0 ± 0.0
Cys
0.752CysAla: 0.752 ± 0.33
0.205CysCys: 0.205 ± 0.126
0.684CysAsp: 0.684 ± 0.195
0.821CysGlu: 0.821 ± 0.227
0.205CysPhe: 0.205 ± 0.115
1.094CysGly: 1.094 ± 0.356
0.205CysHis: 0.205 ± 0.121
0.342CysIle: 0.342 ± 0.16
0.342CysLys: 0.342 ± 0.137
0.821CysLeu: 0.821 ± 0.26
0.137CysMet: 0.137 ± 0.104
0.137CysAsn: 0.137 ± 0.099
0.479CysPro: 0.479 ± 0.174
0.274CysGln: 0.274 ± 0.139
0.479CysArg: 0.479 ± 0.182
0.479CysSer: 0.479 ± 0.167
0.342CysThr: 0.342 ± 0.142
0.615CysVal: 0.615 ± 0.208
0.068CysTrp: 0.068 ± 0.062
0.821CysTyr: 0.821 ± 0.257
0.0CysXaa: 0.0 ± 0.0
Asp
3.761AspAla: 3.761 ± 0.5
0.752AspCys: 0.752 ± 0.287
6.359AspAsp: 6.359 ± 0.89
5.744AspGlu: 5.744 ± 0.715
2.53AspPhe: 2.53 ± 0.46
5.812AspGly: 5.812 ± 0.672
0.615AspHis: 0.615 ± 0.209
4.034AspIle: 4.034 ± 0.411
5.88AspLys: 5.88 ± 0.651
5.197AspLeu: 5.197 ± 0.561
1.709AspMet: 1.709 ± 0.404
4.376AspAsn: 4.376 ± 0.58
1.778AspPro: 1.778 ± 0.363
0.957AspGln: 0.957 ± 0.261
1.915AspArg: 1.915 ± 0.372
2.462AspSer: 2.462 ± 0.444
4.034AspThr: 4.034 ± 0.509
2.667AspVal: 2.667 ± 0.43
0.957AspTrp: 0.957 ± 0.292
2.872AspTyr: 2.872 ± 0.404
0.0AspXaa: 0.0 ± 0.0
Glu
3.145GluAla: 3.145 ± 0.503
0.752GluCys: 0.752 ± 0.261
1.436GluAsp: 1.436 ± 0.437
1.231GluGlu: 1.231 ± 0.343
2.53GluPhe: 2.53 ± 0.408
2.188GluGly: 2.188 ± 0.391
1.573GluHis: 1.573 ± 0.348
5.744GluIle: 5.744 ± 0.777
3.145GluLys: 3.145 ± 0.583
6.427GluLeu: 6.427 ± 0.659
1.915GluMet: 1.915 ± 0.37
3.077GluAsn: 3.077 ± 0.415
2.53GluPro: 2.53 ± 0.502
4.239GluGln: 4.239 ± 0.659
3.829GluArg: 3.829 ± 0.549
2.53GluSer: 2.53 ± 0.363
2.735GluThr: 2.735 ± 0.522
2.53GluVal: 2.53 ± 0.515
1.026GluTrp: 1.026 ± 0.309
2.462GluTyr: 2.462 ± 0.476
0.0GluXaa: 0.0 ± 0.0
Phe
4.103PheAla: 4.103 ± 0.534
0.547PheCys: 0.547 ± 0.201
2.872PheAsp: 2.872 ± 0.458
2.598PheGlu: 2.598 ± 0.402
1.026PhePhe: 1.026 ± 0.236
3.077PheGly: 3.077 ± 0.308
0.889PheHis: 0.889 ± 0.274
2.872PheIle: 2.872 ± 0.479
1.778PheLys: 1.778 ± 0.323
2.393PheLeu: 2.393 ± 0.553
1.231PheMet: 1.231 ± 0.255
1.846PheAsn: 1.846 ± 0.298
0.547PhePro: 0.547 ± 0.163
0.342PheGln: 0.342 ± 0.147
1.573PheArg: 1.573 ± 0.378
2.051PheSer: 2.051 ± 0.329
1.709PheThr: 1.709 ± 0.339
1.983PheVal: 1.983 ± 0.401
0.479PheTrp: 0.479 ± 0.175
1.368PheTyr: 1.368 ± 0.314
0.0PheXaa: 0.0 ± 0.0
Gly
4.581GlyAla: 4.581 ± 0.828
0.479GlyCys: 0.479 ± 0.163
3.966GlyAsp: 3.966 ± 0.575
3.556GlyGlu: 3.556 ± 0.537
3.009GlyPhe: 3.009 ± 0.494
5.265GlyGly: 5.265 ± 0.651
1.094GlyHis: 1.094 ± 0.26
4.444GlyIle: 4.444 ± 0.606
4.923GlyLys: 4.923 ± 0.552
5.812GlyLeu: 5.812 ± 0.633
1.915GlyMet: 1.915 ± 0.323
3.009GlyAsn: 3.009 ± 0.536
0.068GlyPro: 0.068 ± 0.063
3.487GlyGln: 3.487 ± 0.626
3.487GlyArg: 3.487 ± 0.567
3.282GlySer: 3.282 ± 0.426
3.487GlyThr: 3.487 ± 0.581
4.786GlyVal: 4.786 ± 0.729
1.094GlyTrp: 1.094 ± 0.331
2.598GlyTyr: 2.598 ± 0.525
0.0GlyXaa: 0.0 ± 0.0
His
2.667HisAla: 2.667 ± 0.403
0.068HisCys: 0.068 ± 0.055
1.436HisAsp: 1.436 ± 0.376
1.709HisGlu: 1.709 ± 0.332
1.026HisPhe: 1.026 ± 0.249
1.983HisGly: 1.983 ± 0.421
1.094HisHis: 1.094 ± 0.377
1.436HisIle: 1.436 ± 0.383
0.957HisLys: 0.957 ± 0.231
1.709HisLeu: 1.709 ± 0.356
0.274HisMet: 0.274 ± 0.1
0.889HisAsn: 0.889 ± 0.28
1.162HisPro: 1.162 ± 0.32
0.821HisGln: 0.821 ± 0.218
1.094HisArg: 1.094 ± 0.292
1.709HisSer: 1.709 ± 0.302
2.803HisThr: 2.803 ± 0.411
0.752HisVal: 0.752 ± 0.25
0.342HisTrp: 0.342 ± 0.177
1.026HisTyr: 1.026 ± 0.229
0.0HisXaa: 0.0 ± 0.0
Ile
6.564IleAla: 6.564 ± 0.759
0.889IleCys: 0.889 ± 0.27
5.538IleAsp: 5.538 ± 0.519
4.376IleGlu: 4.376 ± 0.588
1.573IlePhe: 1.573 ± 0.355
4.034IleGly: 4.034 ± 0.616
1.162IleHis: 1.162 ± 0.29
3.966IleIle: 3.966 ± 0.638
5.265IleLys: 5.265 ± 0.895
4.65IleLeu: 4.65 ± 0.65
1.573IleMet: 1.573 ± 0.257
4.171IleAsn: 4.171 ± 0.729
2.735IlePro: 2.735 ± 0.518
2.94IleGln: 2.94 ± 0.393
2.94IleArg: 2.94 ± 0.507
5.47IleSer: 5.47 ± 0.8
5.949IleThr: 5.949 ± 0.624
3.077IleVal: 3.077 ± 0.557
0.957IleTrp: 0.957 ± 0.279
2.462IleTyr: 2.462 ± 0.512
0.0IleXaa: 0.0 ± 0.0
Lys
6.838LysAla: 6.838 ± 0.646
0.547LysCys: 0.547 ± 0.26
3.897LysAsp: 3.897 ± 0.507
4.513LysGlu: 4.513 ± 0.675
2.051LysPhe: 2.051 ± 0.333
3.419LysGly: 3.419 ± 0.442
2.256LysHis: 2.256 ± 0.428
5.06LysIle: 5.06 ± 0.693
4.513LysLys: 4.513 ± 0.67
5.402LysLeu: 5.402 ± 0.579
1.573LysMet: 1.573 ± 0.23
2.803LysAsn: 2.803 ± 0.533
3.077LysPro: 3.077 ± 0.672
3.966LysGln: 3.966 ± 0.512
3.419LysArg: 3.419 ± 0.545
4.513LysSer: 4.513 ± 0.624
5.128LysThr: 5.128 ± 0.676
4.171LysVal: 4.171 ± 0.601
0.752LysTrp: 0.752 ± 0.235
2.393LysTyr: 2.393 ± 0.323
0.0LysXaa: 0.0 ± 0.0
Leu
7.521LeuAla: 7.521 ± 1.172
0.957LeuCys: 0.957 ± 0.305
6.222LeuAsp: 6.222 ± 0.671
4.171LeuGlu: 4.171 ± 0.604
2.735LeuPhe: 2.735 ± 0.43
5.88LeuGly: 5.88 ± 0.728
1.846LeuHis: 1.846 ± 0.483
6.838LeuIle: 6.838 ± 0.957
5.744LeuLys: 5.744 ± 0.674
5.47LeuLeu: 5.47 ± 0.705
1.846LeuMet: 1.846 ± 0.377
4.923LeuAsn: 4.923 ± 0.65
4.444LeuPro: 4.444 ± 0.646
4.923LeuGln: 4.923 ± 0.416
3.487LeuArg: 3.487 ± 0.471
7.179LeuSer: 7.179 ± 0.612
5.675LeuThr: 5.675 ± 0.652
4.103LeuVal: 4.103 ± 0.555
0.547LeuTrp: 0.547 ± 0.169
2.12LeuTyr: 2.12 ± 0.414
0.0LeuXaa: 0.0 ± 0.0
Met
2.393MetAla: 2.393 ± 0.59
0.068MetCys: 0.068 ± 0.061
1.573MetAsp: 1.573 ± 0.32
0.274MetGlu: 0.274 ± 0.149
0.752MetPhe: 0.752 ± 0.235
1.778MetGly: 1.778 ± 0.301
0.41MetHis: 0.41 ± 0.196
1.573MetIle: 1.573 ± 0.299
1.299MetLys: 1.299 ± 0.259
1.504MetLeu: 1.504 ± 0.377
0.615MetMet: 0.615 ± 0.19
1.162MetAsn: 1.162 ± 0.293
1.094MetPro: 1.094 ± 0.289
1.299MetGln: 1.299 ± 0.315
1.299MetArg: 1.299 ± 0.255
2.325MetSer: 2.325 ± 0.442
2.12MetThr: 2.12 ± 0.37
1.709MetVal: 1.709 ± 0.324
0.205MetTrp: 0.205 ± 0.112
0.547MetTyr: 0.547 ± 0.225
0.0MetXaa: 0.0 ± 0.0
Asn
5.402AsnAla: 5.402 ± 0.706
0.274AsnCys: 0.274 ± 0.128
1.983AsnAsp: 1.983 ± 0.428
2.803AsnGlu: 2.803 ± 0.419
1.778AsnPhe: 1.778 ± 0.296
3.282AsnGly: 3.282 ± 0.865
2.051AsnHis: 2.051 ± 0.368
2.393AsnIle: 2.393 ± 0.5
4.308AsnLys: 4.308 ± 0.732
4.308AsnLeu: 4.308 ± 0.414
0.957AsnMet: 0.957 ± 0.222
2.462AsnAsn: 2.462 ± 0.447
2.667AsnPro: 2.667 ± 0.495
3.009AsnGln: 3.009 ± 0.623
1.436AsnArg: 1.436 ± 0.305
3.624AsnSer: 3.624 ± 0.556
3.009AsnThr: 3.009 ± 0.68
2.735AsnVal: 2.735 ± 0.417
0.615AsnTrp: 0.615 ± 0.163
2.188AsnTyr: 2.188 ± 0.371
0.0AsnXaa: 0.0 ± 0.0
Pro
2.94ProAla: 2.94 ± 0.523
0.342ProCys: 0.342 ± 0.192
1.983ProAsp: 1.983 ± 0.423
1.709ProGlu: 1.709 ± 0.445
1.231ProPhe: 1.231 ± 0.192
0.068ProGly: 0.068 ± 0.064
0.821ProHis: 0.821 ± 0.196
2.94ProIle: 2.94 ± 0.554
3.556ProLys: 3.556 ± 0.701
3.487ProLeu: 3.487 ± 0.532
0.889ProMet: 0.889 ± 0.198
2.12ProAsn: 2.12 ± 0.367
1.436ProPro: 1.436 ± 0.476
1.504ProGln: 1.504 ± 0.341
1.299ProArg: 1.299 ± 0.265
3.009ProSer: 3.009 ± 0.566
3.145ProThr: 3.145 ± 0.444
1.846ProVal: 1.846 ± 0.347
0.205ProTrp: 0.205 ± 0.118
1.231ProTyr: 1.231 ± 0.322
0.0ProXaa: 0.0 ± 0.0
Gln
4.991GlnAla: 4.991 ± 0.758
0.274GlnCys: 0.274 ± 0.128
2.667GlnAsp: 2.667 ± 0.485
1.915GlnGlu: 1.915 ± 0.456
1.846GlnPhe: 1.846 ± 0.347
3.624GlnGly: 3.624 ± 0.524
0.479GlnHis: 0.479 ± 0.202
4.786GlnIle: 4.786 ± 0.544
3.419GlnLys: 3.419 ± 0.505
4.308GlnLeu: 4.308 ± 0.574
1.299GlnMet: 1.299 ± 0.331
2.803GlnAsn: 2.803 ± 0.501
1.299GlnPro: 1.299 ± 0.308
1.846GlnGln: 1.846 ± 0.375
1.915GlnArg: 1.915 ± 0.47
4.171GlnSer: 4.171 ± 0.801
3.145GlnThr: 3.145 ± 0.431
2.051GlnVal: 2.051 ± 0.392
0.342GlnTrp: 0.342 ± 0.134
1.846GlnTyr: 1.846 ± 0.307
0.0GlnXaa: 0.0 ± 0.0
Arg
4.65ArgAla: 4.65 ± 0.507
0.342ArgCys: 0.342 ± 0.13
2.256ArgAsp: 2.256 ± 0.387
2.462ArgGlu: 2.462 ± 0.451
1.299ArgPhe: 1.299 ± 0.258
1.709ArgGly: 1.709 ± 0.353
1.368ArgHis: 1.368 ± 0.296
2.872ArgIle: 2.872 ± 0.553
2.53ArgLys: 2.53 ± 0.471
5.197ArgLeu: 5.197 ± 0.763
0.889ArgMet: 0.889 ± 0.229
1.299ArgAsn: 1.299 ± 0.279
1.641ArgPro: 1.641 ± 0.298
2.667ArgGln: 2.667 ± 0.467
2.051ArgArg: 2.051 ± 0.346
2.12ArgSer: 2.12 ± 0.402
3.009ArgThr: 3.009 ± 0.411
2.598ArgVal: 2.598 ± 0.407
0.342ArgTrp: 0.342 ± 0.142
2.051ArgTyr: 2.051 ± 0.382
0.0ArgXaa: 0.0 ± 0.0
Ser
4.923SerAla: 4.923 ± 0.665
0.342SerCys: 0.342 ± 0.18
4.513SerAsp: 4.513 ± 0.72
4.855SerGlu: 4.855 ± 0.618
2.12SerPhe: 2.12 ± 0.332
4.923SerGly: 4.923 ± 0.669
2.325SerHis: 2.325 ± 0.563
3.624SerIle: 3.624 ± 0.614
4.034SerLys: 4.034 ± 0.578
5.333SerLeu: 5.333 ± 0.676
1.368SerMet: 1.368 ± 0.303
2.872SerAsn: 2.872 ± 0.714
1.504SerPro: 1.504 ± 0.4
4.718SerGln: 4.718 ± 0.781
2.667SerArg: 2.667 ± 0.476
3.487SerSer: 3.487 ± 1.066
3.214SerThr: 3.214 ± 0.511
4.786SerVal: 4.786 ± 0.862
0.547SerTrp: 0.547 ± 0.178
1.368SerTyr: 1.368 ± 0.346
0.0SerXaa: 0.0 ± 0.0
Thr
6.838ThrAla: 6.838 ± 0.812
0.205ThrCys: 0.205 ± 0.125
4.718ThrAsp: 4.718 ± 0.623
3.419ThrGlu: 3.419 ± 0.458
1.573ThrPhe: 1.573 ± 0.37
4.991ThrGly: 4.991 ± 0.59
1.983ThrHis: 1.983 ± 0.37
3.829ThrIle: 3.829 ± 0.501
4.855ThrLys: 4.855 ± 0.705
8.137ThrLeu: 8.137 ± 0.665
1.026ThrMet: 1.026 ± 0.196
3.145ThrAsn: 3.145 ± 0.495
2.94ThrPro: 2.94 ± 0.507
2.256ThrGln: 2.256 ± 0.395
2.051ThrArg: 2.051 ± 0.398
3.624ThrSer: 3.624 ± 0.662
3.897ThrThr: 3.897 ± 0.583
3.761ThrVal: 3.761 ± 0.642
0.41ThrTrp: 0.41 ± 0.152
1.504ThrTyr: 1.504 ± 0.343
0.0ThrXaa: 0.0 ± 0.0
Val
4.444ValAla: 4.444 ± 0.688
0.615ValCys: 0.615 ± 0.196
3.077ValAsp: 3.077 ± 0.441
2.12ValGlu: 2.12 ± 0.398
2.393ValPhe: 2.393 ± 0.454
4.103ValGly: 4.103 ± 0.562
1.094ValHis: 1.094 ± 0.281
4.718ValIle: 4.718 ± 0.723
3.282ValLys: 3.282 ± 0.541
4.786ValLeu: 4.786 ± 0.495
1.026ValMet: 1.026 ± 0.297
2.53ValAsn: 2.53 ± 0.323
2.12ValPro: 2.12 ± 0.456
2.872ValGln: 2.872 ± 0.441
2.803ValArg: 2.803 ± 0.538
3.966ValSer: 3.966 ± 0.541
3.897ValThr: 3.897 ± 0.589
3.145ValVal: 3.145 ± 0.485
0.821ValTrp: 0.821 ± 0.175
2.256ValTyr: 2.256 ± 0.372
0.0ValXaa: 0.0 ± 0.0
Trp
1.231TrpAla: 1.231 ± 0.23
0.274TrpCys: 0.274 ± 0.15
0.615TrpAsp: 0.615 ± 0.202
0.752TrpGlu: 0.752 ± 0.225
0.41TrpPhe: 0.41 ± 0.159
0.889TrpGly: 0.889 ± 0.218
0.274TrpHis: 0.274 ± 0.176
0.274TrpIle: 0.274 ± 0.119
0.479TrpLys: 0.479 ± 0.161
0.821TrpLeu: 0.821 ± 0.241
0.068TrpMet: 0.068 ± 0.063
0.479TrpAsn: 0.479 ± 0.164
0.0TrpPro: 0.0 ± 0.0
1.299TrpGln: 1.299 ± 0.373
0.821TrpArg: 0.821 ± 0.227
0.479TrpSer: 0.479 ± 0.208
0.752TrpThr: 0.752 ± 0.226
1.368TrpVal: 1.368 ± 0.335
0.41TrpTrp: 0.41 ± 0.173
0.41TrpTyr: 0.41 ± 0.144
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.872TyrAla: 2.872 ± 0.459
0.684TyrCys: 0.684 ± 0.233
2.051TyrAsp: 2.051 ± 0.314
2.188TyrGlu: 2.188 ± 0.422
1.368TyrPhe: 1.368 ± 0.455
1.915TyrGly: 1.915 ± 0.378
1.162TyrHis: 1.162 ± 0.331
2.53TyrIle: 2.53 ± 0.442
1.573TyrLys: 1.573 ± 0.306
3.692TyrLeu: 3.692 ± 0.69
0.821TyrMet: 0.821 ± 0.265
1.915TyrAsn: 1.915 ± 0.342
1.983TyrPro: 1.983 ± 0.396
2.051TyrGln: 2.051 ± 0.439
1.573TyrArg: 1.573 ± 0.352
1.983TyrSer: 1.983 ± 0.346
1.915TyrThr: 1.915 ± 0.355
1.709TyrVal: 1.709 ± 0.312
0.479TyrTrp: 0.479 ± 0.189
1.436TyrTyr: 1.436 ± 0.491
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 68 proteins (14626 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski