Amino acid dipepetide frequency for Arthrobacter phage Noely

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
29.118AlaAla: 29.118 ± 3.222
0.869AlaCys: 0.869 ± 0.419
9.126AlaAsp: 9.126 ± 1.329
5.215AlaGlu: 5.215 ± 1.192
4.998AlaPhe: 4.998 ± 0.921
14.342AlaGly: 14.342 ± 1.597
2.39AlaHis: 2.39 ± 0.609
2.173AlaIle: 2.173 ± 0.617
5.432AlaLys: 5.432 ± 1.423
12.603AlaLeu: 12.603 ± 1.528
3.042AlaMet: 3.042 ± 0.546
4.998AlaAsn: 4.998 ± 1.387
6.953AlaPro: 6.953 ± 1.411
8.04AlaGln: 8.04 ± 1.509
9.778AlaArg: 9.778 ± 1.555
6.736AlaSer: 6.736 ± 1.045
7.823AlaThr: 7.823 ± 1.216
12.386AlaVal: 12.386 ± 1.449
1.304AlaTrp: 1.304 ± 0.576
1.086AlaTyr: 1.086 ± 0.587
0.0AlaXaa: 0.0 ± 0.0
Cys
0.435CysAla: 0.435 ± 0.331
0.0CysCys: 0.0 ± 0.0
0.435CysAsp: 0.435 ± 0.285
0.217CysGlu: 0.217 ± 0.205
0.0CysPhe: 0.0 ± 0.0
0.652CysGly: 0.652 ± 0.364
0.652CysHis: 0.652 ± 0.344
0.217CysIle: 0.217 ± 0.194
0.435CysLys: 0.435 ± 0.295
0.652CysLeu: 0.652 ± 0.382
0.0CysMet: 0.0 ± 0.0
0.217CysAsn: 0.217 ± 0.21
0.652CysPro: 0.652 ± 0.296
0.217CysGln: 0.217 ± 0.236
0.435CysArg: 0.435 ± 0.421
0.0CysSer: 0.0 ± 0.0
0.217CysThr: 0.217 ± 0.194
0.435CysVal: 0.435 ± 0.269
0.217CysTrp: 0.217 ± 0.21
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
10.648AspAla: 10.648 ± 1.44
0.652AspCys: 0.652 ± 0.368
2.39AspAsp: 2.39 ± 0.872
1.956AspGlu: 1.956 ± 0.668
0.217AspPhe: 0.217 ± 0.2
7.605AspGly: 7.605 ± 1.28
0.435AspHis: 0.435 ± 0.248
2.39AspIle: 2.39 ± 0.532
1.304AspLys: 1.304 ± 0.441
8.475AspLeu: 8.475 ± 1.26
1.086AspMet: 1.086 ± 0.519
1.738AspAsn: 1.738 ± 0.442
3.477AspPro: 3.477 ± 0.663
3.911AspGln: 3.911 ± 0.804
3.042AspArg: 3.042 ± 0.905
3.259AspSer: 3.259 ± 0.589
5.215AspThr: 5.215 ± 0.918
5.432AspVal: 5.432 ± 1.271
1.738AspTrp: 1.738 ± 0.58
1.956AspTyr: 1.956 ± 0.889
0.0AspXaa: 0.0 ± 0.0
Glu
8.475GluAla: 8.475 ± 2.151
0.652GluCys: 0.652 ± 0.407
5.432GluAsp: 5.432 ± 1.036
3.477GluGlu: 3.477 ± 0.844
1.521GluPhe: 1.521 ± 0.418
3.694GluGly: 3.694 ± 0.927
1.086GluHis: 1.086 ± 0.384
2.608GluIle: 2.608 ± 0.585
1.521GluLys: 1.521 ± 0.604
4.563GluLeu: 4.563 ± 0.8
0.0GluMet: 0.0 ± 0.0
1.738GluAsn: 1.738 ± 0.682
2.39GluPro: 2.39 ± 0.725
1.304GluGln: 1.304 ± 0.515
3.259GluArg: 3.259 ± 0.669
2.608GluSer: 2.608 ± 0.985
1.956GluThr: 1.956 ± 0.705
4.998GluVal: 4.998 ± 0.912
0.869GluTrp: 0.869 ± 0.279
0.869GluTyr: 0.869 ± 0.607
0.0GluXaa: 0.0 ± 0.0
Phe
3.259PheAla: 3.259 ± 0.74
0.217PheCys: 0.217 ± 0.197
1.956PheAsp: 1.956 ± 0.542
1.086PheGlu: 1.086 ± 0.55
0.0PhePhe: 0.0 ± 0.0
2.173PheGly: 2.173 ± 0.604
0.435PheHis: 0.435 ± 0.274
0.435PheIle: 0.435 ± 0.254
1.521PheLys: 1.521 ± 0.434
1.738PheLeu: 1.738 ± 0.552
0.435PheMet: 0.435 ± 0.275
0.217PheAsn: 0.217 ± 0.197
0.869PhePro: 0.869 ± 0.355
0.869PheGln: 0.869 ± 0.31
1.086PheArg: 1.086 ± 0.455
0.435PheSer: 0.435 ± 0.284
1.521PheThr: 1.521 ± 0.446
0.869PheVal: 0.869 ± 0.677
0.435PheTrp: 0.435 ± 0.308
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
10.865GlyAla: 10.865 ± 1.169
0.652GlyCys: 0.652 ± 0.375
4.129GlyAsp: 4.129 ± 0.98
5.432GlyGlu: 5.432 ± 1.202
1.956GlyPhe: 1.956 ± 0.487
6.953GlyGly: 6.953 ± 1.239
1.956GlyHis: 1.956 ± 0.833
1.956GlyIle: 1.956 ± 0.681
3.477GlyLys: 3.477 ± 0.722
9.126GlyLeu: 9.126 ± 1.347
2.608GlyMet: 2.608 ± 0.721
2.173GlyAsn: 2.173 ± 0.568
4.998GlyPro: 4.998 ± 1.033
3.477GlyGln: 3.477 ± 1.093
6.084GlyArg: 6.084 ± 1.237
5.432GlySer: 5.432 ± 1.156
8.257GlyThr: 8.257 ± 1.358
7.171GlyVal: 7.171 ± 0.979
2.39GlyTrp: 2.39 ± 0.651
3.477GlyTyr: 3.477 ± 0.663
0.0GlyXaa: 0.0 ± 0.0
His
1.304HisAla: 1.304 ± 0.538
0.0HisCys: 0.0 ± 0.0
0.217HisAsp: 0.217 ± 0.169
0.652HisGlu: 0.652 ± 0.463
0.0HisPhe: 0.0 ± 0.0
2.39HisGly: 2.39 ± 0.737
0.0HisHis: 0.0 ± 0.0
0.652HisIle: 0.652 ± 0.379
0.0HisLys: 0.0 ± 0.0
1.304HisLeu: 1.304 ± 0.739
0.435HisMet: 0.435 ± 0.336
0.435HisAsn: 0.435 ± 0.263
0.869HisPro: 0.869 ± 0.365
1.521HisGln: 1.521 ± 0.436
1.086HisArg: 1.086 ± 0.569
0.652HisSer: 0.652 ± 0.293
0.435HisThr: 0.435 ± 0.305
1.304HisVal: 1.304 ± 0.53
0.217HisTrp: 0.217 ± 0.226
0.217HisTyr: 0.217 ± 0.236
0.0HisXaa: 0.0 ± 0.0
Ile
3.042IleAla: 3.042 ± 0.908
0.0IleCys: 0.0 ± 0.0
2.825IleAsp: 2.825 ± 0.799
1.086IleGlu: 1.086 ± 0.352
0.0IlePhe: 0.0 ± 0.0
3.042IleGly: 3.042 ± 0.732
0.652IleHis: 0.652 ± 0.318
1.086IleIle: 1.086 ± 0.4
1.956IleLys: 1.956 ± 0.448
2.39IleLeu: 2.39 ± 0.894
0.217IleMet: 0.217 ± 0.211
1.086IleAsn: 1.086 ± 0.322
1.738IlePro: 1.738 ± 0.496
1.738IleGln: 1.738 ± 0.663
2.173IleArg: 2.173 ± 0.452
0.652IleSer: 0.652 ± 0.263
2.173IleThr: 2.173 ± 0.794
1.738IleVal: 1.738 ± 0.535
0.652IleTrp: 0.652 ± 0.311
1.304IleTyr: 1.304 ± 0.442
0.0IleXaa: 0.0 ± 0.0
Lys
7.171LysAla: 7.171 ± 1.234
0.0LysCys: 0.0 ± 0.0
3.477LysAsp: 3.477 ± 0.771
3.259LysGlu: 3.259 ± 0.897
1.521LysPhe: 1.521 ± 0.513
3.694LysGly: 3.694 ± 1.388
0.217LysHis: 0.217 ± 0.222
0.869LysIle: 0.869 ± 0.322
1.521LysLys: 1.521 ± 0.618
2.825LysLeu: 2.825 ± 0.95
0.869LysMet: 0.869 ± 0.331
1.521LysAsn: 1.521 ± 0.543
1.956LysPro: 1.956 ± 0.441
0.869LysGln: 0.869 ± 0.477
3.694LysArg: 3.694 ± 0.798
2.825LysSer: 2.825 ± 0.748
3.259LysThr: 3.259 ± 0.693
1.956LysVal: 1.956 ± 0.694
0.869LysTrp: 0.869 ± 0.489
0.0LysTyr: 0.0 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
13.472LeuAla: 13.472 ± 1.54
0.435LeuCys: 0.435 ± 0.308
7.171LeuAsp: 7.171 ± 1.093
8.04LeuGlu: 8.04 ± 1.024
2.173LeuPhe: 2.173 ± 0.757
7.388LeuGly: 7.388 ± 1.524
0.217LeuHis: 0.217 ± 0.169
3.042LeuIle: 3.042 ± 1.036
5.215LeuLys: 5.215 ± 0.818
4.563LeuLeu: 4.563 ± 1.081
0.869LeuMet: 0.869 ± 0.511
1.956LeuAsn: 1.956 ± 0.871
5.65LeuPro: 5.65 ± 1.07
3.259LeuGln: 3.259 ± 0.981
4.781LeuArg: 4.781 ± 0.977
4.563LeuSer: 4.563 ± 0.686
3.694LeuThr: 3.694 ± 0.734
8.04LeuVal: 8.04 ± 1.075
2.39LeuTrp: 2.39 ± 0.664
1.086LeuTyr: 1.086 ± 0.567
0.0LeuXaa: 0.0 ± 0.0
Met
2.825MetAla: 2.825 ± 0.753
0.0MetCys: 0.0 ± 0.0
1.738MetAsp: 1.738 ± 0.644
1.304MetGlu: 1.304 ± 0.638
0.0MetPhe: 0.0 ± 0.0
1.738MetGly: 1.738 ± 0.784
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
1.738MetLys: 1.738 ± 0.549
1.304MetLeu: 1.304 ± 0.507
0.652MetMet: 0.652 ± 0.519
0.652MetAsn: 0.652 ± 0.377
1.304MetPro: 1.304 ± 0.813
0.0MetGln: 0.0 ± 0.0
1.738MetArg: 1.738 ± 0.563
1.086MetSer: 1.086 ± 0.337
2.173MetThr: 2.173 ± 0.746
2.39MetVal: 2.39 ± 0.743
0.217MetTrp: 0.217 ± 0.238
0.652MetTyr: 0.652 ± 0.338
0.0MetXaa: 0.0 ± 0.0
Asn
3.042AsnAla: 3.042 ± 0.628
0.0AsnCys: 0.0 ± 0.0
1.738AsnAsp: 1.738 ± 0.497
0.869AsnGlu: 0.869 ± 0.393
0.217AsnPhe: 0.217 ± 0.187
3.911AsnGly: 3.911 ± 1.077
0.652AsnHis: 0.652 ± 0.339
1.956AsnIle: 1.956 ± 0.518
0.869AsnLys: 0.869 ± 0.408
2.825AsnLeu: 2.825 ± 0.58
0.869AsnMet: 0.869 ± 0.474
0.869AsnAsn: 0.869 ± 0.427
1.738AsnPro: 1.738 ± 0.836
0.217AsnGln: 0.217 ± 0.187
2.39AsnArg: 2.39 ± 0.814
0.0AsnSer: 0.0 ± 0.0
2.39AsnThr: 2.39 ± 1.063
3.259AsnVal: 3.259 ± 0.834
0.0AsnTrp: 0.0 ± 0.0
1.304AsnTyr: 1.304 ± 0.463
0.0AsnXaa: 0.0 ± 0.0
Pro
6.302ProAla: 6.302 ± 1.165
0.435ProCys: 0.435 ± 0.327
4.998ProAsp: 4.998 ± 1.206
3.694ProGlu: 3.694 ± 0.738
0.435ProPhe: 0.435 ± 0.27
7.171ProGly: 7.171 ± 1.115
0.652ProHis: 0.652 ± 0.363
1.956ProIle: 1.956 ± 0.56
1.304ProLys: 1.304 ± 0.353
4.346ProLeu: 4.346 ± 1.234
2.39ProMet: 2.39 ± 0.847
1.738ProAsn: 1.738 ± 0.77
4.346ProPro: 4.346 ± 1.245
0.869ProGln: 0.869 ± 0.331
3.694ProArg: 3.694 ± 0.729
3.259ProSer: 3.259 ± 0.856
1.521ProThr: 1.521 ± 0.72
2.825ProVal: 2.825 ± 0.554
0.435ProTrp: 0.435 ± 0.303
0.435ProTyr: 0.435 ± 0.263
0.0ProXaa: 0.0 ± 0.0
Gln
5.65GlnAla: 5.65 ± 1.055
0.435GlnCys: 0.435 ± 0.394
3.477GlnAsp: 3.477 ± 0.836
5.215GlnGlu: 5.215 ± 1.314
0.869GlnPhe: 0.869 ± 0.323
3.911GlnGly: 3.911 ± 1.019
0.217GlnHis: 0.217 ± 0.169
0.217GlnIle: 0.217 ± 0.187
2.825GlnLys: 2.825 ± 0.749
3.911GlnLeu: 3.911 ± 0.715
0.435GlnMet: 0.435 ± 0.291
0.217GlnAsn: 0.217 ± 0.197
0.435GlnPro: 0.435 ± 0.258
0.0GlnGln: 0.0 ± 0.0
1.738GlnArg: 1.738 ± 0.51
1.086GlnSer: 1.086 ± 0.424
1.956GlnThr: 1.956 ± 0.592
5.432GlnVal: 5.432 ± 0.842
0.435GlnTrp: 0.435 ± 0.353
0.652GlnTyr: 0.652 ± 0.38
0.0GlnXaa: 0.0 ± 0.0
Arg
8.257ArgAla: 8.257 ± 2.262
0.869ArgCys: 0.869 ± 0.49
3.259ArgAsp: 3.259 ± 0.885
2.608ArgGlu: 2.608 ± 0.787
0.652ArgPhe: 0.652 ± 0.263
7.171ArgGly: 7.171 ± 0.917
1.304ArgHis: 1.304 ± 0.588
2.39ArgIle: 2.39 ± 0.683
2.825ArgLys: 2.825 ± 0.64
4.998ArgLeu: 4.998 ± 1.008
1.738ArgMet: 1.738 ± 0.665
2.608ArgAsn: 2.608 ± 0.927
4.129ArgPro: 4.129 ± 0.936
3.042ArgGln: 3.042 ± 0.791
5.215ArgArg: 5.215 ± 1.277
1.738ArgSer: 1.738 ± 0.578
5.215ArgThr: 5.215 ± 0.924
4.129ArgVal: 4.129 ± 0.857
1.304ArgTrp: 1.304 ± 0.551
2.608ArgTyr: 2.608 ± 0.697
0.0ArgXaa: 0.0 ± 0.0
Ser
5.867SerAla: 5.867 ± 1.516
0.217SerCys: 0.217 ± 0.191
1.521SerAsp: 1.521 ± 0.488
1.521SerGlu: 1.521 ± 0.451
1.086SerPhe: 1.086 ± 0.567
5.65SerGly: 5.65 ± 0.908
0.435SerHis: 0.435 ± 0.291
1.304SerIle: 1.304 ± 0.375
1.086SerLys: 1.086 ± 0.387
5.432SerLeu: 5.432 ± 1.125
1.521SerMet: 1.521 ± 0.37
1.086SerAsn: 1.086 ± 0.411
2.39SerPro: 2.39 ± 0.718
1.738SerGln: 1.738 ± 0.41
3.259SerArg: 3.259 ± 0.829
2.173SerSer: 2.173 ± 0.565
3.042SerThr: 3.042 ± 0.494
4.129SerVal: 4.129 ± 1.012
0.869SerTrp: 0.869 ± 0.387
1.086SerTyr: 1.086 ± 0.408
0.0SerXaa: 0.0 ± 0.0
Thr
10.213ThrAla: 10.213 ± 1.25
0.217ThrCys: 0.217 ± 0.21
4.998ThrAsp: 4.998 ± 0.798
2.39ThrGlu: 2.39 ± 0.699
1.521ThrPhe: 1.521 ± 0.586
4.346ThrGly: 4.346 ± 1.231
0.435ThrHis: 0.435 ± 0.291
2.39ThrIle: 2.39 ± 0.694
3.259ThrLys: 3.259 ± 0.747
5.215ThrLeu: 5.215 ± 0.965
1.956ThrMet: 1.956 ± 0.642
1.086ThrAsn: 1.086 ± 0.581
3.477ThrPro: 3.477 ± 0.809
0.869ThrGln: 0.869 ± 0.352
4.346ThrArg: 4.346 ± 0.931
2.608ThrSer: 2.608 ± 0.62
4.129ThrThr: 4.129 ± 0.928
5.65ThrVal: 5.65 ± 1.578
2.825ThrTrp: 2.825 ± 0.837
3.259ThrTyr: 3.259 ± 0.783
0.0ThrXaa: 0.0 ± 0.0
Val
11.951ValAla: 11.951 ± 1.698
0.0ValCys: 0.0 ± 0.0
6.084ValAsp: 6.084 ± 0.95
3.694ValGlu: 3.694 ± 1.22
1.521ValPhe: 1.521 ± 0.56
4.129ValGly: 4.129 ± 0.585
1.304ValHis: 1.304 ± 0.515
2.825ValIle: 2.825 ± 0.533
3.477ValLys: 3.477 ± 1.016
7.388ValLeu: 7.388 ± 1.648
1.521ValMet: 1.521 ± 0.612
3.477ValAsn: 3.477 ± 1.139
4.129ValPro: 4.129 ± 0.832
4.998ValGln: 4.998 ± 0.923
4.998ValArg: 4.998 ± 0.869
5.215ValSer: 5.215 ± 1.118
7.605ValThr: 7.605 ± 1.428
4.998ValVal: 4.998 ± 1.126
0.217ValTrp: 0.217 ± 0.205
0.869ValTyr: 0.869 ± 0.399
0.0ValXaa: 0.0 ± 0.0
Trp
1.956TrpAla: 1.956 ± 0.601
0.217TrpCys: 0.217 ± 0.21
0.217TrpAsp: 0.217 ± 0.205
0.435TrpGlu: 0.435 ± 0.263
0.217TrpPhe: 0.217 ± 0.205
1.521TrpGly: 1.521 ± 0.59
0.435TrpHis: 0.435 ± 0.274
0.652TrpIle: 0.652 ± 0.328
1.521TrpLys: 1.521 ± 0.549
2.173TrpLeu: 2.173 ± 0.519
0.0TrpMet: 0.0 ± 0.0
0.435TrpAsn: 0.435 ± 0.263
1.086TrpPro: 1.086 ± 0.524
0.869TrpGln: 0.869 ± 0.589
2.173TrpArg: 2.173 ± 0.688
0.652TrpSer: 0.652 ± 0.321
1.521TrpThr: 1.521 ± 0.625
0.869TrpVal: 0.869 ± 0.348
0.652TrpTrp: 0.652 ± 0.263
0.652TrpTyr: 0.652 ± 0.289
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.781TyrAla: 4.781 ± 0.949
0.217TyrCys: 0.217 ± 0.197
1.738TyrAsp: 1.738 ± 0.651
0.869TyrGlu: 0.869 ± 0.368
0.652TyrPhe: 0.652 ± 0.31
1.086TyrGly: 1.086 ± 0.378
0.217TyrHis: 0.217 ± 0.169
0.435TyrIle: 0.435 ± 0.296
1.086TyrLys: 1.086 ± 0.467
2.173TyrLeu: 2.173 ± 0.63
0.652TyrMet: 0.652 ± 0.313
0.869TyrAsn: 0.869 ± 0.378
0.217TyrPro: 0.217 ± 0.226
1.304TyrGln: 1.304 ± 0.563
1.086TyrArg: 1.086 ± 0.43
0.435TyrSer: 0.435 ± 0.258
1.086TyrThr: 1.086 ± 0.411
2.39TyrVal: 2.39 ± 0.651
0.217TyrTrp: 0.217 ± 0.226
0.435TyrTyr: 0.435 ± 0.279
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 23 proteins (4603 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski