Amino acid dipepetide frequency for Brevibacterium phage AGM6

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.11AlaAla: 12.11 ± 2.052
0.773AlaCys: 0.773 ± 0.254
5.84AlaAsp: 5.84 ± 0.864
6.871AlaGlu: 6.871 ± 0.793
2.748AlaPhe: 2.748 ± 0.387
9.534AlaGly: 9.534 ± 1.296
1.117AlaHis: 1.117 ± 0.314
4.294AlaIle: 4.294 ± 0.584
4.724AlaLys: 4.724 ± 0.897
9.362AlaLeu: 9.362 ± 0.921
2.748AlaMet: 2.748 ± 0.567
3.607AlaAsn: 3.607 ± 0.588
3.865AlaPro: 3.865 ± 0.674
4.294AlaGln: 4.294 ± 0.544
7.301AlaArg: 7.301 ± 0.745
6.012AlaSer: 6.012 ± 0.647
6.442AlaThr: 6.442 ± 0.647
6.27AlaVal: 6.27 ± 0.813
1.546AlaTrp: 1.546 ± 0.293
2.491AlaTyr: 2.491 ± 0.483
0.0AlaXaa: 0.0 ± 0.0
Cys
1.031CysAla: 1.031 ± 0.303
0.258CysCys: 0.258 ± 0.193
0.945CysAsp: 0.945 ± 0.33
0.344CysGlu: 0.344 ± 0.186
0.172CysPhe: 0.172 ± 0.119
0.773CysGly: 0.773 ± 0.277
0.429CysHis: 0.429 ± 0.2
0.0CysIle: 0.0 ± 0.0
0.859CysLys: 0.859 ± 0.349
0.601CysLeu: 0.601 ± 0.295
0.344CysMet: 0.344 ± 0.233
0.172CysAsn: 0.172 ± 0.119
0.687CysPro: 0.687 ± 0.343
0.344CysGln: 0.344 ± 0.221
0.773CysArg: 0.773 ± 0.395
0.258CysSer: 0.258 ± 0.178
0.258CysThr: 0.258 ± 0.15
0.687CysVal: 0.687 ± 0.326
0.172CysTrp: 0.172 ± 0.116
0.344CysTyr: 0.344 ± 0.198
0.0CysXaa: 0.0 ± 0.0
Asp
6.098AspAla: 6.098 ± 0.73
0.945AspCys: 0.945 ± 0.388
3.951AspAsp: 3.951 ± 0.719
3.178AspGlu: 3.178 ± 0.585
1.632AspPhe: 1.632 ± 0.332
7.043AspGly: 7.043 ± 0.99
1.117AspHis: 1.117 ± 0.3
2.061AspIle: 2.061 ± 0.387
2.233AspLys: 2.233 ± 0.55
5.239AspLeu: 5.239 ± 0.674
1.031AspMet: 1.031 ± 0.273
2.061AspAsn: 2.061 ± 0.518
3.865AspPro: 3.865 ± 0.746
2.577AspGln: 2.577 ± 0.401
5.153AspArg: 5.153 ± 0.661
4.123AspSer: 4.123 ± 0.571
5.669AspThr: 5.669 ± 1.038
4.466AspVal: 4.466 ± 0.601
0.859AspTrp: 0.859 ± 0.251
2.061AspTyr: 2.061 ± 0.421
0.0AspXaa: 0.0 ± 0.0
Glu
8.331GluAla: 8.331 ± 0.945
0.258GluCys: 0.258 ± 0.146
3.006GluAsp: 3.006 ± 0.534
4.638GluGlu: 4.638 ± 0.875
1.202GluPhe: 1.202 ± 0.419
4.209GluGly: 4.209 ± 0.717
1.46GluHis: 1.46 ± 0.354
3.092GluIle: 3.092 ± 0.368
2.405GluLys: 2.405 ± 0.52
4.037GluLeu: 4.037 ± 0.566
1.031GluMet: 1.031 ± 0.377
2.319GluAsn: 2.319 ± 0.517
2.748GluPro: 2.748 ± 0.612
3.264GluGln: 3.264 ± 0.633
4.896GluArg: 4.896 ± 0.638
3.521GluSer: 3.521 ± 0.692
3.951GluThr: 3.951 ± 0.566
3.521GluVal: 3.521 ± 0.606
0.773GluTrp: 0.773 ± 0.277
1.975GluTyr: 1.975 ± 0.4
0.0GluXaa: 0.0 ± 0.0
Phe
2.491PheAla: 2.491 ± 0.469
0.172PheCys: 0.172 ± 0.136
2.319PheAsp: 2.319 ± 0.392
1.718PheGlu: 1.718 ± 0.448
0.429PhePhe: 0.429 ± 0.195
2.92PheGly: 2.92 ± 0.484
0.687PheHis: 0.687 ± 0.268
1.46PheIle: 1.46 ± 0.4
1.374PheLys: 1.374 ± 0.287
1.46PheLeu: 1.46 ± 0.383
0.515PheMet: 0.515 ± 0.203
0.773PheAsn: 0.773 ± 0.244
1.89PhePro: 1.89 ± 0.38
0.601PheGln: 0.601 ± 0.239
1.202PheArg: 1.202 ± 0.344
1.632PheSer: 1.632 ± 0.437
2.147PheThr: 2.147 ± 0.427
1.718PheVal: 1.718 ± 0.334
0.172PheTrp: 0.172 ± 0.14
0.773PheTyr: 0.773 ± 0.211
0.0PheXaa: 0.0 ± 0.0
Gly
7.73GlyAla: 7.73 ± 1.039
0.687GlyCys: 0.687 ± 0.299
5.325GlyAsp: 5.325 ± 0.734
4.037GlyGlu: 4.037 ± 0.703
3.35GlyPhe: 3.35 ± 0.681
7.386GlyGly: 7.386 ± 1.703
1.117GlyHis: 1.117 ± 0.243
4.037GlyIle: 4.037 ± 0.935
3.693GlyLys: 3.693 ± 0.679
7.73GlyLeu: 7.73 ± 0.898
2.834GlyMet: 2.834 ± 0.588
3.779GlyAsn: 3.779 ± 0.846
4.466GlyPro: 4.466 ± 1.212
2.663GlyGln: 2.663 ± 0.434
5.411GlyArg: 5.411 ± 0.743
6.528GlySer: 6.528 ± 0.987
5.325GlyThr: 5.325 ± 0.868
5.239GlyVal: 5.239 ± 0.754
2.491GlyTrp: 2.491 ± 0.609
3.264GlyTyr: 3.264 ± 0.507
0.0GlyXaa: 0.0 ± 0.0
His
1.718HisAla: 1.718 ± 0.33
0.086HisCys: 0.086 ± 0.088
1.202HisAsp: 1.202 ± 0.334
0.687HisGlu: 0.687 ± 0.316
0.859HisPhe: 0.859 ± 0.298
1.202HisGly: 1.202 ± 0.375
0.429HisHis: 0.429 ± 0.145
0.859HisIle: 0.859 ± 0.223
1.031HisLys: 1.031 ± 0.288
1.46HisLeu: 1.46 ± 0.34
0.344HisMet: 0.344 ± 0.159
0.258HisAsn: 0.258 ± 0.164
2.663HisPro: 2.663 ± 0.857
0.945HisGln: 0.945 ± 0.289
0.945HisArg: 0.945 ± 0.454
0.773HisSer: 0.773 ± 0.229
1.46HisThr: 1.46 ± 0.43
1.031HisVal: 1.031 ± 0.376
0.429HisTrp: 0.429 ± 0.194
0.773HisTyr: 0.773 ± 0.237
0.0HisXaa: 0.0 ± 0.0
Ile
4.724IleAla: 4.724 ± 0.669
0.601IleCys: 0.601 ± 0.311
3.178IleAsp: 3.178 ± 0.427
3.178IleGlu: 3.178 ± 0.498
1.031IlePhe: 1.031 ± 0.365
3.779IleGly: 3.779 ± 0.636
0.859IleHis: 0.859 ± 0.209
1.975IleIle: 1.975 ± 0.445
1.546IleLys: 1.546 ± 0.309
3.092IleLeu: 3.092 ± 0.528
0.773IleMet: 0.773 ± 0.215
1.546IleAsn: 1.546 ± 0.395
2.748IlePro: 2.748 ± 0.492
1.632IleGln: 1.632 ± 0.29
3.35IleArg: 3.35 ± 0.54
2.663IleSer: 2.663 ± 0.456
3.779IleThr: 3.779 ± 0.544
3.178IleVal: 3.178 ± 0.555
0.601IleTrp: 0.601 ± 0.199
0.773IleTyr: 0.773 ± 0.249
0.0IleXaa: 0.0 ± 0.0
Lys
4.209LysAla: 4.209 ± 1.082
0.601LysCys: 0.601 ± 0.284
3.092LysAsp: 3.092 ± 0.456
2.319LysGlu: 2.319 ± 0.424
1.546LysPhe: 1.546 ± 0.44
3.779LysGly: 3.779 ± 0.714
1.202LysHis: 1.202 ± 0.373
2.405LysIle: 2.405 ± 0.418
2.748LysLys: 2.748 ± 0.634
3.178LysLeu: 3.178 ± 0.585
1.632LysMet: 1.632 ± 0.326
1.46LysAsn: 1.46 ± 0.37
2.491LysPro: 2.491 ± 0.551
1.975LysGln: 1.975 ± 0.483
3.006LysArg: 3.006 ± 0.636
2.405LysSer: 2.405 ± 0.416
2.92LysThr: 2.92 ± 0.954
3.092LysVal: 3.092 ± 0.703
0.687LysTrp: 0.687 ± 0.21
1.117LysTyr: 1.117 ± 0.3
0.0LysXaa: 0.0 ± 0.0
Leu
7.73LeuAla: 7.73 ± 1.014
0.945LeuCys: 0.945 ± 0.412
4.466LeuAsp: 4.466 ± 0.579
4.123LeuGlu: 4.123 ± 0.576
1.202LeuPhe: 1.202 ± 0.333
6.957LeuGly: 6.957 ± 0.89
1.632LeuHis: 1.632 ± 0.445
2.748LeuIle: 2.748 ± 0.671
3.607LeuLys: 3.607 ± 0.673
4.724LeuLeu: 4.724 ± 0.554
1.202LeuMet: 1.202 ± 0.248
2.663LeuAsn: 2.663 ± 0.417
3.178LeuPro: 3.178 ± 0.581
2.491LeuGln: 2.491 ± 0.419
6.442LeuArg: 6.442 ± 0.878
5.926LeuSer: 5.926 ± 0.624
5.755LeuThr: 5.755 ± 0.609
4.982LeuVal: 4.982 ± 0.604
0.945LeuTrp: 0.945 ± 0.242
1.632LeuTyr: 1.632 ± 0.327
0.0LeuXaa: 0.0 ± 0.0
Met
2.663MetAla: 2.663 ± 0.467
0.086MetCys: 0.086 ± 0.091
1.288MetAsp: 1.288 ± 0.34
1.202MetGlu: 1.202 ± 0.341
0.344MetPhe: 0.344 ± 0.173
2.061MetGly: 2.061 ± 0.532
0.172MetHis: 0.172 ± 0.127
1.031MetIle: 1.031 ± 0.287
1.46MetLys: 1.46 ± 0.349
1.374MetLeu: 1.374 ± 0.46
0.344MetMet: 0.344 ± 0.183
0.945MetAsn: 0.945 ± 0.221
0.859MetPro: 0.859 ± 0.296
0.859MetGln: 0.859 ± 0.322
2.061MetArg: 2.061 ± 0.485
2.834MetSer: 2.834 ± 0.726
2.061MetThr: 2.061 ± 0.443
1.546MetVal: 1.546 ± 0.368
0.859MetTrp: 0.859 ± 0.289
0.344MetTyr: 0.344 ± 0.163
0.0MetXaa: 0.0 ± 0.0
Asn
3.006AsnAla: 3.006 ± 0.617
0.258AsnCys: 0.258 ± 0.193
1.718AsnAsp: 1.718 ± 0.52
2.92AsnGlu: 2.92 ± 0.661
1.031AsnPhe: 1.031 ± 0.326
3.264AsnGly: 3.264 ± 0.461
0.773AsnHis: 0.773 ± 0.308
1.546AsnIle: 1.546 ± 0.393
1.546AsnLys: 1.546 ± 0.32
2.748AsnLeu: 2.748 ± 0.521
0.515AsnMet: 0.515 ± 0.206
1.031AsnAsn: 1.031 ± 0.253
1.975AsnPro: 1.975 ± 0.437
1.46AsnGln: 1.46 ± 0.386
2.233AsnArg: 2.233 ± 0.435
1.89AsnSer: 1.89 ± 0.377
3.264AsnThr: 3.264 ± 0.488
1.546AsnVal: 1.546 ± 0.359
0.515AsnTrp: 0.515 ± 0.224
0.945AsnTyr: 0.945 ± 0.301
0.0AsnXaa: 0.0 ± 0.0
Pro
6.356ProAla: 6.356 ± 1.6
0.515ProCys: 0.515 ± 0.249
3.693ProAsp: 3.693 ± 0.842
3.178ProGlu: 3.178 ± 0.549
0.687ProPhe: 0.687 ± 0.229
4.037ProGly: 4.037 ± 0.538
0.945ProHis: 0.945 ± 0.266
1.89ProIle: 1.89 ± 0.39
2.92ProLys: 2.92 ± 0.868
2.834ProLeu: 2.834 ± 0.516
1.975ProMet: 1.975 ± 0.427
2.233ProAsn: 2.233 ± 0.554
3.178ProPro: 3.178 ± 0.688
1.632ProGln: 1.632 ± 0.3
2.663ProArg: 2.663 ± 0.648
3.521ProSer: 3.521 ± 0.789
3.35ProThr: 3.35 ± 0.642
5.153ProVal: 5.153 ± 1.459
0.601ProTrp: 0.601 ± 0.235
1.117ProTyr: 1.117 ± 0.298
0.0ProXaa: 0.0 ± 0.0
Gln
4.294GlnAla: 4.294 ± 0.775
0.344GlnCys: 0.344 ± 0.208
2.319GlnAsp: 2.319 ± 0.386
1.718GlnGlu: 1.718 ± 0.369
1.288GlnPhe: 1.288 ± 0.342
2.405GlnGly: 2.405 ± 0.435
0.945GlnHis: 0.945 ± 0.29
2.147GlnIle: 2.147 ± 0.444
1.89GlnLys: 1.89 ± 0.376
2.92GlnLeu: 2.92 ± 0.451
0.945GlnMet: 0.945 ± 0.213
1.202GlnAsn: 1.202 ± 0.32
1.031GlnPro: 1.031 ± 0.292
1.374GlnGln: 1.374 ± 0.386
2.748GlnArg: 2.748 ± 0.495
3.521GlnSer: 3.521 ± 0.885
1.89GlnThr: 1.89 ± 0.455
2.233GlnVal: 2.233 ± 0.384
0.859GlnTrp: 0.859 ± 0.303
1.031GlnTyr: 1.031 ± 0.356
0.0GlnXaa: 0.0 ± 0.0
Arg
6.098ArgAla: 6.098 ± 0.732
0.945ArgCys: 0.945 ± 0.367
4.81ArgAsp: 4.81 ± 0.96
4.294ArgGlu: 4.294 ± 0.561
1.89ArgPhe: 1.89 ± 0.379
5.067ArgGly: 5.067 ± 0.629
1.804ArgHis: 1.804 ± 0.448
4.123ArgIle: 4.123 ± 0.665
3.521ArgLys: 3.521 ± 0.564
5.239ArgLeu: 5.239 ± 0.797
1.804ArgMet: 1.804 ± 0.393
2.319ArgAsn: 2.319 ± 0.388
2.663ArgPro: 2.663 ± 0.588
2.405ArgGln: 2.405 ± 0.38
6.528ArgArg: 6.528 ± 1.236
5.325ArgSer: 5.325 ± 0.476
4.209ArgThr: 4.209 ± 0.647
4.896ArgVal: 4.896 ± 0.659
0.773ArgTrp: 0.773 ± 0.311
1.718ArgTyr: 1.718 ± 0.38
0.0ArgXaa: 0.0 ± 0.0
Ser
6.871SerAla: 6.871 ± 0.765
0.515SerCys: 0.515 ± 0.217
3.521SerAsp: 3.521 ± 0.595
3.865SerGlu: 3.865 ± 0.619
1.718SerPhe: 1.718 ± 0.352
7.816SerGly: 7.816 ± 1.222
0.515SerHis: 0.515 ± 0.204
3.006SerIle: 3.006 ± 0.523
3.35SerLys: 3.35 ± 0.502
3.779SerLeu: 3.779 ± 0.75
2.405SerMet: 2.405 ± 0.358
2.233SerAsn: 2.233 ± 0.392
3.865SerPro: 3.865 ± 0.852
2.405SerGln: 2.405 ± 0.61
4.037SerArg: 4.037 ± 0.613
5.239SerSer: 5.239 ± 0.891
6.098SerThr: 6.098 ± 0.711
4.638SerVal: 4.638 ± 0.578
0.773SerTrp: 0.773 ± 0.236
1.374SerTyr: 1.374 ± 0.348
0.0SerXaa: 0.0 ± 0.0
Thr
7.043ThrAla: 7.043 ± 1.042
0.515ThrCys: 0.515 ± 0.246
5.239ThrAsp: 5.239 ± 0.795
3.865ThrGlu: 3.865 ± 0.566
2.147ThrPhe: 2.147 ± 0.531
6.699ThrGly: 6.699 ± 0.98
1.117ThrHis: 1.117 ± 0.324
3.436ThrIle: 3.436 ± 0.549
2.491ThrLys: 2.491 ± 0.387
5.84ThrLeu: 5.84 ± 0.661
1.718ThrMet: 1.718 ± 0.534
2.061ThrAsn: 2.061 ± 0.512
4.466ThrPro: 4.466 ± 0.704
2.663ThrGln: 2.663 ± 0.717
3.006ThrArg: 3.006 ± 0.808
4.982ThrSer: 4.982 ± 0.6
5.325ThrThr: 5.325 ± 0.665
4.982ThrVal: 4.982 ± 0.732
1.031ThrTrp: 1.031 ± 0.251
1.804ThrTyr: 1.804 ± 0.524
0.0ThrXaa: 0.0 ± 0.0
Val
5.669ValAla: 5.669 ± 0.636
0.601ValCys: 0.601 ± 0.271
5.926ValAsp: 5.926 ± 0.744
5.067ValGlu: 5.067 ± 0.779
1.804ValPhe: 1.804 ± 0.542
4.724ValGly: 4.724 ± 0.64
2.147ValHis: 2.147 ± 0.737
3.35ValIle: 3.35 ± 0.574
2.319ValLys: 2.319 ± 0.608
5.067ValLeu: 5.067 ± 0.663
1.632ValMet: 1.632 ± 0.401
2.061ValAsn: 2.061 ± 0.396
3.951ValPro: 3.951 ± 0.649
2.061ValGln: 2.061 ± 0.457
5.325ValArg: 5.325 ± 0.816
4.209ValSer: 4.209 ± 0.653
3.607ValThr: 3.607 ± 0.727
4.552ValVal: 4.552 ± 0.87
1.718ValTrp: 1.718 ± 0.482
1.288ValTyr: 1.288 ± 0.266
0.0ValXaa: 0.0 ± 0.0
Trp
1.718TrpAla: 1.718 ± 0.504
0.172TrpCys: 0.172 ± 0.118
1.374TrpAsp: 1.374 ± 0.394
1.546TrpGlu: 1.546 ± 0.371
0.344TrpPhe: 0.344 ± 0.228
0.773TrpGly: 0.773 ± 0.226
0.086TrpHis: 0.086 ± 0.104
0.859TrpIle: 0.859 ± 0.223
1.202TrpLys: 1.202 ± 0.309
1.202TrpLeu: 1.202 ± 0.336
0.172TrpMet: 0.172 ± 0.097
0.601TrpAsn: 0.601 ± 0.217
0.429TrpPro: 0.429 ± 0.185
0.601TrpGln: 0.601 ± 0.18
1.46TrpArg: 1.46 ± 0.433
1.202TrpSer: 1.202 ± 0.399
1.202TrpThr: 1.202 ± 0.348
1.117TrpVal: 1.117 ± 0.269
0.172TrpTrp: 0.172 ± 0.115
0.086TrpTyr: 0.086 ± 0.104
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.061TyrAla: 2.061 ± 0.329
0.172TyrCys: 0.172 ± 0.126
2.147TyrAsp: 2.147 ± 0.571
1.89TyrGlu: 1.89 ± 0.457
1.031TyrPhe: 1.031 ± 0.317
2.748TyrGly: 2.748 ± 0.486
0.601TyrHis: 0.601 ± 0.25
0.687TyrIle: 0.687 ± 0.218
0.945TyrLys: 0.945 ± 0.289
1.46TyrLeu: 1.46 ± 0.5
0.344TyrMet: 0.344 ± 0.146
0.859TyrAsn: 0.859 ± 0.346
1.46TyrPro: 1.46 ± 0.401
0.859TyrGln: 0.859 ± 0.239
1.89TyrArg: 1.89 ± 0.468
1.46TyrSer: 1.46 ± 0.596
1.718TyrThr: 1.718 ± 0.47
2.233TyrVal: 2.233 ± 0.444
0.344TyrTrp: 0.344 ± 0.158
0.344TyrTyr: 0.344 ± 0.187
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 53 proteins (11644 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski