Amino acid dipepetide frequency for Streptococcus satellite phage Javan212

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.341AlaAla: 0.341 ± 0.327
0.0AlaCys: 0.0 ± 0.0
5.111AlaAsp: 5.111 ± 1.293
3.748AlaGlu: 3.748 ± 1.475
3.066AlaPhe: 3.066 ± 1.025
2.044AlaGly: 2.044 ± 0.898
0.681AlaHis: 0.681 ± 0.423
4.429AlaIle: 4.429 ± 0.989
3.407AlaLys: 3.407 ± 1.49
3.407AlaLeu: 3.407 ± 1.075
2.044AlaMet: 2.044 ± 0.895
2.385AlaAsn: 2.385 ± 1.192
0.341AlaPro: 0.341 ± 0.342
0.681AlaGln: 0.681 ± 0.396
1.704AlaArg: 1.704 ± 0.742
4.429AlaSer: 4.429 ± 0.861
5.451AlaThr: 5.451 ± 1.528
2.726AlaVal: 2.726 ± 1.131
0.681AlaTrp: 0.681 ± 0.543
2.044AlaTyr: 2.044 ± 0.781
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.681CysAsp: 0.681 ± 0.494
0.341CysGlu: 0.341 ± 0.327
0.0CysPhe: 0.0 ± 0.0
0.681CysGly: 0.681 ± 0.432
0.0CysHis: 0.0 ± 0.0
1.022CysIle: 1.022 ± 0.793
1.022CysLys: 1.022 ± 0.57
0.681CysLeu: 0.681 ± 0.361
0.341CysMet: 0.341 ± 0.401
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.341CysArg: 0.341 ± 0.342
0.341CysSer: 0.341 ± 0.321
0.0CysThr: 0.0 ± 0.0
0.341CysVal: 0.341 ± 0.413
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.022AspAla: 1.022 ± 0.605
0.341AspCys: 0.341 ± 0.394
4.429AspAsp: 4.429 ± 1.353
4.77AspGlu: 4.77 ± 2.25
3.407AspPhe: 3.407 ± 1.806
3.407AspGly: 3.407 ± 1.07
1.022AspHis: 1.022 ± 0.63
4.429AspIle: 4.429 ± 1.1
5.451AspLys: 5.451 ± 0.936
5.111AspLeu: 5.111 ± 0.684
3.407AspMet: 3.407 ± 1.261
5.111AspAsn: 5.111 ± 1.512
1.363AspPro: 1.363 ± 0.599
1.022AspGln: 1.022 ± 0.569
2.726AspArg: 2.726 ± 0.953
4.089AspSer: 4.089 ± 0.857
2.726AspThr: 2.726 ± 0.748
4.77AspVal: 4.77 ± 1.207
0.0AspTrp: 0.0 ± 0.0
4.089AspTyr: 4.089 ± 0.929
0.0AspXaa: 0.0 ± 0.0
Glu
3.407GluAla: 3.407 ± 1.291
1.022GluCys: 1.022 ± 0.558
3.407GluAsp: 3.407 ± 1.001
6.133GluGlu: 6.133 ± 1.561
2.385GluPhe: 2.385 ± 0.68
2.044GluGly: 2.044 ± 0.786
2.044GluHis: 2.044 ± 0.626
6.474GluIle: 6.474 ± 1.257
8.177GluLys: 8.177 ± 1.723
11.925GluLeu: 11.925 ± 3.071
3.066GluMet: 3.066 ± 1.108
4.77GluAsn: 4.77 ± 1.362
1.363GluPro: 1.363 ± 0.649
4.77GluGln: 4.77 ± 1.495
4.77GluArg: 4.77 ± 1.275
2.726GluSer: 2.726 ± 1.16
4.77GluThr: 4.77 ± 1.485
6.474GluVal: 6.474 ± 1.499
2.044GluTrp: 2.044 ± 0.659
2.044GluTyr: 2.044 ± 1.107
0.0GluXaa: 0.0 ± 0.0
Phe
1.022PheAla: 1.022 ± 0.668
0.341PheCys: 0.341 ± 0.342
3.066PheAsp: 3.066 ± 1.01
3.748PheGlu: 3.748 ± 1.199
2.044PhePhe: 2.044 ± 0.748
1.363PheGly: 1.363 ± 0.588
0.341PheHis: 0.341 ± 0.306
3.066PheIle: 3.066 ± 1.642
4.429PheLys: 4.429 ± 1.14
3.407PheLeu: 3.407 ± 0.854
0.341PheMet: 0.341 ± 0.282
2.726PheAsn: 2.726 ± 0.964
0.341PhePro: 0.341 ± 0.356
1.022PheGln: 1.022 ± 0.852
1.022PheArg: 1.022 ± 0.63
3.748PheSer: 3.748 ± 0.861
1.363PheThr: 1.363 ± 0.433
1.704PheVal: 1.704 ± 0.666
0.0PheTrp: 0.0 ± 0.0
2.726PheTyr: 2.726 ± 0.788
0.0PheXaa: 0.0 ± 0.0
Gly
3.066GlyAla: 3.066 ± 0.84
0.341GlyCys: 0.341 ± 0.321
1.363GlyAsp: 1.363 ± 0.719
3.066GlyGlu: 3.066 ± 1.108
4.089GlyPhe: 4.089 ± 1.366
1.363GlyGly: 1.363 ± 0.616
1.704GlyHis: 1.704 ± 0.696
4.089GlyIle: 4.089 ± 0.82
4.77GlyLys: 4.77 ± 1.065
6.133GlyLeu: 6.133 ± 1.216
0.681GlyMet: 0.681 ± 0.462
2.044GlyAsn: 2.044 ± 0.628
1.022GlyPro: 1.022 ± 0.726
0.341GlyGln: 0.341 ± 0.282
1.022GlyArg: 1.022 ± 0.579
1.363GlySer: 1.363 ± 0.752
2.385GlyThr: 2.385 ± 0.803
3.748GlyVal: 3.748 ± 1.138
0.341GlyTrp: 0.341 ± 0.356
3.066GlyTyr: 3.066 ± 0.845
0.0GlyXaa: 0.0 ± 0.0
His
2.726HisAla: 2.726 ± 0.985
0.0HisCys: 0.0 ± 0.0
0.681HisAsp: 0.681 ± 0.459
0.681HisGlu: 0.681 ± 0.482
0.681HisPhe: 0.681 ± 0.446
0.681HisGly: 0.681 ± 0.423
0.0HisHis: 0.0 ± 0.0
0.341HisIle: 0.341 ± 0.482
1.022HisLys: 1.022 ± 0.406
1.022HisLeu: 1.022 ± 0.441
0.0HisMet: 0.0 ± 0.0
1.022HisAsn: 1.022 ± 0.547
0.341HisPro: 0.341 ± 0.321
0.341HisGln: 0.341 ± 0.415
0.341HisArg: 0.341 ± 0.306
1.022HisSer: 1.022 ± 0.579
1.022HisThr: 1.022 ± 0.468
1.022HisVal: 1.022 ± 0.643
0.0HisTrp: 0.0 ± 0.0
0.341HisTyr: 0.341 ± 0.342
0.0HisXaa: 0.0 ± 0.0
Ile
4.089IleAla: 4.089 ± 1.15
1.022IleCys: 1.022 ± 0.543
4.089IleAsp: 4.089 ± 0.916
5.792IleGlu: 5.792 ± 1.207
2.385IlePhe: 2.385 ± 0.694
4.77IleGly: 4.77 ± 1.705
0.341IleHis: 0.341 ± 0.327
3.407IleIle: 3.407 ± 1.199
8.518IleLys: 8.518 ± 1.723
3.748IleLeu: 3.748 ± 1.27
2.726IleMet: 2.726 ± 0.871
5.792IleAsn: 5.792 ± 1.199
1.363IlePro: 1.363 ± 0.77
3.066IleGln: 3.066 ± 0.903
3.407IleArg: 3.407 ± 1.558
5.451IleSer: 5.451 ± 1.523
4.77IleThr: 4.77 ± 1.299
4.089IleVal: 4.089 ± 1.216
1.704IleTrp: 1.704 ± 0.635
4.089IleTyr: 4.089 ± 1.092
0.0IleXaa: 0.0 ± 0.0
Lys
6.133LysAla: 6.133 ± 1.524
0.0LysCys: 0.0 ± 0.0
8.177LysAsp: 8.177 ± 1.565
9.199LysGlu: 9.199 ± 1.813
0.681LysPhe: 0.681 ± 0.407
3.407LysGly: 3.407 ± 1.487
1.022LysHis: 1.022 ± 0.525
7.155LysIle: 7.155 ± 1.26
8.518LysLys: 8.518 ± 1.786
9.199LysLeu: 9.199 ± 1.591
2.044LysMet: 2.044 ± 0.965
7.155LysAsn: 7.155 ± 1.388
2.726LysPro: 2.726 ± 1.158
7.496LysGln: 7.496 ± 1.977
3.066LysArg: 3.066 ± 0.968
4.77LysSer: 4.77 ± 0.796
5.451LysThr: 5.451 ± 1.826
5.792LysVal: 5.792 ± 1.042
0.681LysTrp: 0.681 ± 0.423
5.111LysTyr: 5.111 ± 0.908
0.0LysXaa: 0.0 ± 0.0
Leu
4.429LeuAla: 4.429 ± 1.325
0.0LeuCys: 0.0 ± 0.0
8.177LeuAsp: 8.177 ± 1.75
9.54LeuGlu: 9.54 ± 1.893
2.726LeuPhe: 2.726 ± 0.568
4.429LeuGly: 4.429 ± 1.071
0.341LeuHis: 0.341 ± 0.339
7.496LeuIle: 7.496 ± 1.796
8.859LeuLys: 8.859 ± 1.433
6.474LeuLeu: 6.474 ± 1.353
2.726LeuMet: 2.726 ± 0.927
7.155LeuAsn: 7.155 ± 1.462
2.726LeuPro: 2.726 ± 0.806
4.089LeuGln: 4.089 ± 1.041
4.77LeuArg: 4.77 ± 0.904
8.177LeuSer: 8.177 ± 1.699
4.77LeuThr: 4.77 ± 1.278
5.451LeuVal: 5.451 ± 1.731
1.363LeuTrp: 1.363 ± 0.399
4.429LeuTyr: 4.429 ± 0.825
0.0LeuXaa: 0.0 ± 0.0
Met
3.407MetAla: 3.407 ± 1.287
0.0MetCys: 0.0 ± 0.0
1.363MetAsp: 1.363 ± 0.549
2.385MetGlu: 2.385 ± 0.91
0.341MetPhe: 0.341 ± 0.282
1.022MetGly: 1.022 ± 0.536
0.0MetHis: 0.0 ± 0.0
2.044MetIle: 2.044 ± 0.922
1.363MetLys: 1.363 ± 0.57
3.748MetLeu: 3.748 ± 1.575
0.341MetMet: 0.341 ± 0.415
2.044MetAsn: 2.044 ± 1.068
0.0MetPro: 0.0 ± 0.0
1.022MetGln: 1.022 ± 0.729
0.341MetArg: 0.341 ± 0.425
1.363MetSer: 1.363 ± 0.467
2.385MetThr: 2.385 ± 1.022
2.726MetVal: 2.726 ± 0.977
0.0MetTrp: 0.0 ± 0.0
0.681MetTyr: 0.681 ± 0.443
0.0MetXaa: 0.0 ± 0.0
Asn
4.089AsnAla: 4.089 ± 1.237
1.022AsnCys: 1.022 ± 1.027
3.407AsnAsp: 3.407 ± 1.199
5.792AsnGlu: 5.792 ± 1.372
1.704AsnPhe: 1.704 ± 0.718
6.133AsnGly: 6.133 ± 1.237
1.022AsnHis: 1.022 ± 0.866
3.407AsnIle: 3.407 ± 1.231
7.836AsnLys: 7.836 ± 1.969
4.089AsnLeu: 4.089 ± 1.121
1.363AsnMet: 1.363 ± 0.974
4.77AsnAsn: 4.77 ± 0.969
1.022AsnPro: 1.022 ± 0.438
2.385AsnGln: 2.385 ± 0.981
1.704AsnArg: 1.704 ± 0.582
5.451AsnSer: 5.451 ± 1.29
5.792AsnThr: 5.792 ± 1.389
2.044AsnVal: 2.044 ± 0.888
1.022AsnTrp: 1.022 ± 0.541
2.726AsnTyr: 2.726 ± 0.882
0.0AsnXaa: 0.0 ± 0.0
Pro
1.363ProAla: 1.363 ± 0.665
0.0ProCys: 0.0 ± 0.0
0.341ProAsp: 0.341 ± 0.282
2.044ProGlu: 2.044 ± 0.688
0.681ProPhe: 0.681 ± 0.444
0.341ProGly: 0.341 ± 0.342
0.0ProHis: 0.0 ± 0.0
1.363ProIle: 1.363 ± 0.653
2.726ProLys: 2.726 ± 1.349
1.363ProLeu: 1.363 ± 0.689
0.681ProMet: 0.681 ± 0.495
2.044ProAsn: 2.044 ± 0.926
0.341ProPro: 0.341 ± 0.356
0.341ProGln: 0.341 ± 0.387
1.022ProArg: 1.022 ± 0.52
0.681ProSer: 0.681 ± 0.522
1.363ProThr: 1.363 ± 0.59
1.704ProVal: 1.704 ± 0.57
0.0ProTrp: 0.0 ± 0.0
1.704ProTyr: 1.704 ± 0.638
0.0ProXaa: 0.0 ± 0.0
Gln
3.066GlnAla: 3.066 ± 1.271
0.341GlnCys: 0.341 ± 0.342
1.022GlnAsp: 1.022 ± 0.59
3.407GlnGlu: 3.407 ± 1.197
1.022GlnPhe: 1.022 ± 0.388
1.022GlnGly: 1.022 ± 0.502
1.363GlnHis: 1.363 ± 0.695
3.407GlnIle: 3.407 ± 0.988
3.748GlnLys: 3.748 ± 1.08
6.814GlnLeu: 6.814 ± 1.06
1.704GlnMet: 1.704 ± 0.579
0.681GlnAsn: 0.681 ± 0.447
0.681GlnPro: 0.681 ± 0.421
2.385GlnGln: 2.385 ± 0.944
0.681GlnArg: 0.681 ± 0.479
3.066GlnSer: 3.066 ± 0.825
0.681GlnThr: 0.681 ± 0.475
1.363GlnVal: 1.363 ± 0.71
0.681GlnTrp: 0.681 ± 0.453
2.044GlnTyr: 2.044 ± 0.508
0.0GlnXaa: 0.0 ± 0.0
Arg
0.681ArgAla: 0.681 ± 0.603
0.0ArgCys: 0.0 ± 0.0
3.066ArgAsp: 3.066 ± 1.097
2.726ArgGlu: 2.726 ± 0.797
2.385ArgPhe: 2.385 ± 0.885
1.022ArgGly: 1.022 ± 0.714
0.681ArgHis: 0.681 ± 0.408
3.407ArgIle: 3.407 ± 1.198
4.429ArgLys: 4.429 ± 1.041
4.77ArgLeu: 4.77 ± 0.99
0.341ArgMet: 0.341 ± 0.415
2.044ArgAsn: 2.044 ± 1.138
0.341ArgPro: 0.341 ± 0.282
3.748ArgGln: 3.748 ± 1.184
1.363ArgArg: 1.363 ± 0.762
1.363ArgSer: 1.363 ± 0.75
4.429ArgThr: 4.429 ± 0.963
0.681ArgVal: 0.681 ± 0.564
0.341ArgTrp: 0.341 ± 0.342
2.385ArgTyr: 2.385 ± 0.913
0.0ArgXaa: 0.0 ± 0.0
Ser
2.044SerAla: 2.044 ± 1.285
0.341SerCys: 0.341 ± 0.321
6.474SerAsp: 6.474 ± 2.024
5.111SerGlu: 5.111 ± 1.053
2.044SerPhe: 2.044 ± 0.716
1.704SerGly: 1.704 ± 0.738
0.341SerHis: 0.341 ± 0.321
3.066SerIle: 3.066 ± 0.982
5.792SerLys: 5.792 ± 1.64
5.451SerLeu: 5.451 ± 1.456
1.363SerMet: 1.363 ± 0.741
6.133SerAsn: 6.133 ± 1.268
1.704SerPro: 1.704 ± 0.675
2.044SerGln: 2.044 ± 0.689
3.066SerArg: 3.066 ± 0.896
3.407SerSer: 3.407 ± 1.731
2.726SerThr: 2.726 ± 0.723
2.726SerVal: 2.726 ± 0.685
0.681SerTrp: 0.681 ± 0.361
4.429SerTyr: 4.429 ± 1.355
0.0SerXaa: 0.0 ± 0.0
Thr
3.066ThrAla: 3.066 ± 1.043
0.341ThrCys: 0.341 ± 0.415
4.089ThrAsp: 4.089 ± 1.204
7.496ThrGlu: 7.496 ± 1.917
2.385ThrPhe: 2.385 ± 0.856
2.726ThrGly: 2.726 ± 0.917
0.681ThrHis: 0.681 ± 0.42
5.451ThrIle: 5.451 ± 1.18
5.111ThrLys: 5.111 ± 1.155
6.814ThrLeu: 6.814 ± 1.366
0.341ThrMet: 0.341 ± 0.306
2.726ThrAsn: 2.726 ± 0.819
2.044ThrPro: 2.044 ± 0.744
1.363ThrGln: 1.363 ± 0.79
3.066ThrArg: 3.066 ± 0.839
1.704ThrSer: 1.704 ± 0.935
6.474ThrThr: 6.474 ± 2.65
4.77ThrVal: 4.77 ± 1.249
0.341ThrTrp: 0.341 ± 0.339
1.363ThrTyr: 1.363 ± 0.614
0.0ThrXaa: 0.0 ± 0.0
Val
3.066ValAla: 3.066 ± 0.745
0.341ValCys: 0.341 ± 0.342
1.363ValAsp: 1.363 ± 0.541
3.407ValGlu: 3.407 ± 1.196
3.407ValPhe: 3.407 ± 0.993
2.385ValGly: 2.385 ± 0.849
0.681ValHis: 0.681 ± 0.423
7.155ValIle: 7.155 ± 1.346
7.155ValLys: 7.155 ± 2.179
7.155ValLeu: 7.155 ± 1.855
1.363ValMet: 1.363 ± 0.535
3.407ValAsn: 3.407 ± 1.268
1.704ValPro: 1.704 ± 0.472
1.704ValGln: 1.704 ± 0.688
1.704ValArg: 1.704 ± 0.816
4.089ValSer: 4.089 ± 1.17
3.066ValThr: 3.066 ± 0.906
1.022ValVal: 1.022 ± 0.53
0.341ValTrp: 0.341 ± 0.415
2.385ValTyr: 2.385 ± 0.869
0.0ValXaa: 0.0 ± 0.0
Trp
1.363TrpAla: 1.363 ± 0.526
0.0TrpCys: 0.0 ± 0.0
0.681TrpAsp: 0.681 ± 0.448
1.363TrpGlu: 1.363 ± 0.643
0.0TrpPhe: 0.0 ± 0.0
0.341TrpGly: 0.341 ± 0.282
0.0TrpHis: 0.0 ± 0.0
0.341TrpIle: 0.341 ± 0.321
0.341TrpLys: 0.341 ± 0.342
1.704TrpLeu: 1.704 ± 0.857
0.341TrpMet: 0.341 ± 0.366
0.681TrpAsn: 0.681 ± 0.542
0.0TrpPro: 0.0 ± 0.0
0.341TrpGln: 0.341 ± 0.356
0.341TrpArg: 0.341 ± 0.34
0.341TrpSer: 0.341 ± 0.306
0.0TrpThr: 0.0 ± 0.0
2.044TrpVal: 2.044 ± 0.731
0.681TrpTrp: 0.681 ± 0.493
0.341TrpTyr: 0.341 ± 0.282
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.341TyrAla: 0.341 ± 0.482
0.341TyrCys: 0.341 ± 0.282
1.704TyrAsp: 1.704 ± 0.717
3.066TyrGlu: 3.066 ± 1.066
2.385TyrPhe: 2.385 ± 0.613
5.111TyrGly: 5.111 ± 1.487
1.022TyrHis: 1.022 ± 0.406
3.066TyrIle: 3.066 ± 1.23
5.111TyrLys: 5.111 ± 1.401
5.451TyrLeu: 5.451 ± 1.241
1.022TyrMet: 1.022 ± 0.816
4.089TyrAsn: 4.089 ± 1.143
0.681TyrPro: 0.681 ± 0.494
1.022TyrGln: 1.022 ± 0.554
3.748TyrArg: 3.748 ± 1.185
3.066TyrSer: 3.066 ± 1.056
2.726TyrThr: 2.726 ± 0.531
1.704TyrVal: 1.704 ± 0.547
0.341TyrTrp: 0.341 ± 0.342
1.363TyrTyr: 1.363 ± 0.718
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 19 proteins (2936 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski