Amino acid dipepetide frequency for Streptococcus phage Javan383

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.284AlaAla: 0.284 ± 0.197
0.284AlaCys: 0.284 ± 0.215
4.076AlaAsp: 4.076 ± 0.698
5.119AlaGlu: 5.119 ± 0.958
3.602AlaPhe: 3.602 ± 0.542
4.55AlaGly: 4.55 ± 0.734
0.664AlaHis: 0.664 ± 0.271
5.877AlaIle: 5.877 ± 1.029
4.835AlaLys: 4.835 ± 0.695
4.835AlaLeu: 4.835 ± 0.794
2.559AlaMet: 2.559 ± 0.624
4.076AlaAsn: 4.076 ± 0.744
1.896AlaPro: 1.896 ± 0.584
1.612AlaGln: 1.612 ± 0.418
2.465AlaArg: 2.465 ± 0.622
4.266AlaSer: 4.266 ± 0.558
3.697AlaThr: 3.697 ± 0.639
4.171AlaVal: 4.171 ± 0.642
0.948AlaTrp: 0.948 ± 0.32
3.507AlaTyr: 3.507 ± 0.52
0.0AlaXaa: 0.0 ± 0.0
Cys
0.19CysAla: 0.19 ± 0.141
0.0CysCys: 0.0 ± 0.0
0.284CysAsp: 0.284 ± 0.159
0.474CysGlu: 0.474 ± 0.222
0.284CysPhe: 0.284 ± 0.173
0.474CysGly: 0.474 ± 0.239
0.0CysHis: 0.0 ± 0.0
0.19CysIle: 0.19 ± 0.134
0.284CysLys: 0.284 ± 0.173
0.569CysLeu: 0.569 ± 0.252
0.0CysMet: 0.0 ± 0.0
0.379CysAsn: 0.379 ± 0.197
0.19CysPro: 0.19 ± 0.137
0.095CysGln: 0.095 ± 0.091
0.19CysArg: 0.19 ± 0.133
0.284CysSer: 0.284 ± 0.173
0.19CysThr: 0.19 ± 0.144
0.284CysVal: 0.284 ± 0.163
0.095CysTrp: 0.095 ± 0.101
0.379CysTyr: 0.379 ± 0.246
0.0CysXaa: 0.0 ± 0.0
Asp
2.749AspAla: 2.749 ± 0.429
0.284AspCys: 0.284 ± 0.185
4.076AspAsp: 4.076 ± 0.649
4.55AspGlu: 4.55 ± 0.725
3.128AspPhe: 3.128 ± 0.558
5.403AspGly: 5.403 ± 0.854
0.664AspHis: 0.664 ± 0.234
5.403AspIle: 5.403 ± 0.778
5.403AspLys: 5.403 ± 0.99
5.403AspLeu: 5.403 ± 0.696
1.327AspMet: 1.327 ± 0.392
4.929AspAsn: 4.929 ± 0.864
1.232AspPro: 1.232 ± 0.332
1.138AspGln: 1.138 ± 0.342
2.749AspArg: 2.749 ± 0.459
3.223AspSer: 3.223 ± 0.524
3.318AspThr: 3.318 ± 0.655
3.128AspVal: 3.128 ± 0.49
1.138AspTrp: 1.138 ± 0.403
3.887AspTyr: 3.887 ± 0.882
0.0AspXaa: 0.0 ± 0.0
Glu
5.403GluAla: 5.403 ± 0.884
0.19GluCys: 0.19 ± 0.127
3.413GluAsp: 3.413 ± 0.709
5.214GluGlu: 5.214 ± 0.963
2.465GluPhe: 2.465 ± 0.436
2.749GluGly: 2.749 ± 0.515
1.138GluHis: 1.138 ± 0.343
6.351GluIle: 6.351 ± 0.826
6.636GluLys: 6.636 ± 0.949
8.058GluLeu: 8.058 ± 0.851
1.896GluMet: 1.896 ± 0.403
4.266GluAsn: 4.266 ± 0.645
1.896GluPro: 1.896 ± 0.434
2.844GluGln: 2.844 ± 0.56
3.318GluArg: 3.318 ± 0.56
3.697GluSer: 3.697 ± 0.665
4.171GluThr: 4.171 ± 0.831
5.119GluVal: 5.119 ± 0.893
0.569GluTrp: 0.569 ± 0.228
2.465GluTyr: 2.465 ± 0.471
0.0GluXaa: 0.0 ± 0.0
Phe
2.844PheAla: 2.844 ± 0.679
0.19PheCys: 0.19 ± 0.141
4.361PheAsp: 4.361 ± 0.75
3.033PheGlu: 3.033 ± 0.607
1.232PhePhe: 1.232 ± 0.357
3.413PheGly: 3.413 ± 0.633
0.095PheHis: 0.095 ± 0.091
3.792PheIle: 3.792 ± 0.628
3.792PheLys: 3.792 ± 0.474
2.749PheLeu: 2.749 ± 0.491
1.043PheMet: 1.043 ± 0.279
2.844PheAsn: 2.844 ± 0.472
1.138PhePro: 1.138 ± 0.322
0.758PheGln: 0.758 ± 0.288
1.517PheArg: 1.517 ± 0.304
2.559PheSer: 2.559 ± 0.444
2.559PheThr: 2.559 ± 0.386
1.896PheVal: 1.896 ± 0.495
0.569PheTrp: 0.569 ± 0.268
1.991PheTyr: 1.991 ± 0.543
0.0PheXaa: 0.0 ± 0.0
Gly
3.981GlyAla: 3.981 ± 0.652
0.379GlyCys: 0.379 ± 0.226
4.171GlyAsp: 4.171 ± 0.557
3.033GlyGlu: 3.033 ± 0.609
2.939GlyPhe: 2.939 ± 0.508
5.783GlyGly: 5.783 ± 1.169
1.327GlyHis: 1.327 ± 0.415
5.309GlyIle: 5.309 ± 1.326
4.455GlyLys: 4.455 ± 0.717
5.688GlyLeu: 5.688 ± 0.704
2.275GlyMet: 2.275 ± 0.527
3.033GlyAsn: 3.033 ± 0.537
1.043GlyPro: 1.043 ± 0.305
2.939GlyGln: 2.939 ± 0.531
1.612GlyArg: 1.612 ± 0.414
4.645GlySer: 4.645 ± 1.106
5.214GlyThr: 5.214 ± 0.885
5.403GlyVal: 5.403 ± 0.643
0.948GlyTrp: 0.948 ± 0.281
3.318GlyTyr: 3.318 ± 0.66
0.0GlyXaa: 0.0 ± 0.0
His
1.043HisAla: 1.043 ± 0.409
0.284HisCys: 0.284 ± 0.173
0.758HisAsp: 0.758 ± 0.245
1.138HisGlu: 1.138 ± 0.332
0.284HisPhe: 0.284 ± 0.149
0.948HisGly: 0.948 ± 0.283
0.095HisHis: 0.095 ± 0.088
0.853HisIle: 0.853 ± 0.248
0.853HisLys: 0.853 ± 0.279
1.043HisLeu: 1.043 ± 0.272
0.095HisMet: 0.095 ± 0.116
0.569HisAsn: 0.569 ± 0.227
0.474HisPro: 0.474 ± 0.213
0.569HisGln: 0.569 ± 0.214
0.379HisArg: 0.379 ± 0.191
0.664HisSer: 0.664 ± 0.287
0.853HisThr: 0.853 ± 0.316
1.232HisVal: 1.232 ± 0.416
0.095HisTrp: 0.095 ± 0.088
0.379HisTyr: 0.379 ± 0.212
0.0HisXaa: 0.0 ± 0.0
Ile
5.593IleAla: 5.593 ± 0.799
0.474IleCys: 0.474 ± 0.243
4.645IleAsp: 4.645 ± 0.657
6.73IleGlu: 6.73 ± 0.922
4.171IlePhe: 4.171 ± 0.671
4.74IleGly: 4.74 ± 0.759
0.948IleHis: 0.948 ± 0.292
4.455IleIle: 4.455 ± 0.808
7.584IleLys: 7.584 ± 0.983
4.455IleLeu: 4.455 ± 0.626
2.18IleMet: 2.18 ± 0.43
5.783IleAsn: 5.783 ± 0.858
2.465IlePro: 2.465 ± 0.509
2.654IleGln: 2.654 ± 0.446
1.612IleArg: 1.612 ± 0.447
6.351IleSer: 6.351 ± 0.758
5.877IleThr: 5.877 ± 0.957
4.455IleVal: 4.455 ± 0.77
0.664IleTrp: 0.664 ± 0.22
2.37IleTyr: 2.37 ± 0.428
0.0IleXaa: 0.0 ± 0.0
Lys
6.351LysAla: 6.351 ± 0.821
0.19LysCys: 0.19 ± 0.143
4.835LysAsp: 4.835 ± 0.862
7.584LysGlu: 7.584 ± 0.975
2.939LysPhe: 2.939 ± 0.557
5.119LysGly: 5.119 ± 0.733
0.948LysHis: 0.948 ± 0.305
6.73LysIle: 6.73 ± 0.989
6.446LysLys: 6.446 ± 1.252
6.162LysLeu: 6.162 ± 0.74
2.939LysMet: 2.939 ± 0.591
4.266LysAsn: 4.266 ± 0.642
2.37LysPro: 2.37 ± 0.451
3.697LysGln: 3.697 ± 0.644
4.74LysArg: 4.74 ± 0.739
4.076LysSer: 4.076 ± 0.525
5.309LysThr: 5.309 ± 0.689
5.877LysVal: 5.877 ± 0.798
1.422LysTrp: 1.422 ± 0.376
4.076LysTyr: 4.076 ± 0.524
0.0LysXaa: 0.0 ± 0.0
Leu
4.929LeuAla: 4.929 ± 0.806
0.758LeuCys: 0.758 ± 0.248
4.171LeuAsp: 4.171 ± 0.636
6.351LeuGlu: 6.351 ± 0.738
3.318LeuPhe: 3.318 ± 0.616
5.309LeuGly: 5.309 ± 0.838
1.043LeuHis: 1.043 ± 0.266
5.309LeuIle: 5.309 ± 0.745
6.825LeuLys: 6.825 ± 0.942
6.351LeuLeu: 6.351 ± 0.677
1.801LeuMet: 1.801 ± 0.534
6.541LeuAsn: 6.541 ± 0.921
2.37LeuPro: 2.37 ± 0.406
2.844LeuGln: 2.844 ± 0.585
2.844LeuArg: 2.844 ± 0.605
5.688LeuSer: 5.688 ± 0.607
4.929LeuThr: 4.929 ± 0.757
5.498LeuVal: 5.498 ± 0.647
0.758LeuTrp: 0.758 ± 0.233
2.939LeuTyr: 2.939 ± 0.563
0.0LeuXaa: 0.0 ± 0.0
Met
2.18MetAla: 2.18 ± 0.392
0.19MetCys: 0.19 ± 0.146
1.517MetAsp: 1.517 ± 0.316
1.706MetGlu: 1.706 ± 0.378
0.948MetPhe: 0.948 ± 0.319
0.948MetGly: 0.948 ± 0.317
0.19MetHis: 0.19 ± 0.14
1.612MetIle: 1.612 ± 0.391
2.844MetLys: 2.844 ± 0.581
2.465MetLeu: 2.465 ± 0.435
0.474MetMet: 0.474 ± 0.208
2.37MetAsn: 2.37 ± 0.477
0.664MetPro: 0.664 ± 0.273
1.517MetGln: 1.517 ± 0.423
0.758MetArg: 0.758 ± 0.288
1.517MetSer: 1.517 ± 0.36
1.517MetThr: 1.517 ± 0.421
1.422MetVal: 1.422 ± 0.451
0.19MetTrp: 0.19 ± 0.14
0.664MetTyr: 0.664 ± 0.267
0.0MetXaa: 0.0 ± 0.0
Asn
3.981AsnAla: 3.981 ± 0.681
0.19AsnCys: 0.19 ± 0.123
4.076AsnAsp: 4.076 ± 0.588
3.981AsnGlu: 3.981 ± 0.718
1.896AsnPhe: 1.896 ± 0.51
4.74AsnGly: 4.74 ± 0.873
1.327AsnHis: 1.327 ± 0.363
4.361AsnIle: 4.361 ± 0.605
4.929AsnLys: 4.929 ± 0.577
4.266AsnLeu: 4.266 ± 0.645
1.517AsnMet: 1.517 ± 0.36
4.076AsnAsn: 4.076 ± 0.71
2.275AsnPro: 2.275 ± 0.537
2.844AsnGln: 2.844 ± 0.405
2.559AsnArg: 2.559 ± 0.591
3.792AsnSer: 3.792 ± 0.564
3.033AsnThr: 3.033 ± 0.798
3.318AsnVal: 3.318 ± 0.709
1.043AsnTrp: 1.043 ± 0.386
2.844AsnTyr: 2.844 ± 0.744
0.0AsnXaa: 0.0 ± 0.0
Pro
1.327ProAla: 1.327 ± 0.394
0.0ProCys: 0.0 ± 0.0
2.275ProAsp: 2.275 ± 0.599
1.517ProGlu: 1.517 ± 0.481
1.232ProPhe: 1.232 ± 0.379
0.664ProGly: 0.664 ± 0.27
0.569ProHis: 0.569 ± 0.219
2.275ProIle: 2.275 ± 0.523
2.654ProLys: 2.654 ± 0.584
1.896ProLeu: 1.896 ± 0.399
0.853ProMet: 0.853 ± 0.304
1.327ProAsn: 1.327 ± 0.518
0.474ProPro: 0.474 ± 0.19
0.948ProGln: 0.948 ± 0.438
1.422ProArg: 1.422 ± 0.526
2.465ProSer: 2.465 ± 0.681
2.37ProThr: 2.37 ± 0.401
1.896ProVal: 1.896 ± 0.344
0.19ProTrp: 0.19 ± 0.102
0.853ProTyr: 0.853 ± 0.239
0.0ProXaa: 0.0 ± 0.0
Gln
2.939GlnAla: 2.939 ± 0.55
0.284GlnCys: 0.284 ± 0.142
0.948GlnAsp: 0.948 ± 0.263
2.749GlnGlu: 2.749 ± 0.569
1.043GlnPhe: 1.043 ± 0.32
2.939GlnGly: 2.939 ± 0.417
0.284GlnHis: 0.284 ± 0.167
2.654GlnIle: 2.654 ± 0.551
3.792GlnLys: 3.792 ± 0.762
3.223GlnLeu: 3.223 ± 0.604
0.379GlnMet: 0.379 ± 0.188
1.706GlnAsn: 1.706 ± 0.429
0.569GlnPro: 0.569 ± 0.223
2.654GlnGln: 2.654 ± 0.842
1.138GlnArg: 1.138 ± 0.376
2.18GlnSer: 2.18 ± 0.438
2.939GlnThr: 2.939 ± 0.679
2.465GlnVal: 2.465 ± 0.551
0.379GlnTrp: 0.379 ± 0.213
2.086GlnTyr: 2.086 ± 0.471
0.0GlnXaa: 0.0 ± 0.0
Arg
1.801ArgAla: 1.801 ± 0.413
0.19ArgCys: 0.19 ± 0.101
2.37ArgAsp: 2.37 ± 0.424
1.706ArgGlu: 1.706 ± 0.424
1.232ArgPhe: 1.232 ± 0.373
1.991ArgGly: 1.991 ± 0.509
0.569ArgHis: 0.569 ± 0.21
2.559ArgIle: 2.559 ± 0.506
3.507ArgLys: 3.507 ± 0.753
4.455ArgLeu: 4.455 ± 0.692
1.896ArgMet: 1.896 ± 0.447
1.801ArgAsn: 1.801 ± 0.518
1.138ArgPro: 1.138 ± 0.331
1.612ArgGln: 1.612 ± 0.422
0.948ArgArg: 0.948 ± 0.269
1.327ArgSer: 1.327 ± 0.319
1.991ArgThr: 1.991 ± 0.455
1.896ArgVal: 1.896 ± 0.494
0.19ArgTrp: 0.19 ± 0.142
1.801ArgTyr: 1.801 ± 0.452
0.0ArgXaa: 0.0 ± 0.0
Ser
5.024SerAla: 5.024 ± 0.891
0.284SerCys: 0.284 ± 0.163
5.119SerAsp: 5.119 ± 0.93
4.361SerGlu: 4.361 ± 0.561
3.033SerPhe: 3.033 ± 0.643
5.024SerGly: 5.024 ± 0.763
0.758SerHis: 0.758 ± 0.218
5.593SerIle: 5.593 ± 0.724
5.309SerLys: 5.309 ± 0.613
5.024SerLeu: 5.024 ± 0.643
1.043SerMet: 1.043 ± 0.41
3.413SerAsn: 3.413 ± 0.583
1.232SerPro: 1.232 ± 0.376
1.612SerGln: 1.612 ± 0.47
1.327SerArg: 1.327 ± 0.442
3.318SerSer: 3.318 ± 0.595
3.602SerThr: 3.602 ± 0.583
3.128SerVal: 3.128 ± 0.486
1.138SerTrp: 1.138 ± 0.405
2.086SerTyr: 2.086 ± 0.488
0.0SerXaa: 0.0 ± 0.0
Thr
4.645ThrAla: 4.645 ± 0.528
0.0ThrCys: 0.0 ± 0.0
3.981ThrAsp: 3.981 ± 0.741
3.792ThrGlu: 3.792 ± 0.624
3.697ThrPhe: 3.697 ± 0.54
5.214ThrGly: 5.214 ± 0.816
0.758ThrHis: 0.758 ± 0.294
4.929ThrIle: 4.929 ± 0.747
5.593ThrLys: 5.593 ± 0.695
4.455ThrLeu: 4.455 ± 0.688
1.138ThrMet: 1.138 ± 0.345
2.275ThrAsn: 2.275 ± 0.46
2.086ThrPro: 2.086 ± 0.426
2.465ThrGln: 2.465 ± 0.524
1.801ThrArg: 1.801 ± 0.554
3.697ThrSer: 3.697 ± 0.567
4.171ThrThr: 4.171 ± 0.83
4.929ThrVal: 4.929 ± 0.872
0.758ThrTrp: 0.758 ± 0.275
3.223ThrTyr: 3.223 ± 0.696
0.0ThrXaa: 0.0 ± 0.0
Val
4.929ValAla: 4.929 ± 0.848
0.19ValCys: 0.19 ± 0.168
4.55ValAsp: 4.55 ± 0.624
5.403ValGlu: 5.403 ± 0.784
2.37ValPhe: 2.37 ± 0.434
3.507ValGly: 3.507 ± 0.725
0.569ValHis: 0.569 ± 0.255
4.361ValIle: 4.361 ± 0.585
4.74ValLys: 4.74 ± 0.714
5.119ValLeu: 5.119 ± 0.57
1.232ValMet: 1.232 ± 0.36
4.645ValAsn: 4.645 ± 0.562
2.275ValPro: 2.275 ± 0.493
1.896ValGln: 1.896 ± 0.382
1.896ValArg: 1.896 ± 0.453
4.645ValSer: 4.645 ± 0.585
4.076ValThr: 4.076 ± 0.773
3.697ValVal: 3.697 ± 0.56
0.758ValTrp: 0.758 ± 0.224
1.706ValTyr: 1.706 ± 0.392
0.0ValXaa: 0.0 ± 0.0
Trp
0.379TrpAla: 0.379 ± 0.18
0.095TrpCys: 0.095 ± 0.088
0.569TrpAsp: 0.569 ± 0.231
0.948TrpGlu: 0.948 ± 0.361
0.569TrpPhe: 0.569 ± 0.185
0.948TrpGly: 0.948 ± 0.3
0.19TrpHis: 0.19 ± 0.139
0.948TrpIle: 0.948 ± 0.29
1.517TrpLys: 1.517 ± 0.393
1.232TrpLeu: 1.232 ± 0.493
0.0TrpMet: 0.0 ± 0.0
0.948TrpAsn: 0.948 ± 0.348
0.0TrpPro: 0.0 ± 0.0
0.569TrpGln: 0.569 ± 0.299
0.284TrpArg: 0.284 ± 0.149
0.853TrpSer: 0.853 ± 0.247
1.043TrpThr: 1.043 ± 0.345
0.474TrpVal: 0.474 ± 0.199
0.284TrpTrp: 0.284 ± 0.144
0.569TrpTyr: 0.569 ± 0.308
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.749TyrAla: 2.749 ± 0.563
0.379TyrCys: 0.379 ± 0.207
2.939TyrAsp: 2.939 ± 0.605
2.465TyrGlu: 2.465 ± 0.496
1.991TyrPhe: 1.991 ± 0.5
3.033TyrGly: 3.033 ± 0.511
0.474TyrHis: 0.474 ± 0.202
4.645TyrIle: 4.645 ± 0.896
4.171TyrLys: 4.171 ± 0.581
3.128TyrLeu: 3.128 ± 0.467
0.948TyrMet: 0.948 ± 0.291
1.801TyrAsn: 1.801 ± 0.403
1.422TyrPro: 1.422 ± 0.404
2.086TyrGln: 2.086 ± 0.473
1.422TyrArg: 1.422 ± 0.412
2.18TyrSer: 2.18 ± 0.473
2.749TyrThr: 2.749 ± 0.569
2.275TyrVal: 2.275 ± 0.407
0.284TyrTrp: 0.284 ± 0.189
2.086TyrTyr: 2.086 ± 0.425
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 58 proteins (10550 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski