Amino acid dipepetide frequency for Streptococcus satellite phage Javan43

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
1.107AlaCys: 1.107 ± 0.544
4.428AlaAsp: 4.428 ± 1.094
4.982AlaGlu: 4.982 ± 1.353
1.661AlaPhe: 1.661 ± 0.524
1.107AlaGly: 1.107 ± 0.584
0.83AlaHis: 0.83 ± 0.474
6.643AlaIle: 6.643 ± 1.207
3.875AlaLys: 3.875 ± 0.997
4.705AlaLeu: 4.705 ± 1.072
1.107AlaMet: 1.107 ± 0.623
4.428AlaAsn: 4.428 ± 1.11
1.384AlaPro: 1.384 ± 0.518
1.937AlaGln: 1.937 ± 0.793
1.937AlaArg: 1.937 ± 0.563
1.937AlaSer: 1.937 ± 0.685
4.428AlaThr: 4.428 ± 0.826
2.768AlaVal: 2.768 ± 0.882
0.83AlaTrp: 0.83 ± 0.407
3.045AlaTyr: 3.045 ± 0.87
0.0AlaXaa: 0.0 ± 0.0
Cys
0.277CysAla: 0.277 ± 0.3
0.0CysCys: 0.0 ± 0.0
0.83CysAsp: 0.83 ± 0.469
0.277CysGlu: 0.277 ± 0.323
0.0CysPhe: 0.0 ± 0.0
0.277CysGly: 0.277 ± 0.323
0.554CysHis: 0.554 ± 0.451
0.277CysIle: 0.277 ± 0.269
0.277CysLys: 0.277 ± 0.217
0.83CysLeu: 0.83 ± 0.449
0.554CysMet: 0.554 ± 0.396
1.384CysAsn: 1.384 ± 0.596
0.277CysPro: 0.277 ± 0.323
0.554CysGln: 0.554 ± 0.646
0.554CysArg: 0.554 ± 0.34
0.554CysSer: 0.554 ± 0.364
0.277CysThr: 0.277 ± 0.266
0.554CysVal: 0.554 ± 0.359
0.277CysTrp: 0.277 ± 0.294
0.277CysTyr: 0.277 ± 0.3
0.0CysXaa: 0.0 ± 0.0
Asp
1.107AspAla: 1.107 ± 0.583
0.83AspCys: 0.83 ± 0.467
3.045AspAsp: 3.045 ± 1.223
4.982AspGlu: 4.982 ± 0.975
3.875AspPhe: 3.875 ± 1.056
1.384AspGly: 1.384 ± 0.498
0.277AspHis: 0.277 ± 0.266
6.089AspIle: 6.089 ± 1.497
6.919AspLys: 6.919 ± 1.263
6.366AspLeu: 6.366 ± 1.137
1.384AspMet: 1.384 ± 0.763
3.045AspAsn: 3.045 ± 0.998
1.107AspPro: 1.107 ± 0.654
1.107AspGln: 1.107 ± 0.776
3.045AspArg: 3.045 ± 0.809
3.045AspSer: 3.045 ± 0.869
3.598AspThr: 3.598 ± 1.33
1.937AspVal: 1.937 ± 0.768
0.554AspTrp: 0.554 ± 0.315
3.875AspTyr: 3.875 ± 1.146
0.0AspXaa: 0.0 ± 0.0
Glu
5.259GluAla: 5.259 ± 1.004
0.83GluCys: 0.83 ± 0.745
4.428GluAsp: 4.428 ± 1.283
4.982GluGlu: 4.982 ± 1.405
2.768GluPhe: 2.768 ± 0.813
3.321GluGly: 3.321 ± 0.974
1.661GluHis: 1.661 ± 0.954
6.089GluIle: 6.089 ± 1.274
6.919GluLys: 6.919 ± 1.027
10.518GluLeu: 10.518 ± 1.667
1.937GluMet: 1.937 ± 0.701
2.491GluAsn: 2.491 ± 0.685
1.107GluPro: 1.107 ± 0.452
3.875GluGln: 3.875 ± 1.271
3.321GluArg: 3.321 ± 0.732
3.875GluSer: 3.875 ± 1.085
4.428GluThr: 4.428 ± 0.931
3.045GluVal: 3.045 ± 1.093
1.107GluTrp: 1.107 ± 0.479
5.259GluTyr: 5.259 ± 1.133
0.0GluXaa: 0.0 ± 0.0
Phe
1.661PheAla: 1.661 ± 0.635
0.83PheCys: 0.83 ± 0.447
2.768PheAsp: 2.768 ± 0.719
3.321PheGlu: 3.321 ± 0.814
1.937PhePhe: 1.937 ± 0.633
1.384PheGly: 1.384 ± 0.543
0.83PheHis: 0.83 ± 0.384
5.536PheIle: 5.536 ± 1.538
4.705PheLys: 4.705 ± 0.906
4.152PheLeu: 4.152 ± 0.954
0.554PheMet: 0.554 ± 0.49
3.875PheAsn: 3.875 ± 0.863
1.661PhePro: 1.661 ± 0.914
1.937PheGln: 1.937 ± 0.656
2.214PheArg: 2.214 ± 0.806
3.875PheSer: 3.875 ± 0.749
3.045PheThr: 3.045 ± 1.053
1.384PheVal: 1.384 ± 0.7
0.277PheTrp: 0.277 ± 0.217
2.214PheTyr: 2.214 ± 0.66
0.0PheXaa: 0.0 ± 0.0
Gly
2.768GlyAla: 2.768 ± 1.105
0.554GlyCys: 0.554 ± 0.338
1.661GlyAsp: 1.661 ± 0.898
1.661GlyGlu: 1.661 ± 0.728
3.598GlyPhe: 3.598 ± 0.933
1.661GlyGly: 1.661 ± 0.576
0.554GlyHis: 0.554 ± 0.345
4.152GlyIle: 4.152 ± 0.884
3.045GlyLys: 3.045 ± 1.025
4.982GlyLeu: 4.982 ± 1.302
0.277GlyMet: 0.277 ± 0.265
2.491GlyAsn: 2.491 ± 0.884
0.0GlyPro: 0.0 ± 0.0
0.277GlyGln: 0.277 ± 0.264
1.937GlyArg: 1.937 ± 0.714
2.491GlySer: 2.491 ± 1.17
3.321GlyThr: 3.321 ± 1.054
2.768GlyVal: 2.768 ± 1.212
0.554GlyTrp: 0.554 ± 0.435
1.937GlyTyr: 1.937 ± 0.751
0.0GlyXaa: 0.0 ± 0.0
His
1.937HisAla: 1.937 ± 0.781
0.0HisCys: 0.0 ± 0.0
0.277HisAsp: 0.277 ± 0.323
0.83HisGlu: 0.83 ± 0.439
1.661HisPhe: 1.661 ± 0.574
0.554HisGly: 0.554 ± 0.354
0.0HisHis: 0.0 ± 0.0
0.83HisIle: 0.83 ± 0.431
1.384HisLys: 1.384 ± 0.681
0.83HisLeu: 0.83 ± 0.413
0.277HisMet: 0.277 ± 0.29
1.107HisAsn: 1.107 ± 0.739
0.277HisPro: 0.277 ± 0.299
1.107HisGln: 1.107 ± 0.801
0.554HisArg: 0.554 ± 0.349
0.83HisSer: 0.83 ± 0.425
1.384HisThr: 1.384 ± 0.661
0.554HisVal: 0.554 ± 0.421
0.83HisTrp: 0.83 ± 0.658
2.491HisTyr: 2.491 ± 0.776
0.0HisXaa: 0.0 ± 0.0
Ile
5.536IleAla: 5.536 ± 1.215
1.107IleCys: 1.107 ± 0.669
5.812IleAsp: 5.812 ± 1.061
6.089IleGlu: 6.089 ± 1.446
4.705IlePhe: 4.705 ± 0.958
2.768IleGly: 2.768 ± 0.785
0.83IleHis: 0.83 ± 0.491
4.705IleIle: 4.705 ± 0.841
9.134IleLys: 9.134 ± 1.599
5.812IleLeu: 5.812 ± 1.158
1.937IleMet: 1.937 ± 0.722
5.536IleAsn: 5.536 ± 1.413
3.045IlePro: 3.045 ± 1.032
1.937IleGln: 1.937 ± 0.891
2.768IleArg: 2.768 ± 0.91
5.259IleSer: 5.259 ± 1.731
5.812IleThr: 5.812 ± 1.091
3.321IleVal: 3.321 ± 0.839
0.0IleTrp: 0.0 ± 0.0
4.428IleTyr: 4.428 ± 1.151
0.0IleXaa: 0.0 ± 0.0
Lys
8.303LysAla: 8.303 ± 1.763
0.554LysCys: 0.554 ± 0.439
4.428LysAsp: 4.428 ± 1.309
8.303LysGlu: 8.303 ± 1.506
4.152LysPhe: 4.152 ± 1.036
3.321LysGly: 3.321 ± 0.861
2.768LysHis: 2.768 ± 0.843
5.259LysIle: 5.259 ± 1.167
9.134LysLys: 9.134 ± 1.643
6.919LysLeu: 6.919 ± 1.375
1.661LysMet: 1.661 ± 0.708
4.982LysAsn: 4.982 ± 1.176
4.152LysPro: 4.152 ± 1.276
3.321LysGln: 3.321 ± 1.204
4.428LysArg: 4.428 ± 1.097
4.982LysSer: 4.982 ± 1.753
4.982LysThr: 4.982 ± 1.229
5.812LysVal: 5.812 ± 1.278
0.554LysTrp: 0.554 ± 0.425
6.089LysTyr: 6.089 ± 1.551
0.0LysXaa: 0.0 ± 0.0
Leu
5.536LeuAla: 5.536 ± 1.602
0.83LeuCys: 0.83 ± 0.481
6.366LeuAsp: 6.366 ± 1.184
10.518LeuGlu: 10.518 ± 1.996
4.428LeuPhe: 4.428 ± 1.151
5.259LeuGly: 5.259 ± 1.367
1.384LeuHis: 1.384 ± 0.567
7.473LeuIle: 7.473 ± 1.624
8.027LeuLys: 8.027 ± 1.729
7.75LeuLeu: 7.75 ± 1.435
3.045LeuMet: 3.045 ± 0.756
6.366LeuAsn: 6.366 ± 1.802
3.598LeuPro: 3.598 ± 1.283
2.491LeuGln: 2.491 ± 0.856
2.768LeuArg: 2.768 ± 1.155
7.473LeuSer: 7.473 ± 1.166
3.045LeuThr: 3.045 ± 0.953
4.152LeuVal: 4.152 ± 1.016
0.83LeuTrp: 0.83 ± 0.5
3.598LeuTyr: 3.598 ± 0.585
0.0LeuXaa: 0.0 ± 0.0
Met
1.384MetAla: 1.384 ± 0.636
0.277MetCys: 0.277 ± 0.32
1.937MetAsp: 1.937 ± 1.177
1.384MetGlu: 1.384 ± 0.558
0.554MetPhe: 0.554 ± 0.414
0.83MetGly: 0.83 ± 0.378
0.0MetHis: 0.0 ± 0.0
1.107MetIle: 1.107 ± 0.514
2.768MetLys: 2.768 ± 0.77
2.491MetLeu: 2.491 ± 0.798
0.0MetMet: 0.0 ± 0.0
1.661MetAsn: 1.661 ± 0.747
0.554MetPro: 0.554 ± 0.432
0.554MetGln: 0.554 ± 0.337
1.937MetArg: 1.937 ± 0.655
0.83MetSer: 0.83 ± 0.46
2.768MetThr: 2.768 ± 0.984
0.554MetVal: 0.554 ± 0.287
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.045AsnAla: 3.045 ± 0.809
0.0AsnCys: 0.0 ± 0.0
3.321AsnAsp: 3.321 ± 0.951
4.705AsnGlu: 4.705 ± 0.948
1.384AsnPhe: 1.384 ± 0.632
4.428AsnGly: 4.428 ± 1.109
1.384AsnHis: 1.384 ± 0.589
3.875AsnIle: 3.875 ± 1.086
4.705AsnLys: 4.705 ± 0.889
4.152AsnLeu: 4.152 ± 0.897
1.937AsnMet: 1.937 ± 0.83
2.214AsnAsn: 2.214 ± 0.812
2.768AsnPro: 2.768 ± 0.662
5.536AsnGln: 5.536 ± 1.557
3.321AsnArg: 3.321 ± 0.608
2.768AsnSer: 2.768 ± 0.647
3.045AsnThr: 3.045 ± 1.245
3.045AsnVal: 3.045 ± 0.819
0.554AsnTrp: 0.554 ± 0.357
3.045AsnTyr: 3.045 ± 0.883
0.0AsnXaa: 0.0 ± 0.0
Pro
1.661ProAla: 1.661 ± 0.519
0.277ProCys: 0.277 ± 0.278
1.661ProAsp: 1.661 ± 0.583
1.937ProGlu: 1.937 ± 0.923
1.937ProPhe: 1.937 ± 0.674
0.554ProGly: 0.554 ± 0.452
0.0ProHis: 0.0 ± 0.0
2.768ProIle: 2.768 ± 0.926
5.259ProLys: 5.259 ± 1.635
2.768ProLeu: 2.768 ± 0.724
0.277ProMet: 0.277 ± 0.323
2.768ProAsn: 2.768 ± 0.838
1.107ProPro: 1.107 ± 0.525
1.384ProGln: 1.384 ± 0.699
1.661ProArg: 1.661 ± 0.72
1.661ProSer: 1.661 ± 0.585
1.937ProThr: 1.937 ± 0.598
1.384ProVal: 1.384 ± 0.686
0.277ProTrp: 0.277 ± 0.217
0.83ProTyr: 0.83 ± 0.578
0.0ProXaa: 0.0 ± 0.0
Gln
2.768GlnAla: 2.768 ± 0.648
0.0GlnCys: 0.0 ± 0.0
1.107GlnAsp: 1.107 ± 0.539
3.875GlnGlu: 3.875 ± 1.019
1.107GlnPhe: 1.107 ± 0.551
1.937GlnGly: 1.937 ± 1.107
0.83GlnHis: 0.83 ± 0.415
2.491GlnIle: 2.491 ± 0.655
2.214GlnLys: 2.214 ± 0.732
4.428GlnLeu: 4.428 ± 1.082
0.277GlnMet: 0.277 ± 0.294
3.045GlnAsn: 3.045 ± 1.103
1.937GlnPro: 1.937 ± 0.915
1.661GlnGln: 1.661 ± 0.526
2.214GlnArg: 2.214 ± 0.705
1.937GlnSer: 1.937 ± 0.692
1.384GlnThr: 1.384 ± 0.671
2.491GlnVal: 2.491 ± 0.6
0.277GlnTrp: 0.277 ± 0.217
1.937GlnTyr: 1.937 ± 0.711
0.0GlnXaa: 0.0 ± 0.0
Arg
1.384ArgAla: 1.384 ± 0.623
0.277ArgCys: 0.277 ± 0.323
3.321ArgAsp: 3.321 ± 0.932
4.428ArgGlu: 4.428 ± 0.999
1.937ArgPhe: 1.937 ± 0.561
1.661ArgGly: 1.661 ± 0.842
1.384ArgHis: 1.384 ± 0.494
4.152ArgIle: 4.152 ± 1.081
3.045ArgLys: 3.045 ± 0.774
4.982ArgLeu: 4.982 ± 1.083
0.83ArgMet: 0.83 ± 0.551
2.768ArgAsn: 2.768 ± 0.725
0.83ArgPro: 0.83 ± 0.518
1.384ArgGln: 1.384 ± 0.713
2.768ArgArg: 2.768 ± 1.074
3.045ArgSer: 3.045 ± 0.987
4.428ArgThr: 4.428 ± 1.184
3.045ArgVal: 3.045 ± 0.862
0.554ArgTrp: 0.554 ± 0.401
2.214ArgTyr: 2.214 ± 0.676
0.0ArgXaa: 0.0 ± 0.0
Ser
1.384SerAla: 1.384 ± 0.691
0.554SerCys: 0.554 ± 0.439
4.428SerAsp: 4.428 ± 0.974
6.089SerGlu: 6.089 ± 1.089
1.937SerPhe: 1.937 ± 0.531
1.384SerGly: 1.384 ± 0.492
0.554SerHis: 0.554 ± 0.34
5.812SerIle: 5.812 ± 1.12
5.812SerLys: 5.812 ± 1.287
6.919SerLeu: 6.919 ± 1.055
0.83SerMet: 0.83 ± 0.431
2.214SerAsn: 2.214 ± 0.709
1.107SerPro: 1.107 ± 0.36
1.937SerGln: 1.937 ± 0.977
2.491SerArg: 2.491 ± 0.635
1.937SerSer: 1.937 ± 0.727
4.152SerThr: 4.152 ± 0.942
2.214SerVal: 2.214 ± 0.845
1.107SerTrp: 1.107 ± 0.434
2.768SerTyr: 2.768 ± 1.045
0.0SerXaa: 0.0 ± 0.0
Thr
3.045ThrAla: 3.045 ± 0.845
0.0ThrCys: 0.0 ± 0.0
1.937ThrAsp: 1.937 ± 0.592
3.321ThrGlu: 3.321 ± 0.849
5.259ThrPhe: 5.259 ± 1.623
4.152ThrGly: 4.152 ± 0.912
1.661ThrHis: 1.661 ± 0.722
4.705ThrIle: 4.705 ± 1.499
7.196ThrLys: 7.196 ± 1.132
6.643ThrLeu: 6.643 ± 1.283
1.937ThrMet: 1.937 ± 0.647
1.107ThrAsn: 1.107 ± 0.536
2.491ThrPro: 2.491 ± 0.876
3.045ThrGln: 3.045 ± 1.101
4.152ThrArg: 4.152 ± 0.836
1.937ThrSer: 1.937 ± 0.689
3.875ThrThr: 3.875 ± 1.548
3.598ThrVal: 3.598 ± 1.304
0.277ThrTrp: 0.277 ± 0.264
2.768ThrTyr: 2.768 ± 0.982
0.0ThrXaa: 0.0 ± 0.0
Val
2.491ValAla: 2.491 ± 0.567
0.277ValCys: 0.277 ± 0.217
3.321ValAsp: 3.321 ± 1.127
1.661ValGlu: 1.661 ± 0.663
2.768ValPhe: 2.768 ± 0.859
1.937ValGly: 1.937 ± 0.608
0.83ValHis: 0.83 ± 0.491
5.259ValIle: 5.259 ± 1.159
4.428ValLys: 4.428 ± 1.059
4.705ValLeu: 4.705 ± 1.305
1.107ValMet: 1.107 ± 0.525
3.321ValAsn: 3.321 ± 0.838
2.491ValPro: 2.491 ± 0.942
0.83ValGln: 0.83 ± 0.481
1.107ValArg: 1.107 ± 0.547
3.875ValSer: 3.875 ± 1.035
4.428ValThr: 4.428 ± 1.138
3.045ValVal: 3.045 ± 0.877
0.0ValTrp: 0.0 ± 0.0
0.83ValTyr: 0.83 ± 0.395
0.0ValXaa: 0.0 ± 0.0
Trp
0.554TrpAla: 0.554 ± 0.39
0.0TrpCys: 0.0 ± 0.0
0.83TrpAsp: 0.83 ± 0.429
1.107TrpGlu: 1.107 ± 0.509
0.0TrpPhe: 0.0 ± 0.0
0.277TrpGly: 0.277 ± 0.278
0.277TrpHis: 0.277 ± 0.217
0.554TrpIle: 0.554 ± 0.343
0.554TrpLys: 0.554 ± 0.378
0.83TrpLeu: 0.83 ± 0.46
0.0TrpMet: 0.0 ± 0.0
0.277TrpAsn: 0.277 ± 0.309
0.277TrpPro: 0.277 ± 0.217
0.83TrpGln: 0.83 ± 0.519
0.554TrpArg: 0.554 ± 0.373
0.554TrpSer: 0.554 ± 0.354
0.277TrpThr: 0.277 ± 0.294
0.83TrpVal: 0.83 ± 0.432
0.554TrpTrp: 0.554 ± 0.421
0.554TrpTyr: 0.554 ± 0.365
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.045TyrAla: 3.045 ± 0.912
0.554TyrCys: 0.554 ± 0.349
1.661TyrAsp: 1.661 ± 0.622
2.768TyrGlu: 2.768 ± 0.841
2.768TyrPhe: 2.768 ± 0.81
2.491TyrGly: 2.491 ± 0.836
1.107TyrHis: 1.107 ± 0.574
2.768TyrIle: 2.768 ± 0.836
4.705TyrLys: 4.705 ± 1.519
4.152TyrLeu: 4.152 ± 1.119
1.384TyrMet: 1.384 ± 0.757
4.152TyrAsn: 4.152 ± 1.141
1.937TyrPro: 1.937 ± 0.84
2.214TyrGln: 2.214 ± 0.634
4.705TyrArg: 4.705 ± 1.385
2.768TyrSer: 2.768 ± 1.342
2.491TyrThr: 2.491 ± 0.702
2.214TyrVal: 2.214 ± 0.796
0.277TyrTrp: 0.277 ± 0.323
2.491TyrTyr: 2.491 ± 1.372
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 24 proteins (3614 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski