Amino acid dipepetide frequency for Streptococcus satellite phage Javan591

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
0.863AlaCys: 0.863 ± 0.731
5.179AlaAsp: 5.179 ± 1.412
6.905AlaGlu: 6.905 ± 1.811
2.158AlaPhe: 2.158 ± 0.905
2.59AlaGly: 2.59 ± 0.801
0.863AlaHis: 0.863 ± 0.731
5.611AlaIle: 5.611 ± 1.806
3.021AlaLys: 3.021 ± 0.909
3.453AlaLeu: 3.453 ± 1.461
1.295AlaMet: 1.295 ± 0.687
3.453AlaAsn: 3.453 ± 1.318
2.158AlaPro: 2.158 ± 0.859
4.316AlaGln: 4.316 ± 1.297
3.884AlaArg: 3.884 ± 1.376
5.179AlaSer: 5.179 ± 1.457
2.158AlaThr: 2.158 ± 0.914
3.453AlaVal: 3.453 ± 0.873
0.432AlaTrp: 0.432 ± 0.426
3.021AlaTyr: 3.021 ± 0.694
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.432CysGlu: 0.432 ± 0.426
0.0CysPhe: 0.0 ± 0.0
0.432CysGly: 0.432 ± 0.43
0.0CysHis: 0.0 ± 0.0
1.726CysIle: 1.726 ± 0.692
0.0CysLys: 0.0 ± 0.0
0.432CysLeu: 0.432 ± 0.467
0.432CysMet: 0.432 ± 0.467
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.432CysGln: 0.432 ± 0.366
0.863CysArg: 0.863 ± 0.548
0.863CysSer: 0.863 ± 0.609
0.0CysThr: 0.0 ± 0.0
0.863CysVal: 0.863 ± 0.564
0.0CysTrp: 0.0 ± 0.0
0.432CysTyr: 0.432 ± 0.366
0.0CysXaa: 0.0 ± 0.0
Asp
0.863AspAla: 0.863 ± 0.511
0.863AspCys: 0.863 ± 0.436
3.021AspAsp: 3.021 ± 1.273
6.474AspGlu: 6.474 ± 1.772
3.884AspPhe: 3.884 ± 1.03
2.158AspGly: 2.158 ± 0.69
2.158AspHis: 2.158 ± 1.206
5.611AspIle: 5.611 ± 1.744
5.179AspLys: 5.179 ± 1.457
9.063AspLeu: 9.063 ± 1.705
1.726AspMet: 1.726 ± 1.069
3.021AspAsn: 3.021 ± 1.239
1.295AspPro: 1.295 ± 0.728
1.295AspGln: 1.295 ± 0.558
1.726AspArg: 1.726 ± 0.726
0.432AspSer: 0.432 ± 0.467
3.453AspThr: 3.453 ± 1.425
3.453AspVal: 3.453 ± 1.385
2.158AspTrp: 2.158 ± 1.073
4.316AspTyr: 4.316 ± 1.276
0.0AspXaa: 0.0 ± 0.0
Glu
6.474GluAla: 6.474 ± 2.321
1.726GluCys: 1.726 ± 1.124
4.748GluAsp: 4.748 ± 1.39
9.063GluGlu: 9.063 ± 3.235
3.884GluPhe: 3.884 ± 1.43
4.316GluGly: 4.316 ± 1.681
1.295GluHis: 1.295 ± 0.719
4.748GluIle: 4.748 ± 1.74
10.358GluLys: 10.358 ± 2.216
9.063GluLeu: 9.063 ± 2.238
1.295GluMet: 1.295 ± 1.073
4.316GluAsn: 4.316 ± 0.97
2.158GluPro: 2.158 ± 1.131
3.021GluGln: 3.021 ± 1.33
6.474GluArg: 6.474 ± 2.568
3.021GluSer: 3.021 ± 0.826
3.453GluThr: 3.453 ± 1.307
4.748GluVal: 4.748 ± 1.536
1.295GluTrp: 1.295 ± 0.833
3.453GluTyr: 3.453 ± 1.232
0.0GluXaa: 0.0 ± 0.0
Phe
2.59PheAla: 2.59 ± 0.775
0.432PheCys: 0.432 ± 0.426
3.021PheAsp: 3.021 ± 0.828
2.158PheGlu: 2.158 ± 1.022
1.726PhePhe: 1.726 ± 1.014
3.021PheGly: 3.021 ± 1.333
0.863PheHis: 0.863 ± 0.548
5.179PheIle: 5.179 ± 1.522
3.453PheLys: 3.453 ± 1.119
4.316PheLeu: 4.316 ± 1.024
0.0PheMet: 0.0 ± 0.0
2.158PheAsn: 2.158 ± 0.897
2.158PhePro: 2.158 ± 0.726
2.158PheGln: 2.158 ± 0.742
1.726PheArg: 1.726 ± 0.814
2.59PheSer: 2.59 ± 1.305
3.453PheThr: 3.453 ± 1.72
0.432PheVal: 0.432 ± 0.343
0.432PheTrp: 0.432 ± 0.343
2.59PheTyr: 2.59 ± 0.957
0.0PheXaa: 0.0 ± 0.0
Gly
3.453GlyAla: 3.453 ± 0.863
0.0GlyCys: 0.0 ± 0.0
3.021GlyAsp: 3.021 ± 1.405
3.453GlyGlu: 3.453 ± 0.806
3.453GlyPhe: 3.453 ± 1.235
1.295GlyGly: 1.295 ± 0.745
1.295GlyHis: 1.295 ± 0.72
5.179GlyIle: 5.179 ± 1.136
6.042GlyLys: 6.042 ± 1.558
4.748GlyLeu: 4.748 ± 1.583
1.295GlyMet: 1.295 ± 1.097
3.453GlyAsn: 3.453 ± 1.344
0.0GlyPro: 0.0 ± 0.0
0.863GlyGln: 0.863 ± 0.594
0.863GlyArg: 0.863 ± 0.512
0.432GlySer: 0.432 ± 0.366
2.158GlyThr: 2.158 ± 1.071
2.158GlyVal: 2.158 ± 1.252
1.726GlyTrp: 1.726 ± 0.729
4.748GlyTyr: 4.748 ± 0.956
0.0GlyXaa: 0.0 ± 0.0
His
1.726HisAla: 1.726 ± 0.894
0.432HisCys: 0.432 ± 0.366
0.863HisAsp: 0.863 ± 0.519
0.863HisGlu: 0.863 ± 0.533
1.295HisPhe: 1.295 ± 0.73
0.863HisGly: 0.863 ± 0.507
0.432HisHis: 0.432 ± 0.453
2.158HisIle: 2.158 ± 0.783
2.59HisLys: 2.59 ± 1.356
0.863HisLeu: 0.863 ± 0.686
0.0HisMet: 0.0 ± 0.0
1.726HisAsn: 1.726 ± 1.006
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
1.295HisArg: 1.295 ± 0.664
1.726HisSer: 1.726 ± 0.66
1.726HisThr: 1.726 ± 0.594
0.863HisVal: 0.863 ± 0.473
0.0HisTrp: 0.0 ± 0.0
3.021HisTyr: 3.021 ± 1.112
0.0HisXaa: 0.0 ± 0.0
Ile
5.179IleAla: 5.179 ± 1.7
0.432IleCys: 0.432 ± 0.467
2.158IleAsp: 2.158 ± 1.063
6.905IleGlu: 6.905 ± 1.834
1.295IlePhe: 1.295 ± 0.644
4.316IleGly: 4.316 ± 1.853
0.863IleHis: 0.863 ± 0.473
5.611IleIle: 5.611 ± 1.405
7.769IleLys: 7.769 ± 1.575
5.179IleLeu: 5.179 ± 1.304
1.295IleMet: 1.295 ± 0.758
3.884IleAsn: 3.884 ± 0.946
3.453IlePro: 3.453 ± 0.955
4.316IleGln: 4.316 ± 1.738
3.453IleArg: 3.453 ± 1.453
4.316IleSer: 4.316 ± 1.049
7.769IleThr: 7.769 ± 1.594
2.59IleVal: 2.59 ± 0.925
0.863IleTrp: 0.863 ± 0.696
3.453IleTyr: 3.453 ± 0.823
0.0IleXaa: 0.0 ± 0.0
Lys
5.179LysAla: 5.179 ± 1.042
0.0LysCys: 0.0 ± 0.0
7.769LysAsp: 7.769 ± 1.688
11.221LysGlu: 11.221 ± 2.486
2.158LysPhe: 2.158 ± 1.177
6.042LysGly: 6.042 ± 2.173
1.295LysHis: 1.295 ± 0.877
5.179LysIle: 5.179 ± 1.468
9.495LysLys: 9.495 ± 3.468
5.179LysLeu: 5.179 ± 1.322
3.021LysMet: 3.021 ± 0.967
3.021LysAsn: 3.021 ± 1.401
4.316LysPro: 4.316 ± 1.265
6.474LysGln: 6.474 ± 1.227
5.611LysArg: 5.611 ± 1.622
4.316LysSer: 4.316 ± 1.543
6.474LysThr: 6.474 ± 1.879
2.158LysVal: 2.158 ± 1.028
0.863LysTrp: 0.863 ± 0.564
3.884LysTyr: 3.884 ± 1.191
0.0LysXaa: 0.0 ± 0.0
Leu
6.042LeuAla: 6.042 ± 1.389
0.0LeuCys: 0.0 ± 0.0
10.79LeuAsp: 10.79 ± 1.772
9.495LeuGlu: 9.495 ± 2.903
4.316LeuPhe: 4.316 ± 1.647
4.316LeuGly: 4.316 ± 1.287
2.158LeuHis: 2.158 ± 0.639
5.179LeuIle: 5.179 ± 2.122
10.358LeuLys: 10.358 ± 1.505
12.085LeuLeu: 12.085 ± 2.087
3.021LeuMet: 3.021 ± 1.392
3.021LeuAsn: 3.021 ± 0.968
4.316LeuPro: 4.316 ± 1.804
4.316LeuGln: 4.316 ± 1.213
3.884LeuArg: 3.884 ± 1.38
4.748LeuSer: 4.748 ± 1.958
3.884LeuThr: 3.884 ± 1.347
3.453LeuVal: 3.453 ± 0.901
1.295LeuTrp: 1.295 ± 0.874
3.884LeuTyr: 3.884 ± 1.247
0.0LeuXaa: 0.0 ± 0.0
Met
4.748MetAla: 4.748 ± 1.574
0.432MetCys: 0.432 ± 0.366
1.295MetAsp: 1.295 ± 1.04
0.863MetGlu: 0.863 ± 0.732
0.432MetPhe: 0.432 ± 0.343
0.432MetGly: 0.432 ± 0.426
0.0MetHis: 0.0 ± 0.0
0.863MetIle: 0.863 ± 0.731
2.59MetLys: 2.59 ± 1.019
2.158MetLeu: 2.158 ± 1.142
1.295MetMet: 1.295 ± 0.722
1.295MetAsn: 1.295 ± 0.616
0.0MetPro: 0.0 ± 0.0
0.432MetGln: 0.432 ± 0.484
0.863MetArg: 0.863 ± 0.545
0.432MetSer: 0.432 ± 0.426
2.158MetThr: 2.158 ± 1.021
0.863MetVal: 0.863 ± 0.473
0.0MetTrp: 0.0 ± 0.0
0.863MetTyr: 0.863 ± 0.968
0.0MetXaa: 0.0 ± 0.0
Asn
3.453AsnAla: 3.453 ± 1.102
0.0AsnCys: 0.0 ± 0.0
2.59AsnAsp: 2.59 ± 1.121
3.021AsnGlu: 3.021 ± 1.525
3.884AsnPhe: 3.884 ± 0.813
3.453AsnGly: 3.453 ± 0.954
1.295AsnHis: 1.295 ± 0.932
3.021AsnIle: 3.021 ± 1.246
3.453AsnLys: 3.453 ± 1.297
5.611AsnLeu: 5.611 ± 1.683
0.432AsnMet: 0.432 ± 0.426
3.021AsnAsn: 3.021 ± 1.14
1.726AsnPro: 1.726 ± 0.675
0.863AsnGln: 0.863 ± 0.623
2.59AsnArg: 2.59 ± 1.121
1.726AsnSer: 1.726 ± 0.882
2.158AsnThr: 2.158 ± 0.737
1.726AsnVal: 1.726 ± 0.904
0.432AsnTrp: 0.432 ± 0.425
2.59AsnTyr: 2.59 ± 0.713
0.0AsnXaa: 0.0 ± 0.0
Pro
2.158ProAla: 2.158 ± 0.874
0.0ProCys: 0.0 ± 0.0
1.726ProAsp: 1.726 ± 0.641
1.726ProGlu: 1.726 ± 1.026
2.158ProPhe: 2.158 ± 1.125
0.0ProGly: 0.0 ± 0.0
2.158ProHis: 2.158 ± 1.33
2.59ProIle: 2.59 ± 0.746
2.59ProLys: 2.59 ± 0.89
2.59ProLeu: 2.59 ± 0.957
0.863ProMet: 0.863 ± 0.731
1.726ProAsn: 1.726 ± 0.929
0.0ProPro: 0.0 ± 0.0
1.295ProGln: 1.295 ± 0.605
1.726ProArg: 1.726 ± 0.787
0.863ProSer: 0.863 ± 0.686
1.726ProThr: 1.726 ± 0.61
0.863ProVal: 0.863 ± 0.686
0.0ProTrp: 0.0 ± 0.0
0.863ProTyr: 0.863 ± 0.473
0.0ProXaa: 0.0 ± 0.0
Gln
5.179GlnAla: 5.179 ± 1.233
0.0GlnCys: 0.0 ± 0.0
2.158GlnAsp: 2.158 ± 1.047
4.316GlnGlu: 4.316 ± 1.193
1.726GlnPhe: 1.726 ± 0.851
2.59GlnGly: 2.59 ± 0.964
0.432GlnHis: 0.432 ± 0.343
1.726GlnIle: 1.726 ± 0.945
5.179GlnLys: 5.179 ± 1.792
3.884GlnLeu: 3.884 ± 1.021
0.432GlnMet: 0.432 ± 0.484
1.726GlnAsn: 1.726 ± 0.847
0.863GlnPro: 0.863 ± 0.473
3.453GlnGln: 3.453 ± 1.705
2.158GlnArg: 2.158 ± 0.59
1.295GlnSer: 1.295 ± 0.771
2.158GlnThr: 2.158 ± 0.783
2.158GlnVal: 2.158 ± 1.048
0.863GlnTrp: 0.863 ± 0.639
1.295GlnTyr: 1.295 ± 0.553
0.0GlnXaa: 0.0 ± 0.0
Arg
4.316ArgAla: 4.316 ± 1.413
0.432ArgCys: 0.432 ± 0.366
2.59ArgAsp: 2.59 ± 1.043
4.316ArgGlu: 4.316 ± 1.6
2.59ArgPhe: 2.59 ± 1.186
3.453ArgGly: 3.453 ± 1.463
2.158ArgHis: 2.158 ± 0.793
3.021ArgIle: 3.021 ± 0.929
6.042ArgLys: 6.042 ± 1.596
4.748ArgLeu: 4.748 ± 1.607
0.432ArgMet: 0.432 ± 0.343
1.295ArgAsn: 1.295 ± 0.984
0.432ArgPro: 0.432 ± 0.343
3.453ArgGln: 3.453 ± 0.96
3.453ArgArg: 3.453 ± 1.19
1.726ArgSer: 1.726 ± 0.806
3.453ArgThr: 3.453 ± 0.948
2.59ArgVal: 2.59 ± 1.034
0.863ArgTrp: 0.863 ± 0.473
0.863ArgTyr: 0.863 ± 0.519
0.0ArgXaa: 0.0 ± 0.0
Ser
1.295SerAla: 1.295 ± 0.886
0.432SerCys: 0.432 ± 0.43
3.884SerAsp: 3.884 ± 1.477
3.884SerGlu: 3.884 ± 1.096
1.295SerPhe: 1.295 ± 0.584
0.863SerGly: 0.863 ± 0.507
0.432SerHis: 0.432 ± 0.467
3.021SerIle: 3.021 ± 1.178
2.59SerLys: 2.59 ± 0.778
6.905SerLeu: 6.905 ± 1.359
0.863SerMet: 0.863 ± 0.699
0.863SerAsn: 0.863 ± 0.577
1.295SerPro: 1.295 ± 0.66
1.726SerGln: 1.726 ± 0.872
2.59SerArg: 2.59 ± 1.03
1.726SerSer: 1.726 ± 0.596
3.021SerThr: 3.021 ± 0.989
3.021SerVal: 3.021 ± 1.039
0.432SerTrp: 0.432 ± 0.453
2.158SerTyr: 2.158 ± 0.639
0.0SerXaa: 0.0 ± 0.0
Thr
3.453ThrAla: 3.453 ± 0.976
0.0ThrCys: 0.0 ± 0.0
1.726ThrAsp: 1.726 ± 0.711
4.316ThrGlu: 4.316 ± 1.324
1.726ThrPhe: 1.726 ± 0.751
4.748ThrGly: 4.748 ± 1.051
1.726ThrHis: 1.726 ± 0.751
4.316ThrIle: 4.316 ± 1.39
5.611ThrLys: 5.611 ± 1.548
6.905ThrLeu: 6.905 ± 1.599
0.863ThrMet: 0.863 ± 0.545
3.453ThrAsn: 3.453 ± 1.088
1.726ThrPro: 1.726 ± 0.816
1.295ThrGln: 1.295 ± 0.584
3.884ThrArg: 3.884 ± 1.111
1.726ThrSer: 1.726 ± 0.568
2.59ThrThr: 2.59 ± 1.562
3.884ThrVal: 3.884 ± 1.663
0.432ThrTrp: 0.432 ± 0.467
0.863ThrTyr: 0.863 ± 0.564
0.0ThrXaa: 0.0 ± 0.0
Val
1.726ValAla: 1.726 ± 0.8
0.432ValCys: 0.432 ± 0.343
2.158ValAsp: 2.158 ± 0.844
5.179ValGlu: 5.179 ± 1.618
1.726ValPhe: 1.726 ± 0.534
3.884ValGly: 3.884 ± 0.867
0.0ValHis: 0.0 ± 0.0
3.453ValIle: 3.453 ± 1.548
2.59ValLys: 2.59 ± 1.145
4.748ValLeu: 4.748 ± 1.201
2.158ValMet: 2.158 ± 1.237
2.158ValAsn: 2.158 ± 0.677
0.432ValPro: 0.432 ± 0.343
1.726ValGln: 1.726 ± 0.662
3.884ValArg: 3.884 ± 1.405
2.59ValSer: 2.59 ± 1.069
1.726ValThr: 1.726 ± 0.574
1.726ValVal: 1.726 ± 1.053
0.432ValTrp: 0.432 ± 0.343
3.021ValTyr: 3.021 ± 0.97
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
3.021TrpGlu: 3.021 ± 0.8
0.863TrpPhe: 0.863 ± 0.436
0.0TrpGly: 0.0 ± 0.0
1.295TrpHis: 1.295 ± 0.936
0.432TrpIle: 0.432 ± 0.453
0.432TrpLys: 0.432 ± 0.366
2.158TrpLeu: 2.158 ± 0.776
0.0TrpMet: 0.0 ± 0.0
0.432TrpAsn: 0.432 ± 0.453
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.432TrpArg: 0.432 ± 0.383
0.0TrpSer: 0.0 ± 0.0
0.863TrpThr: 0.863 ± 0.6
2.59TrpVal: 2.59 ± 1.284
0.432TrpTrp: 0.432 ± 0.343
0.432TrpTyr: 0.432 ± 0.343
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.158TyrAla: 2.158 ± 1.108
0.432TyrCys: 0.432 ± 0.467
3.453TyrAsp: 3.453 ± 1.063
1.726TyrGlu: 1.726 ± 0.732
3.884TyrPhe: 3.884 ± 1.407
0.863TyrGly: 0.863 ± 0.492
1.726TyrHis: 1.726 ± 1.069
6.474TyrIle: 6.474 ± 1.271
4.316TyrLys: 4.316 ± 1.244
6.474TyrLeu: 6.474 ± 1.999
0.863TyrMet: 0.863 ± 0.645
3.021TyrAsn: 3.021 ± 0.9
1.295TyrPro: 1.295 ± 0.745
2.158TyrGln: 2.158 ± 0.879
0.863TyrArg: 0.863 ± 0.568
2.59TyrSer: 2.59 ± 0.856
0.863TyrThr: 0.863 ± 0.587
2.158TyrVal: 2.158 ± 0.895
0.432TyrTrp: 0.432 ± 0.453
1.295TyrTyr: 1.295 ± 0.742
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 15 proteins (2318 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski