Amino acid dipepetide frequency for Stenotrophomonas phage PSH1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.458AlaAla: 15.458 ± 3.16
2.973AlaCys: 2.973 ± 0.73
5.945AlaAsp: 5.945 ± 1.84
3.567AlaGlu: 3.567 ± 0.755
2.973AlaPhe: 2.973 ± 0.755
10.702AlaGly: 10.702 ± 2.062
2.378AlaHis: 2.378 ± 1.023
2.973AlaIle: 2.973 ± 1.315
5.351AlaLys: 5.351 ± 1.345
4.756AlaLeu: 4.756 ± 2.705
2.973AlaMet: 2.973 ± 1.672
3.567AlaAsn: 3.567 ± 1.693
4.756AlaPro: 4.756 ± 1.43
7.729AlaGln: 7.729 ± 1.564
5.945AlaArg: 5.945 ± 1.254
7.134AlaSer: 7.134 ± 1.973
8.918AlaThr: 8.918 ± 1.958
7.729AlaVal: 7.729 ± 1.814
1.784AlaTrp: 1.784 ± 0.898
4.162AlaTyr: 4.162 ± 1.255
0.0AlaXaa: 0.0 ± 0.0
Cys
5.945CysAla: 5.945 ± 2.4
0.0CysCys: 0.0 ± 0.0
2.378CysAsp: 2.378 ± 2.103
1.784CysGlu: 1.784 ± 0.731
0.0CysPhe: 0.0 ± 0.0
2.378CysGly: 2.378 ± 1.501
0.0CysHis: 0.0 ± 0.0
1.784CysIle: 1.784 ± 0.44
2.378CysLys: 2.378 ± 1.129
2.378CysLeu: 2.378 ± 1.264
0.0CysMet: 0.0 ± 0.0
0.595CysAsn: 0.595 ± 0.526
2.973CysPro: 2.973 ± 1.931
0.595CysGln: 0.595 ± 0.651
2.973CysArg: 2.973 ± 0.902
2.378CysSer: 2.378 ± 1.647
2.378CysThr: 2.378 ± 0.729
0.595CysVal: 0.595 ± 0.526
0.595CysTrp: 0.595 ± 0.55
0.595CysTyr: 0.595 ± 0.482
0.0CysXaa: 0.0 ± 0.0
Asp
3.567AspAla: 3.567 ± 1.651
1.189AspCys: 1.189 ± 1.052
2.973AspAsp: 2.973 ± 0.902
4.756AspGlu: 4.756 ± 1.154
2.973AspPhe: 2.973 ± 1.165
9.512AspGly: 9.512 ± 3.523
0.595AspHis: 0.595 ± 0.526
1.189AspIle: 1.189 ± 0.822
2.378AspLys: 2.378 ± 1.051
3.567AspLeu: 3.567 ± 1.63
0.595AspMet: 0.595 ± 0.55
2.973AspAsn: 2.973 ± 1.588
3.567AspPro: 3.567 ± 1.159
1.189AspGln: 1.189 ± 0.516
2.378AspArg: 2.378 ± 1.088
3.567AspSer: 3.567 ± 1.18
1.189AspThr: 1.189 ± 0.516
3.567AspVal: 3.567 ± 1.581
1.784AspTrp: 1.784 ± 0.766
2.378AspTyr: 2.378 ± 1.278
0.0AspXaa: 0.0 ± 0.0
Glu
3.567GluAla: 3.567 ± 1.902
2.378GluCys: 2.378 ± 1.033
1.189GluAsp: 1.189 ± 0.758
2.378GluGlu: 2.378 ± 1.322
1.784GluPhe: 1.784 ± 1.182
4.162GluGly: 4.162 ± 1.009
0.595GluHis: 0.595 ± 0.482
1.784GluIle: 1.784 ± 1.532
4.756GluLys: 4.756 ± 1.879
5.945GluLeu: 5.945 ± 3.601
0.595GluMet: 0.595 ± 0.631
0.595GluAsn: 0.595 ± 0.526
3.567GluPro: 3.567 ± 2.062
1.189GluGln: 1.189 ± 0.516
2.378GluArg: 2.378 ± 1.557
2.973GluSer: 2.973 ± 0.771
2.378GluThr: 2.378 ± 1.42
2.378GluVal: 2.378 ± 1.501
1.189GluTrp: 1.189 ± 0.758
2.378GluTyr: 2.378 ± 1.42
0.0GluXaa: 0.0 ± 0.0
Phe
5.351PheAla: 5.351 ± 1.389
1.189PheCys: 1.189 ± 0.574
4.162PheAsp: 4.162 ± 1.66
1.784PheGlu: 1.784 ± 0.766
1.784PhePhe: 1.784 ± 0.651
1.189PheGly: 1.189 ± 0.516
0.0PheHis: 0.0 ± 0.0
1.189PheIle: 1.189 ± 0.516
2.973PheLys: 2.973 ± 1.71
1.784PheLeu: 1.784 ± 0.815
0.595PheMet: 0.595 ± 0.918
0.0PheAsn: 0.0 ± 0.0
0.595PhePro: 0.595 ± 0.482
1.189PheGln: 1.189 ± 0.516
2.378PheArg: 2.378 ± 2.201
1.784PheSer: 1.784 ± 0.713
0.595PheThr: 0.595 ± 0.765
2.378PheVal: 2.378 ± 1.592
0.595PheTrp: 0.595 ± 0.631
0.595PheTyr: 0.595 ± 0.918
0.0PheXaa: 0.0 ± 0.0
Gly
6.54GlyAla: 6.54 ± 1.817
2.378GlyCys: 2.378 ± 1.429
8.918GlyAsp: 8.918 ± 2.346
4.162GlyGlu: 4.162 ± 1.133
5.945GlyPhe: 5.945 ± 1.549
13.08GlyGly: 13.08 ± 4.962
3.567GlyHis: 3.567 ± 1.753
4.162GlyIle: 4.162 ± 1.145
3.567GlyLys: 3.567 ± 1.305
5.945GlyLeu: 5.945 ± 0.982
3.567GlyMet: 3.567 ± 1.944
1.784GlyAsn: 1.784 ± 0.924
4.162GlyPro: 4.162 ± 2.226
5.351GlyGln: 5.351 ± 2.034
4.756GlyArg: 4.756 ± 0.541
7.729GlySer: 7.729 ± 3.319
4.756GlyThr: 4.756 ± 1.457
4.756GlyVal: 4.756 ± 1.046
1.189GlyTrp: 1.189 ± 0.516
1.189GlyTyr: 1.189 ± 0.822
0.0GlyXaa: 0.0 ± 0.0
His
1.784HisAla: 1.784 ± 0.44
0.595HisCys: 0.595 ± 0.918
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
0.595HisPhe: 0.595 ± 0.55
1.189HisGly: 1.189 ± 0.964
1.189HisHis: 1.189 ± 1.303
0.595HisIle: 0.595 ± 0.482
1.189HisLys: 1.189 ± 1.034
0.595HisLeu: 0.595 ± 0.482
0.595HisMet: 0.595 ± 0.526
0.595HisAsn: 0.595 ± 0.482
1.784HisPro: 1.784 ± 1.407
0.0HisGln: 0.0 ± 0.0
1.784HisArg: 1.784 ± 1.12
0.0HisSer: 0.0 ± 0.0
0.595HisThr: 0.595 ± 0.482
1.189HisVal: 1.189 ± 0.964
0.0HisTrp: 0.0 ± 0.0
0.595HisTyr: 0.595 ± 0.631
0.0HisXaa: 0.0 ± 0.0
Ile
5.351IleAla: 5.351 ± 1.097
0.595IleCys: 0.595 ± 0.526
4.162IleAsp: 4.162 ± 1.824
3.567IleGlu: 3.567 ± 1.328
0.0IlePhe: 0.0 ± 0.0
2.973IleGly: 2.973 ± 1.431
0.0IleHis: 0.0 ± 0.0
2.378IleIle: 2.378 ± 0.729
0.0IleLys: 0.0 ± 0.0
5.351IleLeu: 5.351 ± 1.914
1.189IleMet: 1.189 ± 0.779
0.595IleAsn: 0.595 ± 0.55
0.595IlePro: 0.595 ± 0.482
4.162IleGln: 4.162 ± 1.047
2.973IleArg: 2.973 ± 1.247
2.378IleSer: 2.378 ± 1.828
1.784IleThr: 1.784 ± 0.866
0.0IleVal: 0.0 ± 0.0
0.0IleTrp: 0.0 ± 0.0
1.189IleTyr: 1.189 ± 0.964
0.0IleXaa: 0.0 ± 0.0
Lys
5.351LysAla: 5.351 ± 2.292
1.189LysCys: 1.189 ± 1.052
2.378LysAsp: 2.378 ± 1.007
2.378LysGlu: 2.378 ± 0.729
2.378LysPhe: 2.378 ± 0.663
3.567LysGly: 3.567 ± 1.349
0.0LysHis: 0.0 ± 0.0
0.595LysIle: 0.595 ± 0.482
3.567LysLys: 3.567 ± 1.907
4.162LysLeu: 4.162 ± 0.978
0.0LysMet: 0.0 ± 0.0
2.973LysAsn: 2.973 ± 1.926
1.189LysPro: 1.189 ± 0.753
2.378LysGln: 2.378 ± 1.19
3.567LysArg: 3.567 ± 1.189
2.378LysSer: 2.378 ± 2.201
2.973LysThr: 2.973 ± 0.785
3.567LysVal: 3.567 ± 1.431
1.189LysTrp: 1.189 ± 1.101
1.189LysTyr: 1.189 ± 0.639
0.0LysXaa: 0.0 ± 0.0
Leu
5.351LeuAla: 5.351 ± 1.732
2.378LeuCys: 2.378 ± 1.264
7.134LeuAsp: 7.134 ± 2.697
2.378LeuGlu: 2.378 ± 1.784
0.595LeuPhe: 0.595 ± 0.482
8.323LeuGly: 8.323 ± 1.49
1.189LeuHis: 1.189 ± 1.272
2.973LeuIle: 2.973 ± 1.067
1.784LeuLys: 1.784 ± 0.745
5.945LeuLeu: 5.945 ± 1.37
2.378LeuMet: 2.378 ± 0.706
0.595LeuAsn: 0.595 ± 0.55
5.945LeuPro: 5.945 ± 1.678
1.784LeuGln: 1.784 ± 0.917
7.134LeuArg: 7.134 ± 3.044
3.567LeuSer: 3.567 ± 1.393
4.756LeuThr: 4.756 ± 1.039
2.973LeuVal: 2.973 ± 1.647
0.595LeuTrp: 0.595 ± 0.631
2.973LeuTyr: 2.973 ± 1.465
0.0LeuXaa: 0.0 ± 0.0
Met
3.567MetAla: 3.567 ± 1.413
0.0MetCys: 0.0 ± 0.0
0.595MetAsp: 0.595 ± 0.631
2.378MetGlu: 2.378 ± 1.418
1.189MetPhe: 1.189 ± 1.101
2.378MetGly: 2.378 ± 0.514
0.0MetHis: 0.0 ± 0.0
1.784MetIle: 1.784 ± 1.301
0.595MetLys: 0.595 ± 0.631
1.189MetLeu: 1.189 ± 1.034
1.189MetMet: 1.189 ± 1.25
0.0MetAsn: 0.0 ± 0.0
3.567MetPro: 3.567 ± 1.329
0.595MetGln: 0.595 ± 0.526
0.595MetArg: 0.595 ± 0.482
1.189MetSer: 1.189 ± 0.822
0.595MetThr: 0.595 ± 0.482
1.189MetVal: 1.189 ± 0.639
1.784MetTrp: 1.784 ± 0.865
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.973AsnAla: 2.973 ± 1.481
1.784AsnCys: 1.784 ± 0.954
1.784AsnAsp: 1.784 ± 1.025
1.784AsnGlu: 1.784 ± 0.828
0.595AsnPhe: 0.595 ± 0.55
3.567AsnGly: 3.567 ± 1.549
0.0AsnHis: 0.0 ± 0.0
0.595AsnIle: 0.595 ± 0.526
0.595AsnLys: 0.595 ± 0.765
0.595AsnLeu: 0.595 ± 0.651
1.189AsnMet: 1.189 ± 0.986
1.784AsnAsn: 1.784 ± 1.578
1.784AsnPro: 1.784 ± 1.445
1.189AsnGln: 1.189 ± 0.516
3.567AsnArg: 3.567 ± 1.325
2.378AsnSer: 2.378 ± 2.103
0.595AsnThr: 0.595 ± 0.526
1.784AsnVal: 1.784 ± 1.058
1.189AsnTrp: 1.189 ± 1.101
0.595AsnTyr: 0.595 ± 0.526
0.0AsnXaa: 0.0 ± 0.0
Pro
6.54ProAla: 6.54 ± 2.31
0.595ProCys: 0.595 ± 0.526
5.945ProAsp: 5.945 ± 3.144
7.134ProGlu: 7.134 ± 2.378
1.784ProPhe: 1.784 ± 1.259
4.162ProGly: 4.162 ± 1.742
1.784ProHis: 1.784 ± 0.961
2.973ProIle: 2.973 ± 1.943
2.378ProLys: 2.378 ± 1.606
2.378ProLeu: 2.378 ± 1.342
2.378ProMet: 2.378 ± 1.286
1.189ProAsn: 1.189 ± 0.516
5.945ProPro: 5.945 ± 2.052
2.378ProGln: 2.378 ± 1.264
2.378ProArg: 2.378 ± 1.287
1.784ProSer: 1.784 ± 0.713
4.162ProThr: 4.162 ± 1.042
2.973ProVal: 2.973 ± 0.785
2.973ProTrp: 2.973 ± 0.73
0.595ProTyr: 0.595 ± 0.482
0.0ProXaa: 0.0 ± 0.0
Gln
5.351GlnAla: 5.351 ± 2.104
2.378GlnCys: 2.378 ± 1.647
0.0GlnAsp: 0.0 ± 0.0
0.595GlnGlu: 0.595 ± 0.651
0.595GlnPhe: 0.595 ± 0.55
3.567GlnGly: 3.567 ± 1.397
0.0GlnHis: 0.0 ± 0.0
0.0GlnIle: 0.0 ± 0.0
1.784GlnLys: 1.784 ± 0.917
4.162GlnLeu: 4.162 ± 1.094
1.189GlnMet: 1.189 ± 0.639
0.595GlnAsn: 0.595 ± 0.526
5.945GlnPro: 5.945 ± 1.555
1.784GlnGln: 1.784 ± 0.924
2.973GlnArg: 2.973 ± 1.521
2.378GlnSer: 2.378 ± 1.007
0.595GlnThr: 0.595 ± 0.482
1.784GlnVal: 1.784 ± 1.025
2.378GlnTrp: 2.378 ± 1.007
2.378GlnTyr: 2.378 ± 0.722
0.0GlnXaa: 0.0 ± 0.0
Arg
7.729ArgAla: 7.729 ± 1.915
1.784ArgCys: 1.784 ± 0.994
1.784ArgAsp: 1.784 ± 1.182
1.189ArgGlu: 1.189 ± 0.78
0.595ArgPhe: 0.595 ± 0.55
4.162ArgGly: 4.162 ± 1.255
2.973ArgHis: 2.973 ± 1.473
5.351ArgIle: 5.351 ± 1.469
2.378ArgLys: 2.378 ± 0.949
3.567ArgLeu: 3.567 ± 1.15
1.784ArgMet: 1.784 ± 1.406
5.351ArgAsn: 5.351 ± 1.198
2.973ArgPro: 2.973 ± 1.242
2.973ArgGln: 2.973 ± 1.399
3.567ArgArg: 3.567 ± 1.662
2.378ArgSer: 2.378 ± 1.269
5.351ArgThr: 5.351 ± 1.648
4.162ArgVal: 4.162 ± 2.069
1.784ArgTrp: 1.784 ± 0.989
1.784ArgTyr: 1.784 ± 0.989
0.0ArgXaa: 0.0 ± 0.0
Ser
6.54SerAla: 6.54 ± 2.569
4.162SerCys: 4.162 ± 1.43
1.189SerAsp: 1.189 ± 0.955
1.189SerGlu: 1.189 ± 1.101
2.973SerPhe: 2.973 ± 0.778
8.323SerGly: 8.323 ± 3.729
0.0SerHis: 0.0 ± 0.0
2.973SerIle: 2.973 ± 1.022
2.973SerLys: 2.973 ± 1.109
3.567SerLeu: 3.567 ± 1.82
1.189SerMet: 1.189 ± 0.822
2.378SerAsn: 2.378 ± 1.152
5.945SerPro: 5.945 ± 1.407
1.189SerGln: 1.189 ± 0.516
2.973SerArg: 2.973 ± 1.521
4.162SerSer: 4.162 ± 0.637
2.378SerThr: 2.378 ± 0.729
3.567SerVal: 3.567 ± 1.675
0.0SerTrp: 0.0 ± 0.0
2.378SerTyr: 2.378 ± 1.007
0.0SerXaa: 0.0 ± 0.0
Thr
7.134ThrAla: 7.134 ± 3.083
4.162ThrCys: 4.162 ± 1.083
1.189ThrAsp: 1.189 ± 0.753
1.189ThrGlu: 1.189 ± 0.964
1.189ThrPhe: 1.189 ± 0.71
7.134ThrGly: 7.134 ± 2.262
0.0ThrHis: 0.0 ± 0.0
1.784ThrIle: 1.784 ± 0.849
5.351ThrLys: 5.351 ± 2.011
2.378ThrLeu: 2.378 ± 1.083
1.784ThrMet: 1.784 ± 1.007
0.595ThrAsn: 0.595 ± 0.526
1.784ThrPro: 1.784 ± 1.005
2.378ThrGln: 2.378 ± 1.28
2.973ThrArg: 2.973 ± 0.771
5.945ThrSer: 5.945 ± 1.765
2.973ThrThr: 2.973 ± 1.965
5.351ThrVal: 5.351 ± 1.712
1.189ThrTrp: 1.189 ± 1.052
1.784ThrTyr: 1.784 ± 0.954
0.0ThrXaa: 0.0 ± 0.0
Val
8.323ValAla: 8.323 ± 3.005
2.378ValCys: 2.378 ± 0.729
1.784ValAsp: 1.784 ± 1.091
2.378ValGlu: 2.378 ± 1.259
1.784ValPhe: 1.784 ± 1.091
4.756ValGly: 4.756 ± 1.894
0.595ValHis: 0.595 ± 0.482
2.378ValIle: 2.378 ± 0.781
1.784ValLys: 1.784 ± 0.745
6.54ValLeu: 6.54 ± 2.173
0.0ValMet: 0.0 ± 0.0
1.784ValAsn: 1.784 ± 1.09
2.973ValPro: 2.973 ± 1.052
1.189ValGln: 1.189 ± 0.964
4.162ValArg: 4.162 ± 0.824
2.973ValSer: 2.973 ± 1.23
4.756ValThr: 4.756 ± 1.346
5.351ValVal: 5.351 ± 1.975
0.595ValTrp: 0.595 ± 0.631
1.784ValTyr: 1.784 ± 0.766
0.0ValXaa: 0.0 ± 0.0
Trp
2.378TrpAla: 2.378 ± 1.152
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.595TrpGlu: 0.595 ± 0.482
1.784TrpPhe: 1.784 ± 0.994
0.595TrpGly: 0.595 ± 0.918
0.0TrpHis: 0.0 ± 0.0
0.595TrpIle: 0.595 ± 0.55
0.595TrpLys: 0.595 ± 0.631
3.567TrpLeu: 3.567 ± 1.069
0.0TrpMet: 0.0 ± 0.0
1.784TrpAsn: 1.784 ± 0.731
0.0TrpPro: 0.0 ± 0.0
0.595TrpGln: 0.595 ± 0.631
1.784TrpArg: 1.784 ± 1.13
1.784TrpSer: 1.784 ± 1.578
2.378TrpThr: 2.378 ± 1.412
1.784TrpVal: 1.784 ± 0.766
0.0TrpTrp: 0.0 ± 0.0
1.189TrpTyr: 1.189 ± 0.639
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.567TyrAla: 3.567 ± 1.698
0.595TyrCys: 0.595 ± 0.482
2.378TyrAsp: 2.378 ± 1.278
2.973TyrGlu: 2.973 ± 1.383
0.595TyrPhe: 0.595 ± 0.482
1.784TyrGly: 1.784 ± 1.016
0.0TyrHis: 0.0 ± 0.0
1.189TyrIle: 1.189 ± 1.052
0.595TyrLys: 0.595 ± 0.526
2.378TyrLeu: 2.378 ± 1.419
0.595TyrMet: 0.595 ± 0.482
0.595TyrAsn: 0.595 ± 0.482
2.378TyrPro: 2.378 ± 1.839
0.595TyrGln: 0.595 ± 0.482
2.378TyrArg: 2.378 ± 1.264
1.189TyrSer: 1.189 ± 0.71
4.162TyrThr: 4.162 ± 0.769
1.189TyrVal: 1.189 ± 0.639
0.595TyrTrp: 0.595 ± 0.526
1.189TyrTyr: 1.189 ± 0.516
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (1683 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski