Amino acid dipepetide frequency for Ralstonia phage p12J

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.802AlaAla: 16.802 ± 4.777
2.168AlaCys: 2.168 ± 0.876
5.42AlaAsp: 5.42 ± 1.344
2.71AlaGlu: 2.71 ± 1.459
3.252AlaPhe: 3.252 ± 1.414
4.878AlaGly: 4.878 ± 1.264
1.084AlaHis: 1.084 ± 0.769
7.588AlaIle: 7.588 ± 2.808
2.71AlaLys: 2.71 ± 0.972
6.504AlaLeu: 6.504 ± 2.221
1.626AlaMet: 1.626 ± 1.358
2.168AlaAsn: 2.168 ± 0.827
5.42AlaPro: 5.42 ± 2.61
5.42AlaGln: 5.42 ± 2.213
7.046AlaArg: 7.046 ± 2.209
6.504AlaSer: 6.504 ± 1.788
4.336AlaThr: 4.336 ± 2.156
7.588AlaVal: 7.588 ± 3.847
1.084AlaTrp: 1.084 ± 0.906
4.336AlaTyr: 4.336 ± 1.265
0.0AlaXaa: 0.0 ± 0.0
Cys
3.794CysAla: 3.794 ± 2.046
0.0CysCys: 0.0 ± 0.0
2.168CysAsp: 2.168 ± 1.211
1.084CysGlu: 1.084 ± 0.523
1.084CysPhe: 1.084 ± 0.906
1.084CysGly: 1.084 ± 0.88
0.0CysHis: 0.0 ± 0.0
0.542CysIle: 0.542 ± 0.44
1.084CysLys: 1.084 ± 0.88
0.542CysLeu: 0.542 ± 0.44
0.542CysMet: 0.542 ± 0.719
0.542CysAsn: 0.542 ± 0.44
0.542CysPro: 0.542 ± 0.507
1.084CysGln: 1.084 ± 0.88
1.084CysArg: 1.084 ± 0.914
3.252CysSer: 3.252 ± 2.641
2.168CysThr: 2.168 ± 0.803
1.626CysVal: 1.626 ± 1.009
0.542CysTrp: 0.542 ± 0.507
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.878AspAla: 4.878 ± 1.222
1.084AspCys: 1.084 ± 0.88
5.42AspAsp: 5.42 ± 2.603
3.794AspGlu: 3.794 ± 1.873
1.084AspPhe: 1.084 ± 0.754
5.962AspGly: 5.962 ± 2.866
0.542AspHis: 0.542 ± 0.471
2.168AspIle: 2.168 ± 1.06
0.0AspLys: 0.0 ± 0.0
4.336AspLeu: 4.336 ± 1.92
1.084AspMet: 1.084 ± 0.88
1.084AspAsn: 1.084 ± 0.769
6.504AspPro: 6.504 ± 1.744
2.168AspGln: 2.168 ± 0.798
1.084AspArg: 1.084 ± 0.645
3.794AspSer: 3.794 ± 1.665
2.168AspThr: 2.168 ± 1.268
4.336AspVal: 4.336 ± 1.333
0.542AspTrp: 0.542 ± 0.471
1.084AspTyr: 1.084 ± 0.88
0.0AspXaa: 0.0 ± 0.0
Glu
3.794GluAla: 3.794 ± 1.886
2.168GluCys: 2.168 ± 1.237
1.084GluAsp: 1.084 ± 0.906
4.336GluGlu: 4.336 ± 2.345
1.084GluPhe: 1.084 ± 1.464
3.252GluGly: 3.252 ± 2.356
1.626GluHis: 1.626 ± 0.869
0.542GluIle: 0.542 ± 0.471
2.168GluLys: 2.168 ± 0.902
6.504GluLeu: 6.504 ± 3.14
1.084GluMet: 1.084 ± 0.833
0.542GluAsn: 0.542 ± 0.471
1.084GluPro: 1.084 ± 0.58
2.168GluGln: 2.168 ± 1.7
3.252GluArg: 3.252 ± 1.508
2.168GluSer: 2.168 ± 1.35
2.168GluThr: 2.168 ± 1.063
4.878GluVal: 4.878 ± 1.626
0.542GluTrp: 0.542 ± 0.589
1.084GluTyr: 1.084 ± 0.787
0.0GluXaa: 0.0 ± 0.0
Phe
1.626PheAla: 1.626 ± 1.094
0.542PheCys: 0.542 ± 0.44
3.252PheAsp: 3.252 ± 1.959
3.794PheGlu: 3.794 ± 1.61
2.71PhePhe: 2.71 ± 1.401
2.71PheGly: 2.71 ± 1.285
0.542PheHis: 0.542 ± 0.471
0.0PheIle: 0.0 ± 0.0
1.084PheLys: 1.084 ± 0.997
2.71PheLeu: 2.71 ± 2.201
2.168PheMet: 2.168 ± 0.83
0.0PheAsn: 0.0 ± 0.0
1.626PhePro: 1.626 ± 1.162
0.542PheGln: 0.542 ± 0.704
2.71PheArg: 2.71 ± 1.005
2.71PheSer: 2.71 ± 1.543
0.542PheThr: 0.542 ± 0.598
2.71PheVal: 2.71 ± 1.195
0.542PheTrp: 0.542 ± 0.471
1.626PheTyr: 1.626 ± 0.483
0.0PheXaa: 0.0 ± 0.0
Gly
7.588GlyAla: 7.588 ± 2.041
1.084GlyCys: 1.084 ± 0.523
1.626GlyAsp: 1.626 ± 0.641
2.71GlyGlu: 2.71 ± 1.304
4.336GlyPhe: 4.336 ± 1.483
11.382GlyGly: 11.382 ± 4.898
1.084GlyHis: 1.084 ± 0.88
4.336GlyIle: 4.336 ± 2.275
5.42GlyLys: 5.42 ± 1.809
4.336GlyLeu: 4.336 ± 2.452
4.336GlyMet: 4.336 ± 1.384
1.626GlyAsn: 1.626 ± 0.483
2.168GlyPro: 2.168 ± 1.016
7.588GlyGln: 7.588 ± 1.587
4.878GlyArg: 4.878 ± 1.217
9.756GlySer: 9.756 ± 3.05
4.878GlyThr: 4.878 ± 1.529
7.588GlyVal: 7.588 ± 2.806
2.168GlyTrp: 2.168 ± 0.827
4.336GlyTyr: 4.336 ± 2.517
0.0GlyXaa: 0.0 ± 0.0
His
2.168HisAla: 2.168 ± 1.156
0.542HisCys: 0.542 ± 0.732
0.542HisAsp: 0.542 ± 0.471
1.084HisGlu: 1.084 ± 0.769
0.542HisPhe: 0.542 ± 0.471
0.542HisGly: 0.542 ± 0.471
1.084HisHis: 1.084 ± 0.941
1.626HisIle: 1.626 ± 0.483
1.626HisLys: 1.626 ± 0.983
2.168HisLeu: 2.168 ± 1.063
1.084HisMet: 1.084 ± 0.748
0.542HisAsn: 0.542 ± 0.589
0.542HisPro: 0.542 ± 0.507
1.084HisGln: 1.084 ± 0.906
1.626HisArg: 1.626 ± 0.927
1.084HisSer: 1.084 ± 0.769
0.542HisThr: 0.542 ± 0.507
1.626HisVal: 1.626 ± 0.483
0.542HisTrp: 0.542 ± 0.44
0.542HisTyr: 0.542 ± 0.471
0.0HisXaa: 0.0 ± 0.0
Ile
2.168IleAla: 2.168 ± 0.719
1.084IleCys: 1.084 ± 0.523
2.168IleAsp: 2.168 ± 1.211
2.168IleGlu: 2.168 ± 1.882
0.0IlePhe: 0.0 ± 0.0
5.42IleGly: 5.42 ± 2.128
2.71IleHis: 2.71 ± 1.068
1.084IleIle: 1.084 ± 0.58
1.084IleLys: 1.084 ± 0.855
2.168IleLeu: 2.168 ± 1.403
0.0IleMet: 0.0 ± 0.0
0.0IleAsn: 0.0 ± 0.0
2.168IlePro: 2.168 ± 0.827
0.542IleGln: 0.542 ± 0.471
2.71IleArg: 2.71 ± 1.35
1.626IleSer: 1.626 ± 0.686
2.168IleThr: 2.168 ± 0.634
3.794IleVal: 3.794 ± 1.608
0.542IleTrp: 0.542 ± 0.44
1.084IleTyr: 1.084 ± 1.199
0.0IleXaa: 0.0 ± 0.0
Lys
5.42LysAla: 5.42 ± 2.309
0.542LysCys: 0.542 ± 0.471
1.084LysAsp: 1.084 ± 0.544
1.626LysGlu: 1.626 ± 1.162
1.084LysPhe: 1.084 ± 0.835
4.336LysGly: 4.336 ± 1.624
1.084LysHis: 1.084 ± 0.916
0.542LysIle: 0.542 ± 0.507
3.794LysLys: 3.794 ± 2.276
4.336LysLeu: 4.336 ± 0.848
2.168LysMet: 2.168 ± 1.412
1.626LysAsn: 1.626 ± 1.4
2.71LysPro: 2.71 ± 1.705
0.0LysGln: 0.0 ± 0.0
4.336LysArg: 4.336 ± 2.131
4.336LysSer: 4.336 ± 1.192
1.084LysThr: 1.084 ± 0.715
5.42LysVal: 5.42 ± 2.187
0.542LysTrp: 0.542 ± 0.507
1.626LysTyr: 1.626 ± 0.483
0.0LysXaa: 0.0 ± 0.0
Leu
8.13LeuAla: 8.13 ± 1.945
1.084LeuCys: 1.084 ± 0.88
3.794LeuAsp: 3.794 ± 2.238
3.252LeuGlu: 3.252 ± 1.877
2.71LeuPhe: 2.71 ± 1.652
7.046LeuGly: 7.046 ± 2.66
3.252LeuHis: 3.252 ± 1.012
2.71LeuIle: 2.71 ± 1.627
3.252LeuLys: 3.252 ± 1.492
4.878LeuLeu: 4.878 ± 2.229
1.626LeuMet: 1.626 ± 0.871
1.626LeuAsn: 1.626 ± 0.787
3.794LeuPro: 3.794 ± 1.229
1.626LeuGln: 1.626 ± 0.822
4.336LeuArg: 4.336 ± 1.875
6.504LeuSer: 6.504 ± 2.315
5.42LeuThr: 5.42 ± 1.703
3.794LeuVal: 3.794 ± 1.215
1.084LeuTrp: 1.084 ± 0.78
0.542LeuTyr: 0.542 ± 0.507
0.0LeuXaa: 0.0 ± 0.0
Met
2.71MetAla: 2.71 ± 1.377
0.542MetCys: 0.542 ± 0.471
0.542MetAsp: 0.542 ± 0.471
1.084MetGlu: 1.084 ± 1.014
1.084MetPhe: 1.084 ± 0.952
1.626MetGly: 1.626 ± 1.419
0.0MetHis: 0.0 ± 0.0
0.542MetIle: 0.542 ± 0.598
0.542MetLys: 0.542 ± 0.598
3.794MetLeu: 3.794 ± 1.22
1.084MetMet: 1.084 ± 0.733
1.626MetAsn: 1.626 ± 0.996
2.168MetPro: 2.168 ± 0.876
2.168MetGln: 2.168 ± 0.978
2.168MetArg: 2.168 ± 1.242
0.542MetSer: 0.542 ± 0.44
2.71MetThr: 2.71 ± 1.062
2.168MetVal: 2.168 ± 1.068
0.542MetTrp: 0.542 ± 0.44
0.542MetTyr: 0.542 ± 0.507
0.0MetXaa: 0.0 ± 0.0
Asn
2.168AsnAla: 2.168 ± 0.922
1.626AsnCys: 1.626 ± 1.321
1.626AsnAsp: 1.626 ± 0.871
1.084AsnGlu: 1.084 ± 0.645
1.084AsnPhe: 1.084 ± 1.227
4.878AsnGly: 4.878 ± 1.727
0.542AsnHis: 0.542 ± 0.507
1.084AsnIle: 1.084 ± 0.88
0.542AsnLys: 0.542 ± 0.44
1.626AsnLeu: 1.626 ± 0.822
1.084AsnMet: 1.084 ± 0.685
0.542AsnAsn: 0.542 ± 0.44
2.168AsnPro: 2.168 ± 0.964
0.542AsnGln: 0.542 ± 0.44
0.542AsnArg: 0.542 ± 0.719
1.084AsnSer: 1.084 ± 0.855
1.626AsnThr: 1.626 ± 1.321
2.168AsnVal: 2.168 ± 0.896
1.626AsnTrp: 1.626 ± 0.869
0.542AsnTyr: 0.542 ± 0.471
0.0AsnXaa: 0.0 ± 0.0
Pro
7.046ProAla: 7.046 ± 3.033
0.0ProCys: 0.0 ± 0.0
3.252ProAsp: 3.252 ± 1.164
3.794ProGlu: 3.794 ± 1.549
3.252ProPhe: 3.252 ± 1.645
6.504ProGly: 6.504 ± 2.275
0.0ProHis: 0.0 ± 0.0
0.542ProIle: 0.542 ± 0.883
3.252ProLys: 3.252 ± 0.828
3.252ProLeu: 3.252 ± 0.875
1.626ProMet: 1.626 ± 0.702
2.71ProAsn: 2.71 ± 2.201
5.42ProPro: 5.42 ± 2.743
2.71ProGln: 2.71 ± 1.422
2.71ProArg: 2.71 ± 1.505
3.794ProSer: 3.794 ± 1.811
2.168ProThr: 2.168 ± 1.045
5.42ProVal: 5.42 ± 2.472
1.084ProTrp: 1.084 ± 0.523
1.084ProTyr: 1.084 ± 0.787
0.0ProXaa: 0.0 ± 0.0
Gln
2.168GlnAla: 2.168 ± 1.238
1.626GlnCys: 1.626 ± 1.033
1.626GlnAsp: 1.626 ± 0.931
2.168GlnGlu: 2.168 ± 1.452
2.71GlnPhe: 2.71 ± 1.263
3.252GlnGly: 3.252 ± 1.131
0.542GlnHis: 0.542 ± 0.589
2.168GlnIle: 2.168 ± 0.985
3.794GlnLys: 3.794 ± 1.478
4.336GlnLeu: 4.336 ± 1.043
0.542GlnMet: 0.542 ± 0.507
2.168GlnAsn: 2.168 ± 1.088
3.252GlnPro: 3.252 ± 1.228
3.794GlnGln: 3.794 ± 1.953
2.71GlnArg: 2.71 ± 1.446
1.626GlnSer: 1.626 ± 0.787
0.542GlnThr: 0.542 ± 0.44
2.71GlnVal: 2.71 ± 1.422
1.084GlnTrp: 1.084 ± 0.523
1.626GlnTyr: 1.626 ± 0.686
0.0GlnXaa: 0.0 ± 0.0
Arg
8.13ArgAla: 8.13 ± 2.669
1.084ArgCys: 1.084 ± 0.544
3.252ArgAsp: 3.252 ± 2.263
5.962ArgGlu: 5.962 ± 3.17
1.084ArgPhe: 1.084 ± 0.88
4.336ArgGly: 4.336 ± 1.383
1.626ArgHis: 1.626 ± 0.927
1.626ArgIle: 1.626 ± 0.787
5.42ArgLys: 5.42 ± 1.971
3.252ArgLeu: 3.252 ± 1.131
0.542ArgMet: 0.542 ± 0.732
0.0ArgAsn: 0.0 ± 0.0
3.252ArgPro: 3.252 ± 1.904
2.168ArgGln: 2.168 ± 1.065
8.13ArgArg: 8.13 ± 3.487
2.71ArgSer: 2.71 ± 1.126
2.168ArgThr: 2.168 ± 0.579
7.046ArgVal: 7.046 ± 2.136
1.084ArgTrp: 1.084 ± 0.906
1.084ArgTyr: 1.084 ± 0.941
0.0ArgXaa: 0.0 ± 0.0
Ser
8.13SerAla: 8.13 ± 2.812
3.794SerCys: 3.794 ± 2.178
4.336SerAsp: 4.336 ± 1.654
1.626SerGlu: 1.626 ± 0.869
3.252SerPhe: 3.252 ± 1.236
10.298SerGly: 10.298 ± 3.233
1.084SerHis: 1.084 ± 0.58
1.626SerIle: 1.626 ± 1.238
3.794SerLys: 3.794 ± 1.387
4.878SerLeu: 4.878 ± 1.394
1.084SerMet: 1.084 ± 0.715
3.252SerAsn: 3.252 ± 2.102
2.168SerPro: 2.168 ± 1.344
1.084SerGln: 1.084 ± 0.523
2.71SerArg: 2.71 ± 1.532
7.588SerSer: 7.588 ± 2.295
1.626SerThr: 1.626 ± 1.321
5.962SerVal: 5.962 ± 2.402
0.542SerTrp: 0.542 ± 0.732
1.084SerTyr: 1.084 ± 0.769
0.0SerXaa: 0.0 ± 0.0
Thr
3.252ThrAla: 3.252 ± 1.206
1.084ThrCys: 1.084 ± 0.88
1.626ThrAsp: 1.626 ± 0.981
1.084ThrGlu: 1.084 ± 1.439
0.0ThrPhe: 0.0 ± 0.0
5.42ThrGly: 5.42 ± 1.408
1.626ThrHis: 1.626 ± 0.983
2.168ThrIle: 2.168 ± 0.943
3.794ThrLys: 3.794 ± 1.909
6.504ThrLeu: 6.504 ± 1.598
1.626ThrMet: 1.626 ± 0.81
2.71ThrAsn: 2.71 ± 1.249
2.71ThrPro: 2.71 ± 1.627
1.626ThrGln: 1.626 ± 0.931
1.626ThrArg: 1.626 ± 0.726
3.794ThrSer: 3.794 ± 1.702
8.672ThrThr: 8.672 ± 3.156
5.42ThrVal: 5.42 ± 2.445
0.0ThrTrp: 0.0 ± 0.0
1.626ThrTyr: 1.626 ± 0.917
0.0ThrXaa: 0.0 ± 0.0
Val
7.588ValAla: 7.588 ± 3.395
2.168ValCys: 2.168 ± 1.36
8.672ValAsp: 8.672 ± 1.678
2.168ValGlu: 2.168 ± 1.829
2.71ValPhe: 2.71 ± 1.39
7.588ValGly: 7.588 ± 3.269
1.084ValHis: 1.084 ± 0.544
2.168ValIle: 2.168 ± 0.827
1.626ValLys: 1.626 ± 0.726
1.626ValLeu: 1.626 ± 1.208
2.168ValMet: 2.168 ± 0.971
3.794ValAsn: 3.794 ± 1.363
7.588ValPro: 7.588 ± 2.771
5.42ValGln: 5.42 ± 2.117
7.588ValArg: 7.588 ± 4.398
5.962ValSer: 5.962 ± 1.793
8.13ValThr: 8.13 ± 4.285
9.214ValVal: 9.214 ± 2.899
0.542ValTrp: 0.542 ± 0.732
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
1.084TrpAla: 1.084 ± 1.105
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.542TrpPhe: 0.542 ± 0.471
0.542TrpGly: 0.542 ± 0.507
1.084TrpHis: 1.084 ± 0.58
0.542TrpIle: 0.542 ± 0.44
1.626TrpLys: 1.626 ± 0.983
0.0TrpLeu: 0.0 ± 0.0
1.084TrpMet: 1.084 ± 1.345
1.084TrpAsn: 1.084 ± 0.88
2.168TrpPro: 2.168 ± 0.89
0.542TrpGln: 0.542 ± 0.471
1.084TrpArg: 1.084 ± 0.906
1.084TrpSer: 1.084 ± 0.544
1.084TrpThr: 1.084 ± 1.014
1.084TrpVal: 1.084 ± 0.855
0.0TrpTrp: 0.0 ± 0.0
0.542TrpTyr: 0.542 ± 0.507
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.542TyrAla: 0.542 ± 0.507
0.542TyrCys: 0.542 ± 0.44
1.626TyrAsp: 1.626 ± 0.726
0.0TyrGlu: 0.0 ± 0.0
0.542TyrPhe: 0.542 ± 0.507
2.168TyrGly: 2.168 ± 1.36
0.542TyrHis: 0.542 ± 0.719
1.084TyrIle: 1.084 ± 0.544
1.084TyrLys: 1.084 ± 0.769
2.71TyrLeu: 2.71 ± 1.367
1.084TyrMet: 1.084 ± 0.523
0.542TyrAsn: 0.542 ± 0.883
2.168TyrPro: 2.168 ± 0.719
2.168TyrGln: 2.168 ± 1.086
1.626TyrArg: 1.626 ± 0.917
0.0TyrSer: 0.0 ± 0.0
2.168TyrThr: 2.168 ± 1.045
3.252TyrVal: 3.252 ± 0.915
0.542TyrTrp: 0.542 ± 0.471
0.542TyrTyr: 0.542 ± 0.507
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (1846 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski