Amino acid dipepetide frequency for Enterobacteria phage SP (Bacteriophage SP)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.737AlaAla: 8.737 ± 1.563
4.704AlaCys: 4.704 ± 2.134
4.032AlaAsp: 4.032 ± 1.93
3.36AlaGlu: 3.36 ± 1.039
6.72AlaPhe: 6.72 ± 0.206
2.016AlaGly: 2.016 ± 1.043
0.672AlaHis: 0.672 ± 0.59
2.688AlaIle: 2.688 ± 1.121
2.688AlaLys: 2.688 ± 1.34
10.081AlaLeu: 10.081 ± 2.342
0.672AlaMet: 0.672 ± 0.576
2.016AlaAsn: 2.016 ± 1.084
1.344AlaPro: 1.344 ± 0.972
2.016AlaGln: 2.016 ± 1.043
4.032AlaArg: 4.032 ± 1.265
5.376AlaSer: 5.376 ± 0.42
4.032AlaThr: 4.032 ± 1.371
4.032AlaVal: 4.032 ± 0.591
2.688AlaTrp: 2.688 ± 0.498
2.688AlaTyr: 2.688 ± 1.229
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
2.016CysAsp: 2.016 ± 0.965
0.672CysGlu: 0.672 ± 0.486
0.0CysPhe: 0.0 ± 0.0
1.344CysGly: 1.344 ± 0.972
0.0CysHis: 0.0 ± 0.0
1.344CysIle: 1.344 ± 0.972
0.672CysLys: 0.672 ± 0.59
0.672CysLeu: 0.672 ± 0.486
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
2.016CysPro: 2.016 ± 1.084
0.0CysGln: 0.0 ± 0.0
2.016CysArg: 2.016 ± 0.942
0.0CysSer: 0.0 ± 0.0
1.344CysThr: 1.344 ± 1.038
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.672CysTyr: 0.672 ± 0.576
0.0CysXaa: 0.0 ± 0.0
Asp
4.704AspAla: 4.704 ± 2.548
0.0AspCys: 0.0 ± 0.0
2.016AspAsp: 2.016 ± 0.942
3.36AspGlu: 3.36 ± 1.282
3.36AspPhe: 3.36 ± 1.825
7.392AspGly: 7.392 ± 2.148
0.0AspHis: 0.0 ± 0.0
5.376AspIle: 5.376 ± 0.846
0.672AspLys: 0.672 ± 0.59
8.737AspLeu: 8.737 ± 1.504
1.344AspMet: 1.344 ± 0.422
2.688AspAsn: 2.688 ± 1.229
4.032AspPro: 4.032 ± 2.192
4.032AspGln: 4.032 ± 1.498
3.36AspArg: 3.36 ± 1.581
3.36AspSer: 3.36 ± 0.91
3.36AspThr: 3.36 ± 1.039
8.737AspVal: 8.737 ± 0.959
1.344AspTrp: 1.344 ± 1.18
2.016AspTyr: 2.016 ± 1.152
0.0AspXaa: 0.0 ± 0.0
Glu
4.032GluAla: 4.032 ± 0.92
0.672GluCys: 0.672 ± 0.486
2.016GluAsp: 2.016 ± 0.375
3.36GluGlu: 3.36 ± 1.433
1.344GluPhe: 1.344 ± 1.18
5.376GluGly: 5.376 ± 1.367
0.0GluHis: 0.0 ± 0.0
2.688GluIle: 2.688 ± 1.119
3.36GluLys: 3.36 ± 1.433
3.36GluLeu: 3.36 ± 1.433
0.0GluMet: 0.0 ± 0.0
0.0GluAsn: 0.0 ± 0.0
1.344GluPro: 1.344 ± 0.422
0.672GluGln: 0.672 ± 0.59
4.032GluArg: 4.032 ± 0.816
2.016GluSer: 2.016 ± 1.457
2.016GluThr: 2.016 ± 0.903
7.392GluVal: 7.392 ± 1.831
0.672GluTrp: 0.672 ± 0.486
2.016GluTyr: 2.016 ± 0.693
0.0GluXaa: 0.0 ± 0.0
Phe
4.032PheAla: 4.032 ± 1.033
0.672PheCys: 0.672 ± 0.486
3.36PheAsp: 3.36 ± 0.553
4.704PheGlu: 4.704 ± 1.426
0.672PhePhe: 0.672 ± 0.486
2.016PheGly: 2.016 ± 0.375
0.0PheHis: 0.0 ± 0.0
0.672PheIle: 0.672 ± 0.486
2.688PheLys: 2.688 ± 1.121
2.016PheLeu: 2.016 ± 0.942
0.672PheMet: 0.672 ± 0.59
1.344PheAsn: 1.344 ± 0.711
3.36PhePro: 3.36 ± 1.328
2.016PheGln: 2.016 ± 0.693
2.016PheArg: 2.016 ± 0.693
4.704PheSer: 4.704 ± 3.214
2.688PheThr: 2.688 ± 1.229
3.36PheVal: 3.36 ± 1.328
0.672PheTrp: 0.672 ± 0.576
2.016PheTyr: 2.016 ± 0.903
0.0PheXaa: 0.0 ± 0.0
Gly
4.032GlyAla: 4.032 ± 1.498
0.672GlyCys: 0.672 ± 0.486
8.065GlyAsp: 8.065 ± 0.99
2.688GlyGlu: 2.688 ± 0.843
3.36GlyPhe: 3.36 ± 0.553
2.688GlyGly: 2.688 ± 0.876
2.688GlyHis: 2.688 ± 0.843
3.36GlyIle: 3.36 ± 2.429
5.376GlyLys: 5.376 ± 0.895
2.016GlyLeu: 2.016 ± 0.903
0.672GlyMet: 0.672 ± 0.576
2.688GlyAsn: 2.688 ± 0.876
1.344GlyPro: 1.344 ± 0.422
0.672GlyGln: 0.672 ± 0.59
4.032GlyArg: 4.032 ± 0.751
5.376GlySer: 5.376 ± 1.375
2.688GlyThr: 2.688 ± 0.777
7.392GlyVal: 7.392 ± 1.305
0.672GlyTrp: 0.672 ± 0.486
4.704GlyTyr: 4.704 ± 1.584
0.0GlyXaa: 0.0 ± 0.0
His
0.672HisAla: 0.672 ± 0.486
0.0HisCys: 0.0 ± 0.0
0.672HisAsp: 0.672 ± 0.486
0.0HisGlu: 0.0 ± 0.0
0.672HisPhe: 0.672 ± 0.59
1.344HisGly: 1.344 ± 0.422
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.672HisLys: 0.672 ± 0.59
1.344HisLeu: 1.344 ± 0.422
0.672HisMet: 0.672 ± 0.486
0.0HisAsn: 0.0 ± 0.0
1.344HisPro: 1.344 ± 0.972
0.0HisGln: 0.0 ± 0.0
0.672HisArg: 0.672 ± 0.59
0.672HisSer: 0.672 ± 0.59
2.016HisThr: 2.016 ± 0.903
2.688HisVal: 2.688 ± 2.36
0.672HisTrp: 0.672 ± 0.486
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.032IleAla: 4.032 ± 0.692
0.0IleCys: 0.0 ± 0.0
6.048IleAsp: 6.048 ± 1.635
0.672IleGlu: 0.672 ± 0.486
1.344IlePhe: 1.344 ± 0.422
3.36IleGly: 3.36 ± 0.97
1.344IleHis: 1.344 ± 0.972
2.688IleIle: 2.688 ± 1.222
3.36IleLys: 3.36 ± 1.49
3.36IleLeu: 3.36 ± 2.429
0.0IleMet: 0.0 ± 0.0
1.344IleAsn: 1.344 ± 0.972
3.36IlePro: 3.36 ± 0.91
2.688IleGln: 2.688 ± 0.777
4.032IleArg: 4.032 ± 0.92
3.36IleSer: 3.36 ± 1.28
2.016IleThr: 2.016 ± 0.693
2.016IleVal: 2.016 ± 0.965
0.672IleTrp: 0.672 ± 0.486
2.688IleTyr: 2.688 ± 0.876
0.0IleXaa: 0.0 ± 0.0
Lys
0.672LysAla: 0.672 ± 0.486
0.672LysCys: 0.672 ± 0.576
1.344LysAsp: 1.344 ± 1.18
0.672LysGlu: 0.672 ± 0.576
2.688LysPhe: 2.688 ± 0.498
1.344LysGly: 1.344 ± 1.18
2.016LysHis: 2.016 ± 0.693
4.704LysIle: 4.704 ± 0.578
1.344LysLys: 1.344 ± 0.422
9.409LysLeu: 9.409 ± 1.248
0.0LysMet: 0.0 ± 0.0
4.032LysAsn: 4.032 ± 1.93
0.672LysPro: 0.672 ± 0.576
0.0LysGln: 0.0 ± 0.0
5.376LysArg: 5.376 ± 0.72
1.344LysSer: 1.344 ± 0.711
3.36LysThr: 3.36 ± 1.581
4.032LysVal: 4.032 ± 0.816
0.672LysTrp: 0.672 ± 0.59
4.032LysTyr: 4.032 ± 0.692
0.0LysXaa: 0.0 ± 0.0
Leu
8.065LeuAla: 8.065 ± 2.711
0.0LeuCys: 0.0 ± 0.0
4.032LeuAsp: 4.032 ± 0.751
6.048LeuGlu: 6.048 ± 1.383
1.344LeuPhe: 1.344 ± 0.422
4.704LeuGly: 4.704 ± 0.737
1.344LeuHis: 1.344 ± 0.422
4.704LeuIle: 4.704 ± 2.035
4.032LeuLys: 4.032 ± 1.265
11.425LeuLeu: 11.425 ± 0.898
2.016LeuMet: 2.016 ± 0.55
6.72LeuAsn: 6.72 ± 1.425
3.36LeuPro: 3.36 ± 0.553
2.688LeuGln: 2.688 ± 0.732
8.065LeuArg: 8.065 ± 3.358
9.409LeuSer: 9.409 ± 3.273
7.392LeuThr: 7.392 ± 2.233
6.048LeuVal: 6.048 ± 1.333
2.016LeuTrp: 2.016 ± 0.903
2.016LeuTyr: 2.016 ± 0.693
0.0LeuXaa: 0.0 ± 0.0
Met
1.344MetAla: 1.344 ± 1.038
0.0MetCys: 0.0 ± 0.0
0.672MetAsp: 0.672 ± 0.486
0.672MetGlu: 0.672 ± 0.59
0.672MetPhe: 0.672 ± 0.486
1.344MetGly: 1.344 ± 0.422
0.0MetHis: 0.0 ± 0.0
0.672MetIle: 0.672 ± 0.486
0.0MetLys: 0.0 ± 0.0
0.672MetLeu: 0.672 ± 0.59
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
2.688MetPro: 2.688 ± 0.843
0.672MetGln: 0.672 ± 0.576
1.344MetArg: 1.344 ± 1.18
0.672MetSer: 0.672 ± 0.486
0.0MetThr: 0.0 ± 0.0
0.672MetVal: 0.672 ± 0.576
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
0.0AsnAla: 0.0 ± 0.0
0.672AsnCys: 0.672 ± 0.576
2.688AsnAsp: 2.688 ± 0.843
1.344AsnGlu: 1.344 ± 0.972
2.688AsnPhe: 2.688 ± 0.732
6.72AsnGly: 6.72 ± 2.313
1.344AsnHis: 1.344 ± 0.422
0.672AsnIle: 0.672 ± 0.576
2.016AsnLys: 2.016 ± 0.903
2.016AsnLeu: 2.016 ± 1.043
0.672AsnMet: 0.672 ± 0.486
0.672AsnAsn: 0.672 ± 0.59
6.048AsnPro: 6.048 ± 3.002
2.016AsnGln: 2.016 ± 0.965
4.032AsnArg: 4.032 ± 1.498
3.36AsnSer: 3.36 ± 0.91
1.344AsnThr: 1.344 ± 0.611
1.344AsnVal: 1.344 ± 1.18
2.016AsnTrp: 2.016 ± 1.084
0.672AsnTyr: 0.672 ± 0.486
0.0AsnXaa: 0.0 ± 0.0
Pro
3.36ProAla: 3.36 ± 1.996
0.0ProCys: 0.0 ± 0.0
5.376ProAsp: 5.376 ± 1.752
2.016ProGlu: 2.016 ± 0.693
4.032ProPhe: 4.032 ± 2.293
2.016ProGly: 2.016 ± 0.375
0.0ProHis: 0.0 ± 0.0
0.672ProIle: 0.672 ± 0.486
2.016ProLys: 2.016 ± 1.457
4.032ProLeu: 4.032 ± 0.92
0.0ProMet: 0.0 ± 0.0
1.344ProAsn: 1.344 ± 0.422
2.016ProPro: 2.016 ± 0.942
2.688ProGln: 2.688 ± 0.777
4.032ProArg: 4.032 ± 0.591
6.72ProSer: 6.72 ± 1.94
4.032ProThr: 4.032 ± 2.242
4.032ProVal: 4.032 ± 1.429
0.0ProTrp: 0.0 ± 0.0
2.016ProTyr: 2.016 ± 0.903
0.0ProXaa: 0.0 ± 0.0
Gln
1.344GlnAla: 1.344 ± 0.422
0.0GlnCys: 0.0 ± 0.0
2.016GlnAsp: 2.016 ± 0.693
0.0GlnGlu: 0.0 ± 0.0
0.672GlnPhe: 0.672 ± 0.576
2.688GlnGly: 2.688 ± 0.498
0.0GlnHis: 0.0 ± 0.0
2.016GlnIle: 2.016 ± 1.043
0.0GlnLys: 0.0 ± 0.0
4.032GlnLeu: 4.032 ± 1.029
0.672GlnMet: 0.672 ± 0.59
1.344GlnAsn: 1.344 ± 1.038
1.344GlnPro: 1.344 ± 1.038
0.0GlnGln: 0.0 ± 0.0
4.032GlnArg: 4.032 ± 1.029
3.36GlnSer: 3.36 ± 0.553
3.36GlnThr: 3.36 ± 0.97
1.344GlnVal: 1.344 ± 1.038
0.0GlnTrp: 0.0 ± 0.0
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
6.72ArgAla: 6.72 ± 1.045
2.688ArgCys: 2.688 ± 1.368
5.376ArgAsp: 5.376 ± 1.745
3.36ArgGlu: 3.36 ± 1.039
4.032ArgPhe: 4.032 ± 2.628
4.032ArgGly: 4.032 ± 1.819
2.016ArgHis: 2.016 ± 1.77
0.672ArgIle: 0.672 ± 0.486
4.704ArgLys: 4.704 ± 0.929
7.392ArgLeu: 7.392 ± 1.505
1.344ArgMet: 1.344 ± 1.034
4.032ArgAsn: 4.032 ± 0.816
2.688ArgPro: 2.688 ± 0.843
1.344ArgGln: 1.344 ± 0.422
7.392ArgArg: 7.392 ± 1.505
5.376ArgSer: 5.376 ± 0.42
3.36ArgThr: 3.36 ± 0.675
6.72ArgVal: 6.72 ± 1.021
1.344ArgTrp: 1.344 ± 0.422
3.36ArgTyr: 3.36 ± 0.825
0.0ArgXaa: 0.0 ± 0.0
Ser
6.72SerAla: 6.72 ± 0.944
0.672SerCys: 0.672 ± 0.486
4.704SerAsp: 4.704 ± 1.219
3.36SerGlu: 3.36 ± 0.97
3.36SerPhe: 3.36 ± 1.788
2.016SerGly: 2.016 ± 1.084
0.0SerHis: 0.0 ± 0.0
4.704SerIle: 4.704 ± 1.387
3.36SerLys: 3.36 ± 1.282
7.392SerLeu: 7.392 ± 1.543
1.344SerMet: 1.344 ± 0.905
2.016SerAsn: 2.016 ± 0.903
2.688SerPro: 2.688 ± 0.876
0.672SerGln: 0.672 ± 0.59
4.032SerArg: 4.032 ± 0.591
3.36SerSer: 3.36 ± 1.328
5.376SerThr: 5.376 ± 1.492
10.081SerVal: 10.081 ± 1.775
1.344SerTrp: 1.344 ± 0.972
3.36SerTyr: 3.36 ± 0.97
0.0SerXaa: 0.0 ± 0.0
Thr
6.72ThrAla: 6.72 ± 0.719
0.672ThrCys: 0.672 ± 0.59
4.032ThrAsp: 4.032 ± 1.641
2.688ThrGlu: 2.688 ± 0.732
4.032ThrPhe: 4.032 ± 0.692
4.704ThrGly: 4.704 ± 0.909
0.0ThrHis: 0.0 ± 0.0
1.344ThrIle: 1.344 ± 0.422
1.344ThrLys: 1.344 ± 0.422
6.72ThrLeu: 6.72 ± 3.992
0.0ThrMet: 0.0 ± 0.0
6.048ThrAsn: 6.048 ± 0.431
4.032ThrPro: 4.032 ± 0.591
2.016ThrGln: 2.016 ± 0.375
4.032ThrArg: 4.032 ± 1.93
3.36ThrSer: 3.36 ± 1.433
2.016ThrThr: 2.016 ± 0.693
8.065ThrVal: 8.065 ± 1.156
0.0ThrTrp: 0.0 ± 0.0
2.016ThrTyr: 2.016 ± 0.903
0.0ThrXaa: 0.0 ± 0.0
Val
7.392ValAla: 7.392 ± 1.864
0.672ValCys: 0.672 ± 0.486
8.065ValAsp: 8.065 ± 1.387
3.36ValGlu: 3.36 ± 1.039
0.672ValPhe: 0.672 ± 0.486
3.36ValGly: 3.36 ± 0.91
2.016ValHis: 2.016 ± 1.77
6.048ValIle: 6.048 ± 2.095
6.72ValLys: 6.72 ± 1.106
5.376ValLeu: 5.376 ± 1.367
0.672ValMet: 0.672 ± 0.59
4.704ValAsn: 4.704 ± 0.969
5.376ValPro: 5.376 ± 1.042
2.688ValGln: 2.688 ± 1.229
4.704ValArg: 4.704 ± 2.484
4.032ValSer: 4.032 ± 1.033
12.097ValThr: 12.097 ± 3.238
4.032ValVal: 4.032 ± 1.029
0.0ValTrp: 0.0 ± 0.0
1.344ValTyr: 1.344 ± 0.972
0.0ValXaa: 0.0 ± 0.0
Trp
0.672TrpAla: 0.672 ± 0.576
0.0TrpCys: 0.0 ± 0.0
1.344TrpAsp: 1.344 ± 0.611
1.344TrpGlu: 1.344 ± 0.422
2.016TrpPhe: 2.016 ± 0.903
2.016TrpGly: 2.016 ± 0.693
0.0TrpHis: 0.0 ± 0.0
0.672TrpIle: 0.672 ± 0.486
2.016TrpLys: 2.016 ± 0.693
1.344TrpLeu: 1.344 ± 0.711
0.0TrpMet: 0.0 ± 0.0
1.344TrpAsn: 1.344 ± 0.422
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.344TrpArg: 1.344 ± 0.611
0.0TrpSer: 0.0 ± 0.0
0.672TrpThr: 0.672 ± 0.576
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.672TrpTyr: 0.672 ± 0.486
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.344TyrAla: 1.344 ± 0.611
0.0TyrCys: 0.0 ± 0.0
2.016TyrAsp: 2.016 ± 0.375
2.688TyrGlu: 2.688 ± 1.222
0.0TyrPhe: 0.0 ± 0.0
4.704TyrGly: 4.704 ± 0.737
0.672TyrHis: 0.672 ± 0.486
2.688TyrIle: 2.688 ± 0.777
1.344TyrLys: 1.344 ± 0.611
4.704TyrLeu: 4.704 ± 1.844
0.672TyrMet: 0.672 ± 0.76
0.0TyrAsn: 0.0 ± 0.0
1.344TyrPro: 1.344 ± 0.422
1.344TyrGln: 1.344 ± 1.18
5.376TyrArg: 5.376 ± 2.47
4.704TyrSer: 4.704 ± 0.969
0.672TyrThr: 0.672 ± 0.486
1.344TyrVal: 1.344 ± 0.422
0.672TyrTrp: 0.672 ± 0.576
0.672TyrTyr: 0.672 ± 0.59
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1489 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski