Amino acid dipepetide frequency for Vibrio phage pre-CTX

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.215AlaAla: 7.215 ± 1.224
0.0AlaCys: 0.0 ± 0.0
2.886AlaAsp: 2.886 ± 1.501
3.608AlaGlu: 3.608 ± 1.12
3.608AlaPhe: 3.608 ± 1.079
3.608AlaGly: 3.608 ± 0.991
1.443AlaHis: 1.443 ± 1.123
7.215AlaIle: 7.215 ± 4.541
4.329AlaLys: 4.329 ± 1.331
10.823AlaLeu: 10.823 ± 3.552
1.443AlaMet: 1.443 ± 1.073
2.886AlaAsn: 2.886 ± 1.316
4.329AlaPro: 4.329 ± 2.701
1.443AlaGln: 1.443 ± 0.611
2.886AlaArg: 2.886 ± 1.468
4.329AlaSer: 4.329 ± 1.832
4.329AlaThr: 4.329 ± 1.833
3.608AlaVal: 3.608 ± 1.61
0.722AlaTrp: 0.722 ± 0.705
2.886AlaTyr: 2.886 ± 1.815
0.0AlaXaa: 0.0 ± 0.0
Cys
0.722CysAla: 0.722 ± 0.995
0.0CysCys: 0.0 ± 0.0
2.165CysAsp: 2.165 ± 0.989
0.0CysGlu: 0.0 ± 0.0
0.722CysPhe: 0.722 ± 0.536
1.443CysGly: 1.443 ± 0.736
0.0CysHis: 0.0 ± 0.0
1.443CysIle: 1.443 ± 0.611
0.722CysLys: 0.722 ± 0.536
1.443CysLeu: 1.443 ± 0.611
0.722CysMet: 0.722 ± 0.536
0.0CysAsn: 0.0 ± 0.0
0.722CysPro: 0.722 ± 0.536
0.722CysGln: 0.722 ± 0.536
0.0CysArg: 0.0 ± 0.0
1.443CysSer: 1.443 ± 1.073
0.0CysThr: 0.0 ± 0.0
2.886CysVal: 2.886 ± 1.055
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.886AspAla: 2.886 ± 1.461
0.0AspCys: 0.0 ± 0.0
2.165AspAsp: 2.165 ± 1.268
7.215AspGlu: 7.215 ± 3.585
1.443AspPhe: 1.443 ± 0.611
5.051AspGly: 5.051 ± 0.942
1.443AspHis: 1.443 ± 0.736
4.329AspIle: 4.329 ± 1.458
2.165AspLys: 2.165 ± 1.011
2.165AspLeu: 2.165 ± 0.866
2.886AspMet: 2.886 ± 1.826
1.443AspAsn: 1.443 ± 1.009
0.722AspPro: 0.722 ± 0.536
1.443AspGln: 1.443 ± 0.984
3.608AspArg: 3.608 ± 1.202
4.329AspSer: 4.329 ± 1.05
7.937AspThr: 7.937 ± 1.546
2.886AspVal: 2.886 ± 0.826
0.722AspTrp: 0.722 ± 1.015
1.443AspTyr: 1.443 ± 1.073
0.0AspXaa: 0.0 ± 0.0
Glu
2.886GluAla: 2.886 ± 1.055
1.443GluCys: 1.443 ± 0.611
0.722GluAsp: 0.722 ± 0.504
2.886GluGlu: 2.886 ± 1.443
0.722GluPhe: 0.722 ± 0.504
1.443GluGly: 1.443 ± 1.073
0.722GluHis: 0.722 ± 0.705
2.165GluIle: 2.165 ± 0.569
2.886GluLys: 2.886 ± 1.038
4.329GluLeu: 4.329 ± 1.768
2.165GluMet: 2.165 ± 1.454
2.165GluAsn: 2.165 ± 1.033
1.443GluPro: 1.443 ± 0.736
2.165GluGln: 2.165 ± 1.351
4.329GluArg: 4.329 ± 1.423
4.329GluSer: 4.329 ± 1.536
2.886GluThr: 2.886 ± 1.529
4.329GluVal: 4.329 ± 0.921
0.0GluTrp: 0.0 ± 0.0
2.886GluTyr: 2.886 ± 1.345
0.0GluXaa: 0.0 ± 0.0
Phe
2.886PheAla: 2.886 ± 1.371
1.443PheCys: 1.443 ± 0.611
3.608PheAsp: 3.608 ± 0.947
1.443PheGlu: 1.443 ± 1.009
4.329PhePhe: 4.329 ± 2.032
5.772PheGly: 5.772 ± 1.369
0.722PheHis: 0.722 ± 0.536
2.886PheIle: 2.886 ± 1.468
3.608PheLys: 3.608 ± 1.126
7.937PheLeu: 7.937 ± 1.455
1.443PheMet: 1.443 ± 0.608
2.886PheAsn: 2.886 ± 0.841
1.443PhePro: 1.443 ± 0.671
0.722PheGln: 0.722 ± 0.995
0.722PheArg: 0.722 ± 0.536
6.494PheSer: 6.494 ± 1.658
3.608PheThr: 3.608 ± 2.371
4.329PheVal: 4.329 ± 1.177
1.443PheTrp: 1.443 ± 1.027
0.722PheTyr: 0.722 ± 0.504
0.0PheXaa: 0.0 ± 0.0
Gly
5.772GlyAla: 5.772 ± 0.843
1.443GlyCys: 1.443 ± 0.671
1.443GlyAsp: 1.443 ± 0.611
1.443GlyGlu: 1.443 ± 1.073
5.772GlyPhe: 5.772 ± 1.594
3.608GlyGly: 3.608 ± 1.137
2.165GlyHis: 2.165 ± 0.569
1.443GlyIle: 1.443 ± 1.179
1.443GlyLys: 1.443 ± 0.982
10.823GlyLeu: 10.823 ± 1.798
0.722GlyMet: 0.722 ± 0.504
2.165GlyAsn: 2.165 ± 1.046
0.722GlyPro: 0.722 ± 0.739
3.608GlyGln: 3.608 ± 1.137
3.608GlyArg: 3.608 ± 1.918
5.051GlySer: 5.051 ± 1.583
4.329GlyThr: 4.329 ± 1.361
4.329GlyVal: 4.329 ± 2.165
1.443GlyTrp: 1.443 ± 0.736
4.329GlyTyr: 4.329 ± 2.48
0.0GlyXaa: 0.0 ± 0.0
His
3.608HisAla: 3.608 ± 1.092
0.0HisCys: 0.0 ± 0.0
2.165HisAsp: 2.165 ± 1.046
0.0HisGlu: 0.0 ± 0.0
2.165HisPhe: 2.165 ± 1.046
2.165HisGly: 2.165 ± 1.202
3.608HisHis: 3.608 ± 1.265
2.886HisIle: 2.886 ± 1.031
0.0HisLys: 0.0 ± 0.0
0.722HisLeu: 0.722 ± 0.705
0.0HisMet: 0.0 ± 0.0
2.165HisAsn: 2.165 ± 1.011
1.443HisPro: 1.443 ± 0.736
1.443HisGln: 1.443 ± 1.073
2.165HisArg: 2.165 ± 1.351
1.443HisSer: 1.443 ± 0.671
0.722HisThr: 0.722 ± 0.995
0.722HisVal: 0.722 ± 0.536
0.722HisTrp: 0.722 ± 0.504
0.722HisTyr: 0.722 ± 0.536
0.0HisXaa: 0.0 ± 0.0
Ile
4.329IleAla: 4.329 ± 2.328
0.722IleCys: 0.722 ± 0.504
6.494IleAsp: 6.494 ± 2.696
3.608IleGlu: 3.608 ± 1.33
7.215IlePhe: 7.215 ± 1.817
2.165IleGly: 2.165 ± 1.047
2.165IleHis: 2.165 ± 1.011
2.886IleIle: 2.886 ± 0.688
4.329IleLys: 4.329 ± 1.708
2.886IleLeu: 2.886 ± 0.826
1.443IleMet: 1.443 ± 1.338
2.165IleAsn: 2.165 ± 0.989
2.165IlePro: 2.165 ± 1.046
1.443IleGln: 1.443 ± 2.031
3.608IleArg: 3.608 ± 1.429
3.608IleSer: 3.608 ± 0.707
2.886IleThr: 2.886 ± 1.763
3.608IleVal: 3.608 ± 1.812
0.722IleTrp: 0.722 ± 0.504
3.608IleTyr: 3.608 ± 1.201
0.0IleXaa: 0.0 ± 0.0
Lys
5.051LysAla: 5.051 ± 1.693
0.0LysCys: 0.0 ± 0.0
6.494LysAsp: 6.494 ± 0.879
0.0LysGlu: 0.0 ± 0.0
2.886LysPhe: 2.886 ± 0.688
2.165LysGly: 2.165 ± 1.033
0.0LysHis: 0.0 ± 0.0
4.329LysIle: 4.329 ± 1.069
2.886LysLys: 2.886 ± 1.031
4.329LysLeu: 4.329 ± 2.452
2.165LysMet: 2.165 ± 1.085
2.165LysAsn: 2.165 ± 1.24
0.722LysPro: 0.722 ± 0.536
1.443LysGln: 1.443 ± 0.671
2.165LysArg: 2.165 ± 1.011
2.165LysSer: 2.165 ± 0.694
3.608LysThr: 3.608 ± 1.33
3.608LysVal: 3.608 ± 1.487
0.0LysTrp: 0.0 ± 0.0
0.722LysTyr: 0.722 ± 0.504
0.0LysXaa: 0.0 ± 0.0
Leu
4.329LeuAla: 4.329 ± 1.583
3.608LeuCys: 3.608 ± 1.772
5.772LeuAsp: 5.772 ± 1.464
3.608LeuGlu: 3.608 ± 2.045
8.658LeuPhe: 8.658 ± 2.642
7.937LeuGly: 7.937 ± 3.157
2.886LeuHis: 2.886 ± 0.826
5.051LeuIle: 5.051 ± 2.975
4.329LeuLys: 4.329 ± 1.084
4.329LeuLeu: 4.329 ± 1.415
4.329LeuMet: 4.329 ± 2.99
7.215LeuAsn: 7.215 ± 2.03
4.329LeuPro: 4.329 ± 1.855
0.722LeuGln: 0.722 ± 1.015
4.329LeuArg: 4.329 ± 0.973
7.215LeuSer: 7.215 ± 2.598
5.772LeuThr: 5.772 ± 2.72
3.608LeuVal: 3.608 ± 1.403
3.608LeuTrp: 3.608 ± 1.918
4.329LeuTyr: 4.329 ± 1.506
0.0LeuXaa: 0.0 ± 0.0
Met
5.772MetAla: 5.772 ± 1.907
0.722MetCys: 0.722 ± 0.536
2.165MetAsp: 2.165 ± 1.26
0.722MetGlu: 0.722 ± 0.536
2.886MetPhe: 2.886 ± 1.371
0.722MetGly: 0.722 ± 0.739
0.722MetHis: 0.722 ± 0.504
2.886MetIle: 2.886 ± 1.055
0.722MetLys: 0.722 ± 0.995
2.886MetLeu: 2.886 ± 1.968
1.443MetMet: 1.443 ± 2.031
1.443MetAsn: 1.443 ± 0.671
0.722MetPro: 0.722 ± 0.705
0.0MetGln: 0.0 ± 0.0
0.722MetArg: 0.722 ± 0.536
2.165MetSer: 2.165 ± 1.097
0.722MetThr: 0.722 ± 1.015
2.165MetVal: 2.165 ± 0.569
0.722MetTrp: 0.722 ± 1.015
0.722MetTyr: 0.722 ± 0.504
0.0MetXaa: 0.0 ± 0.0
Asn
1.443AsnAla: 1.443 ± 0.736
0.722AsnCys: 0.722 ± 0.536
3.608AsnAsp: 3.608 ± 1.462
1.443AsnGlu: 1.443 ± 1.073
0.0AsnPhe: 0.0 ± 0.0
1.443AsnGly: 1.443 ± 0.973
1.443AsnHis: 1.443 ± 0.611
2.165AsnIle: 2.165 ± 1.011
1.443AsnLys: 1.443 ± 1.336
6.494AsnLeu: 6.494 ± 1.038
2.165AsnMet: 2.165 ± 0.977
1.443AsnAsn: 1.443 ± 0.973
2.886AsnPro: 2.886 ± 1.529
2.165AsnGln: 2.165 ± 1.137
2.886AsnArg: 2.886 ± 1.127
1.443AsnSer: 1.443 ± 1.009
5.772AsnThr: 5.772 ± 2.248
2.165AsnVal: 2.165 ± 0.569
0.722AsnTrp: 0.722 ± 0.536
0.722AsnTyr: 0.722 ± 0.536
0.0AsnXaa: 0.0 ± 0.0
Pro
2.165ProAla: 2.165 ± 1.046
0.0ProCys: 0.0 ± 0.0
5.051ProAsp: 5.051 ± 1.345
2.886ProGlu: 2.886 ± 0.826
0.722ProPhe: 0.722 ± 0.536
2.165ProGly: 2.165 ± 1.011
1.443ProHis: 1.443 ± 1.123
3.608ProIle: 3.608 ± 1.557
0.0ProLys: 0.0 ± 0.0
2.886ProLeu: 2.886 ± 1.071
0.722ProMet: 0.722 ± 0.536
5.051ProAsn: 5.051 ± 2.283
7.215ProPro: 7.215 ± 1.485
2.165ProGln: 2.165 ± 1.5
2.886ProArg: 2.886 ± 1.127
3.608ProSer: 3.608 ± 2.045
1.443ProThr: 1.443 ± 0.611
0.722ProVal: 0.722 ± 0.536
0.722ProTrp: 0.722 ± 0.705
2.165ProTyr: 2.165 ± 0.569
0.0ProXaa: 0.0 ± 0.0
Gln
3.608GlnAla: 3.608 ± 3.13
0.0GlnCys: 0.0 ± 0.0
1.443GlnAsp: 1.443 ± 0.611
0.722GlnGlu: 0.722 ± 0.705
2.165GlnPhe: 2.165 ± 1.217
0.722GlnGly: 0.722 ± 0.536
0.722GlnHis: 0.722 ± 0.995
2.165GlnIle: 2.165 ± 1.589
1.443GlnLys: 1.443 ± 1.584
2.886GlnLeu: 2.886 ± 1.443
2.886GlnMet: 2.886 ± 0.826
1.443GlnAsn: 1.443 ± 0.973
0.722GlnPro: 0.722 ± 0.536
2.165GlnGln: 2.165 ± 1.217
0.722GlnArg: 0.722 ± 0.536
7.215GlnSer: 7.215 ± 2.809
1.443GlnThr: 1.443 ± 1.411
2.886GlnVal: 2.886 ± 1.055
0.0GlnTrp: 0.0 ± 0.0
0.722GlnTyr: 0.722 ± 1.015
0.0GlnXaa: 0.0 ± 0.0
Arg
3.608ArgAla: 3.608 ± 1.201
2.165ArgCys: 2.165 ± 1.609
1.443ArgAsp: 1.443 ± 0.736
4.329ArgGlu: 4.329 ± 1.05
2.165ArgPhe: 2.165 ± 0.907
5.051ArgGly: 5.051 ± 2.78
2.886ArgHis: 2.886 ± 1.468
2.886ArgIle: 2.886 ± 0.688
2.886ArgLys: 2.886 ± 0.918
5.051ArgLeu: 5.051 ± 2.412
2.165ArgMet: 2.165 ± 1.042
0.722ArgAsn: 0.722 ± 0.705
1.443ArgPro: 1.443 ± 0.736
1.443ArgGln: 1.443 ± 0.736
0.0ArgArg: 0.0 ± 0.0
2.165ArgSer: 2.165 ± 1.268
0.0ArgThr: 0.0 ± 0.0
2.165ArgVal: 2.165 ± 0.989
0.0ArgTrp: 0.0 ± 0.0
2.886ArgTyr: 2.886 ± 0.841
0.0ArgXaa: 0.0 ± 0.0
Ser
6.494SerAla: 6.494 ± 2.188
0.722SerCys: 0.722 ± 0.536
2.165SerAsp: 2.165 ± 0.569
4.329SerGlu: 4.329 ± 1.289
4.329SerPhe: 4.329 ± 1.272
8.658SerGly: 8.658 ± 1.476
2.165SerHis: 2.165 ± 1.033
2.886SerIle: 2.886 ± 1.468
4.329SerLys: 4.329 ± 1.014
5.772SerLeu: 5.772 ± 2.043
1.443SerMet: 1.443 ± 1.123
2.886SerAsn: 2.886 ± 1.341
6.494SerPro: 6.494 ± 3.333
5.051SerGln: 5.051 ± 2.1
1.443SerArg: 1.443 ± 0.671
6.494SerSer: 6.494 ± 1.43
3.608SerThr: 3.608 ± 1.297
2.886SerVal: 2.886 ± 1.529
0.0SerTrp: 0.0 ± 0.0
5.051SerTyr: 5.051 ± 0.934
0.0SerXaa: 0.0 ± 0.0
Thr
4.329ThrAla: 4.329 ± 1.768
0.722ThrCys: 0.722 ± 0.705
1.443ThrAsp: 1.443 ± 0.982
5.051ThrGlu: 5.051 ± 1.953
2.886ThrPhe: 2.886 ± 2.112
3.608ThrGly: 3.608 ± 1.265
1.443ThrHis: 1.443 ± 0.736
3.608ThrIle: 3.608 ± 0.802
1.443ThrLys: 1.443 ± 0.794
7.215ThrLeu: 7.215 ± 2.32
2.165ThrMet: 2.165 ± 0.983
1.443ThrAsn: 1.443 ± 1.009
3.608ThrPro: 3.608 ± 1.772
4.329ThrGln: 4.329 ± 1.805
1.443ThrArg: 1.443 ± 1.027
2.886ThrSer: 2.886 ± 0.765
4.329ThrThr: 4.329 ± 1.966
6.494ThrVal: 6.494 ± 2.232
0.722ThrTrp: 0.722 ± 1.015
0.722ThrTyr: 0.722 ± 0.705
0.0ThrXaa: 0.0 ± 0.0
Val
2.886ValAla: 2.886 ± 1.438
0.0ValCys: 0.0 ± 0.0
1.443ValAsp: 1.443 ± 0.611
2.886ValGlu: 2.886 ± 1.763
4.329ValPhe: 4.329 ± 1.521
5.051ValGly: 5.051 ± 1.6
1.443ValHis: 1.443 ± 0.736
4.329ValIle: 4.329 ± 3.623
3.608ValLys: 3.608 ± 2.058
7.937ValLeu: 7.937 ± 1.652
0.0ValMet: 0.0 ± 0.0
1.443ValAsn: 1.443 ± 0.671
3.608ValPro: 3.608 ± 1.432
2.165ValGln: 2.165 ± 1.033
2.165ValArg: 2.165 ± 0.569
5.051ValSer: 5.051 ± 1.751
5.051ValThr: 5.051 ± 1.945
2.165ValVal: 2.165 ± 0.694
2.165ValTrp: 2.165 ± 1.216
0.722ValTyr: 0.722 ± 0.536
0.0ValXaa: 0.0 ± 0.0
Trp
1.443TrpAla: 1.443 ± 1.009
0.0TrpCys: 0.0 ± 0.0
1.443TrpAsp: 1.443 ± 0.611
0.722TrpGlu: 0.722 ± 1.015
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.722TrpHis: 0.722 ± 0.504
1.443TrpIle: 1.443 ± 1.179
0.722TrpLys: 0.722 ± 0.504
2.886TrpLeu: 2.886 ± 1.826
0.0TrpMet: 0.0 ± 0.0
0.722TrpAsn: 0.722 ± 0.995
1.443TrpPro: 1.443 ± 0.611
0.0TrpGln: 0.0 ± 0.0
0.722TrpArg: 0.722 ± 0.705
0.722TrpSer: 0.722 ± 0.536
0.0TrpThr: 0.0 ± 0.0
0.722TrpVal: 0.722 ± 0.504
0.0TrpTrp: 0.0 ± 0.0
0.722TrpTyr: 0.722 ± 0.705
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.165TyrAla: 2.165 ± 1.011
1.443TyrCys: 1.443 ± 1.073
2.165TyrAsp: 2.165 ± 0.995
0.0TyrGlu: 0.0 ± 0.0
1.443TyrPhe: 1.443 ± 0.671
2.886TyrGly: 2.886 ± 1.127
0.722TyrHis: 0.722 ± 0.995
1.443TyrIle: 1.443 ± 0.671
3.608TyrLys: 3.608 ± 1.557
2.165TyrLeu: 2.165 ± 0.866
0.0TyrMet: 0.0 ± 0.0
0.722TyrAsn: 0.722 ± 0.536
1.443TyrPro: 1.443 ± 0.984
1.443TyrGln: 1.443 ± 1.411
5.772TyrArg: 5.772 ± 2.151
5.051TyrSer: 5.051 ± 1.607
1.443TyrThr: 1.443 ± 0.973
2.165TyrVal: 2.165 ± 1.268
0.0TyrTrp: 0.0 ± 0.0
1.443TyrTyr: 1.443 ± 1.411
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1387 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski