Amino acid dipepetide frequency for Shewanella phage SppYZU01

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.098AlaAla: 15.098 ± 1.609
0.736AlaCys: 0.736 ± 0.201
6.186AlaAsp: 6.186 ± 0.695
7.586AlaGlu: 7.586 ± 0.934
2.946AlaPhe: 2.946 ± 0.429
8.838AlaGly: 8.838 ± 0.906
1.62AlaHis: 1.62 ± 0.438
6.555AlaIle: 6.555 ± 0.612
5.303AlaLys: 5.303 ± 0.65
8.47AlaLeu: 8.47 ± 0.801
3.093AlaMet: 3.093 ± 0.509
3.461AlaAsn: 3.461 ± 0.582
5.966AlaPro: 5.966 ± 1.348
4.787AlaGln: 4.787 ± 0.655
6.334AlaArg: 6.334 ± 0.672
5.892AlaSer: 5.892 ± 1.149
8.101AlaThr: 8.101 ± 1.273
8.47AlaVal: 8.47 ± 0.723
1.473AlaTrp: 1.473 ± 0.31
3.241AlaTyr: 3.241 ± 0.483
0.0AlaXaa: 0.0 ± 0.0
Cys
0.884CysAla: 0.884 ± 0.218
0.074CysCys: 0.074 ± 0.061
0.81CysAsp: 0.81 ± 0.226
0.516CysGlu: 0.516 ± 0.178
0.295CysPhe: 0.295 ± 0.127
1.547CysGly: 1.547 ± 0.503
0.221CysHis: 0.221 ± 0.123
0.368CysIle: 0.368 ± 0.14
0.884CysLys: 0.884 ± 0.274
0.516CysLeu: 0.516 ± 0.144
0.147CysMet: 0.147 ± 0.119
0.147CysAsn: 0.147 ± 0.093
0.736CysPro: 0.736 ± 0.248
0.368CysGln: 0.368 ± 0.15
0.663CysArg: 0.663 ± 0.303
0.957CysSer: 0.957 ± 0.218
0.516CysThr: 0.516 ± 0.173
0.884CysVal: 0.884 ± 0.261
0.0CysTrp: 0.0 ± 0.0
0.516CysTyr: 0.516 ± 0.19
0.0CysXaa: 0.0 ± 0.0
Asp
7.218AspAla: 7.218 ± 0.707
1.105AspCys: 1.105 ± 0.239
2.578AspAsp: 2.578 ± 0.505
3.461AspGlu: 3.461 ± 0.486
1.768AspPhe: 1.768 ± 0.423
5.892AspGly: 5.892 ± 1.045
0.957AspHis: 0.957 ± 0.249
4.124AspIle: 4.124 ± 0.483
2.725AspLys: 2.725 ± 0.527
4.934AspLeu: 4.934 ± 0.605
1.841AspMet: 1.841 ± 0.331
2.872AspAsn: 2.872 ± 0.473
3.83AspPro: 3.83 ± 0.536
2.062AspGln: 2.062 ± 0.455
3.02AspArg: 3.02 ± 0.337
2.651AspSer: 2.651 ± 0.491
3.388AspThr: 3.388 ± 0.558
4.272AspVal: 4.272 ± 0.488
1.252AspTrp: 1.252 ± 0.386
2.946AspTyr: 2.946 ± 0.349
0.0AspXaa: 0.0 ± 0.0
Glu
8.175GluAla: 8.175 ± 0.959
0.442GluCys: 0.442 ± 0.287
3.977GluAsp: 3.977 ± 0.46
3.609GluGlu: 3.609 ± 0.661
2.504GluPhe: 2.504 ± 0.442
3.093GluGly: 3.093 ± 0.419
1.031GluHis: 1.031 ± 0.242
3.388GluIle: 3.388 ± 0.547
3.682GluLys: 3.682 ± 0.714
6.407GluLeu: 6.407 ± 0.757
1.252GluMet: 1.252 ± 0.246
1.841GluAsn: 1.841 ± 0.381
2.283GluPro: 2.283 ± 0.565
2.872GluGln: 2.872 ± 0.374
3.461GluArg: 3.461 ± 0.621
3.093GluSer: 3.093 ± 0.474
4.787GluThr: 4.787 ± 0.56
3.682GluVal: 3.682 ± 0.456
1.399GluTrp: 1.399 ± 0.253
1.694GluTyr: 1.694 ± 0.383
0.0GluXaa: 0.0 ± 0.0
Phe
3.241PheAla: 3.241 ± 0.436
0.368PheCys: 0.368 ± 0.139
2.43PheAsp: 2.43 ± 0.362
2.578PheGlu: 2.578 ± 0.464
1.105PhePhe: 1.105 ± 0.279
4.787PheGly: 4.787 ± 0.612
0.368PheHis: 0.368 ± 0.167
1.326PheIle: 1.326 ± 0.37
1.105PheLys: 1.105 ± 0.244
2.578PheLeu: 2.578 ± 0.389
0.663PheMet: 0.663 ± 0.244
1.989PheAsn: 1.989 ± 0.34
1.547PhePro: 1.547 ± 0.266
1.473PheGln: 1.473 ± 0.343
1.547PheArg: 1.547 ± 0.389
1.547PheSer: 1.547 ± 0.311
1.547PheThr: 1.547 ± 0.366
2.136PheVal: 2.136 ± 0.323
0.81PheTrp: 0.81 ± 0.253
0.884PheTyr: 0.884 ± 0.22
0.0PheXaa: 0.0 ± 0.0
Gly
8.175GlyAla: 8.175 ± 0.944
0.736GlyCys: 0.736 ± 0.242
4.787GlyAsp: 4.787 ± 0.596
5.229GlyGlu: 5.229 ± 0.736
2.872GlyPhe: 2.872 ± 0.39
6.997GlyGly: 6.997 ± 0.767
1.547GlyHis: 1.547 ± 0.305
3.756GlyIle: 3.756 ± 0.654
4.493GlyLys: 4.493 ± 0.711
6.186GlyLeu: 6.186 ± 0.577
1.326GlyMet: 1.326 ± 0.275
3.83GlyAsn: 3.83 ± 0.612
4.198GlyPro: 4.198 ± 0.787
2.43GlyGln: 2.43 ± 0.353
6.039GlyArg: 6.039 ± 0.607
5.008GlySer: 5.008 ± 0.766
5.082GlyThr: 5.082 ± 0.71
7.439GlyVal: 7.439 ± 0.656
1.399GlyTrp: 1.399 ± 0.408
2.872GlyTyr: 2.872 ± 0.492
0.0GlyXaa: 0.0 ± 0.0
His
1.105HisAla: 1.105 ± 0.265
0.368HisCys: 0.368 ± 0.164
0.81HisAsp: 0.81 ± 0.237
0.663HisGlu: 0.663 ± 0.253
0.589HisPhe: 0.589 ± 0.184
0.957HisGly: 0.957 ± 0.382
0.295HisHis: 0.295 ± 0.116
0.957HisIle: 0.957 ± 0.288
0.884HisLys: 0.884 ± 0.245
1.326HisLeu: 1.326 ± 0.283
0.221HisMet: 0.221 ± 0.161
0.295HisAsn: 0.295 ± 0.116
1.62HisPro: 1.62 ± 0.366
0.147HisGln: 0.147 ± 0.103
1.031HisArg: 1.031 ± 0.329
0.663HisSer: 0.663 ± 0.218
0.516HisThr: 0.516 ± 0.149
1.252HisVal: 1.252 ± 0.318
0.295HisTrp: 0.295 ± 0.125
0.589HisTyr: 0.589 ± 0.189
0.0HisXaa: 0.0 ± 0.0
Ile
4.566IleAla: 4.566 ± 0.78
0.516IleCys: 0.516 ± 0.206
3.682IleAsp: 3.682 ± 0.387
3.461IleGlu: 3.461 ± 0.479
1.915IlePhe: 1.915 ± 0.323
3.02IleGly: 3.02 ± 0.506
1.031IleHis: 1.031 ± 0.259
2.872IleIle: 2.872 ± 0.468
2.504IleLys: 2.504 ± 0.436
3.535IleLeu: 3.535 ± 0.479
0.884IleMet: 0.884 ± 0.273
2.725IleAsn: 2.725 ± 0.441
3.02IlePro: 3.02 ± 0.539
1.694IleGln: 1.694 ± 0.316
3.461IleArg: 3.461 ± 0.474
3.241IleSer: 3.241 ± 0.558
3.903IleThr: 3.903 ± 0.586
3.535IleVal: 3.535 ± 0.697
0.663IleTrp: 0.663 ± 0.204
1.399IleTyr: 1.399 ± 0.339
0.0IleXaa: 0.0 ± 0.0
Lys
6.997LysAla: 6.997 ± 0.921
0.81LysCys: 0.81 ± 0.251
3.756LysAsp: 3.756 ± 0.736
2.799LysGlu: 2.799 ± 0.558
1.915LysPhe: 1.915 ± 0.363
3.756LysGly: 3.756 ± 0.466
0.589LysHis: 0.589 ± 0.179
3.093LysIle: 3.093 ± 0.592
2.872LysLys: 2.872 ± 0.582
4.493LysLeu: 4.493 ± 0.607
0.884LysMet: 0.884 ± 0.281
1.473LysAsn: 1.473 ± 0.325
2.209LysPro: 2.209 ± 0.361
1.915LysGln: 1.915 ± 0.339
3.83LysArg: 3.83 ± 0.534
1.915LysSer: 1.915 ± 0.457
3.388LysThr: 3.388 ± 0.58
2.651LysVal: 2.651 ± 0.45
0.663LysTrp: 0.663 ± 0.2
1.252LysTyr: 1.252 ± 0.323
0.0LysXaa: 0.0 ± 0.0
Leu
9.943LeuAla: 9.943 ± 1.099
1.031LeuCys: 1.031 ± 0.26
5.008LeuAsp: 5.008 ± 0.63
4.934LeuGlu: 4.934 ± 0.561
2.872LeuPhe: 2.872 ± 0.475
5.229LeuGly: 5.229 ± 0.667
0.663LeuHis: 0.663 ± 0.168
3.609LeuIle: 3.609 ± 0.526
4.051LeuLys: 4.051 ± 0.559
4.051LeuLeu: 4.051 ± 0.513
0.957LeuMet: 0.957 ± 0.287
3.241LeuAsn: 3.241 ± 0.593
4.124LeuPro: 4.124 ± 0.672
3.609LeuGln: 3.609 ± 0.492
4.861LeuArg: 4.861 ± 0.627
3.756LeuSer: 3.756 ± 0.523
4.64LeuThr: 4.64 ± 0.57
5.966LeuVal: 5.966 ± 0.772
0.663LeuTrp: 0.663 ± 0.193
2.946LeuTyr: 2.946 ± 0.502
0.0LeuXaa: 0.0 ± 0.0
Met
2.651MetAla: 2.651 ± 0.575
0.147MetCys: 0.147 ± 0.098
1.105MetAsp: 1.105 ± 0.303
0.368MetGlu: 0.368 ± 0.144
0.736MetPhe: 0.736 ± 0.204
1.915MetGly: 1.915 ± 0.381
0.368MetHis: 0.368 ± 0.181
0.516MetIle: 0.516 ± 0.185
0.957MetLys: 0.957 ± 0.366
1.399MetLeu: 1.399 ± 0.297
0.147MetMet: 0.147 ± 0.085
0.736MetAsn: 0.736 ± 0.28
1.178MetPro: 1.178 ± 0.272
0.663MetGln: 0.663 ± 0.193
1.326MetArg: 1.326 ± 0.345
0.884MetSer: 0.884 ± 0.223
1.105MetThr: 1.105 ± 0.255
1.326MetVal: 1.326 ± 0.342
0.147MetTrp: 0.147 ± 0.091
0.295MetTyr: 0.295 ± 0.125
0.0MetXaa: 0.0 ± 0.0
Asn
4.64AsnAla: 4.64 ± 0.538
0.957AsnCys: 0.957 ± 0.225
1.989AsnAsp: 1.989 ± 0.411
2.725AsnGlu: 2.725 ± 0.517
1.399AsnPhe: 1.399 ± 0.283
4.714AsnGly: 4.714 ± 0.654
0.442AsnHis: 0.442 ± 0.203
1.915AsnIle: 1.915 ± 0.434
2.504AsnLys: 2.504 ± 0.518
2.872AsnLeu: 2.872 ± 0.348
0.663AsnMet: 0.663 ± 0.206
1.62AsnAsn: 1.62 ± 0.349
2.725AsnPro: 2.725 ± 0.408
1.326AsnGln: 1.326 ± 0.336
2.209AsnArg: 2.209 ± 0.42
1.915AsnSer: 1.915 ± 0.428
1.62AsnThr: 1.62 ± 0.403
3.02AsnVal: 3.02 ± 0.468
0.442AsnTrp: 0.442 ± 0.169
0.81AsnTyr: 0.81 ± 0.227
0.0AsnXaa: 0.0 ± 0.0
Pro
6.776ProAla: 6.776 ± 1.705
0.368ProCys: 0.368 ± 0.241
4.566ProAsp: 4.566 ± 0.512
3.241ProGlu: 3.241 ± 0.479
1.768ProPhe: 1.768 ± 0.425
5.303ProGly: 5.303 ± 0.896
0.589ProHis: 0.589 ± 0.217
2.357ProIle: 2.357 ± 0.409
2.872ProLys: 2.872 ± 0.484
3.241ProLeu: 3.241 ± 0.576
0.957ProMet: 0.957 ± 0.228
2.062ProAsn: 2.062 ± 0.403
1.547ProPro: 1.547 ± 0.338
1.547ProGln: 1.547 ± 0.288
2.357ProArg: 2.357 ± 0.385
2.872ProSer: 2.872 ± 0.373
2.578ProThr: 2.578 ± 0.44
3.02ProVal: 3.02 ± 0.451
0.884ProTrp: 0.884 ± 0.204
0.957ProTyr: 0.957 ± 0.272
0.0ProXaa: 0.0 ± 0.0
Gln
3.756GlnAla: 3.756 ± 0.592
0.736GlnCys: 0.736 ± 0.304
1.768GlnAsp: 1.768 ± 0.371
2.062GlnGlu: 2.062 ± 0.414
1.768GlnPhe: 1.768 ± 0.369
2.209GlnGly: 2.209 ± 0.369
0.736GlnHis: 0.736 ± 0.193
1.694GlnIle: 1.694 ± 0.36
1.841GlnLys: 1.841 ± 0.347
3.903GlnLeu: 3.903 ± 0.582
0.736GlnMet: 0.736 ± 0.247
1.473GlnAsn: 1.473 ± 0.238
1.768GlnPro: 1.768 ± 0.365
1.768GlnGln: 1.768 ± 0.391
2.651GlnArg: 2.651 ± 0.388
2.578GlnSer: 2.578 ± 0.466
1.989GlnThr: 1.989 ± 0.372
2.209GlnVal: 2.209 ± 0.299
0.589GlnTrp: 0.589 ± 0.19
0.663GlnTyr: 0.663 ± 0.203
0.0GlnXaa: 0.0 ± 0.0
Arg
5.966ArgAla: 5.966 ± 0.698
0.589ArgCys: 0.589 ± 0.215
3.977ArgAsp: 3.977 ± 0.52
5.082ArgGlu: 5.082 ± 0.744
2.209ArgPhe: 2.209 ± 0.334
3.83ArgGly: 3.83 ± 0.565
0.957ArgHis: 0.957 ± 0.346
3.977ArgIle: 3.977 ± 0.505
3.609ArgLys: 3.609 ± 0.435
4.566ArgLeu: 4.566 ± 0.538
1.105ArgMet: 1.105 ± 0.351
2.799ArgAsn: 2.799 ± 0.492
2.283ArgPro: 2.283 ± 0.516
2.283ArgGln: 2.283 ± 0.394
4.714ArgArg: 4.714 ± 0.59
2.872ArgSer: 2.872 ± 0.482
1.989ArgThr: 1.989 ± 0.36
4.934ArgVal: 4.934 ± 0.604
0.442ArgTrp: 0.442 ± 0.178
1.915ArgTyr: 1.915 ± 0.385
0.0ArgXaa: 0.0 ± 0.0
Ser
5.524SerAla: 5.524 ± 0.667
0.074SerCys: 0.074 ± 0.07
3.682SerAsp: 3.682 ± 0.481
3.167SerGlu: 3.167 ± 0.435
1.694SerPhe: 1.694 ± 0.335
6.186SerGly: 6.186 ± 0.556
0.663SerHis: 0.663 ± 0.262
2.504SerIle: 2.504 ± 0.415
2.504SerLys: 2.504 ± 0.599
4.345SerLeu: 4.345 ± 0.507
0.589SerMet: 0.589 ± 0.263
2.062SerAsn: 2.062 ± 0.432
2.651SerPro: 2.651 ± 0.407
2.062SerGln: 2.062 ± 0.417
2.946SerArg: 2.946 ± 0.496
2.946SerSer: 2.946 ± 0.43
3.388SerThr: 3.388 ± 0.453
4.787SerVal: 4.787 ± 0.644
0.589SerTrp: 0.589 ± 0.245
1.178SerTyr: 1.178 ± 0.28
0.0SerXaa: 0.0 ± 0.0
Thr
7.807ThrAla: 7.807 ± 1.073
0.368ThrCys: 0.368 ± 0.14
3.314ThrAsp: 3.314 ± 0.531
3.461ThrGlu: 3.461 ± 0.514
2.136ThrPhe: 2.136 ± 0.442
7.439ThrGly: 7.439 ± 0.767
0.516ThrHis: 0.516 ± 0.198
3.241ThrIle: 3.241 ± 0.45
3.02ThrLys: 3.02 ± 0.455
4.566ThrLeu: 4.566 ± 0.668
0.442ThrMet: 0.442 ± 0.158
1.915ThrAsn: 1.915 ± 0.371
3.682ThrPro: 3.682 ± 0.585
1.989ThrGln: 1.989 ± 0.407
2.43ThrArg: 2.43 ± 0.387
2.946ThrSer: 2.946 ± 0.559
4.198ThrThr: 4.198 ± 0.772
5.229ThrVal: 5.229 ± 0.863
0.516ThrTrp: 0.516 ± 0.165
2.062ThrTyr: 2.062 ± 0.357
0.0ThrXaa: 0.0 ± 0.0
Val
7.659ValAla: 7.659 ± 1.091
0.81ValCys: 0.81 ± 0.234
5.892ValAsp: 5.892 ± 0.616
5.376ValGlu: 5.376 ± 0.475
2.062ValPhe: 2.062 ± 0.343
4.64ValGly: 4.64 ± 0.47
0.884ValHis: 0.884 ± 0.321
3.83ValIle: 3.83 ± 0.602
4.051ValLys: 4.051 ± 0.703
4.714ValLeu: 4.714 ± 0.644
1.547ValMet: 1.547 ± 0.385
4.198ValAsn: 4.198 ± 0.496
2.283ValPro: 2.283 ± 0.464
2.504ValGln: 2.504 ± 0.341
3.977ValArg: 3.977 ± 0.309
4.787ValSer: 4.787 ± 0.646
5.597ValThr: 5.597 ± 0.66
5.155ValVal: 5.155 ± 0.732
1.105ValTrp: 1.105 ± 0.323
2.136ValTyr: 2.136 ± 0.364
0.0ValXaa: 0.0 ± 0.0
Trp
1.178TrpAla: 1.178 ± 0.225
0.221TrpCys: 0.221 ± 0.137
0.884TrpAsp: 0.884 ± 0.184
1.105TrpGlu: 1.105 ± 0.274
0.295TrpPhe: 0.295 ± 0.144
1.252TrpGly: 1.252 ± 0.261
0.589TrpHis: 0.589 ± 0.192
0.295TrpIle: 0.295 ± 0.121
0.589TrpLys: 0.589 ± 0.174
1.915TrpLeu: 1.915 ± 0.388
0.221TrpMet: 0.221 ± 0.123
0.589TrpAsn: 0.589 ± 0.175
0.516TrpPro: 0.516 ± 0.204
0.295TrpGln: 0.295 ± 0.117
0.884TrpArg: 0.884 ± 0.224
0.81TrpSer: 0.81 ± 0.291
0.589TrpThr: 0.589 ± 0.232
0.884TrpVal: 0.884 ± 0.214
0.295TrpTrp: 0.295 ± 0.125
0.663TrpTyr: 0.663 ± 0.211
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.799TyrAla: 2.799 ± 0.371
0.589TyrCys: 0.589 ± 0.225
1.694TyrAsp: 1.694 ± 0.226
1.252TyrGlu: 1.252 ± 0.326
1.178TyrPhe: 1.178 ± 0.274
2.799TyrGly: 2.799 ± 0.493
0.516TyrHis: 0.516 ± 0.172
1.252TyrIle: 1.252 ± 0.313
0.81TyrLys: 0.81 ± 0.214
2.136TyrLeu: 2.136 ± 0.407
0.221TyrMet: 0.221 ± 0.109
1.178TyrAsn: 1.178 ± 0.303
1.694TyrPro: 1.694 ± 0.332
1.105TyrGln: 1.105 ± 0.283
2.357TyrArg: 2.357 ± 0.325
2.209TyrSer: 2.209 ± 0.376
2.357TyrThr: 2.357 ± 0.444
2.43TyrVal: 2.43 ± 0.504
0.442TyrTrp: 0.442 ± 0.203
0.736TyrTyr: 0.736 ± 0.239
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 49 proteins (13579 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski