Amino acid dipepetide frequency for Shigella phage Sd1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.172AlaAla: 8.172 ± 1.121
0.554AlaCys: 0.554 ± 0.185
4.294AlaAsp: 4.294 ± 0.605
5.471AlaGlu: 5.471 ± 1.093
2.909AlaPhe: 2.909 ± 0.451
7.41AlaGly: 7.41 ± 1.1
1.039AlaHis: 1.039 ± 0.289
5.956AlaIle: 5.956 ± 0.501
5.609AlaLys: 5.609 ± 0.889
7.064AlaLeu: 7.064 ± 0.908
2.77AlaMet: 2.77 ± 0.439
4.155AlaAsn: 4.155 ± 0.749
1.731AlaPro: 1.731 ± 0.331
2.562AlaGln: 2.562 ± 0.548
4.224AlaArg: 4.224 ± 0.541
4.501AlaSer: 4.501 ± 0.506
5.471AlaThr: 5.471 ± 0.718
5.332AlaVal: 5.332 ± 0.861
1.108AlaTrp: 1.108 ± 0.265
2.978AlaTyr: 2.978 ± 0.482
0.0AlaXaa: 0.0 ± 0.0
Cys
0.9CysAla: 0.9 ± 0.319
0.139CysCys: 0.139 ± 0.089
0.831CysAsp: 0.831 ± 0.206
1.108CysGlu: 1.108 ± 0.289
0.623CysPhe: 0.623 ± 0.243
1.039CysGly: 1.039 ± 0.289
0.277CysHis: 0.277 ± 0.135
0.485CysIle: 0.485 ± 0.198
1.039CysLys: 1.039 ± 0.284
0.693CysLeu: 0.693 ± 0.198
0.346CysMet: 0.346 ± 0.155
0.416CysAsn: 0.416 ± 0.178
0.346CysPro: 0.346 ± 0.157
0.485CysGln: 0.485 ± 0.182
0.693CysArg: 0.693 ± 0.24
0.97CysSer: 0.97 ± 0.28
0.762CysThr: 0.762 ± 0.259
0.623CysVal: 0.623 ± 0.18
0.139CysTrp: 0.139 ± 0.095
0.485CysTyr: 0.485 ± 0.174
0.0CysXaa: 0.0 ± 0.0
Asp
4.155AspAla: 4.155 ± 0.545
0.693AspCys: 0.693 ± 0.238
4.432AspAsp: 4.432 ± 0.649
3.878AspGlu: 3.878 ± 0.544
2.562AspPhe: 2.562 ± 0.402
7.964AspGly: 7.964 ± 1.05
1.039AspHis: 1.039 ± 0.327
4.155AspIle: 4.155 ± 0.446
4.432AspLys: 4.432 ± 0.493
4.363AspLeu: 4.363 ± 0.691
2.008AspMet: 2.008 ± 0.421
2.77AspAsn: 2.77 ± 0.531
1.524AspPro: 1.524 ± 0.345
1.454AspGln: 1.454 ± 0.343
1.662AspArg: 1.662 ± 0.459
3.947AspSer: 3.947 ± 0.511
3.047AspThr: 3.047 ± 0.497
3.809AspVal: 3.809 ± 0.46
0.831AspTrp: 0.831 ± 0.261
2.701AspTyr: 2.701 ± 0.527
0.0AspXaa: 0.0 ± 0.0
Glu
5.54GluAla: 5.54 ± 0.73
0.9GluCys: 0.9 ± 0.266
3.047GluAsp: 3.047 ± 0.496
4.848GluGlu: 4.848 ± 0.785
3.116GluPhe: 3.116 ± 0.485
3.463GluGly: 3.463 ± 0.475
0.9GluHis: 0.9 ± 0.32
5.54GluIle: 5.54 ± 0.551
3.601GluLys: 3.601 ± 0.623
5.679GluLeu: 5.679 ± 0.681
2.77GluMet: 2.77 ± 0.546
3.601GluAsn: 3.601 ± 0.531
1.177GluPro: 1.177 ± 0.283
2.77GluGln: 2.77 ± 0.477
2.978GluArg: 2.978 ± 0.549
3.601GluSer: 3.601 ± 0.621
3.601GluThr: 3.601 ± 0.403
4.986GluVal: 4.986 ± 0.849
0.762GluTrp: 0.762 ± 0.235
2.078GluTyr: 2.078 ± 0.382
0.0GluXaa: 0.0 ± 0.0
Phe
3.047PheAla: 3.047 ± 0.511
0.97PheCys: 0.97 ± 0.258
3.393PheAsp: 3.393 ± 0.464
2.701PheGlu: 2.701 ± 0.488
0.831PhePhe: 0.831 ± 0.267
4.017PheGly: 4.017 ± 0.555
0.485PheHis: 0.485 ± 0.174
2.147PheIle: 2.147 ± 0.359
2.839PheLys: 2.839 ± 0.435
2.285PheLeu: 2.285 ± 0.471
0.762PheMet: 0.762 ± 0.182
2.216PheAsn: 2.216 ± 0.354
1.108PhePro: 1.108 ± 0.27
1.87PheGln: 1.87 ± 0.351
1.731PheArg: 1.731 ± 0.422
2.355PheSer: 2.355 ± 0.369
2.493PheThr: 2.493 ± 0.464
2.632PheVal: 2.632 ± 0.32
0.554PheTrp: 0.554 ± 0.181
1.177PheTyr: 1.177 ± 0.245
0.0PheXaa: 0.0 ± 0.0
Gly
5.125GlyAla: 5.125 ± 0.759
1.316GlyCys: 1.316 ± 0.309
4.778GlyAsp: 4.778 ± 0.593
6.094GlyGlu: 6.094 ± 0.671
3.255GlyPhe: 3.255 ± 0.575
5.679GlyGly: 5.679 ± 0.834
1.039GlyHis: 1.039 ± 0.324
5.125GlyIle: 5.125 ± 0.539
5.263GlyLys: 5.263 ± 0.668
6.717GlyLeu: 6.717 ± 0.777
2.493GlyMet: 2.493 ± 0.503
4.294GlyAsn: 4.294 ± 0.525
3.047GlyPro: 3.047 ± 1.984
2.147GlyGln: 2.147 ± 0.522
2.839GlyArg: 2.839 ± 0.434
6.302GlySer: 6.302 ± 0.832
4.155GlyThr: 4.155 ± 0.699
6.44GlyVal: 6.44 ± 0.76
1.108GlyTrp: 1.108 ± 0.276
3.809GlyTyr: 3.809 ± 0.427
0.0GlyXaa: 0.0 ± 0.0
His
0.9HisAla: 0.9 ± 0.287
0.139HisCys: 0.139 ± 0.102
1.039HisAsp: 1.039 ± 0.281
0.9HisGlu: 0.9 ± 0.261
0.831HisPhe: 0.831 ± 0.227
0.9HisGly: 0.9 ± 0.266
0.346HisHis: 0.346 ± 0.157
0.97HisIle: 0.97 ± 0.251
1.593HisLys: 1.593 ± 0.421
1.454HisLeu: 1.454 ± 0.363
0.277HisMet: 0.277 ± 0.174
0.554HisAsn: 0.554 ± 0.252
0.277HisPro: 0.277 ± 0.153
0.208HisGln: 0.208 ± 0.12
0.831HisArg: 0.831 ± 0.225
0.9HisSer: 0.9 ± 0.246
1.039HisThr: 1.039 ± 0.283
0.623HisVal: 0.623 ± 0.211
0.208HisTrp: 0.208 ± 0.128
0.554HisTyr: 0.554 ± 0.209
0.0HisXaa: 0.0 ± 0.0
Ile
4.363IleAla: 4.363 ± 0.605
0.9IleCys: 0.9 ± 0.208
6.717IleAsp: 6.717 ± 0.877
4.363IleGlu: 4.363 ± 0.582
2.285IlePhe: 2.285 ± 0.38
4.501IleGly: 4.501 ± 0.579
1.108IleHis: 1.108 ± 0.31
3.601IleIle: 3.601 ± 0.678
3.878IleLys: 3.878 ± 0.574
2.424IleLeu: 2.424 ± 0.405
1.524IleMet: 1.524 ± 0.348
4.017IleAsn: 4.017 ± 0.598
2.493IlePro: 2.493 ± 0.441
2.147IleGln: 2.147 ± 0.347
3.186IleArg: 3.186 ± 0.352
4.432IleSer: 4.432 ± 0.592
4.432IleThr: 4.432 ± 0.558
3.601IleVal: 3.601 ± 0.459
0.762IleTrp: 0.762 ± 0.236
2.701IleTyr: 2.701 ± 0.493
0.0IleXaa: 0.0 ± 0.0
Lys
6.51LysAla: 6.51 ± 0.909
0.693LysCys: 0.693 ± 0.211
3.878LysAsp: 3.878 ± 0.577
3.67LysGlu: 3.67 ± 0.672
3.047LysPhe: 3.047 ± 0.566
3.324LysGly: 3.324 ± 0.568
0.97LysHis: 0.97 ± 0.257
3.047LysIle: 3.047 ± 0.412
4.086LysLys: 4.086 ± 0.657
6.371LysLeu: 6.371 ± 0.744
2.978LysMet: 2.978 ± 0.56
2.424LysAsn: 2.424 ± 0.445
1.801LysPro: 1.801 ± 0.464
2.285LysGln: 2.285 ± 0.51
3.255LysArg: 3.255 ± 0.703
4.64LysSer: 4.64 ± 0.764
3.324LysThr: 3.324 ± 0.433
4.432LysVal: 4.432 ± 0.624
0.831LysTrp: 0.831 ± 0.248
2.77LysTyr: 2.77 ± 0.418
0.0LysXaa: 0.0 ± 0.0
Leu
6.787LeuAla: 6.787 ± 0.742
0.9LeuCys: 0.9 ± 0.267
4.155LeuAsp: 4.155 ± 0.468
3.67LeuGlu: 3.67 ± 0.533
2.008LeuPhe: 2.008 ± 0.299
4.709LeuGly: 4.709 ± 0.72
0.762LeuHis: 0.762 ± 0.258
3.532LeuIle: 3.532 ± 0.421
4.709LeuLys: 4.709 ± 0.679
4.848LeuLeu: 4.848 ± 0.656
1.593LeuMet: 1.593 ± 0.382
3.463LeuAsn: 3.463 ± 0.459
3.255LeuPro: 3.255 ± 0.459
2.77LeuGln: 2.77 ± 0.694
4.432LeuArg: 4.432 ± 0.575
6.025LeuSer: 6.025 ± 0.625
5.194LeuThr: 5.194 ± 0.599
5.125LeuVal: 5.125 ± 0.531
0.693LeuTrp: 0.693 ± 0.222
2.285LeuTyr: 2.285 ± 0.321
0.0LeuXaa: 0.0 ± 0.0
Met
4.017MetAla: 4.017 ± 0.463
0.277MetCys: 0.277 ± 0.133
0.693MetAsp: 0.693 ± 0.226
1.177MetGlu: 1.177 ± 0.321
0.762MetPhe: 0.762 ± 0.235
1.316MetGly: 1.316 ± 0.263
0.554MetHis: 0.554 ± 0.188
1.662MetIle: 1.662 ± 0.309
2.701MetLys: 2.701 ± 0.424
2.078MetLeu: 2.078 ± 0.393
1.039MetMet: 1.039 ± 0.325
1.385MetAsn: 1.385 ± 0.328
0.623MetPro: 0.623 ± 0.195
1.108MetGln: 1.108 ± 0.245
1.177MetArg: 1.177 ± 0.289
1.454MetSer: 1.454 ± 0.253
2.078MetThr: 2.078 ± 0.45
2.078MetVal: 2.078 ± 0.513
0.554MetTrp: 0.554 ± 0.182
0.346MetTyr: 0.346 ± 0.137
0.0MetXaa: 0.0 ± 0.0
Asn
4.571AsnAla: 4.571 ± 0.501
0.554AsnCys: 0.554 ± 0.239
3.116AsnAsp: 3.116 ± 0.47
3.393AsnGlu: 3.393 ± 0.543
1.385AsnPhe: 1.385 ± 0.44
5.402AsnGly: 5.402 ± 0.884
1.039AsnHis: 1.039 ± 0.3
2.562AsnIle: 2.562 ± 0.379
2.701AsnLys: 2.701 ± 0.411
3.324AsnLeu: 3.324 ± 0.429
0.831AsnMet: 0.831 ± 0.255
2.909AsnAsn: 2.909 ± 0.468
2.078AsnPro: 2.078 ± 0.38
1.662AsnGln: 1.662 ± 0.427
2.008AsnArg: 2.008 ± 0.358
4.017AsnSer: 4.017 ± 0.657
2.493AsnThr: 2.493 ± 0.444
3.532AsnVal: 3.532 ± 0.453
0.831AsnTrp: 0.831 ± 0.205
1.87AsnTyr: 1.87 ± 0.301
0.0AsnXaa: 0.0 ± 0.0
Pro
3.393ProAla: 3.393 ± 0.748
0.416ProCys: 0.416 ± 0.18
1.87ProAsp: 1.87 ± 0.494
2.77ProGlu: 2.77 ± 0.472
1.87ProPhe: 1.87 ± 0.426
2.355ProGly: 2.355 ± 0.456
0.277ProHis: 0.277 ± 0.113
1.662ProIle: 1.662 ± 0.319
0.9ProLys: 0.9 ± 0.238
1.662ProLeu: 1.662 ± 0.268
0.485ProMet: 0.485 ± 0.181
1.524ProAsn: 1.524 ± 0.325
0.9ProPro: 0.9 ± 0.276
2.285ProGln: 2.285 ± 0.847
1.385ProArg: 1.385 ± 0.27
1.454ProSer: 1.454 ± 0.258
1.454ProThr: 1.454 ± 0.371
3.393ProVal: 3.393 ± 0.449
0.623ProTrp: 0.623 ± 0.203
1.177ProTyr: 1.177 ± 0.279
0.0ProXaa: 0.0 ± 0.0
Gln
3.393GlnAla: 3.393 ± 0.842
0.485GlnCys: 0.485 ± 0.178
1.316GlnAsp: 1.316 ± 0.275
2.701GlnGlu: 2.701 ± 0.477
1.108GlnPhe: 1.108 ± 0.307
3.463GlnGly: 3.463 ± 1.359
0.485GlnHis: 0.485 ± 0.196
3.601GlnIle: 3.601 ± 0.709
2.285GlnLys: 2.285 ± 0.492
3.047GlnLeu: 3.047 ± 0.499
0.9GlnMet: 0.9 ± 0.238
1.87GlnAsn: 1.87 ± 0.435
1.454GlnPro: 1.454 ± 0.305
2.216GlnGln: 2.216 ± 0.774
1.662GlnArg: 1.662 ± 0.46
2.632GlnSer: 2.632 ± 0.41
1.801GlnThr: 1.801 ± 0.468
2.008GlnVal: 2.008 ± 0.335
0.416GlnTrp: 0.416 ± 0.147
1.454GlnTyr: 1.454 ± 0.343
0.0GlnXaa: 0.0 ± 0.0
Arg
3.186ArgAla: 3.186 ± 0.4
0.831ArgCys: 0.831 ± 0.338
2.285ArgAsp: 2.285 ± 0.419
3.878ArgGlu: 3.878 ± 0.538
2.632ArgPhe: 2.632 ± 0.296
2.839ArgGly: 2.839 ± 0.303
0.623ArgHis: 0.623 ± 0.193
3.186ArgIle: 3.186 ± 0.516
3.047ArgLys: 3.047 ± 0.444
3.463ArgLeu: 3.463 ± 0.541
0.762ArgMet: 0.762 ± 0.26
2.285ArgAsn: 2.285 ± 0.366
1.385ArgPro: 1.385 ± 0.336
2.078ArgGln: 2.078 ± 0.402
2.355ArgArg: 2.355 ± 0.355
2.216ArgSer: 2.216 ± 0.386
1.939ArgThr: 1.939 ± 0.407
3.947ArgVal: 3.947 ± 0.44
0.693ArgTrp: 0.693 ± 0.183
1.87ArgTyr: 1.87 ± 0.268
0.0ArgXaa: 0.0 ± 0.0
Ser
5.609SerAla: 5.609 ± 0.709
0.623SerCys: 0.623 ± 0.215
4.709SerAsp: 4.709 ± 0.574
4.294SerGlu: 4.294 ± 0.49
2.909SerPhe: 2.909 ± 0.382
8.102SerGly: 8.102 ± 1.019
0.623SerHis: 0.623 ± 0.206
3.74SerIle: 3.74 ± 0.42
3.947SerLys: 3.947 ± 0.569
4.709SerLeu: 4.709 ± 0.709
1.385SerMet: 1.385 ± 0.266
3.324SerAsn: 3.324 ± 0.708
2.147SerPro: 2.147 ± 0.316
3.324SerGln: 3.324 ± 0.556
2.147SerArg: 2.147 ± 0.487
5.194SerSer: 5.194 ± 1.133
3.67SerThr: 3.67 ± 0.506
5.471SerVal: 5.471 ± 0.695
0.693SerTrp: 0.693 ± 0.209
3.186SerTyr: 3.186 ± 0.474
0.0SerXaa: 0.0 ± 0.0
Thr
5.609ThrAla: 5.609 ± 0.744
0.762ThrCys: 0.762 ± 0.192
2.632ThrAsp: 2.632 ± 0.313
3.186ThrGlu: 3.186 ± 0.4
2.285ThrPhe: 2.285 ± 0.407
6.44ThrGly: 6.44 ± 0.78
0.693ThrHis: 0.693 ± 0.226
4.501ThrIle: 4.501 ± 0.43
3.255ThrLys: 3.255 ± 0.69
3.74ThrLeu: 3.74 ± 0.509
1.177ThrMet: 1.177 ± 0.256
2.978ThrAsn: 2.978 ± 0.491
2.701ThrPro: 2.701 ± 0.445
2.285ThrGln: 2.285 ± 0.443
1.87ThrArg: 1.87 ± 0.34
4.432ThrSer: 4.432 ± 0.609
3.532ThrThr: 3.532 ± 0.667
4.294ThrVal: 4.294 ± 0.536
0.762ThrTrp: 0.762 ± 0.254
2.078ThrTyr: 2.078 ± 0.43
0.0ThrXaa: 0.0 ± 0.0
Val
5.055ValAla: 5.055 ± 0.761
0.623ValCys: 0.623 ± 0.197
4.64ValAsp: 4.64 ± 0.496
3.67ValGlu: 3.67 ± 0.658
2.77ValPhe: 2.77 ± 0.47
4.086ValGly: 4.086 ± 0.657
1.177ValHis: 1.177 ± 0.307
4.778ValIle: 4.778 ± 0.493
5.125ValLys: 5.125 ± 0.553
3.67ValLeu: 3.67 ± 0.46
1.593ValMet: 1.593 ± 0.284
3.463ValAsn: 3.463 ± 0.48
2.285ValPro: 2.285 ± 0.448
2.424ValGln: 2.424 ± 0.515
4.086ValArg: 4.086 ± 0.624
7.133ValSer: 7.133 ± 0.702
4.778ValThr: 4.778 ± 0.539
4.64ValVal: 4.64 ± 0.647
0.831ValTrp: 0.831 ± 0.224
3.393ValTyr: 3.393 ± 0.53
0.0ValXaa: 0.0 ± 0.0
Trp
1.039TrpAla: 1.039 ± 0.275
0.139TrpCys: 0.139 ± 0.102
0.693TrpAsp: 0.693 ± 0.188
0.623TrpGlu: 0.623 ± 0.189
0.831TrpPhe: 0.831 ± 0.252
1.108TrpGly: 1.108 ± 0.271
0.346TrpHis: 0.346 ± 0.151
0.9TrpIle: 0.9 ± 0.214
0.9TrpLys: 0.9 ± 0.222
1.039TrpLeu: 1.039 ± 0.27
0.346TrpMet: 0.346 ± 0.162
0.346TrpAsn: 0.346 ± 0.153
0.346TrpPro: 0.346 ± 0.125
0.208TrpGln: 0.208 ± 0.13
0.97TrpArg: 0.97 ± 0.264
0.762TrpSer: 0.762 ± 0.283
0.9TrpThr: 0.9 ± 0.205
0.97TrpVal: 0.97 ± 0.237
0.208TrpTrp: 0.208 ± 0.114
0.416TrpTyr: 0.416 ± 0.156
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.939TyrAla: 1.939 ± 0.336
0.485TyrCys: 0.485 ± 0.234
2.909TyrAsp: 2.909 ± 0.528
2.562TyrGlu: 2.562 ± 0.422
1.593TyrPhe: 1.593 ± 0.323
2.909TyrGly: 2.909 ± 0.4
0.762TyrHis: 0.762 ± 0.255
2.285TyrIle: 2.285 ± 0.424
2.424TyrLys: 2.424 ± 0.571
2.355TyrLeu: 2.355 ± 0.412
0.97TyrMet: 0.97 ± 0.209
2.285TyrAsn: 2.285 ± 0.395
1.177TyrPro: 1.177 ± 0.266
2.008TyrGln: 2.008 ± 0.284
2.008TyrArg: 2.008 ± 0.381
2.839TyrSer: 2.839 ± 0.552
3.047TyrThr: 3.047 ± 0.467
2.285TyrVal: 2.285 ± 0.328
0.485TyrTrp: 0.485 ± 0.205
1.385TyrTyr: 1.385 ± 0.337
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 73 proteins (14441 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski