Amino acid dipepetide frequency for Shigella phage pSb-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.939AlaAla: 7.939 ± 1.101
0.639AlaCys: 0.639 ± 0.212
4.654AlaAsp: 4.654 ± 0.483
5.201AlaGlu: 5.201 ± 0.499
2.373AlaPhe: 2.373 ± 0.373
6.57AlaGly: 6.57 ± 0.737
1.049AlaHis: 1.049 ± 0.167
4.928AlaIle: 4.928 ± 0.446
5.703AlaLys: 5.703 ± 0.637
7.528AlaLeu: 7.528 ± 0.813
2.418AlaMet: 2.418 ± 0.425
5.156AlaAsn: 5.156 ± 0.484
2.236AlaPro: 2.236 ± 0.348
3.787AlaGln: 3.787 ± 0.636
3.97AlaArg: 3.97 ± 0.531
4.836AlaSer: 4.836 ± 0.493
5.977AlaThr: 5.977 ± 0.753
5.521AlaVal: 5.521 ± 0.522
0.867AlaTrp: 0.867 ± 0.194
3.559AlaTyr: 3.559 ± 0.356
0.0AlaXaa: 0.0 ± 0.0
Cys
0.548CysAla: 0.548 ± 0.158
0.091CysCys: 0.091 ± 0.063
0.456CysAsp: 0.456 ± 0.141
0.502CysGlu: 0.502 ± 0.143
0.502CysPhe: 0.502 ± 0.142
0.456CysGly: 0.456 ± 0.137
0.319CysHis: 0.319 ± 0.115
0.73CysIle: 0.73 ± 0.191
0.776CysLys: 0.776 ± 0.163
0.776CysLeu: 0.776 ± 0.221
0.365CysMet: 0.365 ± 0.132
0.639CysAsn: 0.639 ± 0.153
0.319CysPro: 0.319 ± 0.158
0.137CysGln: 0.137 ± 0.075
0.274CysArg: 0.274 ± 0.121
0.365CysSer: 0.365 ± 0.142
0.639CysThr: 0.639 ± 0.244
0.776CysVal: 0.776 ± 0.249
0.228CysTrp: 0.228 ± 0.104
0.137CysTyr: 0.137 ± 0.087
0.0CysXaa: 0.0 ± 0.0
Asp
4.791AspAla: 4.791 ± 0.563
0.821AspCys: 0.821 ± 0.203
3.376AspAsp: 3.376 ± 0.458
3.422AspGlu: 3.422 ± 0.47
2.281AspPhe: 2.281 ± 0.293
3.65AspGly: 3.65 ± 0.568
0.821AspHis: 0.821 ± 0.197
4.882AspIle: 4.882 ± 0.574
3.468AspLys: 3.468 ± 0.363
4.928AspLeu: 4.928 ± 0.582
1.779AspMet: 1.779 ± 0.297
2.509AspAsn: 2.509 ± 0.252
2.692AspPro: 2.692 ± 0.357
2.144AspGln: 2.144 ± 0.331
2.738AspArg: 2.738 ± 0.338
4.106AspSer: 4.106 ± 0.397
3.924AspThr: 3.924 ± 0.415
3.513AspVal: 3.513 ± 0.424
0.593AspTrp: 0.593 ± 0.174
2.327AspTyr: 2.327 ± 0.316
0.0AspXaa: 0.0 ± 0.0
Glu
6.844GluAla: 6.844 ± 0.734
0.411GluCys: 0.411 ± 0.157
4.106GluAsp: 4.106 ± 0.503
5.338GluGlu: 5.338 ± 0.83
2.738GluPhe: 2.738 ± 0.363
3.376GluGly: 3.376 ± 0.349
0.73GluHis: 0.73 ± 0.175
3.057GluIle: 3.057 ± 0.39
3.833GluLys: 3.833 ± 0.487
5.658GluLeu: 5.658 ± 0.497
1.916GluMet: 1.916 ± 0.266
2.829GluAsn: 2.829 ± 0.337
3.239GluPro: 3.239 ± 0.536
2.646GluGln: 2.646 ± 0.455
2.053GluArg: 2.053 ± 0.351
3.011GluSer: 3.011 ± 0.374
3.331GluThr: 3.331 ± 0.332
4.563GluVal: 4.563 ± 0.573
0.73GluTrp: 0.73 ± 0.166
2.692GluTyr: 2.692 ± 0.371
0.0GluXaa: 0.0 ± 0.0
Phe
2.464PheAla: 2.464 ± 0.393
0.456PheCys: 0.456 ± 0.157
2.418PheAsp: 2.418 ± 0.331
2.008PheGlu: 2.008 ± 0.327
1.186PhePhe: 1.186 ± 0.24
2.418PheGly: 2.418 ± 0.269
0.548PheHis: 0.548 ± 0.142
2.418PheIle: 2.418 ± 0.355
1.962PheLys: 1.962 ± 0.325
2.373PheLeu: 2.373 ± 0.305
1.597PheMet: 1.597 ± 0.291
2.19PheAsn: 2.19 ± 0.331
1.095PhePro: 1.095 ± 0.234
1.551PheGln: 1.551 ± 0.241
1.871PheArg: 1.871 ± 0.247
2.236PheSer: 2.236 ± 0.328
3.011PheThr: 3.011 ± 0.289
2.053PheVal: 2.053 ± 0.368
0.319PheTrp: 0.319 ± 0.106
1.323PheTyr: 1.323 ± 0.262
0.0PheXaa: 0.0 ± 0.0
Gly
4.973GlyAla: 4.973 ± 0.632
0.776GlyCys: 0.776 ± 0.218
2.92GlyAsp: 2.92 ± 0.368
3.97GlyGlu: 3.97 ± 0.39
2.783GlyPhe: 2.783 ± 0.37
4.198GlyGly: 4.198 ± 0.536
1.141GlyHis: 1.141 ± 0.245
3.878GlyIle: 3.878 ± 0.453
5.703GlyLys: 5.703 ± 0.523
4.517GlyLeu: 4.517 ± 0.556
2.327GlyMet: 2.327 ± 0.276
4.882GlyAsn: 4.882 ± 0.477
1.004GlyPro: 1.004 ± 0.208
2.144GlyGln: 2.144 ± 0.361
2.783GlyArg: 2.783 ± 0.44
4.928GlySer: 4.928 ± 0.449
4.836GlyThr: 4.836 ± 0.43
4.243GlyVal: 4.243 ± 0.472
1.004GlyTrp: 1.004 ± 0.197
2.738GlyTyr: 2.738 ± 0.271
0.0GlyXaa: 0.0 ± 0.0
His
1.46HisAla: 1.46 ± 0.227
0.137HisCys: 0.137 ± 0.105
1.095HisAsp: 1.095 ± 0.276
0.821HisGlu: 0.821 ± 0.208
0.456HisPhe: 0.456 ± 0.151
0.73HisGly: 0.73 ± 0.21
0.319HisHis: 0.319 ± 0.118
1.141HisIle: 1.141 ± 0.268
1.506HisLys: 1.506 ± 0.256
1.962HisLeu: 1.962 ± 0.32
0.365HisMet: 0.365 ± 0.132
0.867HisAsn: 0.867 ± 0.189
0.821HisPro: 0.821 ± 0.212
0.548HisGln: 0.548 ± 0.165
0.684HisArg: 0.684 ± 0.192
1.597HisSer: 1.597 ± 0.291
0.776HisThr: 0.776 ± 0.184
0.776HisVal: 0.776 ± 0.173
0.502HisTrp: 0.502 ± 0.138
0.958HisTyr: 0.958 ± 0.189
0.0HisXaa: 0.0 ± 0.0
Ile
4.882IleAla: 4.882 ± 0.501
0.411IleCys: 0.411 ± 0.121
4.106IleAsp: 4.106 ± 0.429
3.833IleGlu: 3.833 ± 0.424
1.597IlePhe: 1.597 ± 0.303
3.331IleGly: 3.331 ± 0.426
1.232IleHis: 1.232 ± 0.253
3.239IleIle: 3.239 ± 0.469
3.787IleLys: 3.787 ± 0.482
4.426IleLeu: 4.426 ± 0.512
1.323IleMet: 1.323 ± 0.208
3.331IleAsn: 3.331 ± 0.47
3.513IlePro: 3.513 ± 0.386
2.783IleGln: 2.783 ± 0.335
2.738IleArg: 2.738 ± 0.391
3.468IleSer: 3.468 ± 0.398
4.426IleThr: 4.426 ± 0.449
3.696IleVal: 3.696 ± 0.467
0.456IleTrp: 0.456 ± 0.158
2.281IleTyr: 2.281 ± 0.379
0.0IleXaa: 0.0 ± 0.0
Lys
6.89LysAla: 6.89 ± 0.631
0.228LysCys: 0.228 ± 0.106
3.833LysAsp: 3.833 ± 0.471
4.335LysGlu: 4.335 ± 0.645
2.144LysPhe: 2.144 ± 0.343
3.878LysGly: 3.878 ± 0.451
1.232LysHis: 1.232 ± 0.302
2.966LysIle: 2.966 ± 0.336
3.103LysLys: 3.103 ± 0.442
6.388LysLeu: 6.388 ± 0.576
2.099LysMet: 2.099 ± 0.375
3.468LysAsn: 3.468 ± 0.427
3.285LysPro: 3.285 ± 0.55
2.829LysGln: 2.829 ± 0.328
3.011LysArg: 3.011 ± 0.364
3.741LysSer: 3.741 ± 0.473
4.198LysThr: 4.198 ± 0.477
3.696LysVal: 3.696 ± 0.306
0.73LysTrp: 0.73 ± 0.185
2.19LysTyr: 2.19 ± 0.315
0.0LysXaa: 0.0 ± 0.0
Leu
7.574LeuAla: 7.574 ± 0.627
0.913LeuCys: 0.913 ± 0.199
4.791LeuAsp: 4.791 ± 0.41
4.608LeuGlu: 4.608 ± 0.483
3.011LeuPhe: 3.011 ± 0.403
6.068LeuGly: 6.068 ± 0.569
1.551LeuHis: 1.551 ± 0.324
4.517LeuIle: 4.517 ± 0.575
5.019LeuLys: 5.019 ± 0.439
6.251LeuLeu: 6.251 ± 0.641
2.692LeuMet: 2.692 ± 0.43
5.019LeuAsn: 5.019 ± 0.389
3.97LeuPro: 3.97 ± 0.39
2.966LeuGln: 2.966 ± 0.372
3.741LeuArg: 3.741 ± 0.502
4.928LeuSer: 4.928 ± 0.583
5.475LeuThr: 5.475 ± 0.499
6.251LeuVal: 6.251 ± 0.53
0.821LeuTrp: 0.821 ± 0.209
2.418LeuTyr: 2.418 ± 0.267
0.0LeuXaa: 0.0 ± 0.0
Met
2.464MetAla: 2.464 ± 0.406
0.183MetCys: 0.183 ± 0.098
1.369MetAsp: 1.369 ± 0.268
2.099MetGlu: 2.099 ± 0.32
0.867MetPhe: 0.867 ± 0.227
1.551MetGly: 1.551 ± 0.263
0.365MetHis: 0.365 ± 0.123
2.144MetIle: 2.144 ± 0.35
2.327MetLys: 2.327 ± 0.346
2.646MetLeu: 2.646 ± 0.368
0.684MetMet: 0.684 ± 0.203
2.144MetAsn: 2.144 ± 0.301
1.004MetPro: 1.004 ± 0.275
1.551MetGln: 1.551 ± 0.271
1.186MetArg: 1.186 ± 0.173
2.555MetSer: 2.555 ± 0.344
1.825MetThr: 1.825 ± 0.297
1.643MetVal: 1.643 ± 0.292
0.319MetTrp: 0.319 ± 0.105
0.684MetTyr: 0.684 ± 0.169
0.0MetXaa: 0.0 ± 0.0
Asn
4.517AsnAla: 4.517 ± 0.615
0.274AsnCys: 0.274 ± 0.111
2.646AsnAsp: 2.646 ± 0.282
2.92AsnGlu: 2.92 ± 0.382
2.008AsnPhe: 2.008 ± 0.278
4.335AsnGly: 4.335 ± 0.656
1.414AsnHis: 1.414 ± 0.242
3.696AsnIle: 3.696 ± 0.403
3.833AsnLys: 3.833 ± 0.373
4.015AsnLeu: 4.015 ± 0.484
1.323AsnMet: 1.323 ± 0.182
3.376AsnAsn: 3.376 ± 0.404
3.376AsnPro: 3.376 ± 0.404
3.194AsnGln: 3.194 ± 0.408
3.422AsnArg: 3.422 ± 0.275
3.696AsnSer: 3.696 ± 0.474
3.239AsnThr: 3.239 ± 0.406
3.194AsnVal: 3.194 ± 0.392
0.821AsnTrp: 0.821 ± 0.191
1.871AsnTyr: 1.871 ± 0.385
0.0AsnXaa: 0.0 ± 0.0
Pro
3.239ProAla: 3.239 ± 0.383
0.319ProCys: 0.319 ± 0.134
2.327ProAsp: 2.327 ± 0.452
3.741ProGlu: 3.741 ± 0.446
1.871ProPhe: 1.871 ± 0.268
2.829ProGly: 2.829 ± 0.329
0.411ProHis: 0.411 ± 0.15
2.281ProIle: 2.281 ± 0.325
1.962ProLys: 1.962 ± 0.338
2.966ProLeu: 2.966 ± 0.338
1.369ProMet: 1.369 ± 0.291
2.053ProAsn: 2.053 ± 0.288
1.232ProPro: 1.232 ± 0.317
1.369ProGln: 1.369 ± 0.224
1.232ProArg: 1.232 ± 0.26
2.92ProSer: 2.92 ± 0.393
3.194ProThr: 3.194 ± 0.368
4.152ProVal: 4.152 ± 0.435
0.548ProTrp: 0.548 ± 0.184
1.46ProTyr: 1.46 ± 0.267
0.0ProXaa: 0.0 ± 0.0
Gln
4.517GlnAla: 4.517 ± 0.65
0.274GlnCys: 0.274 ± 0.111
2.008GlnAsp: 2.008 ± 0.321
2.601GlnGlu: 2.601 ± 0.319
1.46GlnPhe: 1.46 ± 0.226
2.327GlnGly: 2.327 ± 0.48
0.411GlnHis: 0.411 ± 0.14
2.236GlnIle: 2.236 ± 0.296
2.601GlnLys: 2.601 ± 0.395
3.468GlnLeu: 3.468 ± 0.452
1.278GlnMet: 1.278 ± 0.203
2.053GlnAsn: 2.053 ± 0.368
1.141GlnPro: 1.141 ± 0.203
1.643GlnGln: 1.643 ± 0.377
1.46GlnArg: 1.46 ± 0.23
2.874GlnSer: 2.874 ± 0.339
2.373GlnThr: 2.373 ± 0.267
3.468GlnVal: 3.468 ± 0.39
0.502GlnTrp: 0.502 ± 0.175
1.643GlnTyr: 1.643 ± 0.285
0.0GlnXaa: 0.0 ± 0.0
Arg
3.103ArgAla: 3.103 ± 0.592
0.502ArgCys: 0.502 ± 0.174
2.692ArgAsp: 2.692 ± 0.267
3.011ArgGlu: 3.011 ± 0.445
1.551ArgPhe: 1.551 ± 0.286
2.829ArgGly: 2.829 ± 0.368
0.73ArgHis: 0.73 ± 0.209
3.011ArgIle: 3.011 ± 0.367
3.513ArgLys: 3.513 ± 0.537
4.198ArgLeu: 4.198 ± 0.492
1.414ArgMet: 1.414 ± 0.272
3.057ArgAsn: 3.057 ± 0.475
1.551ArgPro: 1.551 ± 0.257
1.551ArgGln: 1.551 ± 0.295
2.601ArgArg: 2.601 ± 0.375
3.148ArgSer: 3.148 ± 0.452
2.509ArgThr: 2.509 ± 0.346
2.555ArgVal: 2.555 ± 0.347
0.548ArgTrp: 0.548 ± 0.148
1.734ArgTyr: 1.734 ± 0.251
0.0ArgXaa: 0.0 ± 0.0
Ser
4.517SerAla: 4.517 ± 0.519
0.821SerCys: 0.821 ± 0.199
3.741SerAsp: 3.741 ± 0.39
4.106SerGlu: 4.106 ± 0.394
1.779SerPhe: 1.779 ± 0.249
4.7SerGly: 4.7 ± 0.587
1.186SerHis: 1.186 ± 0.278
3.605SerIle: 3.605 ± 0.479
3.878SerLys: 3.878 ± 0.426
6.57SerLeu: 6.57 ± 0.61
1.414SerMet: 1.414 ± 0.244
3.011SerAsn: 3.011 ± 0.328
2.92SerPro: 2.92 ± 0.529
1.871SerGln: 1.871 ± 0.341
2.783SerArg: 2.783 ± 0.435
3.741SerSer: 3.741 ± 0.471
4.608SerThr: 4.608 ± 0.572
4.471SerVal: 4.471 ± 0.452
1.095SerTrp: 1.095 ± 0.241
2.281SerTyr: 2.281 ± 0.328
0.0SerXaa: 0.0 ± 0.0
Thr
4.243ThrAla: 4.243 ± 0.497
0.365ThrCys: 0.365 ± 0.131
3.833ThrAsp: 3.833 ± 0.319
3.787ThrGlu: 3.787 ± 0.501
2.92ThrPhe: 2.92 ± 0.262
4.654ThrGly: 4.654 ± 0.477
1.278ThrHis: 1.278 ± 0.263
4.289ThrIle: 4.289 ± 0.553
3.787ThrLys: 3.787 ± 0.412
5.658ThrLeu: 5.658 ± 0.512
1.688ThrMet: 1.688 ± 0.305
3.97ThrAsn: 3.97 ± 0.507
3.331ThrPro: 3.331 ± 0.385
2.418ThrGln: 2.418 ± 0.331
2.829ThrArg: 2.829 ± 0.382
4.015ThrSer: 4.015 ± 0.475
3.696ThrThr: 3.696 ± 0.64
5.156ThrVal: 5.156 ± 0.538
0.593ThrTrp: 0.593 ± 0.186
2.19ThrTyr: 2.19 ± 0.332
0.0ThrXaa: 0.0 ± 0.0
Val
5.977ValAla: 5.977 ± 0.601
0.821ValCys: 0.821 ± 0.199
4.654ValAsp: 4.654 ± 0.472
4.198ValGlu: 4.198 ± 0.437
2.19ValPhe: 2.19 ± 0.336
4.608ValGly: 4.608 ± 0.533
1.734ValHis: 1.734 ± 0.276
3.376ValIle: 3.376 ± 0.489
4.015ValLys: 4.015 ± 0.458
4.563ValLeu: 4.563 ± 0.502
2.281ValMet: 2.281 ± 0.339
3.787ValAsn: 3.787 ± 0.464
2.874ValPro: 2.874 ± 0.425
3.376ValGln: 3.376 ± 0.372
4.152ValArg: 4.152 ± 0.348
3.878ValSer: 3.878 ± 0.377
4.745ValThr: 4.745 ± 0.495
5.019ValVal: 5.019 ± 0.531
0.548ValTrp: 0.548 ± 0.162
2.327ValTyr: 2.327 ± 0.301
0.0ValXaa: 0.0 ± 0.0
Trp
0.867TrpAla: 0.867 ± 0.183
0.274TrpCys: 0.274 ± 0.128
1.186TrpAsp: 1.186 ± 0.262
0.548TrpGlu: 0.548 ± 0.167
0.502TrpPhe: 0.502 ± 0.144
0.639TrpGly: 0.639 ± 0.154
0.228TrpHis: 0.228 ± 0.113
0.548TrpIle: 0.548 ± 0.156
0.821TrpLys: 0.821 ± 0.223
1.141TrpLeu: 1.141 ± 0.235
0.274TrpMet: 0.274 ± 0.106
0.593TrpAsn: 0.593 ± 0.221
0.365TrpPro: 0.365 ± 0.135
0.365TrpGln: 0.365 ± 0.104
0.684TrpArg: 0.684 ± 0.155
0.593TrpSer: 0.593 ± 0.163
0.274TrpThr: 0.274 ± 0.111
1.141TrpVal: 1.141 ± 0.224
0.091TrpTrp: 0.091 ± 0.073
0.639TrpTyr: 0.639 ± 0.178
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.103TyrAla: 3.103 ± 0.341
0.319TyrCys: 0.319 ± 0.143
2.555TyrAsp: 2.555 ± 0.35
2.099TyrGlu: 2.099 ± 0.326
1.278TyrPhe: 1.278 ± 0.242
2.601TyrGly: 2.601 ± 0.41
0.821TyrHis: 0.821 ± 0.187
1.916TyrIle: 1.916 ± 0.305
2.646TyrLys: 2.646 ± 0.408
2.555TyrLeu: 2.555 ± 0.342
0.958TyrMet: 0.958 ± 0.242
2.236TyrAsn: 2.236 ± 0.278
1.506TyrPro: 1.506 ± 0.35
1.46TyrGln: 1.46 ± 0.259
1.597TyrArg: 1.597 ± 0.27
2.464TyrSer: 2.464 ± 0.414
1.643TyrThr: 1.643 ± 0.285
3.239TyrVal: 3.239 ± 0.359
0.456TyrTrp: 0.456 ± 0.13
1.186TyrTyr: 1.186 ± 0.284
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 103 proteins (21918 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski