Amino acid dipepetide frequency for Shigella phage SFN6B

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.647AlaAla: 15.647 ± 1.506
0.816AlaCys: 0.816 ± 0.251
6.081AlaAsp: 6.081 ± 0.643
5.488AlaGlu: 5.488 ± 0.59
2.892AlaPhe: 2.892 ± 0.558
8.157AlaGly: 8.157 ± 1.12
1.854AlaHis: 1.854 ± 0.448
4.968AlaIle: 4.968 ± 0.748
4.82AlaLys: 4.82 ± 0.694
9.27AlaLeu: 9.27 ± 0.838
3.337AlaMet: 3.337 ± 0.328
3.115AlaAsn: 3.115 ± 0.444
4.153AlaPro: 4.153 ± 1.047
4.301AlaGln: 4.301 ± 0.891
5.191AlaArg: 5.191 ± 0.642
6.155AlaSer: 6.155 ± 0.805
5.933AlaThr: 5.933 ± 0.8
6.6AlaVal: 6.6 ± 0.849
1.335AlaTrp: 1.335 ± 0.309
4.449AlaTyr: 4.449 ± 0.596
0.0AlaXaa: 0.0 ± 0.0
Cys
0.89CysAla: 0.89 ± 0.27
0.148CysCys: 0.148 ± 0.115
0.519CysAsp: 0.519 ± 0.219
0.445CysGlu: 0.445 ± 0.168
0.371CysPhe: 0.371 ± 0.184
0.816CysGly: 0.816 ± 0.23
0.445CysHis: 0.445 ± 0.204
0.519CysIle: 0.519 ± 0.226
0.222CysLys: 0.222 ± 0.142
0.89CysLeu: 0.89 ± 0.272
0.593CysMet: 0.593 ± 0.195
0.667CysAsn: 0.667 ± 0.239
0.519CysPro: 0.519 ± 0.273
0.371CysGln: 0.371 ± 0.141
1.187CysArg: 1.187 ± 0.325
0.89CysSer: 0.89 ± 0.271
0.964CysThr: 0.964 ± 0.233
0.667CysVal: 0.667 ± 0.213
0.519CysTrp: 0.519 ± 0.235
0.742CysTyr: 0.742 ± 0.184
0.0CysXaa: 0.0 ± 0.0
Asp
7.49AspAla: 7.49 ± 0.898
1.187AspCys: 1.187 ± 0.32
3.189AspAsp: 3.189 ± 0.548
3.411AspGlu: 3.411 ± 0.382
2.151AspPhe: 2.151 ± 0.35
4.746AspGly: 4.746 ± 0.592
0.667AspHis: 0.667 ± 0.229
3.337AspIle: 3.337 ± 0.541
2.67AspLys: 2.67 ± 0.452
5.71AspLeu: 5.71 ± 0.51
2.299AspMet: 2.299 ± 0.433
2.892AspAsn: 2.892 ± 0.422
2.299AspPro: 2.299 ± 0.373
1.483AspGln: 1.483 ± 0.244
2.521AspArg: 2.521 ± 0.532
4.968AspSer: 4.968 ± 0.584
4.82AspThr: 4.82 ± 0.683
3.56AspVal: 3.56 ± 0.57
0.964AspTrp: 0.964 ± 0.199
2.595AspTyr: 2.595 ± 0.454
0.0AspXaa: 0.0 ± 0.0
Glu
5.413GluAla: 5.413 ± 0.91
0.519GluCys: 0.519 ± 0.209
2.892GluAsp: 2.892 ± 0.391
4.153GluGlu: 4.153 ± 0.831
2.151GluPhe: 2.151 ± 0.449
4.079GluGly: 4.079 ± 0.398
2.076GluHis: 2.076 ± 0.327
2.447GluIle: 2.447 ± 0.361
2.151GluLys: 2.151 ± 0.428
5.562GluLeu: 5.562 ± 0.628
1.854GluMet: 1.854 ± 0.374
1.706GluAsn: 1.706 ± 0.356
1.854GluPro: 1.854 ± 0.463
3.263GluGln: 3.263 ± 0.612
3.115GluArg: 3.115 ± 0.5
2.892GluSer: 2.892 ± 0.473
3.337GluThr: 3.337 ± 0.547
4.598GluVal: 4.598 ± 0.55
1.038GluTrp: 1.038 ± 0.21
2.744GluTyr: 2.744 ± 0.426
0.0GluXaa: 0.0 ± 0.0
Phe
2.373PheAla: 2.373 ± 0.364
0.297PheCys: 0.297 ± 0.261
2.299PheAsp: 2.299 ± 0.411
2.002PheGlu: 2.002 ± 0.378
1.261PhePhe: 1.261 ± 0.283
2.076PheGly: 2.076 ± 0.384
0.593PheHis: 0.593 ± 0.272
1.483PheIle: 1.483 ± 0.358
2.299PheLys: 2.299 ± 0.487
2.299PheLeu: 2.299 ± 0.473
0.519PheMet: 0.519 ± 0.218
1.557PheAsn: 1.557 ± 0.32
1.483PhePro: 1.483 ± 0.228
1.187PheGln: 1.187 ± 0.31
1.261PheArg: 1.261 ± 0.405
1.483PheSer: 1.483 ± 0.389
1.631PheThr: 1.631 ± 0.306
2.076PheVal: 2.076 ± 0.377
0.371PheTrp: 0.371 ± 0.171
1.112PheTyr: 1.112 ± 0.234
0.0PheXaa: 0.0 ± 0.0
Gly
6.303GlyAla: 6.303 ± 0.657
1.409GlyCys: 1.409 ± 0.337
4.449GlyAsp: 4.449 ± 0.765
4.079GlyGlu: 4.079 ± 0.568
3.115GlyPhe: 3.115 ± 0.538
4.894GlyGly: 4.894 ± 0.706
1.112GlyHis: 1.112 ± 0.324
4.894GlyIle: 4.894 ± 0.68
4.524GlyLys: 4.524 ± 0.68
6.452GlyLeu: 6.452 ± 0.602
1.706GlyMet: 1.706 ± 0.362
3.634GlyAsn: 3.634 ± 0.546
1.706GlyPro: 1.706 ± 0.314
3.189GlyGln: 3.189 ± 0.503
4.153GlyArg: 4.153 ± 0.497
4.82GlySer: 4.82 ± 0.494
4.449GlyThr: 4.449 ± 0.637
5.71GlyVal: 5.71 ± 0.646
0.816GlyTrp: 0.816 ± 0.229
3.634GlyTyr: 3.634 ± 0.592
0.0GlyXaa: 0.0 ± 0.0
His
1.854HisAla: 1.854 ± 0.405
0.371HisCys: 0.371 ± 0.138
1.187HisAsp: 1.187 ± 0.301
1.112HisGlu: 1.112 ± 0.335
0.297HisPhe: 0.297 ± 0.12
1.928HisGly: 1.928 ± 0.404
0.222HisHis: 0.222 ± 0.108
0.964HisIle: 0.964 ± 0.303
1.112HisLys: 1.112 ± 0.298
2.595HisLeu: 2.595 ± 0.511
0.371HisMet: 0.371 ± 0.141
1.038HisAsn: 1.038 ± 0.251
0.667HisPro: 0.667 ± 0.266
0.742HisGln: 0.742 ± 0.248
1.112HisArg: 1.112 ± 0.376
0.742HisSer: 0.742 ± 0.242
0.667HisThr: 0.667 ± 0.175
0.742HisVal: 0.742 ± 0.227
0.371HisTrp: 0.371 ± 0.213
0.964HisTyr: 0.964 ± 0.326
0.0HisXaa: 0.0 ± 0.0
Ile
3.782IleAla: 3.782 ± 0.579
0.445IleCys: 0.445 ± 0.174
2.67IleAsp: 2.67 ± 0.397
2.892IleGlu: 2.892 ± 0.456
0.519IlePhe: 0.519 ± 0.189
3.115IleGly: 3.115 ± 0.441
0.816IleHis: 0.816 ± 0.238
2.151IleIle: 2.151 ± 0.359
3.04IleLys: 3.04 ± 0.566
4.301IleLeu: 4.301 ± 0.61
1.187IleMet: 1.187 ± 0.232
2.002IleAsn: 2.002 ± 0.381
2.225IlePro: 2.225 ± 0.407
3.04IleGln: 3.04 ± 0.505
2.67IleArg: 2.67 ± 0.455
3.782IleSer: 3.782 ± 0.447
3.263IleThr: 3.263 ± 0.558
2.373IleVal: 2.373 ± 0.392
0.148IleTrp: 0.148 ± 0.086
1.78IleTyr: 1.78 ± 0.392
0.0IleXaa: 0.0 ± 0.0
Lys
6.377LysAla: 6.377 ± 1.005
0.593LysCys: 0.593 ± 0.22
2.744LysAsp: 2.744 ± 0.415
3.411LysGlu: 3.411 ± 0.459
1.038LysPhe: 1.038 ± 0.346
3.115LysGly: 3.115 ± 0.459
1.038LysHis: 1.038 ± 0.297
1.631LysIle: 1.631 ± 0.271
1.928LysLys: 1.928 ± 0.483
4.079LysLeu: 4.079 ± 0.577
0.964LysMet: 0.964 ± 0.275
1.335LysAsn: 1.335 ± 0.26
1.78LysPro: 1.78 ± 0.427
3.263LysGln: 3.263 ± 0.595
2.966LysArg: 2.966 ± 0.468
3.337LysSer: 3.337 ± 0.546
2.966LysThr: 2.966 ± 0.378
3.856LysVal: 3.856 ± 0.602
0.89LysTrp: 0.89 ± 0.248
2.002LysTyr: 2.002 ± 0.569
0.0LysXaa: 0.0 ± 0.0
Leu
7.638LeuAla: 7.638 ± 0.728
1.409LeuCys: 1.409 ± 0.419
7.712LeuAsp: 7.712 ± 0.643
5.488LeuGlu: 5.488 ± 0.654
2.225LeuPhe: 2.225 ± 0.369
6.377LeuGly: 6.377 ± 0.638
1.78LeuHis: 1.78 ± 0.364
3.411LeuIle: 3.411 ± 0.651
3.856LeuLys: 3.856 ± 0.653
6.526LeuLeu: 6.526 ± 0.647
2.002LeuMet: 2.002 ± 0.304
3.708LeuAsn: 3.708 ± 0.456
3.263LeuPro: 3.263 ± 0.462
4.301LeuGln: 4.301 ± 0.465
6.897LeuArg: 6.897 ± 0.693
5.043LeuSer: 5.043 ± 0.555
4.672LeuThr: 4.672 ± 0.546
5.636LeuVal: 5.636 ± 0.685
1.187LeuTrp: 1.187 ± 0.292
3.56LeuTyr: 3.56 ± 0.54
0.0LeuXaa: 0.0 ± 0.0
Met
2.818MetAla: 2.818 ± 0.439
0.222MetCys: 0.222 ± 0.106
2.225MetAsp: 2.225 ± 0.488
0.89MetGlu: 0.89 ± 0.261
0.964MetPhe: 0.964 ± 0.277
1.483MetGly: 1.483 ± 0.272
0.667MetHis: 0.667 ± 0.263
1.112MetIle: 1.112 ± 0.321
0.89MetLys: 0.89 ± 0.353
3.337MetLeu: 3.337 ± 0.447
0.593MetMet: 0.593 ± 0.259
0.964MetAsn: 0.964 ± 0.29
0.964MetPro: 0.964 ± 0.221
1.928MetGln: 1.928 ± 0.366
1.78MetArg: 1.78 ± 0.387
2.225MetSer: 2.225 ± 0.477
1.261MetThr: 1.261 ± 0.387
1.631MetVal: 1.631 ± 0.317
0.593MetTrp: 0.593 ± 0.216
1.187MetTyr: 1.187 ± 0.301
0.0MetXaa: 0.0 ± 0.0
Asn
3.856AsnAla: 3.856 ± 0.501
0.148AsnCys: 0.148 ± 0.098
1.78AsnAsp: 1.78 ± 0.387
1.78AsnGlu: 1.78 ± 0.414
1.409AsnPhe: 1.409 ± 0.264
3.708AsnGly: 3.708 ± 0.49
0.371AsnHis: 0.371 ± 0.152
2.521AsnIle: 2.521 ± 0.409
2.299AsnLys: 2.299 ± 0.4
4.153AsnLeu: 4.153 ± 0.602
0.89AsnMet: 0.89 ± 0.276
1.631AsnAsn: 1.631 ± 0.307
2.299AsnPro: 2.299 ± 0.366
1.409AsnGln: 1.409 ± 0.367
2.744AsnArg: 2.744 ± 0.45
3.411AsnSer: 3.411 ± 0.603
2.744AsnThr: 2.744 ± 0.451
3.04AsnVal: 3.04 ± 0.43
0.593AsnTrp: 0.593 ± 0.213
1.409AsnTyr: 1.409 ± 0.325
0.0AsnXaa: 0.0 ± 0.0
Pro
4.746ProAla: 4.746 ± 1.004
0.297ProCys: 0.297 ± 0.155
2.892ProAsp: 2.892 ± 0.496
3.485ProGlu: 3.485 ± 0.395
0.816ProPhe: 0.816 ± 0.235
2.521ProGly: 2.521 ± 0.485
0.445ProHis: 0.445 ± 0.182
1.409ProIle: 1.409 ± 0.399
1.928ProLys: 1.928 ± 0.456
2.818ProLeu: 2.818 ± 0.517
1.038ProMet: 1.038 ± 0.277
1.78ProAsn: 1.78 ± 0.427
0.519ProPro: 0.519 ± 0.245
1.112ProGln: 1.112 ± 0.248
1.706ProArg: 1.706 ± 0.322
2.225ProSer: 2.225 ± 0.458
2.744ProThr: 2.744 ± 0.468
3.189ProVal: 3.189 ± 0.567
0.593ProTrp: 0.593 ± 0.206
1.261ProTyr: 1.261 ± 0.321
0.0ProXaa: 0.0 ± 0.0
Gln
5.043GlnAla: 5.043 ± 0.78
0.519GlnCys: 0.519 ± 0.225
3.04GlnAsp: 3.04 ± 0.446
3.189GlnGlu: 3.189 ± 0.614
1.261GlnPhe: 1.261 ± 0.298
3.115GlnGly: 3.115 ± 0.612
1.112GlnHis: 1.112 ± 0.369
1.557GlnIle: 1.557 ± 0.397
1.928GlnLys: 1.928 ± 0.419
4.598GlnLeu: 4.598 ± 0.666
1.483GlnMet: 1.483 ± 0.288
2.076GlnAsn: 2.076 ± 0.34
1.409GlnPro: 1.409 ± 0.358
3.337GlnGln: 3.337 ± 0.679
2.818GlnArg: 2.818 ± 0.443
3.04GlnSer: 3.04 ± 0.38
1.557GlnThr: 1.557 ± 0.46
2.373GlnVal: 2.373 ± 0.488
0.667GlnTrp: 0.667 ± 0.193
2.151GlnTyr: 2.151 ± 0.557
0.0GlnXaa: 0.0 ± 0.0
Arg
6.971ArgAla: 6.971 ± 0.978
0.519ArgCys: 0.519 ± 0.22
2.595ArgAsp: 2.595 ± 0.322
3.56ArgGlu: 3.56 ± 0.485
2.225ArgPhe: 2.225 ± 0.346
4.301ArgGly: 4.301 ± 0.637
0.89ArgHis: 0.89 ± 0.216
3.189ArgIle: 3.189 ± 0.669
3.485ArgLys: 3.485 ± 0.58
4.449ArgLeu: 4.449 ± 0.565
1.854ArgMet: 1.854 ± 0.355
2.966ArgAsn: 2.966 ± 0.44
1.557ArgPro: 1.557 ± 0.362
2.595ArgGln: 2.595 ± 0.395
4.153ArgArg: 4.153 ± 0.758
2.966ArgSer: 2.966 ± 0.646
3.189ArgThr: 3.189 ± 0.471
3.856ArgVal: 3.856 ± 0.391
0.89ArgTrp: 0.89 ± 0.244
2.076ArgTyr: 2.076 ± 0.339
0.0ArgXaa: 0.0 ± 0.0
Ser
7.416SerAla: 7.416 ± 1.012
1.038SerCys: 1.038 ± 0.341
4.004SerAsp: 4.004 ± 0.561
2.818SerGlu: 2.818 ± 0.419
1.854SerPhe: 1.854 ± 0.428
6.377SerGly: 6.377 ± 0.778
0.89SerHis: 0.89 ± 0.254
2.892SerIle: 2.892 ± 0.574
3.856SerLys: 3.856 ± 0.631
5.043SerLeu: 5.043 ± 0.542
2.595SerMet: 2.595 ± 0.436
2.151SerAsn: 2.151 ± 0.373
2.966SerPro: 2.966 ± 0.425
2.002SerGln: 2.002 ± 0.488
3.56SerArg: 3.56 ± 0.671
3.634SerSer: 3.634 ± 0.547
3.93SerThr: 3.93 ± 0.519
4.004SerVal: 4.004 ± 0.564
1.409SerTrp: 1.409 ± 0.288
1.483SerTyr: 1.483 ± 0.356
0.0SerXaa: 0.0 ± 0.0
Thr
5.71ThrAla: 5.71 ± 0.823
0.593ThrCys: 0.593 ± 0.242
3.856ThrAsp: 3.856 ± 0.523
2.892ThrGlu: 2.892 ± 0.593
1.631ThrPhe: 1.631 ± 0.338
5.265ThrGly: 5.265 ± 0.682
0.964ThrHis: 0.964 ± 0.32
2.076ThrIle: 2.076 ± 0.357
2.595ThrLys: 2.595 ± 0.489
4.672ThrLeu: 4.672 ± 0.707
1.557ThrMet: 1.557 ± 0.373
2.447ThrAsn: 2.447 ± 0.449
2.818ThrPro: 2.818 ± 0.372
2.595ThrGln: 2.595 ± 0.531
2.892ThrArg: 2.892 ± 0.524
4.301ThrSer: 4.301 ± 0.638
3.189ThrThr: 3.189 ± 0.557
4.301ThrVal: 4.301 ± 0.536
0.816ThrTrp: 0.816 ± 0.227
2.225ThrTyr: 2.225 ± 0.47
0.0ThrXaa: 0.0 ± 0.0
Val
6.377ValAla: 6.377 ± 0.767
0.519ValCys: 0.519 ± 0.2
5.71ValAsp: 5.71 ± 0.777
3.634ValGlu: 3.634 ± 0.473
1.409ValPhe: 1.409 ± 0.269
5.413ValGly: 5.413 ± 0.602
1.854ValHis: 1.854 ± 0.377
2.373ValIle: 2.373 ± 0.488
3.04ValLys: 3.04 ± 0.59
5.339ValLeu: 5.339 ± 0.647
1.78ValMet: 1.78 ± 0.296
3.337ValAsn: 3.337 ± 0.609
3.263ValPro: 3.263 ± 0.593
3.411ValGln: 3.411 ± 0.716
3.856ValArg: 3.856 ± 0.473
4.079ValSer: 4.079 ± 0.691
2.67ValThr: 2.67 ± 0.49
6.303ValVal: 6.303 ± 0.537
0.89ValTrp: 0.89 ± 0.304
2.447ValTyr: 2.447 ± 0.342
0.0ValXaa: 0.0 ± 0.0
Trp
0.964TrpAla: 0.964 ± 0.215
0.667TrpCys: 0.667 ± 0.201
0.816TrpAsp: 0.816 ± 0.187
1.187TrpGlu: 1.187 ± 0.373
0.816TrpPhe: 0.816 ± 0.341
0.593TrpGly: 0.593 ± 0.195
0.445TrpHis: 0.445 ± 0.161
0.816TrpIle: 0.816 ± 0.225
0.667TrpLys: 0.667 ± 0.265
1.112TrpLeu: 1.112 ± 0.248
0.222TrpMet: 0.222 ± 0.145
0.742TrpAsn: 0.742 ± 0.261
0.445TrpPro: 0.445 ± 0.189
0.593TrpGln: 0.593 ± 0.242
0.89TrpArg: 0.89 ± 0.204
0.816TrpSer: 0.816 ± 0.258
0.964TrpThr: 0.964 ± 0.216
1.483TrpVal: 1.483 ± 0.386
0.371TrpTrp: 0.371 ± 0.175
0.667TrpTyr: 0.667 ± 0.24
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.892TyrAla: 2.892 ± 0.442
0.667TyrCys: 0.667 ± 0.262
2.299TyrAsp: 2.299 ± 0.465
1.928TyrGlu: 1.928 ± 0.445
1.261TyrPhe: 1.261 ± 0.322
2.966TyrGly: 2.966 ± 0.549
0.816TyrHis: 0.816 ± 0.247
2.447TyrIle: 2.447 ± 0.516
2.002TyrLys: 2.002 ± 0.347
3.485TyrLeu: 3.485 ± 0.405
0.816TyrMet: 0.816 ± 0.271
2.447TyrAsn: 2.447 ± 0.474
1.335TyrPro: 1.335 ± 0.243
2.151TyrGln: 2.151 ± 0.435
2.818TyrArg: 2.818 ± 0.549
3.115TyrSer: 3.115 ± 0.504
2.447TyrThr: 2.447 ± 0.472
1.78TyrVal: 1.78 ± 0.44
0.816TyrTrp: 0.816 ± 0.295
1.483TyrTyr: 1.483 ± 0.427
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 49 proteins (13486 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski