Amino acid dipepetide frequency for Streptococcus phage IPP32

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.959AlaAla: 2.959 ± 0.913
0.379AlaCys: 0.379 ± 0.159
5.765AlaAsp: 5.765 ± 0.507
6.827AlaGlu: 6.827 ± 0.657
2.503AlaPhe: 2.503 ± 0.637
5.083AlaGly: 5.083 ± 0.93
0.683AlaHis: 0.683 ± 0.251
4.4AlaIle: 4.4 ± 0.861
6.145AlaLys: 6.145 ± 0.734
6.903AlaLeu: 6.903 ± 0.797
2.276AlaMet: 2.276 ± 0.39
4.096AlaAsn: 4.096 ± 0.675
1.745AlaPro: 1.745 ± 0.373
2.655AlaGln: 2.655 ± 0.432
2.731AlaArg: 2.731 ± 0.427
2.807AlaSer: 2.807 ± 1.003
4.628AlaThr: 4.628 ± 0.607
5.462AlaVal: 5.462 ± 0.797
1.29AlaTrp: 1.29 ± 0.363
1.365AlaTyr: 1.365 ± 0.278
0.0AlaXaa: 0.0 ± 0.0
Cys
0.228CysAla: 0.228 ± 0.134
0.152CysCys: 0.152 ± 0.14
0.683CysAsp: 0.683 ± 0.239
0.531CysGlu: 0.531 ± 0.222
0.455CysPhe: 0.455 ± 0.217
0.152CysGly: 0.152 ± 0.138
0.076CysHis: 0.076 ± 0.063
0.531CysIle: 0.531 ± 0.291
0.607CysLys: 0.607 ± 0.179
0.607CysLeu: 0.607 ± 0.204
0.076CysMet: 0.076 ± 0.07
0.152CysAsn: 0.152 ± 0.17
0.303CysPro: 0.303 ± 0.181
0.303CysGln: 0.303 ± 0.159
0.379CysArg: 0.379 ± 0.18
0.379CysSer: 0.379 ± 0.217
0.152CysThr: 0.152 ± 0.104
0.228CysVal: 0.228 ± 0.15
0.303CysTrp: 0.303 ± 0.123
0.303CysTyr: 0.303 ± 0.131
0.0CysXaa: 0.0 ± 0.0
Asp
3.641AspAla: 3.641 ± 0.618
0.683AspCys: 0.683 ± 0.256
3.186AspAsp: 3.186 ± 0.679
4.324AspGlu: 4.324 ± 1.047
3.338AspPhe: 3.338 ± 0.566
4.703AspGly: 4.703 ± 0.599
0.759AspHis: 0.759 ± 0.29
5.69AspIle: 5.69 ± 0.478
4.931AspLys: 4.931 ± 0.647
5.083AspLeu: 5.083 ± 0.757
1.593AspMet: 1.593 ± 0.359
2.428AspAsn: 2.428 ± 0.461
1.745AspPro: 1.745 ± 0.434
1.821AspGln: 1.821 ± 0.439
2.655AspArg: 2.655 ± 0.477
3.49AspSer: 3.49 ± 0.561
3.717AspThr: 3.717 ± 0.5
3.565AspVal: 3.565 ± 0.445
1.593AspTrp: 1.593 ± 0.336
2.959AspTyr: 2.959 ± 0.392
0.0AspXaa: 0.0 ± 0.0
Glu
6.069GluAla: 6.069 ± 0.96
0.303GluCys: 0.303 ± 0.152
4.552GluAsp: 4.552 ± 0.759
5.841GluGlu: 5.841 ± 1.041
3.414GluPhe: 3.414 ± 0.621
3.717GluGly: 3.717 ± 0.551
0.834GluHis: 0.834 ± 0.258
5.841GluIle: 5.841 ± 0.677
7.207GluLys: 7.207 ± 1.231
8.724GluLeu: 8.724 ± 0.893
2.2GluMet: 2.2 ± 0.517
4.324GluAsn: 4.324 ± 0.499
1.441GluPro: 1.441 ± 0.359
2.959GluGln: 2.959 ± 0.484
4.476GluArg: 4.476 ± 0.616
5.234GluSer: 5.234 ± 0.669
4.021GluThr: 4.021 ± 0.585
4.628GluVal: 4.628 ± 0.508
0.986GluTrp: 0.986 ± 0.235
3.262GluTyr: 3.262 ± 0.53
0.0GluXaa: 0.0 ± 0.0
Phe
2.2PheAla: 2.2 ± 0.661
0.303PheCys: 0.303 ± 0.176
4.4PheAsp: 4.4 ± 0.571
3.793PheGlu: 3.793 ± 0.465
1.441PhePhe: 1.441 ± 0.369
2.048PheGly: 2.048 ± 0.521
0.228PheHis: 0.228 ± 0.122
2.048PheIle: 2.048 ± 0.364
3.49PheLys: 3.49 ± 0.522
2.276PheLeu: 2.276 ± 0.333
1.062PheMet: 1.062 ± 0.319
2.731PheAsn: 2.731 ± 0.648
0.303PhePro: 0.303 ± 0.131
0.834PheGln: 0.834 ± 0.246
1.517PheArg: 1.517 ± 0.323
3.034PheSer: 3.034 ± 0.55
2.428PheThr: 2.428 ± 0.46
1.745PheVal: 1.745 ± 0.382
0.607PheTrp: 0.607 ± 0.209
2.048PheTyr: 2.048 ± 0.386
0.0PheXaa: 0.0 ± 0.0
Gly
3.186GlyAla: 3.186 ± 0.553
0.152GlyCys: 0.152 ± 0.113
3.717GlyAsp: 3.717 ± 0.57
4.628GlyGlu: 4.628 ± 0.768
2.428GlyPhe: 2.428 ± 0.535
4.248GlyGly: 4.248 ± 1.06
0.986GlyHis: 0.986 ± 0.287
3.565GlyIle: 3.565 ± 0.555
4.931GlyLys: 4.931 ± 0.545
5.841GlyLeu: 5.841 ± 0.848
1.669GlyMet: 1.669 ± 0.29
3.793GlyAsn: 3.793 ± 0.549
1.138GlyPro: 1.138 ± 0.317
3.414GlyGln: 3.414 ± 0.43
3.793GlyArg: 3.793 ± 0.714
3.641GlySer: 3.641 ± 0.758
2.503GlyThr: 2.503 ± 0.444
4.172GlyVal: 4.172 ± 0.672
1.062GlyTrp: 1.062 ± 0.435
2.731GlyTyr: 2.731 ± 0.42
0.0GlyXaa: 0.0 ± 0.0
His
0.91HisAla: 0.91 ± 0.276
0.076HisCys: 0.076 ± 0.095
0.531HisAsp: 0.531 ± 0.257
1.365HisGlu: 1.365 ± 0.31
0.531HisPhe: 0.531 ± 0.202
0.986HisGly: 0.986 ± 0.249
0.303HisHis: 0.303 ± 0.18
0.91HisIle: 0.91 ± 0.329
1.062HisLys: 1.062 ± 0.284
1.214HisLeu: 1.214 ± 0.357
0.076HisMet: 0.076 ± 0.078
1.138HisAsn: 1.138 ± 0.293
0.759HisPro: 0.759 ± 0.232
0.379HisGln: 0.379 ± 0.161
0.986HisArg: 0.986 ± 0.342
1.214HisSer: 1.214 ± 0.397
0.683HisThr: 0.683 ± 0.233
0.759HisVal: 0.759 ± 0.228
0.152HisTrp: 0.152 ± 0.112
0.379HisTyr: 0.379 ± 0.188
0.0HisXaa: 0.0 ± 0.0
Ile
5.841IleAla: 5.841 ± 0.694
0.986IleCys: 0.986 ± 0.207
3.49IleAsp: 3.49 ± 0.614
5.917IleGlu: 5.917 ± 0.673
2.352IlePhe: 2.352 ± 0.533
3.869IleGly: 3.869 ± 0.696
0.379IleHis: 0.379 ± 0.191
2.883IleIle: 2.883 ± 0.343
6.524IleLys: 6.524 ± 0.652
4.248IleLeu: 4.248 ± 0.78
1.062IleMet: 1.062 ± 0.252
3.186IleAsn: 3.186 ± 0.48
1.593IlePro: 1.593 ± 0.343
2.352IleGln: 2.352 ± 0.326
2.807IleArg: 2.807 ± 0.639
4.855IleSer: 4.855 ± 0.688
4.628IleThr: 4.628 ± 0.485
2.883IleVal: 2.883 ± 0.483
0.759IleTrp: 0.759 ± 0.266
2.124IleTyr: 2.124 ± 0.539
0.0IleXaa: 0.0 ± 0.0
Lys
5.917LysAla: 5.917 ± 0.694
0.228LysCys: 0.228 ± 0.16
5.765LysAsp: 5.765 ± 0.708
7.89LysGlu: 7.89 ± 1.074
2.807LysPhe: 2.807 ± 0.521
4.703LysGly: 4.703 ± 0.612
1.897LysHis: 1.897 ± 0.382
5.993LysIle: 5.993 ± 0.768
8.041LysLys: 8.041 ± 1.224
6.903LysLeu: 6.903 ± 0.641
3.11LysMet: 3.11 ± 0.455
4.4LysAsn: 4.4 ± 0.514
2.883LysPro: 2.883 ± 0.598
3.717LysGln: 3.717 ± 0.606
3.641LysArg: 3.641 ± 0.463
4.552LysSer: 4.552 ± 0.573
6.448LysThr: 6.448 ± 0.558
5.614LysVal: 5.614 ± 0.749
1.138LysTrp: 1.138 ± 0.307
3.186LysTyr: 3.186 ± 0.409
0.0LysXaa: 0.0 ± 0.0
Leu
7.207LeuAla: 7.207 ± 0.826
0.759LeuCys: 0.759 ± 0.345
5.841LeuAsp: 5.841 ± 0.664
6.903LeuGlu: 6.903 ± 0.854
3.338LeuPhe: 3.338 ± 0.437
5.993LeuGly: 5.993 ± 1.087
1.214LeuHis: 1.214 ± 0.316
3.49LeuIle: 3.49 ± 0.575
7.434LeuLys: 7.434 ± 0.767
6.676LeuLeu: 6.676 ± 0.952
1.972LeuMet: 1.972 ± 0.385
3.565LeuAsn: 3.565 ± 0.78
2.807LeuPro: 2.807 ± 0.59
3.11LeuGln: 3.11 ± 0.653
3.49LeuArg: 3.49 ± 0.522
5.993LeuSer: 5.993 ± 0.865
5.159LeuThr: 5.159 ± 0.983
4.552LeuVal: 4.552 ± 0.68
0.607LeuTrp: 0.607 ± 0.196
2.352LeuTyr: 2.352 ± 0.324
0.0LeuXaa: 0.0 ± 0.0
Met
1.821MetAla: 1.821 ± 0.454
0.0MetCys: 0.0 ± 0.0
1.29MetAsp: 1.29 ± 0.249
2.124MetGlu: 2.124 ± 0.478
0.91MetPhe: 0.91 ± 0.214
0.986MetGly: 0.986 ± 0.391
0.228MetHis: 0.228 ± 0.145
1.745MetIle: 1.745 ± 0.509
2.352MetLys: 2.352 ± 0.546
1.897MetLeu: 1.897 ± 0.375
0.303MetMet: 0.303 ± 0.155
1.593MetAsn: 1.593 ± 0.484
1.062MetPro: 1.062 ± 0.282
0.683MetGln: 0.683 ± 0.273
1.29MetArg: 1.29 ± 0.321
1.365MetSer: 1.365 ± 0.332
2.048MetThr: 2.048 ± 0.397
1.593MetVal: 1.593 ± 0.346
0.152MetTrp: 0.152 ± 0.108
0.986MetTyr: 0.986 ± 0.263
0.0MetXaa: 0.0 ± 0.0
Asn
4.552AsnAla: 4.552 ± 0.725
0.303AsnCys: 0.303 ± 0.135
2.731AsnAsp: 2.731 ± 0.465
2.655AsnGlu: 2.655 ± 0.495
2.048AsnPhe: 2.048 ± 0.442
4.021AsnGly: 4.021 ± 0.552
1.062AsnHis: 1.062 ± 0.302
2.883AsnIle: 2.883 ± 0.41
4.628AsnLys: 4.628 ± 0.564
5.083AsnLeu: 5.083 ± 0.703
1.138AsnMet: 1.138 ± 0.328
2.655AsnAsn: 2.655 ± 0.489
1.897AsnPro: 1.897 ± 0.37
2.959AsnGln: 2.959 ± 0.715
2.428AsnArg: 2.428 ± 0.484
3.717AsnSer: 3.717 ± 0.601
3.262AsnThr: 3.262 ± 0.542
3.641AsnVal: 3.641 ± 0.451
0.986AsnTrp: 0.986 ± 0.24
1.745AsnTyr: 1.745 ± 0.459
0.0AsnXaa: 0.0 ± 0.0
Pro
2.124ProAla: 2.124 ± 0.401
0.228ProCys: 0.228 ± 0.184
1.593ProAsp: 1.593 ± 0.338
2.959ProGlu: 2.959 ± 0.338
0.607ProPhe: 0.607 ± 0.238
1.062ProGly: 1.062 ± 0.246
0.379ProHis: 0.379 ± 0.159
1.517ProIle: 1.517 ± 0.463
3.338ProLys: 3.338 ± 0.508
1.29ProLeu: 1.29 ± 0.344
0.455ProMet: 0.455 ± 0.182
1.745ProAsn: 1.745 ± 0.46
0.455ProPro: 0.455 ± 0.189
0.91ProGln: 0.91 ± 0.267
1.062ProArg: 1.062 ± 0.268
1.972ProSer: 1.972 ± 0.612
0.759ProThr: 0.759 ± 0.216
2.048ProVal: 2.048 ± 0.389
0.379ProTrp: 0.379 ± 0.223
1.669ProTyr: 1.669 ± 0.445
0.0ProXaa: 0.0 ± 0.0
Gln
3.565GlnAla: 3.565 ± 0.535
0.228GlnCys: 0.228 ± 0.142
1.593GlnAsp: 1.593 ± 0.323
3.565GlnGlu: 3.565 ± 0.781
1.441GlnPhe: 1.441 ± 0.347
1.897GlnGly: 1.897 ± 0.294
0.455GlnHis: 0.455 ± 0.242
2.959GlnIle: 2.959 ± 0.511
3.793GlnLys: 3.793 ± 0.448
2.959GlnLeu: 2.959 ± 0.46
0.834GlnMet: 0.834 ± 0.206
1.745GlnAsn: 1.745 ± 0.338
0.986GlnPro: 0.986 ± 0.348
1.517GlnGln: 1.517 ± 0.369
1.669GlnArg: 1.669 ± 0.416
2.2GlnSer: 2.2 ± 0.311
2.807GlnThr: 2.807 ± 0.527
3.869GlnVal: 3.869 ± 0.509
0.531GlnTrp: 0.531 ± 0.159
0.91GlnTyr: 0.91 ± 0.31
0.0GlnXaa: 0.0 ± 0.0
Arg
2.807ArgAla: 2.807 ± 0.552
0.379ArgCys: 0.379 ± 0.14
2.124ArgAsp: 2.124 ± 0.427
3.034ArgGlu: 3.034 ± 0.427
1.517ArgPhe: 1.517 ± 0.394
1.669ArgGly: 1.669 ± 0.426
0.607ArgHis: 0.607 ± 0.222
2.959ArgIle: 2.959 ± 0.477
3.717ArgLys: 3.717 ± 0.705
5.538ArgLeu: 5.538 ± 0.755
2.276ArgMet: 2.276 ± 0.515
2.655ArgAsn: 2.655 ± 0.486
0.91ArgPro: 0.91 ± 0.205
2.352ArgGln: 2.352 ± 0.548
2.731ArgArg: 2.731 ± 0.708
2.428ArgSer: 2.428 ± 0.418
2.883ArgThr: 2.883 ± 0.751
2.655ArgVal: 2.655 ± 0.479
0.455ArgTrp: 0.455 ± 0.174
2.124ArgTyr: 2.124 ± 0.475
0.0ArgXaa: 0.0 ± 0.0
Ser
4.476SerAla: 4.476 ± 0.874
0.303SerCys: 0.303 ± 0.142
3.641SerAsp: 3.641 ± 0.52
4.552SerGlu: 4.552 ± 0.641
2.124SerPhe: 2.124 ± 0.365
4.931SerGly: 4.931 ± 0.7
1.365SerHis: 1.365 ± 0.351
4.021SerIle: 4.021 ± 0.62
5.083SerLys: 5.083 ± 0.737
5.083SerLeu: 5.083 ± 0.613
1.214SerMet: 1.214 ± 0.389
3.641SerAsn: 3.641 ± 0.545
1.365SerPro: 1.365 ± 0.266
2.276SerGln: 2.276 ± 0.476
3.262SerArg: 3.262 ± 0.707
3.565SerSer: 3.565 ± 0.538
4.4SerThr: 4.4 ± 0.824
3.11SerVal: 3.11 ± 0.82
0.91SerTrp: 0.91 ± 0.352
2.883SerTyr: 2.883 ± 0.437
0.0SerXaa: 0.0 ± 0.0
Thr
4.628ThrAla: 4.628 ± 0.89
0.152ThrCys: 0.152 ± 0.126
4.172ThrAsp: 4.172 ± 0.486
4.779ThrGlu: 4.779 ± 0.523
2.731ThrPhe: 2.731 ± 0.604
4.096ThrGly: 4.096 ± 0.702
1.214ThrHis: 1.214 ± 0.418
4.476ThrIle: 4.476 ± 0.599
5.007ThrLys: 5.007 ± 0.786
4.021ThrLeu: 4.021 ± 0.653
0.834ThrMet: 0.834 ± 0.272
3.869ThrAsn: 3.869 ± 0.51
1.517ThrPro: 1.517 ± 0.455
2.655ThrGln: 2.655 ± 0.673
1.821ThrArg: 1.821 ± 0.342
4.248ThrSer: 4.248 ± 0.566
4.931ThrThr: 4.931 ± 0.845
4.552ThrVal: 4.552 ± 0.698
0.683ThrTrp: 0.683 ± 0.274
3.186ThrTyr: 3.186 ± 0.531
0.0ThrXaa: 0.0 ± 0.0
Val
5.614ValAla: 5.614 ± 0.702
0.379ValCys: 0.379 ± 0.224
3.945ValAsp: 3.945 ± 0.55
5.386ValGlu: 5.386 ± 0.51
1.669ValPhe: 1.669 ± 0.364
4.628ValGly: 4.628 ± 0.618
0.759ValHis: 0.759 ± 0.258
3.945ValIle: 3.945 ± 0.546
5.31ValLys: 5.31 ± 0.742
4.172ValLeu: 4.172 ± 0.73
0.986ValMet: 0.986 ± 0.34
3.945ValAsn: 3.945 ± 0.678
1.669ValPro: 1.669 ± 0.271
1.517ValGln: 1.517 ± 0.359
2.579ValArg: 2.579 ± 0.343
4.552ValSer: 4.552 ± 0.617
4.703ValThr: 4.703 ± 0.854
4.324ValVal: 4.324 ± 0.815
0.759ValTrp: 0.759 ± 0.269
2.276ValTyr: 2.276 ± 0.512
0.0ValXaa: 0.0 ± 0.0
Trp
1.138TrpAla: 1.138 ± 0.294
0.152TrpCys: 0.152 ± 0.099
0.834TrpAsp: 0.834 ± 0.358
0.986TrpGlu: 0.986 ± 0.336
1.214TrpPhe: 1.214 ± 0.475
0.683TrpGly: 0.683 ± 0.206
0.076TrpHis: 0.076 ± 0.073
0.607TrpIle: 0.607 ± 0.274
1.214TrpLys: 1.214 ± 0.362
0.683TrpLeu: 0.683 ± 0.291
0.607TrpMet: 0.607 ± 0.233
0.834TrpAsn: 0.834 ± 0.264
0.076TrpPro: 0.076 ± 0.083
0.91TrpGln: 0.91 ± 0.303
0.303TrpArg: 0.303 ± 0.144
0.531TrpSer: 0.531 ± 0.182
0.91TrpThr: 0.91 ± 0.279
1.214TrpVal: 1.214 ± 0.277
0.152TrpTrp: 0.152 ± 0.093
0.683TrpTyr: 0.683 ± 0.454
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.821TyrAla: 1.821 ± 0.327
0.455TyrCys: 0.455 ± 0.167
2.048TyrAsp: 2.048 ± 0.367
1.972TyrGlu: 1.972 ± 0.388
1.669TyrPhe: 1.669 ± 0.302
2.124TyrGly: 2.124 ± 0.419
0.91TyrHis: 0.91 ± 0.264
2.276TyrIle: 2.276 ± 0.395
4.021TyrLys: 4.021 ± 0.636
3.262TyrLeu: 3.262 ± 0.567
0.607TyrMet: 0.607 ± 0.247
1.897TyrAsn: 1.897 ± 0.39
1.897TyrPro: 1.897 ± 0.415
2.2TyrGln: 2.2 ± 0.407
2.276TyrArg: 2.276 ± 0.51
2.428TyrSer: 2.428 ± 0.508
2.503TyrThr: 2.503 ± 0.505
2.503TyrVal: 2.503 ± 0.484
0.303TyrTrp: 0.303 ± 0.163
1.593TyrTyr: 1.593 ± 0.443
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 64 proteins (13183 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski