Amino acid dipepetide frequency for Streptococcus phage SM1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.152AlaAla: 4.152 ± 1.321
0.944AlaCys: 0.944 ± 0.309
3.963AlaAsp: 3.963 ± 0.612
5.473AlaGlu: 5.473 ± 0.557
2.642AlaPhe: 2.642 ± 0.469
3.397AlaGly: 3.397 ± 0.733
1.227AlaHis: 1.227 ± 0.337
5.284AlaIle: 5.284 ± 0.62
4.812AlaLys: 4.812 ± 0.606
5.945AlaLeu: 5.945 ± 0.722
2.265AlaMet: 2.265 ± 0.417
3.586AlaAsn: 3.586 ± 0.497
1.038AlaPro: 1.038 ± 0.366
2.359AlaGln: 2.359 ± 0.508
2.831AlaArg: 2.831 ± 0.529
3.397AlaSer: 3.397 ± 0.873
4.435AlaThr: 4.435 ± 0.493
4.246AlaVal: 4.246 ± 0.799
1.698AlaTrp: 1.698 ± 0.623
2.359AlaTyr: 2.359 ± 0.477
0.0AlaXaa: 0.0 ± 0.0
Cys
0.472CysAla: 0.472 ± 0.221
0.094CysCys: 0.094 ± 0.087
0.755CysAsp: 0.755 ± 0.24
0.566CysGlu: 0.566 ± 0.258
0.283CysPhe: 0.283 ± 0.16
1.321CysGly: 1.321 ± 0.407
0.283CysHis: 0.283 ± 0.166
0.472CysIle: 0.472 ± 0.209
0.377CysLys: 0.377 ± 0.224
0.944CysLeu: 0.944 ± 0.316
0.0CysMet: 0.0 ± 0.0
0.566CysAsn: 0.566 ± 0.233
0.283CysPro: 0.283 ± 0.156
0.189CysGln: 0.189 ± 0.136
0.377CysArg: 0.377 ± 0.218
0.472CysSer: 0.472 ± 0.221
0.189CysThr: 0.189 ± 0.132
0.283CysVal: 0.283 ± 0.142
0.094CysTrp: 0.094 ± 0.105
0.566CysTyr: 0.566 ± 0.242
0.0CysXaa: 0.0 ± 0.0
Asp
3.491AspAla: 3.491 ± 0.642
0.661AspCys: 0.661 ± 0.245
3.774AspAsp: 3.774 ± 0.689
4.435AspGlu: 4.435 ± 0.681
3.208AspPhe: 3.208 ± 0.588
4.624AspGly: 4.624 ± 0.757
1.227AspHis: 1.227 ± 0.328
5.001AspIle: 5.001 ± 0.563
5.001AspLys: 5.001 ± 0.725
4.812AspLeu: 4.812 ± 0.689
2.17AspMet: 2.17 ± 0.368
3.68AspAsn: 3.68 ± 0.725
1.321AspPro: 1.321 ± 0.473
1.51AspGln: 1.51 ± 0.288
3.397AspArg: 3.397 ± 0.537
3.68AspSer: 3.68 ± 0.664
1.793AspThr: 1.793 ± 0.379
3.869AspVal: 3.869 ± 0.643
1.132AspTrp: 1.132 ± 0.332
3.019AspTyr: 3.019 ± 0.632
0.0AspXaa: 0.0 ± 0.0
Glu
6.416GluAla: 6.416 ± 0.783
0.189GluCys: 0.189 ± 0.111
4.435GluAsp: 4.435 ± 0.672
5.756GluGlu: 5.756 ± 0.846
2.642GluPhe: 2.642 ± 0.528
3.586GluGly: 3.586 ± 0.774
0.944GluHis: 0.944 ± 0.238
6.511GluIle: 6.511 ± 0.816
6.699GluLys: 6.699 ± 0.818
8.209GluLeu: 8.209 ± 1.394
2.17GluMet: 2.17 ± 0.452
4.435GluAsn: 4.435 ± 0.474
1.604GluPro: 1.604 ± 0.512
3.208GluGln: 3.208 ± 0.514
3.303GluArg: 3.303 ± 0.572
3.869GluSer: 3.869 ± 0.474
3.68GluThr: 3.68 ± 0.488
3.68GluVal: 3.68 ± 0.679
0.944GluTrp: 0.944 ± 0.337
1.982GluTyr: 1.982 ± 0.418
0.0GluXaa: 0.0 ± 0.0
Phe
2.548PheAla: 2.548 ± 0.522
0.377PheCys: 0.377 ± 0.229
3.397PheAsp: 3.397 ± 0.471
4.435PheGlu: 4.435 ± 0.482
1.698PhePhe: 1.698 ± 0.4
2.265PheGly: 2.265 ± 0.526
0.377PheHis: 0.377 ± 0.174
2.831PheIle: 2.831 ± 0.601
2.359PheLys: 2.359 ± 0.485
2.453PheLeu: 2.453 ± 0.5
1.227PheMet: 1.227 ± 0.3
2.359PheAsn: 2.359 ± 0.39
1.038PhePro: 1.038 ± 0.318
1.132PheGln: 1.132 ± 0.435
2.359PheArg: 2.359 ± 0.491
3.114PheSer: 3.114 ± 0.685
2.076PheThr: 2.076 ± 0.4
3.114PheVal: 3.114 ± 0.61
0.566PheTrp: 0.566 ± 0.223
1.793PheTyr: 1.793 ± 0.422
0.0PheXaa: 0.0 ± 0.0
Gly
3.963GlyAla: 3.963 ± 0.703
0.377GlyCys: 0.377 ± 0.189
3.68GlyAsp: 3.68 ± 0.523
4.435GlyGlu: 4.435 ± 0.822
3.208GlyPhe: 3.208 ± 0.57
4.529GlyGly: 4.529 ± 0.648
1.132GlyHis: 1.132 ± 0.316
6.039GlyIle: 6.039 ± 1.409
5.756GlyLys: 5.756 ± 0.697
6.228GlyLeu: 6.228 ± 0.938
1.982GlyMet: 1.982 ± 0.399
2.548GlyAsn: 2.548 ± 0.508
0.944GlyPro: 0.944 ± 0.284
2.831GlyGln: 2.831 ± 0.625
2.736GlyArg: 2.736 ± 0.679
3.68GlySer: 3.68 ± 0.84
2.453GlyThr: 2.453 ± 0.45
3.774GlyVal: 3.774 ± 0.643
0.944GlyTrp: 0.944 ± 0.375
4.152GlyTyr: 4.152 ± 0.647
0.0GlyXaa: 0.0 ± 0.0
His
1.038HisAla: 1.038 ± 0.267
0.849HisCys: 0.849 ± 0.281
1.227HisAsp: 1.227 ± 0.354
1.038HisGlu: 1.038 ± 0.305
1.132HisPhe: 1.132 ± 0.279
1.415HisGly: 1.415 ± 0.365
0.377HisHis: 0.377 ± 0.18
0.849HisIle: 0.849 ± 0.292
0.849HisLys: 0.849 ± 0.289
1.415HisLeu: 1.415 ± 0.364
0.377HisMet: 0.377 ± 0.242
0.849HisAsn: 0.849 ± 0.248
0.849HisPro: 0.849 ± 0.33
0.377HisGln: 0.377 ± 0.194
0.849HisArg: 0.849 ± 0.279
1.227HisSer: 1.227 ± 0.37
0.944HisThr: 0.944 ± 0.256
0.849HisVal: 0.849 ± 0.318
0.189HisTrp: 0.189 ± 0.139
1.227HisTyr: 1.227 ± 0.498
0.0HisXaa: 0.0 ± 0.0
Ile
3.963IleAla: 3.963 ± 0.677
1.038IleCys: 1.038 ± 0.347
5.19IleAsp: 5.19 ± 0.676
6.228IleGlu: 6.228 ± 0.735
3.774IlePhe: 3.774 ± 0.512
4.435IleGly: 4.435 ± 0.723
1.227IleHis: 1.227 ± 0.278
5.095IleIle: 5.095 ± 0.792
7.36IleLys: 7.36 ± 0.985
6.228IleLeu: 6.228 ± 0.741
1.698IleMet: 1.698 ± 0.289
3.397IleAsn: 3.397 ± 0.564
2.265IlePro: 2.265 ± 0.399
3.303IleGln: 3.303 ± 0.486
3.114IleArg: 3.114 ± 0.457
5.095IleSer: 5.095 ± 0.576
4.34IleThr: 4.34 ± 0.645
3.68IleVal: 3.68 ± 0.574
1.227IleTrp: 1.227 ± 0.555
2.265IleTyr: 2.265 ± 0.398
0.0IleXaa: 0.0 ± 0.0
Lys
6.511LysAla: 6.511 ± 0.615
0.377LysCys: 0.377 ± 0.188
4.529LysAsp: 4.529 ± 0.529
6.133LysGlu: 6.133 ± 0.841
2.359LysPhe: 2.359 ± 0.408
4.718LysGly: 4.718 ± 0.602
1.793LysHis: 1.793 ± 0.492
5.85LysIle: 5.85 ± 0.881
8.209LysLys: 8.209 ± 1.56
5.567LysLeu: 5.567 ± 0.592
1.982LysMet: 1.982 ± 0.46
4.624LysAsn: 4.624 ± 0.699
2.831LysPro: 2.831 ± 0.571
4.057LysGln: 4.057 ± 0.64
4.34LysArg: 4.34 ± 0.821
5.567LysSer: 5.567 ± 0.637
5.85LysThr: 5.85 ± 0.651
4.718LysVal: 4.718 ± 0.805
1.887LysTrp: 1.887 ± 0.394
3.963LysTyr: 3.963 ± 0.526
0.0LysXaa: 0.0 ± 0.0
Leu
6.133LeuAla: 6.133 ± 0.633
0.472LeuCys: 0.472 ± 0.238
6.039LeuAsp: 6.039 ± 0.705
5.85LeuGlu: 5.85 ± 0.554
2.736LeuPhe: 2.736 ± 0.5
6.133LeuGly: 6.133 ± 1.051
1.51LeuHis: 1.51 ± 0.371
5.001LeuIle: 5.001 ± 0.633
9.436LeuLys: 9.436 ± 0.97
6.888LeuLeu: 6.888 ± 0.84
1.227LeuMet: 1.227 ± 0.361
4.529LeuAsn: 4.529 ± 0.481
3.019LeuPro: 3.019 ± 0.61
2.548LeuGln: 2.548 ± 0.472
3.208LeuArg: 3.208 ± 0.593
5.945LeuSer: 5.945 ± 0.723
4.246LeuThr: 4.246 ± 0.681
4.624LeuVal: 4.624 ± 0.568
1.038LeuTrp: 1.038 ± 0.448
2.548LeuTyr: 2.548 ± 0.581
0.0LeuXaa: 0.0 ± 0.0
Met
1.793MetAla: 1.793 ± 0.376
0.189MetCys: 0.189 ± 0.138
1.415MetAsp: 1.415 ± 0.28
2.548MetGlu: 2.548 ± 0.433
0.283MetPhe: 0.283 ± 0.147
1.132MetGly: 1.132 ± 0.354
0.283MetHis: 0.283 ± 0.149
1.698MetIle: 1.698 ± 0.424
2.076MetLys: 2.076 ± 0.533
1.51MetLeu: 1.51 ± 0.35
0.661MetMet: 0.661 ± 0.218
1.793MetAsn: 1.793 ± 0.342
0.661MetPro: 0.661 ± 0.3
1.132MetGln: 1.132 ± 0.299
1.132MetArg: 1.132 ± 0.372
1.793MetSer: 1.793 ± 0.354
2.265MetThr: 2.265 ± 0.511
1.604MetVal: 1.604 ± 0.342
0.189MetTrp: 0.189 ± 0.112
0.661MetTyr: 0.661 ± 0.228
0.0MetXaa: 0.0 ± 0.0
Asn
4.34AsnAla: 4.34 ± 0.588
0.377AsnCys: 0.377 ± 0.224
2.642AsnAsp: 2.642 ± 0.413
3.397AsnGlu: 3.397 ± 0.593
1.982AsnPhe: 1.982 ± 0.479
5.378AsnGly: 5.378 ± 0.818
1.227AsnHis: 1.227 ± 0.369
4.812AsnIle: 4.812 ± 0.786
3.586AsnLys: 3.586 ± 0.623
4.435AsnLeu: 4.435 ± 0.737
0.944AsnMet: 0.944 ± 0.379
2.453AsnAsn: 2.453 ± 0.576
2.642AsnPro: 2.642 ± 0.494
2.831AsnGln: 2.831 ± 0.487
2.265AsnArg: 2.265 ± 0.675
2.548AsnSer: 2.548 ± 0.388
3.019AsnThr: 3.019 ± 0.64
2.736AsnVal: 2.736 ± 0.443
0.849AsnTrp: 0.849 ± 0.265
1.887AsnTyr: 1.887 ± 0.41
0.0AsnXaa: 0.0 ± 0.0
Pro
1.038ProAla: 1.038 ± 0.268
0.094ProCys: 0.094 ± 0.096
1.887ProAsp: 1.887 ± 0.48
2.17ProGlu: 2.17 ± 0.452
1.415ProPhe: 1.415 ± 0.384
1.698ProGly: 1.698 ± 0.468
0.661ProHis: 0.661 ± 0.323
2.925ProIle: 2.925 ± 0.447
2.736ProLys: 2.736 ± 0.614
2.265ProLeu: 2.265 ± 0.494
0.755ProMet: 0.755 ± 0.227
1.604ProAsn: 1.604 ± 0.473
0.377ProPro: 0.377 ± 0.199
0.849ProGln: 0.849 ± 0.251
0.472ProArg: 0.472 ± 0.222
2.548ProSer: 2.548 ± 0.412
2.076ProThr: 2.076 ± 0.544
1.415ProVal: 1.415 ± 0.414
0.189ProTrp: 0.189 ± 0.121
1.132ProTyr: 1.132 ± 0.348
0.0ProXaa: 0.0 ± 0.0
Gln
3.019GlnAla: 3.019 ± 0.423
0.377GlnCys: 0.377 ± 0.189
2.453GlnAsp: 2.453 ± 0.541
3.114GlnGlu: 3.114 ± 0.514
1.793GlnPhe: 1.793 ± 0.659
1.887GlnGly: 1.887 ± 0.394
0.849GlnHis: 0.849 ± 0.224
2.076GlnIle: 2.076 ± 0.419
4.529GlnLys: 4.529 ± 0.647
3.397GlnLeu: 3.397 ± 0.435
1.038GlnMet: 1.038 ± 0.297
2.736GlnAsn: 2.736 ± 0.452
1.227GlnPro: 1.227 ± 0.23
1.604GlnGln: 1.604 ± 0.349
1.982GlnArg: 1.982 ± 0.342
2.736GlnSer: 2.736 ± 0.53
1.415GlnThr: 1.415 ± 0.333
2.548GlnVal: 2.548 ± 0.556
0.094GlnTrp: 0.094 ± 0.077
0.661GlnTyr: 0.661 ± 0.245
0.0GlnXaa: 0.0 ± 0.0
Arg
2.17ArgAla: 2.17 ± 0.497
0.472ArgCys: 0.472 ± 0.206
1.793ArgAsp: 1.793 ± 0.381
2.359ArgGlu: 2.359 ± 0.524
1.698ArgPhe: 1.698 ± 0.48
1.698ArgGly: 1.698 ± 0.418
0.661ArgHis: 0.661 ± 0.238
3.303ArgIle: 3.303 ± 0.572
4.057ArgLys: 4.057 ± 0.579
5.001ArgLeu: 5.001 ± 0.861
0.944ArgMet: 0.944 ± 0.315
3.397ArgAsn: 3.397 ± 0.65
1.51ArgPro: 1.51 ± 0.483
1.793ArgGln: 1.793 ± 0.546
1.982ArgArg: 1.982 ± 0.547
1.698ArgSer: 1.698 ± 0.41
3.303ArgThr: 3.303 ± 0.563
3.208ArgVal: 3.208 ± 0.559
0.566ArgTrp: 0.566 ± 0.209
2.17ArgTyr: 2.17 ± 0.486
0.0ArgXaa: 0.0 ± 0.0
Ser
4.246SerAla: 4.246 ± 1.101
0.377SerCys: 0.377 ± 0.154
4.34SerAsp: 4.34 ± 0.528
4.624SerGlu: 4.624 ± 0.585
3.019SerPhe: 3.019 ± 0.358
3.963SerGly: 3.963 ± 0.781
1.227SerHis: 1.227 ± 0.4
5.378SerIle: 5.378 ± 0.614
4.812SerLys: 4.812 ± 0.676
5.095SerLeu: 5.095 ± 0.948
1.415SerMet: 1.415 ± 0.335
2.831SerAsn: 2.831 ± 0.467
1.887SerPro: 1.887 ± 0.392
2.736SerGln: 2.736 ± 0.443
2.17SerArg: 2.17 ± 0.454
3.963SerSer: 3.963 ± 0.65
3.963SerThr: 3.963 ± 0.55
2.831SerVal: 2.831 ± 0.656
1.038SerTrp: 1.038 ± 0.338
2.831SerTyr: 2.831 ± 0.689
0.0SerXaa: 0.0 ± 0.0
Thr
4.718ThrAla: 4.718 ± 0.99
0.472ThrCys: 0.472 ± 0.202
3.869ThrAsp: 3.869 ± 0.735
3.491ThrGlu: 3.491 ± 0.599
2.736ThrPhe: 2.736 ± 0.596
5.095ThrGly: 5.095 ± 0.787
0.849ThrHis: 0.849 ± 0.236
4.246ThrIle: 4.246 ± 0.724
4.34ThrLys: 4.34 ± 0.419
4.907ThrLeu: 4.907 ± 0.912
1.321ThrMet: 1.321 ± 0.375
3.491ThrAsn: 3.491 ± 0.475
0.944ThrPro: 0.944 ± 0.273
2.076ThrGln: 2.076 ± 0.391
1.698ThrArg: 1.698 ± 0.43
2.453ThrSer: 2.453 ± 0.459
3.869ThrThr: 3.869 ± 0.789
4.057ThrVal: 4.057 ± 0.525
0.189ThrTrp: 0.189 ± 0.113
2.076ThrTyr: 2.076 ± 0.488
0.0ThrXaa: 0.0 ± 0.0
Val
3.303ValAla: 3.303 ± 0.658
0.283ValCys: 0.283 ± 0.165
3.774ValAsp: 3.774 ± 0.478
3.491ValGlu: 3.491 ± 0.667
2.076ValPhe: 2.076 ± 0.363
3.869ValGly: 3.869 ± 0.619
0.944ValHis: 0.944 ± 0.371
4.057ValIle: 4.057 ± 0.555
5.001ValLys: 5.001 ± 0.691
3.774ValLeu: 3.774 ± 0.673
1.132ValMet: 1.132 ± 0.341
2.265ValAsn: 2.265 ± 0.472
2.642ValPro: 2.642 ± 0.567
1.887ValGln: 1.887 ± 0.434
2.831ValArg: 2.831 ± 0.499
4.812ValSer: 4.812 ± 0.548
4.812ValThr: 4.812 ± 0.855
3.303ValVal: 3.303 ± 0.613
0.755ValTrp: 0.755 ± 0.263
1.887ValTyr: 1.887 ± 0.421
0.0ValXaa: 0.0 ± 0.0
Trp
0.566TrpAla: 0.566 ± 0.232
0.283TrpCys: 0.283 ± 0.164
0.377TrpAsp: 0.377 ± 0.169
1.793TrpGlu: 1.793 ± 0.798
0.849TrpPhe: 0.849 ± 0.292
1.415TrpGly: 1.415 ± 0.408
0.189TrpHis: 0.189 ± 0.133
1.132TrpIle: 1.132 ± 0.341
1.132TrpLys: 1.132 ± 0.302
0.755TrpLeu: 0.755 ± 0.233
0.566TrpMet: 0.566 ± 0.209
0.944TrpAsn: 0.944 ± 0.406
0.189TrpPro: 0.189 ± 0.125
0.944TrpGln: 0.944 ± 0.214
0.377TrpArg: 0.377 ± 0.192
1.132TrpSer: 1.132 ± 0.312
0.283TrpThr: 0.283 ± 0.132
0.849TrpVal: 0.849 ± 0.274
0.0TrpTrp: 0.0 ± 0.0
0.472TrpTyr: 0.472 ± 0.297
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.265TyrAla: 2.265 ± 0.39
0.283TyrCys: 0.283 ± 0.177
2.265TyrAsp: 2.265 ± 0.38
2.925TyrGlu: 2.925 ± 0.538
1.982TyrPhe: 1.982 ± 0.364
2.831TyrGly: 2.831 ± 0.572
0.944TyrHis: 0.944 ± 0.319
2.359TyrIle: 2.359 ± 0.475
2.736TyrLys: 2.736 ± 0.499
3.303TyrLeu: 3.303 ± 0.495
0.849TyrMet: 0.849 ± 0.251
2.265TyrAsn: 2.265 ± 0.423
1.038TyrPro: 1.038 ± 0.358
2.17TyrGln: 2.17 ± 0.412
2.359TyrArg: 2.359 ± 0.524
3.019TyrSer: 3.019 ± 0.481
1.887TyrThr: 1.887 ± 0.429
1.51TyrVal: 1.51 ± 0.362
0.661TyrTrp: 0.661 ± 0.3
1.321TyrTyr: 1.321 ± 0.652
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 56 proteins (10599 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski