Amino acid dipepetide frequency for Streptococcus phage CHPC926

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.95AlaAla: 3.95 ± 0.736
0.451AlaCys: 0.451 ± 0.225
4.176AlaAsp: 4.176 ± 0.781
4.289AlaGlu: 4.289 ± 0.854
2.596AlaPhe: 2.596 ± 0.61
3.612AlaGly: 3.612 ± 0.918
0.677AlaHis: 0.677 ± 0.283
5.192AlaIle: 5.192 ± 1.14
4.853AlaLys: 4.853 ± 0.683
6.208AlaLeu: 6.208 ± 0.971
1.806AlaMet: 1.806 ± 0.412
5.079AlaAsn: 5.079 ± 0.765
2.257AlaPro: 2.257 ± 0.427
2.37AlaGln: 2.37 ± 0.597
2.483AlaArg: 2.483 ± 0.681
4.063AlaSer: 4.063 ± 0.696
3.612AlaThr: 3.612 ± 0.724
3.725AlaVal: 3.725 ± 0.822
1.242AlaTrp: 1.242 ± 0.358
2.257AlaTyr: 2.257 ± 0.592
0.0AlaXaa: 0.0 ± 0.0
Cys
0.113CysAla: 0.113 ± 0.131
0.0CysCys: 0.0 ± 0.0
1.016CysAsp: 1.016 ± 0.42
0.677CysGlu: 0.677 ± 0.291
0.113CysPhe: 0.113 ± 0.144
0.564CysGly: 0.564 ± 0.198
0.339CysHis: 0.339 ± 0.221
0.226CysIle: 0.226 ± 0.126
0.564CysLys: 0.564 ± 0.229
0.339CysLeu: 0.339 ± 0.169
0.0CysMet: 0.0 ± 0.0
0.339CysAsn: 0.339 ± 0.224
0.451CysPro: 0.451 ± 0.262
0.113CysGln: 0.113 ± 0.082
0.339CysArg: 0.339 ± 0.166
1.354CysSer: 1.354 ± 0.493
0.113CysThr: 0.113 ± 0.082
0.451CysVal: 0.451 ± 0.218
0.113CysTrp: 0.113 ± 0.125
0.339CysTyr: 0.339 ± 0.212
0.0CysXaa: 0.0 ± 0.0
Asp
3.612AspAla: 3.612 ± 0.576
0.451AspCys: 0.451 ± 0.212
5.305AspAsp: 5.305 ± 0.829
4.515AspGlu: 4.515 ± 1.081
2.709AspPhe: 2.709 ± 0.512
4.628AspGly: 4.628 ± 0.866
0.226AspHis: 0.226 ± 0.139
5.079AspIle: 5.079 ± 0.741
4.74AspLys: 4.74 ± 0.804
5.305AspLeu: 5.305 ± 0.849
1.58AspMet: 1.58 ± 0.38
3.95AspAsn: 3.95 ± 0.734
1.354AspPro: 1.354 ± 0.46
1.016AspGln: 1.016 ± 0.381
1.919AspArg: 1.919 ± 0.465
2.822AspSer: 2.822 ± 0.501
3.837AspThr: 3.837 ± 0.512
4.402AspVal: 4.402 ± 0.634
1.016AspTrp: 1.016 ± 0.366
2.37AspTyr: 2.37 ± 0.485
0.0AspXaa: 0.0 ± 0.0
Glu
4.402GluAla: 4.402 ± 0.597
0.451GluCys: 0.451 ± 0.253
3.273GluAsp: 3.273 ± 0.593
6.095GluGlu: 6.095 ± 1.082
2.709GluPhe: 2.709 ± 0.422
3.273GluGly: 3.273 ± 0.722
1.693GluHis: 1.693 ± 0.481
4.853GluIle: 4.853 ± 0.996
6.998GluLys: 6.998 ± 1.42
7.788GluLeu: 7.788 ± 0.855
2.483GluMet: 2.483 ± 0.673
3.16GluAsn: 3.16 ± 0.74
1.919GluPro: 1.919 ± 0.563
3.837GluGln: 3.837 ± 0.871
3.273GluArg: 3.273 ± 0.484
3.499GluSer: 3.499 ± 0.627
4.402GluThr: 4.402 ± 0.787
5.079GluVal: 5.079 ± 0.824
1.354GluTrp: 1.354 ± 0.354
4.063GluTyr: 4.063 ± 0.797
0.0GluXaa: 0.0 ± 0.0
Phe
3.386PheAla: 3.386 ± 0.634
0.451PheCys: 0.451 ± 0.218
4.628PheAsp: 4.628 ± 0.44
3.612PheGlu: 3.612 ± 0.642
1.467PhePhe: 1.467 ± 0.448
2.596PheGly: 2.596 ± 0.581
0.226PheHis: 0.226 ± 0.128
2.37PheIle: 2.37 ± 0.516
4.966PheLys: 4.966 ± 0.726
2.37PheLeu: 2.37 ± 0.463
1.354PheMet: 1.354 ± 0.346
3.047PheAsn: 3.047 ± 0.72
0.451PhePro: 0.451 ± 0.302
1.919PheGln: 1.919 ± 0.575
1.129PheArg: 1.129 ± 0.389
3.386PheSer: 3.386 ± 0.516
3.273PheThr: 3.273 ± 0.8
2.144PheVal: 2.144 ± 0.623
0.451PheTrp: 0.451 ± 0.237
1.129PheTyr: 1.129 ± 0.382
0.0PheXaa: 0.0 ± 0.0
Gly
3.837GlyAla: 3.837 ± 1.004
0.564GlyCys: 0.564 ± 0.241
3.499GlyAsp: 3.499 ± 0.588
2.144GlyGlu: 2.144 ± 0.493
2.935GlyPhe: 2.935 ± 0.826
3.95GlyGly: 3.95 ± 0.964
0.564GlyHis: 0.564 ± 0.212
7.223GlyIle: 7.223 ± 1.318
6.998GlyLys: 6.998 ± 0.998
4.966GlyLeu: 4.966 ± 0.973
1.467GlyMet: 1.467 ± 0.485
3.16GlyAsn: 3.16 ± 0.687
1.129GlyPro: 1.129 ± 0.394
2.935GlyGln: 2.935 ± 0.58
2.032GlyArg: 2.032 ± 0.602
2.709GlySer: 2.709 ± 0.684
3.95GlyThr: 3.95 ± 0.716
3.837GlyVal: 3.837 ± 0.877
0.339GlyTrp: 0.339 ± 0.19
3.725GlyTyr: 3.725 ± 0.716
0.0GlyXaa: 0.0 ± 0.0
His
0.451HisAla: 0.451 ± 0.182
0.113HisCys: 0.113 ± 0.106
0.677HisAsp: 0.677 ± 0.25
0.564HisGlu: 0.564 ± 0.284
0.113HisPhe: 0.113 ± 0.106
1.242HisGly: 1.242 ± 0.321
0.113HisHis: 0.113 ± 0.114
0.677HisIle: 0.677 ± 0.296
1.129HisLys: 1.129 ± 0.394
1.016HisLeu: 1.016 ± 0.314
0.0HisMet: 0.0 ± 0.0
0.677HisAsn: 0.677 ± 0.257
0.451HisPro: 0.451 ± 0.21
0.451HisGln: 0.451 ± 0.205
0.79HisArg: 0.79 ± 0.271
0.79HisSer: 0.79 ± 0.341
0.564HisThr: 0.564 ± 0.309
1.242HisVal: 1.242 ± 0.337
0.113HisTrp: 0.113 ± 0.112
0.564HisTyr: 0.564 ± 0.244
0.0HisXaa: 0.0 ± 0.0
Ile
4.063IleAla: 4.063 ± 0.836
0.451IleCys: 0.451 ± 0.228
3.95IleAsp: 3.95 ± 0.702
6.208IleGlu: 6.208 ± 1.025
2.483IlePhe: 2.483 ± 0.603
4.289IleGly: 4.289 ± 0.667
0.564IleHis: 0.564 ± 0.269
5.756IleIle: 5.756 ± 1.055
6.208IleLys: 6.208 ± 0.806
4.063IleLeu: 4.063 ± 0.503
1.354IleMet: 1.354 ± 0.341
5.643IleAsn: 5.643 ± 0.746
1.693IlePro: 1.693 ± 0.491
4.289IleGln: 4.289 ± 0.668
1.919IleArg: 1.919 ± 0.437
4.515IleSer: 4.515 ± 0.763
4.853IleThr: 4.853 ± 0.81
4.063IleVal: 4.063 ± 0.616
0.677IleTrp: 0.677 ± 0.28
1.467IleTyr: 1.467 ± 0.392
0.0IleXaa: 0.0 ± 0.0
Lys
7.223LysAla: 7.223 ± 1.213
0.564LysCys: 0.564 ± 0.301
4.515LysAsp: 4.515 ± 0.772
7.901LysGlu: 7.901 ± 1.538
2.935LysPhe: 2.935 ± 0.74
4.628LysGly: 4.628 ± 0.754
1.242LysHis: 1.242 ± 0.406
5.756LysIle: 5.756 ± 0.735
7.901LysLys: 7.901 ± 1.631
7.449LysLeu: 7.449 ± 1.149
1.919LysMet: 1.919 ± 0.418
5.756LysAsn: 5.756 ± 0.973
3.16LysPro: 3.16 ± 0.608
4.289LysGln: 4.289 ± 0.627
3.95LysArg: 3.95 ± 0.914
5.079LysSer: 5.079 ± 0.758
6.208LysThr: 6.208 ± 1.026
4.063LysVal: 4.063 ± 0.604
0.564LysTrp: 0.564 ± 0.315
4.515LysTyr: 4.515 ± 0.833
0.0LysXaa: 0.0 ± 0.0
Leu
4.966LeuAla: 4.966 ± 0.738
0.677LeuCys: 0.677 ± 0.307
5.079LeuAsp: 5.079 ± 0.567
6.546LeuGlu: 6.546 ± 1.292
3.386LeuPhe: 3.386 ± 0.46
4.628LeuGly: 4.628 ± 0.701
0.677LeuHis: 0.677 ± 0.25
4.176LeuIle: 4.176 ± 0.771
7.675LeuLys: 7.675 ± 1.084
5.643LeuLeu: 5.643 ± 0.832
2.032LeuMet: 2.032 ± 0.379
4.966LeuAsn: 4.966 ± 0.851
2.822LeuPro: 2.822 ± 0.438
3.273LeuGln: 3.273 ± 0.638
3.047LeuArg: 3.047 ± 0.67
6.433LeuSer: 6.433 ± 0.623
5.643LeuThr: 5.643 ± 0.996
3.837LeuVal: 3.837 ± 0.638
0.564LeuTrp: 0.564 ± 0.262
2.596LeuTyr: 2.596 ± 0.428
0.0LeuXaa: 0.0 ± 0.0
Met
1.467MetAla: 1.467 ± 0.376
0.113MetCys: 0.113 ± 0.115
1.242MetAsp: 1.242 ± 0.412
1.58MetGlu: 1.58 ± 0.488
0.677MetPhe: 0.677 ± 0.252
1.129MetGly: 1.129 ± 0.446
0.339MetHis: 0.339 ± 0.222
1.016MetIle: 1.016 ± 0.298
2.483MetLys: 2.483 ± 0.55
1.693MetLeu: 1.693 ± 0.389
0.451MetMet: 0.451 ± 0.252
1.806MetAsn: 1.806 ± 0.512
0.677MetPro: 0.677 ± 0.313
1.242MetGln: 1.242 ± 0.384
1.242MetArg: 1.242 ± 0.418
2.144MetSer: 2.144 ± 0.493
3.16MetThr: 3.16 ± 0.746
1.242MetVal: 1.242 ± 0.368
0.113MetTrp: 0.113 ± 0.118
0.903MetTyr: 0.903 ± 0.4
0.0MetXaa: 0.0 ± 0.0
Asn
4.515AsnAla: 4.515 ± 0.678
0.564AsnCys: 0.564 ± 0.236
2.822AsnAsp: 2.822 ± 0.393
2.935AsnGlu: 2.935 ± 0.739
3.273AsnPhe: 3.273 ± 0.598
6.772AsnGly: 6.772 ± 1.216
0.677AsnHis: 0.677 ± 0.231
4.853AsnIle: 4.853 ± 0.647
5.53AsnLys: 5.53 ± 0.546
5.418AsnLeu: 5.418 ± 0.755
1.693AsnMet: 1.693 ± 0.514
3.499AsnAsn: 3.499 ± 0.795
1.693AsnPro: 1.693 ± 0.358
2.935AsnGln: 2.935 ± 0.604
2.596AsnArg: 2.596 ± 0.382
4.063AsnSer: 4.063 ± 0.786
2.596AsnThr: 2.596 ± 0.514
4.628AsnVal: 4.628 ± 0.766
1.016AsnTrp: 1.016 ± 0.307
2.822AsnTyr: 2.822 ± 0.46
0.0AsnXaa: 0.0 ± 0.0
Pro
0.903ProAla: 0.903 ± 0.338
0.0ProCys: 0.0 ± 0.0
2.257ProAsp: 2.257 ± 0.522
1.58ProGlu: 1.58 ± 0.398
1.467ProPhe: 1.467 ± 0.359
0.451ProGly: 0.451 ± 0.242
0.451ProHis: 0.451 ± 0.237
1.354ProIle: 1.354 ± 0.384
3.499ProLys: 3.499 ± 0.612
1.806ProLeu: 1.806 ± 0.375
0.564ProMet: 0.564 ± 0.274
1.693ProAsn: 1.693 ± 0.389
0.677ProPro: 0.677 ± 0.226
1.129ProGln: 1.129 ± 0.455
1.129ProArg: 1.129 ± 0.421
1.58ProSer: 1.58 ± 0.41
2.144ProThr: 2.144 ± 0.456
1.016ProVal: 1.016 ± 0.363
0.451ProTrp: 0.451 ± 0.218
1.467ProTyr: 1.467 ± 0.418
0.0ProXaa: 0.0 ± 0.0
Gln
4.966GlnAla: 4.966 ± 0.782
0.451GlnCys: 0.451 ± 0.266
1.806GlnAsp: 1.806 ± 0.461
4.515GlnGlu: 4.515 ± 0.697
1.806GlnPhe: 1.806 ± 0.628
2.822GlnGly: 2.822 ± 0.665
0.339GlnHis: 0.339 ± 0.195
2.144GlnIle: 2.144 ± 0.424
2.709GlnLys: 2.709 ± 0.666
3.047GlnLeu: 3.047 ± 0.507
1.919GlnMet: 1.919 ± 0.478
1.693GlnAsn: 1.693 ± 0.385
1.016GlnPro: 1.016 ± 0.382
2.144GlnGln: 2.144 ± 0.626
1.467GlnArg: 1.467 ± 0.484
2.257GlnSer: 2.257 ± 0.494
3.047GlnThr: 3.047 ± 0.424
2.822GlnVal: 2.822 ± 0.62
0.79GlnTrp: 0.79 ± 0.342
1.58GlnTyr: 1.58 ± 0.268
0.0GlnXaa: 0.0 ± 0.0
Arg
2.257ArgAla: 2.257 ± 0.491
0.226ArgCys: 0.226 ± 0.147
2.144ArgAsp: 2.144 ± 0.459
3.499ArgGlu: 3.499 ± 0.753
2.483ArgPhe: 2.483 ± 0.502
1.693ArgGly: 1.693 ± 0.458
0.339ArgHis: 0.339 ± 0.176
2.257ArgIle: 2.257 ± 0.57
3.499ArgLys: 3.499 ± 0.558
4.515ArgLeu: 4.515 ± 0.933
1.129ArgMet: 1.129 ± 0.441
3.047ArgAsn: 3.047 ± 0.579
0.451ArgPro: 0.451 ± 0.343
1.016ArgGln: 1.016 ± 0.314
1.354ArgArg: 1.354 ± 0.508
2.935ArgSer: 2.935 ± 0.483
1.919ArgThr: 1.919 ± 0.498
2.032ArgVal: 2.032 ± 0.419
0.339ArgTrp: 0.339 ± 0.149
1.016ArgTyr: 1.016 ± 0.455
0.0ArgXaa: 0.0 ± 0.0
Ser
3.612SerAla: 3.612 ± 0.577
0.339SerCys: 0.339 ± 0.188
5.305SerAsp: 5.305 ± 0.655
4.966SerGlu: 4.966 ± 0.904
3.273SerPhe: 3.273 ± 0.586
3.725SerGly: 3.725 ± 0.874
0.79SerHis: 0.79 ± 0.344
5.756SerIle: 5.756 ± 0.808
5.079SerLys: 5.079 ± 0.798
4.853SerLeu: 4.853 ± 0.659
0.903SerMet: 0.903 ± 0.358
4.853SerAsn: 4.853 ± 0.775
0.903SerPro: 0.903 ± 0.348
3.273SerGln: 3.273 ± 0.575
1.919SerArg: 1.919 ± 0.444
3.16SerSer: 3.16 ± 0.596
3.95SerThr: 3.95 ± 0.663
3.95SerVal: 3.95 ± 0.661
0.903SerTrp: 0.903 ± 0.316
2.257SerTyr: 2.257 ± 0.536
0.0SerXaa: 0.0 ± 0.0
Thr
4.063ThrAla: 4.063 ± 0.639
0.339ThrCys: 0.339 ± 0.221
3.273ThrAsp: 3.273 ± 0.755
5.192ThrGlu: 5.192 ± 0.577
3.612ThrPhe: 3.612 ± 0.615
4.966ThrGly: 4.966 ± 0.772
0.564ThrHis: 0.564 ± 0.244
3.725ThrIle: 3.725 ± 0.661
5.982ThrLys: 5.982 ± 1.071
4.74ThrLeu: 4.74 ± 0.656
1.467ThrMet: 1.467 ± 0.499
4.515ThrAsn: 4.515 ± 0.677
1.242ThrPro: 1.242 ± 0.438
2.37ThrGln: 2.37 ± 0.472
2.822ThrArg: 2.822 ± 0.518
3.837ThrSer: 3.837 ± 0.483
4.515ThrThr: 4.515 ± 0.721
4.966ThrVal: 4.966 ± 0.967
0.339ThrTrp: 0.339 ± 0.208
2.483ThrTyr: 2.483 ± 0.537
0.0ThrXaa: 0.0 ± 0.0
Val
3.95ValAla: 3.95 ± 0.808
0.451ValCys: 0.451 ± 0.286
3.386ValAsp: 3.386 ± 0.675
4.402ValGlu: 4.402 ± 0.757
3.16ValPhe: 3.16 ± 0.603
3.047ValGly: 3.047 ± 0.699
1.129ValHis: 1.129 ± 0.499
3.273ValIle: 3.273 ± 0.669
5.305ValLys: 5.305 ± 0.769
4.628ValLeu: 4.628 ± 0.731
1.016ValMet: 1.016 ± 0.383
4.74ValAsn: 4.74 ± 0.798
1.919ValPro: 1.919 ± 0.485
2.032ValGln: 2.032 ± 0.544
1.806ValArg: 1.806 ± 0.531
4.74ValSer: 4.74 ± 0.671
4.176ValThr: 4.176 ± 0.823
3.725ValVal: 3.725 ± 0.753
1.242ValTrp: 1.242 ± 0.296
1.919ValTyr: 1.919 ± 0.501
0.0ValXaa: 0.0 ± 0.0
Trp
1.242TrpAla: 1.242 ± 0.326
0.0TrpCys: 0.0 ± 0.0
0.79TrpAsp: 0.79 ± 0.283
0.677TrpGlu: 0.677 ± 0.273
0.677TrpPhe: 0.677 ± 0.249
0.677TrpGly: 0.677 ± 0.233
0.226TrpHis: 0.226 ± 0.193
0.677TrpIle: 0.677 ± 0.241
0.677TrpLys: 0.677 ± 0.323
0.79TrpLeu: 0.79 ± 0.293
0.113TrpMet: 0.113 ± 0.104
0.79TrpAsn: 0.79 ± 0.239
0.113TrpPro: 0.113 ± 0.109
0.903TrpGln: 0.903 ± 0.415
0.79TrpArg: 0.79 ± 0.297
0.79TrpSer: 0.79 ± 0.296
1.016TrpThr: 1.016 ± 0.375
0.79TrpVal: 0.79 ± 0.254
0.113TrpTrp: 0.113 ± 0.082
0.339TrpTyr: 0.339 ± 0.189
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.919TyrAla: 1.919 ± 0.509
0.903TyrCys: 0.903 ± 0.322
1.693TyrAsp: 1.693 ± 0.533
2.709TyrGlu: 2.709 ± 0.729
2.483TyrPhe: 2.483 ± 0.568
3.273TyrGly: 3.273 ± 0.746
0.564TyrHis: 0.564 ± 0.319
2.144TyrIle: 2.144 ± 0.403
2.822TyrLys: 2.822 ± 0.662
2.144TyrLeu: 2.144 ± 0.564
1.354TyrMet: 1.354 ± 0.405
2.483TyrAsn: 2.483 ± 0.561
1.242TyrPro: 1.242 ± 0.413
1.806TyrGln: 1.806 ± 0.509
2.144TyrArg: 2.144 ± 0.516
3.612TyrSer: 3.612 ± 0.59
2.032TyrThr: 2.032 ± 0.422
2.032TyrVal: 2.032 ± 0.411
0.451TyrTrp: 0.451 ± 0.158
1.693TyrTyr: 1.693 ± 0.655
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 44 proteins (8861 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski