Amino acid dipepetide frequency for Streptococcus phage CHPC1230

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.565AlaAla: 5.565 ± 2.288
0.337AlaCys: 0.337 ± 0.167
4.722AlaAsp: 4.722 ± 0.648
4.469AlaGlu: 4.469 ± 0.838
2.951AlaPhe: 2.951 ± 1.124
5.228AlaGly: 5.228 ± 1.278
0.843AlaHis: 0.843 ± 0.244
5.565AlaIle: 5.565 ± 1.35
5.397AlaLys: 5.397 ± 0.606
6.577AlaLeu: 6.577 ± 0.774
2.361AlaMet: 2.361 ± 1.058
4.469AlaAsn: 4.469 ± 0.609
2.108AlaPro: 2.108 ± 0.42
3.036AlaGln: 3.036 ± 0.972
3.542AlaArg: 3.542 ± 0.559
5.987AlaSer: 5.987 ± 1.62
3.795AlaThr: 3.795 ± 0.755
4.216AlaVal: 4.216 ± 0.911
0.59AlaTrp: 0.59 ± 0.167
2.192AlaTyr: 2.192 ± 0.513
0.0AlaXaa: 0.0 ± 0.0
Cys
0.337CysAla: 0.337 ± 0.239
0.0CysCys: 0.0 ± 0.0
0.506CysAsp: 0.506 ± 0.227
0.843CysGlu: 0.843 ± 0.304
0.169CysPhe: 0.169 ± 0.137
0.422CysGly: 0.422 ± 0.242
0.337CysHis: 0.337 ± 0.185
0.506CysIle: 0.506 ± 0.223
0.337CysLys: 0.337 ± 0.188
0.59CysLeu: 0.59 ± 0.207
0.084CysMet: 0.084 ± 0.087
0.084CysAsn: 0.084 ± 0.076
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.253CysArg: 0.253 ± 0.127
0.843CysSer: 0.843 ± 0.297
0.169CysThr: 0.169 ± 0.111
0.337CysVal: 0.337 ± 0.162
0.084CysTrp: 0.084 ± 0.095
0.337CysTyr: 0.337 ± 0.159
0.0CysXaa: 0.0 ± 0.0
Asp
2.951AspAla: 2.951 ± 0.366
0.337AspCys: 0.337 ± 0.156
3.963AspAsp: 3.963 ± 0.477
4.132AspGlu: 4.132 ± 0.811
2.867AspPhe: 2.867 ± 0.507
6.071AspGly: 6.071 ± 0.895
0.759AspHis: 0.759 ± 0.264
4.301AspIle: 4.301 ± 0.729
5.059AspLys: 5.059 ± 0.627
4.554AspLeu: 4.554 ± 0.633
1.349AspMet: 1.349 ± 0.314
3.795AspAsn: 3.795 ± 0.662
0.506AspPro: 0.506 ± 0.181
1.349AspGln: 1.349 ± 0.321
3.373AspArg: 3.373 ± 0.714
4.301AspSer: 4.301 ± 0.732
3.626AspThr: 3.626 ± 0.629
3.963AspVal: 3.963 ± 0.498
1.181AspTrp: 1.181 ± 0.337
2.445AspTyr: 2.445 ± 0.53
0.0AspXaa: 0.0 ± 0.0
Glu
5.059GluAla: 5.059 ± 0.824
0.337GluCys: 0.337 ± 0.179
2.867GluAsp: 2.867 ± 0.596
4.891GluGlu: 4.891 ± 1.151
3.289GluPhe: 3.289 ± 0.641
3.036GluGly: 3.036 ± 0.55
1.096GluHis: 1.096 ± 0.311
4.554GluIle: 4.554 ± 0.787
5.734GluLys: 5.734 ± 1.271
7.673GluLeu: 7.673 ± 1.139
2.53GluMet: 2.53 ± 0.619
4.385GluAsn: 4.385 ± 0.754
1.771GluPro: 1.771 ± 0.455
3.204GluGln: 3.204 ± 0.56
4.132GluArg: 4.132 ± 0.732
2.361GluSer: 2.361 ± 0.694
3.71GluThr: 3.71 ± 0.734
5.565GluVal: 5.565 ± 0.993
0.843GluTrp: 0.843 ± 0.338
3.373GluTyr: 3.373 ± 0.854
0.0GluXaa: 0.0 ± 0.0
Phe
2.277PheAla: 2.277 ± 0.419
0.337PheCys: 0.337 ± 0.195
2.698PheAsp: 2.698 ± 0.541
4.132PheGlu: 4.132 ± 0.744
1.096PhePhe: 1.096 ± 0.3
3.626PheGly: 3.626 ± 0.699
0.337PheHis: 0.337 ± 0.164
2.867PheIle: 2.867 ± 0.471
4.638PheLys: 4.638 ± 0.574
1.686PheLeu: 1.686 ± 0.48
0.675PheMet: 0.675 ± 0.284
2.614PheAsn: 2.614 ± 0.357
0.759PhePro: 0.759 ± 0.307
1.012PheGln: 1.012 ± 0.29
1.012PheArg: 1.012 ± 0.368
3.795PheSer: 3.795 ± 0.714
2.698PheThr: 2.698 ± 0.668
1.939PheVal: 1.939 ± 0.404
0.759PheTrp: 0.759 ± 0.231
1.349PheTyr: 1.349 ± 0.351
0.0PheXaa: 0.0 ± 0.0
Gly
4.722GlyAla: 4.722 ± 0.906
0.337GlyCys: 0.337 ± 0.208
2.951GlyAsp: 2.951 ± 0.518
3.204GlyGlu: 3.204 ± 0.508
2.698GlyPhe: 2.698 ± 0.478
2.867GlyGly: 2.867 ± 0.405
1.012GlyHis: 1.012 ± 0.399
6.24GlyIle: 6.24 ± 1.765
5.903GlyLys: 5.903 ± 0.691
6.577GlyLeu: 6.577 ± 0.92
2.108GlyMet: 2.108 ± 0.624
3.626GlyAsn: 3.626 ± 0.627
1.349GlyPro: 1.349 ± 0.664
2.867GlyGln: 2.867 ± 0.532
2.867GlyArg: 2.867 ± 0.578
4.385GlySer: 4.385 ± 0.709
5.059GlyThr: 5.059 ± 1.085
4.132GlyVal: 4.132 ± 0.633
0.675GlyTrp: 0.675 ± 0.235
3.12GlyTyr: 3.12 ± 0.503
0.0GlyXaa: 0.0 ± 0.0
His
0.843HisAla: 0.843 ± 0.249
0.084HisCys: 0.084 ± 0.086
1.012HisAsp: 1.012 ± 0.274
0.59HisGlu: 0.59 ± 0.224
0.759HisPhe: 0.759 ± 0.254
0.59HisGly: 0.59 ± 0.245
0.675HisHis: 0.675 ± 0.335
0.843HisIle: 0.843 ± 0.264
0.928HisLys: 0.928 ± 0.268
1.012HisLeu: 1.012 ± 0.376
0.337HisMet: 0.337 ± 0.16
1.096HisAsn: 1.096 ± 0.326
0.253HisPro: 0.253 ± 0.126
0.506HisGln: 0.506 ± 0.262
0.928HisArg: 0.928 ± 0.257
0.675HisSer: 0.675 ± 0.247
1.181HisThr: 1.181 ± 0.295
1.012HisVal: 1.012 ± 0.347
0.169HisTrp: 0.169 ± 0.129
0.506HisTyr: 0.506 ± 0.252
0.0HisXaa: 0.0 ± 0.0
Ile
5.65IleAla: 5.65 ± 1.186
0.59IleCys: 0.59 ± 0.252
4.469IleAsp: 4.469 ± 0.408
4.722IleGlu: 4.722 ± 0.919
1.686IlePhe: 1.686 ± 0.346
5.312IleGly: 5.312 ± 1.088
1.012IleHis: 1.012 ± 0.22
3.204IleIle: 3.204 ± 0.621
6.156IleLys: 6.156 ± 0.67
3.542IleLeu: 3.542 ± 0.461
1.518IleMet: 1.518 ± 0.305
3.71IleAsn: 3.71 ± 0.782
2.614IlePro: 2.614 ± 0.557
3.036IleGln: 3.036 ± 0.561
2.361IleArg: 2.361 ± 0.536
5.144IleSer: 5.144 ± 1.042
4.638IleThr: 4.638 ± 0.86
4.469IleVal: 4.469 ± 0.715
0.59IleTrp: 0.59 ± 0.256
2.614IleTyr: 2.614 ± 0.556
0.0IleXaa: 0.0 ± 0.0
Lys
6.83LysAla: 6.83 ± 0.799
0.506LysCys: 0.506 ± 0.231
5.312LysAsp: 5.312 ± 0.883
7.589LysGlu: 7.589 ± 1.194
2.277LysPhe: 2.277 ± 0.397
5.059LysGly: 5.059 ± 0.759
1.181LysHis: 1.181 ± 0.379
5.734LysIle: 5.734 ± 0.733
6.409LysLys: 6.409 ± 1.068
6.577LysLeu: 6.577 ± 0.876
2.445LysMet: 2.445 ± 0.517
3.12LysAsn: 3.12 ± 0.548
2.867LysPro: 2.867 ± 0.508
3.204LysGln: 3.204 ± 0.7
5.228LysArg: 5.228 ± 0.983
5.144LysSer: 5.144 ± 0.58
4.806LysThr: 4.806 ± 0.817
3.963LysVal: 3.963 ± 0.436
0.843LysTrp: 0.843 ± 0.205
4.048LysTyr: 4.048 ± 0.662
0.0LysXaa: 0.0 ± 0.0
Leu
6.071LeuAla: 6.071 ± 0.693
0.084LeuCys: 0.084 ± 0.086
4.216LeuAsp: 4.216 ± 0.657
6.324LeuGlu: 6.324 ± 0.992
2.951LeuPhe: 2.951 ± 0.438
5.144LeuGly: 5.144 ± 1.095
0.675LeuHis: 0.675 ± 0.236
3.879LeuIle: 3.879 ± 0.488
6.915LeuLys: 6.915 ± 1.155
4.554LeuLeu: 4.554 ± 0.631
2.951LeuMet: 2.951 ± 0.479
5.059LeuAsn: 5.059 ± 0.552
2.53LeuPro: 2.53 ± 0.481
2.192LeuGln: 2.192 ± 0.381
3.373LeuArg: 3.373 ± 0.684
5.65LeuSer: 5.65 ± 0.604
5.312LeuThr: 5.312 ± 0.899
4.301LeuVal: 4.301 ± 0.523
0.59LeuTrp: 0.59 ± 0.261
3.204LeuTyr: 3.204 ± 0.576
0.0LeuXaa: 0.0 ± 0.0
Met
2.698MetAla: 2.698 ± 1.023
0.253MetCys: 0.253 ± 0.15
1.518MetAsp: 1.518 ± 0.389
1.349MetGlu: 1.349 ± 0.339
1.349MetPhe: 1.349 ± 0.294
1.265MetGly: 1.265 ± 0.427
0.169MetHis: 0.169 ± 0.119
1.434MetIle: 1.434 ± 0.32
2.783MetLys: 2.783 ± 0.614
1.602MetLeu: 1.602 ± 0.321
1.096MetMet: 1.096 ± 0.429
1.265MetAsn: 1.265 ± 0.345
0.928MetPro: 0.928 ± 0.28
1.686MetGln: 1.686 ± 0.48
0.928MetArg: 0.928 ± 0.261
2.361MetSer: 2.361 ± 0.406
1.602MetThr: 1.602 ± 0.358
1.686MetVal: 1.686 ± 0.445
0.084MetTrp: 0.084 ± 0.087
0.759MetTyr: 0.759 ± 0.184
0.0MetXaa: 0.0 ± 0.0
Asn
4.048AsnAla: 4.048 ± 0.55
0.337AsnCys: 0.337 ± 0.253
3.795AsnAsp: 3.795 ± 0.663
3.289AsnGlu: 3.289 ± 0.797
3.289AsnPhe: 3.289 ± 0.492
5.228AsnGly: 5.228 ± 0.821
1.265AsnHis: 1.265 ± 0.416
3.204AsnIle: 3.204 ± 0.523
4.301AsnLys: 4.301 ± 0.624
3.795AsnLeu: 3.795 ± 0.5
1.096AsnMet: 1.096 ± 0.269
3.036AsnAsn: 3.036 ± 0.674
2.361AsnPro: 2.361 ± 0.406
2.445AsnGln: 2.445 ± 0.581
2.108AsnArg: 2.108 ± 0.444
3.289AsnSer: 3.289 ± 0.589
2.951AsnThr: 2.951 ± 0.501
3.036AsnVal: 3.036 ± 0.4
1.181AsnTrp: 1.181 ± 0.332
1.518AsnTyr: 1.518 ± 0.435
0.0AsnXaa: 0.0 ± 0.0
Pro
1.518ProAla: 1.518 ± 0.292
0.084ProCys: 0.084 ± 0.074
1.939ProAsp: 1.939 ± 0.48
1.686ProGlu: 1.686 ± 0.393
0.928ProPhe: 0.928 ± 0.246
0.928ProGly: 0.928 ± 0.296
0.422ProHis: 0.422 ± 0.16
1.939ProIle: 1.939 ± 0.384
3.373ProLys: 3.373 ± 0.557
1.771ProLeu: 1.771 ± 0.479
0.169ProMet: 0.169 ± 0.139
1.771ProAsn: 1.771 ± 0.387
1.012ProPro: 1.012 ± 0.269
2.024ProGln: 2.024 ± 0.479
1.265ProArg: 1.265 ± 0.392
1.939ProSer: 1.939 ± 0.358
1.349ProThr: 1.349 ± 0.531
1.686ProVal: 1.686 ± 0.368
0.337ProTrp: 0.337 ± 0.145
1.012ProTyr: 1.012 ± 0.356
0.0ProXaa: 0.0 ± 0.0
Gln
3.71GlnAla: 3.71 ± 0.734
0.337GlnCys: 0.337 ± 0.174
2.53GlnAsp: 2.53 ± 0.512
3.373GlnGlu: 3.373 ± 0.725
2.108GlnPhe: 2.108 ± 0.433
2.445GlnGly: 2.445 ± 0.773
0.422GlnHis: 0.422 ± 0.208
2.361GlnIle: 2.361 ± 0.418
2.53GlnLys: 2.53 ± 0.521
3.879GlnLeu: 3.879 ± 0.541
1.349GlnMet: 1.349 ± 0.333
1.602GlnAsn: 1.602 ± 0.338
1.012GlnPro: 1.012 ± 0.301
1.939GlnGln: 1.939 ± 0.565
1.265GlnArg: 1.265 ± 0.374
2.698GlnSer: 2.698 ± 0.673
2.867GlnThr: 2.867 ± 0.483
2.614GlnVal: 2.614 ± 0.342
0.759GlnTrp: 0.759 ± 0.255
1.265GlnTyr: 1.265 ± 0.391
0.0GlnXaa: 0.0 ± 0.0
Arg
3.457ArgAla: 3.457 ± 0.415
0.506ArgCys: 0.506 ± 0.22
2.783ArgAsp: 2.783 ± 0.707
3.289ArgGlu: 3.289 ± 0.663
1.265ArgPhe: 1.265 ± 0.352
2.783ArgGly: 2.783 ± 0.516
0.59ArgHis: 0.59 ± 0.195
2.698ArgIle: 2.698 ± 0.644
3.457ArgLys: 3.457 ± 0.704
4.301ArgLeu: 4.301 ± 0.626
1.518ArgMet: 1.518 ± 0.331
1.939ArgAsn: 1.939 ± 0.466
1.349ArgPro: 1.349 ± 0.439
2.108ArgGln: 2.108 ± 0.427
1.518ArgArg: 1.518 ± 0.46
2.192ArgSer: 2.192 ± 0.411
2.277ArgThr: 2.277 ± 0.548
2.783ArgVal: 2.783 ± 0.616
0.675ArgTrp: 0.675 ± 0.248
2.698ArgTyr: 2.698 ± 0.509
0.0ArgXaa: 0.0 ± 0.0
Ser
5.734SerAla: 5.734 ± 2.26
0.506SerCys: 0.506 ± 0.187
4.554SerAsp: 4.554 ± 0.749
3.795SerGlu: 3.795 ± 0.731
2.614SerPhe: 2.614 ± 0.442
4.975SerGly: 4.975 ± 0.63
0.759SerHis: 0.759 ± 0.31
5.734SerIle: 5.734 ± 0.89
4.722SerLys: 4.722 ± 0.706
4.385SerLeu: 4.385 ± 0.673
1.096SerMet: 1.096 ± 0.218
3.457SerAsn: 3.457 ± 0.448
1.349SerPro: 1.349 ± 0.357
3.879SerGln: 3.879 ± 0.936
2.445SerArg: 2.445 ± 0.4
3.879SerSer: 3.879 ± 0.989
4.891SerThr: 4.891 ± 0.741
5.312SerVal: 5.312 ± 0.976
1.096SerTrp: 1.096 ± 0.31
2.192SerTyr: 2.192 ± 0.433
0.0SerXaa: 0.0 ± 0.0
Thr
4.554ThrAla: 4.554 ± 1.28
0.253ThrCys: 0.253 ± 0.173
2.614ThrAsp: 2.614 ± 0.501
3.963ThrGlu: 3.963 ± 0.807
3.373ThrPhe: 3.373 ± 0.453
4.469ThrGly: 4.469 ± 0.724
1.434ThrHis: 1.434 ± 0.377
4.722ThrIle: 4.722 ± 0.666
5.397ThrLys: 5.397 ± 0.651
4.722ThrLeu: 4.722 ± 0.574
1.518ThrMet: 1.518 ± 0.807
3.289ThrAsn: 3.289 ± 0.464
1.518ThrPro: 1.518 ± 0.469
2.867ThrGln: 2.867 ± 0.508
2.361ThrArg: 2.361 ± 0.517
3.626ThrSer: 3.626 ± 0.883
4.216ThrThr: 4.216 ± 0.701
5.059ThrVal: 5.059 ± 0.616
0.759ThrTrp: 0.759 ± 0.297
2.277ThrTyr: 2.277 ± 0.617
0.0ThrXaa: 0.0 ± 0.0
Val
4.301ValAla: 4.301 ± 0.883
0.59ValCys: 0.59 ± 0.23
4.891ValAsp: 4.891 ± 0.712
5.397ValGlu: 5.397 ± 0.99
2.445ValPhe: 2.445 ± 0.442
4.048ValGly: 4.048 ± 0.62
0.506ValHis: 0.506 ± 0.19
4.048ValIle: 4.048 ± 0.497
5.228ValLys: 5.228 ± 0.533
4.469ValLeu: 4.469 ± 0.475
1.265ValMet: 1.265 ± 0.27
4.048ValAsn: 4.048 ± 0.72
1.686ValPro: 1.686 ± 0.304
1.855ValGln: 1.855 ± 0.601
2.445ValArg: 2.445 ± 0.379
5.228ValSer: 5.228 ± 0.663
4.132ValThr: 4.132 ± 0.541
4.554ValVal: 4.554 ± 0.742
1.012ValTrp: 1.012 ± 0.26
1.771ValTyr: 1.771 ± 0.464
0.0ValXaa: 0.0 ± 0.0
Trp
0.675TrpAla: 0.675 ± 0.238
0.0TrpCys: 0.0 ± 0.0
0.759TrpAsp: 0.759 ± 0.24
1.012TrpGlu: 1.012 ± 0.249
0.675TrpPhe: 0.675 ± 0.273
0.759TrpGly: 0.759 ± 0.202
0.253TrpHis: 0.253 ± 0.164
0.506TrpIle: 0.506 ± 0.215
0.759TrpLys: 0.759 ± 0.173
0.843TrpLeu: 0.843 ± 0.291
0.253TrpMet: 0.253 ± 0.16
0.759TrpAsn: 0.759 ± 0.406
0.084TrpPro: 0.084 ± 0.107
0.253TrpGln: 0.253 ± 0.153
0.675TrpArg: 0.675 ± 0.261
1.602TrpSer: 1.602 ± 0.602
1.096TrpThr: 1.096 ± 0.384
1.096TrpVal: 1.096 ± 0.27
0.422TrpTrp: 0.422 ± 0.22
0.506TrpTyr: 0.506 ± 0.225
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.951TyrAla: 2.951 ± 0.445
0.422TyrCys: 0.422 ± 0.153
2.783TyrAsp: 2.783 ± 0.634
2.361TyrGlu: 2.361 ± 0.564
1.518TyrPhe: 1.518 ± 0.366
2.445TyrGly: 2.445 ± 0.41
0.253TyrHis: 0.253 ± 0.163
2.698TyrIle: 2.698 ± 0.555
3.12TyrLys: 3.12 ± 0.655
2.951TyrLeu: 2.951 ± 0.598
1.012TyrMet: 1.012 ± 0.362
2.445TyrAsn: 2.445 ± 0.483
1.096TyrPro: 1.096 ± 0.31
1.602TyrGln: 1.602 ± 0.37
2.108TyrArg: 2.108 ± 0.54
2.277TyrSer: 2.277 ± 0.572
2.614TyrThr: 2.614 ± 0.628
2.192TyrVal: 2.192 ± 0.371
0.337TyrTrp: 0.337 ± 0.161
2.108TyrTyr: 2.108 ± 0.638
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 55 proteins (11860 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski