Amino acid dipepetide frequency for Streptococcus satellite phage Javan720

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.525AlaAla: 0.525 ± 0.373
0.0AlaCys: 0.0 ± 0.0
2.889AlaAsp: 2.889 ± 0.711
7.355AlaGlu: 7.355 ± 1.584
2.889AlaPhe: 2.889 ± 0.865
1.839AlaGly: 1.839 ± 0.568
0.0AlaHis: 0.0 ± 0.0
3.677AlaIle: 3.677 ± 0.955
4.465AlaLys: 4.465 ± 0.841
5.253AlaLeu: 5.253 ± 1.169
2.364AlaMet: 2.364 ± 1.058
2.364AlaAsn: 2.364 ± 0.816
1.051AlaPro: 1.051 ± 0.504
2.627AlaGln: 2.627 ± 0.708
4.465AlaArg: 4.465 ± 1.012
1.576AlaSer: 1.576 ± 0.601
3.415AlaThr: 3.415 ± 0.961
3.677AlaVal: 3.677 ± 0.821
0.0AlaTrp: 0.0 ± 0.0
1.576AlaTyr: 1.576 ± 0.538
0.0AlaXaa: 0.0 ± 0.0
Cys
1.051CysAla: 1.051 ± 0.399
0.0CysCys: 0.0 ± 0.0
0.788CysAsp: 0.788 ± 0.378
0.263CysGlu: 0.263 ± 0.226
0.263CysPhe: 0.263 ± 0.245
0.788CysGly: 0.788 ± 0.633
0.788CysHis: 0.788 ± 0.38
0.525CysIle: 0.525 ± 0.446
0.263CysLys: 0.263 ± 0.248
1.313CysLeu: 1.313 ± 0.486
0.525CysMet: 0.525 ± 0.404
0.263CysAsn: 0.263 ± 0.223
0.525CysPro: 0.525 ± 0.353
1.051CysGln: 1.051 ± 0.794
0.525CysArg: 0.525 ± 0.375
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.263CysTyr: 0.263 ± 0.244
0.0CysXaa: 0.0 ± 0.0
Asp
0.788AspAla: 0.788 ± 0.42
1.051AspCys: 1.051 ± 0.655
3.677AspAsp: 3.677 ± 0.877
4.991AspGlu: 4.991 ± 0.803
4.203AspPhe: 4.203 ± 0.609
3.677AspGly: 3.677 ± 0.82
0.0AspHis: 0.0 ± 0.0
7.092AspIle: 7.092 ± 1.028
4.203AspLys: 4.203 ± 0.883
6.304AspLeu: 6.304 ± 0.823
1.313AspMet: 1.313 ± 0.512
2.627AspAsn: 2.627 ± 0.748
1.576AspPro: 1.576 ± 0.672
1.051AspGln: 1.051 ± 0.531
3.677AspArg: 3.677 ± 0.994
2.364AspSer: 2.364 ± 0.691
1.839AspThr: 1.839 ± 0.647
1.576AspVal: 1.576 ± 0.472
0.0AspTrp: 0.0 ± 0.0
3.677AspTyr: 3.677 ± 1.186
0.0AspXaa: 0.0 ± 0.0
Glu
4.991GluAla: 4.991 ± 1.3
0.788GluCys: 0.788 ± 0.382
3.152GluAsp: 3.152 ± 0.865
4.728GluGlu: 4.728 ± 1.064
3.415GluPhe: 3.415 ± 1.058
2.889GluGly: 2.889 ± 0.838
1.839GluHis: 1.839 ± 0.734
6.304GluIle: 6.304 ± 0.809
7.618GluLys: 7.618 ± 1.649
11.032GluLeu: 11.032 ± 1.544
2.364GluMet: 2.364 ± 0.82
5.779GluAsn: 5.779 ± 1.225
2.889GluPro: 2.889 ± 0.941
3.94GluGln: 3.94 ± 1.18
5.779GluArg: 5.779 ± 1.086
5.253GluSer: 5.253 ± 1.546
4.203GluThr: 4.203 ± 1.312
5.253GluVal: 5.253 ± 1.559
0.263GluTrp: 0.263 ± 0.223
3.94GluTyr: 3.94 ± 1.114
0.0GluXaa: 0.0 ± 0.0
Phe
1.051PheAla: 1.051 ± 0.459
0.0PheCys: 0.0 ± 0.0
3.94PheAsp: 3.94 ± 0.914
1.839PheGlu: 1.839 ± 0.797
2.627PhePhe: 2.627 ± 1.011
3.152PheGly: 3.152 ± 0.985
0.525PheHis: 0.525 ± 0.337
3.152PheIle: 3.152 ± 1.099
4.465PheLys: 4.465 ± 1.374
3.152PheLeu: 3.152 ± 1.056
0.788PheMet: 0.788 ± 0.428
3.152PheAsn: 3.152 ± 0.971
0.788PhePro: 0.788 ± 0.414
1.839PheGln: 1.839 ± 0.582
2.364PheArg: 2.364 ± 0.794
3.677PheSer: 3.677 ± 0.914
1.839PheThr: 1.839 ± 0.682
1.313PheVal: 1.313 ± 0.499
0.525PheTrp: 0.525 ± 0.454
2.889PheTyr: 2.889 ± 0.833
0.0PheXaa: 0.0 ± 0.0
Gly
2.627GlyAla: 2.627 ± 0.831
0.525GlyCys: 0.525 ± 0.362
3.415GlyAsp: 3.415 ± 0.8
2.889GlyGlu: 2.889 ± 0.79
1.576GlyPhe: 1.576 ± 0.703
2.101GlyGly: 2.101 ± 1.178
0.788GlyHis: 0.788 ± 0.411
4.465GlyIle: 4.465 ± 1.29
4.465GlyLys: 4.465 ± 1.11
4.465GlyLeu: 4.465 ± 1.292
1.839GlyMet: 1.839 ± 0.55
1.576GlyAsn: 1.576 ± 0.646
0.0GlyPro: 0.0 ± 0.0
2.889GlyGln: 2.889 ± 0.582
3.415GlyArg: 3.415 ± 1.313
1.839GlySer: 1.839 ± 0.601
3.152GlyThr: 3.152 ± 0.747
4.203GlyVal: 4.203 ± 0.9
1.313GlyTrp: 1.313 ± 0.672
3.152GlyTyr: 3.152 ± 0.977
0.0GlyXaa: 0.0 ± 0.0
His
1.313HisAla: 1.313 ± 0.701
0.0HisCys: 0.0 ± 0.0
0.788HisAsp: 0.788 ± 0.438
0.525HisGlu: 0.525 ± 0.446
0.788HisPhe: 0.788 ± 0.386
1.051HisGly: 1.051 ± 0.48
0.263HisHis: 0.263 ± 0.257
1.051HisIle: 1.051 ± 0.675
1.313HisLys: 1.313 ± 0.643
2.364HisLeu: 2.364 ± 0.751
0.0HisMet: 0.0 ± 0.0
1.313HisAsn: 1.313 ± 0.717
1.051HisPro: 1.051 ± 0.574
0.525HisGln: 0.525 ± 0.347
1.576HisArg: 1.576 ± 0.708
0.525HisSer: 0.525 ± 0.316
0.788HisThr: 0.788 ± 0.441
0.263HisVal: 0.263 ± 0.223
0.0HisTrp: 0.0 ± 0.0
1.313HisTyr: 1.313 ± 0.574
0.0HisXaa: 0.0 ± 0.0
Ile
4.465IleAla: 4.465 ± 1.201
0.525IleCys: 0.525 ± 0.323
3.677IleAsp: 3.677 ± 1.035
6.83IleGlu: 6.83 ± 1.833
3.152IlePhe: 3.152 ± 0.695
3.677IleGly: 3.677 ± 0.996
0.788IleHis: 0.788 ± 0.345
3.677IleIle: 3.677 ± 1.029
6.567IleLys: 6.567 ± 1.311
4.728IleLeu: 4.728 ± 1.144
0.263IleMet: 0.263 ± 0.257
2.364IleAsn: 2.364 ± 0.767
3.152IlePro: 3.152 ± 0.874
2.627IleGln: 2.627 ± 0.633
2.627IleArg: 2.627 ± 0.811
5.779IleSer: 5.779 ± 1.251
4.203IleThr: 4.203 ± 0.851
3.677IleVal: 3.677 ± 0.679
0.525IleTrp: 0.525 ± 0.446
1.576IleTyr: 1.576 ± 0.61
0.0IleXaa: 0.0 ± 0.0
Lys
7.355LysAla: 7.355 ± 1.423
0.525LysCys: 0.525 ± 0.325
3.415LysAsp: 3.415 ± 0.768
8.668LysGlu: 8.668 ± 1.373
2.101LysPhe: 2.101 ± 0.671
4.991LysGly: 4.991 ± 1.078
3.677LysHis: 3.677 ± 1.071
4.465LysIle: 4.465 ± 0.956
8.406LysLys: 8.406 ± 1.759
9.194LysLeu: 9.194 ± 1.545
2.364LysMet: 2.364 ± 0.84
4.991LysAsn: 4.991 ± 1.12
3.94LysPro: 3.94 ± 0.93
2.627LysGln: 2.627 ± 0.712
6.83LysArg: 6.83 ± 1.398
5.253LysSer: 5.253 ± 0.955
4.991LysThr: 4.991 ± 1.529
4.991LysVal: 4.991 ± 0.755
1.313LysTrp: 1.313 ± 0.455
2.627LysTyr: 2.627 ± 0.871
0.0LysXaa: 0.0 ± 0.0
Leu
5.779LeuAla: 5.779 ± 1.347
1.839LeuCys: 1.839 ± 0.561
9.719LeuAsp: 9.719 ± 1.711
11.295LeuGlu: 11.295 ± 1.806
3.677LeuPhe: 3.677 ± 0.975
4.728LeuGly: 4.728 ± 1.224
1.051LeuHis: 1.051 ± 0.667
5.253LeuIle: 5.253 ± 1.181
8.931LeuLys: 8.931 ± 1.39
9.719LeuLeu: 9.719 ± 1.274
2.627LeuMet: 2.627 ± 0.706
7.618LeuAsn: 7.618 ± 1.381
3.152LeuPro: 3.152 ± 0.926
3.415LeuGln: 3.415 ± 0.788
3.677LeuArg: 3.677 ± 0.956
4.728LeuSer: 4.728 ± 0.759
7.88LeuThr: 7.88 ± 1.696
3.415LeuVal: 3.415 ± 1.04
0.525LeuTrp: 0.525 ± 0.346
4.728LeuTyr: 4.728 ± 1.319
0.0LeuXaa: 0.0 ± 0.0
Met
1.839MetAla: 1.839 ± 0.561
0.0MetCys: 0.0 ± 0.0
1.576MetAsp: 1.576 ± 0.517
3.415MetGlu: 3.415 ± 0.865
1.051MetPhe: 1.051 ± 0.422
1.839MetGly: 1.839 ± 0.68
0.263MetHis: 0.263 ± 0.272
0.788MetIle: 0.788 ± 0.377
1.576MetLys: 1.576 ± 0.808
1.576MetLeu: 1.576 ± 0.469
0.788MetMet: 0.788 ± 0.515
2.364MetAsn: 2.364 ± 0.753
1.051MetPro: 1.051 ± 0.497
1.051MetGln: 1.051 ± 0.592
1.839MetArg: 1.839 ± 0.716
1.051MetSer: 1.051 ± 0.487
3.152MetThr: 3.152 ± 1.034
0.525MetVal: 0.525 ± 0.446
0.263MetTrp: 0.263 ± 0.223
0.263MetTyr: 0.263 ± 0.305
0.0MetXaa: 0.0 ± 0.0
Asn
2.364AsnAla: 2.364 ± 0.859
0.263AsnCys: 0.263 ± 0.226
3.677AsnAsp: 3.677 ± 1.256
3.152AsnGlu: 3.152 ± 0.706
1.839AsnPhe: 1.839 ± 0.634
4.465AsnGly: 4.465 ± 1.057
1.313AsnHis: 1.313 ± 0.667
1.839AsnIle: 1.839 ± 0.824
8.143AsnLys: 8.143 ± 1.385
5.253AsnLeu: 5.253 ± 0.766
2.364AsnMet: 2.364 ± 0.812
2.627AsnAsn: 2.627 ± 0.624
2.364AsnPro: 2.364 ± 0.671
1.839AsnGln: 1.839 ± 0.652
2.364AsnArg: 2.364 ± 0.631
3.152AsnSer: 3.152 ± 1.013
2.627AsnThr: 2.627 ± 0.778
2.364AsnVal: 2.364 ± 0.901
0.263AsnTrp: 0.263 ± 0.226
2.627AsnTyr: 2.627 ± 0.747
0.0AsnXaa: 0.0 ± 0.0
Pro
1.576ProAla: 1.576 ± 0.56
0.263ProCys: 0.263 ± 0.223
2.364ProAsp: 2.364 ± 0.583
3.677ProGlu: 3.677 ± 0.844
1.839ProPhe: 1.839 ± 0.767
1.051ProGly: 1.051 ± 0.681
0.263ProHis: 0.263 ± 0.305
1.313ProIle: 1.313 ± 0.528
2.627ProLys: 2.627 ± 0.577
2.364ProLeu: 2.364 ± 0.659
0.525ProMet: 0.525 ± 0.303
2.101ProAsn: 2.101 ± 0.765
1.576ProPro: 1.576 ± 0.709
1.839ProGln: 1.839 ± 0.556
3.152ProArg: 3.152 ± 0.825
1.313ProSer: 1.313 ± 0.44
0.788ProThr: 0.788 ± 0.393
2.101ProVal: 2.101 ± 0.718
0.0ProTrp: 0.0 ± 0.0
1.313ProTyr: 1.313 ± 0.512
0.0ProXaa: 0.0 ± 0.0
Gln
3.415GlnAla: 3.415 ± 1.204
0.263GlnCys: 0.263 ± 0.257
1.839GlnAsp: 1.839 ± 0.773
2.627GlnGlu: 2.627 ± 0.57
1.839GlnPhe: 1.839 ± 0.628
2.889GlnGly: 2.889 ± 0.831
0.525GlnHis: 0.525 ± 0.324
1.051GlnIle: 1.051 ± 0.506
3.415GlnLys: 3.415 ± 0.7
3.94GlnLeu: 3.94 ± 0.917
1.051GlnMet: 1.051 ± 0.494
1.839GlnAsn: 1.839 ± 0.775
1.051GlnPro: 1.051 ± 0.537
1.051GlnGln: 1.051 ± 0.485
2.364GlnArg: 2.364 ± 0.745
0.788GlnSer: 0.788 ± 0.427
2.364GlnThr: 2.364 ± 0.731
3.677GlnVal: 3.677 ± 0.79
1.051GlnTrp: 1.051 ± 0.46
1.313GlnTyr: 1.313 ± 0.722
0.0GlnXaa: 0.0 ± 0.0
Arg
3.415ArgAla: 3.415 ± 0.841
0.525ArgCys: 0.525 ± 0.396
2.101ArgAsp: 2.101 ± 0.705
5.253ArgGlu: 5.253 ± 1.311
4.203ArgPhe: 4.203 ± 1.168
2.101ArgGly: 2.101 ± 1.02
1.051ArgHis: 1.051 ± 0.382
4.728ArgIle: 4.728 ± 0.92
5.516ArgLys: 5.516 ± 1.262
7.618ArgLeu: 7.618 ± 1.179
0.788ArgMet: 0.788 ± 0.52
2.364ArgAsn: 2.364 ± 0.805
0.788ArgPro: 0.788 ± 0.415
2.364ArgGln: 2.364 ± 1.001
2.101ArgArg: 2.101 ± 0.868
3.152ArgSer: 3.152 ± 0.741
2.889ArgThr: 2.889 ± 0.709
2.889ArgVal: 2.889 ± 0.559
0.263ArgTrp: 0.263 ± 0.223
4.203ArgTyr: 4.203 ± 1.227
0.0ArgXaa: 0.0 ± 0.0
Ser
2.364SerAla: 2.364 ± 0.705
0.788SerCys: 0.788 ± 0.438
2.364SerAsp: 2.364 ± 0.771
4.728SerGlu: 4.728 ± 1.309
1.839SerPhe: 1.839 ± 0.525
3.677SerGly: 3.677 ± 0.754
0.788SerHis: 0.788 ± 0.432
3.152SerIle: 3.152 ± 0.99
4.991SerLys: 4.991 ± 0.856
4.203SerLeu: 4.203 ± 1.042
1.313SerMet: 1.313 ± 0.536
4.203SerAsn: 4.203 ± 1.286
1.576SerPro: 1.576 ± 0.519
1.576SerGln: 1.576 ± 0.659
4.203SerArg: 4.203 ± 1.198
3.415SerSer: 3.415 ± 0.764
2.364SerThr: 2.364 ± 0.76
2.364SerVal: 2.364 ± 0.637
0.525SerTrp: 0.525 ± 0.374
2.101SerTyr: 2.101 ± 0.781
0.0SerXaa: 0.0 ± 0.0
Thr
1.313ThrAla: 1.313 ± 0.627
0.525ThrCys: 0.525 ± 0.411
2.364ThrAsp: 2.364 ± 0.805
3.152ThrGlu: 3.152 ± 0.756
2.364ThrPhe: 2.364 ± 0.821
2.101ThrGly: 2.101 ± 0.619
1.839ThrHis: 1.839 ± 0.692
3.677ThrIle: 3.677 ± 0.928
4.728ThrLys: 4.728 ± 1.577
8.143ThrLeu: 8.143 ± 1.829
1.576ThrMet: 1.576 ± 0.778
1.576ThrAsn: 1.576 ± 0.774
1.839ThrPro: 1.839 ± 0.742
2.364ThrGln: 2.364 ± 0.676
2.101ThrArg: 2.101 ± 0.488
2.364ThrSer: 2.364 ± 0.687
4.465ThrThr: 4.465 ± 1.2
4.991ThrVal: 4.991 ± 1.476
0.263ThrTrp: 0.263 ± 0.25
3.94ThrTyr: 3.94 ± 1.137
0.0ThrXaa: 0.0 ± 0.0
Val
3.677ValAla: 3.677 ± 0.988
0.263ValCys: 0.263 ± 0.223
1.839ValAsp: 1.839 ± 0.665
7.092ValGlu: 7.092 ± 1.162
1.313ValPhe: 1.313 ± 0.835
2.364ValGly: 2.364 ± 0.929
0.0ValHis: 0.0 ± 0.0
3.415ValIle: 3.415 ± 1.063
5.253ValLys: 5.253 ± 1.042
4.991ValLeu: 4.991 ± 1.186
1.576ValMet: 1.576 ± 0.623
3.152ValAsn: 3.152 ± 0.948
2.101ValPro: 2.101 ± 0.757
1.839ValGln: 1.839 ± 0.724
1.576ValArg: 1.576 ± 0.806
2.889ValSer: 2.889 ± 0.832
3.415ValThr: 3.415 ± 0.829
3.677ValVal: 3.677 ± 1.091
0.525ValTrp: 0.525 ± 0.32
2.627ValTyr: 2.627 ± 0.729
0.0ValXaa: 0.0 ± 0.0
Trp
0.525TrpAla: 0.525 ± 0.3
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.313TrpGlu: 1.313 ± 0.526
0.263TrpPhe: 0.263 ± 0.223
0.525TrpGly: 0.525 ± 0.32
0.263TrpHis: 0.263 ± 0.223
0.788TrpIle: 0.788 ± 0.462
0.263TrpLys: 0.263 ± 0.271
0.788TrpLeu: 0.788 ± 0.357
0.0TrpMet: 0.0 ± 0.0
0.525TrpAsn: 0.525 ± 0.39
0.0TrpPro: 0.0 ± 0.0
0.525TrpGln: 0.525 ± 0.384
0.263TrpArg: 0.263 ± 0.223
0.788TrpSer: 0.788 ± 0.419
0.0TrpThr: 0.0 ± 0.0
0.788TrpVal: 0.788 ± 0.444
0.0TrpTrp: 0.0 ± 0.0
0.263TrpTyr: 0.263 ± 0.223
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.576TyrAla: 1.576 ± 0.632
1.051TyrCys: 1.051 ± 0.432
2.101TyrAsp: 2.101 ± 0.562
2.627TyrGlu: 2.627 ± 0.969
1.839TyrPhe: 1.839 ± 0.9
0.788TyrGly: 0.788 ± 0.427
0.788TyrHis: 0.788 ± 0.426
4.465TyrIle: 4.465 ± 0.962
5.516TyrLys: 5.516 ± 1.69
7.355TyrLeu: 7.355 ± 1.28
1.576TyrMet: 1.576 ± 0.788
2.101TyrAsn: 2.101 ± 0.55
1.576TyrPro: 1.576 ± 0.762
1.313TyrGln: 1.313 ± 0.484
3.415TyrArg: 3.415 ± 0.747
2.627TyrSer: 2.627 ± 0.724
1.576TyrThr: 1.576 ± 0.613
1.839TyrVal: 1.839 ± 0.603
0.263TyrTrp: 0.263 ± 0.291
0.788TyrTyr: 0.788 ± 0.557
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 26 proteins (3808 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski