Amino acid dipepetide frequency for Streptococcus satellite phage Javan751

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.549AlaAla: 3.549 ± 1.026
1.774AlaCys: 1.774 ± 0.996
3.549AlaAsp: 3.549 ± 0.904
4.968AlaGlu: 4.968 ± 1.859
2.839AlaPhe: 2.839 ± 0.669
2.484AlaGly: 2.484 ± 0.935
0.0AlaHis: 0.0 ± 0.0
6.742AlaIle: 6.742 ± 1.426
6.033AlaLys: 6.033 ± 1.047
5.323AlaLeu: 5.323 ± 1.041
2.484AlaMet: 2.484 ± 0.979
2.484AlaAsn: 2.484 ± 1.206
1.419AlaPro: 1.419 ± 0.737
1.419AlaGln: 1.419 ± 0.644
2.484AlaArg: 2.484 ± 0.98
1.774AlaSer: 1.774 ± 0.624
4.258AlaThr: 4.258 ± 1.011
4.968AlaVal: 4.968 ± 1.25
0.0AlaTrp: 0.0 ± 0.0
2.839AlaTyr: 2.839 ± 1.245
0.0AlaXaa: 0.0 ± 0.0
Cys
0.355CysAla: 0.355 ± 0.331
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.355CysGlu: 0.355 ± 0.266
0.355CysPhe: 0.355 ± 0.396
0.0CysGly: 0.0 ± 0.0
0.355CysHis: 0.355 ± 0.332
0.355CysIle: 0.355 ± 0.266
0.355CysLys: 0.355 ± 0.41
0.71CysLeu: 0.71 ± 0.722
0.71CysMet: 0.71 ± 0.493
0.71CysAsn: 0.71 ± 0.487
0.355CysPro: 0.355 ± 0.266
1.065CysGln: 1.065 ± 0.513
0.355CysArg: 0.355 ± 0.295
0.0CysSer: 0.0 ± 0.0
0.355CysThr: 0.355 ± 0.396
0.355CysVal: 0.355 ± 0.413
0.0CysTrp: 0.0 ± 0.0
0.355CysTyr: 0.355 ± 0.395
0.0CysXaa: 0.0 ± 0.0
Asp
4.613AspAla: 4.613 ± 0.952
0.0AspCys: 0.0 ± 0.0
4.613AspAsp: 4.613 ± 1.013
4.258AspGlu: 4.258 ± 1.174
4.968AspPhe: 4.968 ± 1.938
4.968AspGly: 4.968 ± 1.207
0.355AspHis: 0.355 ± 0.374
5.678AspIle: 5.678 ± 1.341
4.613AspLys: 4.613 ± 0.981
5.678AspLeu: 5.678 ± 1.636
1.774AspMet: 1.774 ± 0.785
3.194AspAsn: 3.194 ± 0.703
0.0AspPro: 0.0 ± 0.0
1.774AspGln: 1.774 ± 0.603
1.774AspArg: 1.774 ± 0.741
1.419AspSer: 1.419 ± 0.485
0.71AspThr: 0.71 ± 0.665
2.839AspVal: 2.839 ± 0.923
1.065AspTrp: 1.065 ± 0.445
1.774AspTyr: 1.774 ± 0.818
0.0AspXaa: 0.0 ± 0.0
Glu
5.323GluAla: 5.323 ± 1.668
1.065GluCys: 1.065 ± 0.608
3.903GluAsp: 3.903 ± 1.048
7.097GluGlu: 7.097 ± 1.776
3.549GluPhe: 3.549 ± 1.468
1.419GluGly: 1.419 ± 0.843
0.71GluHis: 0.71 ± 0.495
7.452GluIle: 7.452 ± 1.582
9.581GluLys: 9.581 ± 1.707
12.42GluLeu: 12.42 ± 2.645
4.258GluMet: 4.258 ± 1.068
7.807GluAsn: 7.807 ± 1.608
3.194GluPro: 3.194 ± 1.148
4.613GluGln: 4.613 ± 1.262
3.549GluArg: 3.549 ± 1.029
3.194GluSer: 3.194 ± 0.933
3.549GluThr: 3.549 ± 0.809
3.194GluVal: 3.194 ± 0.768
0.71GluTrp: 0.71 ± 0.487
2.129GluTyr: 2.129 ± 0.856
0.0GluXaa: 0.0 ± 0.0
Phe
2.129PheAla: 2.129 ± 0.649
0.0PheCys: 0.0 ± 0.0
3.194PheAsp: 3.194 ± 0.906
4.613PheGlu: 4.613 ± 1.594
2.484PhePhe: 2.484 ± 1.225
2.129PheGly: 2.129 ± 0.59
0.355PheHis: 0.355 ± 0.295
2.484PheIle: 2.484 ± 1.368
3.903PheLys: 3.903 ± 1.311
2.839PheLeu: 2.839 ± 0.758
1.774PheMet: 1.774 ± 0.821
2.484PheAsn: 2.484 ± 0.815
0.71PhePro: 0.71 ± 0.595
0.0PheGln: 0.0 ± 0.0
0.71PheArg: 0.71 ± 0.423
1.774PheSer: 1.774 ± 0.646
1.065PheThr: 1.065 ± 0.444
2.129PheVal: 2.129 ± 0.885
0.355PheTrp: 0.355 ± 0.337
0.71PheTyr: 0.71 ± 0.537
0.0PheXaa: 0.0 ± 0.0
Gly
4.258GlyAla: 4.258 ± 1.143
0.355GlyCys: 0.355 ± 0.295
1.774GlyAsp: 1.774 ± 0.729
3.194GlyGlu: 3.194 ± 0.983
1.774GlyPhe: 1.774 ± 0.674
0.71GlyGly: 0.71 ± 0.434
1.065GlyHis: 1.065 ± 0.415
6.033GlyIle: 6.033 ± 1.305
4.968GlyLys: 4.968 ± 1.086
4.968GlyLeu: 4.968 ± 1.42
1.419GlyMet: 1.419 ± 0.676
1.774GlyAsn: 1.774 ± 0.888
0.0GlyPro: 0.0 ± 0.0
1.774GlyGln: 1.774 ± 0.617
1.065GlyArg: 1.065 ± 0.551
1.419GlySer: 1.419 ± 0.793
1.419GlyThr: 1.419 ± 0.802
3.194GlyVal: 3.194 ± 1.575
0.71GlyTrp: 0.71 ± 0.533
2.839GlyTyr: 2.839 ± 0.927
0.0GlyXaa: 0.0 ± 0.0
His
1.065HisAla: 1.065 ± 0.591
0.0HisCys: 0.0 ± 0.0
1.065HisAsp: 1.065 ± 0.602
2.129HisGlu: 2.129 ± 0.65
1.065HisPhe: 1.065 ± 0.491
1.065HisGly: 1.065 ± 0.498
0.355HisHis: 0.355 ± 0.331
0.0HisIle: 0.0 ± 0.0
1.419HisLys: 1.419 ± 0.656
1.065HisLeu: 1.065 ± 0.605
0.355HisMet: 0.355 ± 0.41
0.355HisAsn: 0.355 ± 0.295
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
1.065HisArg: 1.065 ± 0.618
0.355HisSer: 0.355 ± 0.295
2.129HisThr: 2.129 ± 0.882
1.065HisVal: 1.065 ± 0.452
0.0HisTrp: 0.0 ± 0.0
0.355HisTyr: 0.355 ± 0.332
0.0HisXaa: 0.0 ± 0.0
Ile
2.484IleAla: 2.484 ± 1.295
0.0IleCys: 0.0 ± 0.0
4.613IleAsp: 4.613 ± 0.931
7.452IleGlu: 7.452 ± 2.248
1.774IlePhe: 1.774 ± 0.59
2.484IleGly: 2.484 ± 0.749
1.419IleHis: 1.419 ± 0.615
2.484IleIle: 2.484 ± 0.897
11.356IleLys: 11.356 ± 1.805
7.807IleLeu: 7.807 ± 1.571
0.71IleMet: 0.71 ± 0.61
4.258IleAsn: 4.258 ± 0.885
3.549IlePro: 3.549 ± 0.959
3.903IleGln: 3.903 ± 0.861
1.065IleArg: 1.065 ± 0.698
4.613IleSer: 4.613 ± 1.371
3.903IleThr: 3.903 ± 0.949
2.484IleVal: 2.484 ± 0.888
0.0IleTrp: 0.0 ± 0.0
1.774IleTyr: 1.774 ± 0.827
0.0IleXaa: 0.0 ± 0.0
Lys
10.646LysAla: 10.646 ± 1.749
0.0LysCys: 0.0 ± 0.0
4.613LysAsp: 4.613 ± 1.217
8.872LysGlu: 8.872 ± 1.41
1.065LysPhe: 1.065 ± 0.607
6.033LysGly: 6.033 ± 1.64
2.839LysHis: 2.839 ± 0.735
4.613LysIle: 4.613 ± 1.102
9.226LysLys: 9.226 ± 2.048
7.097LysLeu: 7.097 ± 1.392
3.903LysMet: 3.903 ± 1.074
7.452LysAsn: 7.452 ± 1.282
2.839LysPro: 2.839 ± 0.717
7.097LysGln: 7.097 ± 1.048
6.033LysArg: 6.033 ± 1.748
8.872LysSer: 8.872 ± 1.899
6.388LysThr: 6.388 ± 1.517
3.194LysVal: 3.194 ± 1.186
1.419LysTrp: 1.419 ± 0.944
4.258LysTyr: 4.258 ± 0.993
0.0LysXaa: 0.0 ± 0.0
Leu
6.388LeuAla: 6.388 ± 1.354
0.71LeuCys: 0.71 ± 0.43
7.807LeuAsp: 7.807 ± 1.813
9.936LeuGlu: 9.936 ± 2.255
3.903LeuPhe: 3.903 ± 1.048
5.678LeuGly: 5.678 ± 1.093
0.71LeuHis: 0.71 ± 0.358
5.678LeuIle: 5.678 ± 1.103
12.065LeuLys: 12.065 ± 1.747
11.356LeuLeu: 11.356 ± 1.534
2.839LeuMet: 2.839 ± 0.874
5.678LeuAsn: 5.678 ± 1.081
1.774LeuPro: 1.774 ± 0.645
4.258LeuGln: 4.258 ± 1.009
2.129LeuArg: 2.129 ± 0.885
4.968LeuSer: 4.968 ± 1.103
7.097LeuThr: 7.097 ± 1.417
5.678LeuVal: 5.678 ± 1.233
1.065LeuTrp: 1.065 ± 0.619
5.323LeuTyr: 5.323 ± 1.123
0.0LeuXaa: 0.0 ± 0.0
Met
1.774MetAla: 1.774 ± 0.72
0.0MetCys: 0.0 ± 0.0
3.549MetAsp: 3.549 ± 1.27
2.129MetGlu: 2.129 ± 0.717
0.71MetPhe: 0.71 ± 0.577
1.065MetGly: 1.065 ± 0.653
0.0MetHis: 0.0 ± 0.0
1.774MetIle: 1.774 ± 0.507
2.484MetLys: 2.484 ± 0.903
2.129MetLeu: 2.129 ± 0.765
1.774MetMet: 1.774 ± 0.856
2.484MetAsn: 2.484 ± 0.809
0.355MetPro: 0.355 ± 0.41
1.774MetGln: 1.774 ± 0.689
2.484MetArg: 2.484 ± 0.986
1.419MetSer: 1.419 ± 0.669
3.903MetThr: 3.903 ± 1.748
0.355MetVal: 0.355 ± 0.332
0.0MetTrp: 0.0 ± 0.0
0.355MetTyr: 0.355 ± 0.337
0.0MetXaa: 0.0 ± 0.0
Asn
3.194AsnAla: 3.194 ± 0.84
0.0AsnCys: 0.0 ± 0.0
1.419AsnAsp: 1.419 ± 0.738
4.968AsnGlu: 4.968 ± 1.18
0.355AsnPhe: 0.355 ± 0.314
5.678AsnGly: 5.678 ± 1.245
1.419AsnHis: 1.419 ± 0.705
2.129AsnIle: 2.129 ± 0.677
8.517AsnLys: 8.517 ± 1.806
6.388AsnLeu: 6.388 ± 1.193
1.774AsnMet: 1.774 ± 0.657
1.419AsnAsn: 1.419 ± 0.607
1.774AsnPro: 1.774 ± 0.809
2.129AsnGln: 2.129 ± 0.8
2.839AsnArg: 2.839 ± 0.734
1.774AsnSer: 1.774 ± 0.596
4.258AsnThr: 4.258 ± 1.557
1.774AsnVal: 1.774 ± 0.635
0.71AsnTrp: 0.71 ± 0.476
2.839AsnTyr: 2.839 ± 1.102
0.0AsnXaa: 0.0 ± 0.0
Pro
0.71ProAla: 0.71 ± 0.466
0.355ProCys: 0.355 ± 0.395
0.71ProAsp: 0.71 ± 0.483
1.774ProGlu: 1.774 ± 0.882
0.71ProPhe: 0.71 ± 0.393
1.065ProGly: 1.065 ± 0.57
0.0ProHis: 0.0 ± 0.0
0.71ProIle: 0.71 ± 0.591
6.388ProLys: 6.388 ± 1.6
1.774ProLeu: 1.774 ± 0.601
0.71ProMet: 0.71 ± 0.509
1.065ProAsn: 1.065 ± 0.601
0.355ProPro: 0.355 ± 0.266
0.355ProGln: 0.355 ± 0.37
0.355ProArg: 0.355 ± 0.295
1.065ProSer: 1.065 ± 0.626
2.484ProThr: 2.484 ± 0.933
1.065ProVal: 1.065 ± 0.53
0.0ProTrp: 0.0 ± 0.0
1.774ProTyr: 1.774 ± 0.697
0.0ProXaa: 0.0 ± 0.0
Gln
3.549GlnAla: 3.549 ± 1.272
0.71GlnCys: 0.71 ± 0.443
3.194GlnAsp: 3.194 ± 1.291
3.549GlnGlu: 3.549 ± 1.055
1.065GlnPhe: 1.065 ± 0.523
1.419GlnGly: 1.419 ± 0.655
0.71GlnHis: 0.71 ± 0.479
3.549GlnIle: 3.549 ± 0.817
2.484GlnLys: 2.484 ± 0.623
6.388GlnLeu: 6.388 ± 1.527
1.065GlnMet: 1.065 ± 0.63
0.355GlnAsn: 0.355 ± 0.332
0.355GlnPro: 0.355 ± 0.331
3.194GlnGln: 3.194 ± 1.242
4.258GlnArg: 4.258 ± 1.049
3.194GlnSer: 3.194 ± 0.848
3.194GlnThr: 3.194 ± 1.049
2.484GlnVal: 2.484 ± 0.998
0.0GlnTrp: 0.0 ± 0.0
2.484GlnTyr: 2.484 ± 0.803
0.0GlnXaa: 0.0 ± 0.0
Arg
2.484ArgAla: 2.484 ± 1.148
0.355ArgCys: 0.355 ± 0.266
3.194ArgAsp: 3.194 ± 0.789
3.903ArgGlu: 3.903 ± 1.35
0.71ArgPhe: 0.71 ± 0.432
1.419ArgGly: 1.419 ± 0.682
0.71ArgHis: 0.71 ± 0.396
3.903ArgIle: 3.903 ± 1.285
3.549ArgLys: 3.549 ± 0.952
3.549ArgLeu: 3.549 ± 1.258
0.355ArgMet: 0.355 ± 0.314
1.774ArgAsn: 1.774 ± 0.648
0.71ArgPro: 0.71 ± 0.514
4.613ArgGln: 4.613 ± 1.092
1.065ArgArg: 1.065 ± 0.576
0.71ArgSer: 0.71 ± 0.465
1.774ArgThr: 1.774 ± 0.924
2.484ArgVal: 2.484 ± 0.693
1.065ArgTrp: 1.065 ± 0.575
2.839ArgTyr: 2.839 ± 0.661
0.0ArgXaa: 0.0 ± 0.0
Ser
2.839SerAla: 2.839 ± 0.866
0.71SerCys: 0.71 ± 0.722
3.549SerAsp: 3.549 ± 0.981
5.678SerGlu: 5.678 ± 1.496
1.419SerPhe: 1.419 ± 0.467
0.355SerGly: 0.355 ± 0.295
0.71SerHis: 0.71 ± 0.506
2.839SerIle: 2.839 ± 0.782
6.033SerLys: 6.033 ± 1.846
4.968SerLeu: 4.968 ± 1.374
1.774SerMet: 1.774 ± 0.779
3.903SerAsn: 3.903 ± 1.413
2.484SerPro: 2.484 ± 1.054
2.129SerGln: 2.129 ± 0.912
2.839SerArg: 2.839 ± 0.987
1.774SerSer: 1.774 ± 0.583
1.774SerThr: 1.774 ± 0.657
3.549SerVal: 3.549 ± 1.088
1.065SerTrp: 1.065 ± 0.535
2.129SerTyr: 2.129 ± 0.584
0.0SerXaa: 0.0 ± 0.0
Thr
1.774ThrAla: 1.774 ± 0.599
0.0ThrCys: 0.0 ± 0.0
1.774ThrAsp: 1.774 ± 0.886
4.613ThrGlu: 4.613 ± 1.004
2.484ThrPhe: 2.484 ± 0.873
5.323ThrGly: 5.323 ± 0.813
1.065ThrHis: 1.065 ± 0.561
4.258ThrIle: 4.258 ± 0.784
4.968ThrLys: 4.968 ± 1.297
7.452ThrLeu: 7.452 ± 1.415
0.71ThrMet: 0.71 ± 0.401
2.839ThrAsn: 2.839 ± 0.833
1.774ThrPro: 1.774 ± 0.775
4.258ThrGln: 4.258 ± 1.306
1.419ThrArg: 1.419 ± 0.541
2.484ThrSer: 2.484 ± 0.798
5.678ThrThr: 5.678 ± 1.374
3.903ThrVal: 3.903 ± 1.254
0.71ThrTrp: 0.71 ± 0.584
2.839ThrTyr: 2.839 ± 0.767
0.0ThrXaa: 0.0 ± 0.0
Val
1.774ValAla: 1.774 ± 0.601
0.0ValCys: 0.0 ± 0.0
2.484ValAsp: 2.484 ± 0.75
4.613ValGlu: 4.613 ± 1.974
2.129ValPhe: 2.129 ± 0.952
0.355ValGly: 0.355 ± 0.266
0.71ValHis: 0.71 ± 0.401
3.549ValIle: 3.549 ± 1.385
3.549ValLys: 3.549 ± 1.724
3.903ValLeu: 3.903 ± 0.986
0.71ValMet: 0.71 ± 0.61
1.419ValAsn: 1.419 ± 0.543
1.065ValPro: 1.065 ± 0.56
1.774ValGln: 1.774 ± 0.611
3.194ValArg: 3.194 ± 1.02
5.323ValSer: 5.323 ± 1.287
4.613ValThr: 4.613 ± 0.985
2.129ValVal: 2.129 ± 0.716
1.065ValTrp: 1.065 ± 0.616
3.549ValTyr: 3.549 ± 1.093
0.0ValXaa: 0.0 ± 0.0
Trp
0.355TrpAla: 0.355 ± 0.295
0.355TrpCys: 0.355 ± 0.383
0.71TrpAsp: 0.71 ± 0.401
1.419TrpGlu: 1.419 ± 0.519
0.355TrpPhe: 0.355 ± 0.41
0.355TrpGly: 0.355 ± 0.395
0.355TrpHis: 0.355 ± 0.295
1.419TrpIle: 1.419 ± 0.704
0.71TrpLys: 0.71 ± 0.393
1.419TrpLeu: 1.419 ± 0.73
0.0TrpMet: 0.0 ± 0.0
1.065TrpAsn: 1.065 ± 0.614
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.355TrpArg: 0.355 ± 0.315
1.065TrpSer: 1.065 ± 0.444
0.355TrpThr: 0.355 ± 0.374
0.0TrpVal: 0.0 ± 0.0
0.71TrpTrp: 0.71 ± 0.393
0.355TrpTyr: 0.355 ± 0.41
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.129TyrAla: 2.129 ± 1.093
0.71TyrCys: 0.71 ± 0.411
0.355TyrAsp: 0.355 ± 0.489
3.194TyrGlu: 3.194 ± 1.009
2.129TyrPhe: 2.129 ± 0.927
0.71TyrGly: 0.71 ± 0.396
1.065TyrHis: 1.065 ± 0.619
2.839TyrIle: 2.839 ± 0.895
4.258TyrLys: 4.258 ± 1.009
7.807TyrLeu: 7.807 ± 1.428
1.065TyrMet: 1.065 ± 0.546
2.839TyrAsn: 2.839 ± 0.933
0.71TyrPro: 0.71 ± 0.482
1.065TyrGln: 1.065 ± 0.517
2.129TyrArg: 2.129 ± 0.91
5.323TyrSer: 5.323 ± 1.296
1.419TyrThr: 1.419 ± 0.558
1.065TyrVal: 1.065 ± 0.538
0.71TyrTrp: 0.71 ± 0.393
0.355TyrTyr: 0.355 ± 0.295
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 21 proteins (2819 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski