Amino acid dipepetide frequency for Streptococcus satellite phage Javan634

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.293AlaAla: 0.293 ± 0.261
0.293AlaCys: 0.293 ± 0.249
3.511AlaAsp: 3.511 ± 0.889
3.511AlaGlu: 3.511 ± 0.717
1.755AlaPhe: 1.755 ± 0.804
1.463AlaGly: 1.463 ± 0.451
0.585AlaHis: 0.585 ± 0.359
4.389AlaIle: 4.389 ± 1.0
4.096AlaLys: 4.096 ± 1.25
5.266AlaLeu: 5.266 ± 0.922
1.755AlaMet: 1.755 ± 1.092
2.926AlaAsn: 2.926 ± 0.807
1.463AlaPro: 1.463 ± 0.763
2.341AlaGln: 2.341 ± 0.756
2.048AlaArg: 2.048 ± 0.559
3.803AlaSer: 3.803 ± 1.178
2.633AlaThr: 2.633 ± 0.924
2.048AlaVal: 2.048 ± 0.613
0.585AlaTrp: 0.585 ± 0.344
1.17AlaTyr: 1.17 ± 0.496
0.0AlaXaa: 0.0 ± 0.0
Cys
0.293CysAla: 0.293 ± 0.298
0.293CysCys: 0.293 ± 0.316
0.293CysAsp: 0.293 ± 0.241
0.293CysGlu: 0.293 ± 0.288
0.878CysPhe: 0.878 ± 0.557
0.878CysGly: 0.878 ± 0.674
0.293CysHis: 0.293 ± 0.249
1.17CysIle: 1.17 ± 0.61
0.0CysLys: 0.0 ± 0.0
0.585CysLeu: 0.585 ± 0.372
0.0CysMet: 0.0 ± 0.0
0.293CysAsn: 0.293 ± 0.249
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.878CysArg: 0.878 ± 0.517
0.585CysSer: 0.585 ± 0.353
0.293CysThr: 0.293 ± 0.316
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
0.585AspAla: 0.585 ± 0.464
2.048AspCys: 2.048 ± 1.051
3.803AspAsp: 3.803 ± 1.053
5.266AspGlu: 5.266 ± 1.491
4.974AspPhe: 4.974 ± 1.378
1.17AspGly: 1.17 ± 0.603
1.17AspHis: 1.17 ± 0.77
8.192AspIle: 8.192 ± 1.116
5.266AspLys: 5.266 ± 1.097
7.022AspLeu: 7.022 ± 1.186
2.048AspMet: 2.048 ± 0.774
2.048AspAsn: 2.048 ± 0.717
2.048AspPro: 2.048 ± 0.624
1.755AspGln: 1.755 ± 0.525
2.633AspArg: 2.633 ± 0.823
2.926AspSer: 2.926 ± 0.677
3.218AspThr: 3.218 ± 0.787
1.17AspVal: 1.17 ± 0.573
0.585AspTrp: 0.585 ± 0.404
6.729AspTyr: 6.729 ± 1.052
0.0AspXaa: 0.0 ± 0.0
Glu
4.974GluAla: 4.974 ± 1.194
0.0GluCys: 0.0 ± 0.0
5.266GluAsp: 5.266 ± 1.239
4.681GluGlu: 4.681 ± 1.445
3.218GluPhe: 3.218 ± 1.106
3.218GluGly: 3.218 ± 0.941
2.048GluHis: 2.048 ± 0.831
7.022GluIle: 7.022 ± 1.496
8.192GluLys: 8.192 ± 1.237
10.24GluLeu: 10.24 ± 2.005
2.048GluMet: 2.048 ± 0.824
4.389GluAsn: 4.389 ± 1.15
2.633GluPro: 2.633 ± 0.78
3.218GluGln: 3.218 ± 1.205
4.096GluArg: 4.096 ± 1.067
1.755GluSer: 1.755 ± 0.627
3.803GluThr: 3.803 ± 0.951
2.926GluVal: 2.926 ± 0.964
1.17GluTrp: 1.17 ± 0.609
2.048GluTyr: 2.048 ± 0.537
0.0GluXaa: 0.0 ± 0.0
Phe
0.585PheAla: 0.585 ± 0.434
0.0PheCys: 0.0 ± 0.0
3.218PheAsp: 3.218 ± 1.146
4.681PheGlu: 4.681 ± 1.262
2.048PhePhe: 2.048 ± 1.045
2.048PheGly: 2.048 ± 0.571
1.17PheHis: 1.17 ± 0.63
3.511PheIle: 3.511 ± 0.963
3.218PheLys: 3.218 ± 0.969
3.218PheLeu: 3.218 ± 0.966
0.293PheMet: 0.293 ± 0.311
3.511PheAsn: 3.511 ± 1.042
1.17PhePro: 1.17 ± 0.565
2.048PheGln: 2.048 ± 0.637
0.878PheArg: 0.878 ± 0.488
2.048PheSer: 2.048 ± 0.98
1.17PheThr: 1.17 ± 0.75
2.341PheVal: 2.341 ± 0.799
0.0PheTrp: 0.0 ± 0.0
0.878PheTyr: 0.878 ± 0.515
0.0PheXaa: 0.0 ± 0.0
Gly
2.048GlyAla: 2.048 ± 0.841
0.293GlyCys: 0.293 ± 0.282
2.926GlyAsp: 2.926 ± 0.61
1.17GlyGlu: 1.17 ± 0.421
2.633GlyPhe: 2.633 ± 0.691
2.633GlyGly: 2.633 ± 0.968
1.463GlyHis: 1.463 ± 0.561
4.096GlyIle: 4.096 ± 1.135
4.681GlyLys: 4.681 ± 1.08
3.511GlyLeu: 3.511 ± 1.522
0.293GlyMet: 0.293 ± 0.241
2.633GlyAsn: 2.633 ± 0.863
0.293GlyPro: 0.293 ± 0.345
1.463GlyGln: 1.463 ± 0.621
2.633GlyArg: 2.633 ± 0.909
2.633GlySer: 2.633 ± 0.848
2.926GlyThr: 2.926 ± 0.838
1.755GlyVal: 1.755 ± 0.911
0.878GlyTrp: 0.878 ± 0.748
2.341GlyTyr: 2.341 ± 0.75
0.0GlyXaa: 0.0 ± 0.0
His
1.755HisAla: 1.755 ± 0.509
0.0HisCys: 0.0 ± 0.0
1.463HisAsp: 1.463 ± 0.705
1.755HisGlu: 1.755 ± 0.706
0.878HisPhe: 0.878 ± 0.578
0.878HisGly: 0.878 ± 0.434
0.585HisHis: 0.585 ± 0.39
3.511HisIle: 3.511 ± 1.07
2.048HisLys: 2.048 ± 0.694
2.341HisLeu: 2.341 ± 0.544
0.0HisMet: 0.0 ± 0.0
0.878HisAsn: 0.878 ± 0.527
0.878HisPro: 0.878 ± 0.443
0.585HisGln: 0.585 ± 0.335
0.585HisArg: 0.585 ± 0.387
1.463HisSer: 1.463 ± 0.634
1.17HisThr: 1.17 ± 0.627
1.17HisVal: 1.17 ± 0.997
0.0HisTrp: 0.0 ± 0.0
0.585HisTyr: 0.585 ± 0.347
0.0HisXaa: 0.0 ± 0.0
Ile
5.266IleAla: 5.266 ± 1.335
0.878IleCys: 0.878 ± 0.588
7.899IleAsp: 7.899 ± 1.462
7.314IleGlu: 7.314 ± 1.806
2.341IlePhe: 2.341 ± 0.686
3.218IleGly: 3.218 ± 0.845
1.755IleHis: 1.755 ± 0.977
6.729IleIle: 6.729 ± 1.654
8.777IleLys: 8.777 ± 2.153
8.192IleLeu: 8.192 ± 1.3
1.17IleMet: 1.17 ± 0.454
5.851IleAsn: 5.851 ± 1.464
2.633IlePro: 2.633 ± 0.841
2.926IleGln: 2.926 ± 0.822
2.341IleArg: 2.341 ± 0.68
6.144IleSer: 6.144 ± 1.324
7.314IleThr: 7.314 ± 0.996
3.218IleVal: 3.218 ± 1.394
0.585IleTrp: 0.585 ± 0.402
3.218IleTyr: 3.218 ± 0.666
0.0IleXaa: 0.0 ± 0.0
Lys
5.851LysAla: 5.851 ± 1.666
0.585LysCys: 0.585 ± 0.632
4.389LysAsp: 4.389 ± 1.048
8.484LysGlu: 8.484 ± 2.043
2.341LysPhe: 2.341 ± 0.733
4.096LysGly: 4.096 ± 0.894
3.803LysHis: 3.803 ± 0.886
7.314LysIle: 7.314 ± 1.534
5.851LysLys: 5.851 ± 1.336
7.607LysLeu: 7.607 ± 1.867
1.755LysMet: 1.755 ± 0.581
6.729LysAsn: 6.729 ± 1.227
4.681LysPro: 4.681 ± 1.27
3.218LysGln: 3.218 ± 1.034
6.144LysArg: 6.144 ± 1.27
4.681LysSer: 4.681 ± 0.995
4.681LysThr: 4.681 ± 1.069
4.974LysVal: 4.974 ± 1.16
0.878LysTrp: 0.878 ± 0.447
4.974LysTyr: 4.974 ± 1.17
0.0LysXaa: 0.0 ± 0.0
Leu
5.266LeuAla: 5.266 ± 1.246
0.878LeuCys: 0.878 ± 0.748
9.362LeuAsp: 9.362 ± 1.524
9.07LeuGlu: 9.07 ± 2.018
3.218LeuPhe: 3.218 ± 0.928
3.511LeuGly: 3.511 ± 1.036
0.293LeuHis: 0.293 ± 0.282
7.022LeuIle: 7.022 ± 1.156
9.947LeuLys: 9.947 ± 1.636
13.166LeuLeu: 13.166 ± 2.537
2.048LeuMet: 2.048 ± 0.67
7.022LeuAsn: 7.022 ± 1.172
4.096LeuPro: 4.096 ± 1.066
3.218LeuGln: 3.218 ± 0.88
3.803LeuArg: 3.803 ± 1.371
9.07LeuSer: 9.07 ± 1.544
4.389LeuThr: 4.389 ± 1.031
1.755LeuVal: 1.755 ± 1.0
1.17LeuTrp: 1.17 ± 0.547
5.266LeuTyr: 5.266 ± 0.933
0.0LeuXaa: 0.0 ± 0.0
Met
1.463MetAla: 1.463 ± 0.684
0.0MetCys: 0.0 ± 0.0
1.755MetAsp: 1.755 ± 0.55
0.878MetGlu: 0.878 ± 0.568
0.0MetPhe: 0.0 ± 0.0
0.878MetGly: 0.878 ± 0.404
0.293MetHis: 0.293 ± 0.311
0.878MetIle: 0.878 ± 0.498
3.218MetLys: 3.218 ± 0.83
1.463MetLeu: 1.463 ± 0.688
0.293MetMet: 0.293 ± 0.311
2.048MetAsn: 2.048 ± 0.681
0.878MetPro: 0.878 ± 0.363
0.878MetGln: 0.878 ± 0.63
2.633MetArg: 2.633 ± 1.132
0.585MetSer: 0.585 ± 0.402
1.755MetThr: 1.755 ± 0.797
1.17MetVal: 1.17 ± 0.486
0.0MetTrp: 0.0 ± 0.0
0.878MetTyr: 0.878 ± 0.487
0.0MetXaa: 0.0 ± 0.0
Asn
3.511AsnAla: 3.511 ± 0.926
0.585AsnCys: 0.585 ± 0.432
3.218AsnAsp: 3.218 ± 0.843
6.144AsnGlu: 6.144 ± 0.775
2.341AsnPhe: 2.341 ± 0.862
4.096AsnGly: 4.096 ± 1.612
2.341AsnHis: 2.341 ± 0.737
5.559AsnIle: 5.559 ± 1.019
4.096AsnLys: 4.096 ± 1.274
5.851AsnLeu: 5.851 ± 1.165
2.926AsnMet: 2.926 ± 0.88
4.096AsnAsn: 4.096 ± 1.133
2.048AsnPro: 2.048 ± 0.85
2.048AsnGln: 2.048 ± 0.972
4.389AsnArg: 4.389 ± 1.175
3.218AsnSer: 3.218 ± 0.998
2.048AsnThr: 2.048 ± 0.619
1.463AsnVal: 1.463 ± 0.646
0.585AsnTrp: 0.585 ± 0.4
3.218AsnTyr: 3.218 ± 0.745
0.0AsnXaa: 0.0 ± 0.0
Pro
0.585ProAla: 0.585 ± 0.365
0.0ProCys: 0.0 ± 0.0
2.341ProAsp: 2.341 ± 0.9
4.681ProGlu: 4.681 ± 1.122
0.878ProPhe: 0.878 ± 0.379
0.585ProGly: 0.585 ± 0.521
0.585ProHis: 0.585 ± 0.396
2.341ProIle: 2.341 ± 0.662
3.803ProLys: 3.803 ± 0.98
3.803ProLeu: 3.803 ± 0.966
0.293ProMet: 0.293 ± 0.249
3.803ProAsn: 3.803 ± 0.986
0.585ProPro: 0.585 ± 0.36
0.293ProGln: 0.293 ± 0.249
2.048ProArg: 2.048 ± 0.736
1.463ProSer: 1.463 ± 0.581
1.463ProThr: 1.463 ± 0.796
0.878ProVal: 0.878 ± 0.363
0.0ProTrp: 0.0 ± 0.0
3.218ProTyr: 3.218 ± 0.74
0.0ProXaa: 0.0 ± 0.0
Gln
2.341GlnAla: 2.341 ± 0.624
0.0GlnCys: 0.0 ± 0.0
0.878GlnAsp: 0.878 ± 0.41
3.218GlnGlu: 3.218 ± 0.907
1.463GlnPhe: 1.463 ± 0.491
0.878GlnGly: 0.878 ± 0.467
0.293GlnHis: 0.293 ± 0.288
2.633GlnIle: 2.633 ± 0.584
4.096GlnLys: 4.096 ± 1.218
3.218GlnLeu: 3.218 ± 0.994
0.585GlnMet: 0.585 ± 0.448
1.17GlnAsn: 1.17 ± 0.616
0.293GlnPro: 0.293 ± 0.249
1.755GlnGln: 1.755 ± 0.896
2.633GlnArg: 2.633 ± 1.026
2.926GlnSer: 2.926 ± 0.834
2.633GlnThr: 2.633 ± 0.647
1.755GlnVal: 1.755 ± 0.647
0.293GlnTrp: 0.293 ± 0.311
1.463GlnTyr: 1.463 ± 0.556
0.0GlnXaa: 0.0 ± 0.0
Arg
3.803ArgAla: 3.803 ± 0.793
0.293ArgCys: 0.293 ± 0.313
2.926ArgAsp: 2.926 ± 0.698
2.633ArgGlu: 2.633 ± 0.684
2.341ArgPhe: 2.341 ± 0.741
2.926ArgGly: 2.926 ± 1.065
0.878ArgHis: 0.878 ± 0.663
4.096ArgIle: 4.096 ± 1.101
5.266ArgLys: 5.266 ± 1.163
6.437ArgLeu: 6.437 ± 1.2
0.585ArgMet: 0.585 ± 0.394
1.755ArgAsn: 1.755 ± 0.582
1.755ArgPro: 1.755 ± 0.583
1.463ArgGln: 1.463 ± 0.673
2.633ArgArg: 2.633 ± 0.932
0.878ArgSer: 0.878 ± 0.38
5.266ArgThr: 5.266 ± 0.918
2.341ArgVal: 2.341 ± 0.832
0.585ArgTrp: 0.585 ± 0.474
2.048ArgTyr: 2.048 ± 0.73
0.0ArgXaa: 0.0 ± 0.0
Ser
1.755SerAla: 1.755 ± 0.806
0.0SerCys: 0.0 ± 0.0
3.218SerAsp: 3.218 ± 0.981
2.926SerGlu: 2.926 ± 0.805
2.048SerPhe: 2.048 ± 0.928
3.218SerGly: 3.218 ± 0.74
1.17SerHis: 1.17 ± 0.534
4.974SerIle: 4.974 ± 1.157
5.266SerLys: 5.266 ± 1.108
5.559SerLeu: 5.559 ± 1.403
1.755SerMet: 1.755 ± 0.853
3.803SerAsn: 3.803 ± 1.091
2.048SerPro: 2.048 ± 0.576
1.463SerGln: 1.463 ± 0.628
3.218SerArg: 3.218 ± 0.672
2.341SerSer: 2.341 ± 0.602
3.218SerThr: 3.218 ± 0.948
2.633SerVal: 2.633 ± 0.775
0.293SerTrp: 0.293 ± 0.319
4.096SerTyr: 4.096 ± 1.513
0.0SerXaa: 0.0 ± 0.0
Thr
2.341ThrAla: 2.341 ± 0.679
0.0ThrCys: 0.0 ± 0.0
2.048ThrAsp: 2.048 ± 0.84
4.096ThrGlu: 4.096 ± 0.692
2.633ThrPhe: 2.633 ± 0.827
3.803ThrGly: 3.803 ± 0.972
1.17ThrHis: 1.17 ± 0.652
4.974ThrIle: 4.974 ± 1.048
4.974ThrLys: 4.974 ± 1.33
7.899ThrLeu: 7.899 ± 1.234
1.17ThrMet: 1.17 ± 0.448
4.389ThrAsn: 4.389 ± 1.148
2.048ThrPro: 2.048 ± 0.54
2.341ThrGln: 2.341 ± 0.698
2.633ThrArg: 2.633 ± 0.861
2.633ThrSer: 2.633 ± 0.973
3.803ThrThr: 3.803 ± 0.939
3.218ThrVal: 3.218 ± 0.917
1.463ThrTrp: 1.463 ± 0.708
1.755ThrTyr: 1.755 ± 1.035
0.0ThrXaa: 0.0 ± 0.0
Val
1.755ValAla: 1.755 ± 0.558
0.585ValCys: 0.585 ± 0.353
1.755ValAsp: 1.755 ± 0.994
0.878ValGlu: 0.878 ± 0.508
0.878ValPhe: 0.878 ± 0.408
1.463ValGly: 1.463 ± 0.636
0.878ValHis: 0.878 ± 0.379
3.803ValIle: 3.803 ± 0.859
4.389ValLys: 4.389 ± 0.953
2.633ValLeu: 2.633 ± 0.817
1.463ValMet: 1.463 ± 0.563
2.926ValAsn: 2.926 ± 1.096
1.463ValPro: 1.463 ± 0.633
0.878ValGln: 0.878 ± 0.418
1.463ValArg: 1.463 ± 0.692
2.633ValSer: 2.633 ± 0.709
4.974ValThr: 4.974 ± 1.059
2.048ValVal: 2.048 ± 0.96
0.293ValTrp: 0.293 ± 0.288
1.755ValTyr: 1.755 ± 0.582
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
1.17TrpAsp: 1.17 ± 0.57
1.17TrpGlu: 1.17 ± 0.606
0.585TrpPhe: 0.585 ± 0.371
0.585TrpGly: 0.585 ± 0.368
0.0TrpHis: 0.0 ± 0.0
0.878TrpIle: 0.878 ± 0.411
0.585TrpLys: 0.585 ± 0.344
1.17TrpLeu: 1.17 ± 0.538
0.0TrpMet: 0.0 ± 0.0
0.878TrpAsn: 0.878 ± 0.496
0.585TrpPro: 0.585 ± 0.388
0.585TrpGln: 0.585 ± 0.41
0.585TrpArg: 0.585 ± 0.351
0.585TrpSer: 0.585 ± 0.368
0.293TrpThr: 0.293 ± 0.345
0.293TrpVal: 0.293 ± 0.319
0.293TrpTrp: 0.293 ± 0.282
0.293TrpTyr: 0.293 ± 0.249
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.463TyrAla: 1.463 ± 0.65
0.293TyrCys: 0.293 ± 0.241
3.218TyrAsp: 3.218 ± 0.831
4.096TyrGlu: 4.096 ± 1.312
0.585TyrPhe: 0.585 ± 0.4
1.755TyrGly: 1.755 ± 0.552
1.755TyrHis: 1.755 ± 0.586
4.681TyrIle: 4.681 ± 1.103
5.559TyrLys: 5.559 ± 1.42
4.389TyrLeu: 4.389 ± 1.182
1.17TyrMet: 1.17 ± 0.415
2.633TyrAsn: 2.633 ± 0.569
2.048TyrPro: 2.048 ± 0.653
2.048TyrGln: 2.048 ± 0.772
2.633TyrArg: 2.633 ± 0.947
2.633TyrSer: 2.633 ± 0.617
2.341TyrThr: 2.341 ± 0.766
1.755TyrVal: 1.755 ± 0.661
0.878TyrTrp: 0.878 ± 0.692
2.633TyrTyr: 2.633 ± 0.845
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 23 proteins (3419 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski