Amino acid dipepetide frequency for Streptococcus phage Javan88

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.044AlaAla: 6.044 ± 1.738
0.58AlaCys: 0.58 ± 0.226
3.726AlaAsp: 3.726 ± 0.706
5.796AlaGlu: 5.796 ± 0.666
2.981AlaPhe: 2.981 ± 0.917
6.706AlaGly: 6.706 ± 1.34
0.745AlaHis: 0.745 ± 0.253
5.382AlaIle: 5.382 ± 0.889
6.872AlaLys: 6.872 ± 0.538
6.872AlaLeu: 6.872 ± 1.383
3.063AlaMet: 3.063 ± 0.84
3.56AlaAsn: 3.56 ± 0.592
1.904AlaPro: 1.904 ± 0.42
2.898AlaGln: 2.898 ± 0.762
1.987AlaArg: 1.987 ± 0.375
4.802AlaSer: 4.802 ± 1.374
4.637AlaThr: 4.637 ± 0.752
6.044AlaVal: 6.044 ± 1.256
0.497AlaTrp: 0.497 ± 0.184
2.981AlaTyr: 2.981 ± 0.547
0.0AlaXaa: 0.0 ± 0.0
Cys
0.166CysAla: 0.166 ± 0.113
0.083CysCys: 0.083 ± 0.081
0.414CysAsp: 0.414 ± 0.179
0.331CysGlu: 0.331 ± 0.154
0.331CysPhe: 0.331 ± 0.169
0.414CysGly: 0.414 ± 0.234
0.0CysHis: 0.0 ± 0.0
0.331CysIle: 0.331 ± 0.158
0.331CysLys: 0.331 ± 0.157
0.331CysLeu: 0.331 ± 0.166
0.083CysMet: 0.083 ± 0.078
0.331CysAsn: 0.331 ± 0.137
0.083CysPro: 0.083 ± 0.079
0.083CysGln: 0.083 ± 0.083
0.083CysArg: 0.083 ± 0.113
0.331CysSer: 0.331 ± 0.152
0.083CysThr: 0.083 ± 0.085
0.331CysVal: 0.331 ± 0.177
0.0CysTrp: 0.0 ± 0.0
0.166CysTyr: 0.166 ± 0.106
0.0CysXaa: 0.0 ± 0.0
Asp
4.14AspAla: 4.14 ± 0.573
0.248AspCys: 0.248 ± 0.143
4.802AspAsp: 4.802 ± 0.851
5.713AspGlu: 5.713 ± 1.141
3.063AspPhe: 3.063 ± 0.468
5.961AspGly: 5.961 ± 0.845
0.745AspHis: 0.745 ± 0.23
3.643AspIle: 3.643 ± 0.658
4.719AspLys: 4.719 ± 0.612
4.14AspLeu: 4.14 ± 0.77
1.656AspMet: 1.656 ± 0.356
3.56AspAsn: 3.56 ± 0.515
1.242AspPro: 1.242 ± 0.442
1.573AspGln: 1.573 ± 0.406
1.49AspArg: 1.49 ± 0.34
4.14AspSer: 4.14 ± 0.519
4.554AspThr: 4.554 ± 0.733
3.229AspVal: 3.229 ± 0.441
0.497AspTrp: 0.497 ± 0.282
3.643AspTyr: 3.643 ± 0.713
0.0AspXaa: 0.0 ± 0.0
Glu
4.223GluAla: 4.223 ± 0.631
0.083GluCys: 0.083 ± 0.093
3.312GluAsp: 3.312 ± 0.645
3.809GluGlu: 3.809 ± 0.703
2.567GluPhe: 2.567 ± 0.377
2.235GluGly: 2.235 ± 0.46
0.911GluHis: 0.911 ± 0.303
5.961GluIle: 5.961 ± 0.844
4.388GluLys: 4.388 ± 0.601
6.541GluLeu: 6.541 ± 1.061
2.235GluMet: 2.235 ± 0.388
4.223GluAsn: 4.223 ± 0.563
1.159GluPro: 1.159 ± 0.44
4.968GluGln: 4.968 ± 0.65
3.643GluArg: 3.643 ± 0.714
2.235GluSer: 2.235 ± 0.462
3.229GluThr: 3.229 ± 0.526
4.471GluVal: 4.471 ± 0.69
0.828GluTrp: 0.828 ± 0.249
2.898GluTyr: 2.898 ± 0.578
0.0GluXaa: 0.0 ± 0.0
Phe
2.484PheAla: 2.484 ± 0.537
0.166PheCys: 0.166 ± 0.13
5.133PheAsp: 5.133 ± 0.642
2.815PheGlu: 2.815 ± 0.603
0.994PhePhe: 0.994 ± 0.333
2.898PheGly: 2.898 ± 0.717
0.58PheHis: 0.58 ± 0.198
1.987PheIle: 1.987 ± 0.389
3.229PheLys: 3.229 ± 0.56
2.981PheLeu: 2.981 ± 0.484
0.745PheMet: 0.745 ± 0.206
2.732PheAsn: 2.732 ± 0.491
0.828PhePro: 0.828 ± 0.265
1.408PheGln: 1.408 ± 0.422
1.242PheArg: 1.242 ± 0.242
3.477PheSer: 3.477 ± 0.554
3.312PheThr: 3.312 ± 0.504
2.484PheVal: 2.484 ± 0.519
0.331PheTrp: 0.331 ± 0.133
1.49PheTyr: 1.49 ± 0.37
0.0PheXaa: 0.0 ± 0.0
Gly
5.216GlyAla: 5.216 ± 1.628
0.497GlyCys: 0.497 ± 0.199
3.395GlyAsp: 3.395 ± 0.469
2.981GlyGlu: 2.981 ± 0.501
3.312GlyPhe: 3.312 ± 0.733
4.802GlyGly: 4.802 ± 0.617
0.911GlyHis: 0.911 ± 0.29
6.375GlyIle: 6.375 ± 1.012
6.706GlyLys: 6.706 ± 0.732
6.292GlyLeu: 6.292 ± 0.966
2.07GlyMet: 2.07 ± 0.532
4.14GlyAsn: 4.14 ± 0.641
0.166GlyPro: 0.166 ± 0.103
2.484GlyGln: 2.484 ± 0.489
2.815GlyArg: 2.815 ± 0.529
4.14GlySer: 4.14 ± 0.918
4.968GlyThr: 4.968 ± 0.89
5.216GlyVal: 5.216 ± 0.991
0.828GlyTrp: 0.828 ± 0.239
2.318GlyTyr: 2.318 ± 0.522
0.0GlyXaa: 0.0 ± 0.0
His
0.745HisAla: 0.745 ± 0.343
0.166HisCys: 0.166 ± 0.118
0.828HisAsp: 0.828 ± 0.289
0.58HisGlu: 0.58 ± 0.222
0.745HisPhe: 0.745 ± 0.24
0.911HisGly: 0.911 ± 0.263
0.248HisHis: 0.248 ± 0.171
1.573HisIle: 1.573 ± 0.379
0.994HisLys: 0.994 ± 0.365
1.159HisLeu: 1.159 ± 0.286
0.248HisMet: 0.248 ± 0.187
0.911HisAsn: 0.911 ± 0.336
0.497HisPro: 0.497 ± 0.205
0.745HisGln: 0.745 ± 0.228
0.745HisArg: 0.745 ± 0.286
0.662HisSer: 0.662 ± 0.254
0.745HisThr: 0.745 ± 0.219
0.662HisVal: 0.662 ± 0.201
0.083HisTrp: 0.083 ± 0.089
0.662HisTyr: 0.662 ± 0.331
0.0HisXaa: 0.0 ± 0.0
Ile
5.299IleAla: 5.299 ± 0.85
0.083IleCys: 0.083 ± 0.084
4.968IleAsp: 4.968 ± 0.513
5.961IleGlu: 5.961 ± 1.021
2.318IlePhe: 2.318 ± 0.525
5.796IleGly: 5.796 ± 0.997
1.159IleHis: 1.159 ± 0.29
3.643IleIle: 3.643 ± 0.616
6.624IleLys: 6.624 ± 0.931
3.726IleLeu: 3.726 ± 0.516
1.325IleMet: 1.325 ± 0.307
4.14IleAsn: 4.14 ± 0.694
2.318IlePro: 2.318 ± 0.64
2.153IleGln: 2.153 ± 0.481
2.318IleArg: 2.318 ± 0.469
4.885IleSer: 4.885 ± 0.817
4.554IleThr: 4.554 ± 0.551
2.898IleVal: 2.898 ± 0.513
0.828IleTrp: 0.828 ± 0.291
2.815IleTyr: 2.815 ± 0.561
0.0IleXaa: 0.0 ± 0.0
Lys
7.783LysAla: 7.783 ± 0.947
0.662LysCys: 0.662 ± 0.266
3.809LysAsp: 3.809 ± 0.601
5.547LysGlu: 5.547 ± 0.95
2.732LysPhe: 2.732 ± 0.341
5.547LysGly: 5.547 ± 0.804
1.656LysHis: 1.656 ± 0.447
4.471LysIle: 4.471 ± 0.705
7.203LysLys: 7.203 ± 1.129
6.044LysLeu: 6.044 ± 0.862
2.732LysMet: 2.732 ± 0.58
4.388LysAsn: 4.388 ± 0.506
2.567LysPro: 2.567 ± 0.579
3.229LysGln: 3.229 ± 0.578
3.726LysArg: 3.726 ± 0.699
5.382LysSer: 5.382 ± 0.474
5.133LysThr: 5.133 ± 0.702
5.713LysVal: 5.713 ± 0.661
0.828LysTrp: 0.828 ± 0.267
1.656LysTyr: 1.656 ± 0.386
0.0LysXaa: 0.0 ± 0.0
Leu
6.624LeuAla: 6.624 ± 0.963
0.331LeuCys: 0.331 ± 0.178
6.21LeuAsp: 6.21 ± 0.879
6.21LeuGlu: 6.21 ± 1.106
2.567LeuPhe: 2.567 ± 0.487
5.547LeuGly: 5.547 ± 0.813
0.994LeuHis: 0.994 ± 0.333
3.891LeuIle: 3.891 ± 0.616
7.783LeuLys: 7.783 ± 1.013
5.464LeuLeu: 5.464 ± 0.731
1.076LeuMet: 1.076 ± 0.362
5.133LeuAsn: 5.133 ± 0.662
2.649LeuPro: 2.649 ± 0.513
3.229LeuGln: 3.229 ± 0.447
2.484LeuArg: 2.484 ± 0.514
6.706LeuSer: 6.706 ± 0.726
5.299LeuThr: 5.299 ± 0.602
4.388LeuVal: 4.388 ± 0.557
0.414LeuTrp: 0.414 ± 0.138
2.567LeuTyr: 2.567 ± 0.455
0.0LeuXaa: 0.0 ± 0.0
Met
2.153MetAla: 2.153 ± 0.555
0.083MetCys: 0.083 ± 0.081
0.994MetAsp: 0.994 ± 0.32
1.408MetGlu: 1.408 ± 0.339
1.159MetPhe: 1.159 ± 0.289
1.325MetGly: 1.325 ± 0.378
0.331MetHis: 0.331 ± 0.19
2.153MetIle: 2.153 ± 0.446
1.49MetLys: 1.49 ± 0.306
2.235MetLeu: 2.235 ± 0.392
0.745MetMet: 0.745 ± 0.282
1.242MetAsn: 1.242 ± 0.309
0.745MetPro: 0.745 ± 0.242
1.821MetGln: 1.821 ± 0.382
1.325MetArg: 1.325 ± 0.319
1.739MetSer: 1.739 ± 0.411
2.732MetThr: 2.732 ± 0.397
1.242MetVal: 1.242 ± 0.386
0.248MetTrp: 0.248 ± 0.124
0.911MetTyr: 0.911 ± 0.321
0.0MetXaa: 0.0 ± 0.0
Asn
4.223AsnAla: 4.223 ± 0.718
0.166AsnCys: 0.166 ± 0.117
4.057AsnAsp: 4.057 ± 0.663
3.395AsnGlu: 3.395 ± 0.673
1.904AsnPhe: 1.904 ± 0.398
4.554AsnGly: 4.554 ± 0.909
0.745AsnHis: 0.745 ± 0.257
3.312AsnIle: 3.312 ± 0.471
4.554AsnLys: 4.554 ± 0.595
5.051AsnLeu: 5.051 ± 0.696
1.242AsnMet: 1.242 ± 0.339
5.051AsnAsn: 5.051 ± 0.838
2.484AsnPro: 2.484 ± 0.551
2.732AsnGln: 2.732 ± 0.59
1.573AsnArg: 1.573 ± 0.336
3.643AsnSer: 3.643 ± 0.754
3.477AsnThr: 3.477 ± 0.567
2.981AsnVal: 2.981 ± 0.534
1.076AsnTrp: 1.076 ± 0.348
1.904AsnTyr: 1.904 ± 0.429
0.0AsnXaa: 0.0 ± 0.0
Pro
2.235ProAla: 2.235 ± 0.33
0.083ProCys: 0.083 ± 0.081
1.821ProAsp: 1.821 ± 0.506
1.656ProGlu: 1.656 ± 0.455
1.49ProPhe: 1.49 ± 0.344
1.076ProGly: 1.076 ± 0.364
0.745ProHis: 0.745 ± 0.259
1.904ProIle: 1.904 ± 0.462
1.325ProLys: 1.325 ± 0.385
2.153ProLeu: 2.153 ± 0.386
0.58ProMet: 0.58 ± 0.216
1.408ProAsn: 1.408 ± 0.422
0.58ProPro: 0.58 ± 0.199
0.994ProGln: 0.994 ± 0.293
0.745ProArg: 0.745 ± 0.3
2.318ProSer: 2.318 ± 0.406
2.07ProThr: 2.07 ± 0.43
2.318ProVal: 2.318 ± 0.444
0.248ProTrp: 0.248 ± 0.146
1.656ProTyr: 1.656 ± 0.368
0.0ProXaa: 0.0 ± 0.0
Gln
3.974GlnAla: 3.974 ± 0.693
0.166GlnCys: 0.166 ± 0.106
1.408GlnAsp: 1.408 ± 0.366
2.484GlnGlu: 2.484 ± 0.465
1.242GlnPhe: 1.242 ± 0.331
4.14GlnGly: 4.14 ± 0.856
0.414GlnHis: 0.414 ± 0.165
2.567GlnIle: 2.567 ± 0.532
3.229GlnLys: 3.229 ± 0.472
4.305GlnLeu: 4.305 ± 0.579
1.325GlnMet: 1.325 ± 0.337
2.401GlnAsn: 2.401 ± 0.571
1.159GlnPro: 1.159 ± 0.223
2.484GlnGln: 2.484 ± 0.618
1.076GlnArg: 1.076 ± 0.25
3.809GlnSer: 3.809 ± 0.548
1.904GlnThr: 1.904 ± 0.371
2.898GlnVal: 2.898 ± 0.529
0.662GlnTrp: 0.662 ± 0.163
1.904GlnTyr: 1.904 ± 0.392
0.0GlnXaa: 0.0 ± 0.0
Arg
2.401ArgAla: 2.401 ± 0.586
0.0ArgCys: 0.0 ± 0.0
1.656ArgAsp: 1.656 ± 0.297
1.242ArgGlu: 1.242 ± 0.348
1.987ArgPhe: 1.987 ± 0.391
2.318ArgGly: 2.318 ± 0.427
0.497ArgHis: 0.497 ± 0.218
1.821ArgIle: 1.821 ± 0.315
3.726ArgLys: 3.726 ± 0.695
3.809ArgLeu: 3.809 ± 0.736
0.745ArgMet: 0.745 ± 0.23
2.07ArgAsn: 2.07 ± 0.328
1.325ArgPro: 1.325 ± 0.391
1.49ArgGln: 1.49 ± 0.316
1.325ArgArg: 1.325 ± 0.28
1.739ArgSer: 1.739 ± 0.431
2.318ArgThr: 2.318 ± 0.564
2.484ArgVal: 2.484 ± 0.406
0.662ArgTrp: 0.662 ± 0.319
1.656ArgTyr: 1.656 ± 0.374
0.0ArgXaa: 0.0 ± 0.0
Ser
5.961SerAla: 5.961 ± 1.377
0.248SerCys: 0.248 ± 0.134
4.719SerAsp: 4.719 ± 0.623
3.395SerGlu: 3.395 ± 0.557
3.395SerPhe: 3.395 ± 0.699
4.471SerGly: 4.471 ± 1.304
0.662SerHis: 0.662 ± 0.217
3.974SerIle: 3.974 ± 0.57
3.395SerLys: 3.395 ± 0.547
5.051SerLeu: 5.051 ± 0.57
1.573SerMet: 1.573 ± 0.304
4.057SerAsn: 4.057 ± 0.714
2.235SerPro: 2.235 ± 0.566
3.809SerGln: 3.809 ± 0.769
1.821SerArg: 1.821 ± 0.431
5.382SerSer: 5.382 ± 1.123
4.968SerThr: 4.968 ± 0.599
3.643SerVal: 3.643 ± 0.549
0.497SerTrp: 0.497 ± 0.176
2.484SerTyr: 2.484 ± 0.521
0.0SerXaa: 0.0 ± 0.0
Thr
4.802ThrAla: 4.802 ± 0.882
0.083ThrCys: 0.083 ± 0.081
4.305ThrAsp: 4.305 ± 0.725
4.14ThrGlu: 4.14 ± 0.72
2.981ThrPhe: 2.981 ± 0.461
4.223ThrGly: 4.223 ± 0.644
0.911ThrHis: 0.911 ± 0.267
6.21ThrIle: 6.21 ± 0.76
5.961ThrLys: 5.961 ± 0.786
5.464ThrLeu: 5.464 ± 0.732
1.739ThrMet: 1.739 ± 0.369
2.401ThrAsn: 2.401 ± 0.572
2.567ThrPro: 2.567 ± 0.474
3.229ThrGln: 3.229 ± 0.563
1.904ThrArg: 1.904 ± 0.34
3.063ThrSer: 3.063 ± 0.72
4.968ThrThr: 4.968 ± 0.647
5.547ThrVal: 5.547 ± 0.61
0.662ThrTrp: 0.662 ± 0.215
2.484ThrTyr: 2.484 ± 0.482
0.0ThrXaa: 0.0 ± 0.0
Val
6.541ValAla: 6.541 ± 1.485
0.248ValCys: 0.248 ± 0.138
4.223ValAsp: 4.223 ± 0.578
4.471ValGlu: 4.471 ± 0.667
3.146ValPhe: 3.146 ± 0.467
4.057ValGly: 4.057 ± 0.802
0.662ValHis: 0.662 ± 0.226
4.968ValIle: 4.968 ± 0.47
3.891ValLys: 3.891 ± 0.568
3.726ValLeu: 3.726 ± 0.636
1.49ValMet: 1.49 ± 0.363
3.063ValAsn: 3.063 ± 0.562
1.49ValPro: 1.49 ± 0.381
1.821ValGln: 1.821 ± 0.348
2.318ValArg: 2.318 ± 0.485
4.388ValSer: 4.388 ± 0.524
5.051ValThr: 5.051 ± 0.686
4.885ValVal: 4.885 ± 0.748
0.828ValTrp: 0.828 ± 0.306
2.815ValTyr: 2.815 ± 0.486
0.0ValXaa: 0.0 ± 0.0
Trp
0.745TrpAla: 0.745 ± 0.228
0.0TrpCys: 0.0 ± 0.0
0.166TrpAsp: 0.166 ± 0.117
0.58TrpGlu: 0.58 ± 0.194
0.58TrpPhe: 0.58 ± 0.186
0.745TrpGly: 0.745 ± 0.332
0.166TrpHis: 0.166 ± 0.128
0.662TrpIle: 0.662 ± 0.193
1.325TrpLys: 1.325 ± 0.375
0.58TrpLeu: 0.58 ± 0.285
0.331TrpMet: 0.331 ± 0.191
0.58TrpAsn: 0.58 ± 0.167
0.0TrpPro: 0.0 ± 0.0
0.497TrpGln: 0.497 ± 0.235
0.58TrpArg: 0.58 ± 0.216
0.745TrpSer: 0.745 ± 0.282
0.828TrpThr: 0.828 ± 0.367
0.414TrpVal: 0.414 ± 0.168
0.331TrpTrp: 0.331 ± 0.2
0.745TrpTyr: 0.745 ± 0.253
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.318TyrAla: 2.318 ± 0.558
0.331TyrCys: 0.331 ± 0.2
2.815TyrAsp: 2.815 ± 0.569
1.408TyrGlu: 1.408 ± 0.327
1.821TyrPhe: 1.821 ± 0.56
1.904TyrGly: 1.904 ± 0.432
0.828TyrHis: 0.828 ± 0.299
2.981TyrIle: 2.981 ± 0.564
3.063TyrLys: 3.063 ± 0.494
3.643TyrLeu: 3.643 ± 0.688
0.994TyrMet: 0.994 ± 0.265
2.732TyrAsn: 2.732 ± 0.522
1.408TyrPro: 1.408 ± 0.325
1.987TyrGln: 1.987 ± 0.412
2.07TyrArg: 2.07 ± 0.403
2.153TyrSer: 2.153 ± 0.438
2.815TyrThr: 2.815 ± 0.468
2.235TyrVal: 2.235 ± 0.469
0.248TyrTrp: 0.248 ± 0.146
1.656TyrTyr: 1.656 ± 0.36
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 59 proteins (12079 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski