Amino acid dipepetide frequency for Staphylococcus phage phiSa2wa_st59

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.129AlaAla: 2.129 ± 0.473
0.076AlaCys: 0.076 ± 0.082
2.737AlaAsp: 2.737 ± 0.423
4.182AlaGlu: 4.182 ± 0.684
2.433AlaPhe: 2.433 ± 0.576
3.802AlaGly: 3.802 ± 0.746
0.912AlaHis: 0.912 ± 0.329
4.106AlaIle: 4.106 ± 0.767
4.486AlaLys: 4.486 ± 0.561
4.791AlaLeu: 4.791 ± 0.582
1.673AlaMet: 1.673 ± 0.376
4.334AlaAsn: 4.334 ± 0.545
1.445AlaPro: 1.445 ± 0.365
2.357AlaGln: 2.357 ± 0.557
2.509AlaArg: 2.509 ± 0.48
2.89AlaSer: 2.89 ± 0.545
3.27AlaThr: 3.27 ± 0.495
2.813AlaVal: 2.813 ± 0.536
0.228AlaTrp: 0.228 ± 0.135
2.966AlaTyr: 2.966 ± 0.524
0.0AlaXaa: 0.0 ± 0.0
Cys
0.304CysAla: 0.304 ± 0.179
0.0CysCys: 0.0 ± 0.0
0.152CysAsp: 0.152 ± 0.114
0.532CysGlu: 0.532 ± 0.306
0.152CysPhe: 0.152 ± 0.116
0.228CysGly: 0.228 ± 0.142
0.076CysHis: 0.076 ± 0.071
0.76CysIle: 0.76 ± 0.262
0.38CysLys: 0.38 ± 0.181
0.38CysLeu: 0.38 ± 0.184
0.0CysMet: 0.0 ± 0.0
0.228CysAsn: 0.228 ± 0.11
0.152CysPro: 0.152 ± 0.107
0.152CysGln: 0.152 ± 0.123
0.152CysArg: 0.152 ± 0.122
0.076CysSer: 0.076 ± 0.079
0.304CysThr: 0.304 ± 0.165
0.38CysVal: 0.38 ± 0.156
0.076CysTrp: 0.076 ± 0.087
0.228CysTyr: 0.228 ± 0.131
0.0CysXaa: 0.0 ± 0.0
Asp
3.27AspAla: 3.27 ± 0.448
0.532AspCys: 0.532 ± 0.221
3.954AspAsp: 3.954 ± 0.794
4.638AspGlu: 4.638 ± 0.757
3.118AspPhe: 3.118 ± 0.486
4.41AspGly: 4.41 ± 0.816
0.76AspHis: 0.76 ± 0.217
4.714AspIle: 4.714 ± 0.455
5.399AspLys: 5.399 ± 0.72
6.007AspLeu: 6.007 ± 0.75
1.977AspMet: 1.977 ± 0.341
4.258AspAsn: 4.258 ± 0.626
1.597AspPro: 1.597 ± 0.367
0.608AspGln: 0.608 ± 0.241
2.129AspArg: 2.129 ± 0.493
3.346AspSer: 3.346 ± 0.579
3.194AspThr: 3.194 ± 0.595
3.346AspVal: 3.346 ± 0.592
0.684AspTrp: 0.684 ± 0.216
3.194AspTyr: 3.194 ± 0.522
0.0AspXaa: 0.0 ± 0.0
Glu
3.954GluAla: 3.954 ± 0.619
0.76GluCys: 0.76 ± 0.224
3.878GluAsp: 3.878 ± 0.607
6.692GluGlu: 6.692 ± 1.172
3.194GluPhe: 3.194 ± 0.507
2.433GluGly: 2.433 ± 0.409
1.521GluHis: 1.521 ± 0.354
6.387GluIle: 6.387 ± 0.791
7.224GluLys: 7.224 ± 0.984
7.376GluLeu: 7.376 ± 0.807
2.89GluMet: 2.89 ± 0.578
5.171GluAsn: 5.171 ± 0.53
0.836GluPro: 0.836 ± 0.27
4.106GluGln: 4.106 ± 0.825
4.106GluArg: 4.106 ± 0.551
4.258GluSer: 4.258 ± 0.56
3.65GluThr: 3.65 ± 0.808
4.638GluVal: 4.638 ± 0.648
0.989GluTrp: 0.989 ± 0.317
4.258GluTyr: 4.258 ± 0.667
0.0GluXaa: 0.0 ± 0.0
Phe
2.661PheAla: 2.661 ± 0.51
0.076PheCys: 0.076 ± 0.077
2.357PheAsp: 2.357 ± 0.432
2.89PheGlu: 2.89 ± 0.501
1.597PhePhe: 1.597 ± 0.36
2.585PheGly: 2.585 ± 0.473
0.608PheHis: 0.608 ± 0.206
4.182PheIle: 4.182 ± 0.606
3.954PheLys: 3.954 ± 0.617
3.194PheLeu: 3.194 ± 0.55
1.521PheMet: 1.521 ± 0.368
3.802PheAsn: 3.802 ± 0.56
0.836PhePro: 0.836 ± 0.27
0.912PheGln: 0.912 ± 0.212
1.673PheArg: 1.673 ± 0.35
2.89PheSer: 2.89 ± 0.474
2.89PheThr: 2.89 ± 0.553
2.205PheVal: 2.205 ± 0.489
0.228PheTrp: 0.228 ± 0.151
1.369PheTyr: 1.369 ± 0.341
0.0PheXaa: 0.0 ± 0.0
Gly
2.661GlyAla: 2.661 ± 0.542
0.38GlyCys: 0.38 ± 0.191
3.574GlyAsp: 3.574 ± 0.587
3.574GlyGlu: 3.574 ± 0.634
2.661GlyPhe: 2.661 ± 0.505
4.638GlyGly: 4.638 ± 0.887
1.141GlyHis: 1.141 ± 0.401
3.194GlyIle: 3.194 ± 0.803
6.159GlyLys: 6.159 ± 0.984
4.562GlyLeu: 4.562 ± 0.9
0.989GlyMet: 0.989 ± 0.315
3.422GlyAsn: 3.422 ± 0.811
1.141GlyPro: 1.141 ± 0.263
2.585GlyGln: 2.585 ± 0.595
1.977GlyArg: 1.977 ± 0.398
3.27GlySer: 3.27 ± 0.546
2.661GlyThr: 2.661 ± 0.589
4.182GlyVal: 4.182 ± 0.698
0.912GlyTrp: 0.912 ± 0.264
2.89GlyTyr: 2.89 ± 0.534
0.0GlyXaa: 0.0 ± 0.0
His
1.293HisAla: 1.293 ± 0.362
0.0HisCys: 0.0 ± 0.0
0.912HisAsp: 0.912 ± 0.364
1.293HisGlu: 1.293 ± 0.396
1.141HisPhe: 1.141 ± 0.278
0.836HisGly: 0.836 ± 0.27
0.38HisHis: 0.38 ± 0.147
1.673HisIle: 1.673 ± 0.319
1.445HisLys: 1.445 ± 0.414
1.141HisLeu: 1.141 ± 0.286
0.0HisMet: 0.0 ± 0.0
1.217HisAsn: 1.217 ± 0.366
0.532HisPro: 0.532 ± 0.204
0.532HisGln: 0.532 ± 0.2
0.228HisArg: 0.228 ± 0.128
0.608HisSer: 0.608 ± 0.16
1.369HisThr: 1.369 ± 0.315
1.217HisVal: 1.217 ± 0.36
0.228HisTrp: 0.228 ± 0.146
0.989HisTyr: 0.989 ± 0.302
0.0HisXaa: 0.0 ± 0.0
Ile
5.551IleAla: 5.551 ± 0.567
0.228IleCys: 0.228 ± 0.126
5.171IleAsp: 5.171 ± 0.762
6.463IleGlu: 6.463 ± 0.685
3.954IlePhe: 3.954 ± 0.542
3.954IleGly: 3.954 ± 0.592
1.521IleHis: 1.521 ± 0.375
5.019IleIle: 5.019 ± 0.815
8.212IleLys: 8.212 ± 0.684
3.802IleLeu: 3.802 ± 0.525
1.369IleMet: 1.369 ± 0.309
6.235IleAsn: 6.235 ± 0.985
2.053IlePro: 2.053 ± 0.324
3.194IleGln: 3.194 ± 0.421
3.042IleArg: 3.042 ± 0.564
4.867IleSer: 4.867 ± 0.476
5.323IleThr: 5.323 ± 0.609
4.791IleVal: 4.791 ± 0.531
0.989IleTrp: 0.989 ± 0.434
2.585IleTyr: 2.585 ± 0.523
0.0IleXaa: 0.0 ± 0.0
Lys
5.931LysAla: 5.931 ± 0.797
0.228LysCys: 0.228 ± 0.112
6.539LysAsp: 6.539 ± 0.632
8.897LysGlu: 8.897 ± 1.036
3.194LysPhe: 3.194 ± 0.438
5.399LysGly: 5.399 ± 0.861
1.065LysHis: 1.065 ± 0.353
6.463LysIle: 6.463 ± 0.813
7.832LysLys: 7.832 ± 0.896
7.984LysLeu: 7.984 ± 0.919
3.194LysMet: 3.194 ± 0.636
6.615LysAsn: 6.615 ± 0.727
2.813LysPro: 2.813 ± 0.484
4.106LysGln: 4.106 ± 0.543
4.867LysArg: 4.867 ± 0.75
4.867LysSer: 4.867 ± 0.718
5.095LysThr: 5.095 ± 0.667
5.779LysVal: 5.779 ± 0.716
1.369LysTrp: 1.369 ± 0.31
4.638LysTyr: 4.638 ± 0.793
0.0LysXaa: 0.0 ± 0.0
Leu
3.726LeuAla: 3.726 ± 0.603
0.76LeuCys: 0.76 ± 0.209
5.095LeuAsp: 5.095 ± 0.663
6.539LeuGlu: 6.539 ± 0.819
2.966LeuPhe: 2.966 ± 0.405
3.574LeuGly: 3.574 ± 0.59
1.217LeuHis: 1.217 ± 0.278
6.007LeuIle: 6.007 ± 0.82
9.277LeuLys: 9.277 ± 0.913
6.387LeuLeu: 6.387 ± 0.957
2.053LeuMet: 2.053 ± 0.452
5.247LeuAsn: 5.247 ± 0.731
2.509LeuPro: 2.509 ± 0.495
2.661LeuGln: 2.661 ± 0.403
2.737LeuArg: 2.737 ± 0.718
5.475LeuSer: 5.475 ± 0.708
4.791LeuThr: 4.791 ± 0.842
3.726LeuVal: 3.726 ± 0.654
0.684LeuTrp: 0.684 ± 0.22
3.802LeuTyr: 3.802 ± 0.65
0.0LeuXaa: 0.0 ± 0.0
Met
0.989MetAla: 0.989 ± 0.332
0.076MetCys: 0.076 ± 0.075
1.521MetAsp: 1.521 ± 0.35
1.369MetGlu: 1.369 ± 0.324
1.065MetPhe: 1.065 ± 0.229
1.065MetGly: 1.065 ± 0.459
0.532MetHis: 0.532 ± 0.215
2.661MetIle: 2.661 ± 0.335
1.977MetLys: 1.977 ± 0.461
1.749MetLeu: 1.749 ± 0.381
0.76MetMet: 0.76 ± 0.256
2.509MetAsn: 2.509 ± 0.543
0.684MetPro: 0.684 ± 0.265
0.912MetGln: 0.912 ± 0.267
1.141MetArg: 1.141 ± 0.32
2.281MetSer: 2.281 ± 0.429
1.445MetThr: 1.445 ± 0.333
1.369MetVal: 1.369 ± 0.368
0.38MetTrp: 0.38 ± 0.16
0.76MetTyr: 0.76 ± 0.26
0.0MetXaa: 0.0 ± 0.0
Asn
3.346AsnAla: 3.346 ± 0.617
0.228AsnCys: 0.228 ± 0.132
4.03AsnAsp: 4.03 ± 0.533
5.627AsnGlu: 5.627 ± 0.827
2.053AsnPhe: 2.053 ± 0.57
5.019AsnGly: 5.019 ± 0.526
0.912AsnHis: 0.912 ± 0.299
5.247AsnIle: 5.247 ± 0.545
7.452AsnLys: 7.452 ± 0.778
4.791AsnLeu: 4.791 ± 0.684
1.749AsnMet: 1.749 ± 0.293
5.171AsnAsn: 5.171 ± 0.979
3.118AsnPro: 3.118 ± 0.417
3.422AsnGln: 3.422 ± 0.525
3.118AsnArg: 3.118 ± 0.521
3.422AsnSer: 3.422 ± 0.513
3.726AsnThr: 3.726 ± 0.552
4.182AsnVal: 4.182 ± 0.646
1.141AsnTrp: 1.141 ± 0.481
3.118AsnTyr: 3.118 ± 0.541
0.0AsnXaa: 0.0 ± 0.0
Pro
0.836ProAla: 0.836 ± 0.28
0.0ProCys: 0.0 ± 0.0
0.989ProAsp: 0.989 ± 0.245
2.205ProGlu: 2.205 ± 0.41
1.597ProPhe: 1.597 ± 0.381
1.065ProGly: 1.065 ± 0.246
0.456ProHis: 0.456 ± 0.176
2.585ProIle: 2.585 ± 0.473
2.966ProLys: 2.966 ± 0.644
2.281ProLeu: 2.281 ± 0.526
0.532ProMet: 0.532 ± 0.17
1.521ProAsn: 1.521 ± 0.335
0.912ProPro: 0.912 ± 0.258
0.608ProGln: 0.608 ± 0.181
1.217ProArg: 1.217 ± 0.295
1.673ProSer: 1.673 ± 0.344
1.521ProThr: 1.521 ± 0.446
1.445ProVal: 1.445 ± 0.312
0.228ProTrp: 0.228 ± 0.102
1.065ProTyr: 1.065 ± 0.267
0.0ProXaa: 0.0 ± 0.0
Gln
3.042GlnAla: 3.042 ± 0.477
0.456GlnCys: 0.456 ± 0.225
2.281GlnAsp: 2.281 ± 0.501
2.89GlnGlu: 2.89 ± 0.506
1.141GlnPhe: 1.141 ± 0.281
1.825GlnGly: 1.825 ± 0.377
0.912GlnHis: 0.912 ± 0.26
3.65GlnIle: 3.65 ± 0.576
3.346GlnLys: 3.346 ± 0.551
2.966GlnLeu: 2.966 ± 0.554
0.532GlnMet: 0.532 ± 0.209
2.737GlnAsn: 2.737 ± 0.455
0.912GlnPro: 0.912 ± 0.294
1.901GlnGln: 1.901 ± 0.406
2.205GlnArg: 2.205 ± 0.392
2.129GlnSer: 2.129 ± 0.468
1.901GlnThr: 1.901 ± 0.393
1.977GlnVal: 1.977 ± 0.336
0.38GlnTrp: 0.38 ± 0.167
1.369GlnTyr: 1.369 ± 0.298
0.0GlnXaa: 0.0 ± 0.0
Arg
2.281ArgAla: 2.281 ± 0.39
0.152ArgCys: 0.152 ± 0.11
2.585ArgAsp: 2.585 ± 0.482
3.498ArgGlu: 3.498 ± 0.59
1.521ArgPhe: 1.521 ± 0.346
1.825ArgGly: 1.825 ± 0.348
0.684ArgHis: 0.684 ± 0.244
4.258ArgIle: 4.258 ± 0.586
4.258ArgLys: 4.258 ± 0.533
3.118ArgLeu: 3.118 ± 0.516
0.836ArgMet: 0.836 ± 0.264
2.89ArgAsn: 2.89 ± 0.449
0.684ArgPro: 0.684 ± 0.278
1.521ArgGln: 1.521 ± 0.398
2.357ArgArg: 2.357 ± 0.451
1.673ArgSer: 1.673 ± 0.343
2.509ArgThr: 2.509 ± 0.546
2.205ArgVal: 2.205 ± 0.429
0.456ArgTrp: 0.456 ± 0.176
2.89ArgTyr: 2.89 ± 0.587
0.0ArgXaa: 0.0 ± 0.0
Ser
3.194SerAla: 3.194 ± 0.681
0.152SerCys: 0.152 ± 0.118
4.486SerAsp: 4.486 ± 0.638
5.171SerGlu: 5.171 ± 0.594
3.194SerPhe: 3.194 ± 0.51
3.498SerGly: 3.498 ± 0.601
1.065SerHis: 1.065 ± 0.253
4.562SerIle: 4.562 ± 0.645
4.714SerLys: 4.714 ± 0.619
3.65SerLeu: 3.65 ± 0.507
1.293SerMet: 1.293 ± 0.252
4.41SerAsn: 4.41 ± 0.729
0.456SerPro: 0.456 ± 0.19
2.966SerGln: 2.966 ± 0.482
1.749SerArg: 1.749 ± 0.374
3.422SerSer: 3.422 ± 0.55
2.737SerThr: 2.737 ± 0.424
2.966SerVal: 2.966 ± 0.41
0.456SerTrp: 0.456 ± 0.269
2.585SerTyr: 2.585 ± 0.566
0.0SerXaa: 0.0 ± 0.0
Thr
3.574ThrAla: 3.574 ± 0.623
0.152ThrCys: 0.152 ± 0.118
3.118ThrAsp: 3.118 ± 0.77
3.574ThrGlu: 3.574 ± 0.498
2.433ThrPhe: 2.433 ± 0.488
3.65ThrGly: 3.65 ± 0.881
1.521ThrHis: 1.521 ± 0.392
3.878ThrIle: 3.878 ± 0.464
5.247ThrLys: 5.247 ± 0.685
5.019ThrLeu: 5.019 ± 0.591
0.989ThrMet: 0.989 ± 0.321
2.813ThrAsn: 2.813 ± 0.521
2.737ThrPro: 2.737 ± 0.372
1.445ThrGln: 1.445 ± 0.336
2.357ThrArg: 2.357 ± 0.387
3.422ThrSer: 3.422 ± 0.759
3.346ThrThr: 3.346 ± 0.581
3.954ThrVal: 3.954 ± 0.642
0.76ThrTrp: 0.76 ± 0.227
2.661ThrTyr: 2.661 ± 0.441
0.0ThrXaa: 0.0 ± 0.0
Val
3.042ValAla: 3.042 ± 0.592
0.152ValCys: 0.152 ± 0.108
4.182ValAsp: 4.182 ± 0.548
3.954ValGlu: 3.954 ± 0.594
2.129ValPhe: 2.129 ± 0.484
3.042ValGly: 3.042 ± 0.605
0.836ValHis: 0.836 ± 0.274
4.334ValIle: 4.334 ± 0.63
6.692ValLys: 6.692 ± 0.676
4.714ValLeu: 4.714 ± 0.59
1.521ValMet: 1.521 ± 0.273
3.802ValAsn: 3.802 ± 0.512
1.445ValPro: 1.445 ± 0.339
2.357ValGln: 2.357 ± 0.554
2.053ValArg: 2.053 ± 0.446
3.27ValSer: 3.27 ± 0.527
4.03ValThr: 4.03 ± 0.608
4.03ValVal: 4.03 ± 0.612
0.684ValTrp: 0.684 ± 0.233
2.129ValTyr: 2.129 ± 0.394
0.0ValXaa: 0.0 ± 0.0
Trp
0.38TrpAla: 0.38 ± 0.173
0.0TrpCys: 0.0 ± 0.0
0.684TrpAsp: 0.684 ± 0.235
0.684TrpGlu: 0.684 ± 0.264
0.836TrpPhe: 0.836 ± 0.225
0.912TrpGly: 0.912 ± 0.267
0.0TrpHis: 0.0 ± 0.0
1.293TrpIle: 1.293 ± 0.279
1.217TrpLys: 1.217 ± 0.295
1.141TrpLeu: 1.141 ± 0.238
0.38TrpMet: 0.38 ± 0.174
0.912TrpAsn: 0.912 ± 0.317
0.152TrpPro: 0.152 ± 0.097
0.456TrpGln: 0.456 ± 0.15
0.38TrpArg: 0.38 ± 0.167
0.38TrpSer: 0.38 ± 0.164
0.304TrpThr: 0.304 ± 0.128
0.76TrpVal: 0.76 ± 0.23
0.152TrpTrp: 0.152 ± 0.107
0.532TrpTyr: 0.532 ± 0.174
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.977TyrAla: 1.977 ± 0.345
0.304TyrCys: 0.304 ± 0.152
2.89TyrAsp: 2.89 ± 0.593
3.65TyrGlu: 3.65 ± 0.656
2.053TyrPhe: 2.053 ± 0.448
2.89TyrGly: 2.89 ± 0.55
0.912TyrHis: 0.912 ± 0.277
3.346TyrIle: 3.346 ± 0.676
4.638TyrLys: 4.638 ± 0.617
4.182TyrLeu: 4.182 ± 0.671
0.912TyrMet: 0.912 ± 0.249
3.574TyrAsn: 3.574 ± 0.556
0.684TyrPro: 0.684 ± 0.198
1.749TyrGln: 1.749 ± 0.417
2.205TyrArg: 2.205 ± 0.397
2.509TyrSer: 2.509 ± 0.356
2.585TyrThr: 2.585 ± 0.423
2.433TyrVal: 2.433 ± 0.472
0.532TyrTrp: 0.532 ± 0.204
1.293TyrTyr: 1.293 ± 0.407
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 72 proteins (13152 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski