Amino acid dipepetide frequency for Staphylococcus phage 3MRA

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.969AlaAla: 0.969 ± 0.303
0.404AlaCys: 0.404 ± 0.199
2.663AlaAsp: 2.663 ± 0.507
3.793AlaGlu: 3.793 ± 0.526
2.663AlaPhe: 2.663 ± 0.676
3.471AlaGly: 3.471 ± 0.512
1.372AlaHis: 1.372 ± 0.322
5.408AlaIle: 5.408 ± 0.684
6.215AlaLys: 6.215 ± 0.722
4.036AlaLeu: 4.036 ± 0.782
2.018AlaMet: 2.018 ± 0.469
3.955AlaAsn: 3.955 ± 0.622
1.695AlaPro: 1.695 ± 0.36
2.341AlaGln: 2.341 ± 0.391
2.825AlaArg: 2.825 ± 0.414
3.551AlaSer: 3.551 ± 0.642
4.681AlaThr: 4.681 ± 0.858
3.148AlaVal: 3.148 ± 0.738
0.484AlaTrp: 0.484 ± 0.308
2.179AlaTyr: 2.179 ± 0.474
0.0AlaXaa: 0.0 ± 0.0
Cys
0.242CysAla: 0.242 ± 0.139
0.0CysCys: 0.0 ± 0.0
0.404CysAsp: 0.404 ± 0.222
0.242CysGlu: 0.242 ± 0.149
0.161CysPhe: 0.161 ± 0.115
0.323CysGly: 0.323 ± 0.15
0.081CysHis: 0.081 ± 0.09
0.081CysIle: 0.081 ± 0.069
0.484CysLys: 0.484 ± 0.203
0.323CysLeu: 0.323 ± 0.16
0.404CysMet: 0.404 ± 0.219
0.081CysAsn: 0.081 ± 0.084
0.404CysPro: 0.404 ± 0.241
0.323CysGln: 0.323 ± 0.144
0.323CysArg: 0.323 ± 0.21
0.404CysSer: 0.404 ± 0.186
0.081CysThr: 0.081 ± 0.09
0.323CysVal: 0.323 ± 0.162
0.081CysTrp: 0.081 ± 0.079
0.242CysTyr: 0.242 ± 0.133
0.0CysXaa: 0.0 ± 0.0
Asp
3.874AspAla: 3.874 ± 0.625
0.242AspCys: 0.242 ± 0.154
4.116AspAsp: 4.116 ± 0.765
5.004AspGlu: 5.004 ± 0.685
3.551AspPhe: 3.551 ± 0.688
4.439AspGly: 4.439 ± 0.581
0.565AspHis: 0.565 ± 0.229
4.197AspIle: 4.197 ± 0.553
5.408AspLys: 5.408 ± 1.075
4.681AspLeu: 4.681 ± 0.561
1.453AspMet: 1.453 ± 0.29
2.825AspAsn: 2.825 ± 0.437
1.211AspPro: 1.211 ± 0.259
1.291AspGln: 1.291 ± 0.36
2.179AspArg: 2.179 ± 0.416
4.197AspSer: 4.197 ± 0.588
3.067AspThr: 3.067 ± 0.404
4.681AspVal: 4.681 ± 0.792
0.807AspTrp: 0.807 ± 0.315
3.551AspTyr: 3.551 ± 0.529
0.0AspXaa: 0.0 ± 0.0
Glu
5.085GluAla: 5.085 ± 0.7
0.646GluCys: 0.646 ± 0.213
4.278GluAsp: 4.278 ± 0.753
5.811GluGlu: 5.811 ± 1.066
3.067GluPhe: 3.067 ± 0.567
3.148GluGly: 3.148 ± 0.442
1.533GluHis: 1.533 ± 0.341
5.327GluIle: 5.327 ± 0.654
6.215GluLys: 6.215 ± 0.911
7.264GluLeu: 7.264 ± 0.898
2.663GluMet: 2.663 ± 0.509
4.843GluAsn: 4.843 ± 0.599
1.776GluPro: 1.776 ± 0.422
4.439GluGln: 4.439 ± 0.757
2.744GluArg: 2.744 ± 0.475
4.278GluSer: 4.278 ± 0.569
2.986GluThr: 2.986 ± 0.422
5.165GluVal: 5.165 ± 0.65
1.049GluTrp: 1.049 ± 0.273
3.713GluTyr: 3.713 ± 0.646
0.0GluXaa: 0.0 ± 0.0
Phe
2.583PheAla: 2.583 ± 0.381
0.161PheCys: 0.161 ± 0.105
4.6PheAsp: 4.6 ± 0.583
3.471PheGlu: 3.471 ± 0.581
0.969PhePhe: 0.969 ± 0.256
2.825PheGly: 2.825 ± 0.653
0.807PheHis: 0.807 ± 0.263
3.067PheIle: 3.067 ± 0.498
4.358PheLys: 4.358 ± 0.672
2.502PheLeu: 2.502 ± 0.402
1.372PheMet: 1.372 ± 0.4
2.986PheAsn: 2.986 ± 0.431
0.726PhePro: 0.726 ± 0.248
1.372PheGln: 1.372 ± 0.402
1.372PheArg: 1.372 ± 0.3
2.26PheSer: 2.26 ± 0.504
2.744PheThr: 2.744 ± 0.491
1.695PheVal: 1.695 ± 0.465
0.323PheTrp: 0.323 ± 0.156
2.018PheTyr: 2.018 ± 0.396
0.0PheXaa: 0.0 ± 0.0
Gly
3.793GlyAla: 3.793 ± 0.523
0.242GlyCys: 0.242 ± 0.145
3.471GlyAsp: 3.471 ± 0.604
3.39GlyGlu: 3.39 ± 0.56
2.341GlyPhe: 2.341 ± 0.51
2.744GlyGly: 2.744 ± 0.602
1.372GlyHis: 1.372 ± 0.387
5.004GlyIle: 5.004 ± 0.529
4.762GlyLys: 4.762 ± 0.595
4.923GlyLeu: 4.923 ± 0.853
2.098GlyMet: 2.098 ± 0.524
3.713GlyAsn: 3.713 ± 0.683
0.646GlyPro: 0.646 ± 0.246
2.341GlyGln: 2.341 ± 0.351
1.856GlyArg: 1.856 ± 0.477
2.744GlySer: 2.744 ± 0.48
4.197GlyThr: 4.197 ± 0.487
4.923GlyVal: 4.923 ± 0.849
1.211GlyTrp: 1.211 ± 0.509
2.663GlyTyr: 2.663 ± 0.529
0.0GlyXaa: 0.0 ± 0.0
His
1.211HisAla: 1.211 ± 0.304
0.0HisCys: 0.0 ± 0.0
0.807HisAsp: 0.807 ± 0.272
1.13HisGlu: 1.13 ± 0.3
0.888HisPhe: 0.888 ± 0.248
1.291HisGly: 1.291 ± 0.265
0.404HisHis: 0.404 ± 0.243
1.291HisIle: 1.291 ± 0.309
1.13HisLys: 1.13 ± 0.293
1.291HisLeu: 1.291 ± 0.366
0.323HisMet: 0.323 ± 0.156
1.211HisAsn: 1.211 ± 0.315
0.969HisPro: 0.969 ± 0.34
1.13HisGln: 1.13 ± 0.29
0.646HisArg: 0.646 ± 0.246
1.533HisSer: 1.533 ± 0.273
1.211HisThr: 1.211 ± 0.281
0.969HisVal: 0.969 ± 0.277
0.161HisTrp: 0.161 ± 0.131
0.807HisTyr: 0.807 ± 0.375
0.0HisXaa: 0.0 ± 0.0
Ile
4.036IleAla: 4.036 ± 0.661
0.323IleCys: 0.323 ± 0.157
5.408IleAsp: 5.408 ± 0.731
7.748IleGlu: 7.748 ± 1.061
3.471IlePhe: 3.471 ± 0.591
5.165IleGly: 5.165 ± 0.856
0.969IleHis: 0.969 ± 0.266
3.39IleIle: 3.39 ± 0.546
7.022IleLys: 7.022 ± 0.786
4.278IleLeu: 4.278 ± 0.688
2.825IleMet: 2.825 ± 0.499
5.246IleAsn: 5.246 ± 0.912
2.341IlePro: 2.341 ± 0.442
3.39IleGln: 3.39 ± 0.501
3.793IleArg: 3.793 ± 0.631
3.793IleSer: 3.793 ± 0.645
5.408IleThr: 5.408 ± 0.661
4.036IleVal: 4.036 ± 0.476
0.807IleTrp: 0.807 ± 0.338
2.583IleTyr: 2.583 ± 0.612
0.0IleXaa: 0.0 ± 0.0
Lys
5.327LysAla: 5.327 ± 0.737
0.404LysCys: 0.404 ± 0.173
5.327LysAsp: 5.327 ± 0.809
8.232LysGlu: 8.232 ± 0.859
3.309LysPhe: 3.309 ± 0.492
5.004LysGly: 5.004 ± 0.759
2.179LysHis: 2.179 ± 0.425
5.892LysIle: 5.892 ± 0.757
7.91LysLys: 7.91 ± 1.036
6.86LysLeu: 6.86 ± 0.815
3.148LysMet: 3.148 ± 0.398
4.52LysAsn: 4.52 ± 0.793
3.148LysPro: 3.148 ± 0.546
4.116LysGln: 4.116 ± 0.646
3.713LysArg: 3.713 ± 0.551
4.843LysSer: 4.843 ± 0.692
4.439LysThr: 4.439 ± 0.687
5.085LysVal: 5.085 ± 0.659
0.807LysTrp: 0.807 ± 0.202
3.793LysTyr: 3.793 ± 0.567
0.0LysXaa: 0.0 ± 0.0
Leu
4.6LeuAla: 4.6 ± 0.707
0.323LeuCys: 0.323 ± 0.16
4.843LeuAsp: 4.843 ± 0.567
5.085LeuGlu: 5.085 ± 0.662
3.067LeuPhe: 3.067 ± 0.446
3.309LeuGly: 3.309 ± 0.504
1.372LeuHis: 1.372 ± 0.329
5.569LeuIle: 5.569 ± 0.656
7.425LeuLys: 7.425 ± 0.603
5.73LeuLeu: 5.73 ± 0.896
1.856LeuMet: 1.856 ± 0.419
4.439LeuAsn: 4.439 ± 0.529
3.228LeuPro: 3.228 ± 0.643
3.309LeuGln: 3.309 ± 0.509
3.067LeuArg: 3.067 ± 0.615
5.569LeuSer: 5.569 ± 0.729
5.085LeuThr: 5.085 ± 0.721
3.471LeuVal: 3.471 ± 0.613
0.726LeuTrp: 0.726 ± 0.264
4.358LeuTyr: 4.358 ± 0.685
0.0LeuXaa: 0.0 ± 0.0
Met
1.937MetAla: 1.937 ± 0.592
0.242MetCys: 0.242 ± 0.152
0.646MetAsp: 0.646 ± 0.178
1.614MetGlu: 1.614 ± 0.354
1.291MetPhe: 1.291 ± 0.316
1.453MetGly: 1.453 ± 0.394
0.242MetHis: 0.242 ± 0.122
1.937MetIle: 1.937 ± 0.376
2.502MetLys: 2.502 ± 0.525
2.502MetLeu: 2.502 ± 0.354
0.484MetMet: 0.484 ± 0.195
2.502MetAsn: 2.502 ± 0.43
0.807MetPro: 0.807 ± 0.21
1.211MetGln: 1.211 ± 0.338
1.372MetArg: 1.372 ± 0.314
2.421MetSer: 2.421 ± 0.546
2.986MetThr: 2.986 ± 0.4
0.565MetVal: 0.565 ± 0.179
0.404MetTrp: 0.404 ± 0.156
1.13MetTyr: 1.13 ± 0.309
0.0MetXaa: 0.0 ± 0.0
Asn
3.874AsnAla: 3.874 ± 0.608
0.323AsnCys: 0.323 ± 0.18
3.713AsnAsp: 3.713 ± 0.625
5.892AsnGlu: 5.892 ± 0.697
3.148AsnPhe: 3.148 ± 0.478
4.439AsnGly: 4.439 ± 0.59
0.726AsnHis: 0.726 ± 0.21
3.874AsnIle: 3.874 ± 0.567
5.488AsnLys: 5.488 ± 0.623
4.358AsnLeu: 4.358 ± 0.62
1.372AsnMet: 1.372 ± 0.354
5.246AsnAsn: 5.246 ± 0.681
2.341AsnPro: 2.341 ± 0.513
2.341AsnGln: 2.341 ± 0.332
2.583AsnArg: 2.583 ± 0.432
3.228AsnSer: 3.228 ± 0.571
3.309AsnThr: 3.309 ± 0.5
4.197AsnVal: 4.197 ± 0.584
0.807AsnTrp: 0.807 ± 0.23
2.744AsnTyr: 2.744 ± 0.5
0.0AsnXaa: 0.0 ± 0.0
Pro
1.13ProAla: 1.13 ± 0.283
0.081ProCys: 0.081 ± 0.085
1.453ProAsp: 1.453 ± 0.255
2.098ProGlu: 2.098 ± 0.401
1.13ProPhe: 1.13 ± 0.321
1.937ProGly: 1.937 ± 0.553
0.404ProHis: 0.404 ± 0.174
3.067ProIle: 3.067 ± 0.576
2.744ProLys: 2.744 ± 0.604
2.421ProLeu: 2.421 ± 0.555
0.726ProMet: 0.726 ± 0.262
2.421ProAsn: 2.421 ± 0.554
0.807ProPro: 0.807 ± 0.237
1.049ProGln: 1.049 ± 0.267
0.565ProArg: 0.565 ± 0.237
1.614ProSer: 1.614 ± 0.519
2.26ProThr: 2.26 ± 0.383
1.937ProVal: 1.937 ± 0.46
0.081ProTrp: 0.081 ± 0.09
1.533ProTyr: 1.533 ± 0.349
0.0ProXaa: 0.0 ± 0.0
Gln
3.067GlnAla: 3.067 ± 0.553
0.565GlnCys: 0.565 ± 0.216
1.856GlnAsp: 1.856 ± 0.409
2.583GlnGlu: 2.583 ± 0.531
2.26GlnPhe: 2.26 ± 0.411
2.421GlnGly: 2.421 ± 0.449
1.211GlnHis: 1.211 ± 0.299
3.793GlnIle: 3.793 ± 0.503
2.825GlnLys: 2.825 ± 0.413
3.228GlnLeu: 3.228 ± 0.494
1.372GlnMet: 1.372 ± 0.341
2.421GlnAsn: 2.421 ± 0.404
1.211GlnPro: 1.211 ± 0.463
2.018GlnGln: 2.018 ± 0.499
1.776GlnArg: 1.776 ± 0.38
2.502GlnSer: 2.502 ± 0.451
2.018GlnThr: 2.018 ± 0.394
2.583GlnVal: 2.583 ± 0.422
0.323GlnTrp: 0.323 ± 0.155
1.614GlnTyr: 1.614 ± 0.411
0.0GlnXaa: 0.0 ± 0.0
Arg
1.856ArgAla: 1.856 ± 0.38
0.323ArgCys: 0.323 ± 0.166
2.421ArgAsp: 2.421 ± 0.467
3.228ArgGlu: 3.228 ± 0.605
1.776ArgPhe: 1.776 ± 0.397
2.421ArgGly: 2.421 ± 0.406
1.291ArgHis: 1.291 ± 0.265
4.036ArgIle: 4.036 ± 0.608
3.955ArgLys: 3.955 ± 0.488
3.955ArgLeu: 3.955 ± 0.593
0.484ArgMet: 0.484 ± 0.225
3.309ArgAsn: 3.309 ± 0.49
1.211ArgPro: 1.211 ± 0.244
1.211ArgGln: 1.211 ± 0.282
1.937ArgArg: 1.937 ± 0.438
1.533ArgSer: 1.533 ± 0.38
1.533ArgThr: 1.533 ± 0.323
1.695ArgVal: 1.695 ± 0.322
0.323ArgTrp: 0.323 ± 0.188
2.502ArgTyr: 2.502 ± 0.515
0.0ArgXaa: 0.0 ± 0.0
Ser
4.197SerAla: 4.197 ± 0.719
0.242SerCys: 0.242 ± 0.181
4.036SerAsp: 4.036 ± 0.526
3.793SerGlu: 3.793 ± 0.505
2.421SerPhe: 2.421 ± 0.41
3.793SerGly: 3.793 ± 0.603
1.049SerHis: 1.049 ± 0.244
5.408SerIle: 5.408 ± 0.659
5.73SerLys: 5.73 ± 0.738
3.39SerLeu: 3.39 ± 0.528
1.937SerMet: 1.937 ± 0.311
3.39SerAsn: 3.39 ± 0.575
1.291SerPro: 1.291 ± 0.454
2.825SerGln: 2.825 ± 0.585
2.341SerArg: 2.341 ± 0.402
3.148SerSer: 3.148 ± 0.553
3.39SerThr: 3.39 ± 0.409
3.551SerVal: 3.551 ± 0.64
0.807SerTrp: 0.807 ± 0.256
2.098SerTyr: 2.098 ± 0.42
0.0SerXaa: 0.0 ± 0.0
Thr
3.39ThrAla: 3.39 ± 0.49
0.0ThrCys: 0.0 ± 0.0
3.713ThrAsp: 3.713 ± 0.435
3.471ThrGlu: 3.471 ± 0.475
2.744ThrPhe: 2.744 ± 0.517
3.551ThrGly: 3.551 ± 0.535
1.291ThrHis: 1.291 ± 0.312
5.65ThrIle: 5.65 ± 0.736
4.358ThrLys: 4.358 ± 0.539
5.73ThrLeu: 5.73 ± 0.84
1.291ThrMet: 1.291 ± 0.304
3.471ThrAsn: 3.471 ± 0.615
1.856ThrPro: 1.856 ± 0.323
2.986ThrGln: 2.986 ± 0.506
3.228ThrArg: 3.228 ± 0.382
3.955ThrSer: 3.955 ± 0.846
2.744ThrThr: 2.744 ± 0.461
3.228ThrVal: 3.228 ± 0.6
1.049ThrTrp: 1.049 ± 0.335
2.502ThrTyr: 2.502 ± 0.471
0.0ThrXaa: 0.0 ± 0.0
Val
3.39ValAla: 3.39 ± 0.804
0.161ValCys: 0.161 ± 0.117
4.6ValAsp: 4.6 ± 0.555
4.6ValGlu: 4.6 ± 0.596
1.937ValPhe: 1.937 ± 0.364
3.148ValGly: 3.148 ± 0.564
0.404ValHis: 0.404 ± 0.157
5.246ValIle: 5.246 ± 0.527
5.165ValLys: 5.165 ± 0.593
4.923ValLeu: 4.923 ± 0.711
1.291ValMet: 1.291 ± 0.275
3.309ValAsn: 3.309 ± 0.544
2.502ValPro: 2.502 ± 0.379
1.533ValGln: 1.533 ± 0.428
2.098ValArg: 2.098 ± 0.402
4.036ValSer: 4.036 ± 0.791
4.358ValThr: 4.358 ± 0.665
4.278ValVal: 4.278 ± 0.586
0.969ValTrp: 0.969 ± 0.257
2.341ValTyr: 2.341 ± 0.512
0.0ValXaa: 0.0 ± 0.0
Trp
0.726TrpAla: 0.726 ± 0.249
0.081TrpCys: 0.081 ± 0.079
0.404TrpAsp: 0.404 ± 0.18
0.969TrpGlu: 0.969 ± 0.251
0.565TrpPhe: 0.565 ± 0.162
0.888TrpGly: 0.888 ± 0.341
0.404TrpHis: 0.404 ± 0.155
0.565TrpIle: 0.565 ± 0.206
0.888TrpLys: 0.888 ± 0.301
0.888TrpLeu: 0.888 ± 0.306
0.242TrpMet: 0.242 ± 0.144
0.807TrpAsn: 0.807 ± 0.249
0.081TrpPro: 0.081 ± 0.072
0.646TrpGln: 0.646 ± 0.22
0.323TrpArg: 0.323 ± 0.153
0.726TrpSer: 0.726 ± 0.256
0.888TrpThr: 0.888 ± 0.225
1.291TrpVal: 1.291 ± 0.299
0.161TrpTrp: 0.161 ± 0.111
0.484TrpTyr: 0.484 ± 0.22
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.583TyrAla: 2.583 ± 0.567
0.242TyrCys: 0.242 ± 0.13
2.583TyrAsp: 2.583 ± 0.483
3.874TyrGlu: 3.874 ± 0.638
1.453TyrPhe: 1.453 ± 0.405
2.663TyrGly: 2.663 ± 0.638
0.726TyrHis: 0.726 ± 0.272
3.309TyrIle: 3.309 ± 0.456
3.471TyrLys: 3.471 ± 0.616
3.148TyrLeu: 3.148 ± 0.492
0.807TyrMet: 0.807 ± 0.258
3.309TyrAsn: 3.309 ± 0.531
1.211TyrPro: 1.211 ± 0.371
1.776TyrGln: 1.776 ± 0.327
2.26TyrArg: 2.26 ± 0.561
2.502TyrSer: 2.502 ± 0.55
2.825TyrThr: 2.825 ± 0.479
3.471TyrVal: 3.471 ± 0.494
0.646TyrTrp: 0.646 ± 0.21
1.614TyrTyr: 1.614 ± 0.449
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 66 proteins (12391 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski