Amino acid dipepetide frequency for Myxococcus phage Mx8

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
18.757AlaAla: 18.757 ± 1.549
2.174AlaCys: 2.174 ± 0.437
7.018AlaAsp: 7.018 ± 0.683
9.937AlaGlu: 9.937 ± 0.816
2.981AlaPhe: 2.981 ± 0.455
9.378AlaGly: 9.378 ± 0.96
2.298AlaHis: 2.298 ± 0.416
3.913AlaIle: 3.913 ± 0.499
5.59AlaLys: 5.59 ± 0.534
12.794AlaLeu: 12.794 ± 0.888
2.422AlaMet: 2.422 ± 0.42
2.919AlaAsn: 2.919 ± 0.407
7.826AlaPro: 7.826 ± 0.946
5.9AlaGln: 5.9 ± 0.898
11.428AlaArg: 11.428 ± 0.947
8.819AlaSer: 8.819 ± 0.753
6.459AlaThr: 6.459 ± 0.577
9.378AlaVal: 9.378 ± 0.642
1.677AlaTrp: 1.677 ± 0.282
3.292AlaTyr: 3.292 ± 0.411
0.0AlaXaa: 0.0 ± 0.0
Cys
1.366CysAla: 1.366 ± 0.294
0.311CysCys: 0.311 ± 0.139
1.118CysAsp: 1.118 ± 0.24
0.932CysGlu: 0.932 ± 0.26
0.186CysPhe: 0.186 ± 0.097
1.677CysGly: 1.677 ± 0.356
0.373CysHis: 0.373 ± 0.158
0.435CysIle: 0.435 ± 0.166
0.497CysLys: 0.497 ± 0.166
0.683CysLeu: 0.683 ± 0.218
0.186CysMet: 0.186 ± 0.114
0.373CysAsn: 0.373 ± 0.151
1.428CysPro: 1.428 ± 0.306
0.248CysGln: 0.248 ± 0.142
1.801CysArg: 1.801 ± 0.37
0.87CysSer: 0.87 ± 0.223
0.932CysThr: 0.932 ± 0.247
0.994CysVal: 0.994 ± 0.237
0.621CysTrp: 0.621 ± 0.202
0.621CysTyr: 0.621 ± 0.23
0.0CysXaa: 0.0 ± 0.0
Asp
8.757AspAla: 8.757 ± 0.919
0.994AspCys: 0.994 ± 0.222
2.795AspAsp: 2.795 ± 0.455
3.416AspGlu: 3.416 ± 0.373
1.553AspPhe: 1.553 ± 0.362
5.341AspGly: 5.341 ± 0.505
0.497AspHis: 0.497 ± 0.17
1.677AspIle: 1.677 ± 0.276
2.05AspLys: 2.05 ± 0.388
4.907AspLeu: 4.907 ± 0.509
1.304AspMet: 1.304 ± 0.254
0.994AspAsn: 0.994 ± 0.227
2.298AspPro: 2.298 ± 0.376
0.994AspGln: 0.994 ± 0.225
3.043AspArg: 3.043 ± 0.516
2.112AspSer: 2.112 ± 0.333
3.23AspThr: 3.23 ± 0.384
5.217AspVal: 5.217 ± 0.585
1.18AspTrp: 1.18 ± 0.337
1.242AspTyr: 1.242 ± 0.327
0.0AspXaa: 0.0 ± 0.0
Glu
10.186GluAla: 10.186 ± 1.075
0.497GluCys: 0.497 ± 0.139
2.919GluAsp: 2.919 ± 0.423
4.658GluGlu: 4.658 ± 0.637
1.366GluPhe: 1.366 ± 0.289
3.851GluGly: 3.851 ± 0.481
1.304GluHis: 1.304 ± 0.261
1.118GluIle: 1.118 ± 0.231
2.05GluLys: 2.05 ± 0.391
5.528GluLeu: 5.528 ± 0.498
1.553GluMet: 1.553 ± 0.292
1.18GluAsn: 1.18 ± 0.267
3.23GluPro: 3.23 ± 0.413
3.105GluGln: 3.105 ± 0.569
7.515GluArg: 7.515 ± 0.656
3.913GluSer: 3.913 ± 0.476
3.043GluThr: 3.043 ± 0.41
5.528GluVal: 5.528 ± 0.798
1.428GluTrp: 1.428 ± 0.256
1.553GluTyr: 1.553 ± 0.281
0.0GluXaa: 0.0 ± 0.0
Phe
2.919PheAla: 2.919 ± 0.442
0.124PheCys: 0.124 ± 0.085
1.491PheAsp: 1.491 ± 0.416
1.366PheGlu: 1.366 ± 0.289
0.683PhePhe: 0.683 ± 0.179
2.981PheGly: 2.981 ± 0.358
0.497PheHis: 0.497 ± 0.169
0.807PheIle: 0.807 ± 0.213
0.621PheLys: 0.621 ± 0.194
1.615PheLeu: 1.615 ± 0.355
0.559PheMet: 0.559 ± 0.19
0.994PheAsn: 0.994 ± 0.247
1.242PhePro: 1.242 ± 0.259
0.683PheGln: 0.683 ± 0.173
2.05PheArg: 2.05 ± 0.392
1.428PheSer: 1.428 ± 0.293
1.491PheThr: 1.491 ± 0.409
1.987PheVal: 1.987 ± 0.256
0.186PheTrp: 0.186 ± 0.116
0.559PheTyr: 0.559 ± 0.158
0.0PheXaa: 0.0 ± 0.0
Gly
8.881GlyAla: 8.881 ± 1.093
1.553GlyCys: 1.553 ± 0.427
4.099GlyAsp: 4.099 ± 0.468
4.72GlyGlu: 4.72 ± 0.597
2.236GlyPhe: 2.236 ± 0.365
7.577GlyGly: 7.577 ± 0.852
1.491GlyHis: 1.491 ± 0.33
1.677GlyIle: 1.677 ± 0.285
3.168GlyLys: 3.168 ± 0.359
7.701GlyLeu: 7.701 ± 0.669
1.677GlyMet: 1.677 ± 0.272
1.553GlyAsn: 1.553 ± 0.318
3.913GlyPro: 3.913 ± 0.484
3.354GlyGln: 3.354 ± 0.517
7.515GlyArg: 7.515 ± 0.643
3.851GlySer: 3.851 ± 0.562
5.776GlyThr: 5.776 ± 0.892
5.776GlyVal: 5.776 ± 0.754
1.801GlyTrp: 1.801 ± 0.288
2.112GlyTyr: 2.112 ± 0.385
0.0GlyXaa: 0.0 ± 0.0
His
1.677HisAla: 1.677 ± 0.302
0.311HisCys: 0.311 ± 0.137
1.118HisAsp: 1.118 ± 0.298
1.056HisGlu: 1.056 ± 0.272
0.559HisPhe: 0.559 ± 0.175
1.553HisGly: 1.553 ± 0.308
0.807HisHis: 0.807 ± 0.25
0.124HisIle: 0.124 ± 0.081
0.807HisLys: 0.807 ± 0.217
1.987HisLeu: 1.987 ± 0.363
0.497HisMet: 0.497 ± 0.161
0.311HisAsn: 0.311 ± 0.117
1.056HisPro: 1.056 ± 0.222
0.87HisGln: 0.87 ± 0.225
1.553HisArg: 1.553 ± 0.314
0.683HisSer: 0.683 ± 0.209
0.932HisThr: 0.932 ± 0.219
1.056HisVal: 1.056 ± 0.326
0.373HisTrp: 0.373 ± 0.155
0.497HisTyr: 0.497 ± 0.176
0.0HisXaa: 0.0 ± 0.0
Ile
3.168IleAla: 3.168 ± 0.398
0.248IleCys: 0.248 ± 0.115
0.745IleAsp: 0.745 ± 0.228
1.491IleGlu: 1.491 ± 0.299
0.497IlePhe: 0.497 ± 0.151
2.422IleGly: 2.422 ± 0.385
0.186IleHis: 0.186 ± 0.094
0.683IleIle: 0.683 ± 0.268
0.373IleLys: 0.373 ± 0.173
1.801IleLeu: 1.801 ± 0.328
0.435IleMet: 0.435 ± 0.172
0.932IleAsn: 0.932 ± 0.303
1.056IlePro: 1.056 ± 0.23
0.87IleGln: 0.87 ± 0.239
2.484IleArg: 2.484 ± 0.381
1.18IleSer: 1.18 ± 0.294
1.615IleThr: 1.615 ± 0.29
1.428IleVal: 1.428 ± 0.274
0.373IleTrp: 0.373 ± 0.183
0.248IleTyr: 0.248 ± 0.111
0.0IleXaa: 0.0 ± 0.0
Lys
5.714LysAla: 5.714 ± 0.555
0.373LysCys: 0.373 ± 0.142
1.863LysAsp: 1.863 ± 0.329
2.298LysGlu: 2.298 ± 0.354
0.621LysPhe: 0.621 ± 0.189
2.422LysGly: 2.422 ± 0.476
0.559LysHis: 0.559 ± 0.196
0.186LysIle: 0.186 ± 0.144
1.863LysLys: 1.863 ± 0.542
2.422LysLeu: 2.422 ± 0.391
0.559LysMet: 0.559 ± 0.16
0.248LysAsn: 0.248 ± 0.112
2.609LysPro: 2.609 ± 0.364
1.366LysGln: 1.366 ± 0.234
3.602LysArg: 3.602 ± 0.552
1.987LysSer: 1.987 ± 0.315
1.615LysThr: 1.615 ± 0.345
2.236LysVal: 2.236 ± 0.4
0.621LysTrp: 0.621 ± 0.189
0.932LysTyr: 0.932 ± 0.218
0.0LysXaa: 0.0 ± 0.0
Leu
12.608LeuAla: 12.608 ± 1.032
1.863LeuCys: 1.863 ± 0.396
4.844LeuAsp: 4.844 ± 0.549
6.956LeuGlu: 6.956 ± 0.78
1.925LeuPhe: 1.925 ± 0.395
7.018LeuGly: 7.018 ± 0.776
1.304LeuHis: 1.304 ± 0.3
1.739LeuIle: 1.739 ± 0.326
3.478LeuLys: 3.478 ± 0.435
8.447LeuLeu: 8.447 ± 0.642
1.242LeuMet: 1.242 ± 0.278
1.491LeuAsn: 1.491 ± 0.266
4.596LeuPro: 4.596 ± 0.524
3.105LeuGln: 3.105 ± 0.452
7.453LeuArg: 7.453 ± 0.71
4.969LeuSer: 4.969 ± 0.572
4.534LeuThr: 4.534 ± 0.528
6.087LeuVal: 6.087 ± 0.602
1.863LeuTrp: 1.863 ± 0.351
1.863LeuTyr: 1.863 ± 0.33
0.0LeuXaa: 0.0 ± 0.0
Met
2.981MetAla: 2.981 ± 0.42
0.311MetCys: 0.311 ± 0.187
1.242MetAsp: 1.242 ± 0.309
0.807MetGlu: 0.807 ± 0.203
0.248MetPhe: 0.248 ± 0.133
1.242MetGly: 1.242 ± 0.255
0.497MetHis: 0.497 ± 0.167
0.248MetIle: 0.248 ± 0.108
0.621MetLys: 0.621 ± 0.177
1.615MetLeu: 1.615 ± 0.278
0.186MetMet: 0.186 ± 0.109
0.559MetAsn: 0.559 ± 0.188
1.677MetPro: 1.677 ± 0.236
1.056MetGln: 1.056 ± 0.217
1.863MetArg: 1.863 ± 0.346
2.36MetSer: 2.36 ± 0.335
2.422MetThr: 2.422 ± 0.454
1.118MetVal: 1.118 ± 0.244
0.311MetTrp: 0.311 ± 0.149
0.373MetTyr: 0.373 ± 0.132
0.0MetXaa: 0.0 ± 0.0
Asn
3.292AsnAla: 3.292 ± 0.572
0.248AsnCys: 0.248 ± 0.139
1.18AsnAsp: 1.18 ± 0.267
1.553AsnGlu: 1.553 ± 0.268
0.559AsnPhe: 0.559 ± 0.186
1.739AsnGly: 1.739 ± 0.291
0.248AsnHis: 0.248 ± 0.122
0.745AsnIle: 0.745 ± 0.241
0.248AsnLys: 0.248 ± 0.147
1.739AsnLeu: 1.739 ± 0.355
0.373AsnMet: 0.373 ± 0.138
0.497AsnAsn: 0.497 ± 0.176
1.987AsnPro: 1.987 ± 0.328
0.683AsnGln: 0.683 ± 0.294
1.366AsnArg: 1.366 ± 0.319
1.366AsnSer: 1.366 ± 0.363
1.366AsnThr: 1.366 ± 0.279
1.428AsnVal: 1.428 ± 0.256
0.559AsnTrp: 0.559 ± 0.191
0.248AsnTyr: 0.248 ± 0.116
0.0AsnXaa: 0.0 ± 0.0
Pro
7.701ProAla: 7.701 ± 0.879
0.932ProCys: 0.932 ± 0.243
2.795ProAsp: 2.795 ± 0.45
4.037ProGlu: 4.037 ± 0.579
1.428ProPhe: 1.428 ± 0.306
4.782ProGly: 4.782 ± 0.538
1.366ProHis: 1.366 ± 0.284
0.932ProIle: 0.932 ± 0.23
2.546ProLys: 2.546 ± 0.419
4.782ProLeu: 4.782 ± 0.518
2.05ProMet: 2.05 ± 0.356
1.118ProAsn: 1.118 ± 0.236
5.838ProPro: 5.838 ± 0.842
2.422ProGln: 2.422 ± 0.395
4.348ProArg: 4.348 ± 0.549
4.223ProSer: 4.223 ± 0.553
3.664ProThr: 3.664 ± 0.435
4.472ProVal: 4.472 ± 0.478
1.18ProTrp: 1.18 ± 0.224
0.932ProTyr: 0.932 ± 0.261
0.0ProXaa: 0.0 ± 0.0
Gln
5.9GlnAla: 5.9 ± 0.933
0.311GlnCys: 0.311 ± 0.121
2.236GlnAsp: 2.236 ± 0.352
2.05GlnGlu: 2.05 ± 0.343
0.745GlnPhe: 0.745 ± 0.181
3.168GlnGly: 3.168 ± 0.454
0.745GlnHis: 0.745 ± 0.211
0.559GlnIle: 0.559 ± 0.2
1.366GlnLys: 1.366 ± 0.269
2.671GlnLeu: 2.671 ± 0.417
1.366GlnMet: 1.366 ± 0.283
0.621GlnAsn: 0.621 ± 0.167
2.236GlnPro: 2.236 ± 0.361
3.168GlnGln: 3.168 ± 0.664
3.913GlnArg: 3.913 ± 0.43
1.615GlnSer: 1.615 ± 0.365
2.05GlnThr: 2.05 ± 0.339
3.292GlnVal: 3.292 ± 0.464
1.118GlnTrp: 1.118 ± 0.268
0.87GlnTyr: 0.87 ± 0.216
0.0GlnXaa: 0.0 ± 0.0
Arg
10.931ArgAla: 10.931 ± 0.837
1.18ArgCys: 1.18 ± 0.262
4.41ArgAsp: 4.41 ± 0.511
6.646ArgGlu: 6.646 ± 0.642
2.795ArgPhe: 2.795 ± 0.415
5.59ArgGly: 5.59 ± 0.5
1.18ArgHis: 1.18 ± 0.312
2.422ArgIle: 2.422 ± 0.452
2.609ArgLys: 2.609 ± 0.411
9.316ArgLeu: 9.316 ± 0.814
2.484ArgMet: 2.484 ± 0.376
2.174ArgAsn: 2.174 ± 0.359
5.279ArgPro: 5.279 ± 0.695
3.913ArgGln: 3.913 ± 0.6
9.937ArgArg: 9.937 ± 1.208
4.348ArgSer: 4.348 ± 0.588
3.975ArgThr: 3.975 ± 0.606
7.142ArgVal: 7.142 ± 0.708
1.987ArgTrp: 1.987 ± 0.357
1.925ArgTyr: 1.925 ± 0.406
0.0ArgXaa: 0.0 ± 0.0
Ser
7.018SerAla: 7.018 ± 0.733
1.056SerCys: 1.056 ± 0.242
3.168SerAsp: 3.168 ± 0.43
3.292SerGlu: 3.292 ± 0.391
1.553SerPhe: 1.553 ± 0.395
5.838SerGly: 5.838 ± 0.734
0.87SerHis: 0.87 ± 0.263
1.925SerIle: 1.925 ± 0.339
1.491SerLys: 1.491 ± 0.32
4.782SerLeu: 4.782 ± 0.506
1.366SerMet: 1.366 ± 0.291
1.366SerAsn: 1.366 ± 0.284
3.54SerPro: 3.54 ± 0.508
2.05SerGln: 2.05 ± 0.376
4.658SerArg: 4.658 ± 0.447
4.099SerSer: 4.099 ± 0.664
4.161SerThr: 4.161 ± 0.497
3.726SerVal: 3.726 ± 0.496
0.745SerTrp: 0.745 ± 0.228
0.559SerTyr: 0.559 ± 0.204
0.0SerXaa: 0.0 ± 0.0
Thr
7.515ThrAla: 7.515 ± 1.0
1.366ThrCys: 1.366 ± 0.363
3.23ThrAsp: 3.23 ± 0.361
2.919ThrGlu: 2.919 ± 0.49
1.863ThrPhe: 1.863 ± 0.394
5.9ThrGly: 5.9 ± 0.677
1.118ThrHis: 1.118 ± 0.306
1.242ThrIle: 1.242 ± 0.344
1.863ThrLys: 1.863 ± 0.381
4.596ThrLeu: 4.596 ± 0.596
0.87ThrMet: 0.87 ± 0.359
1.615ThrAsn: 1.615 ± 0.261
5.403ThrPro: 5.403 ± 0.556
1.987ThrGln: 1.987 ± 0.361
4.099ThrArg: 4.099 ± 0.515
2.919ThrSer: 2.919 ± 0.468
3.726ThrThr: 3.726 ± 0.487
3.416ThrVal: 3.416 ± 0.447
1.118ThrTrp: 1.118 ± 0.273
1.428ThrTyr: 1.428 ± 0.291
0.0ThrXaa: 0.0 ± 0.0
Val
10.683ValAla: 10.683 ± 0.73
0.745ValCys: 0.745 ± 0.213
4.72ValAsp: 4.72 ± 0.534
5.031ValGlu: 5.031 ± 0.529
1.242ValPhe: 1.242 ± 0.27
4.41ValGly: 4.41 ± 0.505
1.553ValHis: 1.553 ± 0.284
1.428ValIle: 1.428 ± 0.295
2.05ValLys: 2.05 ± 0.351
6.583ValLeu: 6.583 ± 0.637
1.118ValMet: 1.118 ± 0.201
1.553ValAsn: 1.553 ± 0.315
4.161ValPro: 4.161 ± 0.606
2.05ValGln: 2.05 ± 0.368
6.894ValArg: 6.894 ± 0.724
4.41ValSer: 4.41 ± 0.554
5.093ValThr: 5.093 ± 0.654
5.776ValVal: 5.776 ± 0.549
1.553ValTrp: 1.553 ± 0.391
1.615ValTyr: 1.615 ± 0.377
0.0ValXaa: 0.0 ± 0.0
Trp
2.422TrpAla: 2.422 ± 0.376
0.621TrpCys: 0.621 ± 0.226
1.304TrpAsp: 1.304 ± 0.27
0.932TrpGlu: 0.932 ± 0.256
0.745TrpPhe: 0.745 ± 0.193
0.932TrpGly: 0.932 ± 0.228
0.745TrpHis: 0.745 ± 0.195
0.124TrpIle: 0.124 ± 0.09
0.497TrpLys: 0.497 ± 0.197
1.428TrpLeu: 1.428 ± 0.325
0.932TrpMet: 0.932 ± 0.229
0.373TrpAsn: 0.373 ± 0.15
1.242TrpPro: 1.242 ± 0.282
1.056TrpGln: 1.056 ± 0.284
1.863TrpArg: 1.863 ± 0.399
0.994TrpSer: 0.994 ± 0.301
1.242TrpThr: 1.242 ± 0.265
1.304TrpVal: 1.304 ± 0.29
0.435TrpTrp: 0.435 ± 0.156
0.311TrpTyr: 0.311 ± 0.122
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.484TyrAla: 2.484 ± 0.521
0.435TyrCys: 0.435 ± 0.206
1.304TyrAsp: 1.304 ± 0.25
1.428TyrGlu: 1.428 ± 0.287
0.497TyrPhe: 0.497 ± 0.155
2.609TyrGly: 2.609 ± 0.389
0.186TyrHis: 0.186 ± 0.097
0.248TyrIle: 0.248 ± 0.161
0.311TyrLys: 0.311 ± 0.116
1.987TyrLeu: 1.987 ± 0.329
0.497TyrMet: 0.497 ± 0.177
0.559TyrAsn: 0.559 ± 0.165
0.932TyrPro: 0.932 ± 0.25
1.118TyrGln: 1.118 ± 0.231
2.857TyrArg: 2.857 ± 0.44
1.118TyrSer: 1.118 ± 0.236
0.932TyrThr: 0.932 ± 0.222
1.304TyrVal: 1.304 ± 0.29
0.373TyrTrp: 0.373 ± 0.158
0.248TyrTyr: 0.248 ± 0.115
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 86 proteins (16102 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski