Amino acid dipepetide frequency for Microbacterium phage Burro

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.864AlaAla: 14.864 ± 1.382
0.117AlaCys: 0.117 ± 0.11
6.179AlaAsp: 6.179 ± 0.838
6.296AlaGlu: 6.296 ± 0.691
2.681AlaPhe: 2.681 ± 0.487
8.161AlaGly: 8.161 ± 0.801
1.341AlaHis: 1.341 ± 0.305
4.08AlaIle: 4.08 ± 0.416
3.206AlaLys: 3.206 ± 0.443
9.094AlaLeu: 9.094 ± 0.645
2.273AlaMet: 2.273 ± 0.427
4.722AlaAsn: 4.722 ± 0.484
5.246AlaPro: 5.246 ± 0.641
4.314AlaGln: 4.314 ± 0.427
5.596AlaArg: 5.596 ± 0.649
5.771AlaSer: 5.771 ± 0.743
6.878AlaThr: 6.878 ± 0.966
6.004AlaVal: 6.004 ± 0.64
2.448AlaTrp: 2.448 ± 0.466
2.273AlaTyr: 2.273 ± 0.378
0.0AlaXaa: 0.0 ± 0.0
Cys
0.175CysAla: 0.175 ± 0.104
0.0CysCys: 0.0 ± 0.0
0.233CysAsp: 0.233 ± 0.14
0.291CysGlu: 0.291 ± 0.144
0.058CysPhe: 0.058 ± 0.064
0.233CysGly: 0.233 ± 0.134
0.117CysHis: 0.117 ± 0.079
0.117CysIle: 0.117 ± 0.098
0.175CysLys: 0.175 ± 0.101
0.35CysLeu: 0.35 ± 0.215
0.058CysMet: 0.058 ± 0.056
0.175CysAsn: 0.175 ± 0.113
0.291CysPro: 0.291 ± 0.241
0.0CysGln: 0.0 ± 0.0
0.117CysArg: 0.117 ± 0.11
0.525CysSer: 0.525 ± 0.228
0.233CysThr: 0.233 ± 0.135
0.408CysVal: 0.408 ± 0.157
0.058CysTrp: 0.058 ± 0.054
0.117CysTyr: 0.117 ± 0.086
0.0CysXaa: 0.0 ± 0.0
Asp
7.345AspAla: 7.345 ± 0.79
0.408AspCys: 0.408 ± 0.225
4.255AspAsp: 4.255 ± 0.552
4.197AspGlu: 4.197 ± 0.704
2.448AspPhe: 2.448 ± 0.338
4.897AspGly: 4.897 ± 0.599
0.7AspHis: 0.7 ± 0.206
3.498AspIle: 3.498 ± 0.345
3.206AspLys: 3.206 ± 0.543
6.179AspLeu: 6.179 ± 0.652
1.166AspMet: 1.166 ± 0.196
2.565AspAsn: 2.565 ± 0.262
4.08AspPro: 4.08 ± 0.483
2.915AspGln: 2.915 ± 0.389
3.206AspArg: 3.206 ± 0.486
3.556AspSer: 3.556 ± 0.421
3.323AspThr: 3.323 ± 0.488
3.381AspVal: 3.381 ± 0.581
1.632AspTrp: 1.632 ± 0.331
2.04AspTyr: 2.04 ± 0.431
0.0AspXaa: 0.0 ± 0.0
Glu
5.188GluAla: 5.188 ± 0.62
0.175GluCys: 0.175 ± 0.109
3.206GluAsp: 3.206 ± 0.375
3.614GluGlu: 3.614 ± 0.645
1.749GluPhe: 1.749 ± 0.242
4.897GluGly: 4.897 ± 0.728
0.874GluHis: 0.874 ± 0.228
2.856GluIle: 2.856 ± 0.567
3.672GluLys: 3.672 ± 0.83
5.305GluLeu: 5.305 ± 0.527
1.865GluMet: 1.865 ± 0.321
2.565GluAsn: 2.565 ± 0.276
2.507GluPro: 2.507 ± 0.446
2.623GluGln: 2.623 ± 0.453
3.964GluArg: 3.964 ± 0.55
2.915GluSer: 2.915 ± 0.464
4.08GluThr: 4.08 ± 0.524
5.071GluVal: 5.071 ± 0.604
1.399GluTrp: 1.399 ± 0.374
1.574GluTyr: 1.574 ± 0.369
0.0GluXaa: 0.0 ± 0.0
Phe
3.381PheAla: 3.381 ± 0.472
0.058PheCys: 0.058 ± 0.057
2.973PheAsp: 2.973 ± 0.318
2.157PheGlu: 2.157 ± 0.387
1.224PhePhe: 1.224 ± 0.253
2.157PheGly: 2.157 ± 0.341
0.35PheHis: 0.35 ± 0.166
1.574PheIle: 1.574 ± 0.213
1.049PheLys: 1.049 ± 0.203
2.507PheLeu: 2.507 ± 0.404
0.874PheMet: 0.874 ± 0.283
1.516PheAsn: 1.516 ± 0.252
1.282PhePro: 1.282 ± 0.267
1.399PheGln: 1.399 ± 0.366
1.632PheArg: 1.632 ± 0.331
1.924PheSer: 1.924 ± 0.258
2.623PheThr: 2.623 ± 0.499
2.215PheVal: 2.215 ± 0.343
0.233PheTrp: 0.233 ± 0.094
1.282PheTyr: 1.282 ± 0.24
0.0PheXaa: 0.0 ± 0.0
Gly
7.869GlyAla: 7.869 ± 0.829
0.466GlyCys: 0.466 ± 0.205
4.955GlyAsp: 4.955 ± 0.493
3.906GlyGlu: 3.906 ± 0.675
1.865GlyPhe: 1.865 ± 0.292
8.336GlyGly: 8.336 ± 1.133
1.049GlyHis: 1.049 ± 0.287
2.74GlyIle: 2.74 ± 0.453
3.847GlyLys: 3.847 ± 0.752
7.228GlyLeu: 7.228 ± 0.825
1.807GlyMet: 1.807 ± 0.416
3.556GlyAsn: 3.556 ± 0.497
3.089GlyPro: 3.089 ± 0.874
3.439GlyGln: 3.439 ± 0.493
3.731GlyArg: 3.731 ± 0.728
5.13GlySer: 5.13 ± 0.64
7.811GlyThr: 7.811 ± 0.7
5.596GlyVal: 5.596 ± 0.591
2.215GlyTrp: 2.215 ± 0.319
3.031GlyTyr: 3.031 ± 0.357
0.0GlyXaa: 0.0 ± 0.0
His
0.991HisAla: 0.991 ± 0.259
0.175HisCys: 0.175 ± 0.099
0.7HisAsp: 0.7 ± 0.204
1.049HisGlu: 1.049 ± 0.349
0.641HisPhe: 0.641 ± 0.216
1.282HisGly: 1.282 ± 0.262
0.175HisHis: 0.175 ± 0.102
0.583HisIle: 0.583 ± 0.158
0.466HisLys: 0.466 ± 0.139
1.516HisLeu: 1.516 ± 0.433
0.117HisMet: 0.117 ± 0.092
0.466HisAsn: 0.466 ± 0.172
1.049HisPro: 1.049 ± 0.271
0.933HisGln: 0.933 ± 0.258
1.166HisArg: 1.166 ± 0.358
0.758HisSer: 0.758 ± 0.22
0.408HisThr: 0.408 ± 0.138
0.641HisVal: 0.641 ± 0.177
0.233HisTrp: 0.233 ± 0.113
0.641HisTyr: 0.641 ± 0.156
0.0HisXaa: 0.0 ± 0.0
Ile
4.022IleAla: 4.022 ± 0.402
0.058IleCys: 0.058 ± 0.072
2.856IleAsp: 2.856 ± 0.378
3.964IleGlu: 3.964 ± 0.504
1.224IlePhe: 1.224 ± 0.201
3.614IleGly: 3.614 ± 0.435
0.758IleHis: 0.758 ± 0.199
1.457IleIle: 1.457 ± 0.309
2.099IleLys: 2.099 ± 0.563
3.498IleLeu: 3.498 ± 0.406
1.341IleMet: 1.341 ± 0.327
1.749IleAsn: 1.749 ± 0.361
2.798IlePro: 2.798 ± 0.521
2.273IleGln: 2.273 ± 0.399
2.623IleArg: 2.623 ± 0.378
2.681IleSer: 2.681 ± 0.513
2.798IleThr: 2.798 ± 0.494
3.498IleVal: 3.498 ± 0.518
0.7IleTrp: 0.7 ± 0.187
0.933IleTyr: 0.933 ± 0.206
0.0IleXaa: 0.0 ± 0.0
Lys
3.964LysAla: 3.964 ± 0.406
0.175LysCys: 0.175 ± 0.168
2.39LysAsp: 2.39 ± 0.43
2.39LysGlu: 2.39 ± 0.454
0.758LysPhe: 0.758 ± 0.279
2.681LysGly: 2.681 ± 0.603
0.641LysHis: 0.641 ± 0.169
2.507LysIle: 2.507 ± 0.457
1.924LysLys: 1.924 ± 0.404
4.139LysLeu: 4.139 ± 0.491
1.282LysMet: 1.282 ± 0.333
2.04LysAsn: 2.04 ± 0.333
2.39LysPro: 2.39 ± 0.449
1.749LysGln: 1.749 ± 0.337
2.332LysArg: 2.332 ± 0.413
2.448LysSer: 2.448 ± 0.603
2.681LysThr: 2.681 ± 0.367
2.74LysVal: 2.74 ± 0.425
0.7LysTrp: 0.7 ± 0.213
1.69LysTyr: 1.69 ± 0.275
0.0LysXaa: 0.0 ± 0.0
Leu
7.053LeuAla: 7.053 ± 0.507
0.117LeuCys: 0.117 ± 0.093
4.955LeuAsp: 4.955 ± 0.555
4.897LeuGlu: 4.897 ± 0.618
3.148LeuPhe: 3.148 ± 0.335
6.878LeuGly: 6.878 ± 0.691
1.282LeuHis: 1.282 ± 0.274
3.614LeuIle: 3.614 ± 0.423
3.672LeuLys: 3.672 ± 0.373
5.479LeuLeu: 5.479 ± 0.712
2.273LeuMet: 2.273 ± 0.379
5.13LeuAsn: 5.13 ± 0.561
4.139LeuPro: 4.139 ± 0.449
3.789LeuGln: 3.789 ± 0.632
5.363LeuArg: 5.363 ± 0.504
5.887LeuSer: 5.887 ± 0.685
6.179LeuThr: 6.179 ± 0.482
5.946LeuVal: 5.946 ± 0.826
1.341LeuTrp: 1.341 ± 0.227
2.157LeuTyr: 2.157 ± 0.4
0.0LeuXaa: 0.0 ± 0.0
Met
3.031MetAla: 3.031 ± 0.51
0.175MetCys: 0.175 ± 0.107
1.516MetAsp: 1.516 ± 0.24
1.69MetGlu: 1.69 ± 0.244
1.341MetPhe: 1.341 ± 0.242
1.865MetGly: 1.865 ± 0.304
0.233MetHis: 0.233 ± 0.138
1.166MetIle: 1.166 ± 0.274
1.049MetLys: 1.049 ± 0.235
2.099MetLeu: 2.099 ± 0.427
0.641MetMet: 0.641 ± 0.228
0.641MetAsn: 0.641 ± 0.166
1.749MetPro: 1.749 ± 0.364
0.933MetGln: 0.933 ± 0.239
1.108MetArg: 1.108 ± 0.311
1.982MetSer: 1.982 ± 0.256
1.924MetThr: 1.924 ± 0.33
1.516MetVal: 1.516 ± 0.308
0.35MetTrp: 0.35 ± 0.145
0.35MetTyr: 0.35 ± 0.177
0.0MetXaa: 0.0 ± 0.0
Asn
4.605AsnAla: 4.605 ± 0.43
0.35AsnCys: 0.35 ± 0.152
2.448AsnAsp: 2.448 ± 0.402
1.924AsnGlu: 1.924 ± 0.353
1.166AsnPhe: 1.166 ± 0.271
4.255AsnGly: 4.255 ± 0.515
0.816AsnHis: 0.816 ± 0.192
1.807AsnIle: 1.807 ± 0.4
1.574AsnLys: 1.574 ± 0.313
3.498AsnLeu: 3.498 ± 0.685
0.758AsnMet: 0.758 ± 0.2
1.865AsnAsn: 1.865 ± 0.319
2.915AsnPro: 2.915 ± 0.295
1.632AsnGln: 1.632 ± 0.318
2.448AsnArg: 2.448 ± 0.273
3.148AsnSer: 3.148 ± 0.429
3.614AsnThr: 3.614 ± 0.492
3.556AsnVal: 3.556 ± 0.3
1.108AsnTrp: 1.108 ± 0.187
1.574AsnTyr: 1.574 ± 0.325
0.0AsnXaa: 0.0 ± 0.0
Pro
5.829ProAla: 5.829 ± 0.791
0.058ProCys: 0.058 ± 0.079
3.381ProAsp: 3.381 ± 0.431
3.381ProGlu: 3.381 ± 0.442
1.341ProPhe: 1.341 ± 0.289
4.255ProGly: 4.255 ± 0.61
0.758ProHis: 0.758 ± 0.267
3.206ProIle: 3.206 ± 0.537
2.273ProLys: 2.273 ± 0.419
3.672ProLeu: 3.672 ± 0.49
1.516ProMet: 1.516 ± 0.232
2.565ProAsn: 2.565 ± 0.328
2.448ProPro: 2.448 ± 0.495
2.273ProGln: 2.273 ± 0.43
2.157ProArg: 2.157 ± 0.425
2.623ProSer: 2.623 ± 0.359
3.847ProThr: 3.847 ± 0.576
3.148ProVal: 3.148 ± 0.427
0.816ProTrp: 0.816 ± 0.2
1.224ProTyr: 1.224 ± 0.277
0.0ProXaa: 0.0 ± 0.0
Gln
5.13GlnAla: 5.13 ± 0.665
0.233GlnCys: 0.233 ± 0.132
2.273GlnAsp: 2.273 ± 0.297
2.856GlnGlu: 2.856 ± 0.391
1.224GlnPhe: 1.224 ± 0.246
3.264GlnGly: 3.264 ± 0.487
0.583GlnHis: 0.583 ± 0.215
2.099GlnIle: 2.099 ± 0.31
1.69GlnLys: 1.69 ± 0.291
3.731GlnLeu: 3.731 ± 0.417
1.574GlnMet: 1.574 ± 0.357
1.865GlnAsn: 1.865 ± 0.381
1.807GlnPro: 1.807 ± 0.259
3.381GlnGln: 3.381 ± 0.606
2.856GlnArg: 2.856 ± 0.408
2.39GlnSer: 2.39 ± 0.373
3.556GlnThr: 3.556 ± 0.456
2.915GlnVal: 2.915 ± 0.362
0.7GlnTrp: 0.7 ± 0.201
1.457GlnTyr: 1.457 ± 0.213
0.0GlnXaa: 0.0 ± 0.0
Arg
4.78ArgAla: 4.78 ± 0.799
0.117ArgCys: 0.117 ± 0.099
4.197ArgAsp: 4.197 ± 0.493
3.672ArgGlu: 3.672 ± 0.45
1.924ArgPhe: 1.924 ± 0.343
3.614ArgGly: 3.614 ± 0.689
0.816ArgHis: 0.816 ± 0.184
2.565ArgIle: 2.565 ± 0.321
2.507ArgLys: 2.507 ± 0.4
5.13ArgLeu: 5.13 ± 0.561
1.749ArgMet: 1.749 ± 0.364
2.623ArgAsn: 2.623 ± 0.439
2.273ArgPro: 2.273 ± 0.37
2.623ArgGln: 2.623 ± 0.387
3.148ArgArg: 3.148 ± 0.407
3.556ArgSer: 3.556 ± 0.498
3.323ArgThr: 3.323 ± 0.295
4.139ArgVal: 4.139 ± 0.408
0.933ArgTrp: 0.933 ± 0.249
2.215ArgTyr: 2.215 ± 0.281
0.0ArgXaa: 0.0 ± 0.0
Ser
5.946SerAla: 5.946 ± 0.595
0.058SerCys: 0.058 ± 0.083
4.139SerAsp: 4.139 ± 0.466
3.498SerGlu: 3.498 ± 0.449
2.681SerPhe: 2.681 ± 0.339
5.013SerGly: 5.013 ± 0.694
0.583SerHis: 0.583 ± 0.153
3.381SerIle: 3.381 ± 0.515
2.099SerLys: 2.099 ± 0.353
4.955SerLeu: 4.955 ± 0.505
1.632SerMet: 1.632 ± 0.335
2.565SerAsn: 2.565 ± 0.432
3.556SerPro: 3.556 ± 0.411
2.973SerGln: 2.973 ± 0.514
3.614SerArg: 3.614 ± 0.37
4.314SerSer: 4.314 ± 0.494
4.139SerThr: 4.139 ± 0.458
4.022SerVal: 4.022 ± 0.533
1.341SerTrp: 1.341 ± 0.27
2.04SerTyr: 2.04 ± 0.262
0.0SerXaa: 0.0 ± 0.0
Thr
7.053ThrAla: 7.053 ± 0.727
0.233ThrCys: 0.233 ± 0.148
5.188ThrAsp: 5.188 ± 0.42
3.498ThrGlu: 3.498 ± 0.384
3.206ThrPhe: 3.206 ± 0.502
6.937ThrGly: 6.937 ± 0.734
1.049ThrHis: 1.049 ± 0.301
2.856ThrIle: 2.856 ± 0.543
2.856ThrLys: 2.856 ± 0.39
5.887ThrLeu: 5.887 ± 0.528
1.457ThrMet: 1.457 ± 0.271
2.565ThrAsn: 2.565 ± 0.314
3.672ThrPro: 3.672 ± 0.438
2.157ThrGln: 2.157 ± 0.378
3.206ThrArg: 3.206 ± 0.642
4.255ThrSer: 4.255 ± 0.694
5.771ThrThr: 5.771 ± 0.878
4.605ThrVal: 4.605 ± 0.479
1.049ThrTrp: 1.049 ± 0.275
2.099ThrTyr: 2.099 ± 0.347
0.0ThrXaa: 0.0 ± 0.0
Val
6.47ValAla: 6.47 ± 0.713
0.291ValCys: 0.291 ± 0.148
4.838ValAsp: 4.838 ± 0.381
4.43ValGlu: 4.43 ± 0.664
2.099ValPhe: 2.099 ± 0.375
5.363ValGly: 5.363 ± 0.725
1.049ValHis: 1.049 ± 0.217
2.39ValIle: 2.39 ± 0.365
2.507ValLys: 2.507 ± 0.456
5.305ValLeu: 5.305 ± 0.515
1.749ValMet: 1.749 ± 0.311
3.089ValAsn: 3.089 ± 0.337
4.139ValPro: 4.139 ± 0.406
3.556ValGln: 3.556 ± 0.494
4.139ValArg: 4.139 ± 0.402
4.722ValSer: 4.722 ± 0.375
3.906ValThr: 3.906 ± 0.401
4.08ValVal: 4.08 ± 0.479
1.108ValTrp: 1.108 ± 0.395
2.273ValTyr: 2.273 ± 0.318
0.0ValXaa: 0.0 ± 0.0
Trp
1.399TrpAla: 1.399 ± 0.258
0.058TrpCys: 0.058 ± 0.064
1.807TrpAsp: 1.807 ± 0.33
0.7TrpGlu: 0.7 ± 0.234
0.641TrpPhe: 0.641 ± 0.234
1.516TrpGly: 1.516 ± 0.367
0.117TrpHis: 0.117 ± 0.095
1.341TrpIle: 1.341 ± 0.258
0.7TrpLys: 0.7 ± 0.182
1.399TrpLeu: 1.399 ± 0.296
0.466TrpMet: 0.466 ± 0.125
1.224TrpAsn: 1.224 ± 0.256
0.408TrpPro: 0.408 ± 0.204
1.224TrpGln: 1.224 ± 0.201
1.399TrpArg: 1.399 ± 0.27
1.69TrpSer: 1.69 ± 0.396
0.874TrpThr: 0.874 ± 0.27
1.341TrpVal: 1.341 ± 0.326
0.117TrpTrp: 0.117 ± 0.062
0.583TrpTyr: 0.583 ± 0.169
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.448TyrAla: 2.448 ± 0.33
0.408TyrCys: 0.408 ± 0.172
2.507TyrAsp: 2.507 ± 0.396
1.69TyrGlu: 1.69 ± 0.195
0.991TyrPhe: 0.991 ± 0.19
2.448TyrGly: 2.448 ± 0.393
0.7TyrHis: 0.7 ± 0.275
1.108TyrIle: 1.108 ± 0.387
1.049TyrLys: 1.049 ± 0.277
2.39TyrLeu: 2.39 ± 0.302
0.7TyrMet: 0.7 ± 0.175
1.574TyrAsn: 1.574 ± 0.429
0.933TyrPro: 0.933 ± 0.173
1.399TyrGln: 1.399 ± 0.243
2.099TyrArg: 2.099 ± 0.385
2.273TyrSer: 2.273 ± 0.3
1.632TyrThr: 1.632 ± 0.273
2.681TyrVal: 2.681 ± 0.348
0.525TyrTrp: 0.525 ± 0.156
0.816TyrTyr: 0.816 ± 0.259
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 49 proteins (17156 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski