Amino acid dipepetide frequency for Microbacterium phage BeeBee8

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.661AlaAla: 10.661 ± 1.071
0.68AlaCys: 0.68 ± 0.275
5.141AlaAsp: 5.141 ± 0.819
5.973AlaGlu: 5.973 ± 0.811
3.024AlaPhe: 3.024 ± 0.561
7.334AlaGly: 7.334 ± 1.12
2.193AlaHis: 2.193 ± 0.371
5.595AlaIle: 5.595 ± 0.93
5.519AlaLys: 5.519 ± 0.739
9.224AlaLeu: 9.224 ± 1.113
1.815AlaMet: 1.815 ± 0.432
2.873AlaAsn: 2.873 ± 0.459
3.932AlaPro: 3.932 ± 0.622
3.856AlaGln: 3.856 ± 0.555
5.822AlaArg: 5.822 ± 0.72
5.897AlaSer: 5.897 ± 0.822
7.712AlaThr: 7.712 ± 0.795
7.183AlaVal: 7.183 ± 0.706
1.89AlaTrp: 1.89 ± 0.395
2.949AlaTyr: 2.949 ± 0.694
0.0AlaXaa: 0.0 ± 0.0
Cys
0.529CysAla: 0.529 ± 0.186
0.0CysCys: 0.0 ± 0.0
0.302CysAsp: 0.302 ± 0.154
0.302CysGlu: 0.302 ± 0.144
0.076CysPhe: 0.076 ± 0.071
0.68CysGly: 0.68 ± 0.243
0.378CysHis: 0.378 ± 0.172
0.076CysIle: 0.076 ± 0.076
0.454CysLys: 0.454 ± 0.21
0.605CysLeu: 0.605 ± 0.264
0.0CysMet: 0.0 ± 0.0
0.302CysAsn: 0.302 ± 0.152
0.454CysPro: 0.454 ± 0.252
0.0CysGln: 0.0 ± 0.0
0.378CysArg: 0.378 ± 0.145
0.227CysSer: 0.227 ± 0.138
0.378CysThr: 0.378 ± 0.157
0.378CysVal: 0.378 ± 0.138
0.227CysTrp: 0.227 ± 0.12
0.151CysTyr: 0.151 ± 0.107
0.0CysXaa: 0.0 ± 0.0
Asp
5.671AspAla: 5.671 ± 0.695
0.68AspCys: 0.68 ± 0.211
5.066AspAsp: 5.066 ± 0.859
5.897AspGlu: 5.897 ± 1.318
2.646AspPhe: 2.646 ± 0.474
4.158AspGly: 4.158 ± 0.623
1.21AspHis: 1.21 ± 0.282
3.629AspIle: 3.629 ± 0.524
2.722AspLys: 2.722 ± 0.444
5.368AspLeu: 5.368 ± 0.685
2.041AspMet: 2.041 ± 0.484
2.117AspAsn: 2.117 ± 0.458
4.461AspPro: 4.461 ± 0.663
2.117AspGln: 2.117 ± 0.327
3.78AspArg: 3.78 ± 0.582
2.798AspSer: 2.798 ± 0.492
2.949AspThr: 2.949 ± 0.551
4.612AspVal: 4.612 ± 0.646
1.739AspTrp: 1.739 ± 0.336
2.344AspTyr: 2.344 ± 0.479
0.0AspXaa: 0.0 ± 0.0
Glu
7.334GluAla: 7.334 ± 1.007
0.076GluCys: 0.076 ± 0.074
4.612GluAsp: 4.612 ± 1.089
5.293GluGlu: 5.293 ± 1.4
2.344GluPhe: 2.344 ± 0.42
4.007GluGly: 4.007 ± 0.665
0.68GluHis: 0.68 ± 0.173
2.722GluIle: 2.722 ± 0.495
2.571GluLys: 2.571 ± 0.536
5.217GluLeu: 5.217 ± 0.708
2.419GluMet: 2.419 ± 0.393
2.041GluAsn: 2.041 ± 0.463
2.495GluPro: 2.495 ± 0.492
3.024GluGln: 3.024 ± 0.592
3.705GluArg: 3.705 ± 0.676
2.268GluSer: 2.268 ± 0.521
3.176GluThr: 3.176 ± 0.498
4.688GluVal: 4.688 ± 0.528
1.361GluTrp: 1.361 ± 0.368
2.117GluTyr: 2.117 ± 0.314
0.0GluXaa: 0.0 ± 0.0
Phe
2.646PheAla: 2.646 ± 0.405
0.076PheCys: 0.076 ± 0.063
2.571PheAsp: 2.571 ± 0.4
1.815PheGlu: 1.815 ± 0.37
0.756PhePhe: 0.756 ± 0.234
2.949PheGly: 2.949 ± 0.456
0.907PheHis: 0.907 ± 0.257
1.512PheIle: 1.512 ± 0.342
1.815PheLys: 1.815 ± 0.299
2.117PheLeu: 2.117 ± 0.636
0.983PheMet: 0.983 ± 0.323
1.059PheAsn: 1.059 ± 0.242
1.134PhePro: 1.134 ± 0.27
1.588PheGln: 1.588 ± 0.337
2.117PheArg: 2.117 ± 0.441
2.041PheSer: 2.041 ± 0.304
2.646PheThr: 2.646 ± 0.41
1.512PheVal: 1.512 ± 0.424
0.907PheTrp: 0.907 ± 0.316
0.756PheTyr: 0.756 ± 0.207
0.0PheXaa: 0.0 ± 0.0
Gly
6.2GlyAla: 6.2 ± 0.753
0.454GlyCys: 0.454 ± 0.216
4.612GlyAsp: 4.612 ± 0.479
3.478GlyGlu: 3.478 ± 0.503
3.024GlyPhe: 3.024 ± 0.434
5.519GlyGly: 5.519 ± 0.75
1.815GlyHis: 1.815 ± 0.367
4.839GlyIle: 4.839 ± 0.839
5.671GlyLys: 5.671 ± 0.719
6.427GlyLeu: 6.427 ± 0.867
1.663GlyMet: 1.663 ± 0.237
2.193GlyAsn: 2.193 ± 0.451
2.722GlyPro: 2.722 ± 0.405
3.1GlyGln: 3.1 ± 0.508
4.537GlyArg: 4.537 ± 0.788
5.066GlySer: 5.066 ± 0.581
5.671GlyThr: 5.671 ± 0.809
6.351GlyVal: 6.351 ± 0.822
1.512GlyTrp: 1.512 ± 0.248
1.966GlyTyr: 1.966 ± 0.371
0.0GlyXaa: 0.0 ± 0.0
His
1.285HisAla: 1.285 ± 0.339
0.227HisCys: 0.227 ± 0.122
0.983HisAsp: 0.983 ± 0.294
1.361HisGlu: 1.361 ± 0.326
0.454HisPhe: 0.454 ± 0.166
2.117HisGly: 2.117 ± 0.448
0.302HisHis: 0.302 ± 0.167
1.059HisIle: 1.059 ± 0.252
1.285HisLys: 1.285 ± 0.278
1.285HisLeu: 1.285 ± 0.347
0.302HisMet: 0.302 ± 0.14
0.454HisAsn: 0.454 ± 0.233
0.832HisPro: 0.832 ± 0.251
0.529HisGln: 0.529 ± 0.166
0.983HisArg: 0.983 ± 0.246
1.361HisSer: 1.361 ± 0.348
1.437HisThr: 1.437 ± 0.321
1.285HisVal: 1.285 ± 0.302
0.529HisTrp: 0.529 ± 0.202
1.059HisTyr: 1.059 ± 0.23
0.0HisXaa: 0.0 ± 0.0
Ile
5.293IleAla: 5.293 ± 0.544
0.378IleCys: 0.378 ± 0.161
5.141IleAsp: 5.141 ± 0.68
3.327IleGlu: 3.327 ± 0.571
0.907IlePhe: 0.907 ± 0.301
3.629IleGly: 3.629 ± 0.52
0.983IleHis: 0.983 ± 0.228
3.024IleIle: 3.024 ± 0.841
1.89IleLys: 1.89 ± 0.437
3.1IleLeu: 3.1 ± 0.55
1.361IleMet: 1.361 ± 0.283
2.419IleAsn: 2.419 ± 0.54
3.024IlePro: 3.024 ± 0.504
2.798IleGln: 2.798 ± 0.489
2.495IleArg: 2.495 ± 0.476
2.722IleSer: 2.722 ± 0.405
3.554IleThr: 3.554 ± 0.876
3.024IleVal: 3.024 ± 0.416
0.68IleTrp: 0.68 ± 0.234
1.739IleTyr: 1.739 ± 0.389
0.0IleXaa: 0.0 ± 0.0
Lys
4.99LysAla: 4.99 ± 0.839
0.302LysCys: 0.302 ± 0.151
2.571LysAsp: 2.571 ± 0.451
3.327LysGlu: 3.327 ± 0.546
0.983LysPhe: 0.983 ± 0.257
3.629LysGly: 3.629 ± 0.484
1.059LysHis: 1.059 ± 0.328
1.739LysIle: 1.739 ± 0.393
2.495LysLys: 2.495 ± 0.717
4.612LysLeu: 4.612 ± 0.517
1.285LysMet: 1.285 ± 0.267
1.588LysAsn: 1.588 ± 0.429
3.629LysPro: 3.629 ± 0.736
1.739LysGln: 1.739 ± 0.388
2.873LysArg: 2.873 ± 0.549
2.495LysSer: 2.495 ± 0.346
2.873LysThr: 2.873 ± 0.378
4.234LysVal: 4.234 ± 0.641
0.907LysTrp: 0.907 ± 0.271
1.21LysTyr: 1.21 ± 0.258
0.0LysXaa: 0.0 ± 0.0
Leu
8.619LeuAla: 8.619 ± 0.756
0.68LeuCys: 0.68 ± 0.22
5.293LeuAsp: 5.293 ± 0.486
5.293LeuGlu: 5.293 ± 0.57
2.268LeuPhe: 2.268 ± 0.319
6.502LeuGly: 6.502 ± 0.779
1.21LeuHis: 1.21 ± 0.299
5.141LeuIle: 5.141 ± 0.962
4.385LeuLys: 4.385 ± 0.621
7.561LeuLeu: 7.561 ± 0.809
1.815LeuMet: 1.815 ± 0.361
3.478LeuAsn: 3.478 ± 0.415
3.78LeuPro: 3.78 ± 0.525
3.1LeuGln: 3.1 ± 0.51
5.066LeuArg: 5.066 ± 0.606
4.763LeuSer: 4.763 ± 0.454
5.671LeuThr: 5.671 ± 0.698
6.427LeuVal: 6.427 ± 0.731
1.512LeuTrp: 1.512 ± 0.328
2.117LeuTyr: 2.117 ± 0.361
0.0LeuXaa: 0.0 ± 0.0
Met
3.176MetAla: 3.176 ± 0.446
0.151MetCys: 0.151 ± 0.111
1.588MetAsp: 1.588 ± 0.305
1.21MetGlu: 1.21 ± 0.262
0.605MetPhe: 0.605 ± 0.148
1.361MetGly: 1.361 ± 0.383
0.227MetHis: 0.227 ± 0.132
0.832MetIle: 0.832 ± 0.267
0.605MetLys: 0.605 ± 0.235
2.571MetLeu: 2.571 ± 0.454
0.68MetMet: 0.68 ± 0.184
0.832MetAsn: 0.832 ± 0.239
1.437MetPro: 1.437 ± 0.273
0.983MetGln: 0.983 ± 0.274
1.059MetArg: 1.059 ± 0.233
2.193MetSer: 2.193 ± 0.4
2.344MetThr: 2.344 ± 0.305
1.437MetVal: 1.437 ± 0.286
0.378MetTrp: 0.378 ± 0.165
0.227MetTyr: 0.227 ± 0.117
0.0MetXaa: 0.0 ± 0.0
Asn
3.705AsnAla: 3.705 ± 0.687
0.151AsnCys: 0.151 ± 0.104
2.419AsnAsp: 2.419 ± 0.404
1.815AsnGlu: 1.815 ± 0.406
1.059AsnPhe: 1.059 ± 0.285
2.873AsnGly: 2.873 ± 0.287
0.605AsnHis: 0.605 ± 0.252
1.588AsnIle: 1.588 ± 0.428
1.437AsnLys: 1.437 ± 0.321
3.327AsnLeu: 3.327 ± 0.448
0.378AsnMet: 0.378 ± 0.161
1.285AsnAsn: 1.285 ± 0.32
1.966AsnPro: 1.966 ± 0.372
1.512AsnGln: 1.512 ± 0.326
2.117AsnArg: 2.117 ± 0.494
1.815AsnSer: 1.815 ± 0.442
1.437AsnThr: 1.437 ± 0.322
2.495AsnVal: 2.495 ± 0.369
0.605AsnTrp: 0.605 ± 0.235
1.285AsnTyr: 1.285 ± 0.316
0.0AsnXaa: 0.0 ± 0.0
Pro
6.049ProAla: 6.049 ± 0.761
0.0ProCys: 0.0 ± 0.0
3.251ProAsp: 3.251 ± 0.644
2.949ProGlu: 2.949 ± 0.581
1.512ProPhe: 1.512 ± 0.302
4.083ProGly: 4.083 ± 0.573
0.605ProHis: 0.605 ± 0.199
1.966ProIle: 1.966 ± 0.272
2.193ProLys: 2.193 ± 0.379
3.402ProLeu: 3.402 ± 0.408
0.832ProMet: 0.832 ± 0.248
1.21ProAsn: 1.21 ± 0.297
1.21ProPro: 1.21 ± 0.47
2.193ProGln: 2.193 ± 0.431
2.193ProArg: 2.193 ± 0.526
2.949ProSer: 2.949 ± 0.432
3.554ProThr: 3.554 ± 0.641
4.763ProVal: 4.763 ± 0.606
0.983ProTrp: 0.983 ± 0.273
1.134ProTyr: 1.134 ± 0.341
0.0ProXaa: 0.0 ± 0.0
Gln
4.688GlnAla: 4.688 ± 0.946
0.151GlnCys: 0.151 ± 0.106
1.89GlnAsp: 1.89 ± 0.406
2.646GlnGlu: 2.646 ± 0.559
1.361GlnPhe: 1.361 ± 0.311
3.176GlnGly: 3.176 ± 0.44
1.134GlnHis: 1.134 ± 0.298
1.739GlnIle: 1.739 ± 0.342
0.907GlnLys: 0.907 ± 0.218
3.478GlnLeu: 3.478 ± 0.653
0.68GlnMet: 0.68 ± 0.183
1.588GlnAsn: 1.588 ± 0.448
1.512GlnPro: 1.512 ± 0.336
2.117GlnGln: 2.117 ± 0.535
2.419GlnArg: 2.419 ± 0.408
1.966GlnSer: 1.966 ± 0.327
3.176GlnThr: 3.176 ± 0.463
2.949GlnVal: 2.949 ± 0.46
1.437GlnTrp: 1.437 ± 0.314
1.361GlnTyr: 1.361 ± 0.31
0.0GlnXaa: 0.0 ± 0.0
Arg
5.746ArgAla: 5.746 ± 0.716
0.302ArgCys: 0.302 ± 0.132
4.31ArgAsp: 4.31 ± 0.567
3.251ArgGlu: 3.251 ± 0.625
2.041ArgPhe: 2.041 ± 0.352
3.629ArgGly: 3.629 ± 0.466
0.907ArgHis: 0.907 ± 0.227
3.024ArgIle: 3.024 ± 0.497
4.007ArgLys: 4.007 ± 0.769
5.141ArgLeu: 5.141 ± 0.708
1.663ArgMet: 1.663 ± 0.294
1.663ArgAsn: 1.663 ± 0.412
2.646ArgPro: 2.646 ± 0.442
1.739ArgGln: 1.739 ± 0.394
3.554ArgArg: 3.554 ± 0.66
3.478ArgSer: 3.478 ± 0.447
3.327ArgThr: 3.327 ± 0.549
3.856ArgVal: 3.856 ± 0.664
0.756ArgTrp: 0.756 ± 0.233
1.739ArgTyr: 1.739 ± 0.394
0.0ArgXaa: 0.0 ± 0.0
Ser
5.217SerAla: 5.217 ± 0.707
0.076SerCys: 0.076 ± 0.093
3.478SerAsp: 3.478 ± 0.459
2.571SerGlu: 2.571 ± 0.473
2.117SerPhe: 2.117 ± 0.464
5.141SerGly: 5.141 ± 0.786
0.832SerHis: 0.832 ± 0.23
3.176SerIle: 3.176 ± 0.569
2.419SerLys: 2.419 ± 0.364
5.595SerLeu: 5.595 ± 0.494
2.117SerMet: 2.117 ± 0.478
2.646SerAsn: 2.646 ± 0.525
2.117SerPro: 2.117 ± 0.345
2.646SerGln: 2.646 ± 0.576
2.873SerArg: 2.873 ± 0.417
3.478SerSer: 3.478 ± 0.499
3.932SerThr: 3.932 ± 0.584
4.461SerVal: 4.461 ± 0.621
1.134SerTrp: 1.134 ± 0.324
1.966SerTyr: 1.966 ± 0.32
0.0SerXaa: 0.0 ± 0.0
Thr
5.973ThrAla: 5.973 ± 0.916
0.227ThrCys: 0.227 ± 0.123
4.158ThrAsp: 4.158 ± 0.57
3.251ThrGlu: 3.251 ± 0.532
2.949ThrPhe: 2.949 ± 0.564
5.897ThrGly: 5.897 ± 0.693
1.21ThrHis: 1.21 ± 0.3
3.327ThrIle: 3.327 ± 0.47
2.419ThrLys: 2.419 ± 0.456
5.671ThrLeu: 5.671 ± 0.755
1.361ThrMet: 1.361 ± 0.3
1.059ThrAsn: 1.059 ± 0.276
3.478ThrPro: 3.478 ± 0.389
2.495ThrGln: 2.495 ± 0.385
3.629ThrArg: 3.629 ± 0.475
4.31ThrSer: 4.31 ± 0.62
5.066ThrThr: 5.066 ± 0.694
5.595ThrVal: 5.595 ± 0.639
1.89ThrTrp: 1.89 ± 0.326
2.419ThrTyr: 2.419 ± 0.504
0.0ThrXaa: 0.0 ± 0.0
Val
7.939ValAla: 7.939 ± 0.711
0.605ValCys: 0.605 ± 0.233
4.385ValAsp: 4.385 ± 0.768
5.368ValGlu: 5.368 ± 0.697
1.815ValPhe: 1.815 ± 0.362
5.444ValGly: 5.444 ± 0.836
1.663ValHis: 1.663 ± 0.376
3.78ValIle: 3.78 ± 0.521
3.932ValLys: 3.932 ± 0.495
5.519ValLeu: 5.519 ± 0.663
1.437ValMet: 1.437 ± 0.352
3.024ValAsn: 3.024 ± 0.497
3.705ValPro: 3.705 ± 0.616
3.327ValGln: 3.327 ± 0.626
4.31ValArg: 4.31 ± 0.59
4.688ValSer: 4.688 ± 0.54
4.234ValThr: 4.234 ± 0.63
5.444ValVal: 5.444 ± 0.618
1.739ValTrp: 1.739 ± 0.37
2.419ValTyr: 2.419 ± 0.415
0.0ValXaa: 0.0 ± 0.0
Trp
0.983TrpAla: 0.983 ± 0.268
0.227TrpCys: 0.227 ± 0.135
1.663TrpAsp: 1.663 ± 0.317
1.134TrpGlu: 1.134 ± 0.29
1.134TrpPhe: 1.134 ± 0.252
1.739TrpGly: 1.739 ± 0.333
0.756TrpHis: 0.756 ± 0.215
0.983TrpIle: 0.983 ± 0.246
0.68TrpLys: 0.68 ± 0.217
1.966TrpLeu: 1.966 ± 0.343
0.302TrpMet: 0.302 ± 0.153
0.68TrpAsn: 0.68 ± 0.232
1.059TrpPro: 1.059 ± 0.344
0.832TrpGln: 0.832 ± 0.254
1.437TrpArg: 1.437 ± 0.411
1.059TrpSer: 1.059 ± 0.317
1.437TrpThr: 1.437 ± 0.321
1.739TrpVal: 1.739 ± 0.406
0.68TrpTrp: 0.68 ± 0.309
0.907TrpTyr: 0.907 ± 0.247
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.571TyrAla: 2.571 ± 0.468
0.454TyrCys: 0.454 ± 0.152
2.646TyrAsp: 2.646 ± 0.351
1.89TyrGlu: 1.89 ± 0.477
0.832TyrPhe: 0.832 ± 0.221
2.949TyrGly: 2.949 ± 0.448
0.454TyrHis: 0.454 ± 0.192
1.739TyrIle: 1.739 ± 0.368
1.134TyrLys: 1.134 ± 0.336
2.495TyrLeu: 2.495 ± 0.392
0.68TyrMet: 0.68 ± 0.211
1.361TyrAsn: 1.361 ± 0.451
1.437TyrPro: 1.437 ± 0.318
0.68TyrGln: 0.68 ± 0.245
1.437TyrArg: 1.437 ± 0.316
2.419TyrSer: 2.419 ± 0.405
1.815TyrThr: 1.815 ± 0.376
2.344TyrVal: 2.344 ± 0.448
0.529TyrTrp: 0.529 ± 0.206
1.739TyrTyr: 1.739 ± 0.479
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 64 proteins (13227 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski