Amino acid dipepetide frequency for Bacillus phage 031MP003

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.105AlaAla: 6.105 ± 0.687
0.456AlaCys: 0.456 ± 0.129
4.564AlaAsp: 4.564 ± 0.622
6.333AlaGlu: 6.333 ± 0.681
2.738AlaPhe: 2.738 ± 0.357
3.937AlaGly: 3.937 ± 0.496
0.513AlaHis: 0.513 ± 0.165
4.507AlaIle: 4.507 ± 0.601
5.135AlaLys: 5.135 ± 0.549
6.675AlaLeu: 6.675 ± 0.884
2.111AlaMet: 2.111 ± 0.291
4.393AlaAsn: 4.393 ± 0.531
1.826AlaPro: 1.826 ± 0.226
3.024AlaGln: 3.024 ± 0.542
2.796AlaArg: 2.796 ± 0.377
3.024AlaSer: 3.024 ± 0.372
4.108AlaThr: 4.108 ± 0.538
6.047AlaVal: 6.047 ± 0.884
0.97AlaTrp: 0.97 ± 0.312
2.111AlaTyr: 2.111 ± 0.388
0.0AlaXaa: 0.0 ± 0.0
Cys
0.342CysAla: 0.342 ± 0.135
0.114CysCys: 0.114 ± 0.09
0.228CysAsp: 0.228 ± 0.118
0.685CysGlu: 0.685 ± 0.221
0.285CysPhe: 0.285 ± 0.139
0.513CysGly: 0.513 ± 0.2
0.285CysHis: 0.285 ± 0.122
0.228CysIle: 0.228 ± 0.11
0.399CysLys: 0.399 ± 0.161
0.628CysLeu: 0.628 ± 0.212
0.228CysMet: 0.228 ± 0.102
0.342CysAsn: 0.342 ± 0.164
0.342CysPro: 0.342 ± 0.13
0.342CysGln: 0.342 ± 0.144
0.342CysArg: 0.342 ± 0.137
0.114CysSer: 0.114 ± 0.08
0.399CysThr: 0.399 ± 0.234
0.342CysVal: 0.342 ± 0.111
0.057CysTrp: 0.057 ± 0.05
0.057CysTyr: 0.057 ± 0.051
0.0CysXaa: 0.0 ± 0.0
Asp
3.937AspAla: 3.937 ± 0.383
0.571AspCys: 0.571 ± 0.279
3.366AspAsp: 3.366 ± 0.471
5.42AspGlu: 5.42 ± 0.727
2.396AspPhe: 2.396 ± 0.372
4.393AspGly: 4.393 ± 0.49
1.027AspHis: 1.027 ± 0.212
4.906AspIle: 4.906 ± 0.584
3.651AspLys: 3.651 ± 0.563
4.336AspLeu: 4.336 ± 0.472
2.168AspMet: 2.168 ± 0.355
3.081AspAsn: 3.081 ± 0.387
2.225AspPro: 2.225 ± 0.441
1.712AspGln: 1.712 ± 0.286
2.624AspArg: 2.624 ± 0.341
2.624AspSer: 2.624 ± 0.317
3.309AspThr: 3.309 ± 0.458
4.45AspVal: 4.45 ± 0.468
0.742AspTrp: 0.742 ± 0.212
2.339AspTyr: 2.339 ± 0.325
0.0AspXaa: 0.0 ± 0.0
Glu
5.534GluAla: 5.534 ± 0.634
0.97GluCys: 0.97 ± 0.261
4.678GluAsp: 4.678 ± 0.683
9.185GluGlu: 9.185 ± 1.143
2.738GluPhe: 2.738 ± 0.465
6.447GluGly: 6.447 ± 0.636
1.255GluHis: 1.255 ± 0.267
4.849GluIle: 4.849 ± 0.508
5.819GluLys: 5.819 ± 0.883
7.246GluLeu: 7.246 ± 0.659
2.738GluMet: 2.738 ± 0.492
4.279GluAsn: 4.279 ± 0.515
1.826GluPro: 1.826 ± 0.39
2.738GluGln: 2.738 ± 0.517
5.078GluArg: 5.078 ± 0.58
4.849GluSer: 4.849 ± 0.57
4.45GluThr: 4.45 ± 0.519
4.963GluVal: 4.963 ± 0.619
1.255GluTrp: 1.255 ± 0.252
3.252GluTyr: 3.252 ± 0.398
0.0GluXaa: 0.0 ± 0.0
Phe
2.168PheAla: 2.168 ± 0.353
0.228PheCys: 0.228 ± 0.103
2.282PheAsp: 2.282 ± 0.393
2.681PheGlu: 2.681 ± 0.336
1.597PhePhe: 1.597 ± 0.343
3.138PheGly: 3.138 ± 0.39
1.198PheHis: 1.198 ± 0.269
2.225PheIle: 2.225 ± 0.289
2.51PheLys: 2.51 ± 0.345
2.681PheLeu: 2.681 ± 0.439
1.027PheMet: 1.027 ± 0.25
1.826PheAsn: 1.826 ± 0.401
1.54PhePro: 1.54 ± 0.283
1.483PheGln: 1.483 ± 0.34
1.826PheArg: 1.826 ± 0.312
2.054PheSer: 2.054 ± 0.374
3.366PheThr: 3.366 ± 0.389
2.111PheVal: 2.111 ± 0.292
0.399PheTrp: 0.399 ± 0.133
1.769PheTyr: 1.769 ± 0.327
0.0PheXaa: 0.0 ± 0.0
Gly
4.051GlyAla: 4.051 ± 0.687
0.456GlyCys: 0.456 ± 0.159
4.45GlyAsp: 4.45 ± 0.52
6.105GlyGlu: 6.105 ± 0.678
3.366GlyPhe: 3.366 ± 0.491
5.078GlyGly: 5.078 ± 0.713
1.255GlyHis: 1.255 ± 0.222
4.735GlyIle: 4.735 ± 0.627
4.849GlyLys: 4.849 ± 0.548
5.591GlyLeu: 5.591 ± 0.839
2.396GlyMet: 2.396 ± 0.392
3.822GlyAsn: 3.822 ± 0.632
0.057GlyPro: 0.057 ± 0.05
2.054GlyGln: 2.054 ± 0.314
3.48GlyArg: 3.48 ± 0.411
4.279GlySer: 4.279 ± 0.661
3.994GlyThr: 3.994 ± 0.574
5.819GlyVal: 5.819 ± 0.539
0.799GlyTrp: 0.799 ± 0.233
3.195GlyTyr: 3.195 ± 0.446
0.0GlyXaa: 0.0 ± 0.0
His
0.799HisAla: 0.799 ± 0.202
0.114HisCys: 0.114 ± 0.089
0.799HisAsp: 0.799 ± 0.17
0.913HisGlu: 0.913 ± 0.224
0.685HisPhe: 0.685 ± 0.173
0.97HisGly: 0.97 ± 0.256
0.571HisHis: 0.571 ± 0.24
1.426HisIle: 1.426 ± 0.256
1.597HisLys: 1.597 ± 0.244
0.97HisLeu: 0.97 ± 0.271
0.456HisMet: 0.456 ± 0.152
0.685HisAsn: 0.685 ± 0.192
0.799HisPro: 0.799 ± 0.224
0.342HisGln: 0.342 ± 0.15
0.913HisArg: 0.913 ± 0.214
0.571HisSer: 0.571 ± 0.163
1.141HisThr: 1.141 ± 0.249
1.312HisVal: 1.312 ± 0.234
0.342HisTrp: 0.342 ± 0.115
0.799HisTyr: 0.799 ± 0.188
0.0HisXaa: 0.0 ± 0.0
Ile
3.765IleAla: 3.765 ± 0.534
0.342IleCys: 0.342 ± 0.138
4.735IleAsp: 4.735 ± 0.524
6.162IleGlu: 6.162 ± 0.768
2.168IlePhe: 2.168 ± 0.35
4.051IleGly: 4.051 ± 0.519
1.483IleHis: 1.483 ± 0.267
4.165IleIle: 4.165 ± 0.466
5.876IleLys: 5.876 ± 0.759
4.45IleLeu: 4.45 ± 0.487
1.826IleMet: 1.826 ± 0.362
3.081IleAsn: 3.081 ± 0.464
2.567IlePro: 2.567 ± 0.318
2.91IleGln: 2.91 ± 0.333
3.48IleArg: 3.48 ± 0.347
3.651IleSer: 3.651 ± 0.489
4.792IleThr: 4.792 ± 0.563
3.822IleVal: 3.822 ± 0.407
0.571IleTrp: 0.571 ± 0.216
2.111IleTyr: 2.111 ± 0.333
0.0IleXaa: 0.0 ± 0.0
Lys
6.618LysAla: 6.618 ± 0.621
0.513LysCys: 0.513 ± 0.178
3.48LysAsp: 3.48 ± 0.447
6.789LysGlu: 6.789 ± 0.901
2.453LysPhe: 2.453 ± 0.351
5.933LysGly: 5.933 ± 0.682
1.255LysHis: 1.255 ± 0.225
3.937LysIle: 3.937 ± 0.389
7.759LysLys: 7.759 ± 1.073
4.963LysLeu: 4.963 ± 0.736
1.94LysMet: 1.94 ± 0.324
4.564LysAsn: 4.564 ± 0.486
2.738LysPro: 2.738 ± 0.506
2.567LysGln: 2.567 ± 0.361
4.051LysArg: 4.051 ± 0.568
3.252LysSer: 3.252 ± 0.618
5.705LysThr: 5.705 ± 0.707
5.021LysVal: 5.021 ± 0.616
0.97LysTrp: 0.97 ± 0.249
3.423LysTyr: 3.423 ± 0.483
0.0LysXaa: 0.0 ± 0.0
Leu
5.306LeuAla: 5.306 ± 0.522
0.285LeuCys: 0.285 ± 0.154
5.42LeuAsp: 5.42 ± 0.582
6.675LeuGlu: 6.675 ± 0.783
2.168LeuPhe: 2.168 ± 0.264
5.363LeuGly: 5.363 ± 0.657
0.571LeuHis: 0.571 ± 0.179
4.507LeuIle: 4.507 ± 0.464
6.561LeuLys: 6.561 ± 0.764
4.621LeuLeu: 4.621 ± 0.488
2.339LeuMet: 2.339 ± 0.323
4.336LeuAsn: 4.336 ± 0.49
2.282LeuPro: 2.282 ± 0.374
3.081LeuGln: 3.081 ± 0.394
3.366LeuArg: 3.366 ± 0.419
5.249LeuSer: 5.249 ± 0.588
5.42LeuThr: 5.42 ± 0.452
4.393LeuVal: 4.393 ± 0.518
0.685LeuTrp: 0.685 ± 0.218
2.796LeuTyr: 2.796 ± 0.377
0.0LeuXaa: 0.0 ± 0.0
Met
2.396MetAla: 2.396 ± 0.393
0.057MetCys: 0.057 ± 0.051
1.654MetAsp: 1.654 ± 0.227
1.597MetGlu: 1.597 ± 0.307
0.856MetPhe: 0.856 ± 0.255
1.597MetGly: 1.597 ± 0.419
0.171MetHis: 0.171 ± 0.103
2.339MetIle: 2.339 ± 0.444
2.796MetLys: 2.796 ± 0.424
1.94MetLeu: 1.94 ± 0.34
0.799MetMet: 0.799 ± 0.258
1.883MetAsn: 1.883 ± 0.367
1.312MetPro: 1.312 ± 0.237
0.742MetGln: 0.742 ± 0.199
1.826MetArg: 1.826 ± 0.324
1.597MetSer: 1.597 ± 0.261
2.339MetThr: 2.339 ± 0.344
1.141MetVal: 1.141 ± 0.25
0.685MetTrp: 0.685 ± 0.22
0.628MetTyr: 0.628 ± 0.204
0.0MetXaa: 0.0 ± 0.0
Asn
3.937AsnAla: 3.937 ± 0.707
0.456AsnCys: 0.456 ± 0.204
3.195AsnAsp: 3.195 ± 0.427
3.48AsnGlu: 3.48 ± 0.396
1.769AsnPhe: 1.769 ± 0.256
4.507AsnGly: 4.507 ± 0.565
1.027AsnHis: 1.027 ± 0.208
3.423AsnIle: 3.423 ± 0.483
3.309AsnLys: 3.309 ± 0.474
3.994AsnLeu: 3.994 ± 0.46
0.913AsnMet: 0.913 ± 0.254
2.453AsnAsn: 2.453 ± 0.336
2.339AsnPro: 2.339 ± 0.335
1.654AsnGln: 1.654 ± 0.285
2.453AsnArg: 2.453 ± 0.365
3.309AsnSer: 3.309 ± 0.389
2.91AsnThr: 2.91 ± 0.725
3.594AsnVal: 3.594 ± 0.377
0.97AsnTrp: 0.97 ± 0.198
1.94AsnTyr: 1.94 ± 0.327
0.0AsnXaa: 0.0 ± 0.0
Pro
2.225ProAla: 2.225 ± 0.319
0.342ProCys: 0.342 ± 0.151
1.654ProAsp: 1.654 ± 0.306
3.252ProGlu: 3.252 ± 0.469
1.597ProPhe: 1.597 ± 0.361
0.114ProGly: 0.114 ± 0.072
0.399ProHis: 0.399 ± 0.201
1.654ProIle: 1.654 ± 0.253
2.282ProLys: 2.282 ± 0.439
2.624ProLeu: 2.624 ± 0.375
0.628ProMet: 0.628 ± 0.153
1.54ProAsn: 1.54 ± 0.241
1.54ProPro: 1.54 ± 0.394
1.54ProGln: 1.54 ± 0.334
0.913ProArg: 0.913 ± 0.218
2.396ProSer: 2.396 ± 0.378
2.339ProThr: 2.339 ± 0.41
1.94ProVal: 1.94 ± 0.327
0.171ProTrp: 0.171 ± 0.105
1.084ProTyr: 1.084 ± 0.236
0.0ProXaa: 0.0 ± 0.0
Gln
3.423GlnAla: 3.423 ± 0.495
0.171GlnCys: 0.171 ± 0.104
1.997GlnAsp: 1.997 ± 0.31
3.765GlnGlu: 3.765 ± 0.604
0.97GlnPhe: 0.97 ± 0.191
2.339GlnGly: 2.339 ± 0.394
0.285GlnHis: 0.285 ± 0.114
2.111GlnIle: 2.111 ± 0.278
3.48GlnLys: 3.48 ± 0.454
2.681GlnLeu: 2.681 ± 0.379
1.198GlnMet: 1.198 ± 0.282
1.997GlnAsn: 1.997 ± 0.355
1.141GlnPro: 1.141 ± 0.228
1.426GlnGln: 1.426 ± 0.374
1.54GlnArg: 1.54 ± 0.298
1.255GlnSer: 1.255 ± 0.268
1.94GlnThr: 1.94 ± 0.292
2.453GlnVal: 2.453 ± 0.363
0.513GlnTrp: 0.513 ± 0.16
1.312GlnTyr: 1.312 ± 0.249
0.0GlnXaa: 0.0 ± 0.0
Arg
3.537ArgAla: 3.537 ± 0.459
0.057ArgCys: 0.057 ± 0.051
2.967ArgAsp: 2.967 ± 0.405
4.222ArgGlu: 4.222 ± 0.527
1.997ArgPhe: 1.997 ± 0.329
3.024ArgGly: 3.024 ± 0.428
0.799ArgHis: 0.799 ± 0.232
3.708ArgIle: 3.708 ± 0.398
3.88ArgLys: 3.88 ± 0.427
3.765ArgLeu: 3.765 ± 0.532
1.597ArgMet: 1.597 ± 0.321
2.168ArgAsn: 2.168 ± 0.389
1.369ArgPro: 1.369 ± 0.219
2.225ArgGln: 2.225 ± 0.363
3.366ArgArg: 3.366 ± 0.634
2.168ArgSer: 2.168 ± 0.319
3.195ArgThr: 3.195 ± 0.405
3.195ArgVal: 3.195 ± 0.423
0.456ArgTrp: 0.456 ± 0.144
2.453ArgTyr: 2.453 ± 0.406
0.0ArgXaa: 0.0 ± 0.0
Ser
3.537SerAla: 3.537 ± 0.58
0.057SerCys: 0.057 ± 0.052
3.195SerAsp: 3.195 ± 0.366
3.651SerGlu: 3.651 ± 0.394
2.453SerPhe: 2.453 ± 0.39
5.534SerGly: 5.534 ± 0.557
0.799SerHis: 0.799 ± 0.216
3.937SerIle: 3.937 ± 0.465
4.507SerLys: 4.507 ± 0.423
4.336SerLeu: 4.336 ± 0.383
1.369SerMet: 1.369 ± 0.31
2.225SerAsn: 2.225 ± 0.365
0.913SerPro: 0.913 ± 0.254
1.769SerGln: 1.769 ± 0.288
2.51SerArg: 2.51 ± 0.385
3.651SerSer: 3.651 ± 0.754
3.537SerThr: 3.537 ± 0.589
3.708SerVal: 3.708 ± 0.418
0.628SerTrp: 0.628 ± 0.195
2.111SerTyr: 2.111 ± 0.445
0.0SerXaa: 0.0 ± 0.0
Thr
5.249ThrAla: 5.249 ± 1.007
0.342ThrCys: 0.342 ± 0.153
3.651ThrAsp: 3.651 ± 0.574
4.336ThrGlu: 4.336 ± 0.547
3.081ThrPhe: 3.081 ± 0.357
5.477ThrGly: 5.477 ± 0.567
0.856ThrHis: 0.856 ± 0.243
5.42ThrIle: 5.42 ± 0.644
4.051ThrLys: 4.051 ± 0.422
6.333ThrLeu: 6.333 ± 0.509
1.712ThrMet: 1.712 ± 0.259
2.51ThrAsn: 2.51 ± 0.393
2.225ThrPro: 2.225 ± 0.39
2.396ThrGln: 2.396 ± 0.359
2.738ThrArg: 2.738 ± 0.343
2.853ThrSer: 2.853 ± 0.469
4.393ThrThr: 4.393 ± 0.873
5.192ThrVal: 5.192 ± 0.624
0.685ThrTrp: 0.685 ± 0.24
2.282ThrTyr: 2.282 ± 0.503
0.0ThrXaa: 0.0 ± 0.0
Val
4.906ValAla: 4.906 ± 0.581
0.342ValCys: 0.342 ± 0.155
3.423ValAsp: 3.423 ± 0.443
4.849ValGlu: 4.849 ± 0.549
3.081ValPhe: 3.081 ± 0.314
4.45ValGly: 4.45 ± 0.458
1.255ValHis: 1.255 ± 0.255
4.222ValIle: 4.222 ± 0.497
4.963ValLys: 4.963 ± 0.488
4.336ValLeu: 4.336 ± 0.487
1.712ValMet: 1.712 ± 0.29
3.138ValAsn: 3.138 ± 0.443
2.054ValPro: 2.054 ± 0.294
2.681ValGln: 2.681 ± 0.352
4.165ValArg: 4.165 ± 0.616
4.336ValSer: 4.336 ± 0.458
4.963ValThr: 4.963 ± 0.731
4.222ValVal: 4.222 ± 0.526
0.913ValTrp: 0.913 ± 0.22
3.252ValTyr: 3.252 ± 0.462
0.0ValXaa: 0.0 ± 0.0
Trp
1.141TrpAla: 1.141 ± 0.238
0.114TrpCys: 0.114 ± 0.08
0.799TrpAsp: 0.799 ± 0.178
0.742TrpGlu: 0.742 ± 0.158
0.456TrpPhe: 0.456 ± 0.163
0.628TrpGly: 0.628 ± 0.186
0.399TrpHis: 0.399 ± 0.147
0.856TrpIle: 0.856 ± 0.2
1.312TrpLys: 1.312 ± 0.241
0.742TrpLeu: 0.742 ± 0.21
0.399TrpMet: 0.399 ± 0.165
1.369TrpAsn: 1.369 ± 0.496
0.0TrpPro: 0.0 ± 0.0
0.228TrpGln: 0.228 ± 0.116
0.685TrpArg: 0.685 ± 0.198
0.742TrpSer: 0.742 ± 0.225
0.913TrpThr: 0.913 ± 0.255
0.628TrpVal: 0.628 ± 0.2
0.114TrpTrp: 0.114 ± 0.073
0.228TrpTyr: 0.228 ± 0.098
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.396TyrAla: 2.396 ± 0.386
0.285TyrCys: 0.285 ± 0.112
2.738TyrAsp: 2.738 ± 0.453
3.366TyrGlu: 3.366 ± 0.475
1.255TyrPhe: 1.255 ± 0.264
2.339TyrGly: 2.339 ± 0.462
0.799TyrHis: 0.799 ± 0.225
2.853TyrIle: 2.853 ± 0.46
3.195TyrLys: 3.195 ± 0.491
2.51TyrLeu: 2.51 ± 0.441
0.742TyrMet: 0.742 ± 0.192
1.94TyrAsn: 1.94 ± 0.292
1.141TyrPro: 1.141 ± 0.293
1.084TyrGln: 1.084 ± 0.185
1.997TyrArg: 1.997 ± 0.297
2.453TyrSer: 2.453 ± 0.314
2.567TyrThr: 2.567 ± 0.399
2.91TyrVal: 2.91 ± 0.496
0.571TyrTrp: 0.571 ± 0.158
1.94TyrTyr: 1.94 ± 0.402
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 91 proteins (17529 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski