Amino acid dipepetide frequency for Bacillus phage Ray17

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.731AlaAla: 6.731 ± 1.189
0.296AlaCys: 0.296 ± 0.139
3.994AlaAsp: 3.994 ± 0.607
5.991AlaGlu: 5.991 ± 0.95
3.328AlaPhe: 3.328 ± 0.633
3.994AlaGly: 3.994 ± 0.448
0.888AlaHis: 0.888 ± 0.241
4.438AlaIle: 4.438 ± 0.668
6.361AlaLys: 6.361 ± 0.734
5.473AlaLeu: 5.473 ± 0.718
2.293AlaMet: 2.293 ± 0.479
3.033AlaAsn: 3.033 ± 0.539
1.553AlaPro: 1.553 ± 0.324
2.145AlaGln: 2.145 ± 0.398
3.55AlaArg: 3.55 ± 0.506
3.624AlaSer: 3.624 ± 0.484
3.033AlaThr: 3.033 ± 0.771
5.473AlaVal: 5.473 ± 0.909
1.183AlaTrp: 1.183 ± 0.327
2.219AlaTyr: 2.219 ± 0.379
0.0AlaXaa: 0.0 ± 0.0
Cys
0.444CysAla: 0.444 ± 0.161
0.222CysCys: 0.222 ± 0.173
0.74CysAsp: 0.74 ± 0.28
0.518CysGlu: 0.518 ± 0.19
0.222CysPhe: 0.222 ± 0.119
0.814CysGly: 0.814 ± 0.331
0.222CysHis: 0.222 ± 0.159
0.296CysIle: 0.296 ± 0.146
0.74CysLys: 0.74 ± 0.259
0.37CysLeu: 0.37 ± 0.149
0.148CysMet: 0.148 ± 0.104
0.592CysAsn: 0.592 ± 0.238
0.444CysPro: 0.444 ± 0.179
0.296CysGln: 0.296 ± 0.144
0.666CysArg: 0.666 ± 0.211
0.148CysSer: 0.148 ± 0.101
0.444CysThr: 0.444 ± 0.188
0.296CysVal: 0.296 ± 0.156
0.0CysTrp: 0.0 ± 0.0
0.222CysTyr: 0.222 ± 0.127
0.0CysXaa: 0.0 ± 0.0
Asp
4.216AspAla: 4.216 ± 0.409
0.814AspCys: 0.814 ± 0.203
4.956AspAsp: 4.956 ± 0.792
5.178AspGlu: 5.178 ± 0.853
3.033AspPhe: 3.033 ± 0.515
5.621AspGly: 5.621 ± 0.6
1.036AspHis: 1.036 ± 0.257
3.92AspIle: 3.92 ± 0.508
4.512AspLys: 4.512 ± 0.551
5.917AspLeu: 5.917 ± 0.778
1.627AspMet: 1.627 ± 0.352
3.033AspAsn: 3.033 ± 0.566
1.997AspPro: 1.997 ± 0.47
1.627AspGln: 1.627 ± 0.438
3.402AspArg: 3.402 ± 0.571
4.216AspSer: 4.216 ± 0.563
3.254AspThr: 3.254 ± 0.601
4.142AspVal: 4.142 ± 0.504
0.37AspTrp: 0.37 ± 0.159
2.293AspTyr: 2.293 ± 0.375
0.0AspXaa: 0.0 ± 0.0
Glu
6.361GluAla: 6.361 ± 0.748
0.888GluCys: 0.888 ± 0.297
4.438GluAsp: 4.438 ± 0.808
6.583GluGlu: 6.583 ± 0.835
2.737GluPhe: 2.737 ± 0.425
4.512GluGly: 4.512 ± 0.514
0.888GluHis: 0.888 ± 0.246
5.399GluIle: 5.399 ± 0.818
7.766GluLys: 7.766 ± 1.002
6.287GluLeu: 6.287 ± 0.881
3.033GluMet: 3.033 ± 0.558
4.142GluAsn: 4.142 ± 0.645
2.441GluPro: 2.441 ± 0.507
2.959GluGln: 2.959 ± 0.502
3.772GluArg: 3.772 ± 0.561
3.402GluSer: 3.402 ± 0.453
4.142GluThr: 4.142 ± 0.552
5.325GluVal: 5.325 ± 0.649
1.479GluTrp: 1.479 ± 0.384
3.033GluTyr: 3.033 ± 0.513
0.0GluXaa: 0.0 ± 0.0
Phe
2.145PheAla: 2.145 ± 0.493
0.37PheCys: 0.37 ± 0.183
2.959PheAsp: 2.959 ± 0.431
2.663PheGlu: 2.663 ± 0.44
1.923PhePhe: 1.923 ± 0.332
2.959PheGly: 2.959 ± 0.509
0.518PheHis: 0.518 ± 0.207
2.959PheIle: 2.959 ± 0.49
3.624PheLys: 3.624 ± 0.58
2.441PheLeu: 2.441 ± 0.406
1.257PheMet: 1.257 ± 0.324
2.293PheAsn: 2.293 ± 0.363
0.888PhePro: 0.888 ± 0.239
0.888PheGln: 0.888 ± 0.253
0.888PheArg: 0.888 ± 0.313
2.663PheSer: 2.663 ± 0.476
2.959PheThr: 2.959 ± 0.632
2.367PheVal: 2.367 ± 0.452
0.592PheTrp: 0.592 ± 0.172
1.923PheTyr: 1.923 ± 0.355
0.0PheXaa: 0.0 ± 0.0
Gly
4.438GlyAla: 4.438 ± 0.739
0.666GlyCys: 0.666 ± 0.263
3.92GlyAsp: 3.92 ± 0.589
5.547GlyGlu: 5.547 ± 0.729
2.515GlyPhe: 2.515 ± 0.505
5.325GlyGly: 5.325 ± 0.86
1.331GlyHis: 1.331 ± 0.326
5.547GlyIle: 5.547 ± 0.988
6.287GlyLys: 6.287 ± 0.795
4.882GlyLeu: 4.882 ± 0.822
2.589GlyMet: 2.589 ± 0.468
3.624GlyAsn: 3.624 ± 0.67
1.479GlyPro: 1.479 ± 0.337
2.885GlyGln: 2.885 ± 0.448
3.55GlyArg: 3.55 ± 0.57
3.624GlySer: 3.624 ± 0.507
3.698GlyThr: 3.698 ± 0.586
5.251GlyVal: 5.251 ± 0.525
0.444GlyTrp: 0.444 ± 0.173
3.624GlyTyr: 3.624 ± 0.52
0.0GlyXaa: 0.0 ± 0.0
His
0.74HisAla: 0.74 ± 0.221
0.074HisCys: 0.074 ± 0.067
1.405HisAsp: 1.405 ± 0.394
1.257HisGlu: 1.257 ± 0.303
0.814HisPhe: 0.814 ± 0.245
1.331HisGly: 1.331 ± 0.311
0.222HisHis: 0.222 ± 0.137
1.331HisIle: 1.331 ± 0.322
1.036HisLys: 1.036 ± 0.275
0.888HisLeu: 0.888 ± 0.284
0.518HisMet: 0.518 ± 0.214
0.814HisAsn: 0.814 ± 0.239
0.518HisPro: 0.518 ± 0.16
0.518HisGln: 0.518 ± 0.191
1.183HisArg: 1.183 ± 0.324
0.888HisSer: 0.888 ± 0.243
0.666HisThr: 0.666 ± 0.205
0.74HisVal: 0.74 ± 0.26
0.592HisTrp: 0.592 ± 0.191
0.37HisTyr: 0.37 ± 0.17
0.0HisXaa: 0.0 ± 0.0
Ile
3.624IleAla: 3.624 ± 0.518
0.518IleCys: 0.518 ± 0.219
5.769IleAsp: 5.769 ± 0.72
4.66IleGlu: 4.66 ± 0.71
1.775IlePhe: 1.775 ± 0.401
4.068IleGly: 4.068 ± 0.801
1.036IleHis: 1.036 ± 0.324
4.216IleIle: 4.216 ± 0.475
7.175IleLys: 7.175 ± 0.661
3.92IleLeu: 3.92 ± 0.598
2.145IleMet: 2.145 ± 0.452
3.328IleAsn: 3.328 ± 0.435
2.367IlePro: 2.367 ± 0.582
2.663IleGln: 2.663 ± 0.426
2.737IleArg: 2.737 ± 0.402
4.734IleSer: 4.734 ± 0.528
3.476IleThr: 3.476 ± 0.536
5.03IleVal: 5.03 ± 0.508
0.962IleTrp: 0.962 ± 0.339
2.589IleTyr: 2.589 ± 0.452
0.0IleXaa: 0.0 ± 0.0
Lys
6.139LysAla: 6.139 ± 0.712
0.74LysCys: 0.74 ± 0.244
5.473LysAsp: 5.473 ± 0.675
7.027LysGlu: 7.027 ± 0.738
3.033LysPhe: 3.033 ± 0.512
5.325LysGly: 5.325 ± 0.539
1.331LysHis: 1.331 ± 0.321
5.104LysIle: 5.104 ± 0.601
8.284LysLys: 8.284 ± 1.146
6.657LysLeu: 6.657 ± 0.759
2.811LysMet: 2.811 ± 0.61
5.325LysAsn: 5.325 ± 0.625
3.107LysPro: 3.107 ± 0.455
2.811LysGln: 2.811 ± 0.466
5.178LysArg: 5.178 ± 0.971
4.364LysSer: 4.364 ± 1.182
5.695LysThr: 5.695 ± 0.803
5.547LysVal: 5.547 ± 0.563
1.257LysTrp: 1.257 ± 0.357
2.885LysTyr: 2.885 ± 0.423
0.0LysXaa: 0.0 ± 0.0
Leu
4.364LeuAla: 4.364 ± 0.609
0.37LeuCys: 0.37 ± 0.163
4.216LeuAsp: 4.216 ± 0.437
6.509LeuGlu: 6.509 ± 0.762
2.811LeuPhe: 2.811 ± 0.348
4.512LeuGly: 4.512 ± 0.603
1.553LeuHis: 1.553 ± 0.243
4.29LeuIle: 4.29 ± 0.576
6.879LeuLys: 6.879 ± 0.552
3.92LeuLeu: 3.92 ± 0.529
1.849LeuMet: 1.849 ± 0.358
3.402LeuAsn: 3.402 ± 0.469
2.441LeuPro: 2.441 ± 0.458
3.18LeuGln: 3.18 ± 0.485
4.364LeuArg: 4.364 ± 0.599
5.473LeuSer: 5.473 ± 0.494
4.66LeuThr: 4.66 ± 0.558
3.698LeuVal: 3.698 ± 0.575
0.814LeuTrp: 0.814 ± 0.282
1.701LeuTyr: 1.701 ± 0.353
0.0LeuXaa: 0.0 ± 0.0
Met
2.515MetAla: 2.515 ± 0.777
0.444MetCys: 0.444 ± 0.184
1.701MetAsp: 1.701 ± 0.339
2.367MetGlu: 2.367 ± 0.484
0.814MetPhe: 0.814 ± 0.23
2.071MetGly: 2.071 ± 0.356
0.444MetHis: 0.444 ± 0.186
1.775MetIle: 1.775 ± 0.365
3.846MetLys: 3.846 ± 0.565
1.109MetLeu: 1.109 ± 0.265
1.627MetMet: 1.627 ± 0.414
2.663MetAsn: 2.663 ± 0.405
0.814MetPro: 0.814 ± 0.262
1.183MetGln: 1.183 ± 0.359
1.553MetArg: 1.553 ± 0.342
2.441MetSer: 2.441 ± 0.378
1.997MetThr: 1.997 ± 0.319
0.888MetVal: 0.888 ± 0.225
0.592MetTrp: 0.592 ± 0.187
0.592MetTyr: 0.592 ± 0.223
0.0MetXaa: 0.0 ± 0.0
Asn
4.068AsnAla: 4.068 ± 0.769
0.296AsnCys: 0.296 ± 0.168
3.55AsnAsp: 3.55 ± 0.588
4.808AsnGlu: 4.808 ± 0.813
1.627AsnPhe: 1.627 ± 0.417
4.364AsnGly: 4.364 ± 1.0
1.257AsnHis: 1.257 ± 0.269
3.18AsnIle: 3.18 ± 0.465
4.068AsnLys: 4.068 ± 0.608
3.698AsnLeu: 3.698 ± 0.404
1.775AsnMet: 1.775 ± 0.459
2.737AsnAsn: 2.737 ± 0.602
2.071AsnPro: 2.071 ± 0.358
1.183AsnGln: 1.183 ± 0.363
2.293AsnArg: 2.293 ± 0.362
2.515AsnSer: 2.515 ± 0.524
2.589AsnThr: 2.589 ± 0.441
2.737AsnVal: 2.737 ± 0.514
1.183AsnTrp: 1.183 ± 0.245
1.701AsnTyr: 1.701 ± 0.414
0.0AsnXaa: 0.0 ± 0.0
Pro
2.441ProAla: 2.441 ± 0.366
0.074ProCys: 0.074 ± 0.078
2.219ProAsp: 2.219 ± 0.306
2.367ProGlu: 2.367 ± 0.436
1.405ProPhe: 1.405 ± 0.273
2.293ProGly: 2.293 ± 0.496
0.814ProHis: 0.814 ± 0.226
1.627ProIle: 1.627 ± 0.275
2.441ProLys: 2.441 ± 0.558
2.071ProLeu: 2.071 ± 0.47
0.74ProMet: 0.74 ± 0.194
1.405ProAsn: 1.405 ± 0.326
0.962ProPro: 0.962 ± 0.285
1.183ProGln: 1.183 ± 0.282
1.257ProArg: 1.257 ± 0.33
1.405ProSer: 1.405 ± 0.357
1.405ProThr: 1.405 ± 0.356
2.589ProVal: 2.589 ± 0.333
0.222ProTrp: 0.222 ± 0.129
1.405ProTyr: 1.405 ± 0.34
0.0ProXaa: 0.0 ± 0.0
Gln
1.849GlnAla: 1.849 ± 0.304
0.222GlnCys: 0.222 ± 0.123
2.071GlnAsp: 2.071 ± 0.351
3.18GlnGlu: 3.18 ± 0.568
1.331GlnPhe: 1.331 ± 0.249
2.515GlnGly: 2.515 ± 0.425
0.444GlnHis: 0.444 ± 0.176
2.367GlnIle: 2.367 ± 0.337
3.18GlnLys: 3.18 ± 0.548
2.663GlnLeu: 2.663 ± 0.408
1.109GlnMet: 1.109 ± 0.285
1.553GlnAsn: 1.553 ± 0.396
0.888GlnPro: 0.888 ± 0.266
1.553GlnGln: 1.553 ± 0.364
1.553GlnArg: 1.553 ± 0.439
1.775GlnSer: 1.775 ± 0.511
1.849GlnThr: 1.849 ± 0.436
1.997GlnVal: 1.997 ± 0.335
0.296GlnTrp: 0.296 ± 0.149
0.962GlnTyr: 0.962 ± 0.338
0.0GlnXaa: 0.0 ± 0.0
Arg
2.589ArgAla: 2.589 ± 0.433
0.592ArgCys: 0.592 ± 0.205
3.18ArgAsp: 3.18 ± 0.462
3.772ArgGlu: 3.772 ± 0.605
2.885ArgPhe: 2.885 ± 0.452
3.698ArgGly: 3.698 ± 0.596
0.518ArgHis: 0.518 ± 0.193
4.142ArgIle: 4.142 ± 0.586
3.402ArgLys: 3.402 ± 0.741
4.216ArgLeu: 4.216 ± 0.635
1.479ArgMet: 1.479 ± 0.418
1.849ArgAsn: 1.849 ± 0.356
0.962ArgPro: 0.962 ± 0.255
1.701ArgGln: 1.701 ± 0.475
2.737ArgArg: 2.737 ± 0.551
1.701ArgSer: 1.701 ± 0.26
3.846ArgThr: 3.846 ± 0.503
3.033ArgVal: 3.033 ± 0.432
0.666ArgTrp: 0.666 ± 0.181
1.997ArgTyr: 1.997 ± 0.416
0.0ArgXaa: 0.0 ± 0.0
Ser
4.29SerAla: 4.29 ± 0.871
0.222SerCys: 0.222 ± 0.132
3.92SerAsp: 3.92 ± 0.61
4.29SerGlu: 4.29 ± 0.467
2.959SerPhe: 2.959 ± 0.663
5.251SerGly: 5.251 ± 0.59
0.74SerHis: 0.74 ± 0.211
3.55SerIle: 3.55 ± 0.579
4.142SerLys: 4.142 ± 0.505
4.512SerLeu: 4.512 ± 0.687
1.849SerMet: 1.849 ± 0.423
1.923SerAsn: 1.923 ± 0.35
1.775SerPro: 1.775 ± 0.337
1.553SerGln: 1.553 ± 0.341
3.033SerArg: 3.033 ± 0.495
2.885SerSer: 2.885 ± 0.828
3.402SerThr: 3.402 ± 0.468
3.698SerVal: 3.698 ± 0.695
0.888SerTrp: 0.888 ± 0.284
1.923SerTyr: 1.923 ± 0.379
0.0SerXaa: 0.0 ± 0.0
Thr
4.808ThrAla: 4.808 ± 1.054
0.296ThrCys: 0.296 ± 0.126
3.846ThrAsp: 3.846 ± 0.615
3.476ThrGlu: 3.476 ± 0.624
1.997ThrPhe: 1.997 ± 0.31
4.512ThrGly: 4.512 ± 0.658
0.666ThrHis: 0.666 ± 0.195
3.698ThrIle: 3.698 ± 0.47
4.734ThrLys: 4.734 ± 0.623
3.55ThrLeu: 3.55 ± 0.548
1.553ThrMet: 1.553 ± 0.337
3.254ThrAsn: 3.254 ± 0.521
2.515ThrPro: 2.515 ± 0.492
1.405ThrGln: 1.405 ± 0.353
2.293ThrArg: 2.293 ± 0.435
3.18ThrSer: 3.18 ± 0.929
3.624ThrThr: 3.624 ± 0.597
4.808ThrVal: 4.808 ± 0.865
0.962ThrTrp: 0.962 ± 0.252
2.293ThrTyr: 2.293 ± 0.439
0.0ThrXaa: 0.0 ± 0.0
Val
4.512ValAla: 4.512 ± 0.617
0.296ValCys: 0.296 ± 0.15
3.698ValAsp: 3.698 ± 0.552
5.325ValGlu: 5.325 ± 0.728
2.367ValPhe: 2.367 ± 0.469
3.476ValGly: 3.476 ± 0.47
1.036ValHis: 1.036 ± 0.249
5.03ValIle: 5.03 ± 0.554
5.769ValLys: 5.769 ± 0.725
4.66ValLeu: 4.66 ± 0.557
1.701ValMet: 1.701 ± 0.452
3.107ValAsn: 3.107 ± 0.526
1.775ValPro: 1.775 ± 0.328
2.293ValGln: 2.293 ± 0.366
2.811ValArg: 2.811 ± 0.503
4.512ValSer: 4.512 ± 0.548
3.846ValThr: 3.846 ± 0.636
3.846ValVal: 3.846 ± 0.543
1.405ValTrp: 1.405 ± 0.708
2.811ValTyr: 2.811 ± 0.497
0.0ValXaa: 0.0 ± 0.0
Trp
0.666TrpAla: 0.666 ± 0.213
0.148TrpCys: 0.148 ± 0.093
0.814TrpAsp: 0.814 ± 0.229
0.888TrpGlu: 0.888 ± 0.25
0.444TrpPhe: 0.444 ± 0.161
1.257TrpGly: 1.257 ± 0.231
0.222TrpHis: 0.222 ± 0.115
1.479TrpIle: 1.479 ± 0.313
1.036TrpLys: 1.036 ± 0.287
0.814TrpLeu: 0.814 ± 0.222
0.37TrpMet: 0.37 ± 0.17
1.479TrpAsn: 1.479 ± 0.65
0.37TrpPro: 0.37 ± 0.15
0.592TrpGln: 0.592 ± 0.221
0.74TrpArg: 0.74 ± 0.189
1.183TrpSer: 1.183 ± 0.309
0.888TrpThr: 0.888 ± 0.342
0.814TrpVal: 0.814 ± 0.248
0.074TrpTrp: 0.074 ± 0.073
0.37TrpTyr: 0.37 ± 0.192
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.737TyrAla: 2.737 ± 0.387
0.296TyrCys: 0.296 ± 0.147
2.293TyrAsp: 2.293 ± 0.355
3.18TyrGlu: 3.18 ± 0.511
1.257TyrPhe: 1.257 ± 0.314
3.328TyrGly: 3.328 ± 0.668
0.74TyrHis: 0.74 ± 0.243
2.441TyrIle: 2.441 ± 0.469
2.811TyrLys: 2.811 ± 0.552
2.959TyrLeu: 2.959 ± 0.429
0.888TyrMet: 0.888 ± 0.222
2.145TyrAsn: 2.145 ± 0.426
0.962TyrPro: 0.962 ± 0.275
0.74TyrGln: 0.74 ± 0.212
1.405TyrArg: 1.405 ± 0.336
2.071TyrSer: 2.071 ± 0.327
1.997TyrThr: 1.997 ± 0.405
1.997TyrVal: 1.997 ± 0.377
0.666TyrTrp: 0.666 ± 0.212
1.627TyrTyr: 1.627 ± 0.387
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 74 proteins (13521 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski