Amino acid dipepetide frequency for Bacillus phage Tavor_SA

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.432AlaAla: 4.432 ± 0.83
0.492AlaCys: 0.492 ± 0.276
3.119AlaAsp: 3.119 ± 0.568
5.088AlaGlu: 5.088 ± 0.898
3.037AlaPhe: 3.037 ± 0.435
4.432AlaGly: 4.432 ± 0.755
0.739AlaHis: 0.739 ± 0.208
5.17AlaIle: 5.17 ± 0.523
4.76AlaLys: 4.76 ± 0.562
5.006AlaLeu: 5.006 ± 0.751
1.313AlaMet: 1.313 ± 0.392
3.611AlaAsn: 3.611 ± 0.569
1.149AlaPro: 1.149 ± 0.303
1.97AlaGln: 1.97 ± 0.348
2.79AlaArg: 2.79 ± 0.533
4.268AlaSer: 4.268 ± 0.647
3.283AlaThr: 3.283 ± 0.704
3.693AlaVal: 3.693 ± 0.69
0.657AlaTrp: 0.657 ± 0.207
2.216AlaTyr: 2.216 ± 0.616
0.0AlaXaa: 0.0 ± 0.0
Cys
0.246CysAla: 0.246 ± 0.153
0.082CysCys: 0.082 ± 0.084
0.492CysAsp: 0.492 ± 0.16
0.985CysGlu: 0.985 ± 0.404
0.41CysPhe: 0.41 ± 0.192
0.41CysGly: 0.41 ± 0.212
0.328CysHis: 0.328 ± 0.19
0.985CysIle: 0.985 ± 0.372
0.492CysLys: 0.492 ± 0.207
0.739CysLeu: 0.739 ± 0.238
0.41CysMet: 0.41 ± 0.257
0.41CysAsn: 0.41 ± 0.216
0.657CysPro: 0.657 ± 0.238
0.41CysGln: 0.41 ± 0.163
0.328CysArg: 0.328 ± 0.169
0.739CysSer: 0.739 ± 0.278
0.246CysThr: 0.246 ± 0.134
0.492CysVal: 0.492 ± 0.172
0.0CysTrp: 0.0 ± 0.0
0.657CysTyr: 0.657 ± 0.24
0.0CysXaa: 0.0 ± 0.0
Asp
3.037AspAla: 3.037 ± 0.494
0.246AspCys: 0.246 ± 0.147
3.037AspAsp: 3.037 ± 0.467
5.17AspGlu: 5.17 ± 0.678
2.79AspPhe: 2.79 ± 0.459
3.611AspGly: 3.611 ± 0.52
0.574AspHis: 0.574 ± 0.191
4.103AspIle: 4.103 ± 0.55
6.155AspLys: 6.155 ± 0.531
4.35AspLeu: 4.35 ± 0.502
2.134AspMet: 2.134 ± 0.354
2.544AspAsn: 2.544 ± 0.534
1.313AspPro: 1.313 ± 0.362
2.134AspGln: 2.134 ± 0.46
2.872AspArg: 2.872 ± 0.486
1.97AspSer: 1.97 ± 0.487
2.708AspThr: 2.708 ± 0.457
3.529AspVal: 3.529 ± 0.498
0.903AspTrp: 0.903 ± 0.266
2.298AspTyr: 2.298 ± 0.571
0.0AspXaa: 0.0 ± 0.0
Glu
5.252GluAla: 5.252 ± 0.634
1.313GluCys: 1.313 ± 0.333
4.842GluAsp: 4.842 ± 0.518
9.93GluGlu: 9.93 ± 1.777
3.693GluPhe: 3.693 ± 0.787
4.103GluGly: 4.103 ± 0.618
1.723GluHis: 1.723 ± 0.327
7.879GluIle: 7.879 ± 0.778
10.094GluLys: 10.094 ± 0.919
8.371GluLeu: 8.371 ± 0.978
3.365GluMet: 3.365 ± 0.526
4.021GluAsn: 4.021 ± 0.605
0.985GluPro: 0.985 ± 0.396
4.596GluGln: 4.596 ± 0.59
4.021GluArg: 4.021 ± 0.665
4.185GluSer: 4.185 ± 0.718
4.678GluThr: 4.678 ± 0.689
4.432GluVal: 4.432 ± 0.71
0.903GluTrp: 0.903 ± 0.241
2.462GluTyr: 2.462 ± 0.432
0.0GluXaa: 0.0 ± 0.0
Phe
2.79PheAla: 2.79 ± 0.572
0.574PheCys: 0.574 ± 0.22
2.216PheAsp: 2.216 ± 0.472
3.447PheGlu: 3.447 ± 0.637
1.231PhePhe: 1.231 ± 0.339
2.298PheGly: 2.298 ± 0.415
0.492PheHis: 0.492 ± 0.211
2.462PheIle: 2.462 ± 0.526
3.939PheLys: 3.939 ± 0.396
4.103PheLeu: 4.103 ± 0.634
1.641PheMet: 1.641 ± 0.364
2.298PheAsn: 2.298 ± 0.425
0.985PhePro: 0.985 ± 0.275
1.559PheGln: 1.559 ± 0.458
2.052PheArg: 2.052 ± 0.392
2.052PheSer: 2.052 ± 0.633
2.298PheThr: 2.298 ± 0.507
2.216PheVal: 2.216 ± 0.336
0.41PheTrp: 0.41 ± 0.208
1.641PheTyr: 1.641 ± 0.39
0.0PheXaa: 0.0 ± 0.0
Gly
3.283GlyAla: 3.283 ± 0.675
0.657GlyCys: 0.657 ± 0.228
3.037GlyAsp: 3.037 ± 0.523
5.416GlyGlu: 5.416 ± 0.806
3.611GlyPhe: 3.611 ± 0.455
3.447GlyGly: 3.447 ± 0.537
0.821GlyHis: 0.821 ± 0.238
5.088GlyIle: 5.088 ± 0.814
5.745GlyLys: 5.745 ± 0.549
5.499GlyLeu: 5.499 ± 0.891
2.052GlyMet: 2.052 ± 0.685
2.134GlyAsn: 2.134 ± 0.39
0.821GlyPro: 0.821 ± 0.304
1.641GlyGln: 1.641 ± 0.341
2.38GlyArg: 2.38 ± 0.46
2.708GlySer: 2.708 ± 0.396
2.134GlyThr: 2.134 ± 0.55
4.185GlyVal: 4.185 ± 0.647
0.903GlyTrp: 0.903 ± 0.369
2.134GlyTyr: 2.134 ± 0.414
0.0GlyXaa: 0.0 ± 0.0
His
1.067HisAla: 1.067 ± 0.225
0.574HisCys: 0.574 ± 0.249
0.657HisAsp: 0.657 ± 0.199
0.739HisGlu: 0.739 ± 0.253
0.492HisPhe: 0.492 ± 0.208
0.574HisGly: 0.574 ± 0.195
0.657HisHis: 0.657 ± 0.283
0.985HisIle: 0.985 ± 0.294
1.313HisLys: 1.313 ± 0.305
1.805HisLeu: 1.805 ± 0.379
0.657HisMet: 0.657 ± 0.242
0.821HisAsn: 0.821 ± 0.23
0.41HisPro: 0.41 ± 0.179
0.492HisGln: 0.492 ± 0.191
1.149HisArg: 1.149 ± 0.304
0.657HisSer: 0.657 ± 0.252
0.985HisThr: 0.985 ± 0.276
0.739HisVal: 0.739 ± 0.237
0.328HisTrp: 0.328 ± 0.143
1.231HisTyr: 1.231 ± 0.281
0.0HisXaa: 0.0 ± 0.0
Ile
4.432IleAla: 4.432 ± 0.635
0.657IleCys: 0.657 ± 0.218
5.088IleAsp: 5.088 ± 0.618
6.319IleGlu: 6.319 ± 0.607
1.97IlePhe: 1.97 ± 0.472
3.857IleGly: 3.857 ± 0.606
0.985IleHis: 0.985 ± 0.3
4.514IleIle: 4.514 ± 0.864
6.483IleLys: 6.483 ± 0.605
6.073IleLeu: 6.073 ± 0.749
1.559IleMet: 1.559 ± 0.299
3.939IleAsn: 3.939 ± 0.508
2.708IlePro: 2.708 ± 0.465
3.037IleGln: 3.037 ± 0.504
4.103IleArg: 4.103 ± 0.523
5.006IleSer: 5.006 ± 0.613
3.283IleThr: 3.283 ± 0.569
4.432IleVal: 4.432 ± 0.566
0.739IleTrp: 0.739 ± 0.239
2.216IleTyr: 2.216 ± 0.572
0.0IleXaa: 0.0 ± 0.0
Lys
4.678LysAla: 4.678 ± 0.971
1.149LysCys: 1.149 ± 0.351
4.021LysAsp: 4.021 ± 0.497
9.93LysGlu: 9.93 ± 1.119
2.954LysPhe: 2.954 ± 0.456
5.663LysGly: 5.663 ± 0.741
1.888LysHis: 1.888 ± 0.397
6.648LysIle: 6.648 ± 0.642
8.453LysLys: 8.453 ± 0.765
8.207LysLeu: 8.207 ± 0.848
3.283LysMet: 3.283 ± 0.612
5.827LysAsn: 5.827 ± 0.813
2.544LysPro: 2.544 ± 0.452
3.037LysGln: 3.037 ± 0.576
5.006LysArg: 5.006 ± 0.83
4.678LysSer: 4.678 ± 0.505
5.499LysThr: 5.499 ± 0.674
7.222LysVal: 7.222 ± 0.816
0.985LysTrp: 0.985 ± 0.281
3.693LysTyr: 3.693 ± 0.664
0.0LysXaa: 0.0 ± 0.0
Leu
5.252LeuAla: 5.252 ± 0.754
0.574LeuCys: 0.574 ± 0.262
4.842LeuAsp: 4.842 ± 0.58
6.483LeuGlu: 6.483 ± 0.874
3.775LeuPhe: 3.775 ± 0.746
5.17LeuGly: 5.17 ± 0.731
1.313LeuHis: 1.313 ± 0.3
4.842LeuIle: 4.842 ± 0.705
8.453LeuLys: 8.453 ± 0.746
6.155LeuLeu: 6.155 ± 0.865
1.477LeuMet: 1.477 ± 0.348
5.499LeuAsn: 5.499 ± 0.73
1.805LeuPro: 1.805 ± 0.428
4.103LeuGln: 4.103 ± 0.563
3.939LeuArg: 3.939 ± 0.608
5.909LeuSer: 5.909 ± 0.792
5.006LeuThr: 5.006 ± 0.679
4.432LeuVal: 4.432 ± 0.697
0.821LeuTrp: 0.821 ± 0.307
3.529LeuTyr: 3.529 ± 0.603
0.0LeuXaa: 0.0 ± 0.0
Met
1.805MetAla: 1.805 ± 0.407
0.164MetCys: 0.164 ± 0.132
1.559MetAsp: 1.559 ± 0.424
2.38MetGlu: 2.38 ± 0.482
0.903MetPhe: 0.903 ± 0.321
1.477MetGly: 1.477 ± 0.435
0.246MetHis: 0.246 ± 0.133
1.477MetIle: 1.477 ± 0.236
3.283MetLys: 3.283 ± 0.552
1.641MetLeu: 1.641 ± 0.305
0.821MetMet: 0.821 ± 0.281
1.641MetAsn: 1.641 ± 0.494
0.903MetPro: 0.903 ± 0.256
0.821MetGln: 0.821 ± 0.278
1.313MetArg: 1.313 ± 0.333
2.052MetSer: 2.052 ± 0.382
1.97MetThr: 1.97 ± 0.359
1.313MetVal: 1.313 ± 0.379
0.821MetTrp: 0.821 ± 0.271
0.739MetTyr: 0.739 ± 0.234
0.0MetXaa: 0.0 ± 0.0
Asn
3.447AsnAla: 3.447 ± 0.549
0.246AsnCys: 0.246 ± 0.158
2.216AsnAsp: 2.216 ± 0.338
5.088AsnGlu: 5.088 ± 0.727
1.888AsnPhe: 1.888 ± 0.403
3.775AsnGly: 3.775 ± 0.47
1.395AsnHis: 1.395 ± 0.384
3.611AsnIle: 3.611 ± 0.612
4.678AsnLys: 4.678 ± 0.646
3.939AsnLeu: 3.939 ± 0.534
1.805AsnMet: 1.805 ± 0.339
2.544AsnAsn: 2.544 ± 0.489
2.216AsnPro: 2.216 ± 0.494
2.626AsnGln: 2.626 ± 0.525
3.365AsnArg: 3.365 ± 0.638
3.119AsnSer: 3.119 ± 0.541
2.298AsnThr: 2.298 ± 0.445
3.119AsnVal: 3.119 ± 0.514
0.574AsnTrp: 0.574 ± 0.269
1.395AsnTyr: 1.395 ± 0.379
0.0AsnXaa: 0.0 ± 0.0
Pro
1.723ProAla: 1.723 ± 0.391
0.246ProCys: 0.246 ± 0.189
1.395ProAsp: 1.395 ± 0.365
2.626ProGlu: 2.626 ± 0.508
0.903ProPhe: 0.903 ± 0.224
0.903ProGly: 0.903 ± 0.237
0.574ProHis: 0.574 ± 0.235
1.477ProIle: 1.477 ± 0.37
1.559ProLys: 1.559 ± 0.371
1.888ProLeu: 1.888 ± 0.334
0.574ProMet: 0.574 ± 0.254
1.477ProAsn: 1.477 ± 0.332
0.492ProPro: 0.492 ± 0.206
1.149ProGln: 1.149 ± 0.234
0.903ProArg: 0.903 ± 0.324
2.216ProSer: 2.216 ± 0.414
1.805ProThr: 1.805 ± 0.465
2.462ProVal: 2.462 ± 0.321
0.082ProTrp: 0.082 ± 0.093
1.067ProTyr: 1.067 ± 0.313
0.0ProXaa: 0.0 ± 0.0
Gln
3.037GlnAla: 3.037 ± 0.594
0.164GlnCys: 0.164 ± 0.172
1.723GlnAsp: 1.723 ± 0.409
3.283GlnGlu: 3.283 ± 0.513
2.052GlnPhe: 2.052 ± 0.397
1.97GlnGly: 1.97 ± 0.393
0.657GlnHis: 0.657 ± 0.239
2.134GlnIle: 2.134 ± 0.365
4.103GlnLys: 4.103 ± 0.609
2.708GlnLeu: 2.708 ± 0.449
1.477GlnMet: 1.477 ± 0.341
2.216GlnAsn: 2.216 ± 0.493
1.805GlnPro: 1.805 ± 0.38
2.626GlnGln: 2.626 ± 0.576
2.38GlnArg: 2.38 ± 0.446
2.626GlnSer: 2.626 ± 0.48
1.395GlnThr: 1.395 ± 0.281
2.462GlnVal: 2.462 ± 0.442
0.328GlnTrp: 0.328 ± 0.142
1.888GlnTyr: 1.888 ± 0.414
0.0GlnXaa: 0.0 ± 0.0
Arg
2.954ArgAla: 2.954 ± 0.59
0.41ArgCys: 0.41 ± 0.18
3.037ArgAsp: 3.037 ± 0.461
4.678ArgGlu: 4.678 ± 0.611
2.298ArgPhe: 2.298 ± 0.389
2.872ArgGly: 2.872 ± 0.603
0.657ArgHis: 0.657 ± 0.281
4.678ArgIle: 4.678 ± 0.747
4.76ArgLys: 4.76 ± 0.725
4.514ArgLeu: 4.514 ± 0.688
0.739ArgMet: 0.739 ± 0.227
2.38ArgAsn: 2.38 ± 0.562
0.985ArgPro: 0.985 ± 0.313
2.052ArgGln: 2.052 ± 0.472
1.888ArgArg: 1.888 ± 0.388
3.037ArgSer: 3.037 ± 0.661
2.216ArgThr: 2.216 ± 0.378
2.298ArgVal: 2.298 ± 0.354
0.574ArgTrp: 0.574 ± 0.185
2.38ArgTyr: 2.38 ± 0.481
0.0ArgXaa: 0.0 ± 0.0
Ser
3.775SerAla: 3.775 ± 0.758
0.328SerCys: 0.328 ± 0.18
3.775SerAsp: 3.775 ± 0.526
3.939SerGlu: 3.939 ± 0.614
2.052SerPhe: 2.052 ± 0.451
3.529SerGly: 3.529 ± 0.464
1.149SerHis: 1.149 ± 0.317
3.365SerIle: 3.365 ± 0.484
6.237SerLys: 6.237 ± 0.834
5.17SerLeu: 5.17 ± 0.598
1.231SerMet: 1.231 ± 0.355
3.283SerAsn: 3.283 ± 0.582
1.231SerPro: 1.231 ± 0.325
2.216SerGln: 2.216 ± 0.457
3.037SerArg: 3.037 ± 0.497
2.872SerSer: 2.872 ± 0.507
3.201SerThr: 3.201 ± 0.38
4.103SerVal: 4.103 ± 0.592
0.574SerTrp: 0.574 ± 0.254
2.134SerTyr: 2.134 ± 0.687
0.0SerXaa: 0.0 ± 0.0
Thr
3.529ThrAla: 3.529 ± 0.712
0.164ThrCys: 0.164 ± 0.115
2.79ThrAsp: 2.79 ± 0.445
5.088ThrGlu: 5.088 ± 0.817
2.134ThrPhe: 2.134 ± 0.467
3.693ThrGly: 3.693 ± 0.596
0.41ThrHis: 0.41 ± 0.198
4.021ThrIle: 4.021 ± 0.643
4.596ThrLys: 4.596 ± 0.725
4.596ThrLeu: 4.596 ± 0.658
0.985ThrMet: 0.985 ± 0.279
2.38ThrAsn: 2.38 ± 0.453
2.052ThrPro: 2.052 ± 0.567
2.626ThrGln: 2.626 ± 0.374
1.97ThrArg: 1.97 ± 0.326
2.626ThrSer: 2.626 ± 0.53
3.857ThrThr: 3.857 ± 0.726
3.447ThrVal: 3.447 ± 0.671
0.328ThrTrp: 0.328 ± 0.156
1.641ThrTyr: 1.641 ± 0.28
0.0ThrXaa: 0.0 ± 0.0
Val
3.939ValAla: 3.939 ± 0.752
0.41ValCys: 0.41 ± 0.199
4.103ValAsp: 4.103 ± 0.555
5.909ValGlu: 5.909 ± 0.737
2.298ValPhe: 2.298 ± 0.62
3.529ValGly: 3.529 ± 0.567
0.903ValHis: 0.903 ± 0.328
4.35ValIle: 4.35 ± 0.61
5.581ValLys: 5.581 ± 0.68
5.006ValLeu: 5.006 ± 0.827
0.903ValMet: 0.903 ± 0.325
3.201ValAsn: 3.201 ± 0.583
1.723ValPro: 1.723 ± 0.401
2.544ValGln: 2.544 ± 0.564
2.954ValArg: 2.954 ± 0.562
3.775ValSer: 3.775 ± 0.462
3.283ValThr: 3.283 ± 0.49
4.432ValVal: 4.432 ± 0.64
0.574ValTrp: 0.574 ± 0.23
3.037ValTyr: 3.037 ± 0.539
0.0ValXaa: 0.0 ± 0.0
Trp
0.41TrpAla: 0.41 ± 0.288
0.246TrpCys: 0.246 ± 0.162
0.985TrpAsp: 0.985 ± 0.255
0.985TrpGlu: 0.985 ± 0.291
0.328TrpPhe: 0.328 ± 0.14
0.492TrpGly: 0.492 ± 0.173
0.164TrpHis: 0.164 ± 0.096
0.903TrpIle: 0.903 ± 0.402
0.903TrpLys: 0.903 ± 0.267
0.903TrpLeu: 0.903 ± 0.245
0.082TrpMet: 0.082 ± 0.058
0.657TrpAsn: 0.657 ± 0.229
0.082TrpPro: 0.082 ± 0.075
0.41TrpGln: 0.41 ± 0.164
0.574TrpArg: 0.574 ± 0.196
0.903TrpSer: 0.903 ± 0.342
0.739TrpThr: 0.739 ± 0.214
0.739TrpVal: 0.739 ± 0.252
0.246TrpTrp: 0.246 ± 0.138
0.492TrpTyr: 0.492 ± 0.23
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.134TyrAla: 2.134 ± 0.452
0.821TyrCys: 0.821 ± 0.28
2.544TyrAsp: 2.544 ± 0.403
3.365TyrGlu: 3.365 ± 0.491
1.888TyrPhe: 1.888 ± 0.365
1.888TyrGly: 1.888 ± 0.33
0.657TyrHis: 0.657 ± 0.215
2.708TyrIle: 2.708 ± 0.497
3.775TyrLys: 3.775 ± 0.629
2.79TyrLeu: 2.79 ± 0.549
0.739TyrMet: 0.739 ± 0.223
2.462TyrAsn: 2.462 ± 0.591
0.657TyrPro: 0.657 ± 0.319
0.985TyrGln: 0.985 ± 0.252
2.38TyrArg: 2.38 ± 0.46
1.805TyrSer: 1.805 ± 0.396
2.052TyrThr: 2.052 ± 0.347
2.708TyrVal: 2.708 ± 0.48
0.492TyrTrp: 0.492 ± 0.193
0.739TyrTyr: 0.739 ± 0.246
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 61 proteins (12186 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski