Amino acid dipepetide frequency for Bacillus phage vB_BtS_BMBtp13

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.545AlaAla: 4.545 ± 1.111
0.267AlaCys: 0.267 ± 0.225
2.807AlaAsp: 2.807 ± 0.522
3.075AlaGlu: 3.075 ± 0.69
3.61AlaPhe: 3.61 ± 0.585
5.214AlaGly: 5.214 ± 1.493
0.401AlaHis: 0.401 ± 0.203
5.214AlaIle: 5.214 ± 1.12
3.075AlaLys: 3.075 ± 0.819
6.551AlaLeu: 6.551 ± 1.416
2.139AlaMet: 2.139 ± 0.556
4.412AlaAsn: 4.412 ± 0.742
1.872AlaPro: 1.872 ± 0.502
2.54AlaGln: 2.54 ± 0.579
2.273AlaArg: 2.273 ± 0.656
4.011AlaSer: 4.011 ± 0.907
4.144AlaThr: 4.144 ± 1.013
4.412AlaVal: 4.412 ± 0.869
0.535AlaTrp: 0.535 ± 0.228
2.005AlaTyr: 2.005 ± 0.498
0.0AlaXaa: 0.0 ± 0.0
Cys
0.267CysAla: 0.267 ± 0.241
0.267CysCys: 0.267 ± 0.161
0.802CysAsp: 0.802 ± 0.319
0.936CysGlu: 0.936 ± 0.5
0.401CysPhe: 0.401 ± 0.164
0.802CysGly: 0.802 ± 0.395
0.267CysHis: 0.267 ± 0.172
1.07CysIle: 1.07 ± 0.441
1.203CysLys: 1.203 ± 0.56
0.134CysLeu: 0.134 ± 0.132
0.401CysMet: 0.401 ± 0.192
0.267CysAsn: 0.267 ± 0.173
0.134CysPro: 0.134 ± 0.156
0.668CysGln: 0.668 ± 0.294
0.267CysArg: 0.267 ± 0.207
0.134CysSer: 0.134 ± 0.112
0.134CysThr: 0.134 ± 0.131
0.668CysVal: 0.668 ± 0.311
0.0CysTrp: 0.0 ± 0.0
0.267CysTyr: 0.267 ± 0.175
0.0CysXaa: 0.0 ± 0.0
Asp
2.406AspAla: 2.406 ± 0.477
0.267AspCys: 0.267 ± 0.183
2.54AspAsp: 2.54 ± 0.599
4.144AspGlu: 4.144 ± 0.88
3.209AspPhe: 3.209 ± 0.465
5.08AspGly: 5.08 ± 0.62
0.401AspHis: 0.401 ± 0.219
6.551AspIle: 6.551 ± 0.96
3.209AspLys: 3.209 ± 0.658
4.813AspLeu: 4.813 ± 0.534
1.471AspMet: 1.471 ± 0.321
2.273AspAsn: 2.273 ± 0.493
1.604AspPro: 1.604 ± 0.361
2.139AspGln: 2.139 ± 0.485
2.674AspArg: 2.674 ± 0.744
3.476AspSer: 3.476 ± 0.842
2.807AspThr: 2.807 ± 0.588
4.947AspVal: 4.947 ± 0.834
1.07AspTrp: 1.07 ± 0.309
2.406AspTyr: 2.406 ± 0.501
0.0AspXaa: 0.0 ± 0.0
Glu
4.813GluAla: 4.813 ± 1.005
0.802GluCys: 0.802 ± 0.283
3.476GluAsp: 3.476 ± 0.679
7.487GluGlu: 7.487 ± 1.039
3.877GluPhe: 3.877 ± 0.695
4.947GluGly: 4.947 ± 0.81
0.535GluHis: 0.535 ± 0.234
5.08GluIle: 5.08 ± 0.62
6.684GluLys: 6.684 ± 1.281
6.818GluLeu: 6.818 ± 1.016
1.471GluMet: 1.471 ± 0.34
3.877GluAsn: 3.877 ± 0.67
1.337GluPro: 1.337 ± 0.464
4.412GluGln: 4.412 ± 1.198
4.011GluArg: 4.011 ± 0.879
3.877GluSer: 3.877 ± 0.74
2.807GluThr: 2.807 ± 0.664
3.877GluVal: 3.877 ± 0.869
0.802GluTrp: 0.802 ± 0.38
3.61GluTyr: 3.61 ± 0.792
0.0GluXaa: 0.0 ± 0.0
Phe
2.139PheAla: 2.139 ± 0.473
1.07PheCys: 1.07 ± 0.365
2.406PheAsp: 2.406 ± 0.445
2.674PheGlu: 2.674 ± 0.57
1.471PhePhe: 1.471 ± 0.471
2.674PheGly: 2.674 ± 0.701
0.668PheHis: 0.668 ± 0.338
3.342PheIle: 3.342 ± 0.677
4.011PheLys: 4.011 ± 0.448
2.674PheLeu: 2.674 ± 0.608
0.802PheMet: 0.802 ± 0.299
2.941PheAsn: 2.941 ± 0.858
1.337PhePro: 1.337 ± 0.428
2.406PheGln: 2.406 ± 0.505
1.872PheArg: 1.872 ± 0.443
1.872PheSer: 1.872 ± 0.475
2.54PheThr: 2.54 ± 0.462
2.941PheVal: 2.941 ± 0.488
0.267PheTrp: 0.267 ± 0.193
2.273PheTyr: 2.273 ± 0.517
0.0PheXaa: 0.0 ± 0.0
Gly
3.743GlyAla: 3.743 ± 0.604
0.401GlyCys: 0.401 ± 0.212
3.743GlyAsp: 3.743 ± 0.758
4.412GlyGlu: 4.412 ± 0.768
3.476GlyPhe: 3.476 ± 0.581
4.813GlyGly: 4.813 ± 1.022
0.668GlyHis: 0.668 ± 0.305
7.754GlyIle: 7.754 ± 1.513
6.551GlyLys: 6.551 ± 1.062
6.016GlyLeu: 6.016 ± 0.65
1.738GlyMet: 1.738 ± 0.35
3.877GlyAsn: 3.877 ± 0.848
1.471GlyPro: 1.471 ± 0.349
1.738GlyGln: 1.738 ± 0.425
2.406GlyArg: 2.406 ± 0.532
3.61GlySer: 3.61 ± 0.703
3.342GlyThr: 3.342 ± 0.868
6.551GlyVal: 6.551 ± 1.295
0.535GlyTrp: 0.535 ± 0.259
3.342GlyTyr: 3.342 ± 0.675
0.0GlyXaa: 0.0 ± 0.0
His
0.668HisAla: 0.668 ± 0.279
0.134HisCys: 0.134 ± 0.131
0.936HisAsp: 0.936 ± 0.375
0.668HisGlu: 0.668 ± 0.28
0.401HisPhe: 0.401 ± 0.278
0.802HisGly: 0.802 ± 0.487
0.802HisHis: 0.802 ± 0.354
0.802HisIle: 0.802 ± 0.403
0.936HisLys: 0.936 ± 0.333
0.936HisLeu: 0.936 ± 0.308
0.134HisMet: 0.134 ± 0.133
0.936HisAsn: 0.936 ± 0.33
0.668HisPro: 0.668 ± 0.241
0.0HisGln: 0.0 ± 0.0
0.267HisArg: 0.267 ± 0.187
0.802HisSer: 0.802 ± 0.271
0.936HisThr: 0.936 ± 0.385
0.802HisVal: 0.802 ± 0.298
0.0HisTrp: 0.0 ± 0.0
0.535HisTyr: 0.535 ± 0.228
0.0HisXaa: 0.0 ± 0.0
Ile
6.417IleAla: 6.417 ± 1.554
0.668IleCys: 0.668 ± 0.407
6.016IleAsp: 6.016 ± 0.766
6.684IleGlu: 6.684 ± 0.99
2.941IlePhe: 2.941 ± 0.468
5.481IleGly: 5.481 ± 0.798
0.936IleHis: 0.936 ± 0.389
4.679IleIle: 4.679 ± 1.583
6.283IleLys: 6.283 ± 0.921
5.615IleLeu: 5.615 ± 0.979
2.941IleMet: 2.941 ± 0.514
5.214IleAsn: 5.214 ± 0.881
1.471IlePro: 1.471 ± 0.609
2.941IleGln: 2.941 ± 0.6
3.209IleArg: 3.209 ± 0.761
4.144IleSer: 4.144 ± 0.704
4.545IleThr: 4.545 ± 1.013
6.15IleVal: 6.15 ± 1.206
0.802IleTrp: 0.802 ± 0.328
1.604IleTyr: 1.604 ± 0.415
0.0IleXaa: 0.0 ± 0.0
Lys
4.813LysAla: 4.813 ± 0.742
0.401LysCys: 0.401 ± 0.193
3.877LysAsp: 3.877 ± 0.763
7.888LysGlu: 7.888 ± 1.318
3.075LysPhe: 3.075 ± 0.775
5.348LysGly: 5.348 ± 0.921
1.337LysHis: 1.337 ± 0.589
6.15LysIle: 6.15 ± 1.083
8.422LysLys: 8.422 ± 1.669
8.556LysLeu: 8.556 ± 1.519
3.61LysMet: 3.61 ± 0.67
5.882LysAsn: 5.882 ± 1.186
1.471LysPro: 1.471 ± 0.465
4.545LysGln: 4.545 ± 0.713
3.877LysArg: 3.877 ± 0.871
5.214LysSer: 5.214 ± 0.905
4.679LysThr: 4.679 ± 0.799
6.417LysVal: 6.417 ± 1.043
0.802LysTrp: 0.802 ± 0.287
2.674LysTyr: 2.674 ± 0.913
0.0LysXaa: 0.0 ± 0.0
Leu
4.679LeuAla: 4.679 ± 0.877
0.535LeuCys: 0.535 ± 0.255
5.214LeuAsp: 5.214 ± 0.749
7.086LeuGlu: 7.086 ± 0.983
2.54LeuPhe: 2.54 ± 0.468
6.15LeuGly: 6.15 ± 1.044
0.668LeuHis: 0.668 ± 0.378
7.353LeuIle: 7.353 ± 2.139
7.353LeuLys: 7.353 ± 1.016
5.882LeuLeu: 5.882 ± 1.212
2.406LeuMet: 2.406 ± 0.485
5.615LeuAsn: 5.615 ± 0.976
4.144LeuPro: 4.144 ± 1.592
4.412LeuGln: 4.412 ± 1.155
4.144LeuArg: 4.144 ± 0.84
4.011LeuSer: 4.011 ± 0.562
4.144LeuThr: 4.144 ± 0.603
3.61LeuVal: 3.61 ± 0.637
0.267LeuTrp: 0.267 ± 0.191
2.139LeuTyr: 2.139 ± 0.49
0.0LeuXaa: 0.0 ± 0.0
Met
2.406MetAla: 2.406 ± 0.643
0.134MetCys: 0.134 ± 0.132
2.273MetAsp: 2.273 ± 0.568
2.406MetGlu: 2.406 ± 0.415
0.535MetPhe: 0.535 ± 0.174
1.471MetGly: 1.471 ± 0.52
0.401MetHis: 0.401 ± 0.237
1.738MetIle: 1.738 ± 0.652
3.075MetLys: 3.075 ± 0.774
2.273MetLeu: 2.273 ± 0.613
0.936MetMet: 0.936 ± 0.338
2.139MetAsn: 2.139 ± 0.402
1.203MetPro: 1.203 ± 0.408
0.535MetGln: 0.535 ± 0.249
1.604MetArg: 1.604 ± 0.453
1.203MetSer: 1.203 ± 0.403
1.604MetThr: 1.604 ± 0.462
1.872MetVal: 1.872 ± 0.408
0.134MetTrp: 0.134 ± 0.112
1.203MetTyr: 1.203 ± 0.423
0.0MetXaa: 0.0 ± 0.0
Asn
4.278AsnAla: 4.278 ± 1.061
0.535AsnCys: 0.535 ± 0.255
4.278AsnAsp: 4.278 ± 0.659
5.348AsnGlu: 5.348 ± 0.777
1.471AsnPhe: 1.471 ± 0.521
5.615AsnGly: 5.615 ± 1.379
0.668AsnHis: 0.668 ± 0.314
3.61AsnIle: 3.61 ± 0.595
5.615AsnLys: 5.615 ± 0.976
5.882AsnLeu: 5.882 ± 1.089
0.802AsnMet: 0.802 ± 0.226
4.011AsnAsn: 4.011 ± 0.957
2.406AsnPro: 2.406 ± 0.645
2.54AsnGln: 2.54 ± 0.438
2.273AsnArg: 2.273 ± 0.685
3.209AsnSer: 3.209 ± 0.705
2.273AsnThr: 2.273 ± 0.642
3.342AsnVal: 3.342 ± 0.603
0.401AsnTrp: 0.401 ± 0.218
2.54AsnTyr: 2.54 ± 0.583
0.0AsnXaa: 0.0 ± 0.0
Pro
2.273ProAla: 2.273 ± 0.598
0.267ProCys: 0.267 ± 0.171
1.604ProAsp: 1.604 ± 0.551
1.337ProGlu: 1.337 ± 0.287
1.337ProPhe: 1.337 ± 0.414
1.471ProGly: 1.471 ± 0.487
0.267ProHis: 0.267 ± 0.188
2.005ProIle: 2.005 ± 0.452
3.342ProLys: 3.342 ± 0.61
2.406ProLeu: 2.406 ± 0.626
0.936ProMet: 0.936 ± 0.401
1.604ProAsn: 1.604 ± 0.428
0.535ProPro: 0.535 ± 0.268
1.203ProGln: 1.203 ± 0.534
1.07ProArg: 1.07 ± 0.361
1.471ProSer: 1.471 ± 0.352
1.872ProThr: 1.872 ± 0.645
2.406ProVal: 2.406 ± 0.634
0.0ProTrp: 0.0 ± 0.0
0.668ProTyr: 0.668 ± 0.317
0.0ProXaa: 0.0 ± 0.0
Gln
3.209GlnAla: 3.209 ± 0.976
0.401GlnCys: 0.401 ± 0.236
2.139GlnAsp: 2.139 ± 0.577
3.476GlnGlu: 3.476 ± 0.918
1.604GlnPhe: 1.604 ± 0.392
1.738GlnGly: 1.738 ± 0.553
0.668GlnHis: 0.668 ± 0.301
2.273GlnIle: 2.273 ± 0.581
3.476GlnLys: 3.476 ± 0.784
4.813GlnLeu: 4.813 ± 1.245
1.604GlnMet: 1.604 ± 0.406
2.674GlnAsn: 2.674 ± 0.716
2.005GlnPro: 2.005 ± 0.506
2.005GlnGln: 2.005 ± 0.764
2.139GlnArg: 2.139 ± 0.498
2.005GlnSer: 2.005 ± 0.361
1.738GlnThr: 1.738 ± 0.505
1.872GlnVal: 1.872 ± 0.489
0.401GlnTrp: 0.401 ± 0.212
1.872GlnTyr: 1.872 ± 0.477
0.0GlnXaa: 0.0 ± 0.0
Arg
2.139ArgAla: 2.139 ± 0.67
0.668ArgCys: 0.668 ± 0.316
3.877ArgAsp: 3.877 ± 0.605
2.54ArgGlu: 2.54 ± 0.653
1.471ArgPhe: 1.471 ± 0.291
2.54ArgGly: 2.54 ± 0.567
0.401ArgHis: 0.401 ± 0.212
3.877ArgIle: 3.877 ± 1.101
3.61ArgLys: 3.61 ± 0.965
3.209ArgLeu: 3.209 ± 0.643
1.07ArgMet: 1.07 ± 0.287
3.342ArgAsn: 3.342 ± 0.545
1.07ArgPro: 1.07 ± 0.385
1.604ArgGln: 1.604 ± 0.583
2.005ArgArg: 2.005 ± 0.608
2.406ArgSer: 2.406 ± 0.623
1.872ArgThr: 1.872 ± 0.362
2.273ArgVal: 2.273 ± 0.908
0.802ArgTrp: 0.802 ± 0.326
2.54ArgTyr: 2.54 ± 0.655
0.0ArgXaa: 0.0 ± 0.0
Ser
4.278SerAla: 4.278 ± 0.597
0.668SerCys: 0.668 ± 0.377
2.54SerAsp: 2.54 ± 0.586
3.209SerGlu: 3.209 ± 0.778
3.075SerPhe: 3.075 ± 0.616
4.545SerGly: 4.545 ± 0.915
0.401SerHis: 0.401 ± 0.219
3.342SerIle: 3.342 ± 0.583
5.481SerLys: 5.481 ± 0.862
3.743SerLeu: 3.743 ± 0.623
2.273SerMet: 2.273 ± 0.415
3.209SerAsn: 3.209 ± 0.491
0.936SerPro: 0.936 ± 0.413
2.139SerGln: 2.139 ± 0.447
2.273SerArg: 2.273 ± 0.427
2.674SerSer: 2.674 ± 0.561
3.61SerThr: 3.61 ± 0.758
2.54SerVal: 2.54 ± 0.484
0.668SerTrp: 0.668 ± 0.272
2.139SerTyr: 2.139 ± 0.489
0.0SerXaa: 0.0 ± 0.0
Thr
4.278ThrAla: 4.278 ± 0.845
0.535ThrCys: 0.535 ± 0.386
2.273ThrAsp: 2.273 ± 0.55
3.61ThrGlu: 3.61 ± 0.669
2.139ThrPhe: 2.139 ± 0.34
3.075ThrGly: 3.075 ± 0.688
1.203ThrHis: 1.203 ± 0.326
4.011ThrIle: 4.011 ± 0.924
4.278ThrLys: 4.278 ± 0.725
4.144ThrLeu: 4.144 ± 1.255
1.337ThrMet: 1.337 ± 0.434
3.877ThrAsn: 3.877 ± 0.73
1.337ThrPro: 1.337 ± 0.573
1.604ThrGln: 1.604 ± 0.393
1.604ThrArg: 1.604 ± 0.493
3.209ThrSer: 3.209 ± 0.776
2.941ThrThr: 2.941 ± 0.647
3.209ThrVal: 3.209 ± 0.706
0.802ThrTrp: 0.802 ± 0.413
1.604ThrTyr: 1.604 ± 0.364
0.0ThrXaa: 0.0 ± 0.0
Val
3.209ValAla: 3.209 ± 0.609
0.668ValCys: 0.668 ± 0.266
4.545ValAsp: 4.545 ± 0.602
4.545ValGlu: 4.545 ± 0.72
3.075ValPhe: 3.075 ± 0.496
4.011ValGly: 4.011 ± 1.045
0.668ValHis: 0.668 ± 0.381
6.283ValIle: 6.283 ± 0.928
7.086ValLys: 7.086 ± 1.131
4.144ValLeu: 4.144 ± 0.96
1.337ValMet: 1.337 ± 0.465
2.406ValAsn: 2.406 ± 0.519
1.872ValPro: 1.872 ± 0.333
2.941ValGln: 2.941 ± 0.702
3.342ValArg: 3.342 ± 0.721
4.278ValSer: 4.278 ± 0.686
3.476ValThr: 3.476 ± 0.695
4.011ValVal: 4.011 ± 0.723
0.802ValTrp: 0.802 ± 0.342
2.54ValTyr: 2.54 ± 0.573
0.0ValXaa: 0.0 ± 0.0
Trp
0.802TrpAla: 0.802 ± 0.262
0.134TrpCys: 0.134 ± 0.121
0.401TrpAsp: 0.401 ± 0.314
0.668TrpGlu: 0.668 ± 0.32
0.668TrpPhe: 0.668 ± 0.243
0.535TrpGly: 0.535 ± 0.277
0.134TrpHis: 0.134 ± 0.119
0.802TrpIle: 0.802 ± 0.343
0.668TrpLys: 0.668 ± 0.334
0.802TrpLeu: 0.802 ± 0.249
0.535TrpMet: 0.535 ± 0.2
0.802TrpAsn: 0.802 ± 0.322
0.134TrpPro: 0.134 ± 0.133
0.668TrpGln: 0.668 ± 0.23
0.535TrpArg: 0.535 ± 0.271
0.535TrpSer: 0.535 ± 0.366
0.267TrpThr: 0.267 ± 0.197
0.401TrpVal: 0.401 ± 0.208
0.267TrpTrp: 0.267 ± 0.177
0.134TrpTyr: 0.134 ± 0.112
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.738TyrAla: 1.738 ± 0.53
0.267TyrCys: 0.267 ± 0.183
1.738TyrAsp: 1.738 ± 0.645
2.406TyrGlu: 2.406 ± 0.42
2.005TyrPhe: 2.005 ± 0.427
3.877TyrGly: 3.877 ± 0.872
0.668TyrHis: 0.668 ± 0.266
2.807TyrIle: 2.807 ± 0.562
4.679TyrLys: 4.679 ± 1.113
2.941TyrLeu: 2.941 ± 0.707
1.07TyrMet: 1.07 ± 0.407
1.738TyrAsn: 1.738 ± 0.262
1.07TyrPro: 1.07 ± 0.41
1.203TyrGln: 1.203 ± 0.303
1.471TyrArg: 1.471 ± 0.57
1.604TyrSer: 1.604 ± 0.573
1.337TyrThr: 1.337 ± 0.522
2.941TyrVal: 2.941 ± 0.646
0.535TyrTrp: 0.535 ± 0.246
1.07TyrTyr: 1.07 ± 0.328
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 30 proteins (7481 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski