Amino acid dipepetide frequency for Bacillus phage Slash

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.942AlaAla: 4.942 ± 0.572
0.461AlaCys: 0.461 ± 0.147
3.937AlaAsp: 3.937 ± 0.589
4.774AlaGlu: 4.774 ± 0.46
2.638AlaPhe: 2.638 ± 0.493
4.146AlaGly: 4.146 ± 0.569
1.047AlaHis: 1.047 ± 0.234
4.062AlaIle: 4.062 ± 0.54
5.696AlaLys: 5.696 ± 0.734
4.691AlaLeu: 4.691 ± 0.433
1.759AlaMet: 1.759 ± 0.463
4.062AlaAsn: 4.062 ± 0.445
1.424AlaPro: 1.424 ± 0.304
2.555AlaGln: 2.555 ± 0.512
2.052AlaArg: 2.052 ± 0.322
3.685AlaSer: 3.685 ± 0.461
4.188AlaThr: 4.188 ± 0.698
4.146AlaVal: 4.146 ± 0.481
0.586AlaTrp: 0.586 ± 0.12
2.848AlaTyr: 2.848 ± 0.323
0.0AlaXaa: 0.0 ± 0.0
Cys
0.293CysAla: 0.293 ± 0.101
0.084CysCys: 0.084 ± 0.063
0.879CysAsp: 0.879 ± 0.22
0.586CysGlu: 0.586 ± 0.194
0.461CysPhe: 0.461 ± 0.135
0.754CysGly: 0.754 ± 0.219
0.168CysHis: 0.168 ± 0.086
0.586CysIle: 0.586 ± 0.149
0.628CysLys: 0.628 ± 0.181
0.586CysLeu: 0.586 ± 0.153
0.084CysMet: 0.084 ± 0.073
0.754CysAsn: 0.754 ± 0.205
0.419CysPro: 0.419 ± 0.139
0.251CysGln: 0.251 ± 0.099
0.293CysArg: 0.293 ± 0.111
0.419CysSer: 0.419 ± 0.168
0.544CysThr: 0.544 ± 0.171
0.628CysVal: 0.628 ± 0.17
0.084CysTrp: 0.084 ± 0.052
0.335CysTyr: 0.335 ± 0.132
0.0CysXaa: 0.0 ± 0.0
Asp
2.806AspAla: 2.806 ± 0.39
0.712AspCys: 0.712 ± 0.166
3.602AspAsp: 3.602 ± 0.384
4.607AspGlu: 4.607 ± 0.596
3.057AspPhe: 3.057 ± 0.309
4.691AspGly: 4.691 ± 0.524
0.796AspHis: 0.796 ± 0.215
4.565AspIle: 4.565 ± 0.41
5.444AspLys: 5.444 ± 0.471
5.57AspLeu: 5.57 ± 0.515
2.094AspMet: 2.094 ± 0.265
3.099AspAsn: 3.099 ± 0.372
1.759AspPro: 1.759 ± 0.253
2.303AspGln: 2.303 ± 0.443
2.22AspArg: 2.22 ± 0.379
3.602AspSer: 3.602 ± 0.573
3.267AspThr: 3.267 ± 0.429
4.732AspVal: 4.732 ± 0.423
0.67AspTrp: 0.67 ± 0.161
2.932AspTyr: 2.932 ± 0.429
0.0AspXaa: 0.0 ± 0.0
Glu
3.853GluAla: 3.853 ± 0.48
0.838GluCys: 0.838 ± 0.198
3.602GluAsp: 3.602 ± 0.449
7.078GluGlu: 7.078 ± 0.729
3.434GluPhe: 3.434 ± 0.471
4.565GluGly: 4.565 ± 0.492
1.173GluHis: 1.173 ± 0.222
4.146GluIle: 4.146 ± 0.459
6.491GluLys: 6.491 ± 0.645
6.282GluLeu: 6.282 ± 0.59
3.057GluMet: 3.057 ± 0.408
5.026GluAsn: 5.026 ± 0.495
1.298GluPro: 1.298 ± 0.254
3.057GluGln: 3.057 ± 0.318
3.141GluArg: 3.141 ± 0.491
3.937GluSer: 3.937 ± 0.334
4.272GluThr: 4.272 ± 0.464
5.528GluVal: 5.528 ± 0.724
1.173GluTrp: 1.173 ± 0.237
2.848GluTyr: 2.848 ± 0.352
0.0GluXaa: 0.0 ± 0.0
Phe
2.722PheAla: 2.722 ± 0.319
0.419PheCys: 0.419 ± 0.151
3.853PheAsp: 3.853 ± 0.363
3.392PheGlu: 3.392 ± 0.34
1.717PhePhe: 1.717 ± 0.303
3.141PheGly: 3.141 ± 0.426
0.67PheHis: 0.67 ± 0.211
2.555PheIle: 2.555 ± 0.383
3.853PheLys: 3.853 ± 0.283
3.434PheLeu: 3.434 ± 0.549
1.005PheMet: 1.005 ± 0.228
3.308PheAsn: 3.308 ± 0.369
1.675PhePro: 1.675 ± 0.296
1.382PheGln: 1.382 ± 0.245
1.591PheArg: 1.591 ± 0.231
2.764PheSer: 2.764 ± 0.358
3.267PheThr: 3.267 ± 0.396
2.89PheVal: 2.89 ± 0.388
0.377PheTrp: 0.377 ± 0.142
1.633PheTyr: 1.633 ± 0.267
0.0PheXaa: 0.0 ± 0.0
Gly
3.518GlyAla: 3.518 ± 0.419
0.419GlyCys: 0.419 ± 0.155
3.183GlyAsp: 3.183 ± 0.406
4.062GlyGlu: 4.062 ± 0.425
3.057GlyPhe: 3.057 ± 0.573
4.188GlyGly: 4.188 ± 0.577
0.921GlyHis: 0.921 ± 0.162
4.607GlyIle: 4.607 ± 0.526
5.528GlyLys: 5.528 ± 0.506
5.444GlyLeu: 5.444 ± 0.528
1.926GlyMet: 1.926 ± 0.276
3.35GlyAsn: 3.35 ± 0.425
0.377GlyPro: 0.377 ± 0.143
3.015GlyGln: 3.015 ± 0.468
2.094GlyArg: 2.094 ± 0.327
4.691GlySer: 4.691 ± 0.47
3.895GlyThr: 3.895 ± 0.466
4.314GlyVal: 4.314 ± 0.383
0.796GlyTrp: 0.796 ± 0.162
3.644GlyTyr: 3.644 ± 0.374
0.0GlyXaa: 0.0 ± 0.0
His
0.921HisAla: 0.921 ± 0.136
0.084HisCys: 0.084 ± 0.057
1.215HisAsp: 1.215 ± 0.241
1.466HisGlu: 1.466 ± 0.292
1.089HisPhe: 1.089 ± 0.231
0.67HisGly: 0.67 ± 0.207
0.628HisHis: 0.628 ± 0.253
1.215HisIle: 1.215 ± 0.251
1.382HisLys: 1.382 ± 0.282
1.591HisLeu: 1.591 ± 0.299
0.251HisMet: 0.251 ± 0.095
0.503HisAsn: 0.503 ± 0.156
0.377HisPro: 0.377 ± 0.135
0.712HisGln: 0.712 ± 0.153
0.796HisArg: 0.796 ± 0.216
0.754HisSer: 0.754 ± 0.171
1.089HisThr: 1.089 ± 0.241
0.879HisVal: 0.879 ± 0.164
0.168HisTrp: 0.168 ± 0.082
0.879HisTyr: 0.879 ± 0.163
0.0HisXaa: 0.0 ± 0.0
Ile
4.816IleAla: 4.816 ± 0.465
0.712IleCys: 0.712 ± 0.182
5.277IleAsp: 5.277 ± 0.505
4.942IleGlu: 4.942 ± 0.529
2.68IlePhe: 2.68 ± 0.344
4.062IleGly: 4.062 ± 0.472
1.131IleHis: 1.131 ± 0.27
3.769IleIle: 3.769 ± 0.391
5.947IleLys: 5.947 ± 0.452
3.853IleLeu: 3.853 ± 0.361
1.759IleMet: 1.759 ± 0.339
3.937IleAsn: 3.937 ± 0.408
1.885IlePro: 1.885 ± 0.255
2.22IleGln: 2.22 ± 0.317
2.932IleArg: 2.932 ± 0.379
3.727IleSer: 3.727 ± 0.377
4.314IleThr: 4.314 ± 0.398
4.481IleVal: 4.481 ± 0.494
0.586IleTrp: 0.586 ± 0.168
1.926IleTyr: 1.926 ± 0.29
0.0IleXaa: 0.0 ± 0.0
Lys
6.114LysAla: 6.114 ± 0.851
0.796LysCys: 0.796 ± 0.214
5.193LysAsp: 5.193 ± 0.476
7.245LysGlu: 7.245 ± 0.658
3.602LysPhe: 3.602 ± 0.359
5.361LysGly: 5.361 ± 0.415
1.55LysHis: 1.55 ± 0.299
5.193LysIle: 5.193 ± 0.624
8.627LysLys: 8.627 ± 0.79
7.161LysLeu: 7.161 ± 0.627
2.89LysMet: 2.89 ± 0.329
5.486LysAsn: 5.486 ± 0.385
2.764LysPro: 2.764 ± 0.418
2.973LysGln: 2.973 ± 0.39
3.937LysArg: 3.937 ± 0.414
4.146LysSer: 4.146 ± 0.494
5.612LysThr: 5.612 ± 0.516
6.282LysVal: 6.282 ± 0.482
0.921LysTrp: 0.921 ± 0.203
3.099LysTyr: 3.099 ± 0.302
0.0LysXaa: 0.0 ± 0.0
Leu
5.57LeuAla: 5.57 ± 0.701
0.586LeuCys: 0.586 ± 0.175
5.737LeuAsp: 5.737 ± 0.528
6.617LeuGlu: 6.617 ± 0.732
2.638LeuPhe: 2.638 ± 0.314
4.314LeuGly: 4.314 ± 0.375
1.173LeuHis: 1.173 ± 0.178
4.188LeuIle: 4.188 ± 0.512
7.245LeuLys: 7.245 ± 0.619
5.151LeuLeu: 5.151 ± 0.393
2.178LeuMet: 2.178 ± 0.306
4.732LeuAsn: 4.732 ± 0.466
2.345LeuPro: 2.345 ± 0.332
2.89LeuGln: 2.89 ± 0.384
3.141LeuArg: 3.141 ± 0.356
4.9LeuSer: 4.9 ± 0.406
4.649LeuThr: 4.649 ± 0.46
4.565LeuVal: 4.565 ± 0.46
0.879LeuTrp: 0.879 ± 0.22
2.806LeuTyr: 2.806 ± 0.362
0.0LeuXaa: 0.0 ± 0.0
Met
1.717MetAla: 1.717 ± 0.272
0.168MetCys: 0.168 ± 0.098
1.801MetAsp: 1.801 ± 0.297
2.597MetGlu: 2.597 ± 0.336
1.466MetPhe: 1.466 ± 0.262
1.885MetGly: 1.885 ± 0.504
0.461MetHis: 0.461 ± 0.135
2.638MetIle: 2.638 ± 0.34
2.68MetLys: 2.68 ± 0.439
2.303MetLeu: 2.303 ± 0.334
1.047MetMet: 1.047 ± 0.223
2.22MetAsn: 2.22 ± 0.268
1.047MetPro: 1.047 ± 0.247
1.089MetGln: 1.089 ± 0.258
1.005MetArg: 1.005 ± 0.219
2.094MetSer: 2.094 ± 0.253
1.926MetThr: 1.926 ± 0.348
1.466MetVal: 1.466 ± 0.28
0.251MetTrp: 0.251 ± 0.092
0.586MetTyr: 0.586 ± 0.206
0.0MetXaa: 0.0 ± 0.0
Asn
3.644AsnAla: 3.644 ± 0.557
0.503AsnCys: 0.503 ± 0.175
4.02AsnAsp: 4.02 ± 0.391
4.104AsnGlu: 4.104 ± 0.441
2.597AsnPhe: 2.597 ± 0.351
4.816AsnGly: 4.816 ± 0.5
0.921AsnHis: 0.921 ± 0.212
3.141AsnIle: 3.141 ± 0.326
5.821AsnLys: 5.821 ± 0.601
4.355AsnLeu: 4.355 ± 0.385
1.591AsnMet: 1.591 ± 0.285
3.392AsnAsn: 3.392 ± 0.395
2.68AsnPro: 2.68 ± 0.363
2.261AsnGln: 2.261 ± 0.42
2.848AsnArg: 2.848 ± 0.35
3.183AsnSer: 3.183 ± 0.426
3.685AsnThr: 3.685 ± 0.544
3.811AsnVal: 3.811 ± 0.493
0.461AsnTrp: 0.461 ± 0.139
2.22AsnTyr: 2.22 ± 0.372
0.0AsnXaa: 0.0 ± 0.0
Pro
1.717ProAla: 1.717 ± 0.27
0.168ProCys: 0.168 ± 0.089
1.34ProAsp: 1.34 ± 0.298
1.885ProGlu: 1.885 ± 0.291
1.508ProPhe: 1.508 ± 0.255
1.298ProGly: 1.298 ± 0.222
0.586ProHis: 0.586 ± 0.15
2.303ProIle: 2.303 ± 0.338
2.597ProLys: 2.597 ± 0.363
1.759ProLeu: 1.759 ± 0.249
0.796ProMet: 0.796 ± 0.204
1.508ProAsn: 1.508 ± 0.278
0.544ProPro: 0.544 ± 0.142
1.089ProGln: 1.089 ± 0.213
0.838ProArg: 0.838 ± 0.183
2.261ProSer: 2.261 ± 0.344
1.885ProThr: 1.885 ± 0.32
1.466ProVal: 1.466 ± 0.324
0.084ProTrp: 0.084 ± 0.068
1.424ProTyr: 1.424 ± 0.261
0.0ProXaa: 0.0 ± 0.0
Gln
3.267GlnAla: 3.267 ± 0.706
0.335GlnCys: 0.335 ± 0.139
1.759GlnAsp: 1.759 ± 0.217
2.471GlnGlu: 2.471 ± 0.455
1.885GlnPhe: 1.885 ± 0.353
2.136GlnGly: 2.136 ± 0.434
0.503GlnHis: 0.503 ± 0.12
2.68GlnIle: 2.68 ± 0.479
3.35GlnLys: 3.35 ± 0.351
3.853GlnLeu: 3.853 ± 0.53
1.55GlnMet: 1.55 ± 0.378
2.178GlnAsn: 2.178 ± 0.509
1.089GlnPro: 1.089 ± 0.17
2.806GlnGln: 2.806 ± 0.605
1.843GlnArg: 1.843 ± 0.258
2.052GlnSer: 2.052 ± 0.439
1.717GlnThr: 1.717 ± 0.327
2.932GlnVal: 2.932 ± 0.357
0.335GlnTrp: 0.335 ± 0.098
1.173GlnTyr: 1.173 ± 0.231
0.0GlnXaa: 0.0 ± 0.0
Arg
1.843ArgAla: 1.843 ± 0.321
0.293ArgCys: 0.293 ± 0.119
2.261ArgAsp: 2.261 ± 0.315
2.513ArgGlu: 2.513 ± 0.378
2.429ArgPhe: 2.429 ± 0.381
2.094ArgGly: 2.094 ± 0.34
0.586ArgHis: 0.586 ± 0.144
3.434ArgIle: 3.434 ± 0.394
4.649ArgLys: 4.649 ± 0.609
2.597ArgLeu: 2.597 ± 0.391
1.717ArgMet: 1.717 ± 0.323
2.178ArgAsn: 2.178 ± 0.351
1.089ArgPro: 1.089 ± 0.211
1.34ArgGln: 1.34 ± 0.254
1.633ArgArg: 1.633 ± 0.338
1.885ArgSer: 1.885 ± 0.314
2.806ArgThr: 2.806 ± 0.353
2.68ArgVal: 2.68 ± 0.329
0.628ArgTrp: 0.628 ± 0.182
1.256ArgTyr: 1.256 ± 0.263
0.0ArgXaa: 0.0 ± 0.0
Ser
3.644SerAla: 3.644 ± 0.456
0.544SerCys: 0.544 ± 0.191
3.308SerAsp: 3.308 ± 0.338
4.188SerGlu: 4.188 ± 0.419
2.722SerPhe: 2.722 ± 0.377
4.272SerGly: 4.272 ± 0.571
1.047SerHis: 1.047 ± 0.265
3.769SerIle: 3.769 ± 0.447
4.146SerLys: 4.146 ± 0.545
4.565SerLeu: 4.565 ± 0.533
2.178SerMet: 2.178 ± 0.339
3.434SerAsn: 3.434 ± 0.498
1.466SerPro: 1.466 ± 0.272
2.597SerGln: 2.597 ± 0.558
1.717SerArg: 1.717 ± 0.229
3.811SerSer: 3.811 ± 0.426
3.685SerThr: 3.685 ± 0.545
3.727SerVal: 3.727 ± 0.341
0.67SerTrp: 0.67 ± 0.154
3.141SerTyr: 3.141 ± 0.381
0.0SerXaa: 0.0 ± 0.0
Thr
4.774ThrAla: 4.774 ± 0.946
0.419ThrCys: 0.419 ± 0.156
3.727ThrAsp: 3.727 ± 0.409
3.769ThrGlu: 3.769 ± 0.382
3.56ThrPhe: 3.56 ± 0.443
3.518ThrGly: 3.518 ± 0.412
1.005ThrHis: 1.005 ± 0.243
3.727ThrIle: 3.727 ± 0.494
4.481ThrLys: 4.481 ± 0.458
4.858ThrLeu: 4.858 ± 0.427
1.466ThrMet: 1.466 ± 0.272
3.979ThrAsn: 3.979 ± 0.369
2.052ThrPro: 2.052 ± 0.265
2.429ThrGln: 2.429 ± 0.431
2.052ThrArg: 2.052 ± 0.267
3.769ThrSer: 3.769 ± 0.45
3.769ThrThr: 3.769 ± 0.48
4.355ThrVal: 4.355 ± 0.378
0.754ThrTrp: 0.754 ± 0.135
2.429ThrTyr: 2.429 ± 0.303
0.0ThrXaa: 0.0 ± 0.0
Val
3.811ValAla: 3.811 ± 0.423
0.503ValCys: 0.503 ± 0.175
4.23ValAsp: 4.23 ± 0.489
5.109ValGlu: 5.109 ± 0.536
2.932ValPhe: 2.932 ± 0.33
4.104ValGly: 4.104 ± 0.54
1.131ValHis: 1.131 ± 0.194
4.732ValIle: 4.732 ± 0.472
6.24ValLys: 6.24 ± 0.507
4.23ValLeu: 4.23 ± 0.361
1.926ValMet: 1.926 ± 0.371
3.895ValAsn: 3.895 ± 0.353
1.885ValPro: 1.885 ± 0.206
3.225ValGln: 3.225 ± 0.411
3.183ValArg: 3.183 ± 0.348
4.062ValSer: 4.062 ± 0.424
3.434ValThr: 3.434 ± 0.309
4.355ValVal: 4.355 ± 0.366
0.628ValTrp: 0.628 ± 0.158
3.015ValTyr: 3.015 ± 0.413
0.0ValXaa: 0.0 ± 0.0
Trp
0.503TrpAla: 0.503 ± 0.148
0.126TrpCys: 0.126 ± 0.075
0.586TrpAsp: 0.586 ± 0.14
0.67TrpGlu: 0.67 ± 0.196
0.503TrpPhe: 0.503 ± 0.181
0.712TrpGly: 0.712 ± 0.177
0.419TrpHis: 0.419 ± 0.139
0.963TrpIle: 0.963 ± 0.194
0.879TrpLys: 0.879 ± 0.17
0.838TrpLeu: 0.838 ± 0.199
0.251TrpMet: 0.251 ± 0.124
0.377TrpAsn: 0.377 ± 0.151
0.168TrpPro: 0.168 ± 0.094
0.419TrpGln: 0.419 ± 0.192
0.712TrpArg: 0.712 ± 0.176
0.628TrpSer: 0.628 ± 0.125
0.712TrpThr: 0.712 ± 0.18
0.503TrpVal: 0.503 ± 0.135
0.084TrpTrp: 0.084 ± 0.059
0.544TrpTyr: 0.544 ± 0.123
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.225TyrAla: 3.225 ± 0.369
0.67TyrCys: 0.67 ± 0.191
3.015TyrAsp: 3.015 ± 0.358
2.722TyrGlu: 2.722 ± 0.397
1.675TyrPhe: 1.675 ± 0.261
2.303TyrGly: 2.303 ± 0.278
0.754TyrHis: 0.754 ± 0.163
2.68TyrIle: 2.68 ± 0.406
3.267TyrLys: 3.267 ± 0.37
3.225TyrLeu: 3.225 ± 0.352
0.879TyrMet: 0.879 ± 0.19
2.638TyrAsn: 2.638 ± 0.367
0.712TyrPro: 0.712 ± 0.184
1.298TyrGln: 1.298 ± 0.202
1.885TyrArg: 1.885 ± 0.359
2.261TyrSer: 2.261 ± 0.289
2.178TyrThr: 2.178 ± 0.335
2.89TyrVal: 2.89 ± 0.378
0.461TyrTrp: 0.461 ± 0.104
1.717TyrTyr: 1.717 ± 0.33
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 111 proteins (23879 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski