Amino acid dipepetide frequency for Bacillus phage BCD7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.641AlaAla: 0.641 ± 0.21
0.249AlaCys: 0.249 ± 0.098
3.277AlaAsp: 3.277 ± 0.361
6.768AlaGlu: 6.768 ± 0.576
2.066AlaPhe: 2.066 ± 0.304
4.666AlaGly: 4.666 ± 0.819
0.962AlaHis: 0.962 ± 0.181
3.883AlaIle: 3.883 ± 0.375
5.628AlaLys: 5.628 ± 0.463
5.236AlaLeu: 5.236 ± 0.563
2.707AlaMet: 2.707 ± 0.471
3.348AlaAsn: 3.348 ± 0.322
2.244AlaPro: 2.244 ± 0.317
3.028AlaGln: 3.028 ± 0.471
3.74AlaArg: 3.74 ± 0.447
3.704AlaSer: 3.704 ± 0.515
3.989AlaThr: 3.989 ± 0.45
3.74AlaVal: 3.74 ± 0.423
1.211AlaTrp: 1.211 ± 0.288
2.244AlaTyr: 2.244 ± 0.3
0.0AlaXaa: 0.0 ± 0.0
Cys
0.499CysAla: 0.499 ± 0.136
0.142CysCys: 0.142 ± 0.074
0.321CysAsp: 0.321 ± 0.113
0.499CysGlu: 0.499 ± 0.128
0.178CysPhe: 0.178 ± 0.08
0.499CysGly: 0.499 ± 0.178
0.178CysHis: 0.178 ± 0.082
0.356CysIle: 0.356 ± 0.123
0.748CysLys: 0.748 ± 0.166
0.142CysLeu: 0.142 ± 0.066
0.285CysMet: 0.285 ± 0.107
0.463CysAsn: 0.463 ± 0.138
0.427CysPro: 0.427 ± 0.127
0.356CysGln: 0.356 ± 0.114
0.249CysArg: 0.249 ± 0.104
0.606CysSer: 0.606 ± 0.134
0.499CysThr: 0.499 ± 0.117
0.356CysVal: 0.356 ± 0.126
0.036CysTrp: 0.036 ± 0.04
0.463CysTyr: 0.463 ± 0.149
0.0CysXaa: 0.0 ± 0.0
Asp
3.954AspAla: 3.954 ± 0.392
0.499AspCys: 0.499 ± 0.14
3.74AspAsp: 3.74 ± 0.586
5.984AspGlu: 5.984 ± 0.647
3.348AspPhe: 3.348 ± 0.373
4.524AspGly: 4.524 ± 0.415
0.57AspHis: 0.57 ± 0.147
4.381AspIle: 4.381 ± 0.395
4.31AspLys: 4.31 ± 0.394
4.096AspLeu: 4.096 ± 0.378
1.817AspMet: 1.817 ± 0.297
2.707AspAsn: 2.707 ± 0.327
1.532AspPro: 1.532 ± 0.189
1.389AspGln: 1.389 ± 0.175
2.672AspArg: 2.672 ± 0.294
3.348AspSer: 3.348 ± 0.349
3.562AspThr: 3.562 ± 0.433
3.954AspVal: 3.954 ± 0.353
0.926AspTrp: 0.926 ± 0.144
2.493AspTyr: 2.493 ± 0.304
0.0AspXaa: 0.0 ± 0.0
Glu
4.916GluAla: 4.916 ± 0.476
0.606GluCys: 0.606 ± 0.135
5.628GluAsp: 5.628 ± 0.707
9.796GluGlu: 9.796 ± 1.397
3.811GluPhe: 3.811 ± 0.435
5.949GluGly: 5.949 ± 0.472
1.674GluHis: 1.674 ± 0.217
6.055GluIle: 6.055 ± 0.597
7.017GluLys: 7.017 ± 0.518
7.16GluLeu: 7.16 ± 0.513
3.241GluMet: 3.241 ± 0.381
4.025GluAsn: 4.025 ± 0.359
1.674GluPro: 1.674 ± 0.248
3.811GluGln: 3.811 ± 0.376
3.883GluArg: 3.883 ± 0.424
2.885GluSer: 2.885 ± 0.329
4.168GluThr: 4.168 ± 0.437
6.875GluVal: 6.875 ± 0.572
0.926GluTrp: 0.926 ± 0.173
2.6GluTyr: 2.6 ± 0.365
0.0GluXaa: 0.0 ± 0.0
Phe
2.315PheAla: 2.315 ± 0.289
0.534PheCys: 0.534 ± 0.14
3.206PheAsp: 3.206 ± 0.381
3.063PheGlu: 3.063 ± 0.368
1.781PhePhe: 1.781 ± 0.257
3.348PheGly: 3.348 ± 0.374
1.033PheHis: 1.033 ± 0.241
1.745PheIle: 1.745 ± 0.281
3.135PheLys: 3.135 ± 0.334
3.669PheLeu: 3.669 ± 0.373
1.175PheMet: 1.175 ± 0.194
2.208PheAsn: 2.208 ± 0.289
1.354PhePro: 1.354 ± 0.219
1.496PheGln: 1.496 ± 0.234
2.387PheArg: 2.387 ± 0.348
2.137PheSer: 2.137 ± 0.265
2.743PheThr: 2.743 ± 0.384
2.636PheVal: 2.636 ± 0.371
0.534PheTrp: 0.534 ± 0.156
1.282PheTyr: 1.282 ± 0.223
0.0PheXaa: 0.0 ± 0.0
Gly
4.025GlyAla: 4.025 ± 0.672
0.356GlyCys: 0.356 ± 0.122
4.274GlyAsp: 4.274 ± 0.402
5.521GlyGlu: 5.521 ± 0.438
2.672GlyPhe: 2.672 ± 0.39
4.737GlyGly: 4.737 ± 0.808
1.354GlyHis: 1.354 ± 0.225
4.524GlyIle: 4.524 ± 0.389
6.732GlyLys: 6.732 ± 0.551
4.916GlyLeu: 4.916 ± 0.41
2.244GlyMet: 2.244 ± 0.391
3.633GlyAsn: 3.633 ± 0.466
0.392GlyPro: 0.392 ± 0.131
2.244GlyGln: 2.244 ± 0.304
3.633GlyArg: 3.633 ± 0.712
2.992GlySer: 2.992 ± 0.291
3.954GlyThr: 3.954 ± 0.422
5.628GlyVal: 5.628 ± 0.698
0.926GlyTrp: 0.926 ± 0.185
2.85GlyTyr: 2.85 ± 0.343
0.0GlyXaa: 0.0 ± 0.0
His
1.247HisAla: 1.247 ± 0.23
0.285HisCys: 0.285 ± 0.079
1.425HisAsp: 1.425 ± 0.236
1.14HisGlu: 1.14 ± 0.213
0.819HisPhe: 0.819 ± 0.171
1.389HisGly: 1.389 ± 0.192
0.606HisHis: 0.606 ± 0.167
1.674HisIle: 1.674 ± 0.257
1.318HisLys: 1.318 ± 0.244
1.567HisLeu: 1.567 ± 0.308
0.606HisMet: 0.606 ± 0.118
1.282HisAsn: 1.282 ± 0.213
0.962HisPro: 0.962 ± 0.164
0.392HisGln: 0.392 ± 0.118
0.712HisArg: 0.712 ± 0.181
0.748HisSer: 0.748 ± 0.135
1.14HisThr: 1.14 ± 0.216
1.14HisVal: 1.14 ± 0.207
0.178HisTrp: 0.178 ± 0.064
0.962HisTyr: 0.962 ± 0.2
0.0HisXaa: 0.0 ± 0.0
Ile
5.094IleAla: 5.094 ± 0.485
0.499IleCys: 0.499 ± 0.139
4.274IleAsp: 4.274 ± 0.385
6.091IleGlu: 6.091 ± 0.612
2.315IlePhe: 2.315 ± 0.329
3.847IleGly: 3.847 ± 0.499
1.211IleHis: 1.211 ± 0.201
4.666IleIle: 4.666 ± 0.465
5.201IleLys: 5.201 ± 0.509
3.954IleLeu: 3.954 ± 0.439
1.745IleMet: 1.745 ± 0.322
3.135IleAsn: 3.135 ± 0.354
2.6IlePro: 2.6 ± 0.269
2.458IleGln: 2.458 ± 0.341
3.241IleArg: 3.241 ± 0.338
3.384IleSer: 3.384 ± 0.306
4.168IleThr: 4.168 ± 0.376
3.954IleVal: 3.954 ± 0.363
0.712IleTrp: 0.712 ± 0.181
2.28IleTyr: 2.28 ± 0.32
0.0IleXaa: 0.0 ± 0.0
Lys
6.234LysAla: 6.234 ± 0.468
0.534LysCys: 0.534 ± 0.18
4.417LysAsp: 4.417 ± 0.397
8.549LysGlu: 8.549 ± 0.666
3.598LysPhe: 3.598 ± 0.329
4.274LysGly: 4.274 ± 0.389
1.639LysHis: 1.639 ± 0.245
4.809LysIle: 4.809 ± 0.424
8.834LysLys: 8.834 ± 1.036
6.483LysLeu: 6.483 ± 0.489
3.348LysMet: 3.348 ± 0.378
3.455LysAsn: 3.455 ± 0.315
2.85LysPro: 2.85 ± 0.438
3.918LysGln: 3.918 ± 0.358
4.809LysArg: 4.809 ± 0.48
4.524LysSer: 4.524 ± 0.43
3.989LysThr: 3.989 ± 0.459
6.127LysVal: 6.127 ± 0.43
0.784LysTrp: 0.784 ± 0.163
3.348LysTyr: 3.348 ± 0.351
0.0LysXaa: 0.0 ± 0.0
Leu
5.343LeuAla: 5.343 ± 0.367
0.392LeuCys: 0.392 ± 0.129
4.381LeuAsp: 4.381 ± 0.457
6.162LeuGlu: 6.162 ± 0.434
2.992LeuPhe: 2.992 ± 0.343
4.987LeuGly: 4.987 ± 0.644
1.71LeuHis: 1.71 ± 0.24
4.346LeuIle: 4.346 ± 0.469
6.59LeuLys: 6.59 ± 0.505
5.379LeuLeu: 5.379 ± 0.427
2.422LeuMet: 2.422 ± 0.288
3.206LeuAsn: 3.206 ± 0.344
3.526LeuPro: 3.526 ± 0.431
2.778LeuGln: 2.778 ± 0.302
3.526LeuArg: 3.526 ± 0.319
4.025LeuSer: 4.025 ± 0.357
4.096LeuThr: 4.096 ± 0.363
5.022LeuVal: 5.022 ± 0.438
0.784LeuTrp: 0.784 ± 0.157
2.707LeuTyr: 2.707 ± 0.37
0.0LeuXaa: 0.0 ± 0.0
Met
2.351MetAla: 2.351 ± 0.374
0.178MetCys: 0.178 ± 0.088
1.674MetAsp: 1.674 ± 0.238
2.707MetGlu: 2.707 ± 0.333
1.318MetPhe: 1.318 ± 0.232
2.03MetGly: 2.03 ± 0.327
0.748MetHis: 0.748 ± 0.184
1.674MetIle: 1.674 ± 0.238
3.099MetLys: 3.099 ± 0.313
2.03MetLeu: 2.03 ± 0.255
0.855MetMet: 0.855 ± 0.163
1.817MetAsn: 1.817 ± 0.229
1.282MetPro: 1.282 ± 0.194
1.567MetGln: 1.567 ± 0.245
1.567MetArg: 1.567 ± 0.253
2.244MetSer: 2.244 ± 0.355
2.137MetThr: 2.137 ± 0.254
1.282MetVal: 1.282 ± 0.214
0.249MetTrp: 0.249 ± 0.093
1.14MetTyr: 1.14 ± 0.285
0.0MetXaa: 0.0 ± 0.0
Asn
3.384AsnAla: 3.384 ± 0.457
0.499AsnCys: 0.499 ± 0.144
2.529AsnAsp: 2.529 ± 0.276
3.633AsnGlu: 3.633 ± 0.434
2.102AsnPhe: 2.102 ± 0.311
3.918AsnGly: 3.918 ± 0.341
1.104AsnHis: 1.104 ± 0.222
3.704AsnIle: 3.704 ± 0.379
3.847AsnLys: 3.847 ± 0.362
3.633AsnLeu: 3.633 ± 0.336
1.603AsnMet: 1.603 ± 0.236
2.6AsnAsn: 2.6 ± 0.288
2.066AsnPro: 2.066 ± 0.238
1.354AsnGln: 1.354 ± 0.176
2.315AsnArg: 2.315 ± 0.273
2.85AsnSer: 2.85 ± 0.363
3.099AsnThr: 3.099 ± 0.402
3.704AsnVal: 3.704 ± 0.359
0.534AsnTrp: 0.534 ± 0.148
2.173AsnTyr: 2.173 ± 0.283
0.0AsnXaa: 0.0 ± 0.0
Pro
2.351ProAla: 2.351 ± 0.311
0.285ProCys: 0.285 ± 0.098
1.639ProAsp: 1.639 ± 0.274
2.565ProGlu: 2.565 ± 0.37
1.603ProPhe: 1.603 ± 0.2
1.069ProGly: 1.069 ± 0.192
0.748ProHis: 0.748 ± 0.16
1.959ProIle: 1.959 ± 0.278
3.776ProLys: 3.776 ± 0.504
2.173ProLeu: 2.173 ± 0.283
0.962ProMet: 0.962 ± 0.191
1.389ProAsn: 1.389 ± 0.21
1.318ProPro: 1.318 ± 0.277
1.247ProGln: 1.247 ± 0.227
1.425ProArg: 1.425 ± 0.248
1.603ProSer: 1.603 ± 0.275
2.493ProThr: 2.493 ± 0.253
2.315ProVal: 2.315 ± 0.277
0.178ProTrp: 0.178 ± 0.07
1.567ProTyr: 1.567 ± 0.273
0.0ProXaa: 0.0 ± 0.0
Gln
3.455GlnAla: 3.455 ± 0.687
0.214GlnCys: 0.214 ± 0.077
1.923GlnAsp: 1.923 ± 0.263
2.672GlnGlu: 2.672 ± 0.28
1.781GlnPhe: 1.781 ± 0.267
2.28GlnGly: 2.28 ± 0.31
0.677GlnHis: 0.677 ± 0.156
2.066GlnIle: 2.066 ± 0.28
2.743GlnLys: 2.743 ± 0.32
3.669GlnLeu: 3.669 ± 0.347
1.318GlnMet: 1.318 ± 0.261
1.71GlnAsn: 1.71 ± 0.235
1.033GlnPro: 1.033 ± 0.181
1.639GlnGln: 1.639 ± 0.288
2.066GlnArg: 2.066 ± 0.395
1.888GlnSer: 1.888 ± 0.271
1.995GlnThr: 1.995 ± 0.285
2.565GlnVal: 2.565 ± 0.299
0.356GlnTrp: 0.356 ± 0.132
1.532GlnTyr: 1.532 ± 0.243
0.0GlnXaa: 0.0 ± 0.0
Arg
3.135ArgAla: 3.135 ± 0.676
0.285ArgCys: 0.285 ± 0.114
3.063ArgAsp: 3.063 ± 0.377
3.776ArgGlu: 3.776 ± 0.372
2.208ArgPhe: 2.208 ± 0.322
3.491ArgGly: 3.491 ± 0.38
0.997ArgHis: 0.997 ± 0.189
3.135ArgIle: 3.135 ± 0.315
4.737ArgLys: 4.737 ± 0.505
3.954ArgLeu: 3.954 ± 0.395
1.425ArgMet: 1.425 ± 0.224
2.814ArgAsn: 2.814 ± 0.352
1.211ArgPro: 1.211 ± 0.191
2.173ArgGln: 2.173 ± 0.407
2.814ArgArg: 2.814 ± 0.423
1.532ArgSer: 1.532 ± 0.253
2.778ArgThr: 2.778 ± 0.326
3.206ArgVal: 3.206 ± 0.357
0.285ArgTrp: 0.285 ± 0.086
1.781ArgTyr: 1.781 ± 0.281
0.0ArgXaa: 0.0 ± 0.0
Ser
3.562SerAla: 3.562 ± 0.468
0.499SerCys: 0.499 ± 0.118
3.42SerAsp: 3.42 ± 0.353
3.562SerGlu: 3.562 ± 0.359
1.71SerPhe: 1.71 ± 0.235
4.595SerGly: 4.595 ± 0.517
0.819SerHis: 0.819 ± 0.158
4.096SerIle: 4.096 ± 0.477
4.488SerLys: 4.488 ± 0.379
3.526SerLeu: 3.526 ± 0.369
1.496SerMet: 1.496 ± 0.267
2.422SerAsn: 2.422 ± 0.272
1.71SerPro: 1.71 ± 0.221
1.211SerGln: 1.211 ± 0.183
1.817SerArg: 1.817 ± 0.259
2.208SerSer: 2.208 ± 0.351
2.885SerThr: 2.885 ± 0.317
3.526SerVal: 3.526 ± 0.362
0.534SerTrp: 0.534 ± 0.165
1.888SerTyr: 1.888 ± 0.203
0.0SerXaa: 0.0 ± 0.0
Thr
3.633ThrAla: 3.633 ± 0.586
0.427ThrCys: 0.427 ± 0.146
3.135ThrAsp: 3.135 ± 0.359
4.346ThrGlu: 4.346 ± 0.404
2.565ThrPhe: 2.565 ± 0.305
4.737ThrGly: 4.737 ± 0.477
1.282ThrHis: 1.282 ± 0.241
3.954ThrIle: 3.954 ± 0.462
4.274ThrLys: 4.274 ± 0.427
4.631ThrLeu: 4.631 ± 0.354
1.674ThrMet: 1.674 ± 0.252
2.672ThrAsn: 2.672 ± 0.295
2.422ThrPro: 2.422 ± 0.302
2.315ThrGln: 2.315 ± 0.321
2.672ThrArg: 2.672 ± 0.356
3.277ThrSer: 3.277 ± 0.351
4.025ThrThr: 4.025 ± 0.491
4.025ThrVal: 4.025 ± 0.386
0.427ThrTrp: 0.427 ± 0.133
2.778ThrTyr: 2.778 ± 0.375
0.0ThrXaa: 0.0 ± 0.0
Val
4.168ValAla: 4.168 ± 0.348
0.356ValCys: 0.356 ± 0.109
3.989ValAsp: 3.989 ± 0.326
5.129ValGlu: 5.129 ± 0.462
2.992ValPhe: 2.992 ± 0.304
3.918ValGly: 3.918 ± 0.439
1.318ValHis: 1.318 ± 0.26
4.666ValIle: 4.666 ± 0.417
5.45ValLys: 5.45 ± 0.446
4.844ValLeu: 4.844 ± 0.381
1.888ValMet: 1.888 ± 0.308
4.132ValAsn: 4.132 ± 0.317
2.814ValPro: 2.814 ± 0.266
2.672ValGln: 2.672 ± 0.335
3.099ValArg: 3.099 ± 0.287
3.42ValSer: 3.42 ± 0.362
4.559ValThr: 4.559 ± 0.374
4.773ValVal: 4.773 ± 0.47
0.855ValTrp: 0.855 ± 0.159
2.493ValTyr: 2.493 ± 0.354
0.0ValXaa: 0.0 ± 0.0
Trp
0.499TrpAla: 0.499 ± 0.137
0.071TrpCys: 0.071 ± 0.046
1.14TrpAsp: 1.14 ± 0.221
1.46TrpGlu: 1.46 ± 0.266
0.534TrpPhe: 0.534 ± 0.14
0.677TrpGly: 0.677 ± 0.16
0.107TrpHis: 0.107 ± 0.063
0.784TrpIle: 0.784 ± 0.176
0.997TrpLys: 0.997 ± 0.19
0.712TrpLeu: 0.712 ± 0.177
0.107TrpMet: 0.107 ± 0.066
0.997TrpAsn: 0.997 ± 0.253
0.0TrpPro: 0.0 ± 0.0
0.285TrpGln: 0.285 ± 0.101
0.427TrpArg: 0.427 ± 0.127
0.57TrpSer: 0.57 ± 0.141
0.748TrpThr: 0.748 ± 0.143
0.463TrpVal: 0.463 ± 0.11
0.107TrpTrp: 0.107 ± 0.067
0.356TrpTyr: 0.356 ± 0.102
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.529TyrAla: 2.529 ± 0.293
0.392TyrCys: 0.392 ± 0.117
2.351TyrAsp: 2.351 ± 0.276
3.277TyrGlu: 3.277 ± 0.324
1.389TyrPhe: 1.389 ± 0.218
2.672TyrGly: 2.672 ± 0.36
0.855TyrHis: 0.855 ± 0.154
2.351TyrIle: 2.351 ± 0.336
3.633TyrLys: 3.633 ± 0.427
2.6TyrLeu: 2.6 ± 0.447
0.997TyrMet: 0.997 ± 0.179
2.636TyrAsn: 2.636 ± 0.345
1.247TyrPro: 1.247 ± 0.237
1.247TyrGln: 1.247 ± 0.181
1.745TyrArg: 1.745 ± 0.289
2.03TyrSer: 2.03 ± 0.278
2.244TyrThr: 2.244 ± 0.338
2.244TyrVal: 2.244 ± 0.274
0.499TyrTrp: 0.499 ± 0.136
1.745TyrTyr: 1.745 ± 0.276
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 140 proteins (28075 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski