Amino acid dipepetide frequency for Brevibacillus phage Osiris

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.309AlaAla: 2.309 ± 0.364
0.811AlaCys: 0.811 ± 0.246
3.806AlaAsp: 3.806 ± 0.47
5.179AlaGlu: 5.179 ± 0.583
2.87AlaPhe: 2.87 ± 0.371
3.12AlaGly: 3.12 ± 0.425
0.749AlaHis: 0.749 ± 0.241
4.68AlaIle: 4.68 ± 0.559
5.616AlaLys: 5.616 ± 0.501
4.742AlaLeu: 4.742 ± 0.558
2.059AlaMet: 2.059 ± 0.392
2.995AlaAsn: 2.995 ± 0.489
1.747AlaPro: 1.747 ± 0.526
2.434AlaGln: 2.434 ± 0.376
2.933AlaArg: 2.933 ± 0.51
2.683AlaSer: 2.683 ± 0.403
2.87AlaThr: 2.87 ± 0.435
3.869AlaVal: 3.869 ± 0.564
1.248AlaTrp: 1.248 ± 0.283
2.246AlaTyr: 2.246 ± 0.34
0.0AlaXaa: 0.0 ± 0.0
Cys
0.25CysAla: 0.25 ± 0.145
0.25CysCys: 0.25 ± 0.143
0.624CysAsp: 0.624 ± 0.176
0.998CysGlu: 0.998 ± 0.319
0.437CysPhe: 0.437 ± 0.161
0.499CysGly: 0.499 ± 0.194
0.187CysHis: 0.187 ± 0.113
0.811CysIle: 0.811 ± 0.254
0.437CysLys: 0.437 ± 0.156
0.312CysLeu: 0.312 ± 0.144
0.0CysMet: 0.0 ± 0.0
0.624CysAsn: 0.624 ± 0.191
0.25CysPro: 0.25 ± 0.126
0.187CysGln: 0.187 ± 0.109
0.624CysArg: 0.624 ± 0.231
0.437CysSer: 0.437 ± 0.169
0.562CysThr: 0.562 ± 0.173
0.562CysVal: 0.562 ± 0.181
0.187CysTrp: 0.187 ± 0.15
0.437CysTyr: 0.437 ± 0.164
0.0CysXaa: 0.0 ± 0.0
Asp
3.307AspAla: 3.307 ± 0.492
0.437AspCys: 0.437 ± 0.209
3.557AspAsp: 3.557 ± 0.525
5.304AspGlu: 5.304 ± 0.573
2.746AspPhe: 2.746 ± 0.379
3.931AspGly: 3.931 ± 0.557
1.061AspHis: 1.061 ± 0.272
4.742AspIle: 4.742 ± 0.61
5.429AspLys: 5.429 ± 0.561
4.43AspLeu: 4.43 ± 0.529
1.747AspMet: 1.747 ± 0.244
2.995AspAsn: 2.995 ± 0.441
2.496AspPro: 2.496 ± 0.523
1.685AspGln: 1.685 ± 0.32
2.246AspArg: 2.246 ± 0.399
4.306AspSer: 4.306 ± 0.483
2.683AspThr: 2.683 ± 0.38
3.619AspVal: 3.619 ± 0.5
0.874AspTrp: 0.874 ± 0.184
2.995AspTyr: 2.995 ± 0.436
0.0AspXaa: 0.0 ± 0.0
Glu
4.243GluAla: 4.243 ± 0.477
0.936GluCys: 0.936 ± 0.277
3.931GluAsp: 3.931 ± 0.472
7.238GluGlu: 7.238 ± 0.892
3.806GluPhe: 3.806 ± 0.527
4.43GluGly: 4.43 ± 0.469
1.685GluHis: 1.685 ± 0.335
6.677GluIle: 6.677 ± 0.605
7.737GluLys: 7.737 ± 0.622
6.801GluLeu: 6.801 ± 0.795
2.496GluMet: 2.496 ± 0.405
2.621GluAsn: 2.621 ± 0.38
1.685GluPro: 1.685 ± 0.332
4.43GluGln: 4.43 ± 0.585
5.179GluArg: 5.179 ± 0.558
4.118GluSer: 4.118 ± 0.561
4.68GluThr: 4.68 ± 0.536
5.928GluVal: 5.928 ± 0.542
1.248GluTrp: 1.248 ± 0.273
2.434GluTyr: 2.434 ± 0.47
0.0GluXaa: 0.0 ± 0.0
Phe
2.309PheAla: 2.309 ± 0.373
0.25PheCys: 0.25 ± 0.135
3.432PheAsp: 3.432 ± 0.376
3.494PheGlu: 3.494 ± 0.482
1.622PhePhe: 1.622 ± 0.346
2.621PheGly: 2.621 ± 0.37
0.562PheHis: 0.562 ± 0.21
3.12PheIle: 3.12 ± 0.45
3.307PheLys: 3.307 ± 0.479
4.181PheLeu: 4.181 ± 0.455
0.811PheMet: 0.811 ± 0.204
1.622PheAsn: 1.622 ± 0.429
0.811PhePro: 0.811 ± 0.23
1.747PheGln: 1.747 ± 0.26
1.56PheArg: 1.56 ± 0.346
2.496PheSer: 2.496 ± 0.369
2.059PheThr: 2.059 ± 0.312
2.808PheVal: 2.808 ± 0.443
0.624PheTrp: 0.624 ± 0.19
1.435PheTyr: 1.435 ± 0.33
0.0PheXaa: 0.0 ± 0.0
Gly
3.994GlyAla: 3.994 ± 0.664
0.312GlyCys: 0.312 ± 0.168
3.931GlyAsp: 3.931 ± 0.596
4.118GlyGlu: 4.118 ± 0.473
2.371GlyPhe: 2.371 ± 0.429
3.12GlyGly: 3.12 ± 0.479
0.749GlyHis: 0.749 ± 0.227
4.306GlyIle: 4.306 ± 0.502
5.741GlyLys: 5.741 ± 0.579
4.243GlyLeu: 4.243 ± 0.572
1.747GlyMet: 1.747 ± 0.328
3.806GlyAsn: 3.806 ± 0.551
0.874GlyPro: 0.874 ± 0.218
1.934GlyGln: 1.934 ± 0.387
3.37GlyArg: 3.37 ± 0.57
3.37GlySer: 3.37 ± 0.52
3.619GlyThr: 3.619 ± 0.493
5.117GlyVal: 5.117 ± 0.569
0.874GlyTrp: 0.874 ± 0.272
2.808GlyTyr: 2.808 ± 0.402
0.0GlyXaa: 0.0 ± 0.0
His
0.811HisAla: 0.811 ± 0.224
0.062HisCys: 0.062 ± 0.062
0.998HisAsp: 0.998 ± 0.221
1.186HisGlu: 1.186 ± 0.279
0.811HisPhe: 0.811 ± 0.224
0.811HisGly: 0.811 ± 0.192
0.312HisHis: 0.312 ± 0.161
1.31HisIle: 1.31 ± 0.247
0.998HisLys: 0.998 ± 0.251
0.998HisLeu: 0.998 ± 0.291
0.062HisMet: 0.062 ± 0.054
0.998HisAsn: 0.998 ± 0.227
0.811HisPro: 0.811 ± 0.182
0.499HisGln: 0.499 ± 0.159
0.562HisArg: 0.562 ± 0.191
0.686HisSer: 0.686 ± 0.22
1.186HisThr: 1.186 ± 0.263
1.248HisVal: 1.248 ± 0.279
0.312HisTrp: 0.312 ± 0.133
1.186HisTyr: 1.186 ± 0.306
0.0HisXaa: 0.0 ± 0.0
Ile
5.491IleAla: 5.491 ± 0.602
0.562IleCys: 0.562 ± 0.16
4.306IleAsp: 4.306 ± 0.482
6.614IleGlu: 6.614 ± 0.635
2.184IlePhe: 2.184 ± 0.341
3.994IleGly: 3.994 ± 0.525
1.498IleHis: 1.498 ± 0.273
3.931IleIle: 3.931 ± 0.469
6.614IleLys: 6.614 ± 0.566
5.678IleLeu: 5.678 ± 0.59
1.31IleMet: 1.31 ± 0.317
4.181IleAsn: 4.181 ± 0.507
2.808IlePro: 2.808 ± 0.406
2.621IleGln: 2.621 ± 0.402
3.744IleArg: 3.744 ± 0.479
5.366IleSer: 5.366 ± 0.562
4.368IleThr: 4.368 ± 0.522
5.366IleVal: 5.366 ± 0.63
0.499IleTrp: 0.499 ± 0.174
3.432IleTyr: 3.432 ± 0.601
0.0IleXaa: 0.0 ± 0.0
Lys
5.304LysAla: 5.304 ± 0.599
0.874LysCys: 0.874 ± 0.21
4.118LysAsp: 4.118 ± 0.523
7.613LysGlu: 7.613 ± 0.63
2.558LysPhe: 2.558 ± 0.391
5.741LysGly: 5.741 ± 0.608
1.56LysHis: 1.56 ± 0.284
6.115LysIle: 6.115 ± 0.523
8.174LysLys: 8.174 ± 0.878
6.739LysLeu: 6.739 ± 0.6
3.058LysMet: 3.058 ± 0.418
4.68LysAsn: 4.68 ± 0.705
2.558LysPro: 2.558 ± 0.333
4.181LysGln: 4.181 ± 0.536
4.617LysArg: 4.617 ± 0.513
4.368LysSer: 4.368 ± 0.628
4.68LysThr: 4.68 ± 0.572
5.117LysVal: 5.117 ± 0.644
0.936LysTrp: 0.936 ± 0.207
2.995LysTyr: 2.995 ± 0.428
0.0LysXaa: 0.0 ± 0.0
Leu
6.427LeuAla: 6.427 ± 0.636
0.874LeuCys: 0.874 ± 0.267
5.803LeuAsp: 5.803 ± 0.524
6.926LeuGlu: 6.926 ± 0.678
3.869LeuPhe: 3.869 ± 0.475
4.43LeuGly: 4.43 ± 0.45
1.061LeuHis: 1.061 ± 0.251
5.865LeuIle: 5.865 ± 0.579
6.115LeuLys: 6.115 ± 0.766
6.926LeuLeu: 6.926 ± 0.578
2.184LeuMet: 2.184 ± 0.374
4.617LeuAsn: 4.617 ± 0.432
2.309LeuPro: 2.309 ± 0.274
3.307LeuGln: 3.307 ± 0.456
3.37LeuArg: 3.37 ± 0.47
5.803LeuSer: 5.803 ± 0.551
4.368LeuThr: 4.368 ± 0.522
4.243LeuVal: 4.243 ± 0.49
0.811LeuTrp: 0.811 ± 0.19
2.87LeuTyr: 2.87 ± 0.427
0.0LeuXaa: 0.0 ± 0.0
Met
1.81MetAla: 1.81 ± 0.331
0.187MetCys: 0.187 ± 0.1
1.31MetAsp: 1.31 ± 0.255
1.872MetGlu: 1.872 ± 0.361
1.186MetPhe: 1.186 ± 0.339
1.373MetGly: 1.373 ± 0.257
0.187MetHis: 0.187 ± 0.118
1.56MetIle: 1.56 ± 0.33
2.808MetLys: 2.808 ± 0.478
1.498MetLeu: 1.498 ± 0.292
1.248MetMet: 1.248 ± 0.263
1.498MetAsn: 1.498 ± 0.26
1.061MetPro: 1.061 ± 0.327
1.123MetGln: 1.123 ± 0.274
1.622MetArg: 1.622 ± 0.307
2.184MetSer: 2.184 ± 0.326
1.81MetThr: 1.81 ± 0.341
1.435MetVal: 1.435 ± 0.289
0.062MetTrp: 0.062 ± 0.071
0.998MetTyr: 0.998 ± 0.241
0.0MetXaa: 0.0 ± 0.0
Asn
2.309AsnAla: 2.309 ± 0.631
0.374AsnCys: 0.374 ± 0.143
2.184AsnAsp: 2.184 ± 0.422
4.056AsnGlu: 4.056 ± 0.514
1.81AsnPhe: 1.81 ± 0.281
3.744AsnGly: 3.744 ± 0.542
0.998AsnHis: 0.998 ± 0.229
3.619AsnIle: 3.619 ± 0.672
5.054AsnLys: 5.054 ± 0.566
4.43AsnLeu: 4.43 ± 0.499
1.123AsnMet: 1.123 ± 0.28
2.558AsnAsn: 2.558 ± 0.563
2.496AsnPro: 2.496 ± 0.739
1.81AsnGln: 1.81 ± 0.33
2.558AsnArg: 2.558 ± 0.388
2.683AsnSer: 2.683 ± 0.465
2.87AsnThr: 2.87 ± 0.464
2.621AsnVal: 2.621 ± 0.443
0.749AsnTrp: 0.749 ± 0.211
2.309AsnTyr: 2.309 ± 0.485
0.0AsnXaa: 0.0 ± 0.0
Pro
1.685ProAla: 1.685 ± 0.3
0.187ProCys: 0.187 ± 0.099
2.683ProAsp: 2.683 ± 0.441
2.059ProGlu: 2.059 ± 0.41
1.747ProPhe: 1.747 ± 0.347
1.186ProGly: 1.186 ± 0.314
0.437ProHis: 0.437 ± 0.144
2.496ProIle: 2.496 ± 0.321
1.934ProLys: 1.934 ± 0.511
2.059ProLeu: 2.059 ± 0.318
0.374ProMet: 0.374 ± 0.168
2.496ProAsn: 2.496 ± 0.767
0.811ProPro: 0.811 ± 0.257
0.998ProGln: 0.998 ± 0.254
1.373ProArg: 1.373 ± 0.28
2.371ProSer: 2.371 ± 0.368
2.059ProThr: 2.059 ± 0.362
1.56ProVal: 1.56 ± 0.272
0.437ProTrp: 0.437 ± 0.165
1.061ProTyr: 1.061 ± 0.231
0.0ProXaa: 0.0 ± 0.0
Gln
2.808GlnAla: 2.808 ± 0.434
0.187GlnCys: 0.187 ± 0.116
1.872GlnAsp: 1.872 ± 0.384
2.87GlnGlu: 2.87 ± 0.38
1.123GlnPhe: 1.123 ± 0.218
2.246GlnGly: 2.246 ± 0.381
0.749GlnHis: 0.749 ± 0.233
3.869GlnIle: 3.869 ± 0.496
2.933GlnLys: 2.933 ± 0.435
4.929GlnLeu: 4.929 ± 0.43
0.874GlnMet: 0.874 ± 0.214
2.184GlnAsn: 2.184 ± 0.469
0.998GlnPro: 0.998 ± 0.218
1.622GlnGln: 1.622 ± 0.436
1.747GlnArg: 1.747 ± 0.309
1.872GlnSer: 1.872 ± 0.325
2.496GlnThr: 2.496 ± 0.389
2.434GlnVal: 2.434 ± 0.354
0.374GlnTrp: 0.374 ± 0.187
1.31GlnTyr: 1.31 ± 0.325
0.0GlnXaa: 0.0 ± 0.0
Arg
2.309ArgAla: 2.309 ± 0.362
0.624ArgCys: 0.624 ± 0.241
1.872ArgAsp: 1.872 ± 0.331
4.555ArgGlu: 4.555 ± 0.612
2.059ArgPhe: 2.059 ± 0.316
3.12ArgGly: 3.12 ± 0.457
0.936ArgHis: 0.936 ± 0.28
3.994ArgIle: 3.994 ± 0.454
4.929ArgLys: 4.929 ± 0.722
3.806ArgLeu: 3.806 ± 0.674
1.872ArgMet: 1.872 ± 0.343
1.56ArgAsn: 1.56 ± 0.262
1.123ArgPro: 1.123 ± 0.279
1.934ArgGln: 1.934 ± 0.353
1.81ArgArg: 1.81 ± 0.407
2.309ArgSer: 2.309 ± 0.402
2.995ArgThr: 2.995 ± 0.491
3.058ArgVal: 3.058 ± 0.446
0.562ArgTrp: 0.562 ± 0.164
2.122ArgTyr: 2.122 ± 0.349
0.0ArgXaa: 0.0 ± 0.0
Ser
2.496SerAla: 2.496 ± 0.468
0.187SerCys: 0.187 ± 0.101
4.43SerAsp: 4.43 ± 0.463
4.306SerGlu: 4.306 ± 0.519
3.744SerPhe: 3.744 ± 0.662
3.557SerGly: 3.557 ± 0.388
0.874SerHis: 0.874 ± 0.233
4.742SerIle: 4.742 ± 0.557
5.179SerLys: 5.179 ± 0.634
5.553SerLeu: 5.553 ± 0.481
1.498SerMet: 1.498 ± 0.302
2.558SerAsn: 2.558 ± 0.378
1.747SerPro: 1.747 ± 0.409
2.184SerGln: 2.184 ± 0.345
2.122SerArg: 2.122 ± 0.325
4.056SerSer: 4.056 ± 0.637
3.806SerThr: 3.806 ± 0.476
3.557SerVal: 3.557 ± 0.436
0.749SerTrp: 0.749 ± 0.22
2.246SerTyr: 2.246 ± 0.426
0.0SerXaa: 0.0 ± 0.0
Thr
3.619ThrAla: 3.619 ± 0.477
0.437ThrCys: 0.437 ± 0.159
3.307ThrAsp: 3.307 ± 0.397
4.306ThrGlu: 4.306 ± 0.495
1.997ThrPhe: 1.997 ± 0.363
4.306ThrGly: 4.306 ± 0.434
0.998ThrHis: 0.998 ± 0.193
4.056ThrIle: 4.056 ± 0.558
4.742ThrLys: 4.742 ± 0.592
5.741ThrLeu: 5.741 ± 0.56
1.31ThrMet: 1.31 ± 0.278
2.746ThrAsn: 2.746 ± 0.426
2.122ThrPro: 2.122 ± 0.33
1.997ThrGln: 1.997 ± 0.345
1.872ThrArg: 1.872 ± 0.358
3.432ThrSer: 3.432 ± 0.654
2.995ThrThr: 2.995 ± 0.319
3.806ThrVal: 3.806 ± 0.548
0.811ThrTrp: 0.811 ± 0.215
2.933ThrTyr: 2.933 ± 0.388
0.0ThrXaa: 0.0 ± 0.0
Val
4.805ValAla: 4.805 ± 0.556
0.437ValCys: 0.437 ± 0.152
4.555ValAsp: 4.555 ± 0.501
4.493ValGlu: 4.493 ± 0.586
1.747ValPhe: 1.747 ± 0.317
4.243ValGly: 4.243 ± 0.487
0.562ValHis: 0.562 ± 0.185
5.179ValIle: 5.179 ± 0.575
4.056ValLys: 4.056 ± 0.549
4.118ValLeu: 4.118 ± 0.62
1.747ValMet: 1.747 ± 0.308
3.245ValAsn: 3.245 ± 0.485
1.872ValPro: 1.872 ± 0.294
2.746ValGln: 2.746 ± 0.37
3.37ValArg: 3.37 ± 0.471
4.181ValSer: 4.181 ± 0.392
4.306ValThr: 4.306 ± 0.496
4.43ValVal: 4.43 ± 0.697
0.624ValTrp: 0.624 ± 0.193
3.619ValTyr: 3.619 ± 0.588
0.0ValXaa: 0.0 ± 0.0
Trp
0.374TrpAla: 0.374 ± 0.157
0.125TrpCys: 0.125 ± 0.094
0.624TrpAsp: 0.624 ± 0.203
1.186TrpGlu: 1.186 ± 0.218
0.437TrpPhe: 0.437 ± 0.165
0.874TrpGly: 0.874 ± 0.25
0.187TrpHis: 0.187 ± 0.108
0.686TrpIle: 0.686 ± 0.204
1.622TrpLys: 1.622 ± 0.293
1.498TrpLeu: 1.498 ± 0.296
0.25TrpMet: 0.25 ± 0.113
0.25TrpAsn: 0.25 ± 0.094
0.25TrpPro: 0.25 ± 0.14
0.437TrpGln: 0.437 ± 0.16
0.686TrpArg: 0.686 ± 0.227
0.562TrpSer: 0.562 ± 0.206
1.123TrpThr: 1.123 ± 0.255
0.936TrpVal: 0.936 ± 0.223
0.312TrpTrp: 0.312 ± 0.192
0.25TrpTyr: 0.25 ± 0.121
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.309TyrAla: 2.309 ± 0.383
0.562TyrCys: 0.562 ± 0.192
3.245TyrAsp: 3.245 ± 0.41
3.931TyrGlu: 3.931 ± 0.528
1.747TyrPhe: 1.747 ± 0.285
3.058TyrGly: 3.058 ± 0.507
0.499TyrHis: 0.499 ± 0.18
2.808TyrIle: 2.808 ± 0.383
2.558TyrLys: 2.558 ± 0.399
3.494TyrLeu: 3.494 ± 0.446
0.998TyrMet: 0.998 ± 0.213
2.122TyrAsn: 2.122 ± 0.354
1.186TyrPro: 1.186 ± 0.26
1.622TyrGln: 1.622 ± 0.316
2.246TyrArg: 2.246 ± 0.428
2.309TyrSer: 2.309 ± 0.599
1.997TyrThr: 1.997 ± 0.397
2.621TyrVal: 2.621 ± 0.477
0.374TyrTrp: 0.374 ± 0.152
1.56TyrTyr: 1.56 ± 0.312
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 103 proteins (16027 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski