Amino acid dipepetide frequency for Corynebacterium phage Bran

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
21.305AlaAla: 21.305 ± 2.302
0.423AlaCys: 0.423 ± 0.17
8.113AlaAsp: 8.113 ± 0.774
9.171AlaGlu: 9.171 ± 0.8
2.822AlaPhe: 2.822 ± 0.627
10.3AlaGly: 10.3 ± 1.25
2.116AlaHis: 2.116 ± 0.39
5.291AlaIle: 5.291 ± 0.894
4.233AlaLys: 4.233 ± 0.624
10.018AlaLeu: 10.018 ± 0.952
4.162AlaMet: 4.162 ± 0.625
2.469AlaAsn: 2.469 ± 0.702
6.984AlaPro: 6.984 ± 0.628
5.15AlaGln: 5.15 ± 0.711
7.69AlaArg: 7.69 ± 0.963
5.432AlaSer: 5.432 ± 0.65
7.196AlaThr: 7.196 ± 0.691
8.325AlaVal: 8.325 ± 0.966
2.963AlaTrp: 2.963 ± 0.492
2.399AlaTyr: 2.399 ± 0.44
0.0AlaXaa: 0.0 ± 0.0
Cys
0.564CysAla: 0.564 ± 0.226
0.141CysCys: 0.141 ± 0.113
0.282CysAsp: 0.282 ± 0.174
0.705CysGlu: 0.705 ± 0.227
0.282CysPhe: 0.282 ± 0.13
0.776CysGly: 0.776 ± 0.26
0.071CysHis: 0.071 ± 0.083
0.212CysIle: 0.212 ± 0.118
0.212CysLys: 0.212 ± 0.104
0.705CysLeu: 0.705 ± 0.224
0.0CysMet: 0.0 ± 0.0
0.141CysAsn: 0.141 ± 0.109
0.282CysPro: 0.282 ± 0.159
0.282CysGln: 0.282 ± 0.139
0.635CysArg: 0.635 ± 0.247
0.494CysSer: 0.494 ± 0.165
0.494CysThr: 0.494 ± 0.21
0.282CysVal: 0.282 ± 0.175
0.353CysTrp: 0.353 ± 0.179
0.141CysTyr: 0.141 ± 0.113
0.0CysXaa: 0.0 ± 0.0
Asp
9.453AspAla: 9.453 ± 0.674
0.353AspCys: 0.353 ± 0.164
5.926AspAsp: 5.926 ± 0.863
4.092AspGlu: 4.092 ± 0.425
1.905AspPhe: 1.905 ± 0.343
7.125AspGly: 7.125 ± 0.739
1.481AspHis: 1.481 ± 0.339
1.693AspIle: 1.693 ± 0.319
1.058AspLys: 1.058 ± 0.302
6.067AspLeu: 6.067 ± 0.724
1.058AspMet: 1.058 ± 0.256
1.34AspAsn: 1.34 ± 0.281
4.092AspPro: 4.092 ± 0.809
2.681AspGln: 2.681 ± 0.404
3.81AspArg: 3.81 ± 0.472
2.61AspSer: 2.61 ± 0.393
3.598AspThr: 3.598 ± 0.634
5.009AspVal: 5.009 ± 0.61
1.552AspTrp: 1.552 ± 0.342
1.34AspTyr: 1.34 ± 0.359
0.0AspXaa: 0.0 ± 0.0
Glu
6.067GluAla: 6.067 ± 0.675
0.282GluCys: 0.282 ± 0.148
3.034GluAsp: 3.034 ± 0.417
4.444GluGlu: 4.444 ± 0.72
1.481GluPhe: 1.481 ± 0.259
3.88GluGly: 3.88 ± 0.428
2.54GluHis: 2.54 ± 0.415
3.951GluIle: 3.951 ± 0.496
2.328GluLys: 2.328 ± 0.453
5.22GluLeu: 5.22 ± 0.672
1.764GluMet: 1.764 ± 0.382
1.481GluAsn: 1.481 ± 0.294
4.374GluPro: 4.374 ± 0.669
2.963GluGln: 2.963 ± 0.448
4.444GluArg: 4.444 ± 0.704
3.245GluSer: 3.245 ± 0.459
3.316GluThr: 3.316 ± 0.44
5.573GluVal: 5.573 ± 0.523
1.552GluTrp: 1.552 ± 0.349
1.975GluTyr: 1.975 ± 0.382
0.0GluXaa: 0.0 ± 0.0
Phe
2.116PheAla: 2.116 ± 0.371
0.141PheCys: 0.141 ± 0.152
2.399PheAsp: 2.399 ± 0.363
1.34PheGlu: 1.34 ± 0.313
0.988PhePhe: 0.988 ± 0.394
2.751PheGly: 2.751 ± 0.431
0.423PheHis: 0.423 ± 0.164
0.917PheIle: 0.917 ± 0.229
0.917PheLys: 0.917 ± 0.25
1.905PheLeu: 1.905 ± 0.366
0.564PheMet: 0.564 ± 0.169
0.494PheAsn: 0.494 ± 0.172
0.847PhePro: 0.847 ± 0.301
1.129PheGln: 1.129 ± 0.244
1.975PheArg: 1.975 ± 0.405
1.34PheSer: 1.34 ± 0.427
1.411PheThr: 1.411 ± 0.326
1.199PheVal: 1.199 ± 0.273
0.635PheTrp: 0.635 ± 0.19
0.705PheTyr: 0.705 ± 0.235
0.0PheXaa: 0.0 ± 0.0
Gly
7.76GlyAla: 7.76 ± 1.323
0.494GlyCys: 0.494 ± 0.237
5.785GlyAsp: 5.785 ± 0.555
5.079GlyGlu: 5.079 ± 0.519
2.328GlyPhe: 2.328 ± 0.43
8.113GlyGly: 8.113 ± 0.873
2.257GlyHis: 2.257 ± 0.413
4.586GlyIle: 4.586 ± 0.734
4.092GlyLys: 4.092 ± 0.677
6.349GlyLeu: 6.349 ± 0.739
3.245GlyMet: 3.245 ± 0.556
2.469GlyAsn: 2.469 ± 0.428
4.444GlyPro: 4.444 ± 0.598
2.399GlyGln: 2.399 ± 0.343
6.279GlyArg: 6.279 ± 0.653
4.444GlySer: 4.444 ± 0.709
5.926GlyThr: 5.926 ± 0.644
6.349GlyVal: 6.349 ± 0.629
2.399GlyTrp: 2.399 ± 0.35
2.54GlyTyr: 2.54 ± 0.494
0.0GlyXaa: 0.0 ± 0.0
His
2.399HisAla: 2.399 ± 0.474
0.141HisCys: 0.141 ± 0.099
1.693HisAsp: 1.693 ± 0.389
1.481HisGlu: 1.481 ± 0.326
0.494HisPhe: 0.494 ± 0.186
2.328HisGly: 2.328 ± 0.449
0.988HisHis: 0.988 ± 0.306
1.693HisIle: 1.693 ± 0.372
0.353HisLys: 0.353 ± 0.154
2.257HisLeu: 2.257 ± 0.446
0.282HisMet: 0.282 ± 0.15
0.212HisAsn: 0.212 ± 0.116
1.411HisPro: 1.411 ± 0.364
0.988HisGln: 0.988 ± 0.289
1.481HisArg: 1.481 ± 0.321
0.917HisSer: 0.917 ± 0.267
0.776HisThr: 0.776 ± 0.22
1.975HisVal: 1.975 ± 0.358
0.494HisTrp: 0.494 ± 0.17
0.635HisTyr: 0.635 ± 0.163
0.0HisXaa: 0.0 ± 0.0
Ile
5.926IleAla: 5.926 ± 0.644
0.282IleCys: 0.282 ± 0.154
3.527IleAsp: 3.527 ± 0.493
3.104IleGlu: 3.104 ± 0.548
0.776IlePhe: 0.776 ± 0.212
4.374IleGly: 4.374 ± 0.854
0.917IleHis: 0.917 ± 0.209
2.328IleIle: 2.328 ± 0.352
1.27IleLys: 1.27 ± 0.304
2.54IleLeu: 2.54 ± 0.367
0.635IleMet: 0.635 ± 0.184
1.27IleAsn: 1.27 ± 0.285
3.104IlePro: 3.104 ± 0.465
1.34IleGln: 1.34 ± 0.276
3.245IleArg: 3.245 ± 0.545
2.257IleSer: 2.257 ± 0.511
4.586IleThr: 4.586 ± 0.723
3.527IleVal: 3.527 ± 0.483
0.776IleTrp: 0.776 ± 0.257
0.847IleTyr: 0.847 ± 0.264
0.0IleXaa: 0.0 ± 0.0
Lys
5.079LysAla: 5.079 ± 0.944
0.141LysCys: 0.141 ± 0.111
1.34LysAsp: 1.34 ± 0.405
1.834LysGlu: 1.834 ± 0.296
0.564LysPhe: 0.564 ± 0.215
2.61LysGly: 2.61 ± 0.455
0.847LysHis: 0.847 ± 0.298
1.27LysIle: 1.27 ± 0.336
0.847LysLys: 0.847 ± 0.29
2.54LysLeu: 2.54 ± 0.441
0.776LysMet: 0.776 ± 0.259
0.635LysAsn: 0.635 ± 0.186
1.623LysPro: 1.623 ± 0.298
1.764LysGln: 1.764 ± 0.424
2.54LysArg: 2.54 ± 0.449
2.046LysSer: 2.046 ± 0.469
2.257LysThr: 2.257 ± 0.429
1.764LysVal: 1.764 ± 0.26
0.988LysTrp: 0.988 ± 0.244
0.635LysTyr: 0.635 ± 0.172
0.0LysXaa: 0.0 ± 0.0
Leu
10.511LeuAla: 10.511 ± 0.905
1.199LeuCys: 1.199 ± 0.371
6.349LeuAsp: 6.349 ± 0.869
3.668LeuGlu: 3.668 ± 0.568
1.411LeuPhe: 1.411 ± 0.307
7.337LeuGly: 7.337 ± 0.783
1.975LeuHis: 1.975 ± 0.423
3.457LeuIle: 3.457 ± 0.598
2.469LeuLys: 2.469 ± 0.435
5.573LeuLeu: 5.573 ± 0.649
1.764LeuMet: 1.764 ± 0.318
1.975LeuAsn: 1.975 ± 0.36
5.009LeuPro: 5.009 ± 0.678
1.975LeuGln: 1.975 ± 0.338
6.42LeuArg: 6.42 ± 0.759
3.951LeuSer: 3.951 ± 0.66
4.233LeuThr: 4.233 ± 0.587
4.374LeuVal: 4.374 ± 0.419
1.834LeuTrp: 1.834 ± 0.338
1.905LeuTyr: 1.905 ± 0.505
0.0LeuXaa: 0.0 ± 0.0
Met
2.751MetAla: 2.751 ± 0.409
0.212MetCys: 0.212 ± 0.134
1.411MetAsp: 1.411 ± 0.244
1.552MetGlu: 1.552 ± 0.377
0.282MetPhe: 0.282 ± 0.152
2.822MetGly: 2.822 ± 0.55
0.705MetHis: 0.705 ± 0.301
1.129MetIle: 1.129 ± 0.227
1.34MetLys: 1.34 ± 0.296
1.552MetLeu: 1.552 ± 0.429
0.282MetMet: 0.282 ± 0.133
0.776MetAsn: 0.776 ± 0.212
2.046MetPro: 2.046 ± 0.323
0.564MetGln: 0.564 ± 0.226
1.411MetArg: 1.411 ± 0.365
1.693MetSer: 1.693 ± 0.329
3.386MetThr: 3.386 ± 0.585
1.199MetVal: 1.199 ± 0.286
0.705MetTrp: 0.705 ± 0.286
0.282MetTyr: 0.282 ± 0.143
0.0MetXaa: 0.0 ± 0.0
Asn
2.892AsnAla: 2.892 ± 0.488
0.071AsnCys: 0.071 ± 0.087
1.481AsnAsp: 1.481 ± 0.333
1.058AsnGlu: 1.058 ± 0.215
0.564AsnPhe: 0.564 ± 0.213
2.469AsnGly: 2.469 ± 0.584
0.705AsnHis: 0.705 ± 0.338
0.705AsnIle: 0.705 ± 0.2
0.423AsnLys: 0.423 ± 0.167
2.399AsnLeu: 2.399 ± 0.441
0.635AsnMet: 0.635 ± 0.215
0.564AsnAsn: 0.564 ± 0.243
3.175AsnPro: 3.175 ± 0.82
0.917AsnGln: 0.917 ± 0.245
1.975AsnArg: 1.975 ± 0.363
1.058AsnSer: 1.058 ± 0.25
1.623AsnThr: 1.623 ± 0.272
1.693AsnVal: 1.693 ± 0.354
0.0AsnTrp: 0.0 ± 0.0
0.635AsnTyr: 0.635 ± 0.25
0.0AsnXaa: 0.0 ± 0.0
Pro
6.279ProAla: 6.279 ± 0.663
0.353ProCys: 0.353 ± 0.199
5.291ProAsp: 5.291 ± 0.673
3.668ProGlu: 3.668 ± 0.521
1.905ProPhe: 1.905 ± 0.312
5.362ProGly: 5.362 ± 0.677
1.27ProHis: 1.27 ± 0.369
1.905ProIle: 1.905 ± 0.273
1.975ProLys: 1.975 ± 0.502
3.386ProLeu: 3.386 ± 0.48
1.905ProMet: 1.905 ± 0.468
1.905ProAsn: 1.905 ± 0.531
3.386ProPro: 3.386 ± 0.583
1.481ProGln: 1.481 ± 0.352
4.656ProArg: 4.656 ± 0.679
3.034ProSer: 3.034 ± 1.237
3.739ProThr: 3.739 ± 0.601
4.162ProVal: 4.162 ± 0.445
1.27ProTrp: 1.27 ± 0.376
1.552ProTyr: 1.552 ± 0.387
0.0ProXaa: 0.0 ± 0.0
Gln
4.938GlnAla: 4.938 ± 0.777
0.282GlnCys: 0.282 ± 0.137
0.988GlnAsp: 0.988 ± 0.315
1.905GlnGlu: 1.905 ± 0.366
0.635GlnPhe: 0.635 ± 0.205
2.257GlnGly: 2.257 ± 0.435
0.705GlnHis: 0.705 ± 0.229
1.834GlnIle: 1.834 ± 0.394
1.129GlnLys: 1.129 ± 0.226
3.527GlnLeu: 3.527 ± 0.416
1.058GlnMet: 1.058 ± 0.245
0.776GlnAsn: 0.776 ± 0.31
1.481GlnPro: 1.481 ± 0.284
1.481GlnGln: 1.481 ± 0.342
2.751GlnArg: 2.751 ± 0.404
1.34GlnSer: 1.34 ± 0.308
1.34GlnThr: 1.34 ± 0.252
2.963GlnVal: 2.963 ± 0.408
1.058GlnTrp: 1.058 ± 0.248
0.705GlnTyr: 0.705 ± 0.212
0.0GlnXaa: 0.0 ± 0.0
Arg
8.183ArgAla: 8.183 ± 0.967
1.058ArgCys: 1.058 ± 0.34
5.785ArgAsp: 5.785 ± 0.826
5.362ArgGlu: 5.362 ± 0.704
2.469ArgPhe: 2.469 ± 0.367
5.432ArgGly: 5.432 ± 0.495
1.834ArgHis: 1.834 ± 0.426
4.162ArgIle: 4.162 ± 0.524
2.54ArgLys: 2.54 ± 0.384
5.22ArgLeu: 5.22 ± 0.79
2.54ArgMet: 2.54 ± 0.381
1.481ArgAsn: 1.481 ± 0.396
3.386ArgPro: 3.386 ± 0.675
1.905ArgGln: 1.905 ± 0.381
5.996ArgArg: 5.996 ± 0.813
3.598ArgSer: 3.598 ± 0.499
3.598ArgThr: 3.598 ± 0.389
4.656ArgVal: 4.656 ± 0.552
2.187ArgTrp: 2.187 ± 0.374
1.27ArgTyr: 1.27 ± 0.294
0.0ArgXaa: 0.0 ± 0.0
Ser
5.079SerAla: 5.079 ± 0.737
0.282SerCys: 0.282 ± 0.168
2.751SerAsp: 2.751 ± 0.475
3.034SerGlu: 3.034 ± 0.521
1.623SerPhe: 1.623 ± 0.355
4.727SerGly: 4.727 ± 0.948
0.776SerHis: 0.776 ± 0.288
2.116SerIle: 2.116 ± 0.335
1.34SerLys: 1.34 ± 0.285
3.739SerLeu: 3.739 ± 0.669
1.199SerMet: 1.199 ± 0.202
1.411SerAsn: 1.411 ± 0.36
2.892SerPro: 2.892 ± 0.466
1.552SerGln: 1.552 ± 0.269
3.527SerArg: 3.527 ± 0.476
2.61SerSer: 2.61 ± 0.458
4.303SerThr: 4.303 ± 0.65
4.233SerVal: 4.233 ± 0.665
1.27SerTrp: 1.27 ± 0.25
1.058SerTyr: 1.058 ± 0.239
0.0SerXaa: 0.0 ± 0.0
Thr
8.607ThrAla: 8.607 ± 0.804
0.282ThrCys: 0.282 ± 0.165
3.316ThrAsp: 3.316 ± 0.472
4.021ThrGlu: 4.021 ± 0.553
1.129ThrPhe: 1.129 ± 0.267
5.079ThrGly: 5.079 ± 0.467
1.34ThrHis: 1.34 ± 0.302
3.739ThrIle: 3.739 ± 0.788
2.257ThrLys: 2.257 ± 0.319
5.432ThrLeu: 5.432 ± 0.652
1.693ThrMet: 1.693 ± 0.339
1.623ThrAsn: 1.623 ± 0.325
5.079ThrPro: 5.079 ± 0.892
1.481ThrGln: 1.481 ± 0.317
4.444ThrArg: 4.444 ± 0.716
3.245ThrSer: 3.245 ± 0.708
4.868ThrThr: 4.868 ± 0.719
4.938ThrVal: 4.938 ± 0.847
0.988ThrTrp: 0.988 ± 0.261
1.481ThrTyr: 1.481 ± 0.307
0.0ThrXaa: 0.0 ± 0.0
Val
10.582ValAla: 10.582 ± 1.195
0.282ValCys: 0.282 ± 0.141
4.374ValAsp: 4.374 ± 0.624
4.727ValGlu: 4.727 ± 0.507
1.481ValPhe: 1.481 ± 0.391
6.067ValGly: 6.067 ± 0.816
1.27ValHis: 1.27 ± 0.328
3.104ValIle: 3.104 ± 0.416
2.257ValLys: 2.257 ± 0.412
5.714ValLeu: 5.714 ± 0.915
1.34ValMet: 1.34 ± 0.322
2.328ValAsn: 2.328 ± 0.426
2.963ValPro: 2.963 ± 0.417
1.905ValGln: 1.905 ± 0.354
5.291ValArg: 5.291 ± 0.707
4.092ValSer: 4.092 ± 0.489
5.009ValThr: 5.009 ± 0.727
5.009ValVal: 5.009 ± 0.555
1.552ValTrp: 1.552 ± 0.264
0.988ValTyr: 0.988 ± 0.236
0.0ValXaa: 0.0 ± 0.0
Trp
2.822TrpAla: 2.822 ± 0.509
0.635TrpCys: 0.635 ± 0.22
1.199TrpAsp: 1.199 ± 0.277
2.187TrpGlu: 2.187 ± 0.421
0.494TrpPhe: 0.494 ± 0.209
1.764TrpGly: 1.764 ± 0.328
0.282TrpHis: 0.282 ± 0.156
1.27TrpIle: 1.27 ± 0.272
0.353TrpLys: 0.353 ± 0.135
1.975TrpLeu: 1.975 ± 0.421
0.635TrpMet: 0.635 ± 0.231
0.988TrpAsn: 0.988 ± 0.279
0.776TrpPro: 0.776 ± 0.231
0.776TrpGln: 0.776 ± 0.178
2.328TrpArg: 2.328 ± 0.381
1.129TrpSer: 1.129 ± 0.268
1.411TrpThr: 1.411 ± 0.409
1.552TrpVal: 1.552 ± 0.387
0.494TrpTrp: 0.494 ± 0.148
0.212TrpTyr: 0.212 ± 0.175
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.245TyrAla: 3.245 ± 0.559
0.0TyrCys: 0.0 ± 0.0
1.199TyrAsp: 1.199 ± 0.277
1.481TyrGlu: 1.481 ± 0.344
0.564TyrPhe: 0.564 ± 0.224
1.764TyrGly: 1.764 ± 0.358
0.494TyrHis: 0.494 ± 0.183
1.27TyrIle: 1.27 ± 0.28
0.705TyrLys: 0.705 ± 0.26
1.27TyrLeu: 1.27 ± 0.338
0.494TyrMet: 0.494 ± 0.231
0.705TyrAsn: 0.705 ± 0.203
1.129TyrPro: 1.129 ± 0.291
0.423TyrGln: 0.423 ± 0.169
1.834TyrArg: 1.834 ± 0.373
0.988TyrSer: 0.988 ± 0.279
1.975TyrThr: 1.975 ± 0.358
1.552TyrVal: 1.552 ± 0.28
0.212TyrTrp: 0.212 ± 0.115
0.635TyrTyr: 0.635 ± 0.268
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 69 proteins (14176 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski