Amino acid dipepetide frequency for Gordonia phage BritBrat

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.699AlaAla: 14.699 ± 1.7
1.014AlaCys: 1.014 ± 0.272
7.378AlaAsp: 7.378 ± 0.608
7.997AlaGlu: 7.997 ± 0.76
3.267AlaPhe: 3.267 ± 0.681
9.18AlaGly: 9.18 ± 0.922
2.253AlaHis: 2.253 ± 0.414
4.731AlaIle: 4.731 ± 0.617
4.28AlaLys: 4.28 ± 0.429
8.842AlaLeu: 8.842 ± 0.951
3.773AlaMet: 3.773 ± 0.643
3.999AlaAsn: 3.999 ± 0.631
4.506AlaPro: 4.506 ± 0.528
3.999AlaGln: 3.999 ± 0.779
7.716AlaArg: 7.716 ± 0.802
6.871AlaSer: 6.871 ± 0.759
7.265AlaThr: 7.265 ± 0.623
7.885AlaVal: 7.885 ± 0.606
2.309AlaTrp: 2.309 ± 0.311
2.14AlaTyr: 2.14 ± 0.285
0.0AlaXaa: 0.0 ± 0.0
Cys
1.014CysAla: 1.014 ± 0.308
0.338CysCys: 0.338 ± 0.155
0.901CysAsp: 0.901 ± 0.232
0.845CysGlu: 0.845 ± 0.262
0.338CysPhe: 0.338 ± 0.167
2.084CysGly: 2.084 ± 0.506
0.563CysHis: 0.563 ± 0.222
0.338CysIle: 0.338 ± 0.15
0.394CysLys: 0.394 ± 0.173
0.732CysLeu: 0.732 ± 0.226
0.225CysMet: 0.225 ± 0.126
0.394CysAsn: 0.394 ± 0.169
0.62CysPro: 0.62 ± 0.203
0.676CysGln: 0.676 ± 0.229
0.676CysArg: 0.676 ± 0.256
0.507CysSer: 0.507 ± 0.191
0.732CysThr: 0.732 ± 0.232
0.62CysVal: 0.62 ± 0.213
0.225CysTrp: 0.225 ± 0.122
0.451CysTyr: 0.451 ± 0.155
0.0CysXaa: 0.0 ± 0.0
Asp
7.378AspAla: 7.378 ± 0.659
1.183AspCys: 1.183 ± 0.274
4.111AspAsp: 4.111 ± 0.532
3.83AspGlu: 3.83 ± 0.657
1.464AspPhe: 1.464 ± 0.31
6.927AspGly: 6.927 ± 0.657
1.859AspHis: 1.859 ± 0.387
2.703AspIle: 2.703 ± 0.386
1.014AspLys: 1.014 ± 0.213
5.801AspLeu: 5.801 ± 0.574
1.295AspMet: 1.295 ± 0.282
2.14AspAsn: 2.14 ± 0.417
4.843AspPro: 4.843 ± 0.583
2.365AspGln: 2.365 ± 0.503
5.125AspArg: 5.125 ± 0.643
2.76AspSer: 2.76 ± 0.441
3.492AspThr: 3.492 ± 0.452
4.337AspVal: 4.337 ± 0.606
1.126AspTrp: 1.126 ± 0.288
1.464AspTyr: 1.464 ± 0.322
0.0AspXaa: 0.0 ± 0.0
Glu
5.913GluAla: 5.913 ± 0.687
0.62GluCys: 0.62 ± 0.207
2.703GluAsp: 2.703 ± 0.415
2.929GluGlu: 2.929 ± 0.372
2.14GluPhe: 2.14 ± 0.381
3.379GluGly: 3.379 ± 0.421
1.577GluHis: 1.577 ± 0.267
3.717GluIle: 3.717 ± 0.542
2.196GluLys: 2.196 ± 0.319
5.294GluLeu: 5.294 ± 0.567
1.352GluMet: 1.352 ± 0.254
1.408GluAsn: 1.408 ± 0.251
2.422GluPro: 2.422 ± 0.402
2.027GluGln: 2.027 ± 0.306
5.519GluArg: 5.519 ± 0.662
2.929GluSer: 2.929 ± 0.456
2.872GluThr: 2.872 ± 0.373
3.492GluVal: 3.492 ± 0.394
1.464GluTrp: 1.464 ± 0.363
2.027GluTyr: 2.027 ± 0.359
0.0GluXaa: 0.0 ± 0.0
Phe
2.365PheAla: 2.365 ± 0.359
0.451PheCys: 0.451 ± 0.152
2.253PheAsp: 2.253 ± 0.369
1.746PheGlu: 1.746 ± 0.375
0.563PhePhe: 0.563 ± 0.187
3.717PheGly: 3.717 ± 0.515
0.788PheHis: 0.788 ± 0.232
1.183PheIle: 1.183 ± 0.247
0.845PheLys: 0.845 ± 0.219
1.633PheLeu: 1.633 ± 0.336
0.282PheMet: 0.282 ± 0.15
0.563PheAsn: 0.563 ± 0.177
1.577PhePro: 1.577 ± 0.324
0.788PheGln: 0.788 ± 0.182
1.746PheArg: 1.746 ± 0.239
2.084PheSer: 2.084 ± 0.35
1.915PheThr: 1.915 ± 0.313
2.027PheVal: 2.027 ± 0.4
0.507PheTrp: 0.507 ± 0.209
0.62PheTyr: 0.62 ± 0.169
0.0PheXaa: 0.0 ± 0.0
Gly
8.673GlyAla: 8.673 ± 0.897
0.62GlyCys: 0.62 ± 0.206
6.702GlyAsp: 6.702 ± 0.689
4.224GlyGlu: 4.224 ± 0.481
2.816GlyPhe: 2.816 ± 0.369
10.137GlyGly: 10.137 ± 1.411
2.534GlyHis: 2.534 ± 0.369
3.267GlyIle: 3.267 ± 0.466
3.041GlyLys: 3.041 ± 0.457
5.801GlyLeu: 5.801 ± 0.596
1.633GlyMet: 1.633 ± 0.3
2.534GlyAsn: 2.534 ± 0.375
3.548GlyPro: 3.548 ± 0.377
3.323GlyGln: 3.323 ± 0.624
6.871GlyArg: 6.871 ± 0.634
4.731GlySer: 4.731 ± 0.731
6.026GlyThr: 6.026 ± 0.788
6.646GlyVal: 6.646 ± 0.619
2.365GlyTrp: 2.365 ± 0.348
2.084GlyTyr: 2.084 ± 0.391
0.0GlyXaa: 0.0 ± 0.0
His
2.365HisAla: 2.365 ± 0.4
0.394HisCys: 0.394 ± 0.161
1.352HisAsp: 1.352 ± 0.302
1.352HisGlu: 1.352 ± 0.247
0.394HisPhe: 0.394 ± 0.164
2.084HisGly: 2.084 ± 0.47
0.676HisHis: 0.676 ± 0.21
1.239HisIle: 1.239 ± 0.293
0.507HisLys: 0.507 ± 0.152
2.084HisLeu: 2.084 ± 0.338
0.113HisMet: 0.113 ± 0.069
0.845HisAsn: 0.845 ± 0.225
1.295HisPro: 1.295 ± 0.362
0.62HisGln: 0.62 ± 0.243
1.802HisArg: 1.802 ± 0.384
1.352HisSer: 1.352 ± 0.34
1.802HisThr: 1.802 ± 0.357
1.802HisVal: 1.802 ± 0.351
0.563HisTrp: 0.563 ± 0.161
0.788HisTyr: 0.788 ± 0.232
0.0HisXaa: 0.0 ± 0.0
Ile
6.026IleAla: 6.026 ± 0.618
0.451IleCys: 0.451 ± 0.16
2.985IleAsp: 2.985 ± 0.448
3.267IleGlu: 3.267 ± 0.499
0.507IlePhe: 0.507 ± 0.157
3.492IleGly: 3.492 ± 0.41
1.07IleHis: 1.07 ± 0.257
1.464IleIle: 1.464 ± 0.292
1.126IleLys: 1.126 ± 0.358
3.323IleLeu: 3.323 ± 0.406
0.394IleMet: 0.394 ± 0.162
1.408IleAsn: 1.408 ± 0.284
2.647IlePro: 2.647 ± 0.405
1.239IleGln: 1.239 ± 0.256
3.267IleArg: 3.267 ± 0.423
2.309IleSer: 2.309 ± 0.356
3.21IleThr: 3.21 ± 0.415
3.21IleVal: 3.21 ± 0.482
0.732IleTrp: 0.732 ± 0.213
0.957IleTyr: 0.957 ± 0.307
0.0IleXaa: 0.0 ± 0.0
Lys
3.435LysAla: 3.435 ± 0.508
0.225LysCys: 0.225 ± 0.143
1.464LysAsp: 1.464 ± 0.292
1.352LysGlu: 1.352 ± 0.259
0.732LysPhe: 0.732 ± 0.169
2.084LysGly: 2.084 ± 0.403
0.788LysHis: 0.788 ± 0.237
1.07LysIle: 1.07 ± 0.228
1.577LysLys: 1.577 ± 0.344
2.534LysLeu: 2.534 ± 0.434
0.676LysMet: 0.676 ± 0.188
1.126LysAsn: 1.126 ± 0.253
2.14LysPro: 2.14 ± 0.513
0.676LysGln: 0.676 ± 0.243
3.21LysArg: 3.21 ± 0.433
2.76LysSer: 2.76 ± 0.332
2.816LysThr: 2.816 ± 0.466
2.365LysVal: 2.365 ± 0.406
0.507LysTrp: 0.507 ± 0.185
0.957LysTyr: 0.957 ± 0.184
0.0LysXaa: 0.0 ± 0.0
Leu
10.363LeuAla: 10.363 ± 0.884
1.126LeuCys: 1.126 ± 0.335
4.731LeuAsp: 4.731 ± 0.513
4.674LeuGlu: 4.674 ± 0.5
2.084LeuPhe: 2.084 ± 0.299
5.576LeuGly: 5.576 ± 0.904
1.577LeuHis: 1.577 ± 0.39
3.661LeuIle: 3.661 ± 0.53
2.478LeuLys: 2.478 ± 0.463
5.069LeuLeu: 5.069 ± 0.601
1.802LeuMet: 1.802 ± 0.295
2.309LeuAsn: 2.309 ± 0.446
4.787LeuPro: 4.787 ± 0.548
2.703LeuGln: 2.703 ± 0.385
5.238LeuArg: 5.238 ± 0.709
4.731LeuSer: 4.731 ± 0.613
5.632LeuThr: 5.632 ± 0.655
4.506LeuVal: 4.506 ± 0.563
1.69LeuTrp: 1.69 ± 0.353
1.464LeuTyr: 1.464 ± 0.232
0.0LeuXaa: 0.0 ± 0.0
Met
2.534MetAla: 2.534 ± 0.394
0.225MetCys: 0.225 ± 0.119
0.732MetAsp: 0.732 ± 0.188
0.845MetGlu: 0.845 ± 0.223
0.676MetPhe: 0.676 ± 0.206
1.464MetGly: 1.464 ± 0.335
0.169MetHis: 0.169 ± 0.094
0.901MetIle: 0.901 ± 0.201
0.901MetLys: 0.901 ± 0.226
1.633MetLeu: 1.633 ± 0.257
0.507MetMet: 0.507 ± 0.156
1.014MetAsn: 1.014 ± 0.218
1.521MetPro: 1.521 ± 0.269
0.901MetGln: 0.901 ± 0.316
2.027MetArg: 2.027 ± 0.349
2.027MetSer: 2.027 ± 0.343
2.929MetThr: 2.929 ± 0.333
1.07MetVal: 1.07 ± 0.196
0.507MetTrp: 0.507 ± 0.165
0.282MetTyr: 0.282 ± 0.158
0.0MetXaa: 0.0 ± 0.0
Asn
4.168AsnAla: 4.168 ± 0.821
0.507AsnCys: 0.507 ± 0.187
1.915AsnAsp: 1.915 ± 0.278
1.577AsnGlu: 1.577 ± 0.311
1.014AsnPhe: 1.014 ± 0.273
3.492AsnGly: 3.492 ± 0.469
0.676AsnHis: 0.676 ± 0.191
1.014AsnIle: 1.014 ± 0.325
0.957AsnLys: 0.957 ± 0.21
2.196AsnLeu: 2.196 ± 0.34
0.338AsnMet: 0.338 ± 0.137
0.845AsnAsn: 0.845 ± 0.232
2.591AsnPro: 2.591 ± 0.348
1.352AsnGln: 1.352 ± 0.289
2.591AsnArg: 2.591 ± 0.44
2.027AsnSer: 2.027 ± 0.371
1.352AsnThr: 1.352 ± 0.387
1.521AsnVal: 1.521 ± 0.339
0.901AsnTrp: 0.901 ± 0.205
0.676AsnTyr: 0.676 ± 0.226
0.0AsnXaa: 0.0 ± 0.0
Pro
5.519ProAla: 5.519 ± 0.611
0.451ProCys: 0.451 ± 0.19
4.787ProAsp: 4.787 ± 0.573
3.098ProGlu: 3.098 ± 0.372
1.521ProPhe: 1.521 ± 0.315
5.35ProGly: 5.35 ± 0.457
1.464ProHis: 1.464 ± 0.387
2.14ProIle: 2.14 ± 0.37
1.971ProLys: 1.971 ± 0.333
3.154ProLeu: 3.154 ± 0.354
0.957ProMet: 0.957 ± 0.25
1.915ProAsn: 1.915 ± 0.322
3.717ProPro: 3.717 ± 0.642
1.915ProGln: 1.915 ± 0.286
3.492ProArg: 3.492 ± 0.586
3.041ProSer: 3.041 ± 0.452
3.83ProThr: 3.83 ± 0.404
3.379ProVal: 3.379 ± 0.459
1.859ProTrp: 1.859 ± 0.461
1.408ProTyr: 1.408 ± 0.254
0.0ProXaa: 0.0 ± 0.0
Gln
4.168GlnAla: 4.168 ± 0.704
0.451GlnCys: 0.451 ± 0.209
1.464GlnAsp: 1.464 ± 0.263
1.408GlnGlu: 1.408 ± 0.277
1.295GlnPhe: 1.295 ± 0.315
2.647GlnGly: 2.647 ± 0.592
0.845GlnHis: 0.845 ± 0.231
2.027GlnIle: 2.027 ± 0.402
1.126GlnLys: 1.126 ± 0.401
3.267GlnLeu: 3.267 ± 0.599
1.07GlnMet: 1.07 ± 0.304
0.845GlnAsn: 0.845 ± 0.323
1.746GlnPro: 1.746 ± 0.275
2.253GlnGln: 2.253 ± 0.724
2.422GlnArg: 2.422 ± 0.368
2.365GlnSer: 2.365 ± 0.442
2.196GlnThr: 2.196 ± 0.336
2.591GlnVal: 2.591 ± 0.428
0.957GlnTrp: 0.957 ± 0.205
1.183GlnTyr: 1.183 ± 0.232
0.0GlnXaa: 0.0 ± 0.0
Arg
7.265ArgAla: 7.265 ± 0.578
1.352ArgCys: 1.352 ± 0.33
4.787ArgAsp: 4.787 ± 0.438
4.224ArgGlu: 4.224 ± 0.583
2.196ArgPhe: 2.196 ± 0.37
4.731ArgGly: 4.731 ± 0.543
2.084ArgHis: 2.084 ± 0.4
3.604ArgIle: 3.604 ± 0.482
3.886ArgLys: 3.886 ± 0.564
5.688ArgLeu: 5.688 ± 0.553
2.478ArgMet: 2.478 ± 0.334
3.098ArgAsn: 3.098 ± 0.47
3.604ArgPro: 3.604 ± 0.452
3.041ArgGln: 3.041 ± 0.434
7.04ArgArg: 7.04 ± 0.952
3.999ArgSer: 3.999 ± 0.487
3.717ArgThr: 3.717 ± 0.467
4.787ArgVal: 4.787 ± 0.654
1.859ArgTrp: 1.859 ± 0.377
2.14ArgTyr: 2.14 ± 0.352
0.0ArgXaa: 0.0 ± 0.0
Ser
7.49SerAla: 7.49 ± 1.029
0.451SerCys: 0.451 ± 0.198
3.999SerAsp: 3.999 ± 0.454
3.999SerGlu: 3.999 ± 0.414
1.464SerPhe: 1.464 ± 0.286
6.308SerGly: 6.308 ± 0.69
0.845SerHis: 0.845 ± 0.282
2.196SerIle: 2.196 ± 0.307
1.295SerLys: 1.295 ± 0.333
3.886SerLeu: 3.886 ± 0.479
1.915SerMet: 1.915 ± 0.293
1.746SerAsn: 1.746 ± 0.268
2.591SerPro: 2.591 ± 0.339
2.084SerGln: 2.084 ± 0.426
3.548SerArg: 3.548 ± 0.423
3.323SerSer: 3.323 ± 0.449
4.393SerThr: 4.393 ± 0.562
3.999SerVal: 3.999 ± 0.576
1.352SerTrp: 1.352 ± 0.249
1.408SerTyr: 1.408 ± 0.253
0.0SerXaa: 0.0 ± 0.0
Thr
8.448ThrAla: 8.448 ± 0.876
0.957ThrCys: 0.957 ± 0.288
4.674ThrAsp: 4.674 ± 0.467
2.534ThrGlu: 2.534 ± 0.372
2.027ThrPhe: 2.027 ± 0.335
5.801ThrGly: 5.801 ± 0.689
1.464ThrHis: 1.464 ± 0.394
3.21ThrIle: 3.21 ± 0.462
1.521ThrLys: 1.521 ± 0.318
6.139ThrLeu: 6.139 ± 0.583
1.07ThrMet: 1.07 ± 0.25
2.084ThrAsn: 2.084 ± 0.369
4.956ThrPro: 4.956 ± 0.741
2.196ThrGln: 2.196 ± 0.279
4.393ThrArg: 4.393 ± 0.635
3.379ThrSer: 3.379 ± 0.444
4.449ThrThr: 4.449 ± 0.575
4.562ThrVal: 4.562 ± 0.583
1.183ThrTrp: 1.183 ± 0.275
1.859ThrTyr: 1.859 ± 0.346
0.0ThrXaa: 0.0 ± 0.0
Val
7.659ValAla: 7.659 ± 0.581
1.014ValCys: 1.014 ± 0.339
5.012ValAsp: 5.012 ± 0.588
3.604ValGlu: 3.604 ± 0.45
1.577ValPhe: 1.577 ± 0.34
5.125ValGly: 5.125 ± 0.561
1.183ValHis: 1.183 ± 0.33
3.379ValIle: 3.379 ± 0.619
2.027ValLys: 2.027 ± 0.424
4.9ValLeu: 4.9 ± 0.546
1.577ValMet: 1.577 ± 0.305
2.422ValAsn: 2.422 ± 0.349
3.886ValPro: 3.886 ± 0.518
2.365ValGln: 2.365 ± 0.431
4.731ValArg: 4.731 ± 0.598
4.168ValSer: 4.168 ± 0.425
5.069ValThr: 5.069 ± 0.685
5.576ValVal: 5.576 ± 0.682
1.746ValTrp: 1.746 ± 0.342
0.957ValTyr: 0.957 ± 0.221
0.0ValXaa: 0.0 ± 0.0
Trp
2.478TrpAla: 2.478 ± 0.487
0.451TrpCys: 0.451 ± 0.166
1.521TrpAsp: 1.521 ± 0.287
1.239TrpGlu: 1.239 ± 0.205
0.563TrpPhe: 0.563 ± 0.207
1.577TrpGly: 1.577 ± 0.236
0.676TrpHis: 0.676 ± 0.213
0.732TrpIle: 0.732 ± 0.206
0.788TrpLys: 0.788 ± 0.232
2.591TrpLeu: 2.591 ± 0.398
0.957TrpMet: 0.957 ± 0.218
0.507TrpAsn: 0.507 ± 0.187
0.732TrpPro: 0.732 ± 0.243
0.845TrpGln: 0.845 ± 0.238
2.14TrpArg: 2.14 ± 0.392
1.352TrpSer: 1.352 ± 0.335
1.183TrpThr: 1.183 ± 0.263
1.746TrpVal: 1.746 ± 0.317
0.732TrpTrp: 0.732 ± 0.247
0.394TrpTyr: 0.394 ± 0.136
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.196TyrAla: 2.196 ± 0.286
0.394TyrCys: 0.394 ± 0.154
1.746TyrAsp: 1.746 ± 0.276
1.239TyrGlu: 1.239 ± 0.232
1.014TyrPhe: 1.014 ± 0.258
2.422TyrGly: 2.422 ± 0.335
0.338TyrHis: 0.338 ± 0.138
0.507TyrIle: 0.507 ± 0.176
0.563TyrLys: 0.563 ± 0.195
1.633TyrLeu: 1.633 ± 0.276
0.394TyrMet: 0.394 ± 0.14
0.507TyrAsn: 0.507 ± 0.171
1.239TyrPro: 1.239 ± 0.323
0.957TyrGln: 0.957 ± 0.212
2.027TyrArg: 2.027 ± 0.357
1.577TyrSer: 1.577 ± 0.357
2.027TyrThr: 2.027 ± 0.3
1.859TyrVal: 1.859 ± 0.359
0.62TyrTrp: 0.62 ± 0.178
0.507TyrTyr: 0.507 ± 0.174
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 98 proteins (17757 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski