Amino acid dipepetide frequency for Arthrobacter phage DrRobert

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.703AlaAla: 11.703 ± 0.965
0.883AlaCys: 0.883 ± 0.322
4.858AlaAsp: 4.858 ± 0.449
6.698AlaGlu: 6.698 ± 0.906
2.944AlaPhe: 2.944 ± 0.421
8.244AlaGly: 8.244 ± 0.86
1.619AlaHis: 1.619 ± 0.385
4.637AlaIle: 4.637 ± 0.544
5.888AlaLys: 5.888 ± 0.878
8.906AlaLeu: 8.906 ± 1.085
2.061AlaMet: 2.061 ± 0.359
3.975AlaAsn: 3.975 ± 0.427
4.343AlaPro: 4.343 ± 0.573
4.932AlaGln: 4.932 ± 0.571
5.668AlaArg: 5.668 ± 0.622
6.33AlaSer: 6.33 ± 0.813
5.815AlaThr: 5.815 ± 0.702
6.772AlaVal: 6.772 ± 0.652
1.84AlaTrp: 1.84 ± 0.338
3.165AlaTyr: 3.165 ± 0.428
0.0AlaXaa: 0.0 ± 0.0
Cys
0.515CysAla: 0.515 ± 0.199
0.074CysCys: 0.074 ± 0.073
0.368CysAsp: 0.368 ± 0.159
0.589CysGlu: 0.589 ± 0.232
0.074CysPhe: 0.074 ± 0.083
1.325CysGly: 1.325 ± 0.411
0.147CysHis: 0.147 ± 0.102
0.589CysIle: 0.589 ± 0.21
0.294CysLys: 0.294 ± 0.135
0.442CysLeu: 0.442 ± 0.147
0.074CysMet: 0.074 ± 0.073
0.442CysAsn: 0.442 ± 0.213
0.294CysPro: 0.294 ± 0.133
0.662CysGln: 0.662 ± 0.185
0.589CysArg: 0.589 ± 0.214
0.368CysSer: 0.368 ± 0.185
0.515CysThr: 0.515 ± 0.192
0.147CysVal: 0.147 ± 0.151
0.221CysTrp: 0.221 ± 0.129
0.221CysTyr: 0.221 ± 0.12
0.0CysXaa: 0.0 ± 0.0
Asp
5.079AspAla: 5.079 ± 0.488
0.662AspCys: 0.662 ± 0.257
3.607AspAsp: 3.607 ± 0.563
4.637AspGlu: 4.637 ± 0.68
1.767AspPhe: 1.767 ± 0.318
5.668AspGly: 5.668 ± 0.704
1.251AspHis: 1.251 ± 0.386
2.135AspIle: 2.135 ± 0.402
1.914AspLys: 1.914 ± 0.31
5.594AspLeu: 5.594 ± 0.642
1.398AspMet: 1.398 ± 0.28
1.325AspAsn: 1.325 ± 0.329
3.386AspPro: 3.386 ± 0.544
2.429AspGln: 2.429 ± 0.393
4.343AspArg: 4.343 ± 0.575
3.754AspSer: 3.754 ± 0.488
2.944AspThr: 2.944 ± 0.517
2.503AspVal: 2.503 ± 0.503
1.84AspTrp: 1.84 ± 0.384
1.472AspTyr: 1.472 ± 0.331
0.0AspXaa: 0.0 ± 0.0
Glu
8.244GluAla: 8.244 ± 0.911
0.589GluCys: 0.589 ± 0.232
4.343GluAsp: 4.343 ± 0.672
4.416GluGlu: 4.416 ± 0.527
1.914GluPhe: 1.914 ± 0.416
4.784GluGly: 4.784 ± 0.615
0.957GluHis: 0.957 ± 0.31
2.797GluIle: 2.797 ± 0.509
3.239GluLys: 3.239 ± 0.537
5.962GluLeu: 5.962 ± 0.657
1.693GluMet: 1.693 ± 0.437
2.355GluAsn: 2.355 ± 0.284
2.576GluPro: 2.576 ± 0.474
3.165GluGln: 3.165 ± 0.569
3.827GluArg: 3.827 ± 0.644
2.65GluSer: 2.65 ± 0.46
3.754GluThr: 3.754 ± 0.611
4.564GluVal: 4.564 ± 0.593
1.251GluTrp: 1.251 ± 0.271
1.472GluTyr: 1.472 ± 0.351
0.0GluXaa: 0.0 ± 0.0
Phe
2.503PheAla: 2.503 ± 0.436
0.221PheCys: 0.221 ± 0.111
2.723PheAsp: 2.723 ± 0.364
1.767PheGlu: 1.767 ± 0.274
1.104PhePhe: 1.104 ± 0.454
3.312PheGly: 3.312 ± 0.674
0.662PheHis: 0.662 ± 0.249
0.662PheIle: 0.662 ± 0.191
1.693PheLys: 1.693 ± 0.371
1.619PheLeu: 1.619 ± 0.278
1.03PheMet: 1.03 ± 0.291
1.03PheAsn: 1.03 ± 0.277
0.81PhePro: 0.81 ± 0.298
1.251PheGln: 1.251 ± 0.301
1.914PheArg: 1.914 ± 0.448
2.429PheSer: 2.429 ± 0.55
1.987PheThr: 1.987 ± 0.425
2.429PheVal: 2.429 ± 0.423
0.074PheTrp: 0.074 ± 0.07
0.662PheTyr: 0.662 ± 0.201
0.0PheXaa: 0.0 ± 0.0
Gly
7.949GlyAla: 7.949 ± 0.811
0.81GlyCys: 0.81 ± 0.265
4.564GlyAsp: 4.564 ± 0.59
5.005GlyGlu: 5.005 ± 0.631
3.018GlyPhe: 3.018 ± 0.574
5.668GlyGly: 5.668 ± 0.626
2.061GlyHis: 2.061 ± 0.455
4.122GlyIle: 4.122 ± 0.738
4.122GlyLys: 4.122 ± 0.517
7.434GlyLeu: 7.434 ± 0.936
2.135GlyMet: 2.135 ± 0.294
3.68GlyAsn: 3.68 ± 0.587
4.269GlyPro: 4.269 ± 0.605
3.754GlyGln: 3.754 ± 0.452
3.975GlyArg: 3.975 ± 0.565
6.477GlySer: 6.477 ± 1.025
5.815GlyThr: 5.815 ± 1.155
6.698GlyVal: 6.698 ± 0.746
1.693GlyTrp: 1.693 ± 0.404
2.208GlyTyr: 2.208 ± 0.4
0.0GlyXaa: 0.0 ± 0.0
His
1.767HisAla: 1.767 ± 0.38
0.147HisCys: 0.147 ± 0.111
1.325HisAsp: 1.325 ± 0.266
0.883HisGlu: 0.883 ± 0.293
0.662HisPhe: 0.662 ± 0.252
1.619HisGly: 1.619 ± 0.311
0.515HisHis: 0.515 ± 0.184
1.178HisIle: 1.178 ± 0.265
1.03HisLys: 1.03 ± 0.243
1.767HisLeu: 1.767 ± 0.421
0.589HisMet: 0.589 ± 0.223
0.589HisAsn: 0.589 ± 0.254
0.662HisPro: 0.662 ± 0.233
0.515HisGln: 0.515 ± 0.205
0.81HisArg: 0.81 ± 0.299
1.251HisSer: 1.251 ± 0.209
1.251HisThr: 1.251 ± 0.322
1.325HisVal: 1.325 ± 0.362
0.515HisTrp: 0.515 ± 0.208
0.957HisTyr: 0.957 ± 0.323
0.0HisXaa: 0.0 ± 0.0
Ile
4.784IleAla: 4.784 ± 0.685
0.368IleCys: 0.368 ± 0.169
3.018IleAsp: 3.018 ± 0.495
3.312IleGlu: 3.312 ± 0.448
1.178IlePhe: 1.178 ± 0.351
3.533IleGly: 3.533 ± 0.596
0.589IleHis: 0.589 ± 0.189
2.429IleIle: 2.429 ± 0.529
2.503IleLys: 2.503 ± 0.369
3.68IleLeu: 3.68 ± 0.488
0.736IleMet: 0.736 ± 0.288
2.355IleAsn: 2.355 ± 0.355
2.282IlePro: 2.282 ± 0.346
1.693IleGln: 1.693 ± 0.294
3.533IleArg: 3.533 ± 0.542
3.827IleSer: 3.827 ± 0.671
3.239IleThr: 3.239 ± 0.365
2.797IleVal: 2.797 ± 0.574
0.662IleTrp: 0.662 ± 0.262
0.81IleTyr: 0.81 ± 0.253
0.0IleXaa: 0.0 ± 0.0
Lys
6.404LysAla: 6.404 ± 0.731
0.662LysCys: 0.662 ± 0.263
3.165LysAsp: 3.165 ± 0.556
2.797LysGlu: 2.797 ± 0.519
1.84LysPhe: 1.84 ± 0.39
3.975LysGly: 3.975 ± 0.448
1.398LysHis: 1.398 ± 0.378
2.797LysIle: 2.797 ± 0.534
2.429LysLys: 2.429 ± 0.52
4.564LysLeu: 4.564 ± 0.515
2.061LysMet: 2.061 ± 0.37
0.883LysAsn: 0.883 ± 0.255
2.871LysPro: 2.871 ± 0.509
1.178LysGln: 1.178 ± 0.271
2.871LysArg: 2.871 ± 0.476
2.723LysSer: 2.723 ± 0.458
3.239LysThr: 3.239 ± 0.649
3.459LysVal: 3.459 ± 0.578
0.368LysTrp: 0.368 ± 0.169
1.251LysTyr: 1.251 ± 0.248
0.0LysXaa: 0.0 ± 0.0
Leu
9.569LeuAla: 9.569 ± 0.922
0.662LeuCys: 0.662 ± 0.218
5.888LeuAsp: 5.888 ± 0.578
4.416LeuGlu: 4.416 ± 0.614
2.503LeuPhe: 2.503 ± 0.479
7.14LeuGly: 7.14 ± 0.848
1.546LeuHis: 1.546 ± 0.379
4.637LeuIle: 4.637 ± 0.542
3.239LeuLys: 3.239 ± 0.492
6.845LeuLeu: 6.845 ± 0.748
1.619LeuMet: 1.619 ± 0.404
3.018LeuAsn: 3.018 ± 0.373
4.711LeuPro: 4.711 ± 0.651
3.459LeuGln: 3.459 ± 0.436
5.373LeuArg: 5.373 ± 0.524
4.858LeuSer: 4.858 ± 0.702
4.711LeuThr: 4.711 ± 0.586
7.14LeuVal: 7.14 ± 0.675
1.693LeuTrp: 1.693 ± 0.322
2.355LeuTyr: 2.355 ± 0.445
0.0LeuXaa: 0.0 ± 0.0
Met
1.987MetAla: 1.987 ± 0.398
0.0MetCys: 0.0 ± 0.0
1.398MetAsp: 1.398 ± 0.359
1.325MetGlu: 1.325 ± 0.311
0.736MetPhe: 0.736 ± 0.224
1.693MetGly: 1.693 ± 0.411
0.515MetHis: 0.515 ± 0.206
1.619MetIle: 1.619 ± 0.423
1.104MetLys: 1.104 ± 0.307
1.619MetLeu: 1.619 ± 0.32
0.589MetMet: 0.589 ± 0.216
0.957MetAsn: 0.957 ± 0.197
2.061MetPro: 2.061 ± 0.327
0.81MetGln: 0.81 ± 0.3
0.81MetArg: 0.81 ± 0.296
1.693MetSer: 1.693 ± 0.303
2.135MetThr: 2.135 ± 0.388
2.282MetVal: 2.282 ± 0.358
0.368MetTrp: 0.368 ± 0.134
0.294MetTyr: 0.294 ± 0.152
0.0MetXaa: 0.0 ± 0.0
Asn
2.723AsnAla: 2.723 ± 0.437
0.589AsnCys: 0.589 ± 0.215
1.84AsnAsp: 1.84 ± 0.349
2.208AsnGlu: 2.208 ± 0.382
0.957AsnPhe: 0.957 ± 0.229
4.637AsnGly: 4.637 ± 0.64
0.294AsnHis: 0.294 ± 0.192
1.398AsnIle: 1.398 ± 0.352
1.767AsnLys: 1.767 ± 0.314
3.533AsnLeu: 3.533 ± 0.504
0.662AsnMet: 0.662 ± 0.237
1.03AsnAsn: 1.03 ± 0.273
2.135AsnPro: 2.135 ± 0.387
1.398AsnGln: 1.398 ± 0.327
1.546AsnArg: 1.546 ± 0.352
1.84AsnSer: 1.84 ± 0.273
2.797AsnThr: 2.797 ± 0.411
2.576AsnVal: 2.576 ± 0.389
0.883AsnTrp: 0.883 ± 0.222
0.883AsnTyr: 0.883 ± 0.258
0.0AsnXaa: 0.0 ± 0.0
Pro
5.594ProAla: 5.594 ± 0.707
0.147ProCys: 0.147 ± 0.099
2.503ProAsp: 2.503 ± 0.438
3.827ProGlu: 3.827 ± 0.559
0.957ProPhe: 0.957 ± 0.261
5.594ProGly: 5.594 ± 0.606
0.81ProHis: 0.81 ± 0.25
2.282ProIle: 2.282 ± 0.462
3.018ProLys: 3.018 ± 0.503
3.091ProLeu: 3.091 ± 0.48
1.104ProMet: 1.104 ± 0.235
1.693ProAsn: 1.693 ± 0.399
1.987ProPro: 1.987 ± 0.43
1.178ProGln: 1.178 ± 0.269
2.282ProArg: 2.282 ± 0.407
3.975ProSer: 3.975 ± 0.534
3.165ProThr: 3.165 ± 0.515
4.564ProVal: 4.564 ± 0.466
1.325ProTrp: 1.325 ± 0.371
1.03ProTyr: 1.03 ± 0.301
0.0ProXaa: 0.0 ± 0.0
Gln
3.901GlnAla: 3.901 ± 0.449
0.147GlnCys: 0.147 ± 0.106
1.987GlnAsp: 1.987 ± 0.356
2.135GlnGlu: 2.135 ± 0.44
1.251GlnPhe: 1.251 ± 0.302
2.871GlnGly: 2.871 ± 0.493
0.662GlnHis: 0.662 ± 0.194
2.135GlnIle: 2.135 ± 0.492
2.576GlnLys: 2.576 ± 0.453
4.637GlnLeu: 4.637 ± 0.643
0.81GlnMet: 0.81 ± 0.256
1.472GlnAsn: 1.472 ± 0.33
2.135GlnPro: 2.135 ± 0.408
1.546GlnGln: 1.546 ± 0.295
2.208GlnArg: 2.208 ± 0.4
1.546GlnSer: 1.546 ± 0.4
2.503GlnThr: 2.503 ± 0.344
3.165GlnVal: 3.165 ± 0.506
1.03GlnTrp: 1.03 ± 0.278
0.81GlnTyr: 0.81 ± 0.255
0.0GlnXaa: 0.0 ± 0.0
Arg
5.3ArgAla: 5.3 ± 0.534
0.442ArgCys: 0.442 ± 0.194
3.239ArgAsp: 3.239 ± 0.581
4.122ArgGlu: 4.122 ± 0.635
1.398ArgPhe: 1.398 ± 0.33
4.048ArgGly: 4.048 ± 0.542
1.178ArgHis: 1.178 ± 0.359
2.797ArgIle: 2.797 ± 0.462
3.754ArgLys: 3.754 ± 0.652
6.036ArgLeu: 6.036 ± 0.815
1.546ArgMet: 1.546 ± 0.258
2.429ArgAsn: 2.429 ± 0.466
3.091ArgPro: 3.091 ± 0.489
1.84ArgGln: 1.84 ± 0.363
4.048ArgArg: 4.048 ± 0.583
3.312ArgSer: 3.312 ± 0.518
2.503ArgThr: 2.503 ± 0.318
4.269ArgVal: 4.269 ± 0.816
0.957ArgTrp: 0.957 ± 0.292
1.325ArgTyr: 1.325 ± 0.396
0.0ArgXaa: 0.0 ± 0.0
Ser
4.637SerAla: 4.637 ± 0.621
0.294SerCys: 0.294 ± 0.14
3.239SerAsp: 3.239 ± 0.46
4.784SerGlu: 4.784 ± 0.563
2.355SerPhe: 2.355 ± 0.347
6.772SerGly: 6.772 ± 1.001
1.398SerHis: 1.398 ± 0.329
3.386SerIle: 3.386 ± 0.699
3.165SerLys: 3.165 ± 0.518
5.447SerLeu: 5.447 ± 0.784
1.987SerMet: 1.987 ± 0.422
2.282SerAsn: 2.282 ± 0.583
2.723SerPro: 2.723 ± 0.51
2.135SerGln: 2.135 ± 0.428
3.091SerArg: 3.091 ± 0.488
3.018SerSer: 3.018 ± 0.544
4.637SerThr: 4.637 ± 0.852
4.048SerVal: 4.048 ± 0.574
1.03SerTrp: 1.03 ± 0.307
1.767SerTyr: 1.767 ± 0.418
0.0SerXaa: 0.0 ± 0.0
Thr
7.14ThrAla: 7.14 ± 0.886
0.81ThrCys: 0.81 ± 0.226
3.239ThrAsp: 3.239 ± 0.507
3.68ThrGlu: 3.68 ± 0.601
1.546ThrPhe: 1.546 ± 0.34
5.152ThrGly: 5.152 ± 1.365
0.81ThrHis: 0.81 ± 0.254
2.135ThrIle: 2.135 ± 0.44
3.459ThrLys: 3.459 ± 0.506
5.447ThrLeu: 5.447 ± 0.509
1.178ThrMet: 1.178 ± 0.295
2.503ThrAsn: 2.503 ± 0.417
3.533ThrPro: 3.533 ± 0.437
2.429ThrGln: 2.429 ± 0.412
3.312ThrArg: 3.312 ± 0.578
3.975ThrSer: 3.975 ± 0.578
5.079ThrThr: 5.079 ± 1.01
5.005ThrVal: 5.005 ± 0.832
1.178ThrTrp: 1.178 ± 0.334
1.767ThrTyr: 1.767 ± 0.346
0.0ThrXaa: 0.0 ± 0.0
Val
7.434ValAla: 7.434 ± 0.699
0.221ValCys: 0.221 ± 0.123
3.239ValAsp: 3.239 ± 0.439
5.373ValGlu: 5.373 ± 0.621
1.987ValPhe: 1.987 ± 0.413
5.447ValGly: 5.447 ± 0.817
1.987ValHis: 1.987 ± 0.422
3.312ValIle: 3.312 ± 0.563
3.312ValLys: 3.312 ± 0.485
5.152ValLeu: 5.152 ± 0.603
2.061ValMet: 2.061 ± 0.426
2.355ValAsn: 2.355 ± 0.375
3.975ValPro: 3.975 ± 0.501
3.754ValGln: 3.754 ± 0.606
4.048ValArg: 4.048 ± 0.5
5.52ValSer: 5.52 ± 0.815
3.901ValThr: 3.901 ± 0.515
5.152ValVal: 5.152 ± 0.735
1.987ValTrp: 1.987 ± 0.431
1.84ValTyr: 1.84 ± 0.38
0.0ValXaa: 0.0 ± 0.0
Trp
2.208TrpAla: 2.208 ± 0.317
0.074TrpCys: 0.074 ± 0.075
1.472TrpAsp: 1.472 ± 0.409
1.03TrpGlu: 1.03 ± 0.323
0.662TrpPhe: 0.662 ± 0.212
1.84TrpGly: 1.84 ± 0.396
0.294TrpHis: 0.294 ± 0.138
1.03TrpIle: 1.03 ± 0.256
0.81TrpLys: 0.81 ± 0.235
1.398TrpLeu: 1.398 ± 0.241
0.589TrpMet: 0.589 ± 0.173
0.736TrpAsn: 0.736 ± 0.256
0.736TrpPro: 0.736 ± 0.323
0.662TrpGln: 0.662 ± 0.263
0.957TrpArg: 0.957 ± 0.222
1.398TrpSer: 1.398 ± 0.315
1.619TrpThr: 1.619 ± 0.385
1.84TrpVal: 1.84 ± 0.436
0.589TrpTrp: 0.589 ± 0.171
0.294TrpTyr: 0.294 ± 0.15
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.84TyrAla: 1.84 ± 0.409
0.147TyrCys: 0.147 ± 0.116
1.767TyrAsp: 1.767 ± 0.317
1.546TyrGlu: 1.546 ± 0.276
1.03TyrPhe: 1.03 ± 0.18
2.135TyrGly: 2.135 ± 0.434
0.662TyrHis: 0.662 ± 0.188
1.251TyrIle: 1.251 ± 0.231
1.619TyrLys: 1.619 ± 0.333
2.208TyrLeu: 2.208 ± 0.344
0.0TyrMet: 0.0 ± 0.0
0.589TyrAsn: 0.589 ± 0.192
1.546TyrPro: 1.546 ± 0.315
0.736TyrGln: 0.736 ± 0.259
2.429TyrArg: 2.429 ± 0.348
1.251TyrSer: 1.251 ± 0.31
1.767TyrThr: 1.767 ± 0.499
1.325TyrVal: 1.325 ± 0.347
0.736TyrTrp: 0.736 ± 0.222
0.736TyrTyr: 0.736 ± 0.324
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 59 proteins (13587 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski