Amino acid dipepetide frequency for Escherichia phage GA2A

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.775AlaAla: 8.775 ± 1.074
0.966AlaCys: 0.966 ± 0.274
5.233AlaAsp: 5.233 ± 0.775
5.394AlaGlu: 5.394 ± 0.725
3.381AlaPhe: 3.381 ± 0.399
7.89AlaGly: 7.89 ± 0.945
0.886AlaHis: 0.886 ± 0.306
4.992AlaIle: 4.992 ± 0.643
5.797AlaLys: 5.797 ± 0.668
7.326AlaLeu: 7.326 ± 1.003
2.818AlaMet: 2.818 ± 0.508
3.864AlaAsn: 3.864 ± 0.571
2.898AlaPro: 2.898 ± 0.531
2.898AlaGln: 2.898 ± 0.569
3.784AlaArg: 3.784 ± 0.545
4.831AlaSer: 4.831 ± 0.575
3.864AlaThr: 3.864 ± 0.655
6.119AlaVal: 6.119 ± 0.881
1.449AlaTrp: 1.449 ± 0.427
2.657AlaTyr: 2.657 ± 0.501
0.0AlaXaa: 0.0 ± 0.0
Cys
0.725CysAla: 0.725 ± 0.236
0.0CysCys: 0.0 ± 0.0
0.644CysAsp: 0.644 ± 0.336
0.564CysGlu: 0.564 ± 0.197
0.725CysPhe: 0.725 ± 0.274
0.725CysGly: 0.725 ± 0.234
0.161CysHis: 0.161 ± 0.106
0.644CysIle: 0.644 ± 0.287
0.483CysLys: 0.483 ± 0.22
1.047CysLeu: 1.047 ± 0.337
0.242CysMet: 0.242 ± 0.205
0.322CysAsn: 0.322 ± 0.145
0.403CysPro: 0.403 ± 0.182
0.242CysGln: 0.242 ± 0.122
0.644CysArg: 0.644 ± 0.248
0.483CysSer: 0.483 ± 0.21
0.242CysThr: 0.242 ± 0.139
1.047CysVal: 1.047 ± 0.55
0.081CysTrp: 0.081 ± 0.091
0.483CysTyr: 0.483 ± 0.244
0.0CysXaa: 0.0 ± 0.0
Asp
5.958AspAla: 5.958 ± 0.723
0.725AspCys: 0.725 ± 0.418
4.025AspAsp: 4.025 ± 0.698
4.025AspGlu: 4.025 ± 0.593
2.093AspPhe: 2.093 ± 0.438
6.28AspGly: 6.28 ± 0.582
1.288AspHis: 1.288 ± 0.315
3.14AspIle: 3.14 ± 0.495
3.14AspLys: 3.14 ± 0.497
4.589AspLeu: 4.589 ± 0.539
2.174AspMet: 2.174 ± 0.374
2.254AspAsn: 2.254 ± 0.503
2.898AspPro: 2.898 ± 0.479
2.093AspGln: 2.093 ± 0.432
2.818AspArg: 2.818 ± 0.509
3.14AspSer: 3.14 ± 0.45
3.945AspThr: 3.945 ± 0.574
4.831AspVal: 4.831 ± 0.574
0.805AspTrp: 0.805 ± 0.299
2.576AspTyr: 2.576 ± 0.466
0.0AspXaa: 0.0 ± 0.0
Glu
6.521GluAla: 6.521 ± 1.051
0.483GluCys: 0.483 ± 0.215
4.508GluAsp: 4.508 ± 0.646
4.75GluGlu: 4.75 ± 0.757
2.415GluPhe: 2.415 ± 0.403
4.911GluGly: 4.911 ± 0.771
1.127GluHis: 1.127 ± 0.264
2.898GluIle: 2.898 ± 0.386
2.979GluLys: 2.979 ± 0.53
6.441GluLeu: 6.441 ± 0.714
2.174GluMet: 2.174 ± 0.466
2.576GluAsn: 2.576 ± 0.56
2.013GluPro: 2.013 ± 0.344
2.737GluGln: 2.737 ± 0.531
3.623GluArg: 3.623 ± 0.584
3.945GluSer: 3.945 ± 0.706
3.864GluThr: 3.864 ± 0.479
4.508GluVal: 4.508 ± 0.552
1.127GluTrp: 1.127 ± 0.203
2.496GluTyr: 2.496 ± 0.444
0.0GluXaa: 0.0 ± 0.0
Phe
2.657PheAla: 2.657 ± 0.493
0.564PheCys: 0.564 ± 0.217
2.657PheAsp: 2.657 ± 0.375
1.771PheGlu: 1.771 ± 0.268
1.369PhePhe: 1.369 ± 0.372
2.496PheGly: 2.496 ± 0.515
0.564PheHis: 0.564 ± 0.209
1.852PheIle: 1.852 ± 0.459
2.576PheLys: 2.576 ± 0.402
3.14PheLeu: 3.14 ± 0.399
0.805PheMet: 0.805 ± 0.34
2.818PheAsn: 2.818 ± 0.426
1.53PhePro: 1.53 ± 0.383
0.886PheGln: 0.886 ± 0.271
1.53PheArg: 1.53 ± 0.338
2.657PheSer: 2.657 ± 0.369
2.254PheThr: 2.254 ± 0.34
2.335PheVal: 2.335 ± 0.438
0.242PheTrp: 0.242 ± 0.14
1.208PheTyr: 1.208 ± 0.264
0.0PheXaa: 0.0 ± 0.0
Gly
6.602GlyAla: 6.602 ± 1.004
0.483GlyCys: 0.483 ± 0.223
4.831GlyAsp: 4.831 ± 0.719
5.394GlyGlu: 5.394 ± 0.679
2.093GlyPhe: 2.093 ± 0.496
5.394GlyGly: 5.394 ± 0.602
1.288GlyHis: 1.288 ± 0.402
4.347GlyIle: 4.347 ± 0.539
6.199GlyLys: 6.199 ± 0.747
7.004GlyLeu: 7.004 ± 0.987
2.174GlyMet: 2.174 ± 0.449
2.737GlyAsn: 2.737 ± 0.362
1.53GlyPro: 1.53 ± 0.327
3.22GlyGln: 3.22 ± 0.475
5.394GlyArg: 5.394 ± 0.562
5.716GlySer: 5.716 ± 0.553
3.784GlyThr: 3.784 ± 0.439
5.555GlyVal: 5.555 ± 0.612
1.288GlyTrp: 1.288 ± 0.323
3.945GlyTyr: 3.945 ± 0.644
0.0GlyXaa: 0.0 ± 0.0
His
0.564HisAla: 0.564 ± 0.212
0.161HisCys: 0.161 ± 0.119
1.449HisAsp: 1.449 ± 0.505
1.127HisGlu: 1.127 ± 0.346
0.483HisPhe: 0.483 ± 0.202
1.127HisGly: 1.127 ± 0.282
0.322HisHis: 0.322 ± 0.15
1.127HisIle: 1.127 ± 0.217
1.288HisLys: 1.288 ± 0.287
2.174HisLeu: 2.174 ± 0.458
0.322HisMet: 0.322 ± 0.165
0.725HisAsn: 0.725 ± 0.243
0.564HisPro: 0.564 ± 0.195
0.564HisGln: 0.564 ± 0.222
0.966HisArg: 0.966 ± 0.245
1.127HisSer: 1.127 ± 0.335
1.449HisThr: 1.449 ± 0.311
1.369HisVal: 1.369 ± 0.394
0.322HisTrp: 0.322 ± 0.185
0.644HisTyr: 0.644 ± 0.255
0.0HisXaa: 0.0 ± 0.0
Ile
3.864IleAla: 3.864 ± 0.615
0.564IleCys: 0.564 ± 0.251
2.979IleAsp: 2.979 ± 0.476
2.818IleGlu: 2.818 ± 0.413
1.369IlePhe: 1.369 ± 0.341
4.186IleGly: 4.186 ± 0.521
1.288IleHis: 1.288 ± 0.369
2.013IleIle: 2.013 ± 0.343
3.462IleLys: 3.462 ± 0.508
3.381IleLeu: 3.381 ± 0.485
1.208IleMet: 1.208 ± 0.286
2.979IleAsn: 2.979 ± 0.551
2.174IlePro: 2.174 ± 0.461
1.852IleGln: 1.852 ± 0.453
3.059IleArg: 3.059 ± 0.444
2.979IleSer: 2.979 ± 0.538
3.22IleThr: 3.22 ± 0.511
4.186IleVal: 4.186 ± 0.49
0.725IleTrp: 0.725 ± 0.239
1.369IleTyr: 1.369 ± 0.327
0.0IleXaa: 0.0 ± 0.0
Lys
6.843LysAla: 6.843 ± 0.867
0.725LysCys: 0.725 ± 0.239
3.623LysAsp: 3.623 ± 0.449
3.381LysGlu: 3.381 ± 0.479
2.415LysPhe: 2.415 ± 0.42
4.186LysGly: 4.186 ± 0.719
1.53LysHis: 1.53 ± 0.413
2.335LysIle: 2.335 ± 0.391
3.623LysLys: 3.623 ± 0.759
6.199LysLeu: 6.199 ± 0.545
1.932LysMet: 1.932 ± 0.334
2.013LysAsn: 2.013 ± 0.464
2.335LysPro: 2.335 ± 0.573
1.852LysGln: 1.852 ± 0.442
3.784LysArg: 3.784 ± 0.593
4.67LysSer: 4.67 ± 0.618
4.347LysThr: 4.347 ± 0.589
4.831LysVal: 4.831 ± 0.704
1.449LysTrp: 1.449 ± 0.324
2.335LysTyr: 2.335 ± 0.436
0.0LysXaa: 0.0 ± 0.0
Leu
6.924LeuAla: 6.924 ± 0.836
0.322LeuCys: 0.322 ± 0.179
4.186LeuAsp: 4.186 ± 0.468
6.199LeuGlu: 6.199 ± 0.747
2.254LeuPhe: 2.254 ± 0.336
5.153LeuGly: 5.153 ± 0.705
1.208LeuHis: 1.208 ± 0.305
3.703LeuIle: 3.703 ± 0.482
7.165LeuLys: 7.165 ± 0.79
4.831LeuLeu: 4.831 ± 0.531
2.737LeuMet: 2.737 ± 0.504
4.831LeuAsn: 4.831 ± 0.59
3.542LeuPro: 3.542 ± 0.41
4.106LeuGln: 4.106 ± 0.646
4.992LeuArg: 4.992 ± 0.386
5.153LeuSer: 5.153 ± 0.609
5.636LeuThr: 5.636 ± 0.717
5.072LeuVal: 5.072 ± 0.671
1.127LeuTrp: 1.127 ± 0.265
2.496LeuTyr: 2.496 ± 0.597
0.0LeuXaa: 0.0 ± 0.0
Met
3.14MetAla: 3.14 ± 0.48
0.564MetCys: 0.564 ± 0.191
1.449MetAsp: 1.449 ± 0.365
1.932MetGlu: 1.932 ± 0.371
1.047MetPhe: 1.047 ± 0.324
2.174MetGly: 2.174 ± 0.436
0.322MetHis: 0.322 ± 0.165
1.53MetIle: 1.53 ± 0.253
1.288MetLys: 1.288 ± 0.335
2.254MetLeu: 2.254 ± 0.31
0.725MetMet: 0.725 ± 0.267
1.369MetAsn: 1.369 ± 0.296
1.047MetPro: 1.047 ± 0.33
0.725MetGln: 0.725 ± 0.244
1.449MetArg: 1.449 ± 0.325
1.61MetSer: 1.61 ± 0.42
2.093MetThr: 2.093 ± 0.429
2.576MetVal: 2.576 ± 0.46
0.161MetTrp: 0.161 ± 0.132
1.127MetTyr: 1.127 ± 0.277
0.0MetXaa: 0.0 ± 0.0
Asn
3.623AsnAla: 3.623 ± 0.633
0.564AsnCys: 0.564 ± 0.258
1.932AsnAsp: 1.932 ± 0.44
2.496AsnGlu: 2.496 ± 0.473
2.013AsnPhe: 2.013 ± 0.276
4.508AsnGly: 4.508 ± 0.565
0.725AsnHis: 0.725 ± 0.201
2.335AsnIle: 2.335 ± 0.365
2.657AsnLys: 2.657 ± 0.458
3.623AsnLeu: 3.623 ± 0.556
1.288AsnMet: 1.288 ± 0.341
2.093AsnAsn: 2.093 ± 0.411
3.059AsnPro: 3.059 ± 0.475
1.449AsnGln: 1.449 ± 0.314
2.737AsnArg: 2.737 ± 0.558
2.496AsnSer: 2.496 ± 0.559
2.013AsnThr: 2.013 ± 0.375
2.979AsnVal: 2.979 ± 0.479
0.403AsnTrp: 0.403 ± 0.15
1.61AsnTyr: 1.61 ± 0.354
0.0AsnXaa: 0.0 ± 0.0
Pro
3.059ProAla: 3.059 ± 0.558
0.644ProCys: 0.644 ± 0.276
2.818ProAsp: 2.818 ± 0.446
3.301ProGlu: 3.301 ± 0.481
1.208ProPhe: 1.208 ± 0.231
1.61ProGly: 1.61 ± 0.333
0.564ProHis: 0.564 ± 0.187
2.335ProIle: 2.335 ± 0.419
2.737ProLys: 2.737 ± 0.632
2.013ProLeu: 2.013 ± 0.289
0.966ProMet: 0.966 ± 0.329
2.576ProAsn: 2.576 ± 0.417
0.966ProPro: 0.966 ± 0.287
1.449ProGln: 1.449 ± 0.42
1.691ProArg: 1.691 ± 0.379
2.093ProSer: 2.093 ± 0.295
2.657ProThr: 2.657 ± 0.438
3.301ProVal: 3.301 ± 0.432
0.644ProTrp: 0.644 ± 0.218
0.886ProTyr: 0.886 ± 0.223
0.0ProXaa: 0.0 ± 0.0
Gln
3.703GlnAla: 3.703 ± 0.629
0.483GlnCys: 0.483 ± 0.209
3.301GlnAsp: 3.301 ± 0.684
2.657GlnGlu: 2.657 ± 0.403
1.932GlnPhe: 1.932 ± 0.348
2.496GlnGly: 2.496 ± 0.43
0.483GlnHis: 0.483 ± 0.176
1.449GlnIle: 1.449 ± 0.306
1.932GlnLys: 1.932 ± 0.347
3.945GlnLeu: 3.945 ± 0.765
1.127GlnMet: 1.127 ± 0.407
1.691GlnAsn: 1.691 ± 0.329
1.127GlnPro: 1.127 ± 0.299
1.691GlnGln: 1.691 ± 0.625
2.657GlnArg: 2.657 ± 0.723
2.979GlnSer: 2.979 ± 0.442
1.932GlnThr: 1.932 ± 0.42
2.415GlnVal: 2.415 ± 0.377
0.644GlnTrp: 0.644 ± 0.175
0.886GlnTyr: 0.886 ± 0.346
0.0GlnXaa: 0.0 ± 0.0
Arg
4.831ArgAla: 4.831 ± 0.965
0.403ArgCys: 0.403 ± 0.167
4.025ArgAsp: 4.025 ± 0.436
4.186ArgGlu: 4.186 ± 0.558
2.737ArgPhe: 2.737 ± 0.41
4.186ArgGly: 4.186 ± 0.474
0.886ArgHis: 0.886 ± 0.305
3.059ArgIle: 3.059 ± 0.58
3.462ArgLys: 3.462 ± 0.568
5.394ArgLeu: 5.394 ± 0.672
1.369ArgMet: 1.369 ± 0.334
2.174ArgAsn: 2.174 ± 0.354
1.61ArgPro: 1.61 ± 0.345
2.496ArgGln: 2.496 ± 0.391
2.657ArgArg: 2.657 ± 0.485
3.301ArgSer: 3.301 ± 0.557
2.174ArgThr: 2.174 ± 0.394
3.381ArgVal: 3.381 ± 0.601
1.127ArgTrp: 1.127 ± 0.261
1.691ArgTyr: 1.691 ± 0.311
0.0ArgXaa: 0.0 ± 0.0
Ser
4.267SerAla: 4.267 ± 0.608
0.886SerCys: 0.886 ± 0.335
5.314SerAsp: 5.314 ± 0.533
3.784SerGlu: 3.784 ± 0.567
2.174SerPhe: 2.174 ± 0.369
5.877SerGly: 5.877 ± 0.781
2.093SerHis: 2.093 ± 0.39
3.542SerIle: 3.542 ± 0.58
3.381SerLys: 3.381 ± 0.459
4.186SerLeu: 4.186 ± 0.582
1.288SerMet: 1.288 ± 0.334
1.932SerAsn: 1.932 ± 0.402
2.818SerPro: 2.818 ± 0.516
2.174SerGln: 2.174 ± 0.426
2.979SerArg: 2.979 ± 0.481
4.347SerSer: 4.347 ± 0.643
3.784SerThr: 3.784 ± 0.488
3.784SerVal: 3.784 ± 0.65
0.564SerTrp: 0.564 ± 0.207
2.979SerTyr: 2.979 ± 0.459
0.0SerXaa: 0.0 ± 0.0
Thr
3.462ThrAla: 3.462 ± 0.572
0.483ThrCys: 0.483 ± 0.258
3.381ThrAsp: 3.381 ± 0.375
4.75ThrGlu: 4.75 ± 0.484
2.335ThrPhe: 2.335 ± 0.443
5.797ThrGly: 5.797 ± 0.695
0.725ThrHis: 0.725 ± 0.277
3.542ThrIle: 3.542 ± 0.457
3.059ThrLys: 3.059 ± 0.459
4.75ThrLeu: 4.75 ± 0.562
1.691ThrMet: 1.691 ± 0.346
1.771ThrAsn: 1.771 ± 0.425
2.979ThrPro: 2.979 ± 0.36
2.818ThrGln: 2.818 ± 0.546
2.737ThrArg: 2.737 ± 0.432
2.576ThrSer: 2.576 ± 0.413
2.657ThrThr: 2.657 ± 0.424
5.153ThrVal: 5.153 ± 0.631
0.644ThrTrp: 0.644 ± 0.252
1.771ThrTyr: 1.771 ± 0.285
0.0ThrXaa: 0.0 ± 0.0
Val
5.877ValAla: 5.877 ± 0.693
0.483ValCys: 0.483 ± 0.215
3.381ValAsp: 3.381 ± 0.535
5.072ValGlu: 5.072 ± 0.639
2.093ValPhe: 2.093 ± 0.504
5.877ValGly: 5.877 ± 0.59
1.288ValHis: 1.288 ± 0.563
3.059ValIle: 3.059 ± 0.503
5.877ValLys: 5.877 ± 0.748
4.992ValLeu: 4.992 ± 0.609
2.174ValMet: 2.174 ± 0.494
3.703ValAsn: 3.703 ± 0.656
2.415ValPro: 2.415 ± 0.463
3.623ValGln: 3.623 ± 0.505
4.508ValArg: 4.508 ± 0.644
5.314ValSer: 5.314 ± 0.674
4.428ValThr: 4.428 ± 0.607
5.958ValVal: 5.958 ± 1.025
0.644ValTrp: 0.644 ± 0.229
2.254ValTyr: 2.254 ± 0.39
0.0ValXaa: 0.0 ± 0.0
Trp
0.564TrpAla: 0.564 ± 0.208
0.161TrpCys: 0.161 ± 0.116
0.644TrpAsp: 0.644 ± 0.189
0.725TrpGlu: 0.725 ± 0.244
0.725TrpPhe: 0.725 ± 0.23
0.805TrpGly: 0.805 ± 0.249
0.322TrpHis: 0.322 ± 0.151
0.242TrpIle: 0.242 ± 0.161
1.449TrpLys: 1.449 ± 0.342
1.932TrpLeu: 1.932 ± 0.428
0.242TrpMet: 0.242 ± 0.14
0.725TrpAsn: 0.725 ± 0.22
0.322TrpPro: 0.322 ± 0.157
0.805TrpGln: 0.805 ± 0.309
0.725TrpArg: 0.725 ± 0.21
1.208TrpSer: 1.208 ± 0.465
0.564TrpThr: 0.564 ± 0.24
1.127TrpVal: 1.127 ± 0.312
0.242TrpTrp: 0.242 ± 0.129
0.644TrpTyr: 0.644 ± 0.233
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.864TyrAla: 3.864 ± 0.603
0.242TyrCys: 0.242 ± 0.158
2.415TyrAsp: 2.415 ± 0.431
1.852TyrGlu: 1.852 ± 0.418
1.047TyrPhe: 1.047 ± 0.258
3.301TyrGly: 3.301 ± 0.56
0.805TyrHis: 0.805 ± 0.29
1.53TyrIle: 1.53 ± 0.419
1.852TyrLys: 1.852 ± 0.376
2.415TyrLeu: 2.415 ± 0.378
1.047TyrMet: 1.047 ± 0.235
1.449TyrAsn: 1.449 ± 0.377
1.288TyrPro: 1.288 ± 0.361
1.852TyrGln: 1.852 ± 0.536
2.415TyrArg: 2.415 ± 0.461
1.53TyrSer: 1.53 ± 0.381
2.174TyrThr: 2.174 ± 0.375
2.496TyrVal: 2.496 ± 0.435
0.483TyrTrp: 0.483 ± 0.172
1.047TyrTyr: 1.047 ± 0.304
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 56 proteins (12422 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski