Amino acid dipepetide frequency for BtRf-AlphaCoV/HuB2013

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.285AlaAla: 5.285 ± 0.871
2.202AlaCys: 2.202 ± 0.658
2.643AlaAsp: 2.643 ± 0.506
2.422AlaGlu: 2.422 ± 1.569
4.735AlaPhe: 4.735 ± 0.765
3.303AlaGly: 3.303 ± 0.972
0.991AlaHis: 0.991 ± 0.365
4.514AlaIle: 4.514 ± 0.584
3.193AlaLys: 3.193 ± 1.146
6.166AlaLeu: 6.166 ± 1.89
1.431AlaMet: 1.431 ± 0.225
3.193AlaAsn: 3.193 ± 1.101
2.422AlaPro: 2.422 ± 0.837
1.652AlaGln: 1.652 ± 0.646
2.092AlaArg: 2.092 ± 0.645
4.514AlaSer: 4.514 ± 0.465
4.184AlaThr: 4.184 ± 1.173
6.276AlaVal: 6.276 ± 1.669
0.551AlaTrp: 0.551 ± 0.173
2.643AlaTyr: 2.643 ± 0.729
0.0AlaXaa: 0.0 ± 0.0
Cys
1.762CysAla: 1.762 ± 0.456
1.211CysCys: 1.211 ± 0.386
1.542CysAsp: 1.542 ± 0.637
0.991CysGlu: 0.991 ± 0.508
2.202CysPhe: 2.202 ± 0.459
2.202CysGly: 2.202 ± 0.575
0.551CysHis: 0.551 ± 0.318
1.211CysIle: 1.211 ± 0.308
1.982CysLys: 1.982 ± 0.73
1.872CysLeu: 1.872 ± 0.658
0.551CysMet: 0.551 ± 0.282
2.092CysAsn: 2.092 ± 0.691
0.771CysPro: 0.771 ± 0.266
0.771CysGln: 0.771 ± 0.402
1.101CysArg: 1.101 ± 0.346
2.092CysSer: 2.092 ± 0.624
2.202CysThr: 2.202 ± 0.575
3.634CysVal: 3.634 ± 1.025
0.44CysTrp: 0.44 ± 0.226
2.202CysTyr: 2.202 ± 0.94
0.0CysXaa: 0.0 ± 0.0
Asp
3.523AspAla: 3.523 ± 0.718
1.542AspCys: 1.542 ± 0.531
1.872AspAsp: 1.872 ± 0.641
2.643AspGlu: 2.643 ± 0.693
3.634AspPhe: 3.634 ± 0.946
4.404AspGly: 4.404 ± 1.673
0.991AspHis: 0.991 ± 0.347
3.523AspIle: 3.523 ± 0.915
2.092AspLys: 2.092 ± 0.489
4.514AspLeu: 4.514 ± 0.558
0.991AspMet: 0.991 ± 0.312
2.753AspAsn: 2.753 ± 0.99
1.542AspPro: 1.542 ± 0.525
1.211AspGln: 1.211 ± 0.998
1.101AspArg: 1.101 ± 0.253
2.973AspSer: 2.973 ± 0.467
2.312AspThr: 2.312 ± 0.562
5.065AspVal: 5.065 ± 0.994
0.771AspTrp: 0.771 ± 0.395
3.303AspTyr: 3.303 ± 0.972
0.0AspXaa: 0.0 ± 0.0
Glu
2.532GluAla: 2.532 ± 0.647
1.101GluCys: 1.101 ± 0.393
2.753GluAsp: 2.753 ± 0.809
2.202GluGlu: 2.202 ± 0.369
2.863GluPhe: 2.863 ± 1.022
2.863GluGly: 2.863 ± 0.809
1.321GluHis: 1.321 ± 0.477
1.762GluIle: 1.762 ± 0.992
2.312GluLys: 2.312 ± 0.443
3.413GluLeu: 3.413 ± 1.268
0.661GluMet: 0.661 ± 0.366
2.532GluAsn: 2.532 ± 0.686
2.092GluPro: 2.092 ± 0.48
1.652GluGln: 1.652 ± 0.914
1.872GluArg: 1.872 ± 0.615
3.193GluSer: 3.193 ± 0.594
0.991GluThr: 0.991 ± 0.617
5.726GluVal: 5.726 ± 1.162
0.44GluTrp: 0.44 ± 0.142
1.652GluTyr: 1.652 ± 0.44
0.0GluXaa: 0.0 ± 0.0
Phe
4.625PheAla: 4.625 ± 1.703
1.762PheCys: 1.762 ± 0.625
3.964PheAsp: 3.964 ± 0.731
2.643PheGlu: 2.643 ± 0.893
2.422PhePhe: 2.422 ± 0.442
3.193PheGly: 3.193 ± 1.128
0.22PheHis: 0.22 ± 0.14
2.202PheIle: 2.202 ± 1.226
4.074PheLys: 4.074 ± 1.187
4.514PheLeu: 4.514 ± 1.528
1.431PheMet: 1.431 ± 0.537
4.294PheAsn: 4.294 ± 0.914
0.661PhePro: 0.661 ± 0.258
1.321PheGln: 1.321 ± 0.545
1.431PheArg: 1.431 ± 0.529
4.625PheSer: 4.625 ± 1.025
2.973PheThr: 2.973 ± 0.616
6.166PheVal: 6.166 ± 1.24
1.211PheTrp: 1.211 ± 0.558
2.973PheTyr: 2.973 ± 0.309
0.0PheXaa: 0.0 ± 0.0
Gly
3.303GlyAla: 3.303 ± 1.04
1.982GlyCys: 1.982 ± 0.486
4.625GlyAsp: 4.625 ± 0.302
2.092GlyGlu: 2.092 ± 0.501
3.854GlyPhe: 3.854 ± 1.051
4.514GlyGly: 4.514 ± 1.131
0.551GlyHis: 0.551 ± 0.173
3.744GlyIle: 3.744 ± 1.187
5.065GlyLys: 5.065 ± 0.51
4.625GlyLeu: 4.625 ± 0.555
0.991GlyMet: 0.991 ± 0.508
4.184GlyAsn: 4.184 ± 1.331
1.872GlyPro: 1.872 ± 0.54
1.652GlyGln: 1.652 ± 0.502
1.762GlyArg: 1.762 ± 0.712
5.065GlySer: 5.065 ± 1.035
3.193GlyThr: 3.193 ± 1.268
8.588GlyVal: 8.588 ± 0.922
0.551GlyTrp: 0.551 ± 0.294
2.092GlyTyr: 2.092 ± 0.501
0.0GlyXaa: 0.0 ± 0.0
His
1.762HisAla: 1.762 ± 0.327
0.771HisCys: 0.771 ± 0.395
0.881HisAsp: 0.881 ± 0.313
0.44HisGlu: 0.44 ± 0.226
0.771HisPhe: 0.771 ± 0.314
0.44HisGly: 0.44 ± 0.226
0.22HisHis: 0.22 ± 0.378
0.661HisIle: 0.661 ± 0.215
1.101HisLys: 1.101 ± 0.418
1.431HisLeu: 1.431 ± 0.58
0.33HisMet: 0.33 ± 0.129
1.321HisAsn: 1.321 ± 0.426
0.44HisPro: 0.44 ± 0.226
0.661HisGln: 0.661 ± 0.32
0.33HisArg: 0.33 ± 0.355
1.211HisSer: 1.211 ± 0.311
1.872HisThr: 1.872 ± 0.503
1.321HisVal: 1.321 ± 0.426
0.22HisTrp: 0.22 ± 0.113
0.991HisTyr: 0.991 ± 0.239
0.0HisXaa: 0.0 ± 0.0
Ile
3.413IleAla: 3.413 ± 0.935
0.881IleCys: 0.881 ± 0.391
2.422IleAsp: 2.422 ± 0.685
1.652IleGlu: 1.652 ± 0.321
2.202IlePhe: 2.202 ± 0.505
3.634IleGly: 3.634 ± 0.944
0.991IleHis: 0.991 ± 0.312
2.422IleIle: 2.422 ± 0.926
4.074IleLys: 4.074 ± 0.703
3.854IleLeu: 3.854 ± 0.898
0.991IleMet: 0.991 ± 0.333
2.643IleAsn: 2.643 ± 0.938
2.092IlePro: 2.092 ± 0.675
1.982IleGln: 1.982 ± 1.063
1.211IleArg: 1.211 ± 0.596
4.294IleSer: 4.294 ± 1.088
4.955IleThr: 4.955 ± 1.516
4.514IleVal: 4.514 ± 0.65
0.44IleTrp: 0.44 ± 0.142
2.312IleTyr: 2.312 ± 0.839
0.0IleXaa: 0.0 ± 0.0
Lys
3.413LysAla: 3.413 ± 0.985
2.202LysCys: 2.202 ± 0.526
3.413LysAsp: 3.413 ± 0.436
3.744LysGlu: 3.744 ± 0.684
3.083LysPhe: 3.083 ± 0.584
3.303LysGly: 3.303 ± 1.121
1.762LysHis: 1.762 ± 0.68
3.193LysIle: 3.193 ± 1.019
1.542LysLys: 1.542 ± 0.557
5.505LysLeu: 5.505 ± 1.363
1.211LysMet: 1.211 ± 0.485
2.643LysAsn: 2.643 ± 0.752
3.413LysPro: 3.413 ± 0.624
2.753LysGln: 2.753 ± 1.322
2.312LysArg: 2.312 ± 0.748
3.523LysSer: 3.523 ± 0.595
3.744LysThr: 3.744 ± 0.623
5.175LysVal: 5.175 ± 0.928
0.771LysTrp: 0.771 ± 0.314
3.523LysTyr: 3.523 ± 0.815
0.0LysXaa: 0.0 ± 0.0
Leu
5.175LeuAla: 5.175 ± 0.643
2.643LeuCys: 2.643 ± 0.861
3.193LeuAsp: 3.193 ± 0.944
3.303LeuGlu: 3.303 ± 0.7
4.294LeuPhe: 4.294 ± 2.039
5.065LeuGly: 5.065 ± 1.4
2.202LeuHis: 2.202 ± 0.497
3.193LeuIle: 3.193 ± 1.295
5.285LeuLys: 5.285 ± 0.937
7.708LeuLeu: 7.708 ± 1.435
1.321LeuMet: 1.321 ± 0.436
5.946LeuAsn: 5.946 ± 1.576
3.964LeuPro: 3.964 ± 1.51
3.634LeuGln: 3.634 ± 0.741
2.422LeuArg: 2.422 ± 0.548
6.386LeuSer: 6.386 ± 0.6
4.294LeuThr: 4.294 ± 1.275
7.267LeuVal: 7.267 ± 1.41
1.321LeuTrp: 1.321 ± 1.132
5.175LeuTyr: 5.175 ± 0.973
0.0LeuXaa: 0.0 ± 0.0
Met
0.771MetAla: 0.771 ± 0.339
1.321MetCys: 1.321 ± 0.678
0.771MetAsp: 0.771 ± 0.408
0.44MetGlu: 0.44 ± 0.355
1.321MetPhe: 1.321 ± 0.426
1.101MetGly: 1.101 ± 0.393
0.44MetHis: 0.44 ± 0.226
1.431MetIle: 1.431 ± 0.464
0.881MetLys: 0.881 ± 0.272
2.312MetLeu: 2.312 ± 0.63
0.551MetMet: 0.551 ± 0.356
0.551MetAsn: 0.551 ± 0.358
0.771MetPro: 0.771 ± 0.339
0.991MetGln: 0.991 ± 0.773
0.661MetArg: 0.661 ± 0.339
2.092MetSer: 2.092 ± 0.97
0.991MetThr: 0.991 ± 0.359
1.762MetVal: 1.762 ± 0.331
0.22MetTrp: 0.22 ± 0.113
1.542MetTyr: 1.542 ± 0.386
0.0MetXaa: 0.0 ± 0.0
Asn
3.193AsnAla: 3.193 ± 0.87
2.312AsnCys: 2.312 ± 0.706
3.303AsnAsp: 3.303 ± 0.689
3.303AsnGlu: 3.303 ± 0.561
3.413AsnPhe: 3.413 ± 0.628
6.827AsnGly: 6.827 ± 1.179
0.881AsnHis: 0.881 ± 0.313
3.193AsnIle: 3.193 ± 0.837
3.964AsnLys: 3.964 ± 0.598
3.964AsnLeu: 3.964 ± 0.947
1.101AsnMet: 1.101 ± 0.604
3.413AsnAsn: 3.413 ± 0.542
1.982AsnPro: 1.982 ± 1.091
1.431AsnGln: 1.431 ± 1.071
1.652AsnArg: 1.652 ± 0.493
4.955AsnSer: 4.955 ± 1.577
3.303AsnThr: 3.303 ± 0.518
7.597AsnVal: 7.597 ± 2.296
0.661AsnTrp: 0.661 ± 0.952
1.762AsnTyr: 1.762 ± 0.568
0.0AsnXaa: 0.0 ± 0.0
Pro
3.413ProAla: 3.413 ± 1.2
0.881ProCys: 0.881 ± 0.336
1.431ProAsp: 1.431 ± 0.476
2.422ProGlu: 2.422 ± 0.363
1.431ProPhe: 1.431 ± 0.644
2.422ProGly: 2.422 ± 0.837
0.44ProHis: 0.44 ± 0.337
1.101ProIle: 1.101 ± 0.367
1.652ProLys: 1.652 ± 0.985
3.964ProLeu: 3.964 ± 0.571
0.551ProMet: 0.551 ± 0.389
1.872ProAsn: 1.872 ± 0.654
1.542ProPro: 1.542 ± 0.362
0.881ProGln: 0.881 ± 0.425
1.431ProArg: 1.431 ± 0.4
2.643ProSer: 2.643 ± 0.796
2.312ProThr: 2.312 ± 2.033
3.413ProVal: 3.413 ± 1.318
0.661ProTrp: 0.661 ± 0.36
1.211ProTyr: 1.211 ± 0.364
0.0ProXaa: 0.0 ± 0.0
Gln
2.422GlnAla: 2.422 ± 0.49
0.44GlnCys: 0.44 ± 0.226
1.321GlnAsp: 1.321 ± 0.527
1.652GlnGlu: 1.652 ± 0.493
1.211GlnPhe: 1.211 ± 0.399
1.542GlnGly: 1.542 ± 0.526
0.33GlnHis: 0.33 ± 0.129
1.872GlnIle: 1.872 ± 0.3
1.542GlnLys: 1.542 ± 0.931
4.514GlnLeu: 4.514 ± 2.078
0.661GlnMet: 0.661 ± 0.308
1.982GlnAsn: 1.982 ± 1.079
1.652GlnPro: 1.652 ± 1.122
1.321GlnGln: 1.321 ± 0.437
1.762GlnArg: 1.762 ± 0.437
1.652GlnSer: 1.652 ± 0.914
2.202GlnThr: 2.202 ± 0.548
1.872GlnVal: 1.872 ± 1.477
0.44GlnTrp: 0.44 ± 0.142
1.321GlnTyr: 1.321 ± 0.709
0.0GlnXaa: 0.0 ± 0.0
Arg
2.532ArgAla: 2.532 ± 0.873
1.321ArgCys: 1.321 ± 0.527
1.211ArgAsp: 1.211 ± 0.579
0.33ArgGlu: 0.33 ± 0.362
2.312ArgPhe: 2.312 ± 0.706
2.422ArgGly: 2.422 ± 0.915
0.44ArgHis: 0.44 ± 0.226
1.321ArgIle: 1.321 ± 0.426
2.422ArgLys: 2.422 ± 0.825
3.083ArgLeu: 3.083 ± 0.623
0.881ArgMet: 0.881 ± 0.358
2.422ArgAsn: 2.422 ± 0.968
0.44ArgPro: 0.44 ± 0.337
0.881ArgGln: 0.881 ± 0.499
0.44ArgArg: 0.44 ± 0.303
2.092ArgSer: 2.092 ± 3.356
2.422ArgThr: 2.422 ± 1.207
3.083ArgVal: 3.083 ± 0.366
0.22ArgTrp: 0.22 ± 0.113
1.431ArgTyr: 1.431 ± 0.243
0.0ArgXaa: 0.0 ± 0.0
Ser
4.514SerAla: 4.514 ± 0.742
2.312SerCys: 2.312 ± 0.383
3.413SerAsp: 3.413 ± 0.542
1.872SerGlu: 1.872 ± 0.803
5.175SerPhe: 5.175 ± 1.408
4.845SerGly: 4.845 ± 0.898
1.101SerHis: 1.101 ± 0.394
3.523SerIle: 3.523 ± 1.091
4.184SerLys: 4.184 ± 1.284
4.625SerLeu: 4.625 ± 1.283
1.652SerMet: 1.652 ± 0.609
5.395SerAsn: 5.395 ± 0.998
1.321SerPro: 1.321 ± 0.733
2.422SerGln: 2.422 ± 2.829
2.973SerArg: 2.973 ± 1.76
5.395SerSer: 5.395 ± 1.371
4.845SerThr: 4.845 ± 0.645
8.809SerVal: 8.809 ± 1.413
0.771SerTrp: 0.771 ± 0.623
4.074SerTyr: 4.074 ± 0.684
0.0SerXaa: 0.0 ± 0.0
Thr
2.092ThrAla: 2.092 ± 0.352
1.542ThrCys: 1.542 ± 0.424
2.863ThrAsp: 2.863 ± 0.493
3.303ThrGlu: 3.303 ± 1.665
3.083ThrPhe: 3.083 ± 0.601
3.964ThrGly: 3.964 ± 1.876
1.101ThrHis: 1.101 ± 0.719
3.964ThrIle: 3.964 ± 0.845
3.744ThrLys: 3.744 ± 1.136
4.955ThrLeu: 4.955 ± 1.411
1.762ThrMet: 1.762 ± 0.748
3.303ThrAsn: 3.303 ± 0.298
1.982ThrPro: 1.982 ± 0.835
1.762ThrGln: 1.762 ± 0.493
1.982ThrArg: 1.982 ± 0.885
4.845ThrSer: 4.845 ± 1.13
3.413ThrThr: 3.413 ± 0.212
6.717ThrVal: 6.717 ± 1.464
0.551ThrTrp: 0.551 ± 0.358
2.643ThrTyr: 2.643 ± 0.945
0.0ThrXaa: 0.0 ± 0.0
Val
6.717ValAla: 6.717 ± 0.811
3.523ValCys: 3.523 ± 0.94
5.175ValAsp: 5.175 ± 1.036
6.056ValGlu: 6.056 ± 0.793
5.726ValPhe: 5.726 ± 0.953
4.625ValGly: 4.625 ± 0.941
1.431ValHis: 1.431 ± 0.389
5.395ValIle: 5.395 ± 0.957
7.597ValLys: 7.597 ± 2.39
8.809ValLeu: 8.809 ± 2.449
2.202ValMet: 2.202 ± 0.375
6.496ValAsn: 6.496 ± 0.8
4.625ValPro: 4.625 ± 2.238
3.634ValGln: 3.634 ± 0.279
3.083ValArg: 3.083 ± 0.842
7.597ValSer: 7.597 ± 1.023
5.726ValThr: 5.726 ± 0.492
9.579ValVal: 9.579 ± 1.094
0.991ValTrp: 0.991 ± 0.365
2.753ValTyr: 2.753 ± 0.438
0.0ValXaa: 0.0 ± 0.0
Trp
0.771TrpAla: 0.771 ± 1.078
0.11TrpCys: 0.11 ± 0.056
0.771TrpAsp: 0.771 ± 0.395
0.551TrpGlu: 0.551 ± 0.282
0.771TrpPhe: 0.771 ± 0.334
0.22TrpGly: 0.22 ± 0.113
0.22TrpHis: 0.22 ± 0.381
0.771TrpIle: 0.771 ± 0.281
0.551TrpLys: 0.551 ± 0.747
1.321TrpLeu: 1.321 ± 0.436
0.22TrpMet: 0.22 ± 0.14
1.211TrpAsn: 1.211 ± 0.821
0.44TrpPro: 0.44 ± 0.512
0.11TrpGln: 0.11 ± 0.056
0.44TrpArg: 0.44 ± 0.226
1.211TrpSer: 1.211 ± 0.406
0.44TrpThr: 0.44 ± 0.142
0.881TrpVal: 0.881 ± 0.529
0.33TrpTrp: 0.33 ± 0.129
0.661TrpTyr: 0.661 ± 0.342
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.193TyrAla: 3.193 ± 1.019
1.321TyrCys: 1.321 ± 0.426
3.193TyrAsp: 3.193 ± 1.436
1.872TyrGlu: 1.872 ± 0.594
2.202TyrPhe: 2.202 ± 0.456
3.413TyrGly: 3.413 ± 0.668
0.991TyrHis: 0.991 ± 0.312
1.982TyrIle: 1.982 ± 0.633
2.973TyrLys: 2.973 ± 1.117
2.643TyrLeu: 2.643 ± 0.39
1.321TyrMet: 1.321 ± 0.375
3.964TyrAsn: 3.964 ± 0.68
1.652TyrPro: 1.652 ± 0.284
1.101TyrGln: 1.101 ± 0.316
1.652TyrArg: 1.652 ± 0.58
2.863TyrSer: 2.863 ± 0.608
3.193TyrThr: 3.193 ± 0.849
4.294TyrVal: 4.294 ± 0.989
0.44TyrTrp: 0.44 ± 0.761
3.303TyrTyr: 3.303 ± 0.568
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (9083 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski