Amino acid dipepetide frequency for Khurdun virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.249AlaAla: 3.249 ± 2.138
0.0AlaCys: 0.0 ± 0.0
3.544AlaAsp: 3.544 ± 1.521
3.249AlaGlu: 3.249 ± 0.258
2.658AlaPhe: 2.658 ± 0.429
3.544AlaGly: 3.544 ± 1.135
2.363AlaHis: 2.363 ± 0.445
4.43AlaIle: 4.43 ± 1.029
4.135AlaLys: 4.135 ± 0.415
5.611AlaLeu: 5.611 ± 0.885
1.477AlaMet: 1.477 ± 0.802
2.363AlaAsn: 2.363 ± 0.445
1.477AlaPro: 1.477 ± 1.05
0.886AlaGln: 0.886 ± 0.481
1.477AlaArg: 1.477 ± 0.632
5.021AlaSer: 5.021 ± 1.236
2.953AlaThr: 2.953 ± 0.638
2.658AlaVal: 2.658 ± 2.279
0.295AlaTrp: 0.295 ± 0.308
2.363AlaTyr: 2.363 ± 1.396
0.0AlaXaa: 0.0 ± 0.0
Cys
0.886CysAla: 0.886 ± 0.481
0.295CysCys: 0.295 ± 0.16
0.591CysAsp: 0.591 ± 0.321
1.181CysGlu: 1.181 ± 0.797
0.886CysPhe: 0.886 ± 0.492
1.477CysGly: 1.477 ± 1.103
0.886CysHis: 0.886 ± 0.492
2.363CysIle: 2.363 ± 0.792
3.839CysLys: 3.839 ± 1.859
1.772CysLeu: 1.772 ± 0.895
1.181CysMet: 1.181 ± 1.231
2.067CysAsn: 2.067 ± 0.879
1.477CysPro: 1.477 ± 0.684
0.295CysGln: 0.295 ± 0.16
0.295CysArg: 0.295 ± 0.308
2.953CysSer: 2.953 ± 1.368
2.067CysThr: 2.067 ± 0.879
1.181CysVal: 1.181 ± 0.797
0.295CysTrp: 0.295 ± 0.16
1.477CysTyr: 1.477 ± 0.426
0.0CysXaa: 0.0 ± 0.0
Asp
2.658AspAla: 2.658 ± 0.709
0.295AspCys: 0.295 ± 0.16
4.43AspAsp: 4.43 ± 1.029
3.839AspGlu: 3.839 ± 2.04
5.021AspPhe: 5.021 ± 0.94
1.181AspGly: 1.181 ± 0.396
1.181AspHis: 1.181 ± 0.396
6.497AspIle: 6.497 ± 2.0
2.953AspLys: 2.953 ± 0.47
5.021AspLeu: 5.021 ± 1.576
1.772AspMet: 1.772 ± 0.866
2.658AspAsn: 2.658 ± 0.563
1.477AspPro: 1.477 ± 0.802
1.181AspGln: 1.181 ± 0.287
2.953AspArg: 2.953 ± 1.379
2.363AspSer: 2.363 ± 1.283
3.839AspThr: 3.839 ± 0.365
3.839AspVal: 3.839 ± 0.365
1.181AspTrp: 1.181 ± 0.797
2.363AspTyr: 2.363 ± 1.41
0.0AspXaa: 0.0 ± 0.0
Glu
2.953GluAla: 2.953 ± 1.285
2.067GluCys: 2.067 ± 1.958
3.249GluAsp: 3.249 ± 0.89
5.611GluGlu: 5.611 ± 0.273
3.544GluPhe: 3.544 ± 2.083
2.953GluGly: 2.953 ± 0.333
1.181GluHis: 1.181 ± 0.396
5.907GluIle: 5.907 ± 1.437
5.611GluLys: 5.611 ± 0.411
5.021GluLeu: 5.021 ± 2.282
2.363GluMet: 2.363 ± 1.283
3.249GluAsn: 3.249 ± 0.741
2.067GluPro: 2.067 ± 0.538
2.363GluGln: 2.363 ± 0.575
2.067GluArg: 2.067 ± 1.123
5.021GluSer: 5.021 ± 1.079
4.725GluThr: 4.725 ± 1.234
2.953GluVal: 2.953 ± 1.78
0.886GluTrp: 0.886 ± 0.481
3.249GluTyr: 3.249 ± 0.258
0.0GluXaa: 0.0 ± 0.0
Phe
0.886PheAla: 0.886 ± 0.188
2.067PheCys: 2.067 ± 0.538
4.135PheAsp: 4.135 ± 1.133
3.544PheGlu: 3.544 ± 2.093
3.249PhePhe: 3.249 ± 0.926
2.658PheGly: 2.658 ± 0.429
0.886PheHis: 0.886 ± 0.481
5.021PheIle: 5.021 ± 1.078
4.725PheLys: 4.725 ± 2.095
5.316PheLeu: 5.316 ± 1.502
1.181PheMet: 1.181 ± 0.396
2.953PheAsn: 2.953 ± 0.636
1.181PhePro: 1.181 ± 0.396
1.181PheGln: 1.181 ± 0.779
2.363PheArg: 2.363 ± 0.887
4.135PheSer: 4.135 ± 0.861
2.658PheThr: 2.658 ± 0.709
2.658PheVal: 2.658 ± 0.429
0.591PheTrp: 0.591 ± 0.819
0.591PheTyr: 0.591 ± 0.198
0.0PheXaa: 0.0 ± 0.0
Gly
2.658GlyAla: 2.658 ± 2.271
2.363GlyCys: 2.363 ± 2.024
2.658GlyAsp: 2.658 ± 0.429
3.544GlyGlu: 3.544 ± 1.188
1.477GlyPhe: 1.477 ± 0.351
2.067GlyGly: 2.067 ± 1.642
0.591GlyHis: 0.591 ± 0.615
2.658GlyIle: 2.658 ± 2.279
3.249GlyLys: 3.249 ± 1.225
3.249GlyLeu: 3.249 ± 0.477
1.477GlyMet: 1.477 ± 0.351
1.772GlyAsn: 1.772 ± 0.594
1.477GlyPro: 1.477 ± 0.351
2.067GlyGln: 2.067 ± 0.616
2.067GlyArg: 2.067 ± 0.73
3.249GlySer: 3.249 ± 1.326
1.772GlyThr: 1.772 ± 1.275
1.772GlyVal: 1.772 ± 0.375
0.591GlyTrp: 0.591 ± 0.198
1.772GlyTyr: 1.772 ± 0.594
0.0GlyXaa: 0.0 ± 0.0
His
0.886HisAla: 0.886 ± 0.188
0.295HisCys: 0.295 ± 0.16
1.477HisAsp: 1.477 ± 0.426
2.658HisGlu: 2.658 ± 1.476
1.181HisPhe: 1.181 ± 0.287
0.591HisGly: 0.591 ± 0.198
1.181HisHis: 1.181 ± 0.396
1.772HisIle: 1.772 ± 0.375
2.363HisLys: 2.363 ± 0.955
2.363HisLeu: 2.363 ± 0.445
0.295HisMet: 0.295 ± 0.308
2.658HisAsn: 2.658 ± 0.709
0.295HisPro: 0.295 ± 0.883
0.295HisGln: 0.295 ± 0.16
0.886HisArg: 0.886 ± 0.188
2.953HisSer: 2.953 ± 0.638
0.886HisThr: 0.886 ± 0.492
1.477HisVal: 1.477 ± 0.684
0.886HisTrp: 0.886 ± 1.02
0.295HisTyr: 0.295 ± 0.16
0.0HisXaa: 0.0 ± 0.0
Ile
6.202IleAla: 6.202 ± 0.461
2.067IleCys: 2.067 ± 0.538
5.316IleAsp: 5.316 ± 0.324
4.135IleGlu: 4.135 ± 1.533
2.067IlePhe: 2.067 ± 0.458
2.658IleGly: 2.658 ± 0.454
1.477IleHis: 1.477 ± 0.632
6.497IleIle: 6.497 ± 1.417
6.793IleLys: 6.793 ± 1.6
5.316IleLeu: 5.316 ± 1.126
2.363IleMet: 2.363 ± 1.111
6.793IleAsn: 6.793 ± 0.496
4.725IlePro: 4.725 ± 0.566
2.363IleGln: 2.363 ± 0.792
2.067IleArg: 2.067 ± 1.123
7.974IleSer: 7.974 ± 0.208
7.974IleThr: 7.974 ± 1.733
4.135IleVal: 4.135 ± 1.027
0.295IleTrp: 0.295 ± 0.308
3.249IleTyr: 3.249 ± 0.741
0.0IleXaa: 0.0 ± 0.0
Lys
4.725LysAla: 4.725 ± 3.712
3.544LysCys: 3.544 ± 1.968
4.135LysAsp: 4.135 ± 1.533
6.497LysGlu: 6.497 ± 1.725
3.839LysPhe: 3.839 ± 0.169
4.135LysGly: 4.135 ± 1.951
1.477LysHis: 1.477 ± 0.684
6.793LysIle: 6.793 ± 1.461
8.565LysLys: 8.565 ± 3.893
6.497LysLeu: 6.497 ± 2.347
1.772LysMet: 1.772 ± 0.904
2.658LysAsn: 2.658 ± 1.045
1.772LysPro: 1.772 ± 0.576
2.067LysGln: 2.067 ± 0.73
2.658LysArg: 2.658 ± 0.454
5.316LysSer: 5.316 ± 1.138
5.611LysThr: 5.611 ± 2.258
5.611LysVal: 5.611 ± 1.424
0.591LysTrp: 0.591 ± 0.321
5.611LysTyr: 5.611 ± 1.487
0.0LysXaa: 0.0 ± 0.0
Leu
5.611LeuAla: 5.611 ± 1.603
3.544LeuCys: 3.544 ± 0.887
3.839LeuAsp: 3.839 ± 1.693
6.202LeuGlu: 6.202 ± 2.565
5.021LeuPhe: 5.021 ± 0.125
1.477LeuGly: 1.477 ± 0.741
2.067LeuHis: 2.067 ± 0.514
7.088LeuIle: 7.088 ± 0.545
5.907LeuLys: 5.907 ± 1.82
9.155LeuLeu: 9.155 ± 1.421
2.363LeuMet: 2.363 ± 0.887
4.135LeuAsn: 4.135 ± 0.94
2.953LeuPro: 2.953 ± 0.638
3.544LeuGln: 3.544 ± 1.206
3.249LeuArg: 3.249 ± 0.258
5.611LeuSer: 5.611 ± 1.713
7.383LeuThr: 7.383 ± 0.986
2.067LeuVal: 2.067 ± 0.538
0.295LeuTrp: 0.295 ± 0.308
1.772LeuTyr: 1.772 ± 0.375
0.0LeuXaa: 0.0 ± 0.0
Met
1.477MetAla: 1.477 ± 1.595
0.295MetCys: 0.295 ± 0.308
1.181MetAsp: 1.181 ± 0.641
2.067MetGlu: 2.067 ± 0.458
1.772MetPhe: 1.772 ± 0.576
0.295MetGly: 0.295 ± 0.16
0.591MetHis: 0.591 ± 0.198
2.363MetIle: 2.363 ± 0.527
2.067MetLys: 2.067 ± 0.458
3.249MetLeu: 3.249 ± 0.741
0.295MetMet: 0.295 ± 0.16
1.772MetAsn: 1.772 ± 0.617
0.591MetPro: 0.591 ± 0.615
0.591MetGln: 0.591 ± 0.321
1.477MetArg: 1.477 ± 0.802
2.363MetSer: 2.363 ± 1.047
1.772MetThr: 1.772 ± 0.603
1.477MetVal: 1.477 ± 0.351
0.0MetTrp: 0.0 ± 0.0
0.591MetTyr: 0.591 ± 0.198
0.0MetXaa: 0.0 ± 0.0
Asn
1.477AsnAla: 1.477 ± 0.351
1.772AsnCys: 1.772 ± 0.375
3.249AsnAsp: 3.249 ± 1.225
3.839AsnGlu: 3.839 ± 1.246
2.067AsnPhe: 2.067 ± 1.529
2.067AsnGly: 2.067 ± 0.458
2.067AsnHis: 2.067 ± 0.458
3.544AsnIle: 3.544 ± 0.75
2.953AsnLys: 2.953 ± 0.852
6.202AsnLeu: 6.202 ± 1.683
2.067AsnMet: 2.067 ± 0.879
1.772AsnAsn: 1.772 ± 0.617
2.363AsnPro: 2.363 ± 1.283
2.363AsnGln: 2.363 ± 0.575
3.249AsnArg: 3.249 ± 0.556
3.839AsnSer: 3.839 ± 0.365
2.953AsnThr: 2.953 ± 0.99
2.658AsnVal: 2.658 ± 0.709
0.886AsnTrp: 0.886 ± 0.188
2.067AsnTyr: 2.067 ± 1.123
0.0AsnXaa: 0.0 ± 0.0
Pro
2.067ProAla: 2.067 ± 1.555
1.181ProCys: 1.181 ± 0.287
1.772ProAsp: 1.772 ± 0.576
1.772ProGlu: 1.772 ± 0.594
3.544ProPhe: 3.544 ± 0.862
1.772ProGly: 1.772 ± 0.603
0.295ProHis: 0.295 ± 0.16
1.181ProIle: 1.181 ± 0.797
1.772ProLys: 1.772 ± 0.375
2.067ProLeu: 2.067 ± 0.538
1.477ProMet: 1.477 ± 0.351
1.772ProAsn: 1.772 ± 0.576
0.591ProPro: 0.591 ± 0.198
1.181ProGln: 1.181 ± 0.287
1.477ProArg: 1.477 ± 0.684
2.658ProSer: 2.658 ± 1.045
1.477ProThr: 1.477 ± 1.103
2.363ProVal: 2.363 ± 1.8
0.0ProTrp: 0.0 ± 0.0
1.181ProTyr: 1.181 ± 0.698
0.0ProXaa: 0.0 ± 0.0
Gln
2.363GlnAla: 2.363 ± 0.445
0.591GlnCys: 0.591 ± 0.321
1.772GlnAsp: 1.772 ± 0.866
1.181GlnGlu: 1.181 ± 0.698
0.886GlnPhe: 0.886 ± 0.783
1.477GlnGly: 1.477 ± 0.684
1.477GlnHis: 1.477 ± 0.351
2.363GlnIle: 2.363 ± 0.445
3.544GlnLys: 3.544 ± 1.731
0.886GlnLeu: 0.886 ± 0.188
0.295GlnMet: 0.295 ± 0.16
1.477GlnAsn: 1.477 ± 0.426
0.295GlnPro: 0.295 ± 0.16
1.477GlnGln: 1.477 ± 0.632
2.658GlnArg: 2.658 ± 0.563
2.658GlnSer: 2.658 ± 1.161
2.067GlnThr: 2.067 ± 0.73
1.477GlnVal: 1.477 ± 0.351
0.0GlnTrp: 0.0 ± 0.0
0.886GlnTyr: 0.886 ± 0.188
0.0GlnXaa: 0.0 ± 0.0
Arg
1.477ArgAla: 1.477 ± 0.351
0.886ArgCys: 0.886 ± 0.492
2.067ArgAsp: 2.067 ± 1.123
3.544ArgGlu: 3.544 ± 1.152
3.839ArgPhe: 3.839 ± 1.208
1.477ArgGly: 1.477 ± 0.684
0.591ArgHis: 0.591 ± 0.321
3.249ArgIle: 3.249 ± 0.926
5.316ArgLys: 5.316 ± 2.089
3.249ArgLeu: 3.249 ± 0.556
1.181ArgMet: 1.181 ± 0.641
1.477ArgAsn: 1.477 ± 0.808
1.181ArgPro: 1.181 ± 0.287
1.477ArgGln: 1.477 ± 1.581
2.363ArgArg: 2.363 ± 0.668
1.477ArgSer: 1.477 ± 0.741
3.839ArgThr: 3.839 ± 0.994
1.181ArgVal: 1.181 ± 0.287
0.295ArgTrp: 0.295 ± 0.16
0.886ArgTyr: 0.886 ± 0.492
0.0ArgXaa: 0.0 ± 0.0
Ser
5.316SerAla: 5.316 ± 0.74
0.886SerCys: 0.886 ± 0.481
4.43SerAsp: 4.43 ± 1.16
3.249SerGlu: 3.249 ± 0.556
4.135SerPhe: 4.135 ± 0.861
3.839SerGly: 3.839 ± 0.572
2.363SerHis: 2.363 ± 0.792
6.202SerIle: 6.202 ± 0.726
6.497SerLys: 6.497 ± 0.517
4.43SerLeu: 4.43 ± 1.085
0.886SerMet: 0.886 ± 0.481
3.544SerAsn: 3.544 ± 0.272
2.067SerPro: 2.067 ± 0.616
1.772SerGln: 1.772 ± 0.375
4.725SerArg: 4.725 ± 0.566
4.135SerSer: 4.135 ± 0.893
7.383SerThr: 7.383 ± 0.634
3.544SerVal: 3.544 ± 0.67
0.295SerTrp: 0.295 ± 0.16
2.363SerTyr: 2.363 ± 1.175
0.0SerXaa: 0.0 ± 0.0
Thr
5.021ThrAla: 5.021 ± 1.516
2.953ThrCys: 2.953 ± 3.077
3.839ThrAsp: 3.839 ± 0.801
3.839ThrGlu: 3.839 ± 0.822
1.772ThrPhe: 1.772 ± 0.603
2.658ThrGly: 2.658 ± 1.476
1.772ThrHis: 1.772 ± 0.594
6.793ThrIle: 6.793 ± 1.451
4.135ThrLys: 4.135 ± 0.415
5.316ThrLeu: 5.316 ± 1.539
1.477ThrMet: 1.477 ± 0.398
3.249ThrAsn: 3.249 ± 0.709
2.067ThrPro: 2.067 ± 0.879
1.772ThrGln: 1.772 ± 0.576
2.363ThrArg: 2.363 ± 1.283
5.316ThrSer: 5.316 ± 0.324
6.202ThrThr: 6.202 ± 1.229
4.135ThrVal: 4.135 ± 1.485
1.181ThrTrp: 1.181 ± 0.877
3.544ThrTyr: 3.544 ± 1.924
0.0ThrXaa: 0.0 ± 0.0
Val
1.772ValAla: 1.772 ± 1.41
0.886ValCys: 0.886 ± 0.492
2.658ValAsp: 2.658 ± 0.429
3.544ValGlu: 3.544 ± 0.862
2.363ValPhe: 2.363 ± 1.047
3.544ValGly: 3.544 ± 0.32
2.067ValHis: 2.067 ± 0.538
4.725ValIle: 4.725 ± 0.34
5.611ValLys: 5.611 ± 1.644
3.544ValLeu: 3.544 ± 1.968
1.181ValMet: 1.181 ± 0.779
2.953ValAsn: 2.953 ± 0.333
1.477ValPro: 1.477 ± 0.351
2.067ValGln: 2.067 ± 1.451
1.181ValArg: 1.181 ± 0.797
2.658ValSer: 2.658 ± 1.641
2.067ValThr: 2.067 ± 1.288
2.953ValVal: 2.953 ± 1.368
0.591ValTrp: 0.591 ± 0.321
2.658ValTyr: 2.658 ± 0.731
0.0ValXaa: 0.0 ± 0.0
Trp
0.886TrpAla: 0.886 ± 0.481
0.295TrpCys: 0.295 ± 0.16
0.591TrpAsp: 0.591 ± 0.198
1.181TrpGlu: 1.181 ± 0.396
0.886TrpPhe: 0.886 ± 0.492
1.181TrpGly: 1.181 ± 0.797
0.886TrpHis: 0.886 ± 0.791
0.591TrpIle: 0.591 ± 0.198
0.591TrpLys: 0.591 ± 0.903
0.886TrpLeu: 0.886 ± 0.481
0.295TrpMet: 0.295 ± 0.308
0.886TrpAsn: 0.886 ± 0.783
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.295TrpSer: 0.295 ± 0.16
0.591TrpThr: 0.591 ± 0.198
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.181TyrAla: 1.181 ± 0.287
1.181TyrCys: 1.181 ± 0.396
1.772TyrAsp: 1.772 ± 0.576
2.363TyrGlu: 2.363 ± 0.575
1.772TyrPhe: 1.772 ± 0.594
1.477TyrGly: 1.477 ± 1.05
0.295TyrHis: 0.295 ± 0.16
4.43TyrIle: 4.43 ± 0.711
3.249TyrLys: 3.249 ± 0.741
4.135TyrLeu: 4.135 ± 1.027
0.591TyrMet: 0.591 ± 0.819
3.249TyrAsn: 3.249 ± 0.741
2.067TyrPro: 2.067 ± 0.743
0.886TyrGln: 0.886 ± 0.481
1.477TyrArg: 1.477 ± 0.426
2.067TyrSer: 2.067 ± 0.458
1.477TyrThr: 1.477 ± 0.351
2.363TyrVal: 2.363 ± 0.445
0.886TyrTrp: 0.886 ± 0.188
2.067TyrTyr: 2.067 ± 0.743
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3387 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski