Amino acid dipepetide frequency for Nova virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.964AlaAla: 2.964 ± 0.996
1.078AlaCys: 1.078 ± 0.66
2.695AlaAsp: 2.695 ± 0.844
3.234AlaGlu: 3.234 ± 2.028
2.156AlaPhe: 2.156 ± 0.247
3.773AlaGly: 3.773 ± 0.438
1.886AlaHis: 1.886 ± 0.726
4.581AlaIle: 4.581 ± 1.142
4.042AlaLys: 4.042 ± 1.358
5.928AlaLeu: 5.928 ± 2.808
1.347AlaMet: 1.347 ± 0.461
1.347AlaAsn: 1.347 ± 0.263
1.886AlaPro: 1.886 ± 0.901
2.425AlaGln: 2.425 ± 0.72
2.156AlaArg: 2.156 ± 0.689
3.773AlaSer: 3.773 ± 0.438
3.503AlaThr: 3.503 ± 0.759
5.12AlaVal: 5.12 ± 2.247
0.539AlaTrp: 0.539 ± 0.437
1.617AlaTyr: 1.617 ± 0.311
0.0AlaXaa: 0.0 ± 0.0
Cys
1.617CysAla: 1.617 ± 0.613
0.269CysCys: 0.269 ± 0.259
0.808CysAsp: 0.808 ± 0.406
1.078CysGlu: 1.078 ± 0.66
1.617CysPhe: 1.617 ± 0.811
1.078CysGly: 1.078 ± 0.523
0.269CysHis: 0.269 ± 0.259
1.347CysIle: 1.347 ± 0.461
1.078CysLys: 1.078 ± 0.66
1.617CysLeu: 1.617 ± 1.175
0.269CysMet: 0.269 ± 0.259
1.617CysAsn: 1.617 ± 0.946
2.964CysPro: 2.964 ± 2.436
1.347CysGln: 1.347 ± 1.101
0.808CysArg: 0.808 ± 0.406
2.156CysSer: 2.156 ± 0.968
1.886CysThr: 1.886 ± 0.792
2.425CysVal: 2.425 ± 0.643
0.539CysTrp: 0.539 ± 0.167
1.347CysTyr: 1.347 ± 0.917
0.0CysXaa: 0.0 ± 0.0
Asp
4.042AspAla: 4.042 ± 1.373
1.886AspCys: 1.886 ± 0.726
2.695AspAsp: 2.695 ± 1.698
3.503AspGlu: 3.503 ± 1.533
1.886AspPhe: 1.886 ± 0.504
2.425AspGly: 2.425 ± 0.643
2.695AspHis: 2.695 ± 1.144
3.773AspIle: 3.773 ± 0.8
4.581AspLys: 4.581 ± 1.138
7.276AspLeu: 7.276 ± 1.292
1.347AspMet: 1.347 ± 0.263
2.425AspAsn: 2.425 ± 1.091
1.347AspPro: 1.347 ± 0.263
2.695AspGln: 2.695 ± 0.098
2.695AspArg: 2.695 ± 0.526
3.773AspSer: 3.773 ± 0.806
1.886AspThr: 1.886 ± 0.483
3.773AspVal: 3.773 ± 0.716
0.808AspTrp: 0.808 ± 0.477
1.886AspTyr: 1.886 ± 0.504
0.0AspXaa: 0.0 ± 0.0
Glu
4.312GluAla: 4.312 ± 1.607
0.808GluCys: 0.808 ± 0.406
3.503GluAsp: 3.503 ± 0.558
3.773GluGlu: 3.773 ± 0.836
2.964GluPhe: 2.964 ± 0.819
2.425GluGly: 2.425 ± 0.299
1.617GluHis: 1.617 ± 0.733
3.234GluIle: 3.234 ± 1.142
6.467GluLys: 6.467 ± 1.25
6.467GluLeu: 6.467 ± 0.707
1.078GluMet: 1.078 ± 0.874
1.886GluAsn: 1.886 ± 0.901
1.886GluPro: 1.886 ± 0.693
2.425GluGln: 2.425 ± 0.545
3.234GluArg: 3.234 ± 0.91
1.617GluSer: 1.617 ± 0.613
3.234GluThr: 3.234 ± 0.952
4.312GluVal: 4.312 ± 1.846
1.617GluTrp: 1.617 ± 0.395
1.347GluTyr: 1.347 ± 0.794
0.0GluXaa: 0.0 ± 0.0
Phe
2.156PheAla: 2.156 ± 0.159
1.347PheCys: 1.347 ± 0.917
2.156PheAsp: 2.156 ± 0.634
4.312PheGlu: 4.312 ± 0.489
2.156PhePhe: 2.156 ± 0.923
2.156PheGly: 2.156 ± 0.159
1.617PheHis: 1.617 ± 0.5
2.964PheIle: 2.964 ± 0.253
3.773PheLys: 3.773 ± 1.236
3.503PheLeu: 3.503 ± 0.558
2.964PheMet: 2.964 ± 0.607
4.042PheAsn: 4.042 ± 0.555
1.078PhePro: 1.078 ± 0.317
2.425PheGln: 2.425 ± 0.299
2.695PheArg: 2.695 ± 0.526
3.234PheSer: 3.234 ± 1.289
2.425PheThr: 2.425 ± 0.776
1.347PheVal: 1.347 ± 0.369
0.269PheTrp: 0.269 ± 0.159
1.347PheTyr: 1.347 ± 0.263
0.0PheXaa: 0.0 ± 0.0
Gly
3.773GlyAla: 3.773 ± 1.399
1.078GlyCys: 1.078 ± 0.317
2.156GlyAsp: 2.156 ± 0.641
2.695GlyGlu: 2.695 ± 0.834
3.234GlyPhe: 3.234 ± 0.441
2.425GlyGly: 2.425 ± 0.89
2.156GlyHis: 2.156 ± 0.968
4.042GlyIle: 4.042 ± 0.582
2.156GlyLys: 2.156 ± 0.159
7.815GlyLeu: 7.815 ± 0.408
1.886GlyMet: 1.886 ± 0.418
2.964GlyAsn: 2.964 ± 0.253
1.078GlyPro: 1.078 ± 0.66
1.886GlyGln: 1.886 ± 0.726
1.617GlyArg: 1.617 ± 1.048
4.042GlySer: 4.042 ± 0.908
3.773GlyThr: 3.773 ± 0.414
3.773GlyVal: 3.773 ± 1.802
1.347GlyTrp: 1.347 ± 0.917
2.156GlyTyr: 2.156 ± 0.516
0.0GlyXaa: 0.0 ± 0.0
His
0.539HisAla: 0.539 ± 0.318
1.078HisCys: 1.078 ± 0.66
2.425HisAsp: 2.425 ± 0.776
0.269HisGlu: 0.269 ± 0.159
0.808HisPhe: 0.808 ± 0.407
1.078HisGly: 1.078 ± 0.66
0.269HisHis: 0.269 ± 0.159
0.539HisIle: 0.539 ± 0.167
2.156HisLys: 2.156 ± 0.159
3.773HisLeu: 3.773 ± 0.8
0.808HisMet: 0.808 ± 0.482
0.539HisAsn: 0.539 ± 0.167
1.617HisPro: 1.617 ± 0.395
0.539HisGln: 0.539 ± 0.167
1.617HisArg: 1.617 ± 0.221
2.156HisSer: 2.156 ± 0.667
1.347HisThr: 1.347 ± 0.263
1.617HisVal: 1.617 ± 0.5
0.539HisTrp: 0.539 ± 0.167
1.078HisTyr: 1.078 ± 0.317
0.0HisXaa: 0.0 ± 0.0
Ile
2.964IleAla: 2.964 ± 0.553
2.425IleCys: 2.425 ± 1.231
3.503IleAsp: 3.503 ± 1.418
4.85IleGlu: 4.85 ± 1.44
2.964IlePhe: 2.964 ± 0.806
2.695IleGly: 2.695 ± 1.54
1.617IleHis: 1.617 ± 0.733
3.234IleIle: 3.234 ± 1.265
4.312IleLys: 4.312 ± 0.716
5.389IleLeu: 5.389 ± 0.648
2.695IleMet: 2.695 ± 0.359
2.695IleAsn: 2.695 ± 1.144
4.042IlePro: 4.042 ± 0.874
2.964IleGln: 2.964 ± 1.395
2.695IleArg: 2.695 ± 1.111
5.928IleSer: 5.928 ± 0.883
2.964IleThr: 2.964 ± 0.709
4.85IleVal: 4.85 ± 0.731
0.269IleTrp: 0.269 ± 0.159
2.695IleTyr: 2.695 ± 0.697
0.0IleXaa: 0.0 ± 0.0
Lys
4.042LysAla: 4.042 ± 0.65
0.808LysCys: 0.808 ± 0.477
5.389LysAsp: 5.389 ± 1.449
4.85LysGlu: 4.85 ± 1.44
3.773LysPhe: 3.773 ± 1.008
3.234LysGly: 3.234 ± 0.953
2.156LysHis: 2.156 ± 0.551
5.12LysIle: 5.12 ± 1.143
4.581LysLys: 4.581 ± 1.062
3.773LysLeu: 3.773 ± 0.465
1.347LysMet: 1.347 ± 0.369
3.234LysAsn: 3.234 ± 1.553
2.425LysPro: 2.425 ± 0.545
2.425LysGln: 2.425 ± 0.776
2.425LysArg: 2.425 ± 1.091
4.312LysSer: 4.312 ± 0.126
3.234LysThr: 3.234 ± 0.952
7.276LysVal: 7.276 ± 1.879
0.269LysTrp: 0.269 ± 0.159
2.695LysTyr: 2.695 ± 0.697
0.0LysXaa: 0.0 ± 0.0
Leu
5.389LeuAla: 5.389 ± 0.519
2.156LeuCys: 2.156 ± 0.968
6.198LeuAsp: 6.198 ± 0.212
5.389LeuGlu: 5.389 ± 1.513
5.389LeuPhe: 5.389 ± 1.688
5.389LeuGly: 5.389 ± 0.648
2.156LeuHis: 2.156 ± 0.634
8.084LeuIle: 8.084 ± 0.56
5.12LeuLys: 5.12 ± 0.471
9.162LeuLeu: 9.162 ± 0.609
3.503LeuMet: 3.503 ± 1.265
5.389LeuAsn: 5.389 ± 0.519
2.425LeuPro: 2.425 ± 0.624
5.659LeuGln: 5.659 ± 0.325
3.503LeuArg: 3.503 ± 1.095
4.312LeuSer: 4.312 ± 1.087
5.659LeuThr: 5.659 ± 1.324
4.85LeuVal: 4.85 ± 1.287
1.078LeuTrp: 1.078 ± 0.317
4.581LeuTyr: 4.581 ± 1.203
0.0LeuXaa: 0.0 ± 0.0
Met
2.695MetAla: 2.695 ± 1.103
0.808MetCys: 0.808 ± 0.68
1.347MetAsp: 1.347 ± 0.803
1.886MetGlu: 1.886 ± 0.767
1.886MetPhe: 1.886 ± 0.726
1.078MetGly: 1.078 ± 0.874
0.539MetHis: 0.539 ± 0.538
2.425MetIle: 2.425 ± 0.545
2.425MetLys: 2.425 ± 0.776
1.347MetLeu: 1.347 ± 0.562
1.617MetMet: 1.617 ± 0.395
1.078MetAsn: 1.078 ± 0.333
0.808MetPro: 0.808 ± 0.197
1.617MetGln: 1.617 ± 0.733
1.886MetArg: 1.886 ± 1.793
1.078MetSer: 1.078 ± 0.317
1.886MetThr: 1.886 ± 0.901
1.617MetVal: 1.617 ± 0.395
0.269MetTrp: 0.269 ± 0.159
0.808MetTyr: 0.808 ± 0.477
0.0MetXaa: 0.0 ± 0.0
Asn
0.808AsnAla: 0.808 ± 0.427
1.078AsnCys: 1.078 ± 0.333
2.425AsnAsp: 2.425 ± 1.129
2.964AsnGlu: 2.964 ± 0.607
2.425AsnPhe: 2.425 ± 0.643
2.425AsnGly: 2.425 ± 0.643
0.808AsnHis: 0.808 ± 0.406
3.503AsnIle: 3.503 ± 0.843
3.234AsnLys: 3.234 ± 1.225
3.234AsnLeu: 3.234 ± 0.441
0.808AsnMet: 0.808 ± 0.197
1.347AsnAsn: 1.347 ± 0.562
3.773AsnPro: 3.773 ± 0.465
1.078AsnGln: 1.078 ± 0.874
2.156AsnArg: 2.156 ± 0.159
2.156AsnSer: 2.156 ± 0.516
1.078AsnThr: 1.078 ± 0.333
3.773AsnVal: 3.773 ± 0.966
1.347AsnTrp: 1.347 ± 0.849
0.808AsnTyr: 0.808 ± 0.406
0.0AsnXaa: 0.0 ± 0.0
Pro
2.156ProAla: 2.156 ± 0.159
0.808ProCys: 0.808 ± 0.406
2.964ProAsp: 2.964 ± 1.115
1.886ProGlu: 1.886 ± 0.807
1.078ProPhe: 1.078 ± 0.303
3.503ProGly: 3.503 ± 0.964
0.808ProHis: 0.808 ± 0.406
2.425ProIle: 2.425 ± 1.08
1.886ProLys: 1.886 ± 0.108
2.425ProLeu: 2.425 ± 0.89
1.078ProMet: 1.078 ± 0.303
1.078ProAsn: 1.078 ± 0.333
1.078ProPro: 1.078 ± 0.874
1.078ProGln: 1.078 ± 0.333
0.808ProArg: 0.808 ± 0.197
4.312ProSer: 4.312 ± 0.494
3.234ProThr: 3.234 ± 1.467
1.617ProVal: 1.617 ± 0.561
0.808ProTrp: 0.808 ± 0.68
1.078ProTyr: 1.078 ± 0.333
0.0ProXaa: 0.0 ± 0.0
Gln
3.773GlnAla: 3.773 ± 0.836
1.886GlnCys: 1.886 ± 1.433
1.078GlnAsp: 1.078 ± 0.303
1.617GlnGlu: 1.617 ± 0.311
0.808GlnPhe: 0.808 ± 0.197
2.695GlnGly: 2.695 ± 0.75
1.347GlnHis: 1.347 ± 0.461
1.347GlnIle: 1.347 ± 0.564
1.886GlnLys: 1.886 ± 0.418
4.042GlnLeu: 4.042 ± 0.987
1.078GlnMet: 1.078 ± 0.874
1.617GlnAsn: 1.617 ± 0.311
0.808GlnPro: 0.808 ± 0.68
1.617GlnGln: 1.617 ± 0.5
2.156GlnArg: 2.156 ± 1.196
4.042GlnSer: 4.042 ± 0.789
2.425GlnThr: 2.425 ± 0.1
3.503GlnVal: 3.503 ± 0.292
0.808GlnTrp: 0.808 ± 0.197
2.425GlnTyr: 2.425 ± 0.695
0.0GlnXaa: 0.0 ± 0.0
Arg
1.347ArgAla: 1.347 ± 0.794
1.078ArgCys: 1.078 ± 0.896
2.964ArgAsp: 2.964 ± 0.607
1.886ArgGlu: 1.886 ± 0.418
2.425ArgPhe: 2.425 ± 1.106
2.964ArgGly: 2.964 ± 0.41
1.347ArgHis: 1.347 ± 0.562
3.234ArgIle: 3.234 ± 0.755
2.964ArgLys: 2.964 ± 0.587
4.581ArgLeu: 4.581 ± 1.661
0.539ArgMet: 0.539 ± 0.167
2.156ArgAsn: 2.156 ± 0.946
0.808ArgPro: 0.808 ± 0.477
2.695ArgGln: 2.695 ± 1.103
2.695ArgArg: 2.695 ± 0.452
2.695ArgSer: 2.695 ± 1.111
2.964ArgThr: 2.964 ± 0.553
2.695ArgVal: 2.695 ± 0.844
0.808ArgTrp: 0.808 ± 0.477
3.773ArgTyr: 3.773 ± 0.716
0.0ArgXaa: 0.0 ± 0.0
Ser
3.773SerAla: 3.773 ± 1.564
2.425SerCys: 2.425 ± 1.95
5.12SerAsp: 5.12 ± 1.22
3.503SerGlu: 3.503 ± 0.387
3.503SerPhe: 3.503 ± 0.558
5.659SerGly: 5.659 ± 1.08
1.078SerHis: 1.078 ± 0.636
5.928SerIle: 5.928 ± 1.316
5.389SerLys: 5.389 ± 1.016
7.545SerLeu: 7.545 ± 0.69
1.617SerMet: 1.617 ± 0.561
2.425SerAsn: 2.425 ± 0.889
2.695SerPro: 2.695 ± 0.452
1.617SerGln: 1.617 ± 0.395
2.695SerArg: 2.695 ± 0.923
4.581SerSer: 4.581 ± 0.738
3.234SerThr: 3.234 ± 0.503
4.581SerVal: 4.581 ± 0.551
1.078SerTrp: 1.078 ± 0.66
3.234SerTyr: 3.234 ± 0.366
0.0SerXaa: 0.0 ± 0.0
Thr
4.042ThrAla: 4.042 ± 0.713
1.078ThrCys: 1.078 ± 0.66
2.156ThrAsp: 2.156 ± 0.641
4.581ThrGlu: 4.581 ± 0.738
2.964ThrPhe: 2.964 ± 0.819
3.503ThrGly: 3.503 ± 1.136
0.808ThrHis: 0.808 ± 0.197
2.695ThrIle: 2.695 ± 1.296
2.695ThrLys: 2.695 ± 0.452
5.659ThrLeu: 5.659 ± 2.15
1.886ThrMet: 1.886 ± 0.767
1.078ThrAsn: 1.078 ± 0.523
1.617ThrPro: 1.617 ± 0.677
1.617ThrGln: 1.617 ± 0.311
3.234ThrArg: 3.234 ± 1.15
4.312ThrSer: 4.312 ± 0.704
4.042ThrThr: 4.042 ± 1.655
4.85ThrVal: 4.85 ± 0.601
0.539ThrTrp: 0.539 ± 0.518
2.156ThrTyr: 2.156 ± 0.247
0.0ThrXaa: 0.0 ± 0.0
Val
3.503ValAla: 3.503 ± 0.72
2.695ValCys: 2.695 ± 1.144
5.12ValAsp: 5.12 ± 1.143
4.042ValGlu: 4.042 ± 0.221
3.234ValPhe: 3.234 ± 0.79
2.695ValGly: 2.695 ± 0.658
0.539ValHis: 0.539 ± 0.167
4.581ValIle: 4.581 ± 0.342
5.12ValLys: 5.12 ± 0.749
6.198ValLeu: 6.198 ± 1.588
1.347ValMet: 1.347 ± 0.803
2.425ValAsn: 2.425 ± 0.776
2.425ValPro: 2.425 ± 0.89
2.156ValGln: 2.156 ± 0.516
5.659ValArg: 5.659 ± 1.254
7.545ValSer: 7.545 ± 1.251
4.042ValThr: 4.042 ± 0.582
3.773ValVal: 3.773 ± 0.716
1.347ValTrp: 1.347 ± 0.564
1.617ValTyr: 1.617 ± 0.811
0.0ValXaa: 0.0 ± 0.0
Trp
1.617TrpAla: 1.617 ± 0.5
0.269TrpCys: 0.269 ± 0.259
0.539TrpAsp: 0.539 ± 0.318
0.0TrpGlu: 0.0 ± 0.0
1.078TrpPhe: 1.078 ± 0.636
1.078TrpGly: 1.078 ± 0.303
0.539TrpHis: 0.539 ± 0.318
1.347TrpIle: 1.347 ± 0.917
1.078TrpLys: 1.078 ± 0.333
1.617TrpLeu: 1.617 ± 0.221
0.539TrpMet: 0.539 ± 0.518
0.269TrpAsn: 0.269 ± 0.259
0.269TrpPro: 0.269 ± 0.159
0.269TrpGln: 0.269 ± 0.259
1.078TrpArg: 1.078 ± 0.317
1.617TrpSer: 1.617 ± 0.677
0.539TrpThr: 0.539 ± 0.167
1.078TrpVal: 1.078 ± 0.473
0.269TrpTrp: 0.269 ± 0.259
0.269TrpTyr: 0.269 ± 0.259
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.539TyrAla: 0.539 ± 0.318
1.078TyrCys: 1.078 ± 0.333
2.156TyrAsp: 2.156 ± 0.923
1.886TyrGlu: 1.886 ± 0.504
1.617TyrPhe: 1.617 ± 0.311
3.773TyrGly: 3.773 ± 0.38
0.808TyrHis: 0.808 ± 0.197
1.347TyrIle: 1.347 ± 0.369
2.425TyrLys: 2.425 ± 0.592
4.042TyrLeu: 4.042 ± 1.239
1.078TyrMet: 1.078 ± 0.898
1.617TyrAsn: 1.617 ± 0.613
1.078TyrPro: 1.078 ± 0.523
2.425TyrGln: 2.425 ± 0.495
0.808TyrArg: 0.808 ± 0.197
4.042TyrSer: 4.042 ± 1.135
2.156TyrThr: 2.156 ± 1.036
3.234TyrVal: 3.234 ± 0.366
0.808TyrTrp: 0.808 ± 0.477
1.347TyrTyr: 1.347 ± 0.794
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3712 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski