Amino acid dipepetide frequency for Cedar virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.197AlaAla: 2.197 ± 0.603
0.732AlaCys: 0.732 ± 0.458
1.464AlaAsp: 1.464 ± 0.395
2.38AlaGlu: 2.38 ± 1.109
1.098AlaPhe: 1.098 ± 0.486
2.563AlaGly: 2.563 ± 0.829
0.915AlaHis: 0.915 ± 0.432
3.478AlaIle: 3.478 ± 0.875
3.295AlaLys: 3.295 ± 0.987
3.295AlaLeu: 3.295 ± 0.958
1.647AlaMet: 1.647 ± 0.652
2.014AlaAsn: 2.014 ± 0.496
1.098AlaPro: 1.098 ± 0.702
1.464AlaGln: 1.464 ± 0.394
1.281AlaArg: 1.281 ± 0.802
3.112AlaSer: 3.112 ± 0.841
2.563AlaThr: 2.563 ± 0.514
1.647AlaVal: 1.647 ± 0.475
0.183AlaTrp: 0.183 ± 0.115
0.915AlaTyr: 0.915 ± 0.347
0.0AlaXaa: 0.0 ± 0.0
Cys
0.549CysAla: 0.549 ± 0.261
0.183CysCys: 0.183 ± 0.115
0.183CysAsp: 0.183 ± 0.115
0.549CysGlu: 0.549 ± 0.284
0.732CysPhe: 0.732 ± 0.343
0.183CysGly: 0.183 ± 0.218
0.366CysHis: 0.366 ± 0.212
0.915CysIle: 0.915 ± 0.637
1.464CysLys: 1.464 ± 0.426
1.464CysLeu: 1.464 ± 0.563
0.915CysMet: 0.915 ± 0.44
1.281CysAsn: 1.281 ± 0.501
1.098CysPro: 1.098 ± 0.487
1.098CysGln: 1.098 ± 0.312
0.732CysArg: 0.732 ± 0.495
1.281CysSer: 1.281 ± 0.598
0.0CysThr: 0.0 ± 0.0
1.098CysVal: 1.098 ± 0.384
0.183CysTrp: 0.183 ± 0.115
1.464CysTyr: 1.464 ± 0.585
0.0CysXaa: 0.0 ± 0.0
Asp
1.83AspAla: 1.83 ± 0.554
0.0AspCys: 0.0 ± 0.0
4.393AspAsp: 4.393 ± 1.259
4.393AspGlu: 4.393 ± 1.79
3.295AspPhe: 3.295 ± 0.554
2.563AspGly: 2.563 ± 0.975
1.281AspHis: 1.281 ± 0.597
5.125AspIle: 5.125 ± 0.563
5.308AspLys: 5.308 ± 0.843
6.773AspLeu: 6.773 ± 1.147
1.83AspMet: 1.83 ± 0.788
6.224AspAsn: 6.224 ± 1.031
3.112AspPro: 3.112 ± 0.211
2.38AspGln: 2.38 ± 0.652
3.112AspArg: 3.112 ± 1.349
4.759AspSer: 4.759 ± 1.199
2.563AspThr: 2.563 ± 0.549
3.844AspVal: 3.844 ± 0.779
0.183AspTrp: 0.183 ± 0.115
2.746AspTyr: 2.746 ± 1.365
0.0AspXaa: 0.0 ± 0.0
Glu
2.014GluAla: 2.014 ± 0.918
0.915GluCys: 0.915 ± 0.306
4.942GluAsp: 4.942 ± 1.359
3.844GluGlu: 3.844 ± 0.967
2.197GluPhe: 2.197 ± 0.863
3.661GluGly: 3.661 ± 0.886
1.464GluHis: 1.464 ± 0.551
5.125GluIle: 5.125 ± 1.078
3.295GluLys: 3.295 ± 0.77
5.125GluLeu: 5.125 ± 0.748
1.281GluMet: 1.281 ± 0.443
3.844GluAsn: 3.844 ± 0.65
2.197GluPro: 2.197 ± 0.887
2.014GluGln: 2.014 ± 0.936
2.929GluArg: 2.929 ± 0.695
4.027GluSer: 4.027 ± 0.683
4.027GluThr: 4.027 ± 0.658
2.197GluVal: 2.197 ± 0.55
0.366GluTrp: 0.366 ± 0.211
2.38GluTyr: 2.38 ± 0.429
0.0GluXaa: 0.0 ± 0.0
Phe
2.563PheAla: 2.563 ± 0.655
1.464PheCys: 1.464 ± 0.719
2.563PheAsp: 2.563 ± 0.775
1.281PheGlu: 1.281 ± 0.459
2.014PhePhe: 2.014 ± 0.459
1.098PheGly: 1.098 ± 0.419
0.366PheHis: 0.366 ± 0.229
3.478PheIle: 3.478 ± 0.829
2.014PheLys: 2.014 ± 0.807
3.844PheLeu: 3.844 ± 0.903
1.281PheMet: 1.281 ± 0.797
3.295PheAsn: 3.295 ± 0.827
1.281PhePro: 1.281 ± 0.444
0.549PheGln: 0.549 ± 0.241
1.83PheArg: 1.83 ± 0.553
2.746PheSer: 2.746 ± 0.48
2.38PheThr: 2.38 ± 0.639
1.647PheVal: 1.647 ± 0.734
0.366PheTrp: 0.366 ± 0.229
1.098PheTyr: 1.098 ± 0.511
0.0PheXaa: 0.0 ± 0.0
Gly
1.464GlyAla: 1.464 ± 0.743
0.0GlyCys: 0.0 ± 0.0
2.563GlyAsp: 2.563 ± 0.452
2.38GlyGlu: 2.38 ± 0.451
2.929GlyPhe: 2.929 ± 0.555
3.844GlyGly: 3.844 ± 1.132
1.647GlyHis: 1.647 ± 0.652
5.308GlyIle: 5.308 ± 0.802
4.21GlyLys: 4.21 ± 0.915
4.759GlyLeu: 4.759 ± 1.035
1.098GlyMet: 1.098 ± 0.258
2.929GlyAsn: 2.929 ± 1.003
2.197GlyPro: 2.197 ± 0.506
1.464GlyGln: 1.464 ± 0.416
4.21GlyArg: 4.21 ± 1.419
4.942GlySer: 4.942 ± 1.717
1.83GlyThr: 1.83 ± 0.797
2.563GlyVal: 2.563 ± 0.527
0.183GlyTrp: 0.183 ± 0.115
1.281GlyTyr: 1.281 ± 0.443
0.0GlyXaa: 0.0 ± 0.0
His
1.281HisAla: 1.281 ± 0.462
0.915HisCys: 0.915 ± 0.573
1.098HisAsp: 1.098 ± 0.658
2.38HisGlu: 2.38 ± 1.193
0.549HisPhe: 0.549 ± 0.261
0.549HisGly: 0.549 ± 0.261
0.366HisHis: 0.366 ± 0.229
1.281HisIle: 1.281 ± 0.46
0.915HisLys: 0.915 ± 0.397
2.38HisLeu: 2.38 ± 0.678
0.0HisMet: 0.0 ± 0.0
1.281HisAsn: 1.281 ± 0.442
2.197HisPro: 2.197 ± 0.656
0.732HisGln: 0.732 ± 0.344
0.732HisArg: 0.732 ± 0.458
1.281HisSer: 1.281 ± 0.422
0.549HisThr: 0.549 ± 0.344
1.098HisVal: 1.098 ± 0.52
0.0HisTrp: 0.0 ± 0.0
0.915HisTyr: 0.915 ± 0.437
0.0HisXaa: 0.0 ± 0.0
Ile
3.844IleAla: 3.844 ± 0.994
2.746IleCys: 2.746 ± 0.657
5.125IleAsp: 5.125 ± 1.109
2.563IleGlu: 2.563 ± 0.565
2.563IlePhe: 2.563 ± 0.55
4.576IleGly: 4.576 ± 0.622
2.014IleHis: 2.014 ± 0.374
6.407IleIle: 6.407 ± 1.857
4.942IleLys: 4.942 ± 1.238
8.603IleLeu: 8.603 ± 1.127
2.929IleMet: 2.929 ± 0.667
6.773IleAsn: 6.773 ± 1.353
4.027IlePro: 4.027 ± 0.493
2.746IleGln: 2.746 ± 0.604
4.759IleArg: 4.759 ± 1.078
8.237IleSer: 8.237 ± 1.921
6.956IleThr: 6.956 ± 1.648
2.563IleVal: 2.563 ± 0.998
0.549IleTrp: 0.549 ± 0.263
2.563IleTyr: 2.563 ± 0.555
0.0IleXaa: 0.0 ± 0.0
Lys
2.929LysAla: 2.929 ± 0.706
0.366LysCys: 0.366 ± 0.212
5.858LysAsp: 5.858 ± 1.318
5.125LysGlu: 5.125 ± 1.294
3.295LysPhe: 3.295 ± 0.975
4.393LysGly: 4.393 ± 0.687
1.281LysHis: 1.281 ± 0.326
6.59LysIle: 6.59 ± 1.327
6.407LysLys: 6.407 ± 1.403
6.407LysLeu: 6.407 ± 0.913
2.38LysMet: 2.38 ± 0.499
5.858LysAsn: 5.858 ± 1.096
1.464LysPro: 1.464 ± 0.462
1.098LysGln: 1.098 ± 0.543
3.661LysArg: 3.661 ± 0.786
6.407LysSer: 6.407 ± 1.138
3.844LysThr: 3.844 ± 0.693
3.661LysVal: 3.661 ± 0.702
0.732LysTrp: 0.732 ± 0.223
3.112LysTyr: 3.112 ± 0.459
0.0LysXaa: 0.0 ± 0.0
Leu
2.38LeuAla: 2.38 ± 0.705
0.549LeuCys: 0.549 ± 0.344
5.491LeuAsp: 5.491 ± 1.116
6.041LeuGlu: 6.041 ± 1.503
4.393LeuPhe: 4.393 ± 1.313
4.759LeuGly: 4.759 ± 0.617
2.197LeuHis: 2.197 ± 0.538
7.505LeuIle: 7.505 ± 1.472
6.041LeuLys: 6.041 ± 0.823
7.139LeuLeu: 7.139 ± 1.574
2.38LeuMet: 2.38 ± 0.804
6.59LeuAsn: 6.59 ± 0.521
2.746LeuPro: 2.746 ± 0.755
2.746LeuGln: 2.746 ± 0.822
5.491LeuArg: 5.491 ± 0.922
8.603LeuSer: 8.603 ± 0.891
6.773LeuThr: 6.773 ± 0.964
3.844LeuVal: 3.844 ± 0.602
0.915LeuTrp: 0.915 ± 0.432
3.661LeuTyr: 3.661 ± 0.967
0.0LeuXaa: 0.0 ± 0.0
Met
1.098MetAla: 1.098 ± 0.549
0.0MetCys: 0.0 ± 0.0
2.014MetAsp: 2.014 ± 0.673
2.563MetGlu: 2.563 ± 0.927
0.366MetPhe: 0.366 ± 0.229
1.464MetGly: 1.464 ± 0.691
0.549MetHis: 0.549 ± 0.344
3.478MetIle: 3.478 ± 0.367
2.014MetLys: 2.014 ± 0.516
2.38MetLeu: 2.38 ± 0.604
0.366MetMet: 0.366 ± 0.197
2.197MetAsn: 2.197 ± 0.485
0.732MetPro: 0.732 ± 0.654
0.732MetGln: 0.732 ± 0.399
0.732MetArg: 0.732 ± 0.34
2.746MetSer: 2.746 ± 0.955
2.746MetThr: 2.746 ± 1.092
1.647MetVal: 1.647 ± 0.487
0.366MetTrp: 0.366 ± 0.229
1.098MetTyr: 1.098 ± 0.469
0.0MetXaa: 0.0 ± 0.0
Asn
1.83AsnAla: 1.83 ± 0.316
1.464AsnCys: 1.464 ± 0.625
4.027AsnAsp: 4.027 ± 0.898
3.661AsnGlu: 3.661 ± 0.419
2.014AsnPhe: 2.014 ± 0.719
3.661AsnGly: 3.661 ± 1.315
1.098AsnHis: 1.098 ± 0.368
6.956AsnIle: 6.956 ± 1.511
5.491AsnLys: 5.491 ± 1.458
5.491AsnLeu: 5.491 ± 0.491
2.746AsnMet: 2.746 ± 0.589
5.858AsnAsn: 5.858 ± 1.226
5.491AsnPro: 5.491 ± 0.735
3.295AsnGln: 3.295 ± 0.767
2.929AsnArg: 2.929 ± 0.926
4.576AsnSer: 4.576 ± 0.813
3.661AsnThr: 3.661 ± 0.849
2.38AsnVal: 2.38 ± 0.622
1.281AsnTrp: 1.281 ± 0.494
3.661AsnTyr: 3.661 ± 0.979
0.0AsnXaa: 0.0 ± 0.0
Pro
1.647ProAla: 1.647 ± 0.682
0.183ProCys: 0.183 ± 0.115
3.295ProAsp: 3.295 ± 0.967
3.295ProGlu: 3.295 ± 0.798
1.098ProPhe: 1.098 ± 0.258
2.197ProGly: 2.197 ± 0.404
0.732ProHis: 0.732 ± 0.458
3.661ProIle: 3.661 ± 0.673
4.027ProLys: 4.027 ± 0.669
3.478ProLeu: 3.478 ± 0.381
0.915ProMet: 0.915 ± 0.578
2.197ProAsn: 2.197 ± 0.872
1.83ProPro: 1.83 ± 0.424
1.647ProGln: 1.647 ± 0.505
1.83ProArg: 1.83 ± 0.419
3.478ProSer: 3.478 ± 0.805
1.83ProThr: 1.83 ± 0.512
2.197ProVal: 2.197 ± 0.501
0.732ProTrp: 0.732 ± 0.507
2.014ProTyr: 2.014 ± 0.89
0.0ProXaa: 0.0 ± 0.0
Gln
0.915GlnAla: 0.915 ± 0.398
0.183GlnCys: 0.183 ± 0.27
3.112GlnAsp: 3.112 ± 0.913
1.464GlnGlu: 1.464 ± 0.427
0.732GlnPhe: 0.732 ± 0.446
2.197GlnGly: 2.197 ± 0.723
0.549GlnHis: 0.549 ± 0.693
2.746GlnIle: 2.746 ± 1.394
2.563GlnLys: 2.563 ± 0.454
2.563GlnLeu: 2.563 ± 0.701
0.366GlnMet: 0.366 ± 0.462
2.38GlnAsn: 2.38 ± 0.545
1.464GlnPro: 1.464 ± 0.368
2.014GlnGln: 2.014 ± 0.761
1.464GlnArg: 1.464 ± 0.525
3.844GlnSer: 3.844 ± 0.933
2.197GlnThr: 2.197 ± 0.436
2.38GlnVal: 2.38 ± 0.71
0.366GlnTrp: 0.366 ± 0.367
1.098GlnTyr: 1.098 ± 0.436
0.0GlnXaa: 0.0 ± 0.0
Arg
2.014ArgAla: 2.014 ± 0.889
0.549ArgCys: 0.549 ± 0.236
3.478ArgAsp: 3.478 ± 1.231
4.027ArgGlu: 4.027 ± 0.935
1.281ArgPhe: 1.281 ± 0.332
2.563ArgGly: 2.563 ± 0.559
0.915ArgHis: 0.915 ± 0.573
3.478ArgIle: 3.478 ± 1.083
4.759ArgLys: 4.759 ± 0.939
4.393ArgLeu: 4.393 ± 0.91
1.098ArgMet: 1.098 ± 0.436
1.464ArgAsn: 1.464 ± 0.546
1.647ArgPro: 1.647 ± 0.735
1.647ArgGln: 1.647 ± 0.778
2.929ArgArg: 2.929 ± 0.72
5.125ArgSer: 5.125 ± 1.27
1.83ArgThr: 1.83 ± 0.669
2.929ArgVal: 2.929 ± 0.49
0.549ArgTrp: 0.549 ± 0.456
1.83ArgTyr: 1.83 ± 0.542
0.0ArgXaa: 0.0 ± 0.0
Ser
3.112SerAla: 3.112 ± 1.24
2.197SerCys: 2.197 ± 0.725
5.308SerAsp: 5.308 ± 0.92
3.478SerGlu: 3.478 ± 0.787
4.21SerPhe: 4.21 ± 1.06
3.295SerGly: 3.295 ± 1.143
1.464SerHis: 1.464 ± 0.482
7.505SerIle: 7.505 ± 0.591
6.407SerLys: 6.407 ± 0.886
9.336SerLeu: 9.336 ± 1.418
3.478SerMet: 3.478 ± 0.414
5.308SerAsn: 5.308 ± 0.709
2.929SerPro: 2.929 ± 0.555
3.112SerGln: 3.112 ± 0.466
3.478SerArg: 3.478 ± 0.847
5.308SerSer: 5.308 ± 1.384
5.491SerThr: 5.491 ± 0.889
4.942SerVal: 4.942 ± 1.328
0.183SerTrp: 0.183 ± 0.218
2.197SerTyr: 2.197 ± 0.425
0.0SerXaa: 0.0 ± 0.0
Thr
2.746ThrAla: 2.746 ± 0.914
0.732ThrCys: 0.732 ± 0.427
3.844ThrAsp: 3.844 ± 0.868
4.027ThrGlu: 4.027 ± 0.595
1.464ThrPhe: 1.464 ± 0.545
3.295ThrGly: 3.295 ± 0.771
1.098ThrHis: 1.098 ± 0.543
5.125ThrIle: 5.125 ± 0.911
5.125ThrLys: 5.125 ± 0.518
3.661ThrLeu: 3.661 ± 0.386
0.915ThrMet: 0.915 ± 0.693
4.759ThrAsn: 4.759 ± 0.722
2.38ThrPro: 2.38 ± 0.604
1.83ThrGln: 1.83 ± 0.462
2.38ThrArg: 2.38 ± 0.524
5.308ThrSer: 5.308 ± 1.036
4.942ThrThr: 4.942 ± 1.183
2.746ThrVal: 2.746 ± 0.535
0.915ThrTrp: 0.915 ± 0.453
2.38ThrTyr: 2.38 ± 0.992
0.0ThrXaa: 0.0 ± 0.0
Val
1.464ValAla: 1.464 ± 0.731
0.366ValCys: 0.366 ± 0.341
4.393ValAsp: 4.393 ± 0.812
2.014ValGlu: 2.014 ± 0.815
1.464ValPhe: 1.464 ± 0.691
3.112ValGly: 3.112 ± 0.782
1.281ValHis: 1.281 ± 0.268
4.759ValIle: 4.759 ± 1.221
3.478ValLys: 3.478 ± 0.79
4.21ValLeu: 4.21 ± 0.563
1.647ValMet: 1.647 ± 0.466
3.112ValAsn: 3.112 ± 0.513
2.38ValPro: 2.38 ± 0.54
2.563ValGln: 2.563 ± 0.452
2.197ValArg: 2.197 ± 0.646
2.563ValSer: 2.563 ± 0.911
2.929ValThr: 2.929 ± 0.665
1.281ValVal: 1.281 ± 0.649
0.183ValTrp: 0.183 ± 0.115
2.197ValTyr: 2.197 ± 0.711
0.0ValXaa: 0.0 ± 0.0
Trp
0.366TrpAla: 0.366 ± 0.227
0.549TrpCys: 0.549 ± 0.261
0.549TrpAsp: 0.549 ± 0.261
0.549TrpGlu: 0.549 ± 0.248
0.732TrpPhe: 0.732 ± 0.458
0.183TrpGly: 0.183 ± 0.231
0.0TrpHis: 0.0 ± 0.0
0.732TrpIle: 0.732 ± 0.454
0.366TrpLys: 0.366 ± 0.212
0.549TrpLeu: 0.549 ± 0.392
0.366TrpMet: 0.366 ± 0.212
0.549TrpAsn: 0.549 ± 0.303
0.183TrpPro: 0.183 ± 0.115
0.0TrpGln: 0.0 ± 0.0
0.915TrpArg: 0.915 ± 0.305
0.549TrpSer: 0.549 ± 0.266
0.366TrpThr: 0.366 ± 0.243
0.183TrpVal: 0.183 ± 0.115
0.366TrpTrp: 0.366 ± 0.229
0.549TrpTyr: 0.549 ± 0.241
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.098TyrAla: 1.098 ± 0.258
1.647TyrCys: 1.647 ± 0.813
2.014TyrAsp: 2.014 ± 0.83
1.83TyrGlu: 1.83 ± 0.349
0.732TyrPhe: 0.732 ± 0.302
1.647TyrGly: 1.647 ± 0.496
0.915TyrHis: 0.915 ± 0.461
1.83TyrIle: 1.83 ± 0.712
2.746TyrLys: 2.746 ± 0.577
4.393TyrLeu: 4.393 ± 1.174
1.464TyrMet: 1.464 ± 0.759
3.844TyrAsn: 3.844 ± 0.688
1.83TyrPro: 1.83 ± 0.576
1.464TyrGln: 1.464 ± 0.194
0.915TyrArg: 0.915 ± 0.352
3.844TyrSer: 3.844 ± 0.878
2.197TyrThr: 2.197 ± 0.668
2.746TyrVal: 2.746 ± 0.851
0.0TyrTrp: 0.0 ± 0.0
1.281TyrTyr: 1.281 ± 0.678
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (5464 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski