Amino acid dipepetide frequency for Horseradish latent virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.465AlaAla: 2.465 ± 0.604
0.0AlaCys: 0.0 ± 0.0
2.465AlaAsp: 2.465 ± 1.682
2.876AlaGlu: 2.876 ± 0.65
3.287AlaPhe: 3.287 ± 1.062
2.054AlaGly: 2.054 ± 0.883
1.233AlaHis: 1.233 ± 0.697
3.698AlaIle: 3.698 ± 0.898
6.163AlaLys: 6.163 ± 2.083
2.465AlaLeu: 2.465 ± 1.162
1.643AlaMet: 1.643 ± 0.868
2.054AlaAsn: 2.054 ± 0.706
1.233AlaPro: 1.233 ± 0.947
2.876AlaGln: 2.876 ± 0.798
2.054AlaArg: 2.054 ± 0.768
4.108AlaSer: 4.108 ± 1.577
2.465AlaThr: 2.465 ± 1.223
2.054AlaVal: 2.054 ± 0.935
0.0AlaTrp: 0.0 ± 0.0
0.822AlaTyr: 0.822 ± 0.477
0.0AlaXaa: 0.0 ± 0.0
Cys
0.822CysAla: 0.822 ± 0.433
0.822CysCys: 0.822 ± 0.392
1.233CysAsp: 1.233 ± 0.697
1.233CysGlu: 1.233 ± 0.492
0.822CysPhe: 0.822 ± 0.371
0.411CysGly: 0.411 ± 0.444
0.411CysHis: 0.411 ± 0.62
1.643CysIle: 1.643 ± 0.575
1.233CysLys: 1.233 ± 0.703
0.822CysLeu: 0.822 ± 0.477
0.0CysMet: 0.0 ± 0.0
1.643CysAsn: 1.643 ± 0.57
1.643CysPro: 1.643 ± 0.819
1.643CysGln: 1.643 ± 0.306
1.643CysArg: 1.643 ± 0.479
1.233CysSer: 1.233 ± 0.612
0.411CysThr: 0.411 ± 0.379
0.411CysVal: 0.411 ± 0.444
0.411CysTrp: 0.411 ± 0.379
0.411CysTyr: 0.411 ± 0.316
0.0CysXaa: 0.0 ± 0.0
Asp
2.054AspAla: 2.054 ± 0.934
2.876AspCys: 2.876 ± 0.667
2.465AspAsp: 2.465 ± 1.378
4.108AspGlu: 4.108 ± 1.52
2.465AspPhe: 2.465 ± 0.659
1.233AspGly: 1.233 ± 0.708
0.411AspHis: 0.411 ± 0.316
3.287AspIle: 3.287 ± 0.741
4.108AspLys: 4.108 ± 0.928
4.519AspLeu: 4.519 ± 0.933
1.233AspMet: 1.233 ± 1.108
4.108AspAsn: 4.108 ± 0.946
2.876AspPro: 2.876 ± 0.466
1.233AspGln: 1.233 ± 0.735
2.876AspArg: 2.876 ± 0.763
4.519AspSer: 4.519 ± 1.899
3.698AspThr: 3.698 ± 0.499
2.465AspVal: 2.465 ± 0.803
0.0AspTrp: 0.0 ± 0.0
2.465AspTyr: 2.465 ± 0.578
0.0AspXaa: 0.0 ± 0.0
Glu
4.108GluAla: 4.108 ± 1.512
0.822GluCys: 0.822 ± 0.433
6.574GluAsp: 6.574 ± 1.976
7.806GluGlu: 7.806 ± 1.664
2.465GluPhe: 2.465 ± 0.675
1.643GluGly: 1.643 ± 1.081
1.643GluHis: 1.643 ± 0.784
7.806GluIle: 7.806 ± 2.243
9.449GluLys: 9.449 ± 1.023
6.984GluLeu: 6.984 ± 1.592
0.0GluMet: 0.0 ± 0.0
2.054GluAsn: 2.054 ± 0.768
3.287GluPro: 3.287 ± 1.526
4.519GluGln: 4.519 ± 1.824
2.876GluArg: 2.876 ± 1.114
8.217GluSer: 8.217 ± 1.311
3.287GluThr: 3.287 ± 0.859
2.465GluVal: 2.465 ± 1.325
0.822GluTrp: 0.822 ± 0.392
2.054GluTyr: 2.054 ± 1.427
0.0GluXaa: 0.0 ± 0.0
Phe
2.054PheAla: 2.054 ± 0.801
1.233PheCys: 1.233 ± 0.677
2.876PheAsp: 2.876 ± 1.351
1.233PheGlu: 1.233 ± 0.464
0.822PhePhe: 0.822 ± 0.621
2.054PheGly: 2.054 ± 0.768
0.822PheHis: 0.822 ± 0.371
2.465PheIle: 2.465 ± 0.405
4.108PheLys: 4.108 ± 1.525
4.519PheLeu: 4.519 ± 1.851
0.411PheMet: 0.411 ± 0.444
0.822PheAsn: 0.822 ± 0.371
1.233PhePro: 1.233 ± 0.603
2.876PheGln: 2.876 ± 1.213
2.465PheArg: 2.465 ± 1.057
2.876PheSer: 2.876 ± 1.055
2.876PheThr: 2.876 ± 0.592
2.054PheVal: 2.054 ± 0.475
0.822PheTrp: 0.822 ± 0.392
0.411PheTyr: 0.411 ± 0.444
0.0PheXaa: 0.0 ± 0.0
Gly
2.876GlyAla: 2.876 ± 1.126
0.822GlyCys: 0.822 ± 0.433
2.465GlyAsp: 2.465 ± 0.497
4.108GlyGlu: 4.108 ± 0.497
2.054GlyPhe: 2.054 ± 0.741
1.643GlyGly: 1.643 ± 0.484
1.643GlyHis: 1.643 ± 0.7
3.698GlyIle: 3.698 ± 0.742
5.752GlyLys: 5.752 ± 1.419
2.465GlyLeu: 2.465 ± 0.835
1.233GlyMet: 1.233 ± 0.546
1.643GlyAsn: 1.643 ± 0.887
2.054GlyPro: 2.054 ± 1.357
1.643GlyGln: 1.643 ± 0.884
3.287GlyArg: 3.287 ± 0.542
4.519GlySer: 4.519 ± 2.206
2.054GlyThr: 2.054 ± 0.537
3.698GlyVal: 3.698 ± 0.885
0.822GlyTrp: 0.822 ± 0.632
0.822GlyTyr: 0.822 ± 0.477
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.822HisCys: 0.822 ± 0.713
0.0HisAsp: 0.0 ± 0.0
0.822HisGlu: 0.822 ± 0.524
0.411HisPhe: 0.411 ± 0.316
0.822HisGly: 0.822 ± 0.371
0.822HisHis: 0.822 ± 0.477
4.93HisIle: 4.93 ± 1.484
1.233HisLys: 1.233 ± 0.942
0.822HisLeu: 0.822 ± 0.392
0.822HisMet: 0.822 ± 0.632
0.411HisAsn: 0.411 ± 0.356
0.411HisPro: 0.411 ± 0.316
0.411HisGln: 0.411 ± 0.316
0.822HisArg: 0.822 ± 0.477
1.643HisSer: 1.643 ± 0.636
0.0HisThr: 0.0 ± 0.0
1.233HisVal: 1.233 ± 0.603
0.0HisTrp: 0.0 ± 0.0
0.822HisTyr: 0.822 ± 0.392
0.0HisXaa: 0.0 ± 0.0
Ile
2.465IleAla: 2.465 ± 0.821
2.465IleCys: 2.465 ± 0.774
4.519IleAsp: 4.519 ± 1.478
5.752IleGlu: 5.752 ± 0.899
2.465IlePhe: 2.465 ± 0.743
2.465IleGly: 2.465 ± 1.023
1.233IleHis: 1.233 ± 0.464
4.519IleIle: 4.519 ± 1.721
5.341IleLys: 5.341 ± 1.974
6.163IleLeu: 6.163 ± 1.367
0.411IleMet: 0.411 ± 0.356
5.341IleAsn: 5.341 ± 1.224
4.93IlePro: 4.93 ± 1.276
4.519IleGln: 4.519 ± 1.121
5.752IleArg: 5.752 ± 2.361
3.698IleSer: 3.698 ± 1.302
4.93IleThr: 4.93 ± 1.038
2.876IleVal: 2.876 ± 0.825
0.0IleTrp: 0.0 ± 0.0
3.698IleTyr: 3.698 ± 0.705
0.0IleXaa: 0.0 ± 0.0
Lys
5.752LysAla: 5.752 ± 0.934
1.233LysCys: 1.233 ± 0.612
5.752LysAsp: 5.752 ± 2.111
9.039LysGlu: 9.039 ± 1.65
4.108LysPhe: 4.108 ± 0.933
5.341LysGly: 5.341 ± 0.882
0.411LysHis: 0.411 ± 0.316
8.628LysIle: 8.628 ± 1.576
11.915LysLys: 11.915 ± 2.117
9.86LysLeu: 9.86 ± 1.323
0.822LysMet: 0.822 ± 0.392
4.108LysAsn: 4.108 ± 1.757
6.163LysPro: 6.163 ± 1.37
1.643LysGln: 1.643 ± 0.708
3.287LysArg: 3.287 ± 1.238
7.806LysSer: 7.806 ± 1.76
6.574LysThr: 6.574 ± 1.003
6.984LysVal: 6.984 ± 1.262
0.822LysTrp: 0.822 ± 0.678
5.341LysTyr: 5.341 ± 1.135
0.0LysXaa: 0.0 ± 0.0
Leu
4.108LeuAla: 4.108 ± 0.398
1.233LeuCys: 1.233 ± 0.464
4.93LeuAsp: 4.93 ± 1.915
5.752LeuGlu: 5.752 ± 1.315
0.822LeuPhe: 0.822 ± 0.392
6.163LeuGly: 6.163 ± 1.348
1.643LeuHis: 1.643 ± 0.575
4.108LeuIle: 4.108 ± 2.542
8.217LeuLys: 8.217 ± 1.572
10.271LeuLeu: 10.271 ± 2.084
3.287LeuMet: 3.287 ± 1.272
5.341LeuAsn: 5.341 ± 1.676
2.465LeuPro: 2.465 ± 0.743
4.93LeuGln: 4.93 ± 1.584
2.876LeuArg: 2.876 ± 0.814
4.519LeuSer: 4.519 ± 1.566
7.806LeuThr: 7.806 ± 1.932
5.752LeuVal: 5.752 ± 1.408
0.0LeuTrp: 0.0 ± 0.0
2.465LeuTyr: 2.465 ± 0.51
0.0LeuXaa: 0.0 ± 0.0
Met
1.233MetAla: 1.233 ± 0.484
0.0MetCys: 0.0 ± 0.0
1.643MetAsp: 1.643 ± 0.575
3.287MetGlu: 3.287 ± 1.384
1.233MetPhe: 1.233 ± 0.492
0.822MetGly: 0.822 ± 0.371
0.0MetHis: 0.0 ± 0.0
1.643MetIle: 1.643 ± 0.563
3.698MetLys: 3.698 ± 1.565
0.0MetLeu: 0.0 ± 0.0
0.0MetMet: 0.0 ± 0.0
1.233MetAsn: 1.233 ± 0.677
0.411MetPro: 0.411 ± 0.379
1.233MetGln: 1.233 ± 0.329
0.0MetArg: 0.0 ± 0.0
1.233MetSer: 1.233 ± 1.116
1.233MetThr: 1.233 ± 0.895
2.465MetVal: 2.465 ± 0.675
0.0MetTrp: 0.0 ± 0.0
0.411MetTyr: 0.411 ± 0.459
0.0MetXaa: 0.0 ± 0.0
Asn
0.822AsnAla: 0.822 ± 0.621
1.233AsnCys: 1.233 ± 0.329
2.876AsnAsp: 2.876 ± 0.856
4.93AsnGlu: 4.93 ± 1.332
1.643AsnPhe: 1.643 ± 0.742
3.287AsnGly: 3.287 ± 1.296
0.822AsnHis: 0.822 ± 0.477
2.876AsnIle: 2.876 ± 1.079
4.93AsnLys: 4.93 ± 1.279
7.395AsnLeu: 7.395 ± 1.755
0.822AsnMet: 0.822 ± 0.713
3.287AsnAsn: 3.287 ± 0.947
4.108AsnPro: 4.108 ± 0.879
3.287AsnGln: 3.287 ± 1.267
0.822AsnArg: 0.822 ± 0.433
2.876AsnSer: 2.876 ± 1.126
2.465AsnThr: 2.465 ± 1.167
2.054AsnVal: 2.054 ± 0.834
0.822AsnTrp: 0.822 ± 0.498
2.876AsnTyr: 2.876 ± 1.396
0.0AsnXaa: 0.0 ± 0.0
Pro
2.876ProAla: 2.876 ± 0.851
0.822ProCys: 0.822 ± 0.494
0.822ProAsp: 0.822 ± 0.392
6.574ProGlu: 6.574 ± 1.935
1.643ProPhe: 1.643 ± 0.708
1.643ProGly: 1.643 ± 0.694
1.233ProHis: 1.233 ± 0.575
2.054ProIle: 2.054 ± 0.666
6.163ProLys: 6.163 ± 2.418
4.519ProLeu: 4.519 ± 1.746
1.233ProMet: 1.233 ± 0.782
3.698ProAsn: 3.698 ± 1.14
0.822ProPro: 0.822 ± 0.392
2.465ProGln: 2.465 ± 0.841
2.054ProArg: 2.054 ± 0.821
2.876ProSer: 2.876 ± 1.154
0.822ProThr: 0.822 ± 0.477
2.054ProVal: 2.054 ± 0.475
0.0ProTrp: 0.0 ± 0.0
1.643ProTyr: 1.643 ± 0.884
0.0ProXaa: 0.0 ± 0.0
Gln
2.465GlnAla: 2.465 ± 0.788
0.411GlnCys: 0.411 ± 0.356
0.0GlnAsp: 0.0 ± 0.0
4.519GlnGlu: 4.519 ± 0.794
1.233GlnPhe: 1.233 ± 0.748
1.233GlnGly: 1.233 ± 0.947
0.822GlnHis: 0.822 ± 0.371
2.054GlnIle: 2.054 ± 1.097
4.519GlnLys: 4.519 ± 0.989
3.287GlnLeu: 3.287 ± 1.357
0.411GlnMet: 0.411 ± 0.356
2.465GlnAsn: 2.465 ± 1.114
2.465GlnPro: 2.465 ± 0.713
2.054GlnGln: 2.054 ± 0.591
2.054GlnArg: 2.054 ± 0.381
2.054GlnSer: 2.054 ± 0.995
5.341GlnThr: 5.341 ± 1.651
2.465GlnVal: 2.465 ± 1.176
0.822GlnTrp: 0.822 ± 0.371
1.643GlnTyr: 1.643 ± 0.455
0.0GlnXaa: 0.0 ± 0.0
Arg
1.233ArgAla: 1.233 ± 0.655
0.411ArgCys: 0.411 ± 0.379
1.233ArgAsp: 1.233 ± 0.702
2.465ArgGlu: 2.465 ± 0.915
2.465ArgPhe: 2.465 ± 1.316
2.465ArgGly: 2.465 ± 1.531
0.411ArgHis: 0.411 ± 0.316
3.287ArgIle: 3.287 ± 1.247
4.93ArgLys: 4.93 ± 1.299
5.341ArgLeu: 5.341 ± 1.178
2.054ArgMet: 2.054 ± 0.726
2.054ArgAsn: 2.054 ± 0.862
3.287ArgPro: 3.287 ± 1.007
0.822ArgGln: 0.822 ± 0.524
2.465ArgArg: 2.465 ± 0.947
2.876ArgSer: 2.876 ± 1.187
2.876ArgThr: 2.876 ± 0.455
0.411ArgVal: 0.411 ± 0.499
1.233ArgTrp: 1.233 ± 0.947
0.822ArgTyr: 0.822 ± 0.392
0.0ArgXaa: 0.0 ± 0.0
Ser
3.698SerAla: 3.698 ± 2.139
0.0SerCys: 0.0 ± 0.0
6.163SerAsp: 6.163 ± 2.198
5.341SerGlu: 5.341 ± 2.319
2.465SerPhe: 2.465 ± 0.695
7.395SerGly: 7.395 ± 1.231
0.822SerHis: 0.822 ± 0.477
5.341SerIle: 5.341 ± 1.531
8.217SerLys: 8.217 ± 1.764
7.395SerLeu: 7.395 ± 1.408
2.876SerMet: 2.876 ± 0.952
2.876SerAsn: 2.876 ± 1.056
2.465SerPro: 2.465 ± 0.841
1.233SerGln: 1.233 ± 0.702
1.233SerArg: 1.233 ± 0.59
8.628SerSer: 8.628 ± 2.526
6.163SerThr: 6.163 ± 1.451
3.287SerVal: 3.287 ± 1.243
0.0SerTrp: 0.0 ± 0.0
0.411SerTyr: 0.411 ± 0.402
0.0SerXaa: 0.0 ± 0.0
Thr
3.287ThrAla: 3.287 ± 0.912
1.233ThrCys: 1.233 ± 0.703
2.465ThrAsp: 2.465 ± 0.647
2.876ThrGlu: 2.876 ± 0.765
2.054ThrPhe: 2.054 ± 1.16
3.287ThrGly: 3.287 ± 0.814
1.643ThrHis: 1.643 ± 0.658
5.752ThrIle: 5.752 ± 1.693
5.752ThrLys: 5.752 ± 2.273
5.341ThrLeu: 5.341 ± 1.081
0.822ThrMet: 0.822 ± 0.632
6.163ThrAsn: 6.163 ± 0.658
2.054ThrPro: 2.054 ± 0.676
1.643ThrGln: 1.643 ± 0.742
2.465ThrArg: 2.465 ± 1.009
5.341ThrSer: 5.341 ± 2.761
3.698ThrThr: 3.698 ± 2.319
1.643ThrVal: 1.643 ± 0.596
0.822ThrTrp: 0.822 ± 0.632
1.233ThrTyr: 1.233 ± 0.675
0.0ThrXaa: 0.0 ± 0.0
Val
1.233ValAla: 1.233 ± 0.435
2.054ValCys: 2.054 ± 0.63
2.465ValAsp: 2.465 ± 0.831
4.108ValGlu: 4.108 ± 1.998
4.519ValPhe: 4.519 ± 1.639
2.465ValGly: 2.465 ± 0.739
0.411ValHis: 0.411 ± 0.444
3.287ValIle: 3.287 ± 1.009
5.341ValLys: 5.341 ± 0.803
1.233ValLeu: 1.233 ± 0.329
2.054ValMet: 2.054 ± 0.815
2.876ValAsn: 2.876 ± 0.65
2.054ValPro: 2.054 ± 0.591
1.233ValGln: 1.233 ± 0.652
3.287ValArg: 3.287 ± 0.716
2.876ValSer: 2.876 ± 0.816
1.643ValThr: 1.643 ± 0.757
2.465ValVal: 2.465 ± 0.982
0.411ValTrp: 0.411 ± 0.356
2.876ValTyr: 2.876 ± 1.101
0.0ValXaa: 0.0 ± 0.0
Trp
0.411TrpAla: 0.411 ± 0.444
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.411TrpGlu: 0.411 ± 0.316
0.411TrpPhe: 0.411 ± 0.62
1.233TrpGly: 1.233 ± 0.329
0.0TrpHis: 0.0 ± 0.0
0.822TrpIle: 0.822 ± 0.433
0.411TrpLys: 0.411 ± 0.499
0.411TrpLeu: 0.411 ± 0.316
0.411TrpMet: 0.411 ± 0.316
0.411TrpAsn: 0.411 ± 0.316
0.411TrpPro: 0.411 ± 0.402
0.822TrpGln: 0.822 ± 0.632
0.411TrpArg: 0.411 ± 0.316
1.233TrpSer: 1.233 ± 0.731
0.411TrpThr: 0.411 ± 0.316
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.054TyrAla: 2.054 ± 0.381
0.411TyrCys: 0.411 ± 0.444
2.054TyrAsp: 2.054 ± 1.104
1.643TyrGlu: 1.643 ± 0.455
1.643TyrPhe: 1.643 ± 0.784
2.054TyrGly: 2.054 ± 1.22
0.822TyrHis: 0.822 ± 0.371
1.643TyrIle: 1.643 ± 0.695
4.108TyrLys: 4.108 ± 1.573
2.876TyrLeu: 2.876 ± 0.91
0.822TyrMet: 0.822 ± 0.371
1.643TyrAsn: 1.643 ± 0.455
1.643TyrPro: 1.643 ± 0.881
1.233TyrGln: 1.233 ± 0.77
0.411TyrArg: 0.411 ± 0.356
2.876TyrSer: 2.876 ± 0.553
0.822TyrThr: 0.822 ± 0.917
2.054TyrVal: 2.054 ± 0.645
0.411TyrTrp: 0.411 ± 0.316
0.411TyrTyr: 0.411 ± 0.356
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (2435 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski