Amino acid dipepetide frequency for Euphorbia caput-medusae latent virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.353AlaAla: 2.353 ± 0.965
0.0AlaCys: 0.0 ± 0.0
4.706AlaAsp: 4.706 ± 1.944
3.137AlaGlu: 3.137 ± 0.965
0.784AlaPhe: 0.784 ± 0.832
0.784AlaGly: 0.784 ± 0.66
1.569AlaHis: 1.569 ± 1.408
2.353AlaIle: 2.353 ± 1.699
1.569AlaLys: 1.569 ± 0.743
3.922AlaLeu: 3.922 ± 1.521
0.784AlaMet: 0.784 ± 0.746
3.137AlaAsn: 3.137 ± 1.035
0.0AlaPro: 0.0 ± 0.0
3.922AlaGln: 3.922 ± 2.561
7.059AlaArg: 7.059 ± 1.464
6.275AlaSer: 6.275 ± 1.081
4.706AlaThr: 4.706 ± 1.293
0.0AlaVal: 0.0 ± 0.0
1.569AlaTrp: 1.569 ± 0.763
1.569AlaTyr: 1.569 ± 0.954
0.0AlaXaa: 0.0 ± 0.0
Cys
1.569CysAla: 1.569 ± 0.933
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.784CysGly: 0.784 ± 0.671
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.784CysLys: 0.784 ± 0.671
2.353CysLeu: 2.353 ± 0.987
0.0CysMet: 0.0 ± 0.0
0.784CysAsn: 0.784 ± 0.671
1.569CysPro: 1.569 ± 0.743
2.353CysGln: 2.353 ± 1.229
0.0CysArg: 0.0 ± 0.0
0.784CysSer: 0.784 ± 0.804
3.137CysThr: 3.137 ± 0.761
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.784CysTyr: 0.784 ± 0.66
0.0CysXaa: 0.0 ± 0.0
Asp
2.353AspAla: 2.353 ± 0.646
1.569AspCys: 1.569 ± 1.321
1.569AspAsp: 1.569 ± 0.895
1.569AspGlu: 1.569 ± 1.408
2.353AspPhe: 2.353 ± 1.277
3.137AspGly: 3.137 ± 1.075
0.784AspHis: 0.784 ± 0.66
3.137AspIle: 3.137 ± 1.487
3.137AspLys: 3.137 ± 0.761
3.137AspLeu: 3.137 ± 1.583
0.784AspMet: 0.784 ± 0.768
3.137AspAsn: 3.137 ± 2.108
2.353AspPro: 2.353 ± 1.229
1.569AspGln: 1.569 ± 1.342
3.137AspArg: 3.137 ± 1.278
1.569AspSer: 1.569 ± 0.815
2.353AspThr: 2.353 ± 0.965
2.353AspVal: 2.353 ± 1.229
0.0AspTrp: 0.0 ± 0.0
4.706AspTyr: 4.706 ± 1.93
0.0AspXaa: 0.0 ± 0.0
Glu
2.353GluAla: 2.353 ± 1.757
0.0GluCys: 0.0 ± 0.0
2.353GluAsp: 2.353 ± 1.064
7.059GluGlu: 7.059 ± 2.932
3.137GluPhe: 3.137 ± 1.115
1.569GluGly: 1.569 ± 0.895
0.0GluHis: 0.0 ± 0.0
1.569GluIle: 1.569 ± 0.801
3.137GluLys: 3.137 ± 2.51
3.137GluLeu: 3.137 ± 1.641
1.569GluMet: 1.569 ± 0.801
2.353GluAsn: 2.353 ± 0.965
7.059GluPro: 7.059 ± 3.424
1.569GluGln: 1.569 ± 0.743
2.353GluArg: 2.353 ± 0.908
2.353GluSer: 2.353 ± 0.779
3.137GluThr: 3.137 ± 1.356
1.569GluVal: 1.569 ± 0.933
1.569GluTrp: 1.569 ± 0.801
5.49GluTyr: 5.49 ± 1.693
0.0GluXaa: 0.0 ± 0.0
Phe
0.784PheAla: 0.784 ± 0.671
0.784PheCys: 0.784 ± 0.832
1.569PheAsp: 1.569 ± 0.743
3.922PheGlu: 3.922 ± 1.227
2.353PhePhe: 2.353 ± 0.987
1.569PheGly: 1.569 ± 1.084
0.0PheHis: 0.0 ± 0.0
4.706PheIle: 4.706 ± 2.962
1.569PheLys: 1.569 ± 1.321
8.627PheLeu: 8.627 ± 1.163
0.784PheMet: 0.784 ± 0.66
4.706PheAsn: 4.706 ± 0.951
1.569PhePro: 1.569 ± 0.743
1.569PheGln: 1.569 ± 1.008
3.137PheArg: 3.137 ± 1.13
3.922PheSer: 3.922 ± 1.255
3.922PheThr: 3.922 ± 0.793
1.569PheVal: 1.569 ± 0.743
0.784PheTrp: 0.784 ± 0.766
3.137PheTyr: 3.137 ± 1.775
0.0PheXaa: 0.0 ± 0.0
Gly
1.569GlyAla: 1.569 ± 0.815
0.784GlyCys: 0.784 ± 0.671
1.569GlyAsp: 1.569 ± 1.664
1.569GlyGlu: 1.569 ± 0.743
1.569GlyPhe: 1.569 ± 0.954
3.922GlyGly: 3.922 ± 1.621
1.569GlyHis: 1.569 ± 1.098
3.922GlyIle: 3.922 ± 1.057
3.922GlyLys: 3.922 ± 1.288
5.49GlyLeu: 5.49 ± 3.735
0.0GlyMet: 0.0 ± 0.0
0.784GlyAsn: 0.784 ± 0.66
9.412GlyPro: 9.412 ± 3.704
3.137GlyGln: 3.137 ± 0.656
2.353GlyArg: 2.353 ± 0.965
4.706GlySer: 4.706 ± 2.611
1.569GlyThr: 1.569 ± 0.954
3.137GlyVal: 3.137 ± 1.035
0.784GlyTrp: 0.784 ± 0.66
2.353GlyTyr: 2.353 ± 1.326
0.0GlyXaa: 0.0 ± 0.0
His
0.784HisAla: 0.784 ± 0.704
1.569HisCys: 1.569 ± 0.743
1.569HisAsp: 1.569 ± 0.815
1.569HisGlu: 1.569 ± 0.743
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.784HisLys: 0.784 ± 0.66
1.569HisLeu: 1.569 ± 0.743
0.784HisMet: 0.784 ± 0.804
2.353HisAsn: 2.353 ± 1.277
3.922HisPro: 3.922 ± 0.984
1.569HisGln: 1.569 ± 0.743
1.569HisArg: 1.569 ± 0.933
1.569HisSer: 1.569 ± 1.321
1.569HisThr: 1.569 ± 1.609
1.569HisVal: 1.569 ± 1.321
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.137IleAla: 3.137 ± 1.053
0.0IleCys: 0.0 ± 0.0
3.922IleAsp: 3.922 ± 1.227
3.137IleGlu: 3.137 ± 1.328
7.059IlePhe: 7.059 ± 3.108
1.569IleGly: 1.569 ± 0.954
0.784IleHis: 0.784 ± 0.66
4.706IleIle: 4.706 ± 1.144
5.49IleLys: 5.49 ± 1.565
7.059IleLeu: 7.059 ± 1.951
0.0IleMet: 0.0 ± 0.0
6.275IleAsn: 6.275 ± 2.229
3.922IlePro: 3.922 ± 1.78
3.137IleGln: 3.137 ± 1.25
2.353IleArg: 2.353 ± 1.18
1.569IleSer: 1.569 ± 1.231
3.137IleThr: 3.137 ± 1.053
3.922IleVal: 3.922 ± 3.302
2.353IleTrp: 2.353 ± 0.779
1.569IleTyr: 1.569 ± 1.098
0.0IleXaa: 0.0 ± 0.0
Lys
0.784LysAla: 0.784 ± 0.832
1.569LysCys: 1.569 ± 0.743
5.49LysAsp: 5.49 ± 1.565
1.569LysGlu: 1.569 ± 1.008
2.353LysPhe: 2.353 ± 1.981
3.137LysGly: 3.137 ± 1.035
0.0LysHis: 0.0 ± 0.0
4.706LysIle: 4.706 ± 3.241
3.922LysLys: 3.922 ± 1.621
0.784LysLeu: 0.784 ± 0.832
0.0LysMet: 0.0 ± 0.0
3.137LysAsn: 3.137 ± 2.197
0.784LysPro: 0.784 ± 0.671
0.0LysGln: 0.0 ± 0.0
3.922LysArg: 3.922 ± 1.01
4.706LysSer: 4.706 ± 1.163
7.059LysThr: 7.059 ± 1.747
1.569LysVal: 1.569 ± 1.664
3.137LysTrp: 3.137 ± 1.13
2.353LysTyr: 2.353 ± 0.965
0.0LysXaa: 0.0 ± 0.0
Leu
3.922LeuAla: 3.922 ± 0.793
2.353LeuCys: 2.353 ± 0.952
1.569LeuAsp: 1.569 ± 1.068
2.353LeuGlu: 2.353 ± 0.965
3.137LeuPhe: 3.137 ± 1.641
7.059LeuGly: 7.059 ± 1.82
2.353LeuHis: 2.353 ± 0.908
5.49LeuIle: 5.49 ± 1.512
1.569LeuLys: 1.569 ± 1.321
10.196LeuLeu: 10.196 ± 3.924
0.784LeuMet: 0.784 ± 0.832
6.275LeuAsn: 6.275 ± 1.592
4.706LeuPro: 4.706 ± 1.634
3.922LeuGln: 3.922 ± 1.539
6.275LeuArg: 6.275 ± 1.594
3.137LeuSer: 3.137 ± 1.984
5.49LeuThr: 5.49 ± 2.504
3.922LeuVal: 3.922 ± 1.328
0.0LeuTrp: 0.0 ± 0.0
3.922LeuTyr: 3.922 ± 1.227
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
0.0MetCys: 0.0 ± 0.0
2.353MetAsp: 2.353 ± 0.646
2.353MetGlu: 2.353 ± 1.38
1.569MetPhe: 1.569 ± 1.098
0.784MetGly: 0.784 ± 0.832
0.0MetHis: 0.0 ± 0.0
0.784MetIle: 0.784 ± 0.832
0.0MetLys: 0.0 ± 0.0
0.784MetLeu: 0.784 ± 0.804
1.569MetMet: 1.569 ± 1.321
0.0MetAsn: 0.0 ± 0.0
1.569MetPro: 1.569 ± 0.743
2.353MetGln: 2.353 ± 1.715
0.0MetArg: 0.0 ± 0.0
1.569MetSer: 1.569 ± 1.609
0.0MetThr: 0.0 ± 0.0
0.784MetVal: 0.784 ± 0.66
0.784MetTrp: 0.784 ± 0.804
0.784MetTyr: 0.784 ± 0.671
0.0MetXaa: 0.0 ± 0.0
Asn
0.784AsnAla: 0.784 ± 0.66
1.569AsnCys: 1.569 ± 0.743
1.569AsnAsp: 1.569 ± 1.408
1.569AsnGlu: 1.569 ± 0.801
5.49AsnPhe: 5.49 ± 2.601
7.843AsnGly: 7.843 ± 2.401
2.353AsnHis: 2.353 ± 0.779
3.137AsnIle: 3.137 ± 0.733
0.0AsnLys: 0.0 ± 0.0
6.275AsnLeu: 6.275 ± 1.43
1.569AsnMet: 1.569 ± 1.084
4.706AsnAsn: 4.706 ± 1.796
4.706AsnPro: 4.706 ± 1.216
1.569AsnGln: 1.569 ± 0.743
0.784AsnArg: 0.784 ± 0.66
9.412AsnSer: 9.412 ± 2.677
4.706AsnThr: 4.706 ± 1.328
3.137AsnVal: 3.137 ± 1.405
0.0AsnTrp: 0.0 ± 0.0
3.137AsnTyr: 3.137 ± 1.527
0.0AsnXaa: 0.0 ± 0.0
Pro
0.784ProAla: 0.784 ± 0.66
0.784ProCys: 0.784 ± 0.671
3.137ProAsp: 3.137 ± 1.347
5.49ProGlu: 5.49 ± 1.714
2.353ProPhe: 2.353 ± 1.24
1.569ProGly: 1.569 ± 1.664
3.922ProHis: 3.922 ± 1.318
7.843ProIle: 7.843 ± 2.286
4.706ProLys: 4.706 ± 1.163
3.922ProLeu: 3.922 ± 1.67
0.0ProMet: 0.0 ± 0.0
7.059ProAsn: 7.059 ± 1.376
0.784ProPro: 0.784 ± 0.804
1.569ProGln: 1.569 ± 1.321
8.627ProArg: 8.627 ± 2.563
5.49ProSer: 5.49 ± 2.912
3.922ProThr: 3.922 ± 0.793
0.0ProVal: 0.0 ± 0.0
0.0ProTrp: 0.0 ± 0.0
2.353ProTyr: 2.353 ± 0.908
0.0ProXaa: 0.0 ± 0.0
Gln
2.353GlnAla: 2.353 ± 1.415
0.0GlnCys: 0.0 ± 0.0
1.569GlnAsp: 1.569 ± 0.743
2.353GlnGlu: 2.353 ± 1.757
3.137GlnPhe: 3.137 ± 0.906
3.137GlnGly: 3.137 ± 0.906
2.353GlnHis: 2.353 ± 0.646
2.353GlnIle: 2.353 ± 1.326
3.922GlnLys: 3.922 ± 1.457
5.49GlnLeu: 5.49 ± 1.609
3.922GlnMet: 3.922 ± 2.04
1.569GlnAsn: 1.569 ± 1.168
3.922GlnPro: 3.922 ± 1.293
3.137GlnGln: 3.137 ± 0.656
2.353GlnArg: 2.353 ± 0.965
1.569GlnSer: 1.569 ± 1.008
3.137GlnThr: 3.137 ± 1.984
3.922GlnVal: 3.922 ± 0.653
1.569GlnTrp: 1.569 ± 0.743
1.569GlnTyr: 1.569 ± 0.763
0.0GlnXaa: 0.0 ± 0.0
Arg
2.353ArgAla: 2.353 ± 0.987
0.0ArgCys: 0.0 ± 0.0
3.137ArgAsp: 3.137 ± 1.487
2.353ArgGlu: 2.353 ± 1.886
4.706ArgPhe: 4.706 ± 1.163
3.137ArgGly: 3.137 ± 1.076
1.569ArgHis: 1.569 ± 0.763
5.49ArgIle: 5.49 ± 1.577
2.353ArgLys: 2.353 ± 0.965
3.922ArgLeu: 3.922 ± 1.508
0.784ArgMet: 0.784 ± 0.744
4.706ArgAsn: 4.706 ± 1.641
2.353ArgPro: 2.353 ± 1.26
4.706ArgGln: 4.706 ± 1.854
8.627ArgArg: 8.627 ± 1.297
6.275ArgSer: 6.275 ± 1.604
3.922ArgThr: 3.922 ± 2.1
3.922ArgVal: 3.922 ± 1.227
2.353ArgTrp: 2.353 ± 1.064
1.569ArgTyr: 1.569 ± 0.954
0.0ArgXaa: 0.0 ± 0.0
Ser
2.353SerAla: 2.353 ± 1.18
0.784SerCys: 0.784 ± 0.66
2.353SerAsp: 2.353 ± 1.981
6.275SerGlu: 6.275 ± 1.312
1.569SerPhe: 1.569 ± 0.933
7.059SerGly: 7.059 ± 3.083
0.784SerHis: 0.784 ± 0.804
5.49SerIle: 5.49 ± 1.024
6.275SerLys: 6.275 ± 1.633
3.922SerLeu: 3.922 ± 1.292
0.784SerMet: 0.784 ± 0.804
3.922SerAsn: 3.922 ± 2.217
6.275SerPro: 6.275 ± 3.649
4.706SerGln: 4.706 ± 2.2
3.922SerArg: 3.922 ± 1.67
3.922SerSer: 3.922 ± 3.259
7.059SerThr: 7.059 ± 1.782
0.784SerVal: 0.784 ± 0.66
1.569SerTrp: 1.569 ± 1.138
2.353SerTyr: 2.353 ± 1.068
0.0SerXaa: 0.0 ± 0.0
Thr
3.922ThrAla: 3.922 ± 1.289
0.0ThrCys: 0.0 ± 0.0
1.569ThrAsp: 1.569 ± 0.763
4.706ThrGlu: 4.706 ± 1.944
3.137ThrPhe: 3.137 ± 1.076
3.922ThrGly: 3.922 ± 2.406
0.784ThrHis: 0.784 ± 0.671
6.275ThrIle: 6.275 ± 0.991
0.784ThrLys: 0.784 ± 0.804
2.353ThrLeu: 2.353 ± 0.965
0.0ThrMet: 0.0 ± 0.666
3.922ThrAsn: 3.922 ± 1.293
1.569ThrPro: 1.569 ± 1.02
4.706ThrGln: 4.706 ± 1.252
5.49ThrArg: 5.49 ± 1.511
5.49ThrSer: 5.49 ± 1.737
3.922ThrThr: 3.922 ± 1.058
6.275ThrVal: 6.275 ± 1.223
0.784ThrTrp: 0.784 ± 0.66
5.49ThrTyr: 5.49 ± 2.173
0.0ThrXaa: 0.0 ± 0.0
Val
4.706ValAla: 4.706 ± 1.538
1.569ValCys: 1.569 ± 0.743
2.353ValAsp: 2.353 ± 1.26
1.569ValGlu: 1.569 ± 1.084
2.353ValPhe: 2.353 ± 1.229
0.784ValGly: 0.784 ± 0.66
0.784ValHis: 0.784 ± 0.832
0.784ValIle: 0.784 ± 0.671
2.353ValLys: 2.353 ± 1.981
1.569ValLeu: 1.569 ± 0.763
1.569ValMet: 1.569 ± 1.183
3.922ValAsn: 3.922 ± 0.793
3.922ValPro: 3.922 ± 1.663
3.137ValGln: 3.137 ± 1.487
2.353ValArg: 2.353 ± 1.326
5.49ValSer: 5.49 ± 2.749
1.569ValThr: 1.569 ± 0.933
0.784ValVal: 0.784 ± 0.66
1.569ValTrp: 1.569 ± 0.743
0.784ValTyr: 0.784 ± 0.823
0.0ValXaa: 0.0 ± 0.0
Trp
9.412TrpAla: 9.412 ± 2.533
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.784TrpHis: 0.784 ± 0.66
0.0TrpIle: 0.0 ± 0.0
1.569TrpLys: 1.569 ± 1.098
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
2.353TrpPro: 2.353 ± 0.779
1.569TrpGln: 1.569 ± 1.231
0.0TrpArg: 0.0 ± 0.0
1.569TrpSer: 1.569 ± 0.801
0.784TrpThr: 0.784 ± 0.804
0.784TrpVal: 0.784 ± 0.66
0.784TrpTrp: 0.784 ± 0.766
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.922TyrAla: 3.922 ± 1.227
1.569TyrCys: 1.569 ± 0.895
2.353TyrAsp: 2.353 ± 1.26
0.784TyrGlu: 0.784 ± 0.671
3.922TyrPhe: 3.922 ± 2.901
2.353TyrGly: 2.353 ± 1.268
2.353TyrHis: 2.353 ± 0.646
2.353TyrIle: 2.353 ± 0.646
3.137TyrLys: 3.137 ± 1.842
3.922TyrLeu: 3.922 ± 1.094
1.569TyrMet: 1.569 ± 0.753
1.569TyrAsn: 1.569 ± 1.008
1.569TyrPro: 1.569 ± 0.743
3.137TyrGln: 3.137 ± 1.075
3.922TyrArg: 3.922 ± 2.041
0.784TyrSer: 0.784 ± 0.766
0.784TyrThr: 0.784 ± 0.66
3.922TyrVal: 3.922 ± 0.911
0.0TyrTrp: 0.0 ± 0.0
1.569TyrTyr: 1.569 ± 0.956
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (1276 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski