Amino acid dipepetide frequency for White clover mosaic virus (strain O) (WCMV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.208AlaAla: 5.208 ± 1.781
1.042AlaCys: 1.042 ± 1.232
3.646AlaAsp: 3.646 ± 1.299
3.125AlaGlu: 3.125 ± 2.459
5.729AlaPhe: 5.729 ± 2.656
4.167AlaGly: 4.167 ± 1.208
1.042AlaHis: 1.042 ± 0.82
5.208AlaIle: 5.208 ± 1.213
5.208AlaLys: 5.208 ± 2.217
10.938AlaLeu: 10.938 ± 2.938
1.042AlaMet: 1.042 ± 0.562
3.125AlaAsn: 3.125 ± 1.072
2.604AlaPro: 2.604 ± 1.507
1.562AlaGln: 1.562 ± 1.005
3.125AlaArg: 3.125 ± 0.658
3.125AlaSer: 3.125 ± 3.249
4.688AlaThr: 4.688 ± 1.642
3.646AlaVal: 3.646 ± 1.373
0.0AlaTrp: 0.0 ± 0.0
3.646AlaTyr: 3.646 ± 1.479
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.042CysAsp: 1.042 ± 1.581
0.521CysGlu: 0.521 ± 0.281
1.042CysPhe: 1.042 ± 0.562
1.042CysGly: 1.042 ± 0.562
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.521CysLys: 0.521 ± 0.859
1.562CysLeu: 1.562 ± 0.768
0.521CysMet: 0.521 ± 0.973
0.521CysAsn: 0.521 ± 0.859
0.521CysPro: 0.521 ± 0.281
1.562CysGln: 1.562 ± 0.832
1.042CysArg: 1.042 ± 0.562
1.042CysSer: 1.042 ± 0.562
2.604CysThr: 2.604 ± 1.598
1.042CysVal: 1.042 ± 0.888
0.521CysTrp: 0.521 ± 0.973
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.646AspAla: 3.646 ± 0.746
1.562AspCys: 1.562 ± 0.843
1.042AspAsp: 1.042 ± 0.562
3.646AspGlu: 3.646 ± 1.479
3.646AspPhe: 3.646 ± 1.409
2.083AspGly: 2.083 ± 2.218
3.125AspHis: 3.125 ± 2.887
4.688AspIle: 4.688 ± 1.131
0.0AspLys: 0.0 ± 0.0
6.771AspLeu: 6.771 ± 1.808
0.521AspMet: 0.521 ± 0.281
3.125AspAsn: 3.125 ± 1.902
3.646AspPro: 3.646 ± 2.155
0.521AspGln: 0.521 ± 0.281
1.562AspArg: 1.562 ± 0.843
4.688AspSer: 4.688 ± 1.221
3.646AspThr: 3.646 ± 0.746
2.604AspVal: 2.604 ± 1.541
1.042AspTrp: 1.042 ± 0.562
2.083AspTyr: 2.083 ± 1.124
0.0AspXaa: 0.0 ± 0.0
Glu
4.688GluAla: 4.688 ± 1.642
0.0GluCys: 0.0 ± 0.0
2.604GluAsp: 2.604 ± 1.405
4.688GluGlu: 4.688 ± 1.221
2.083GluPhe: 2.083 ± 0.867
1.042GluGly: 1.042 ± 0.562
1.042GluHis: 1.042 ± 0.562
6.771GluIle: 6.771 ± 2.922
4.688GluLys: 4.688 ± 2.529
3.646GluLeu: 3.646 ± 1.49
0.521GluMet: 0.521 ± 0.281
3.125GluAsn: 3.125 ± 1.686
3.646GluPro: 3.646 ± 1.479
0.521GluGln: 0.521 ± 0.281
2.604GluArg: 2.604 ± 1.036
3.646GluSer: 3.646 ± 0.746
2.083GluThr: 2.083 ± 0.771
2.604GluVal: 2.604 ± 1.036
1.042GluTrp: 1.042 ± 0.562
1.042GluTyr: 1.042 ± 1.289
0.0GluXaa: 0.0 ± 0.0
Phe
5.208PheAla: 5.208 ± 2.896
1.562PheCys: 1.562 ± 0.745
4.167PheAsp: 4.167 ± 1.942
2.604PheGlu: 2.604 ± 2.062
2.083PhePhe: 2.083 ± 0.812
1.562PheGly: 1.562 ± 1.938
3.646PheHis: 3.646 ± 1.967
4.167PheIle: 4.167 ± 1.682
2.604PheLys: 2.604 ± 1.497
4.167PheLeu: 4.167 ± 1.606
1.042PheMet: 1.042 ± 0.562
4.688PheAsn: 4.688 ± 2.529
2.083PhePro: 2.083 ± 1.124
2.604PheGln: 2.604 ± 1.573
0.521PheArg: 0.521 ± 0.281
1.562PheSer: 1.562 ± 0.832
4.688PheThr: 4.688 ± 1.635
1.562PheVal: 1.562 ± 0.832
0.521PheTrp: 0.521 ± 0.281
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.646GlyAla: 3.646 ± 1.409
1.562GlyCys: 1.562 ± 1.475
3.646GlyAsp: 3.646 ± 1.025
1.562GlyGlu: 1.562 ± 0.832
2.083GlyPhe: 2.083 ± 0.812
2.083GlyGly: 2.083 ± 0.868
1.042GlyHis: 1.042 ± 0.562
1.562GlyIle: 1.562 ± 0.843
2.604GlyLys: 2.604 ± 1.507
4.167GlyLeu: 4.167 ± 4.087
0.521GlyMet: 0.521 ± 0.281
0.521GlyAsn: 0.521 ± 0.281
3.646GlyPro: 3.646 ± 1.805
2.083GlyGln: 2.083 ± 0.867
0.521GlyArg: 0.521 ± 0.973
3.125GlySer: 3.125 ± 1.536
4.167GlyThr: 4.167 ± 2.134
2.083GlyVal: 2.083 ± 2.627
0.521GlyTrp: 0.521 ± 0.281
2.083GlyTyr: 2.083 ± 0.812
0.0GlyXaa: 0.0 ± 0.0
His
2.604HisAla: 2.604 ± 1.036
1.042HisCys: 1.042 ± 1.586
1.562HisAsp: 1.562 ± 0.843
2.083HisGlu: 2.083 ± 1.124
1.562HisPhe: 1.562 ± 0.843
3.646HisGly: 3.646 ± 1.748
1.562HisHis: 1.562 ± 0.768
2.083HisIle: 2.083 ± 0.867
0.521HisLys: 0.521 ± 0.281
3.646HisLeu: 3.646 ± 1.351
0.521HisMet: 0.521 ± 0.901
2.604HisAsn: 2.604 ± 1.699
2.604HisPro: 2.604 ± 0.89
2.604HisGln: 2.604 ± 0.684
2.083HisArg: 2.083 ± 0.771
2.083HisSer: 2.083 ± 1.777
1.042HisThr: 1.042 ± 0.765
0.521HisVal: 0.521 ± 0.281
0.0HisTrp: 0.0 ± 0.0
1.042HisTyr: 1.042 ± 1.289
0.0HisXaa: 0.0 ± 0.0
Ile
4.167IleAla: 4.167 ± 1.053
0.0IleCys: 0.0 ± 0.0
1.042IleAsp: 1.042 ± 0.562
7.812IleGlu: 7.812 ± 3.386
3.646IlePhe: 3.646 ± 0.897
4.688IleGly: 4.688 ± 2.15
2.083IleHis: 2.083 ± 0.812
3.646IleIle: 3.646 ± 1.777
5.208IleLys: 5.208 ± 0.625
10.417IleLeu: 10.417 ± 3.201
2.083IleMet: 2.083 ± 1.539
4.167IleAsn: 4.167 ± 1.71
4.688IlePro: 4.688 ± 0.533
2.083IleGln: 2.083 ± 1.124
3.125IleArg: 3.125 ± 2.546
5.208IleSer: 5.208 ± 5.975
6.25IleThr: 6.25 ± 1.908
2.604IleVal: 2.604 ± 1.441
0.521IleTrp: 0.521 ± 0.281
1.562IleTyr: 1.562 ± 0.843
0.0IleXaa: 0.0 ± 0.0
Lys
4.167LysAla: 4.167 ± 1.727
0.0LysCys: 0.0 ± 0.0
2.083LysAsp: 2.083 ± 0.868
1.562LysGlu: 1.562 ± 0.745
2.604LysPhe: 2.604 ± 0.89
1.562LysGly: 1.562 ± 0.843
2.604LysHis: 2.604 ± 0.986
6.771LysIle: 6.771 ± 1.263
2.604LysLys: 2.604 ± 1.405
4.688LysLeu: 4.688 ± 1.779
2.083LysMet: 2.083 ± 1.077
2.083LysAsn: 2.083 ± 1.124
4.167LysPro: 4.167 ± 1.682
3.646LysGln: 3.646 ± 1.49
2.083LysArg: 2.083 ± 0.812
5.729LysSer: 5.729 ± 0.83
7.292LysThr: 7.292 ± 2.709
4.167LysVal: 4.167 ± 1.053
0.0LysTrp: 0.0 ± 0.0
1.562LysTyr: 1.562 ± 1.381
0.0LysXaa: 0.0 ± 0.0
Leu
9.896LeuAla: 9.896 ± 4.611
3.125LeuCys: 3.125 ± 0.813
8.333LeuAsp: 8.333 ± 2.417
4.688LeuGlu: 4.688 ± 1.983
5.729LeuPhe: 5.729 ± 1.683
4.688LeuGly: 4.688 ± 1.903
4.688LeuHis: 4.688 ± 3.844
6.25LeuIle: 6.25 ± 4.15
8.333LeuLys: 8.333 ± 2.692
8.333LeuLeu: 8.333 ± 2.881
1.562LeuMet: 1.562 ± 0.745
3.125LeuAsn: 3.125 ± 1.245
8.333LeuPro: 8.333 ± 0.956
3.646LeuGln: 3.646 ± 1.967
3.125LeuArg: 3.125 ± 1.686
5.729LeuSer: 5.729 ± 2.303
5.208LeuThr: 5.208 ± 1.213
4.167LeuVal: 4.167 ± 3.742
1.042LeuTrp: 1.042 ± 0.562
2.604LeuTyr: 2.604 ± 1.036
0.0LeuXaa: 0.0 ± 0.0
Met
1.562MetAla: 1.562 ± 0.745
0.521MetCys: 0.521 ± 1.727
1.042MetAsp: 1.042 ± 1.936
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
0.521MetGly: 0.521 ± 0.281
0.0MetHis: 0.0 ± 0.0
2.083MetIle: 2.083 ± 1.124
1.042MetLys: 1.042 ± 0.562
1.042MetLeu: 1.042 ± 0.82
0.0MetMet: 0.0 ± 0.0
1.562MetAsn: 1.562 ± 0.843
1.562MetPro: 1.562 ± 1.381
1.042MetGln: 1.042 ± 0.562
1.562MetArg: 1.562 ± 0.843
2.604MetSer: 2.604 ± 1.405
0.521MetThr: 0.521 ± 0.281
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.521MetTyr: 0.521 ± 0.973
0.0MetXaa: 0.0 ± 0.0
Asn
2.083AsnAla: 2.083 ± 0.867
0.521AsnCys: 0.521 ± 0.281
2.604AsnAsp: 2.604 ± 0.89
2.604AsnGlu: 2.604 ± 1.405
1.562AsnPhe: 1.562 ± 0.843
1.042AsnGly: 1.042 ± 0.82
2.083AsnHis: 2.083 ± 0.771
4.688AsnIle: 4.688 ± 1.221
2.604AsnLys: 2.604 ± 0.986
6.25AsnLeu: 6.25 ± 1.611
1.042AsnMet: 1.042 ± 0.562
2.604AsnAsn: 2.604 ± 0.986
6.771AsnPro: 6.771 ± 1.71
2.604AsnGln: 2.604 ± 0.821
1.562AsnArg: 1.562 ± 0.832
2.083AsnSer: 2.083 ± 0.771
5.729AsnThr: 5.729 ± 1.529
2.083AsnVal: 2.083 ± 1.639
0.0AsnTrp: 0.0 ± 0.0
1.562AsnTyr: 1.562 ± 0.843
0.0AsnXaa: 0.0 ± 0.0
Pro
3.646ProAla: 3.646 ± 4.14
1.562ProCys: 1.562 ± 0.843
5.729ProAsp: 5.729 ± 2.475
3.646ProGlu: 3.646 ± 1.479
4.167ProPhe: 4.167 ± 2.06
0.521ProGly: 0.521 ± 0.281
1.042ProHis: 1.042 ± 0.765
3.646ProIle: 3.646 ± 0.746
3.125ProLys: 3.125 ± 1.162
3.125ProLeu: 3.125 ± 1.862
0.521ProMet: 0.521 ± 0.671
5.208ProAsn: 5.208 ± 2.887
3.646ProPro: 3.646 ± 2.578
3.646ProGln: 3.646 ± 1.025
2.083ProArg: 2.083 ± 1.124
5.208ProSer: 5.208 ± 1.752
5.729ProThr: 5.729 ± 2.475
2.604ProVal: 2.604 ± 1.036
1.042ProTrp: 1.042 ± 0.562
2.604ProTyr: 2.604 ± 2.36
0.0ProXaa: 0.0 ± 0.0
Gln
5.729GlnAla: 5.729 ± 1.896
0.521GlnCys: 0.521 ± 0.281
1.562GlnAsp: 1.562 ± 0.768
1.042GlnGlu: 1.042 ± 0.562
1.562GlnPhe: 1.562 ± 1.005
1.562GlnGly: 1.562 ± 0.843
1.042GlnHis: 1.042 ± 0.888
2.083GlnIle: 2.083 ± 0.868
1.562GlnLys: 1.562 ± 0.832
5.208GlnLeu: 5.208 ± 1.37
0.521GlnMet: 0.521 ± 0.692
1.042GlnAsn: 1.042 ± 0.888
2.083GlnPro: 2.083 ± 0.867
1.042GlnGln: 1.042 ± 0.562
1.042GlnArg: 1.042 ± 0.888
3.125GlnSer: 3.125 ± 1.245
3.646GlnThr: 3.646 ± 1.29
1.042GlnVal: 1.042 ± 0.82
1.562GlnTrp: 1.562 ± 0.768
1.562GlnTyr: 1.562 ± 0.843
0.0GlnXaa: 0.0 ± 0.0
Arg
4.167ArgAla: 4.167 ± 1.208
0.0ArgCys: 0.0 ± 0.0
3.125ArgAsp: 3.125 ± 1.072
2.604ArgGlu: 2.604 ± 1.405
0.521ArgPhe: 0.521 ± 0.859
2.604ArgGly: 2.604 ± 1.62
1.562ArgHis: 1.562 ± 1.602
2.083ArgIle: 2.083 ± 0.868
2.604ArgLys: 2.604 ± 0.684
2.083ArgLeu: 2.083 ± 0.867
0.0ArgMet: 0.0 ± 0.0
2.083ArgAsn: 2.083 ± 1.124
1.042ArgPro: 1.042 ± 0.888
2.083ArgGln: 2.083 ± 0.771
2.083ArgArg: 2.083 ± 1.206
2.604ArgSer: 2.604 ± 0.89
1.562ArgThr: 1.562 ± 0.768
1.042ArgVal: 1.042 ± 0.562
0.0ArgTrp: 0.0 ± 0.0
2.604ArgTyr: 2.604 ± 1.405
0.0ArgXaa: 0.0 ± 0.0
Ser
1.042SerAla: 1.042 ± 0.82
0.521SerCys: 0.521 ± 0.281
3.646SerAsp: 3.646 ± 0.739
3.646SerGlu: 3.646 ± 1.805
3.646SerPhe: 3.646 ± 1.614
3.125SerGly: 3.125 ± 3.061
3.646SerHis: 3.646 ± 0.897
6.771SerIle: 6.771 ± 3.023
6.25SerLys: 6.25 ± 1.928
5.729SerLeu: 5.729 ± 2.228
1.562SerMet: 1.562 ± 1.475
4.167SerAsn: 4.167 ± 1.318
5.208SerPro: 5.208 ± 0.903
1.562SerGln: 1.562 ± 0.843
1.042SerArg: 1.042 ± 0.82
5.729SerSer: 5.729 ± 2.817
2.083SerThr: 2.083 ± 0.812
4.167SerVal: 4.167 ± 3.507
1.042SerTrp: 1.042 ± 0.82
2.604SerTyr: 2.604 ± 1.405
0.0SerXaa: 0.0 ± 0.0
Thr
3.646ThrAla: 3.646 ± 1.49
1.042ThrCys: 1.042 ± 0.765
4.167ThrAsp: 4.167 ± 1.201
3.125ThrGlu: 3.125 ± 1.245
4.688ThrPhe: 4.688 ± 1.779
2.083ThrGly: 2.083 ± 0.771
2.604ThrHis: 2.604 ± 1.405
5.729ThrIle: 5.729 ± 2.645
4.167ThrLys: 4.167 ± 1.626
9.375ThrLeu: 9.375 ± 2.68
1.042ThrMet: 1.042 ± 0.562
3.125ThrAsn: 3.125 ± 1.489
3.646ThrPro: 3.646 ± 1.49
2.083ThrGln: 2.083 ± 0.867
5.208ThrArg: 5.208 ± 2.374
3.646ThrSer: 3.646 ± 1.362
6.771ThrThr: 6.771 ± 2.67
3.125ThrVal: 3.125 ± 1.348
1.042ThrTrp: 1.042 ± 0.82
4.688ThrTyr: 4.688 ± 1.048
0.0ThrXaa: 0.0 ± 0.0
Val
1.562ValAla: 1.562 ± 1.777
0.0ValCys: 0.0 ± 0.0
1.562ValAsp: 1.562 ± 0.768
1.562ValGlu: 1.562 ± 1.938
2.083ValPhe: 2.083 ± 1.418
2.604ValGly: 2.604 ± 1.2
2.083ValHis: 2.083 ± 0.867
4.688ValIle: 4.688 ± 1.635
5.729ValLys: 5.729 ± 3.091
5.729ValLeu: 5.729 ± 3.53
0.521ValMet: 0.521 ± 0.281
3.125ValAsn: 3.125 ± 1.489
1.042ValPro: 1.042 ± 1.581
1.562ValGln: 1.562 ± 0.843
1.562ValArg: 1.562 ± 0.843
2.083ValSer: 2.083 ± 1.824
1.562ValThr: 1.562 ± 1.602
2.604ValVal: 2.604 ± 1.62
0.521ValTrp: 0.521 ± 0.973
2.083ValTyr: 2.083 ± 3.163
0.0ValXaa: 0.0 ± 0.0
Trp
1.562TrpAla: 1.562 ± 0.745
0.0TrpCys: 0.0 ± 0.0
0.521TrpAsp: 0.521 ± 0.973
1.042TrpGlu: 1.042 ± 0.562
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.521TrpLys: 0.521 ± 0.281
1.562TrpLeu: 1.562 ± 0.843
0.521TrpMet: 0.521 ± 0.281
1.042TrpAsn: 1.042 ± 0.82
0.521TrpPro: 0.521 ± 0.859
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
1.042TrpSer: 1.042 ± 0.82
2.083TrpThr: 2.083 ± 1.124
0.521TrpVal: 0.521 ± 0.281
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.646TyrAla: 3.646 ± 1.967
0.521TyrCys: 0.521 ± 0.281
0.521TyrAsp: 0.521 ± 0.281
0.521TyrGlu: 0.521 ± 0.281
2.604TyrPhe: 2.604 ± 3.402
2.604TyrGly: 2.604 ± 1.036
0.521TyrHis: 0.521 ± 0.281
2.083TyrIle: 2.083 ± 1.124
1.562TyrLys: 1.562 ± 1.381
4.167TyrLeu: 4.167 ± 2.436
0.521TyrMet: 0.521 ± 0.281
1.042TyrAsn: 1.042 ± 0.562
1.042TyrPro: 1.042 ± 0.82
2.604TyrGln: 2.604 ± 1.573
0.521TyrArg: 0.521 ± 0.859
3.125TyrSer: 3.125 ± 1.245
3.646TyrThr: 3.646 ± 1.245
2.083TyrVal: 2.083 ± 1.124
0.521TyrTrp: 0.521 ± 0.281
0.521TyrTyr: 0.521 ± 0.281
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1921 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski