Amino acid dipepetide frequency for Cacao swollen shoot Ghana Q virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.526AlaAla: 3.526 ± 1.115
0.441AlaCys: 0.441 ± 0.871
1.763AlaAsp: 1.763 ± 0.831
3.526AlaGlu: 3.526 ± 1.909
3.085AlaPhe: 3.085 ± 1.226
3.085AlaGly: 3.085 ± 0.92
0.441AlaHis: 0.441 ± 0.208
4.848AlaIle: 4.848 ± 1.925
4.407AlaLys: 4.407 ± 0.991
3.967AlaLeu: 3.967 ± 3.497
2.204AlaMet: 2.204 ± 1.039
1.322AlaAsn: 1.322 ± 1.034
0.441AlaPro: 0.441 ± 0.208
4.848AlaGln: 4.848 ± 1.645
3.085AlaArg: 3.085 ± 1.455
3.085AlaSer: 3.085 ± 1.455
3.526AlaThr: 3.526 ± 1.13
4.407AlaVal: 4.407 ± 0.889
1.763AlaTrp: 1.763 ± 0.831
3.085AlaTyr: 3.085 ± 0.92
0.0AlaXaa: 0.0 ± 0.0
Cys
0.881CysAla: 0.881 ± 0.758
0.441CysCys: 0.441 ± 0.208
0.441CysAsp: 0.441 ± 0.208
0.0CysGlu: 0.0 ± 0.0
1.322CysPhe: 1.322 ± 0.624
0.881CysGly: 0.881 ± 0.416
0.441CysHis: 0.441 ± 0.208
0.881CysIle: 0.881 ± 1.366
2.644CysLys: 2.644 ± 1.247
0.441CysLeu: 0.441 ± 0.208
0.881CysMet: 0.881 ± 1.174
0.441CysAsn: 0.441 ± 0.208
0.881CysPro: 0.881 ± 0.416
0.881CysGln: 0.881 ± 0.416
0.881CysArg: 0.881 ± 0.416
1.763CysSer: 1.763 ± 0.831
0.441CysThr: 0.441 ± 0.208
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.441CysTyr: 0.441 ± 0.208
0.0CysXaa: 0.0 ± 0.0
Asp
1.763AspAla: 1.763 ± 0.831
0.441AspCys: 0.441 ± 0.208
2.204AspAsp: 2.204 ± 1.039
5.729AspGlu: 5.729 ± 1.573
2.204AspPhe: 2.204 ± 0.844
2.204AspGly: 2.204 ± 1.039
0.881AspHis: 0.881 ± 0.416
2.204AspIle: 2.204 ± 1.039
2.644AspLys: 2.644 ± 2.068
3.526AspLeu: 3.526 ± 2.558
0.881AspMet: 0.881 ± 0.416
4.407AspAsn: 4.407 ± 1.484
2.204AspPro: 2.204 ± 0.998
2.204AspGln: 2.204 ± 0.844
1.322AspArg: 1.322 ± 0.624
2.644AspSer: 2.644 ± 4.841
2.204AspThr: 2.204 ± 1.039
0.441AspVal: 0.441 ± 0.208
0.441AspTrp: 0.441 ± 0.871
2.204AspTyr: 2.204 ± 1.039
0.0AspXaa: 0.0 ± 0.0
Glu
5.729GluAla: 5.729 ± 4.261
0.881GluCys: 0.881 ± 0.416
7.052GluAsp: 7.052 ± 1.306
13.662GluGlu: 13.662 ± 1.303
2.204GluPhe: 2.204 ± 1.039
5.729GluGly: 5.729 ± 2.702
1.763GluHis: 1.763 ± 0.831
5.289GluIle: 5.289 ± 3.159
9.255GluLys: 9.255 ± 4.286
3.526GluLeu: 3.526 ± 1.13
0.881GluMet: 0.881 ± 0.416
3.526GluAsn: 3.526 ± 0.855
2.204GluPro: 2.204 ± 0.844
4.848GluGln: 4.848 ± 3.349
4.407GluArg: 4.407 ± 1.396
4.848GluSer: 4.848 ± 1.562
3.967GluThr: 3.967 ± 1.871
5.729GluVal: 5.729 ± 2.702
0.441GluTrp: 0.441 ± 0.208
2.644GluTyr: 2.644 ± 1.379
0.0GluXaa: 0.0 ± 0.0
Phe
2.644PheAla: 2.644 ± 1.247
1.763PheCys: 1.763 ± 0.831
1.763PheAsp: 1.763 ± 0.831
1.763PheGlu: 1.763 ± 0.92
0.0PhePhe: 0.0 ± 0.0
0.441PheGly: 0.441 ± 0.208
0.881PheHis: 0.881 ± 1.174
4.407PheIle: 4.407 ± 2.079
3.526PheLys: 3.526 ± 1.909
2.644PheLeu: 2.644 ± 0.937
0.441PheMet: 0.441 ± 0.746
1.763PheAsn: 1.763 ± 0.831
0.441PhePro: 0.441 ± 0.208
0.881PheGln: 0.881 ± 0.416
1.763PheArg: 1.763 ± 0.831
2.644PheSer: 2.644 ± 0.814
2.204PheThr: 2.204 ± 0.844
0.441PheVal: 0.441 ± 0.208
0.441PheTrp: 0.441 ± 0.208
1.763PheTyr: 1.763 ± 1.966
0.0PheXaa: 0.0 ± 0.0
Gly
2.204GlyAla: 2.204 ± 1.039
1.322GlyCys: 1.322 ± 0.624
1.322GlyAsp: 1.322 ± 0.689
5.729GlyGlu: 5.729 ± 2.702
1.763GlyPhe: 1.763 ± 1.095
2.644GlyGly: 2.644 ± 1.247
0.441GlyHis: 0.441 ± 0.208
3.967GlyIle: 3.967 ± 1.303
5.289GlyLys: 5.289 ± 2.607
4.407GlyLeu: 4.407 ± 0.991
1.763GlyMet: 1.763 ± 0.774
1.322GlyAsn: 1.322 ± 0.624
1.763GlyPro: 1.763 ± 1.095
1.322GlyGln: 1.322 ± 0.624
5.729GlyArg: 5.729 ± 1.965
3.085GlySer: 3.085 ± 2.054
1.322GlyThr: 1.322 ± 0.624
3.967GlyVal: 3.967 ± 1.871
1.322GlyTrp: 1.322 ± 0.624
3.526GlyTyr: 3.526 ± 1.663
0.0GlyXaa: 0.0 ± 0.0
His
1.322HisAla: 1.322 ± 0.624
1.763HisCys: 1.763 ± 0.831
0.441HisAsp: 0.441 ± 1.331
0.881HisGlu: 0.881 ± 0.416
0.441HisPhe: 0.441 ± 0.208
1.322HisGly: 1.322 ± 0.689
0.881HisHis: 0.881 ± 0.758
2.204HisIle: 2.204 ± 1.039
1.322HisLys: 1.322 ± 1.034
1.322HisLeu: 1.322 ± 2.501
0.0HisMet: 0.0 ± 0.0
2.644HisAsn: 2.644 ± 1.36
0.0HisPro: 0.0 ± 0.0
1.322HisGln: 1.322 ± 0.624
1.763HisArg: 1.763 ± 0.831
2.644HisSer: 2.644 ± 0.937
0.441HisThr: 0.441 ± 0.208
2.204HisVal: 2.204 ± 1.039
1.322HisTrp: 1.322 ± 1.034
0.441HisTyr: 0.441 ± 0.208
0.0HisXaa: 0.0 ± 0.0
Ile
4.848IleAla: 4.848 ± 1.645
0.881IleCys: 0.881 ± 0.416
3.526IleAsp: 3.526 ± 0.95
5.289IleGlu: 5.289 ± 1.628
2.644IlePhe: 2.644 ± 1.247
3.526IleGly: 3.526 ± 1.13
2.644IleHis: 2.644 ± 1.247
4.848IleIle: 4.848 ± 1.26
3.967IleLys: 3.967 ± 2.448
5.729IleLeu: 5.729 ± 1.695
1.763IleMet: 1.763 ± 0.831
2.644IleAsn: 2.644 ± 0.814
5.289IlePro: 5.289 ± 1.864
4.407IleGln: 4.407 ± 1.996
4.407IleArg: 4.407 ± 1.484
4.407IleSer: 4.407 ± 2.195
3.085IleThr: 3.085 ± 0.837
2.644IleVal: 2.644 ± 1.247
0.0IleTrp: 0.0 ± 0.0
0.881IleTyr: 0.881 ± 1.366
0.0IleXaa: 0.0 ± 0.0
Lys
6.611LysAla: 6.611 ± 0.371
1.763LysCys: 1.763 ± 0.831
2.644LysAsp: 2.644 ± 1.36
8.814LysGlu: 8.814 ± 4.627
3.526LysPhe: 3.526 ± 2.558
4.848LysGly: 4.848 ± 4.444
1.322LysHis: 1.322 ± 1.034
6.17LysIle: 6.17 ± 3.893
8.374LysLys: 8.374 ± 2.874
5.289LysLeu: 5.289 ± 1.477
2.644LysMet: 2.644 ± 0.834
4.848LysAsn: 4.848 ± 1.309
5.289LysPro: 5.289 ± 0.676
4.848LysGln: 4.848 ± 2.864
4.848LysArg: 4.848 ± 2.504
6.17LysSer: 6.17 ± 3.227
1.763LysThr: 1.763 ± 0.831
3.967LysVal: 3.967 ± 5.54
0.881LysTrp: 0.881 ± 0.416
1.322LysTyr: 1.322 ± 0.624
0.0LysXaa: 0.0 ± 0.0
Leu
3.085LeuAla: 3.085 ± 0.961
1.763LeuCys: 1.763 ± 0.92
3.526LeuAsp: 3.526 ± 4.016
6.17LeuGlu: 6.17 ± 3.995
1.322LeuPhe: 1.322 ± 0.624
3.967LeuGly: 3.967 ± 1.042
3.526LeuHis: 3.526 ± 1.115
4.407LeuIle: 4.407 ± 3.447
6.611LeuLys: 6.611 ± 4.159
9.255LeuLeu: 9.255 ± 6.822
0.881LeuMet: 0.881 ± 2.024
3.967LeuAsn: 3.967 ± 1.042
3.085LeuPro: 3.085 ± 1.354
3.526LeuGln: 3.526 ± 1.13
3.967LeuArg: 3.967 ± 1.303
6.17LeuSer: 6.17 ± 2.972
4.407LeuThr: 4.407 ± 2.978
4.848LeuVal: 4.848 ± 0.991
0.441LeuTrp: 0.441 ± 0.208
2.644LeuTyr: 2.644 ± 1.247
0.0LeuXaa: 0.0 ± 0.0
Met
0.881MetAla: 0.881 ± 0.416
0.0MetCys: 0.0 ± 0.0
0.881MetAsp: 0.881 ± 0.416
4.407MetGlu: 4.407 ± 2.038
0.881MetPhe: 0.881 ± 0.758
0.881MetGly: 0.881 ± 1.366
0.881MetHis: 0.881 ± 0.416
0.881MetIle: 0.881 ± 0.416
3.526MetLys: 3.526 ± 0.907
0.441MetLeu: 0.441 ± 0.208
0.881MetMet: 0.881 ± 0.416
0.441MetAsn: 0.441 ± 0.208
1.322MetPro: 1.322 ± 0.624
2.204MetGln: 2.204 ± 1.039
2.204MetArg: 2.204 ± 1.039
2.204MetSer: 2.204 ± 1.771
1.322MetThr: 1.322 ± 0.624
0.881MetVal: 0.881 ± 0.416
0.441MetTrp: 0.441 ± 0.208
0.881MetTyr: 0.881 ± 0.416
0.0MetXaa: 0.0 ± 0.0
Asn
1.322AsnAla: 1.322 ± 1.221
0.0AsnCys: 0.0 ± 0.0
2.644AsnAsp: 2.644 ± 1.247
3.085AsnGlu: 3.085 ± 0.972
2.204AsnPhe: 2.204 ± 0.844
0.881AsnGly: 0.881 ± 0.416
1.763AsnHis: 1.763 ± 3.831
2.204AsnIle: 2.204 ± 1.039
2.204AsnLys: 2.204 ± 1.039
6.611AsnLeu: 6.611 ± 3.898
0.881AsnMet: 0.881 ± 0.758
2.644AsnAsn: 2.644 ± 1.096
1.763AsnPro: 1.763 ± 0.831
2.204AsnGln: 2.204 ± 0.844
0.881AsnArg: 0.881 ± 0.416
4.407AsnSer: 4.407 ± 1.484
3.967AsnThr: 3.967 ± 1.017
2.204AsnVal: 2.204 ± 1.039
0.881AsnTrp: 0.881 ± 0.416
2.204AsnTyr: 2.204 ± 0.733
0.0AsnXaa: 0.0 ± 0.0
Pro
3.967ProAla: 3.967 ± 1.871
0.0ProCys: 0.0 ± 0.0
1.322ProAsp: 1.322 ± 0.624
4.848ProGlu: 4.848 ± 1.26
1.763ProPhe: 1.763 ± 0.831
3.085ProGly: 3.085 ± 1.455
0.881ProHis: 0.881 ± 0.416
1.763ProIle: 1.763 ± 0.831
3.526ProLys: 3.526 ± 2.558
4.848ProLeu: 4.848 ± 1.305
0.881ProMet: 0.881 ± 0.416
0.441ProAsn: 0.441 ± 0.208
1.763ProPro: 1.763 ± 0.831
2.644ProGln: 2.644 ± 1.247
1.322ProArg: 1.322 ± 0.624
1.763ProSer: 1.763 ± 0.831
2.644ProThr: 2.644 ± 0.937
1.763ProVal: 1.763 ± 0.68
0.441ProTrp: 0.441 ± 0.208
1.763ProTyr: 1.763 ± 1.095
0.0ProXaa: 0.0 ± 0.0
Gln
4.407GlnAla: 4.407 ± 2.452
0.441GlnCys: 0.441 ± 0.208
1.322GlnAsp: 1.322 ± 0.624
5.729GlnGlu: 5.729 ± 1.212
1.322GlnPhe: 1.322 ± 1.034
3.967GlnGly: 3.967 ± 1.871
3.526GlnHis: 3.526 ± 0.95
3.526GlnIle: 3.526 ± 1.663
4.407GlnLys: 4.407 ± 1.876
6.611GlnLeu: 6.611 ± 5.849
2.204GlnMet: 2.204 ± 1.039
1.763GlnAsn: 1.763 ± 1.966
3.967GlnPro: 3.967 ± 1.017
5.729GlnGln: 5.729 ± 1.114
2.204GlnArg: 2.204 ± 1.039
2.204GlnSer: 2.204 ± 0.998
2.644GlnThr: 2.644 ± 0.814
1.763GlnVal: 1.763 ± 0.92
1.322GlnTrp: 1.322 ± 0.624
3.085GlnTyr: 3.085 ± 0.837
0.0GlnXaa: 0.0 ± 0.0
Arg
1.322ArgAla: 1.322 ± 0.624
0.441ArgCys: 0.441 ± 0.208
2.644ArgAsp: 2.644 ± 1.247
3.526ArgGlu: 3.526 ± 2.191
1.763ArgPhe: 1.763 ± 0.68
1.763ArgGly: 1.763 ± 0.831
1.322ArgHis: 1.322 ± 0.624
5.729ArgIle: 5.729 ± 1.035
7.052ArgLys: 7.052 ± 1.521
3.967ArgLeu: 3.967 ± 1.042
2.644ArgMet: 2.644 ± 1.247
2.644ArgAsn: 2.644 ± 1.247
2.644ArgPro: 2.644 ± 1.247
2.204ArgGln: 2.204 ± 0.733
4.407ArgArg: 4.407 ± 2.079
6.17ArgSer: 6.17 ± 1.943
2.204ArgThr: 2.204 ± 1.039
2.644ArgVal: 2.644 ± 2.068
1.322ArgTrp: 1.322 ± 0.624
0.441ArgTyr: 0.441 ± 0.208
0.0ArgXaa: 0.0 ± 0.0
Ser
3.526SerAla: 3.526 ± 1.115
1.322SerCys: 1.322 ± 1.221
2.644SerAsp: 2.644 ± 0.937
4.848SerGlu: 4.848 ± 2.282
2.204SerPhe: 2.204 ± 1.039
3.085SerGly: 3.085 ± 1.226
1.763SerHis: 1.763 ± 0.68
4.407SerIle: 4.407 ± 1.153
7.052SerLys: 7.052 ± 4.21
4.848SerLeu: 4.848 ± 2.798
2.204SerMet: 2.204 ± 0.733
4.407SerAsn: 4.407 ± 1.876
2.644SerPro: 2.644 ± 1.247
5.289SerGln: 5.289 ± 1.853
6.611SerArg: 6.611 ± 2.027
7.052SerSer: 7.052 ± 4.237
4.848SerThr: 4.848 ± 1.672
2.644SerVal: 2.644 ± 1.247
0.881SerTrp: 0.881 ± 0.416
1.763SerTyr: 1.763 ± 1.516
0.0SerXaa: 0.0 ± 0.0
Thr
1.763ThrAla: 1.763 ± 0.92
1.322ThrCys: 1.322 ± 0.624
3.085ThrAsp: 3.085 ± 1.455
3.085ThrGlu: 3.085 ± 0.837
1.763ThrPhe: 1.763 ± 0.831
6.17ThrGly: 6.17 ± 1.746
0.441ThrHis: 0.441 ± 0.208
3.967ThrIle: 3.967 ± 1.017
3.526ThrLys: 3.526 ± 2.166
1.763ThrLeu: 1.763 ± 0.68
2.204ThrMet: 2.204 ± 1.039
1.763ThrAsn: 1.763 ± 0.92
1.763ThrPro: 1.763 ± 0.831
3.967ThrGln: 3.967 ± 1.042
3.085ThrArg: 3.085 ± 1.455
5.289ThrSer: 5.289 ± 2.494
5.289ThrThr: 5.289 ± 2.494
2.204ThrVal: 2.204 ± 1.039
0.441ThrTrp: 0.441 ± 0.208
0.881ThrTyr: 0.881 ± 0.416
0.0ThrXaa: 0.0 ± 0.0
Val
3.967ValAla: 3.967 ± 1.871
0.0ValCys: 0.0 ± 0.0
2.204ValAsp: 2.204 ± 0.998
3.526ValGlu: 3.526 ± 1.841
1.322ValPhe: 1.322 ± 0.624
3.526ValGly: 3.526 ± 1.841
0.0ValHis: 0.0 ± 0.0
2.644ValIle: 2.644 ± 1.36
2.204ValLys: 2.204 ± 0.733
2.204ValLeu: 2.204 ± 0.998
0.881ValMet: 0.881 ± 0.416
1.322ValAsn: 1.322 ± 0.624
2.644ValPro: 2.644 ± 0.937
5.729ValGln: 5.729 ± 0.763
2.204ValArg: 2.204 ± 0.733
3.967ValSer: 3.967 ± 0.792
5.729ValThr: 5.729 ± 1.573
2.204ValVal: 2.204 ± 0.733
0.0ValTrp: 0.0 ± 0.0
1.322ValTyr: 1.322 ± 0.624
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
2.204TrpGlu: 2.204 ± 1.039
0.441TrpPhe: 0.441 ± 0.208
1.322TrpGly: 1.322 ± 0.624
0.0TrpHis: 0.0 ± 0.0
0.441TrpIle: 0.441 ± 0.208
2.204TrpLys: 2.204 ± 1.039
1.763TrpLeu: 1.763 ± 0.68
0.441TrpMet: 0.441 ± 0.208
0.441TrpAsn: 0.441 ± 0.208
0.0TrpPro: 0.0 ± 0.0
0.881TrpGln: 0.881 ± 0.416
0.441TrpArg: 0.441 ± 0.208
0.881TrpSer: 0.881 ± 0.416
1.322TrpThr: 1.322 ± 0.624
1.322TrpVal: 1.322 ± 1.034
0.441TrpTrp: 0.441 ± 0.208
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.085TyrAla: 3.085 ± 1.226
0.0TyrCys: 0.0 ± 0.0
1.322TyrAsp: 1.322 ± 1.034
1.763TyrGlu: 1.763 ± 0.831
0.441TyrPhe: 0.441 ± 0.208
0.881TyrGly: 0.881 ± 0.758
0.881TyrHis: 0.881 ± 0.416
2.644TyrIle: 2.644 ± 1.247
2.644TyrLys: 2.644 ± 0.937
4.407TyrLeu: 4.407 ± 0.78
0.441TyrMet: 0.441 ± 0.208
2.204TyrAsn: 2.204 ± 1.039
1.322TyrPro: 1.322 ± 0.624
2.204TyrGln: 2.204 ± 1.51
1.322TyrArg: 1.322 ± 0.624
2.644TyrSer: 2.644 ± 1.247
0.441TyrThr: 0.441 ± 0.208
1.322TyrVal: 1.322 ± 1.845
1.322TyrTrp: 1.322 ± 0.624
1.322TyrTyr: 1.322 ± 0.689
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (2270 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski