Amino acid dipepetide frequency for Clerodendron golden mosaic virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.943AlaAla: 2.943 ± 1.179
1.177AlaCys: 1.177 ± 0.742
0.589AlaAsp: 0.589 ± 0.553
3.531AlaGlu: 3.531 ± 0.793
0.589AlaPhe: 0.589 ± 0.504
2.354AlaGly: 2.354 ± 1.193
2.943AlaHis: 2.943 ± 0.86
0.589AlaIle: 0.589 ± 0.497
5.297AlaLys: 5.297 ± 0.901
4.709AlaLeu: 4.709 ± 1.795
0.0AlaMet: 0.0 ± 0.0
2.943AlaAsn: 2.943 ± 0.84
0.589AlaPro: 0.589 ± 0.497
2.943AlaGln: 2.943 ± 1.243
5.297AlaArg: 5.297 ± 1.554
7.063AlaSer: 7.063 ± 2.32
4.12AlaThr: 4.12 ± 1.572
1.177AlaVal: 1.177 ± 0.85
0.589AlaTrp: 0.589 ± 0.497
2.354AlaTyr: 2.354 ± 1.029
0.0AlaXaa: 0.0 ± 0.0
Cys
0.589CysAla: 0.589 ± 0.599
0.0CysCys: 0.0 ± 0.0
1.177CysAsp: 1.177 ± 1.007
0.589CysGlu: 0.589 ± 0.5
0.589CysPhe: 0.589 ± 0.733
1.177CysGly: 1.177 ± 0.672
0.589CysHis: 0.589 ± 0.618
2.943CysIle: 2.943 ± 1.199
1.177CysLys: 1.177 ± 0.64
0.589CysLeu: 0.589 ± 0.553
1.177CysMet: 1.177 ± 0.89
2.943CysAsn: 2.943 ± 1.211
1.177CysPro: 1.177 ± 1.197
0.589CysGln: 0.589 ± 0.497
1.177CysArg: 1.177 ± 0.603
1.766CysSer: 1.766 ± 0.719
0.589CysThr: 0.589 ± 0.5
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.177AspAla: 1.177 ± 0.993
0.589AspCys: 0.589 ± 0.497
1.177AspAsp: 1.177 ± 0.603
0.589AspGlu: 0.589 ± 0.5
2.354AspPhe: 2.354 ± 0.881
2.943AspGly: 2.943 ± 1.521
1.177AspHis: 1.177 ± 0.858
2.943AspIle: 2.943 ± 1.239
1.766AspLys: 1.766 ± 1.001
5.297AspLeu: 5.297 ± 1.526
0.589AspMet: 0.589 ± 0.599
1.177AspAsn: 1.177 ± 0.602
2.943AspPro: 2.943 ± 1.604
1.177AspGln: 1.177 ± 0.826
4.709AspArg: 4.709 ± 1.342
5.297AspSer: 5.297 ± 1.376
2.943AspThr: 2.943 ± 0.884
5.297AspVal: 5.297 ± 1.824
1.177AspTrp: 1.177 ± 0.672
1.177AspTyr: 1.177 ± 0.693
0.0AspXaa: 0.0 ± 0.0
Glu
4.12GluAla: 4.12 ± 1.945
1.177GluCys: 1.177 ± 0.943
0.589GluAsp: 0.589 ± 0.497
2.943GluGlu: 2.943 ± 1.46
2.943GluPhe: 2.943 ± 1.326
1.766GluGly: 1.766 ± 1.031
0.0GluHis: 0.0 ± 0.0
1.177GluIle: 1.177 ± 0.943
0.589GluLys: 0.589 ± 0.497
4.12GluLeu: 4.12 ± 1.762
0.589GluMet: 0.589 ± 0.553
4.12GluAsn: 4.12 ± 1.796
2.354GluPro: 2.354 ± 0.813
1.766GluGln: 1.766 ± 1.041
0.0GluArg: 0.0 ± 0.0
2.354GluSer: 2.354 ± 1.511
2.943GluThr: 2.943 ± 1.243
2.943GluVal: 2.943 ± 1.039
1.177GluTrp: 1.177 ± 0.672
2.354GluTyr: 2.354 ± 1.386
0.0GluXaa: 0.0 ± 0.0
Phe
1.177PheAla: 1.177 ± 0.826
1.177PheCys: 1.177 ± 0.602
1.177PheAsp: 1.177 ± 0.64
2.354PheGlu: 2.354 ± 0.73
1.177PhePhe: 1.177 ± 0.64
1.177PheGly: 1.177 ± 0.647
1.766PheHis: 1.766 ± 1.077
2.354PheIle: 2.354 ± 1.168
4.709PheLys: 4.709 ± 2.096
4.709PheLeu: 4.709 ± 2.156
0.589PheMet: 0.589 ± 0.497
2.943PheAsn: 2.943 ± 0.671
1.177PhePro: 1.177 ± 0.789
2.354PheGln: 2.354 ± 1.014
3.531PheArg: 3.531 ± 1.412
3.531PheSer: 3.531 ± 1.692
2.354PheThr: 2.354 ± 1.189
1.766PheVal: 1.766 ± 1.031
1.177PheTrp: 1.177 ± 0.826
1.766PheTyr: 1.766 ± 1.119
0.0PheXaa: 0.0 ± 0.0
Gly
2.354GlyAla: 2.354 ± 1.555
1.766GlyCys: 1.766 ± 0.817
3.531GlyAsp: 3.531 ± 1.103
4.12GlyGlu: 4.12 ± 0.86
0.589GlyPhe: 0.589 ± 0.618
2.354GlyGly: 2.354 ± 1.014
1.177GlyHis: 1.177 ± 0.603
1.766GlyIle: 1.766 ± 0.671
4.12GlyLys: 4.12 ± 1.689
3.531GlyLeu: 3.531 ± 1.329
0.589GlyMet: 0.589 ± 0.489
2.354GlyAsn: 2.354 ± 0.836
4.709GlyPro: 4.709 ± 1.939
2.943GlyGln: 2.943 ± 1.145
2.354GlyArg: 2.354 ± 0.742
4.709GlySer: 4.709 ± 1.999
3.531GlyThr: 3.531 ± 1.339
2.354GlyVal: 2.354 ± 1.445
0.0GlyTrp: 0.0 ± 0.0
0.589GlyTyr: 0.589 ± 0.599
0.0GlyXaa: 0.0 ± 0.0
His
1.177HisAla: 1.177 ± 0.647
1.177HisCys: 1.177 ± 0.868
1.766HisAsp: 1.766 ± 0.898
1.766HisGlu: 1.766 ± 1.129
2.354HisPhe: 2.354 ± 1.023
1.177HisGly: 1.177 ± 0.868
2.943HisHis: 2.943 ± 2.553
0.589HisIle: 0.589 ± 0.553
1.177HisLys: 1.177 ± 0.943
2.354HisLeu: 2.354 ± 1.051
0.0HisMet: 0.0 ± 0.0
3.531HisAsn: 3.531 ± 1.093
1.766HisPro: 1.766 ± 0.746
1.766HisGln: 1.766 ± 0.991
2.354HisArg: 2.354 ± 0.969
1.766HisSer: 1.766 ± 1.126
3.531HisThr: 3.531 ± 2.043
3.531HisVal: 3.531 ± 1.05
0.589HisTrp: 0.589 ± 0.681
1.177HisTyr: 1.177 ± 0.603
0.0HisXaa: 0.0 ± 0.0
Ile
1.766IleAla: 1.766 ± 1.037
1.177IleCys: 1.177 ± 0.64
4.709IleAsp: 4.709 ± 2.043
1.177IleGlu: 1.177 ± 0.693
4.12IlePhe: 4.12 ± 1.275
1.177IleGly: 1.177 ± 1.106
1.177IleHis: 1.177 ± 0.826
5.297IleIle: 5.297 ± 2.232
5.297IleLys: 5.297 ± 1.092
4.12IleLeu: 4.12 ± 1.148
0.589IleMet: 0.589 ± 0.746
4.709IleAsn: 4.709 ± 1.484
1.766IlePro: 1.766 ± 0.64
5.297IleGln: 5.297 ± 1.648
3.531IleArg: 3.531 ± 1.752
4.709IleSer: 4.709 ± 1.615
2.943IleThr: 2.943 ± 1.62
1.766IleVal: 1.766 ± 0.99
1.766IleTrp: 1.766 ± 1.495
1.766IleTyr: 1.766 ± 0.914
0.0IleXaa: 0.0 ± 0.0
Lys
1.177LysAla: 1.177 ± 0.693
1.177LysCys: 1.177 ± 0.793
1.766LysAsp: 1.766 ± 0.957
1.766LysGlu: 1.766 ± 1.49
2.943LysPhe: 2.943 ± 1.411
1.177LysGly: 1.177 ± 0.789
1.177LysHis: 1.177 ± 0.993
2.943LysIle: 2.943 ± 1.093
2.943LysLys: 2.943 ± 1.431
2.354LysLeu: 2.354 ± 1.4
0.589LysMet: 0.589 ± 0.681
6.474LysAsn: 6.474 ± 1.587
2.354LysPro: 2.354 ± 0.543
2.354LysGln: 2.354 ± 0.932
2.943LysArg: 2.943 ± 1.346
3.531LysSer: 3.531 ± 1.597
2.354LysThr: 2.354 ± 1.023
6.474LysVal: 6.474 ± 2.478
0.0LysTrp: 0.0 ± 0.0
4.12LysTyr: 4.12 ± 1.666
0.0LysXaa: 0.0 ± 0.0
Leu
3.531LeuAla: 3.531 ± 1.418
2.943LeuCys: 2.943 ± 0.937
5.297LeuAsp: 5.297 ± 2.293
4.12LeuGlu: 4.12 ± 1.327
1.766LeuPhe: 1.766 ± 1.078
5.297LeuGly: 5.297 ± 1.324
3.531LeuHis: 3.531 ± 1.613
2.354LeuIle: 2.354 ± 1.699
4.12LeuLys: 4.12 ± 1.937
4.12LeuLeu: 4.12 ± 2.049
1.766LeuMet: 1.766 ± 0.898
3.531LeuAsn: 3.531 ± 1.351
1.177LeuPro: 1.177 ± 0.916
1.177LeuGln: 1.177 ± 0.716
7.652LeuArg: 7.652 ± 1.964
7.063LeuSer: 7.063 ± 2.019
5.886LeuThr: 5.886 ± 1.918
3.531LeuVal: 3.531 ± 1.462
1.177LeuTrp: 1.177 ± 0.64
4.12LeuTyr: 4.12 ± 1.672
0.0LeuXaa: 0.0 ± 0.0
Met
1.177MetAla: 1.177 ± 0.999
0.0MetCys: 0.0 ± 0.0
1.766MetAsp: 1.766 ± 0.914
0.589MetGlu: 0.589 ± 0.553
1.177MetPhe: 1.177 ± 0.895
1.766MetGly: 1.766 ± 0.888
0.589MetHis: 0.589 ± 0.553
0.0MetIle: 0.0 ± 0.0
0.589MetLys: 0.589 ± 0.733
2.354MetLeu: 2.354 ± 1.448
0.589MetMet: 0.589 ± 0.681
0.0MetAsn: 0.0 ± 0.0
1.177MetPro: 1.177 ± 0.789
1.177MetGln: 1.177 ± 0.672
0.589MetArg: 0.589 ± 0.504
1.766MetSer: 1.766 ± 0.64
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
1.177MetTrp: 1.177 ± 0.716
1.766MetTyr: 1.766 ± 0.673
0.0MetXaa: 0.0 ± 0.0
Asn
4.709AsnAla: 4.709 ± 1.29
1.177AsnCys: 1.177 ± 0.821
4.12AsnAsp: 4.12 ± 1.072
1.766AsnGlu: 1.766 ± 0.914
1.766AsnPhe: 1.766 ± 0.899
2.354AsnGly: 2.354 ± 0.796
5.297AsnHis: 5.297 ± 2.321
2.943AsnIle: 2.943 ± 0.867
1.177AsnLys: 1.177 ± 0.609
1.766AsnLeu: 1.766 ± 1.128
1.177AsnMet: 1.177 ± 0.762
3.531AsnAsn: 3.531 ± 1.556
2.943AsnPro: 2.943 ± 1.43
2.354AsnGln: 2.354 ± 1.4
2.943AsnArg: 2.943 ± 1.286
7.063AsnSer: 7.063 ± 3.273
4.12AsnThr: 4.12 ± 1.076
4.709AsnVal: 4.709 ± 1.438
0.0AsnTrp: 0.0 ± 0.0
2.354AsnTyr: 2.354 ± 1.087
0.0AsnXaa: 0.0 ± 0.0
Pro
1.177ProAla: 1.177 ± 0.602
1.177ProCys: 1.177 ± 0.742
1.766ProAsp: 1.766 ± 1.053
1.177ProGlu: 1.177 ± 0.716
1.177ProPhe: 1.177 ± 0.609
2.354ProGly: 2.354 ± 1.207
2.943ProHis: 2.943 ± 1.535
5.886ProIle: 5.886 ± 2.397
2.943ProLys: 2.943 ± 1.439
2.354ProLeu: 2.354 ± 1.158
2.354ProMet: 2.354 ± 1.15
1.766ProAsn: 1.766 ± 0.817
2.354ProPro: 2.354 ± 1.127
2.943ProGln: 2.943 ± 1.125
4.709ProArg: 4.709 ± 1.288
4.12ProSer: 4.12 ± 1.934
7.063ProThr: 7.063 ± 2.135
3.531ProVal: 3.531 ± 0.89
0.589ProTrp: 0.589 ± 0.553
2.943ProTyr: 2.943 ± 1.199
0.0ProXaa: 0.0 ± 0.0
Gln
3.531GlnAla: 3.531 ± 1.631
1.177GlnCys: 1.177 ± 0.993
1.766GlnAsp: 1.766 ± 0.888
2.943GlnGlu: 2.943 ± 1.304
3.531GlnPhe: 3.531 ± 1.502
2.943GlnGly: 2.943 ± 0.982
0.589GlnHis: 0.589 ± 0.618
1.766GlnIle: 1.766 ± 1.128
0.589GlnLys: 0.589 ± 0.681
2.943GlnLeu: 2.943 ± 1.584
0.589GlnMet: 0.589 ± 0.618
1.766GlnAsn: 1.766 ± 0.991
3.531GlnPro: 3.531 ± 2.333
1.766GlnGln: 1.766 ± 0.84
2.943GlnArg: 2.943 ± 1.179
4.709GlnSer: 4.709 ± 1.133
1.766GlnThr: 1.766 ± 0.671
3.531GlnVal: 3.531 ± 1.094
0.0GlnTrp: 0.0 ± 0.0
1.177GlnTyr: 1.177 ± 0.647
0.0GlnXaa: 0.0 ± 0.0
Arg
4.12ArgAla: 4.12 ± 1.526
1.177ArgCys: 1.177 ± 0.789
4.12ArgAsp: 4.12 ± 1.618
1.766ArgGlu: 1.766 ± 0.671
6.474ArgPhe: 6.474 ± 1.394
4.12ArgGly: 4.12 ± 1.783
2.943ArgHis: 2.943 ± 1.118
6.474ArgIle: 6.474 ± 0.999
2.354ArgLys: 2.354 ± 1.499
4.709ArgLeu: 4.709 ± 1.411
0.589ArgMet: 0.589 ± 0.553
0.589ArgAsn: 0.589 ± 0.553
4.709ArgPro: 4.709 ± 1.447
1.766ArgGln: 1.766 ± 1.053
8.829ArgArg: 8.829 ± 3.088
5.886ArgSer: 5.886 ± 1.82
4.709ArgThr: 4.709 ± 1.445
7.652ArgVal: 7.652 ± 2.744
0.0ArgTrp: 0.0 ± 0.0
2.354ArgTyr: 2.354 ± 1.626
0.0ArgXaa: 0.0 ± 0.0
Ser
8.24SerAla: 8.24 ± 3.08
0.589SerCys: 0.589 ± 0.504
3.531SerAsp: 3.531 ± 1.06
2.943SerGlu: 2.943 ± 1.553
1.766SerPhe: 1.766 ± 0.914
2.354SerGly: 2.354 ± 1.209
1.766SerHis: 1.766 ± 1.075
5.886SerIle: 5.886 ± 1.664
4.12SerLys: 4.12 ± 1.068
6.474SerLeu: 6.474 ± 1.576
0.0SerMet: 0.0 ± 0.0
5.297SerAsn: 5.297 ± 1.969
7.063SerPro: 7.063 ± 1.22
2.943SerGln: 2.943 ± 1.44
6.474SerArg: 6.474 ± 1.693
12.949SerSer: 12.949 ± 3.297
10.006SerThr: 10.006 ± 2.725
4.709SerVal: 4.709 ± 1.801
0.589SerTrp: 0.589 ± 0.504
2.943SerTyr: 2.943 ± 0.888
0.0SerXaa: 0.0 ± 0.0
Thr
2.943ThrAla: 2.943 ± 1.143
0.589ThrCys: 0.589 ± 0.504
1.177ThrAsp: 1.177 ± 0.821
2.943ThrGlu: 2.943 ± 0.802
1.766ThrPhe: 1.766 ± 1.491
6.474ThrGly: 6.474 ± 1.374
3.531ThrHis: 3.531 ± 1.369
3.531ThrIle: 3.531 ± 1.089
4.12ThrLys: 4.12 ± 1.728
8.24ThrLeu: 8.24 ± 1.211
1.177ThrMet: 1.177 ± 0.932
3.531ThrAsn: 3.531 ± 0.964
3.531ThrPro: 3.531 ± 1.215
1.766ThrGln: 1.766 ± 0.787
5.297ThrArg: 5.297 ± 1.006
5.886ThrSer: 5.886 ± 2.266
1.766ThrThr: 1.766 ± 0.889
4.709ThrVal: 4.709 ± 1.548
1.177ThrTrp: 1.177 ± 0.603
3.531ThrTyr: 3.531 ± 1.505
0.0ThrXaa: 0.0 ± 0.0
Val
1.177ValAla: 1.177 ± 0.602
0.589ValCys: 0.589 ± 0.504
2.354ValAsp: 2.354 ± 0.981
1.177ValGlu: 1.177 ± 0.789
2.943ValPhe: 2.943 ± 1.446
2.943ValGly: 2.943 ± 1.199
2.354ValHis: 2.354 ± 0.96
7.652ValIle: 7.652 ± 2.885
2.943ValLys: 2.943 ± 1.087
5.297ValLeu: 5.297 ± 1.911
2.354ValMet: 2.354 ± 0.708
3.531ValAsn: 3.531 ± 1.178
7.063ValPro: 7.063 ± 1.166
4.12ValGln: 4.12 ± 1.171
4.709ValArg: 4.709 ± 1.988
2.943ValSer: 2.943 ± 1.091
4.12ValThr: 4.12 ± 1.423
2.943ValVal: 2.943 ± 1.387
1.177ValTrp: 1.177 ± 0.647
2.943ValTyr: 2.943 ± 1.451
0.0ValXaa: 0.0 ± 0.0
Trp
2.354TrpAla: 2.354 ± 0.894
0.0TrpCys: 0.0 ± 0.0
0.589TrpAsp: 0.589 ± 0.599
1.177TrpGlu: 1.177 ± 0.943
0.0TrpPhe: 0.0 ± 0.0
0.589TrpGly: 0.589 ± 0.497
0.0TrpHis: 0.0 ± 0.0
0.589TrpIle: 0.589 ± 0.681
0.0TrpLys: 0.0 ± 0.0
0.589TrpLeu: 0.589 ± 0.504
0.589TrpMet: 0.589 ± 0.5
0.589TrpAsn: 0.589 ± 0.681
0.0TrpPro: 0.0 ± 0.0
1.177TrpGln: 1.177 ± 0.789
1.177TrpArg: 1.177 ± 0.821
0.0TrpSer: 0.0 ± 0.0
0.589TrpThr: 0.589 ± 0.553
1.177TrpVal: 1.177 ± 0.609
0.0TrpTrp: 0.0 ± 0.0
1.766TrpTyr: 1.766 ± 0.77
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.354TyrAla: 2.354 ± 1.495
0.0TyrCys: 0.0 ± 0.0
3.531TyrAsp: 3.531 ± 1.436
0.589TyrGlu: 0.589 ± 0.5
2.943TyrPhe: 2.943 ± 0.867
2.943TyrGly: 2.943 ± 1.099
0.0TyrHis: 0.0 ± 0.0
1.766TyrIle: 1.766 ± 1.001
0.589TyrLys: 0.589 ± 0.504
3.531TyrLeu: 3.531 ± 1.062
1.766TyrMet: 1.766 ± 0.86
2.943TyrAsn: 2.943 ± 0.681
2.943TyrPro: 2.943 ± 0.972
1.177TyrGln: 1.177 ± 0.602
4.12TyrArg: 4.12 ± 1.348
3.531TyrSer: 3.531 ± 1.179
2.354TyrThr: 2.354 ± 1.126
3.531TyrVal: 3.531 ± 1.739
0.589TyrTrp: 0.589 ± 0.504
1.766TyrTyr: 1.766 ± 0.865
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (1700 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski