Amino acid dipepetide frequency for Cauliflower mosaic virus (strain CM-1841) (CaMV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.911AlaAla: 3.911 ± 1.179
0.0AlaCys: 0.0 ± 0.0
2.608AlaAsp: 2.608 ± 0.93
3.477AlaGlu: 3.477 ± 1.443
1.738AlaPhe: 1.738 ± 0.957
1.738AlaGly: 1.738 ± 0.84
0.869AlaHis: 0.869 ± 0.433
4.781AlaIle: 4.781 ± 1.063
4.346AlaLys: 4.346 ± 0.989
3.477AlaLeu: 3.477 ± 1.609
1.738AlaMet: 1.738 ± 0.494
1.738AlaAsn: 1.738 ± 0.845
3.042AlaPro: 3.042 ± 1.518
2.608AlaGln: 2.608 ± 0.865
1.304AlaArg: 1.304 ± 0.841
4.346AlaSer: 4.346 ± 1.461
2.608AlaThr: 2.608 ± 1.227
3.477AlaVal: 3.477 ± 1.461
0.435AlaTrp: 0.435 ± 0.319
2.173AlaTyr: 2.173 ± 1.101
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.869CysCys: 0.869 ± 0.449
1.304CysAsp: 1.304 ± 0.753
0.0CysGlu: 0.0 ± 0.0
0.435CysPhe: 0.435 ± 0.319
0.435CysGly: 0.435 ± 0.431
0.869CysHis: 0.869 ± 0.811
0.869CysIle: 0.869 ± 0.412
0.869CysLys: 0.869 ± 0.637
0.869CysLeu: 0.869 ± 0.511
0.0CysMet: 0.0 ± 0.0
1.738CysAsn: 1.738 ± 0.597
2.173CysPro: 2.173 ± 0.679
0.869CysGln: 0.869 ± 0.449
0.869CysArg: 0.869 ± 0.449
0.869CysSer: 0.869 ± 0.449
0.869CysThr: 0.869 ± 0.531
0.869CysVal: 0.869 ± 0.476
0.435CysTrp: 0.435 ± 0.422
0.435CysTyr: 0.435 ± 0.422
0.0CysXaa: 0.0 ± 0.0
Asp
2.608AspAla: 2.608 ± 1.081
2.608AspCys: 2.608 ± 1.152
3.911AspAsp: 3.911 ± 0.77
3.042AspGlu: 3.042 ± 1.228
3.042AspPhe: 3.042 ± 0.64
2.608AspGly: 2.608 ± 0.532
0.869AspHis: 0.869 ± 0.637
3.477AspIle: 3.477 ± 0.665
2.608AspLys: 2.608 ± 0.928
5.65AspLeu: 5.65 ± 1.054
0.869AspMet: 0.869 ± 0.702
2.608AspAsn: 2.608 ± 0.681
2.608AspPro: 2.608 ± 0.584
2.173AspGln: 2.173 ± 0.448
3.042AspArg: 3.042 ± 0.856
2.608AspSer: 2.608 ± 1.445
3.477AspThr: 3.477 ± 1.064
1.304AspVal: 1.304 ± 0.476
0.869AspTrp: 0.869 ± 0.433
2.608AspTyr: 2.608 ± 0.843
0.0AspXaa: 0.0 ± 0.0
Glu
4.781GluAla: 4.781 ± 1.364
0.869GluCys: 0.869 ± 0.511
5.65GluAsp: 5.65 ± 1.708
10.43GluGlu: 10.43 ± 4.196
2.173GluPhe: 2.173 ± 1.242
4.346GluGly: 4.346 ± 0.992
2.173GluHis: 2.173 ± 0.76
7.388GluIle: 7.388 ± 1.305
8.257GluLys: 8.257 ± 1.416
6.953GluLeu: 6.953 ± 0.953
0.869GluMet: 0.869 ± 0.5
4.781GluAsn: 4.781 ± 1.834
2.608GluPro: 2.608 ± 1.184
4.346GluGln: 4.346 ± 1.381
1.304GluArg: 1.304 ± 0.654
7.388GluSer: 7.388 ± 2.224
2.608GluThr: 2.608 ± 0.676
2.173GluVal: 2.173 ± 1.315
0.435GluTrp: 0.435 ± 0.319
0.869GluTyr: 0.869 ± 0.845
0.0GluXaa: 0.0 ± 0.0
Phe
2.173PheAla: 2.173 ± 0.963
1.304PheCys: 1.304 ± 0.685
2.608PheAsp: 2.608 ± 0.85
0.869PheGlu: 0.869 ± 0.511
0.0PhePhe: 0.0 ± 0.0
1.304PheGly: 1.304 ± 0.617
0.435PheHis: 0.435 ± 0.406
3.042PheIle: 3.042 ± 1.002
4.346PheLys: 4.346 ± 1.365
4.781PheLeu: 4.781 ± 1.498
0.869PheMet: 0.869 ± 0.636
1.738PheAsn: 1.738 ± 0.466
1.738PhePro: 1.738 ± 0.898
1.304PheGln: 1.304 ± 0.476
3.042PheArg: 3.042 ± 0.422
2.608PheSer: 2.608 ± 1.114
3.042PheThr: 3.042 ± 0.624
1.738PheVal: 1.738 ± 0.888
0.869PheTrp: 0.869 ± 0.449
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
1.738GlyAla: 1.738 ± 1.261
0.435GlyCys: 0.435 ± 0.406
3.477GlyAsp: 3.477 ± 0.654
4.781GlyGlu: 4.781 ± 1.935
2.608GlyPhe: 2.608 ± 0.639
1.738GlyGly: 1.738 ± 0.645
1.304GlyHis: 1.304 ± 0.654
5.215GlyIle: 5.215 ± 1.058
4.781GlyLys: 4.781 ± 0.962
4.346GlyLeu: 4.346 ± 1.181
1.304GlyMet: 1.304 ± 0.57
4.346GlyAsn: 4.346 ± 2.326
1.738GlyPro: 1.738 ± 1.092
0.869GlyGln: 0.869 ± 0.657
2.173GlyArg: 2.173 ± 0.827
2.608GlySer: 2.608 ± 0.842
3.042GlyThr: 3.042 ± 0.785
2.608GlyVal: 2.608 ± 1.267
0.435GlyTrp: 0.435 ± 0.492
1.304GlyTyr: 1.304 ± 0.473
0.0GlyXaa: 0.0 ± 0.0
His
0.435HisAla: 0.435 ± 0.406
1.304HisCys: 1.304 ± 0.753
1.304HisAsp: 1.304 ± 0.878
0.435HisGlu: 0.435 ± 0.422
1.738HisPhe: 1.738 ± 0.888
0.435HisGly: 0.435 ± 0.319
1.738HisHis: 1.738 ± 0.745
2.608HisIle: 2.608 ± 1.055
1.738HisLys: 1.738 ± 0.645
1.738HisLeu: 1.738 ± 0.721
0.869HisMet: 0.869 ± 0.637
0.0HisAsn: 0.0 ± 0.0
0.869HisPro: 0.869 ± 0.412
0.435HisGln: 0.435 ± 0.406
0.435HisArg: 0.435 ± 0.406
1.304HisSer: 1.304 ± 0.476
0.0HisThr: 0.0 ± 0.0
0.869HisVal: 0.869 ± 0.502
0.869HisTrp: 0.869 ± 0.412
1.304HisTyr: 1.304 ± 0.654
0.0HisXaa: 0.0 ± 0.0
Ile
3.042IleAla: 3.042 ± 1.24
2.608IleCys: 2.608 ± 0.906
5.215IleAsp: 5.215 ± 2.121
6.519IleGlu: 6.519 ± 0.985
3.911IlePhe: 3.911 ± 1.116
4.781IleGly: 4.781 ± 1.361
1.304IleHis: 1.304 ± 0.878
4.346IleIle: 4.346 ± 1.821
7.388IleLys: 7.388 ± 2.165
7.388IleLeu: 7.388 ± 1.35
0.869IleMet: 0.869 ± 0.476
6.519IleAsn: 6.519 ± 1.62
4.781IlePro: 4.781 ± 1.466
3.477IleGln: 3.477 ± 1.464
4.346IleArg: 4.346 ± 0.992
4.781IleSer: 4.781 ± 1.136
3.477IleThr: 3.477 ± 0.977
3.477IleVal: 3.477 ± 1.201
0.435IleTrp: 0.435 ± 0.505
2.173IleTyr: 2.173 ± 0.624
0.0IleXaa: 0.0 ± 0.0
Lys
9.561LysAla: 9.561 ± 1.49
1.304LysCys: 1.304 ± 0.338
4.346LysAsp: 4.346 ± 1.713
11.734LysGlu: 11.734 ± 3.222
4.346LysPhe: 4.346 ± 0.739
5.65LysGly: 5.65 ± 0.794
0.0LysHis: 0.0 ± 0.0
11.299LysIle: 11.299 ± 0.383
13.907LysLys: 13.907 ± 4.438
5.215LysLeu: 5.215 ± 1.652
0.435LysMet: 0.435 ± 0.431
6.084LysAsn: 6.084 ± 2.204
4.346LysPro: 4.346 ± 0.567
3.477LysGln: 3.477 ± 0.988
4.781LysArg: 4.781 ± 0.98
5.215LysSer: 5.215 ± 1.719
5.65LysThr: 5.65 ± 1.734
5.215LysVal: 5.215 ± 1.278
0.435LysTrp: 0.435 ± 0.319
2.608LysTyr: 2.608 ± 1.036
0.0LysXaa: 0.0 ± 0.0
Leu
3.911LeuAla: 3.911 ± 1.189
0.435LeuCys: 0.435 ± 0.319
3.477LeuAsp: 3.477 ± 1.055
6.953LeuGlu: 6.953 ± 1.669
1.738LeuPhe: 1.738 ± 0.494
7.823LeuGly: 7.823 ± 1.857
2.173LeuHis: 2.173 ± 0.997
6.084LeuIle: 6.084 ± 1.311
7.388LeuLys: 7.388 ± 2.307
9.126LeuLeu: 9.126 ± 0.952
3.477LeuMet: 3.477 ± 1.183
5.215LeuAsn: 5.215 ± 1.876
3.042LeuPro: 3.042 ± 0.787
4.346LeuGln: 4.346 ± 1.358
1.304LeuArg: 1.304 ± 1.217
7.823LeuSer: 7.823 ± 1.639
6.519LeuThr: 6.519 ± 0.782
1.738LeuVal: 1.738 ± 0.824
0.0LeuTrp: 0.0 ± 0.0
3.042LeuTyr: 3.042 ± 1.048
0.0LeuXaa: 0.0 ± 0.0
Met
2.173MetAla: 2.173 ± 0.679
0.0MetCys: 0.0 ± 0.0
2.608MetAsp: 2.608 ± 0.639
1.738MetGlu: 1.738 ± 0.865
1.304MetPhe: 1.304 ± 0.449
0.435MetGly: 0.435 ± 0.422
0.0MetHis: 0.0 ± 0.0
1.304MetIle: 1.304 ± 0.473
2.173MetLys: 2.173 ± 0.945
1.304MetLeu: 1.304 ± 0.338
0.0MetMet: 0.0 ± 0.0
1.738MetAsn: 1.738 ± 0.611
0.435MetPro: 0.435 ± 0.431
1.738MetGln: 1.738 ± 0.611
0.0MetArg: 0.0 ± 0.0
0.869MetSer: 0.869 ± 0.667
1.304MetThr: 1.304 ± 0.833
2.173MetVal: 2.173 ± 0.779
0.0MetTrp: 0.0 ± 0.0
0.869MetTyr: 0.869 ± 0.845
0.0MetXaa: 0.0 ± 0.0
Asn
2.173AsnAla: 2.173 ± 0.417
0.435AsnCys: 0.435 ± 0.406
2.608AsnAsp: 2.608 ± 1.256
4.346AsnGlu: 4.346 ± 0.691
1.738AsnPhe: 1.738 ± 0.824
2.173AsnGly: 2.173 ± 1.034
0.869AsnHis: 0.869 ± 0.502
5.215AsnIle: 5.215 ± 1.778
6.953AsnLys: 6.953 ± 2.314
6.084AsnLeu: 6.084 ± 1.147
1.304AsnMet: 1.304 ± 0.449
3.042AsnAsn: 3.042 ± 1.016
3.477AsnPro: 3.477 ± 1.187
2.608AsnGln: 2.608 ± 1.44
2.608AsnArg: 2.608 ± 2.169
4.781AsnSer: 4.781 ± 2.846
4.346AsnThr: 4.346 ± 1.227
2.173AsnVal: 2.173 ± 1.136
0.435AsnTrp: 0.435 ± 0.431
2.608AsnTyr: 2.608 ± 0.683
0.0AsnXaa: 0.0 ± 0.0
Pro
1.738ProAla: 1.738 ± 0.84
0.435ProCys: 0.435 ± 0.492
0.435ProAsp: 0.435 ± 0.319
4.781ProGlu: 4.781 ± 0.546
1.304ProPhe: 1.304 ± 0.666
2.173ProGly: 2.173 ± 1.122
2.173ProHis: 2.173 ± 1.142
1.738ProIle: 1.738 ± 0.26
6.953ProLys: 6.953 ± 2.514
5.215ProLeu: 5.215 ± 1.586
0.869ProMet: 0.869 ± 0.637
3.042ProAsn: 3.042 ± 1.049
1.738ProPro: 1.738 ± 1.137
1.738ProGln: 1.738 ± 0.824
0.435ProArg: 0.435 ± 0.406
3.911ProSer: 3.911 ± 0.904
3.042ProThr: 3.042 ± 1.385
3.042ProVal: 3.042 ± 1.002
0.435ProTrp: 0.435 ± 0.319
0.869ProTyr: 0.869 ± 0.845
0.0ProXaa: 0.0 ± 0.0
Gln
1.738GlnAla: 1.738 ± 0.926
0.0GlnCys: 0.0 ± 0.0
0.435GlnAsp: 0.435 ± 0.406
3.042GlnGlu: 3.042 ± 0.422
2.173GlnPhe: 2.173 ± 0.448
2.608GlnGly: 2.608 ± 1.236
0.435GlnHis: 0.435 ± 0.319
4.346GlnIle: 4.346 ± 0.993
3.911GlnLys: 3.911 ± 0.885
3.911GlnLeu: 3.911 ± 0.767
0.435GlnMet: 0.435 ± 0.406
3.042GlnAsn: 3.042 ± 2.153
2.608GlnPro: 2.608 ± 1.262
3.477GlnGln: 3.477 ± 0.836
2.608GlnArg: 2.608 ± 0.928
3.477GlnSer: 3.477 ± 2.05
2.608GlnThr: 2.608 ± 0.862
4.346GlnVal: 4.346 ± 1.056
0.435GlnTrp: 0.435 ± 0.319
0.435GlnTyr: 0.435 ± 0.422
0.0GlnXaa: 0.0 ± 0.0
Arg
2.608ArgAla: 2.608 ± 1.932
0.435ArgCys: 0.435 ± 0.422
0.435ArgAsp: 0.435 ± 0.431
2.608ArgGlu: 2.608 ± 0.462
1.738ArgPhe: 1.738 ± 0.645
1.304ArgGly: 1.304 ± 0.672
0.869ArgHis: 0.869 ± 0.449
0.869ArgIle: 0.869 ± 0.667
3.911ArgLys: 3.911 ± 0.699
3.911ArgLeu: 3.911 ± 0.812
1.738ArgMet: 1.738 ± 0.898
1.738ArgAsn: 1.738 ± 0.623
1.738ArgPro: 1.738 ± 0.824
0.869ArgGln: 0.869 ± 0.433
1.738ArgArg: 1.738 ± 0.623
2.608ArgSer: 2.608 ± 0.989
3.042ArgThr: 3.042 ± 1.365
2.173ArgVal: 2.173 ± 0.851
0.869ArgTrp: 0.869 ± 0.449
2.173ArgTyr: 2.173 ± 1.626
0.0ArgXaa: 0.0 ± 0.0
Ser
1.304SerAla: 1.304 ± 0.841
0.0SerCys: 0.0 ± 0.0
6.519SerAsp: 6.519 ± 1.992
6.953SerGlu: 6.953 ± 1.652
2.608SerPhe: 2.608 ± 1.23
4.346SerGly: 4.346 ± 0.907
0.869SerHis: 0.869 ± 0.476
6.953SerIle: 6.953 ± 1.25
9.561SerLys: 9.561 ± 1.631
6.953SerLeu: 6.953 ± 2.379
2.173SerMet: 2.173 ± 0.663
3.042SerAsn: 3.042 ± 1.221
1.304SerPro: 1.304 ± 0.615
3.911SerGln: 3.911 ± 1.782
2.608SerArg: 2.608 ± 0.879
6.953SerSer: 6.953 ± 1.886
2.608SerThr: 2.608 ± 0.532
1.304SerVal: 1.304 ± 0.72
0.0SerTrp: 0.0 ± 0.0
1.304SerTyr: 1.304 ± 0.955
0.0SerXaa: 0.0 ± 0.0
Thr
1.304ThrAla: 1.304 ± 0.523
0.869ThrCys: 0.869 ± 0.449
3.042ThrAsp: 3.042 ± 1.033
3.911ThrGlu: 3.911 ± 1.18
0.869ThrPhe: 0.869 ± 0.811
3.042ThrGly: 3.042 ± 0.584
0.869ThrHis: 0.869 ± 0.637
6.084ThrIle: 6.084 ± 0.749
4.781ThrLys: 4.781 ± 0.929
3.911ThrLeu: 3.911 ± 0.918
1.304ThrMet: 1.304 ± 0.725
3.477ThrAsn: 3.477 ± 0.836
2.173ThrPro: 2.173 ± 0.491
4.346ThrGln: 4.346 ± 1.557
2.608ThrArg: 2.608 ± 0.609
4.346ThrSer: 4.346 ± 2.637
2.608ThrThr: 2.608 ± 1.298
1.738ThrVal: 1.738 ± 0.745
0.435ThrTrp: 0.435 ± 0.492
1.738ThrTyr: 1.738 ± 0.637
0.0ThrXaa: 0.0 ± 0.0
Val
2.173ValAla: 2.173 ± 0.669
0.869ValCys: 0.869 ± 0.637
2.173ValAsp: 2.173 ± 0.614
3.042ValGlu: 3.042 ± 1.124
2.608ValPhe: 2.608 ± 0.998
1.738ValGly: 1.738 ± 0.816
1.738ValHis: 1.738 ± 0.611
3.042ValIle: 3.042 ± 1.033
5.65ValLys: 5.65 ± 1.287
0.869ValLeu: 0.869 ± 0.476
1.738ValMet: 1.738 ± 0.785
3.477ValAsn: 3.477 ± 1.695
2.608ValPro: 2.608 ± 1.506
1.738ValGln: 1.738 ± 0.686
2.173ValArg: 2.173 ± 0.491
2.173ValSer: 2.173 ± 1.204
0.435ValThr: 0.435 ± 0.319
1.738ValVal: 1.738 ± 0.824
0.435ValTrp: 0.435 ± 0.406
3.477ValTyr: 3.477 ± 0.607
0.0ValXaa: 0.0 ± 0.0
Trp
0.869TrpAla: 0.869 ± 0.511
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.869TrpGly: 0.869 ± 0.412
0.0TrpHis: 0.0 ± 0.0
0.435TrpIle: 0.435 ± 0.422
0.869TrpLys: 0.869 ± 0.502
0.869TrpLeu: 0.869 ± 0.412
0.435TrpMet: 0.435 ± 0.319
0.869TrpAsn: 0.869 ± 0.449
0.869TrpPro: 0.869 ± 0.983
0.869TrpGln: 0.869 ± 0.637
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
0.869TrpThr: 0.869 ± 0.412
0.435TrpVal: 0.435 ± 0.319
0.0TrpTrp: 0.0 ± 0.0
0.435TrpTyr: 0.435 ± 0.422
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.173TyrAla: 2.173 ± 0.37
0.869TyrCys: 0.869 ± 0.511
0.869TyrAsp: 0.869 ± 0.811
1.738TyrGlu: 1.738 ± 0.898
1.738TyrPhe: 1.738 ± 0.898
1.304TyrGly: 1.304 ± 0.704
0.869TyrHis: 0.869 ± 0.449
1.738TyrIle: 1.738 ± 0.623
5.215TyrLys: 5.215 ± 1.995
3.042TyrLeu: 3.042 ± 0.826
0.435TyrMet: 0.435 ± 0.319
1.738TyrAsn: 1.738 ± 0.645
2.173TyrPro: 2.173 ± 0.935
0.869TyrGln: 0.869 ± 0.476
0.0TyrArg: 0.0 ± 0.0
2.608TyrSer: 2.608 ± 0.681
1.304TyrThr: 1.304 ± 0.618
1.304TyrVal: 1.304 ± 0.523
0.435TyrTrp: 0.435 ± 0.319
0.435TyrTyr: 0.435 ± 0.406
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2302 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski