Amino acid dipepetide frequency for Camellia chlorotic dwarf-associated virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.421AlaAla: 1.421 ± 1.242
0.711AlaCys: 0.711 ± 0.621
2.843AlaAsp: 2.843 ± 0.874
1.421AlaGlu: 1.421 ± 0.729
0.711AlaPhe: 0.711 ± 0.744
2.132AlaGly: 2.132 ± 1.232
0.0AlaHis: 0.0 ± 0.0
2.843AlaIle: 2.843 ± 0.874
3.554AlaLys: 3.554 ± 1.344
3.554AlaLeu: 3.554 ± 1.484
3.554AlaMet: 3.554 ± 1.291
2.843AlaAsn: 2.843 ± 1.609
2.132AlaPro: 2.132 ± 0.732
3.554AlaGln: 3.554 ± 0.809
3.554AlaArg: 3.554 ± 1.157
9.24AlaSer: 9.24 ± 1.994
3.554AlaThr: 3.554 ± 0.847
2.843AlaVal: 2.843 ± 2.154
0.0AlaTrp: 0.0 ± 0.0
1.421AlaTyr: 1.421 ± 0.897
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
1.421CysCys: 1.421 ± 1.201
0.711CysAsp: 0.711 ± 0.575
2.132CysGlu: 2.132 ± 0.96
0.711CysPhe: 0.711 ± 0.744
1.421CysGly: 1.421 ± 0.802
0.0CysHis: 0.0 ± 0.0
2.132CysIle: 2.132 ± 1.093
2.843CysLys: 2.843 ± 2.056
0.711CysLeu: 0.711 ± 0.621
1.421CysMet: 1.421 ± 1.032
2.843CysAsn: 2.843 ± 1.24
2.132CysPro: 2.132 ± 1.049
1.421CysGln: 1.421 ± 0.897
2.132CysArg: 2.132 ± 0.641
3.554CysSer: 3.554 ± 1.266
0.0CysThr: 0.0 ± 0.0
1.421CysVal: 1.421 ± 0.884
0.711CysTrp: 0.711 ± 0.74
2.132CysTyr: 2.132 ± 1.225
0.0CysXaa: 0.0 ± 0.0
Asp
2.132AspAla: 2.132 ± 1.473
0.0AspCys: 0.0 ± 0.0
2.132AspAsp: 2.132 ± 1.371
4.264AspGlu: 4.264 ± 1.402
4.264AspPhe: 4.264 ± 1.061
6.397AspGly: 6.397 ± 2.108
4.264AspHis: 4.264 ± 1.464
2.132AspIle: 2.132 ± 1.245
1.421AspLys: 1.421 ± 1.001
4.264AspLeu: 4.264 ± 1.819
2.132AspMet: 2.132 ± 1.219
2.843AspAsn: 2.843 ± 1.609
4.264AspPro: 4.264 ± 1.02
0.711AspGln: 0.711 ± 0.575
4.264AspArg: 4.264 ± 1.823
2.132AspSer: 2.132 ± 1.219
1.421AspThr: 1.421 ± 1.14
4.264AspVal: 4.264 ± 1.79
0.711AspTrp: 0.711 ± 0.541
4.975AspTyr: 4.975 ± 0.822
0.0AspXaa: 0.0 ± 0.0
Glu
2.132GluAla: 2.132 ± 0.641
5.686GluCys: 5.686 ± 2.721
2.132GluAsp: 2.132 ± 1.232
2.132GluGlu: 2.132 ± 0.949
2.132GluPhe: 2.132 ± 0.732
4.264GluGly: 4.264 ± 2.213
0.711GluHis: 0.711 ± 0.541
2.843GluIle: 2.843 ± 1.17
2.843GluLys: 2.843 ± 2.433
4.264GluLeu: 4.264 ± 1.368
0.711GluMet: 0.711 ± 0.621
4.975GluAsn: 4.975 ± 0.921
4.264GluPro: 4.264 ± 1.023
2.132GluGln: 2.132 ± 0.83
1.421GluArg: 1.421 ± 0.729
4.264GluSer: 4.264 ± 1.304
0.711GluThr: 0.711 ± 0.575
2.132GluVal: 2.132 ± 2.008
2.132GluTrp: 2.132 ± 0.732
2.132GluTyr: 2.132 ± 1.371
0.0GluXaa: 0.0 ± 0.0
Phe
7.107PheAla: 7.107 ± 1.413
0.711PheCys: 0.711 ± 0.92
2.843PheAsp: 2.843 ± 1.186
4.264PheGlu: 4.264 ± 1.981
2.132PhePhe: 2.132 ± 0.801
0.711PheGly: 0.711 ± 0.92
2.132PheHis: 2.132 ± 1.245
4.975PheIle: 4.975 ± 1.549
0.711PheLys: 0.711 ± 0.744
2.843PheLeu: 2.843 ± 0.703
0.711PheMet: 0.711 ± 0.74
0.711PheAsn: 0.711 ± 0.744
1.421PhePro: 1.421 ± 1.487
1.421PheGln: 1.421 ± 0.805
4.975PheArg: 4.975 ± 1.672
2.132PheSer: 2.132 ± 1.421
2.132PheThr: 2.132 ± 1.01
4.264PheVal: 4.264 ± 1.778
0.0PheTrp: 0.0 ± 0.0
3.554PheTyr: 3.554 ± 1.347
0.0PheXaa: 0.0 ± 0.0
Gly
1.421GlyAla: 1.421 ± 0.71
0.711GlyCys: 0.711 ± 0.621
3.554GlyAsp: 3.554 ± 0.877
6.397GlyGlu: 6.397 ± 1.926
2.132GlyPhe: 2.132 ± 1.382
4.264GlyGly: 4.264 ± 2.605
0.711GlyHis: 0.711 ± 0.744
3.554GlyIle: 3.554 ± 1.746
5.686GlyLys: 5.686 ± 2.223
4.975GlyLeu: 4.975 ± 2.156
0.0GlyMet: 0.0 ± 0.0
4.975GlyAsn: 4.975 ± 1.996
5.686GlyPro: 5.686 ± 2.517
0.711GlyGln: 0.711 ± 0.575
2.843GlyArg: 2.843 ± 2.056
3.554GlySer: 3.554 ± 2.187
0.711GlyThr: 0.711 ± 0.621
2.132GlyVal: 2.132 ± 1.382
1.421GlyTrp: 1.421 ± 1.001
0.711GlyTyr: 0.711 ± 0.92
0.0GlyXaa: 0.0 ± 0.0
His
2.843HisAla: 2.843 ± 1.609
2.132HisCys: 2.132 ± 1.421
0.711HisAsp: 0.711 ± 0.621
2.132HisGlu: 2.132 ± 0.801
0.711HisPhe: 0.711 ± 0.744
1.421HisGly: 1.421 ± 0.806
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
1.421HisLys: 1.421 ± 0.805
4.264HisLeu: 4.264 ± 2.414
0.0HisMet: 0.0 ± 0.0
1.421HisAsn: 1.421 ± 0.805
2.132HisPro: 2.132 ± 1.289
0.0HisGln: 0.0 ± 0.0
2.132HisArg: 2.132 ± 1.245
1.421HisSer: 1.421 ± 0.806
0.0HisThr: 0.0 ± 0.0
2.132HisVal: 2.132 ± 1.69
0.0HisTrp: 0.0 ± 0.0
2.132HisTyr: 2.132 ± 1.245
0.0HisXaa: 0.0 ± 0.0
Ile
0.0IleAla: 0.0 ± 0.0
1.421IleCys: 1.421 ± 0.802
3.554IleAsp: 3.554 ± 1.917
2.843IleGlu: 2.843 ± 1.237
2.132IlePhe: 2.132 ± 0.988
0.711IleGly: 0.711 ± 0.621
0.0IleHis: 0.0 ± 0.0
4.975IleIle: 4.975 ± 1.642
3.554IleLys: 3.554 ± 1.616
4.264IleLeu: 4.264 ± 0.744
1.421IleMet: 1.421 ± 0.73
2.132IleAsn: 2.132 ± 1.371
3.554IlePro: 3.554 ± 1.661
1.421IleGln: 1.421 ± 0.805
3.554IleArg: 3.554 ± 1.341
7.107IleSer: 7.107 ± 2.675
2.843IleThr: 2.843 ± 1.071
4.264IleVal: 4.264 ± 1.904
1.421IleTrp: 1.421 ± 1.48
2.843IleTyr: 2.843 ± 1.732
0.0IleXaa: 0.0 ± 0.0
Lys
2.132LysAla: 2.132 ± 0.732
0.0LysCys: 0.0 ± 0.0
7.107LysAsp: 7.107 ± 1.976
2.132LysGlu: 2.132 ± 1.473
1.421LysPhe: 1.421 ± 0.73
2.132LysGly: 2.132 ± 1.864
1.421LysHis: 1.421 ± 0.805
3.554LysIle: 3.554 ± 1.41
4.975LysLys: 4.975 ± 1.187
3.554LysLeu: 3.554 ± 2.291
2.132LysMet: 2.132 ± 0.801
2.132LysAsn: 2.132 ± 0.949
4.264LysPro: 4.264 ± 1.216
3.554LysGln: 3.554 ± 2.025
3.554LysArg: 3.554 ± 2.055
6.397LysSer: 6.397 ± 2.197
3.554LysThr: 3.554 ± 0.847
6.397LysVal: 6.397 ± 2.131
0.711LysTrp: 0.711 ± 0.74
2.843LysTyr: 2.843 ± 1.855
0.0LysXaa: 0.0 ± 0.0
Leu
2.132LeuAla: 2.132 ± 1.69
4.264LeuCys: 4.264 ± 1.234
4.975LeuAsp: 4.975 ± 1.503
1.421LeuGlu: 1.421 ± 0.806
2.843LeuPhe: 2.843 ± 1.965
5.686LeuGly: 5.686 ± 1.94
5.686LeuHis: 5.686 ± 1.818
2.843LeuIle: 2.843 ± 1.424
3.554LeuLys: 3.554 ± 1.549
7.818LeuLeu: 7.818 ± 2.338
2.843LeuMet: 2.843 ± 0.982
3.554LeuAsn: 3.554 ± 1.948
4.975LeuPro: 4.975 ± 1.932
4.264LeuGln: 4.264 ± 1.464
5.686LeuArg: 5.686 ± 0.972
4.975LeuSer: 4.975 ± 2.382
1.421LeuThr: 1.421 ± 0.805
4.264LeuVal: 4.264 ± 2.389
0.711LeuTrp: 0.711 ± 0.92
1.421LeuTyr: 1.421 ± 1.081
0.0LeuXaa: 0.0 ± 0.0
Met
3.554MetAla: 3.554 ± 1.48
0.711MetCys: 0.711 ± 0.74
3.554MetAsp: 3.554 ± 1.616
1.421MetGlu: 1.421 ± 1.201
1.421MetPhe: 1.421 ± 1.0
2.132MetGly: 2.132 ± 1.77
1.421MetHis: 1.421 ± 0.805
2.132MetIle: 2.132 ± 1.421
2.132MetLys: 2.132 ± 1.371
2.132MetLeu: 2.132 ± 2.231
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.711MetPro: 0.711 ± 0.74
1.421MetGln: 1.421 ± 1.242
0.0MetArg: 0.0 ± 0.0
3.554MetSer: 3.554 ± 0.877
0.711MetThr: 0.711 ± 0.621
1.421MetVal: 1.421 ± 0.805
0.711MetTrp: 0.711 ± 0.621
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.264AsnAla: 4.264 ± 1.345
2.132AsnCys: 2.132 ± 1.01
2.843AsnAsp: 2.843 ± 1.061
2.132AsnGlu: 2.132 ± 1.421
7.818AsnPhe: 7.818 ± 3.047
2.843AsnGly: 2.843 ± 0.49
0.711AsnHis: 0.711 ± 0.744
3.554AsnIle: 3.554 ± 1.48
2.132AsnLys: 2.132 ± 0.988
4.975AsnLeu: 4.975 ± 2.244
0.711AsnMet: 0.711 ± 0.558
4.264AsnAsn: 4.264 ± 2.414
4.264AsnPro: 4.264 ± 1.889
0.0AsnGln: 0.0 ± 0.0
3.554AsnArg: 3.554 ± 1.544
3.554AsnSer: 3.554 ± 1.344
0.711AsnThr: 0.711 ± 0.744
5.686AsnVal: 5.686 ± 1.566
0.0AsnTrp: 0.0 ± 0.0
2.132AsnTyr: 2.132 ± 1.047
0.0AsnXaa: 0.0 ± 0.0
Pro
4.975ProAla: 4.975 ± 2.344
1.421ProCys: 1.421 ± 0.71
4.264ProAsp: 4.264 ± 1.286
4.264ProGlu: 4.264 ± 2.128
0.0ProPhe: 0.0 ± 0.0
4.975ProGly: 4.975 ± 1.526
3.554ProHis: 3.554 ± 2.025
2.132ProIle: 2.132 ± 1.356
0.711ProLys: 0.711 ± 0.744
4.264ProLeu: 4.264 ± 1.464
2.132ProMet: 2.132 ± 1.416
0.0ProAsn: 0.0 ± 0.0
0.711ProPro: 0.711 ± 0.744
2.132ProGln: 2.132 ± 0.732
4.975ProArg: 4.975 ± 1.466
8.529ProSer: 8.529 ± 2.313
2.843ProThr: 2.843 ± 1.061
0.711ProVal: 0.711 ± 0.744
0.711ProTrp: 0.711 ± 0.744
2.132ProTyr: 2.132 ± 0.732
0.0ProXaa: 0.0 ± 0.0
Gln
2.132GlnAla: 2.132 ± 0.83
2.132GlnCys: 2.132 ± 0.801
1.421GlnAsp: 1.421 ± 0.729
2.132GlnGlu: 2.132 ± 1.289
4.264GlnPhe: 4.264 ± 1.061
1.421GlnGly: 1.421 ± 1.48
0.0GlnHis: 0.0 ± 0.0
1.421GlnIle: 1.421 ± 0.806
4.264GlnLys: 4.264 ± 2.414
2.843GlnLeu: 2.843 ± 0.703
0.0GlnMet: 0.0 ± 0.0
0.0GlnAsn: 0.0 ± 0.0
0.0GlnPro: 0.0 ± 0.0
0.0GlnGln: 0.0 ± 0.0
0.711GlnArg: 0.711 ± 0.541
2.132GlnSer: 2.132 ± 0.949
2.132GlnThr: 2.132 ± 1.289
3.554GlnVal: 3.554 ± 1.906
0.711GlnTrp: 0.711 ± 0.541
0.711GlnTyr: 0.711 ± 0.575
0.0GlnXaa: 0.0 ± 0.0
Arg
1.421ArgAla: 1.421 ± 0.805
0.711ArgCys: 0.711 ± 0.744
2.132ArgAsp: 2.132 ± 1.864
2.843ArgGlu: 2.843 ± 1.13
4.264ArgPhe: 4.264 ± 1.061
2.843ArgGly: 2.843 ± 1.777
2.132ArgHis: 2.132 ± 1.049
2.132ArgIle: 2.132 ± 1.232
3.554ArgLys: 3.554 ± 2.068
2.843ArgLeu: 2.843 ± 1.545
0.711ArgMet: 0.711 ± 0.744
7.107ArgAsn: 7.107 ± 2.935
5.686ArgPro: 5.686 ± 1.733
1.421ArgGln: 1.421 ± 1.48
4.264ArgArg: 4.264 ± 3.246
7.818ArgSer: 7.818 ± 3.163
2.132ArgThr: 2.132 ± 1.098
5.686ArgVal: 5.686 ± 2.517
1.421ArgTrp: 1.421 ± 0.805
0.711ArgTyr: 0.711 ± 0.74
0.0ArgXaa: 0.0 ± 0.0
Ser
4.264SerAla: 4.264 ± 2.959
0.711SerCys: 0.711 ± 0.92
7.107SerAsp: 7.107 ± 0.667
3.554SerGlu: 3.554 ± 1.163
3.554SerPhe: 3.554 ± 1.597
4.264SerGly: 4.264 ± 1.064
1.421SerHis: 1.421 ± 1.242
4.264SerIle: 4.264 ± 1.304
7.107SerLys: 7.107 ± 1.811
5.686SerLeu: 5.686 ± 0.981
4.264SerMet: 4.264 ± 3.572
4.975SerAsn: 4.975 ± 1.08
3.554SerPro: 3.554 ± 1.347
2.132SerGln: 2.132 ± 1.356
4.975SerArg: 4.975 ± 1.659
7.107SerSer: 7.107 ± 3.85
4.975SerThr: 4.975 ± 1.08
7.818SerVal: 7.818 ± 0.78
2.132SerTrp: 2.132 ± 1.371
2.132SerTyr: 2.132 ± 0.732
0.0SerXaa: 0.0 ± 0.0
Thr
1.421ThrAla: 1.421 ± 0.806
0.0ThrCys: 0.0 ± 0.0
2.843ThrAsp: 2.843 ± 1.186
2.843ThrGlu: 2.843 ± 1.24
2.843ThrPhe: 2.843 ± 1.095
2.132ThrGly: 2.132 ± 0.674
0.711ThrHis: 0.711 ± 0.621
1.421ThrIle: 1.421 ± 0.73
2.132ThrLys: 2.132 ± 1.421
2.843ThrLeu: 2.843 ± 1.061
0.711ThrMet: 0.711 ± 0.541
2.132ThrAsn: 2.132 ± 0.674
2.132ThrPro: 2.132 ± 1.289
1.421ThrGln: 1.421 ± 0.805
4.264ThrArg: 4.264 ± 1.041
2.843ThrSer: 2.843 ± 1.186
0.711ThrThr: 0.711 ± 0.621
3.554ThrVal: 3.554 ± 1.896
0.0ThrTrp: 0.0 ± 0.0
0.0ThrTyr: 0.0 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
2.843ValAla: 2.843 ± 2.974
2.843ValCys: 2.843 ± 0.991
2.843ValAsp: 2.843 ± 1.03
3.554ValGlu: 3.554 ± 1.99
3.554ValPhe: 3.554 ± 1.784
3.554ValGly: 3.554 ± 2.129
0.711ValHis: 0.711 ± 0.575
2.843ValIle: 2.843 ± 1.229
7.107ValLys: 7.107 ± 2.274
7.818ValLeu: 7.818 ± 2.332
2.132ValMet: 2.132 ± 1.374
5.686ValAsn: 5.686 ± 1.397
2.132ValPro: 2.132 ± 1.421
2.132ValGln: 2.132 ± 1.289
2.843ValArg: 2.843 ± 2.433
4.264ValSer: 4.264 ± 1.511
3.554ValThr: 3.554 ± 1.167
0.711ValVal: 0.711 ± 0.92
2.843ValTrp: 2.843 ± 1.186
2.843ValTyr: 2.843 ± 2.056
0.0ValXaa: 0.0 ± 0.0
Trp
2.843TrpAla: 2.843 ± 1.061
0.711TrpCys: 0.711 ± 0.74
0.711TrpAsp: 0.711 ± 0.541
0.711TrpGlu: 0.711 ± 0.744
0.0TrpPhe: 0.0 ± 0.0
0.711TrpGly: 0.711 ± 0.74
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.711TrpLys: 0.711 ± 0.621
0.711TrpLeu: 0.711 ± 0.621
0.0TrpMet: 0.0 ± 0.0
0.711TrpAsn: 0.711 ± 0.74
1.421TrpPro: 1.421 ± 0.805
2.132TrpGln: 2.132 ± 0.949
0.711TrpArg: 0.711 ± 0.621
0.711TrpSer: 0.711 ± 0.744
0.711TrpThr: 0.711 ± 0.541
2.843TrpVal: 2.843 ± 1.332
0.711TrpTrp: 0.711 ± 0.541
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.843TyrAla: 2.843 ± 1.477
1.421TyrCys: 1.421 ± 0.884
0.711TyrAsp: 0.711 ± 0.575
1.421TyrGlu: 1.421 ± 0.884
2.843TyrPhe: 2.843 ± 1.071
2.132TyrGly: 2.132 ± 2.008
1.421TyrHis: 1.421 ± 0.806
4.264TyrIle: 4.264 ± 2.414
3.554TyrLys: 3.554 ± 1.093
0.711TyrLeu: 0.711 ± 0.74
2.843TyrMet: 2.843 ± 1.766
4.975TyrAsn: 4.975 ± 2.19
0.711TyrPro: 0.711 ± 0.744
0.0TyrGln: 0.0 ± 0.0
1.421TyrArg: 1.421 ± 0.71
0.711TyrSer: 0.711 ± 0.74
2.132TyrThr: 2.132 ± 0.732
0.711TyrVal: 0.711 ± 0.92
0.0TyrTrp: 0.0 ± 0.0
0.711TyrTyr: 0.711 ± 0.744
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1408 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski