Amino acid dipepetide frequency for Cymbidium chlorotic mosaic virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.795AlaAla: 4.795 ± 2.498
3.729AlaCys: 3.729 ± 1.889
3.729AlaAsp: 3.729 ± 1.186
9.057AlaGlu: 9.057 ± 3.24
0.0AlaPhe: 0.0 ± 0.0
5.86AlaGly: 5.86 ± 1.467
0.0AlaHis: 0.0 ± 0.0
1.066AlaIle: 1.066 ± 0.36
4.795AlaLys: 4.795 ± 0.963
4.795AlaLeu: 4.795 ± 1.05
2.664AlaMet: 2.664 ± 0.612
1.066AlaAsn: 1.066 ± 0.692
3.197AlaPro: 3.197 ± 1.879
2.131AlaGln: 2.131 ± 1.384
2.131AlaArg: 2.131 ± 0.484
11.188AlaSer: 11.188 ± 3.165
3.729AlaThr: 3.729 ± 1.776
4.262AlaVal: 4.262 ± 0.985
0.0AlaTrp: 0.0 ± 0.0
2.131AlaTyr: 2.131 ± 0.735
0.0AlaXaa: 0.0 ± 0.0
Cys
2.131CysAla: 2.131 ± 2.354
0.0CysCys: 0.0 ± 0.0
2.131CysAsp: 2.131 ± 1.227
2.664CysGlu: 2.664 ± 2.008
0.533CysPhe: 0.533 ± 0.338
3.729CysGly: 3.729 ± 0.628
0.0CysHis: 0.0 ± 0.0
0.533CysIle: 0.533 ± 0.338
0.533CysLys: 0.533 ± 0.338
1.066CysLeu: 1.066 ± 0.677
0.0CysMet: 0.0 ± 0.0
0.533CysAsn: 0.533 ± 0.518
2.131CysPro: 2.131 ± 1.004
2.131CysGln: 2.131 ± 0.922
1.598CysArg: 1.598 ± 2.08
1.598CysSer: 1.598 ± 0.47
2.131CysThr: 2.131 ± 0.922
0.0CysVal: 0.0 ± 0.0
0.533CysTrp: 0.533 ± 1.081
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
0.0AspAla: 0.0 ± 0.0
0.0AspCys: 0.0 ± 0.0
2.664AspAsp: 2.664 ± 0.612
3.197AspGlu: 3.197 ± 0.952
2.131AspPhe: 2.131 ± 0.72
3.197AspGly: 3.197 ± 0.862
1.066AspHis: 1.066 ± 1.167
3.197AspIle: 3.197 ± 0.952
0.533AspLys: 0.533 ± 1.081
3.729AspLeu: 3.729 ± 1.158
0.0AspMet: 0.0 ± 0.0
0.0AspAsn: 0.0 ± 0.0
2.131AspPro: 2.131 ± 0.922
1.598AspGln: 1.598 ± 1.015
2.664AspArg: 2.664 ± 0.968
3.197AspSer: 3.197 ± 0.771
0.533AspThr: 0.533 ± 0.338
4.795AspVal: 4.795 ± 1.066
1.066AspTrp: 1.066 ± 0.677
3.729AspTyr: 3.729 ± 1.342
0.0AspXaa: 0.0 ± 0.0
Glu
6.926GluAla: 6.926 ± 2.013
3.197GluCys: 3.197 ± 4.282
2.131GluAsp: 2.131 ± 1.211
1.598GluGlu: 1.598 ± 0.825
2.664GluPhe: 2.664 ± 0.968
3.197GluGly: 3.197 ± 0.862
1.066GluHis: 1.066 ± 0.36
3.197GluIle: 3.197 ± 1.27
1.066GluLys: 1.066 ± 1.036
8.524GluLeu: 8.524 ± 2.756
0.0GluMet: 0.0 ± 0.0
2.131GluAsn: 2.131 ± 1.33
3.197GluPro: 3.197 ± 0.862
2.131GluGln: 2.131 ± 1.354
3.729GluArg: 3.729 ± 1.342
3.197GluSer: 3.197 ± 0.939
4.795GluThr: 4.795 ± 0.785
5.86GluVal: 5.86 ± 1.998
1.598GluTrp: 1.598 ± 0.47
3.197GluTyr: 3.197 ± 1.991
0.0GluXaa: 0.0 ± 0.0
Phe
1.598PheAla: 1.598 ± 0.47
1.598PheCys: 1.598 ± 1.079
1.066PheAsp: 1.066 ± 1.026
1.066PheGlu: 1.066 ± 1.24
0.533PhePhe: 0.533 ± 1.081
3.197PheGly: 3.197 ± 0.862
0.0PheHis: 0.0 ± 0.0
3.197PheIle: 3.197 ± 1.081
1.066PheLys: 1.066 ± 0.36
4.262PheLeu: 4.262 ± 2.009
0.533PheMet: 0.533 ± 0.338
0.0PheAsn: 0.0 ± 0.0
1.066PhePro: 1.066 ± 0.677
1.066PheGln: 1.066 ± 0.36
2.131PheArg: 2.131 ± 0.735
3.729PheSer: 3.729 ± 2.934
0.0PheThr: 0.0 ± 0.0
2.131PheVal: 2.131 ± 0.484
0.533PheTrp: 0.533 ± 0.338
0.533PheTyr: 0.533 ± 0.338
0.0PheXaa: 0.0 ± 0.0
Gly
5.328GlyAla: 5.328 ± 1.206
0.533GlyCys: 0.533 ± 0.738
2.664GlyAsp: 2.664 ± 1.692
4.795GlyGlu: 4.795 ± 1.337
2.131GlyPhe: 2.131 ± 1.082
4.795GlyGly: 4.795 ± 1.773
0.533GlyHis: 0.533 ± 0.338
4.262GlyIle: 4.262 ± 1.116
4.262GlyLys: 4.262 ± 1.139
9.057GlyLeu: 9.057 ± 1.912
3.197GlyMet: 3.197 ± 0.939
3.197GlyAsn: 3.197 ± 0.5
2.131GlyPro: 2.131 ± 0.735
1.066GlyGln: 1.066 ± 0.802
4.262GlyArg: 4.262 ± 0.724
9.057GlySer: 9.057 ± 1.166
6.926GlyThr: 6.926 ± 2.577
7.991GlyVal: 7.991 ± 1.23
2.131GlyTrp: 2.131 ± 0.922
1.598GlyTyr: 1.598 ± 1.731
0.0GlyXaa: 0.0 ± 0.0
His
1.598HisAla: 1.598 ± 0.47
1.066HisCys: 1.066 ± 0.36
1.598HisAsp: 1.598 ± 0.47
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
0.533HisIle: 0.533 ± 0.338
0.533HisLys: 0.533 ± 1.081
1.066HisLeu: 1.066 ± 0.36
0.533HisMet: 0.533 ± 0.738
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
1.598HisArg: 1.598 ± 0.47
1.066HisSer: 1.066 ± 1.167
1.066HisThr: 1.066 ± 0.36
2.131HisVal: 2.131 ± 0.735
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.795IleAla: 4.795 ± 0.821
0.533IleCys: 0.533 ± 0.338
2.664IleAsp: 2.664 ± 0.829
4.795IleGlu: 4.795 ± 1.888
2.664IlePhe: 2.664 ± 0.968
3.197IleGly: 3.197 ± 0.952
1.066IleHis: 1.066 ± 0.36
2.131IleIle: 2.131 ± 0.922
0.0IleLys: 0.0 ± 0.0
3.729IleLeu: 3.729 ± 1.822
2.664IleMet: 2.664 ± 0.766
1.598IleAsn: 1.598 ± 0.47
1.598IlePro: 1.598 ± 0.47
1.598IleGln: 1.598 ± 0.47
1.066IleArg: 1.066 ± 0.677
7.459IleSer: 7.459 ± 2.201
1.066IleThr: 1.066 ± 0.36
4.262IleVal: 4.262 ± 0.854
1.598IleTrp: 1.598 ± 0.47
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
4.262LysAla: 4.262 ± 1.218
0.0LysCys: 0.0 ± 0.0
1.598LysAsp: 1.598 ± 0.47
1.598LysGlu: 1.598 ± 1.015
2.664LysPhe: 2.664 ± 1.954
4.795LysGly: 4.795 ± 1.448
0.0LysHis: 0.0 ± 0.0
1.066LysIle: 1.066 ± 1.026
1.598LysLys: 1.598 ± 0.995
4.795LysLeu: 4.795 ± 2.028
0.0LysMet: 0.0 ± 0.0
1.066LysAsn: 1.066 ± 1.167
4.795LysPro: 4.795 ± 1.759
0.533LysGln: 0.533 ± 0.338
1.598LysArg: 1.598 ± 0.569
4.262LysSer: 4.262 ± 1.687
3.729LysThr: 3.729 ± 1.186
3.197LysVal: 3.197 ± 0.5
0.533LysTrp: 0.533 ± 0.738
0.533LysTyr: 0.533 ± 0.338
0.0LysXaa: 0.0 ± 0.0
Leu
4.795LeuAla: 4.795 ± 0.785
1.066LeuCys: 1.066 ± 1.026
3.729LeuAsp: 3.729 ± 1.002
6.393LeuGlu: 6.393 ± 2.117
2.131LeuPhe: 2.131 ± 0.72
6.393LeuGly: 6.393 ± 1.0
0.533LeuHis: 0.533 ± 0.738
7.459LeuIle: 7.459 ± 2.124
3.197LeuLys: 3.197 ± 1.081
9.59LeuLeu: 9.59 ± 2.589
2.131LeuMet: 2.131 ± 0.735
3.197LeuAsn: 3.197 ± 0.5
2.131LeuPro: 2.131 ± 0.72
4.262LeuGln: 4.262 ± 0.527
6.926LeuArg: 6.926 ± 1.904
14.385LeuSer: 14.385 ± 2.56
6.393LeuThr: 6.393 ± 1.184
7.991LeuVal: 7.991 ± 0.48
2.664LeuTrp: 2.664 ± 1.287
5.328LeuTyr: 5.328 ± 1.223
0.0LeuXaa: 0.0 ± 0.0
Met
2.664MetAla: 2.664 ± 0.612
0.0MetCys: 0.0 ± 0.0
1.066MetAsp: 1.066 ± 0.36
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
3.729MetGly: 3.729 ± 0.605
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.533MetLys: 0.533 ± 0.338
3.197MetLeu: 3.197 ± 1.081
1.066MetMet: 1.066 ± 0.36
2.131MetAsn: 2.131 ± 0.484
1.066MetPro: 1.066 ± 0.692
0.0MetGln: 0.0 ± 0.0
2.131MetArg: 2.131 ± 0.735
0.533MetSer: 0.533 ± 0.738
0.533MetThr: 0.533 ± 1.081
1.066MetVal: 1.066 ± 0.36
0.0MetTrp: 0.0 ± 0.0
0.533MetTyr: 0.533 ± 0.738
0.0MetXaa: 0.0 ± 0.0
Asn
0.0AsnAla: 0.0 ± 0.0
1.066AsnCys: 1.066 ± 0.692
0.0AsnAsp: 0.0 ± 0.0
1.066AsnGlu: 1.066 ± 0.36
3.197AsnPhe: 3.197 ± 0.5
7.459AsnGly: 7.459 ± 1.705
0.0AsnHis: 0.0 ± 0.0
0.533AsnIle: 0.533 ± 0.338
2.664AsnLys: 2.664 ± 0.766
2.131AsnLeu: 2.131 ± 0.484
1.598AsnMet: 1.598 ± 0.569
0.533AsnAsn: 0.533 ± 1.081
3.729AsnPro: 3.729 ± 1.223
1.598AsnGln: 1.598 ± 0.569
3.197AsnArg: 3.197 ± 1.014
2.664AsnSer: 2.664 ± 0.766
2.131AsnThr: 2.131 ± 1.211
4.262AsnVal: 4.262 ± 1.441
0.0AsnTrp: 0.0 ± 0.0
2.131AsnTyr: 2.131 ± 0.72
0.0AsnXaa: 0.0 ± 0.0
Pro
5.328ProAla: 5.328 ± 1.556
0.533ProCys: 0.533 ± 0.338
1.598ProAsp: 1.598 ± 0.995
4.262ProGlu: 4.262 ± 1.106
1.066ProPhe: 1.066 ± 1.476
3.729ProGly: 3.729 ± 1.158
0.533ProHis: 0.533 ± 0.338
2.664ProIle: 2.664 ± 0.829
0.533ProLys: 0.533 ± 0.338
4.795ProLeu: 4.795 ± 0.785
0.0ProMet: 0.0 ± 0.0
1.066ProAsn: 1.066 ± 0.36
1.598ProPro: 1.598 ± 0.995
1.598ProGln: 1.598 ± 1.079
3.197ProArg: 3.197 ± 0.862
4.262ProSer: 4.262 ± 1.992
4.262ProThr: 4.262 ± 2.532
6.393ProVal: 6.393 ± 1.361
0.533ProTrp: 0.533 ± 0.738
0.533ProTyr: 0.533 ± 0.738
0.0ProXaa: 0.0 ± 0.0
Gln
4.262GlnAla: 4.262 ± 1.47
0.533GlnCys: 0.533 ± 0.738
0.533GlnAsp: 0.533 ± 0.338
2.131GlnGlu: 2.131 ± 2.047
1.598GlnPhe: 1.598 ± 0.47
1.598GlnGly: 1.598 ± 1.113
1.066GlnHis: 1.066 ± 0.36
1.066GlnIle: 1.066 ± 0.36
0.0GlnLys: 0.0 ± 0.0
2.664GlnLeu: 2.664 ± 1.044
0.0GlnMet: 0.0 ± 0.0
1.598GlnAsn: 1.598 ± 0.825
0.533GlnPro: 0.533 ± 0.338
0.533GlnGln: 0.533 ± 0.518
0.533GlnArg: 0.533 ± 0.338
3.197GlnSer: 3.197 ± 0.862
2.131GlnThr: 2.131 ± 1.211
0.533GlnVal: 0.533 ± 0.738
2.131GlnTrp: 2.131 ± 1.017
0.533GlnTyr: 0.533 ± 0.738
0.0GlnXaa: 0.0 ± 0.0
Arg
4.262ArgAla: 4.262 ± 0.854
0.533ArgCys: 0.533 ± 0.338
3.197ArgAsp: 3.197 ± 1.991
2.664ArgGlu: 2.664 ± 1.287
2.664ArgPhe: 2.664 ± 1.154
5.328ArgGly: 5.328 ± 1.648
0.533ArgHis: 0.533 ± 1.081
2.664ArgIle: 2.664 ± 1.692
4.262ArgLys: 4.262 ± 0.724
9.057ArgLeu: 9.057 ± 2.255
0.533ArgMet: 0.533 ± 0.587
0.533ArgAsn: 0.533 ± 0.338
1.066ArgPro: 1.066 ± 1.476
1.598ArgGln: 1.598 ± 0.825
5.328ArgArg: 5.328 ± 2.698
2.664ArgSer: 2.664 ± 1.506
2.664ArgThr: 2.664 ± 1.035
6.393ArgVal: 6.393 ± 2.161
2.131ArgTrp: 2.131 ± 0.484
0.533ArgTyr: 0.533 ± 0.338
0.0ArgXaa: 0.0 ± 0.0
Ser
5.86SerAla: 5.86 ± 1.68
1.598SerCys: 1.598 ± 0.47
3.729SerAsp: 3.729 ± 1.342
4.795SerGlu: 4.795 ± 0.6
1.598SerPhe: 1.598 ± 0.47
7.991SerGly: 7.991 ± 1.487
3.729SerHis: 3.729 ± 1.1
5.86SerIle: 5.86 ± 1.8
5.328SerLys: 5.328 ± 0.766
9.057SerLeu: 9.057 ± 1.016
0.533SerMet: 0.533 ± 0.738
6.926SerAsn: 6.926 ± 1.765
7.991SerPro: 7.991 ± 2.068
0.533SerGln: 0.533 ± 0.518
6.926SerArg: 6.926 ± 1.4
10.123SerSer: 10.123 ± 2.695
6.393SerThr: 6.393 ± 2.134
7.991SerVal: 7.991 ± 2.097
2.664SerTrp: 2.664 ± 0.968
0.533SerTyr: 0.533 ± 0.338
0.0SerXaa: 0.0 ± 0.0
Thr
6.926ThrAla: 6.926 ± 1.4
3.197ThrCys: 3.197 ± 1.12
2.131ThrAsp: 2.131 ± 0.484
4.795ThrGlu: 4.795 ± 1.423
0.533ThrPhe: 0.533 ± 1.081
1.598ThrGly: 1.598 ± 2.214
1.598ThrHis: 1.598 ± 0.47
4.262ThrIle: 4.262 ± 0.985
1.066ThrLys: 1.066 ± 0.36
5.328ThrLeu: 5.328 ± 0.916
2.131ThrMet: 2.131 ± 0.936
3.729ThrAsn: 3.729 ± 1.158
1.598ThrPro: 1.598 ± 1.079
0.533ThrGln: 0.533 ± 0.738
3.197ThrArg: 3.197 ± 1.014
5.328ThrSer: 5.328 ± 1.029
9.057ThrThr: 9.057 ± 3.648
3.197ThrVal: 3.197 ± 2.064
2.131ThrTrp: 2.131 ± 0.72
0.533ThrTyr: 0.533 ± 0.338
0.0ThrXaa: 0.0 ± 0.0
Val
1.598ValAla: 1.598 ± 1.079
1.598ValCys: 1.598 ± 1.13
2.131ValAsp: 2.131 ± 0.922
4.795ValGlu: 4.795 ± 1.857
2.131ValPhe: 2.131 ± 1.082
9.057ValGly: 9.057 ± 1.514
0.0ValHis: 0.0 ± 0.0
4.262ValIle: 4.262 ± 0.724
9.057ValLys: 9.057 ± 1.68
7.991ValLeu: 7.991 ± 1.711
1.066ValMet: 1.066 ± 0.36
6.926ValAsn: 6.926 ± 1.377
5.328ValPro: 5.328 ± 1.608
1.598ValGln: 1.598 ± 0.47
3.197ValArg: 3.197 ± 1.865
4.262ValSer: 4.262 ± 0.724
4.795ValThr: 4.795 ± 3.261
5.86ValVal: 5.86 ± 2.237
3.197ValTrp: 3.197 ± 1.081
1.598ValTyr: 1.598 ± 0.569
0.0ValXaa: 0.0 ± 0.0
Trp
1.598TrpAla: 1.598 ± 0.47
1.598TrpCys: 1.598 ± 2.08
0.533TrpAsp: 0.533 ± 0.338
1.598TrpGlu: 1.598 ± 0.47
0.0TrpPhe: 0.0 ± 0.0
0.533TrpGly: 0.533 ± 1.081
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.533TrpLys: 0.533 ± 0.338
3.729TrpLeu: 3.729 ± 1.002
0.533TrpMet: 0.533 ± 0.311
2.664TrpAsn: 2.664 ± 0.603
2.131TrpPro: 2.131 ± 0.735
1.066TrpGln: 1.066 ± 0.36
2.131TrpArg: 2.131 ± 0.484
4.262TrpSer: 4.262 ± 1.47
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
1.598TrpTyr: 1.598 ± 0.801
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.598TyrAla: 1.598 ± 1.113
1.598TyrCys: 1.598 ± 0.801
0.533TyrAsp: 0.533 ± 0.738
2.131TyrGlu: 2.131 ± 0.484
1.066TyrPhe: 1.066 ± 0.36
0.533TyrGly: 0.533 ± 0.338
1.066TyrHis: 1.066 ± 0.677
0.533TyrIle: 0.533 ± 1.081
1.066TyrLys: 1.066 ± 0.36
2.131TyrLeu: 2.131 ± 0.72
0.533TyrMet: 0.533 ± 0.338
1.066TyrAsn: 1.066 ± 0.36
1.066TyrPro: 1.066 ± 0.692
1.598TyrGln: 1.598 ± 1.079
1.066TyrArg: 1.066 ± 1.026
3.729TyrSer: 3.729 ± 0.605
1.066TyrThr: 1.066 ± 1.476
2.131TyrVal: 2.131 ± 1.017
1.066TyrTrp: 1.066 ± 0.36
1.066TyrTyr: 1.066 ± 0.36
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1878 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski