Amino acid dipepetide frequency for Cucumber mosaic virus (strain Q) (CMV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.259AlaAla: 5.259 ± 1.37
0.751AlaCys: 0.751 ± 0.522
5.259AlaAsp: 5.259 ± 1.303
4.508AlaGlu: 4.508 ± 1.198
4.508AlaPhe: 4.508 ± 1.209
4.132AlaGly: 4.132 ± 0.828
1.503AlaHis: 1.503 ± 0.66
5.259AlaIle: 5.259 ± 1.371
2.63AlaLys: 2.63 ± 1.05
5.259AlaLeu: 5.259 ± 0.529
2.63AlaMet: 2.63 ± 0.73
2.63AlaAsn: 2.63 ± 0.641
1.878AlaPro: 1.878 ± 0.786
1.878AlaGln: 1.878 ± 1.076
2.63AlaArg: 2.63 ± 1.206
6.762AlaSer: 6.762 ± 2.662
3.381AlaThr: 3.381 ± 1.201
6.011AlaVal: 6.011 ± 0.919
0.376AlaTrp: 0.376 ± 0.317
1.127AlaTyr: 1.127 ± 0.532
0.0AlaXaa: 0.0 ± 0.0
Cys
1.878CysAla: 1.878 ± 0.432
0.376CysCys: 0.376 ± 0.261
2.254CysAsp: 2.254 ± 0.978
0.751CysGlu: 0.751 ± 0.522
1.503CysPhe: 1.503 ± 0.501
1.127CysGly: 1.127 ± 0.783
0.0CysHis: 0.0 ± 0.0
1.503CysIle: 1.503 ± 0.549
0.751CysLys: 0.751 ± 0.516
1.878CysLeu: 1.878 ± 0.676
0.376CysMet: 0.376 ± 0.397
0.0CysAsn: 0.0 ± 0.0
1.878CysPro: 1.878 ± 0.432
0.376CysGln: 0.376 ± 0.261
0.0CysArg: 0.0 ± 0.0
3.005CysSer: 3.005 ± 0.435
0.376CysThr: 0.376 ± 0.261
1.878CysVal: 1.878 ± 0.455
0.0CysTrp: 0.0 ± 0.0
0.751CysTyr: 0.751 ± 0.275
0.0CysXaa: 0.0 ± 0.0
Asp
4.884AspAla: 4.884 ± 0.901
1.503AspCys: 1.503 ± 0.853
4.132AspAsp: 4.132 ± 0.971
1.878AspGlu: 1.878 ± 0.503
3.381AspPhe: 3.381 ± 1.318
4.508AspGly: 4.508 ± 0.634
1.127AspHis: 1.127 ± 0.532
1.127AspIle: 1.127 ± 0.68
5.259AspLys: 5.259 ± 1.669
9.016AspLeu: 9.016 ± 0.402
2.254AspMet: 2.254 ± 0.57
1.878AspAsn: 1.878 ± 0.432
2.254AspPro: 2.254 ± 0.772
1.878AspGln: 1.878 ± 0.676
3.757AspArg: 3.757 ± 2.28
5.635AspSer: 5.635 ± 0.905
3.005AspThr: 3.005 ± 1.0
5.259AspVal: 5.259 ± 1.617
1.127AspTrp: 1.127 ± 0.841
2.63AspTyr: 2.63 ± 0.615
0.0AspXaa: 0.0 ± 0.0
Glu
1.878GluAla: 1.878 ± 0.786
1.503GluCys: 1.503 ± 0.686
2.63GluAsp: 2.63 ± 1.36
3.757GluGlu: 3.757 ± 2.188
1.878GluPhe: 1.878 ± 0.826
1.878GluGly: 1.878 ± 0.516
1.127GluHis: 1.127 ± 0.651
3.005GluIle: 3.005 ± 1.312
3.381GluLys: 3.381 ± 1.178
6.386GluLeu: 6.386 ± 1.719
1.503GluMet: 1.503 ± 0.816
0.376GluAsn: 0.376 ± 0.317
1.503GluPro: 1.503 ± 1.266
1.878GluGln: 1.878 ± 0.905
3.757GluArg: 3.757 ± 0.759
3.757GluSer: 3.757 ± 0.66
2.63GluThr: 2.63 ± 1.084
3.005GluVal: 3.005 ± 0.88
0.751GluTrp: 0.751 ± 0.455
0.751GluTyr: 0.751 ± 0.522
0.0GluXaa: 0.0 ± 0.0
Phe
3.381PheAla: 3.381 ± 0.746
0.751PheCys: 0.751 ± 0.275
5.259PheAsp: 5.259 ± 0.991
2.254PheGlu: 2.254 ± 1.899
1.878PhePhe: 1.878 ± 0.432
3.381PheGly: 3.381 ± 1.036
2.254PheHis: 2.254 ± 1.577
1.503PheIle: 1.503 ± 0.549
1.878PheLys: 1.878 ± 0.641
1.878PheLeu: 1.878 ± 0.826
1.878PheMet: 1.878 ± 0.662
2.63PheAsn: 2.63 ± 0.937
2.254PhePro: 2.254 ± 0.891
2.254PheGln: 2.254 ± 1.062
2.63PheArg: 2.63 ± 0.736
6.386PheSer: 6.386 ± 1.556
1.503PheThr: 1.503 ± 0.816
4.132PheVal: 4.132 ± 1.123
0.0PheTrp: 0.0 ± 0.0
1.503PheTyr: 1.503 ± 0.549
0.0PheXaa: 0.0 ± 0.0
Gly
2.63GlyAla: 2.63 ± 0.441
1.503GlyCys: 1.503 ± 0.56
5.259GlyAsp: 5.259 ± 0.645
1.878GlyGlu: 1.878 ± 0.696
1.878GlyPhe: 1.878 ± 0.676
3.381GlyGly: 3.381 ± 1.201
1.503GlyHis: 1.503 ± 0.462
1.127GlyIle: 1.127 ± 0.783
2.63GlyLys: 2.63 ± 1.107
5.635GlyLeu: 5.635 ± 0.989
1.503GlyMet: 1.503 ± 0.66
2.63GlyAsn: 2.63 ± 1.087
1.503GlyPro: 1.503 ± 0.462
1.127GlyGln: 1.127 ± 0.378
3.005GlyArg: 3.005 ± 1.814
4.884GlySer: 4.884 ± 2.737
3.381GlyThr: 3.381 ± 1.201
4.884GlyVal: 4.884 ± 0.827
0.0GlyTrp: 0.0 ± 0.0
3.757GlyTyr: 3.757 ± 1.983
0.0GlyXaa: 0.0 ± 0.0
His
1.503HisAla: 1.503 ± 0.462
1.503HisCys: 1.503 ± 0.549
1.503HisAsp: 1.503 ± 0.66
1.503HisGlu: 1.503 ± 1.045
1.503HisPhe: 1.503 ± 0.66
3.005HisGly: 3.005 ± 0.769
0.376HisHis: 0.376 ± 0.501
0.376HisIle: 0.376 ± 0.317
0.751HisLys: 0.751 ± 0.275
1.127HisLeu: 1.127 ± 0.418
0.751HisMet: 0.751 ± 0.633
0.751HisAsn: 0.751 ± 0.523
0.751HisPro: 0.751 ± 0.516
1.878HisGln: 1.878 ± 0.611
0.751HisArg: 0.751 ± 0.462
0.751HisSer: 0.751 ± 0.462
0.751HisThr: 0.751 ± 0.522
1.878HisVal: 1.878 ± 0.632
0.376HisTrp: 0.376 ± 0.501
0.751HisTyr: 0.751 ± 0.522
0.0HisXaa: 0.0 ± 0.0
Ile
3.005IleAla: 3.005 ± 1.534
0.751IleCys: 0.751 ± 0.633
1.878IleAsp: 1.878 ± 0.696
1.503IleGlu: 1.503 ± 0.563
0.751IlePhe: 0.751 ± 0.633
2.63IleGly: 2.63 ± 1.813
0.376IleHis: 0.376 ± 0.261
0.751IleIle: 0.751 ± 0.522
3.005IleLys: 3.005 ± 0.731
4.132IleLeu: 4.132 ± 1.501
0.0IleMet: 0.0 ± 0.0
1.878IleAsn: 1.878 ± 1.009
3.381IlePro: 3.381 ± 0.9
1.503IleGln: 1.503 ± 0.816
3.005IleArg: 3.005 ± 0.557
6.011IleSer: 6.011 ± 0.619
3.005IleThr: 3.005 ± 0.515
3.005IleVal: 3.005 ± 0.557
1.127IleTrp: 1.127 ± 0.783
1.503IleTyr: 1.503 ± 0.462
0.0IleXaa: 0.0 ± 0.0
Lys
2.254LysAla: 2.254 ± 0.749
1.127LysCys: 1.127 ± 0.433
2.63LysAsp: 2.63 ± 1.208
3.005LysGlu: 3.005 ± 0.915
3.005LysPhe: 3.005 ± 0.862
2.63LysGly: 2.63 ± 1.058
0.376LysHis: 0.376 ± 0.317
3.381LysIle: 3.381 ± 0.832
5.635LysLys: 5.635 ± 1.498
5.259LysLeu: 5.259 ± 1.941
1.127LysMet: 1.127 ± 0.657
1.127LysAsn: 1.127 ± 0.378
3.005LysPro: 3.005 ± 1.36
0.376LysGln: 0.376 ± 0.261
1.878LysArg: 1.878 ± 0.843
7.513LysSer: 7.513 ± 1.168
5.635LysThr: 5.635 ± 0.487
3.381LysVal: 3.381 ± 0.75
1.503LysTrp: 1.503 ± 0.549
2.63LysTyr: 2.63 ± 0.441
0.0LysXaa: 0.0 ± 0.0
Leu
6.386LeuAla: 6.386 ± 0.874
2.254LeuCys: 2.254 ± 0.866
3.005LeuAsp: 3.005 ± 1.67
3.757LeuGlu: 3.757 ± 0.682
4.884LeuPhe: 4.884 ± 1.205
4.508LeuGly: 4.508 ± 1.03
3.381LeuHis: 3.381 ± 1.055
2.63LeuIle: 2.63 ± 1.107
6.011LeuLys: 6.011 ± 1.003
6.762LeuLeu: 6.762 ± 0.865
2.254LeuMet: 2.254 ± 0.623
6.762LeuAsn: 6.762 ± 1.441
7.137LeuPro: 7.137 ± 2.219
2.254LeuGln: 2.254 ± 0.445
5.635LeuArg: 5.635 ± 0.642
8.264LeuSer: 8.264 ± 2.531
2.254LeuThr: 2.254 ± 1.004
10.518LeuVal: 10.518 ± 1.32
0.0LeuTrp: 0.0 ± 0.0
1.878LeuTyr: 1.878 ± 0.611
0.0LeuXaa: 0.0 ± 0.0
Met
2.63MetAla: 2.63 ± 0.798
0.376MetCys: 0.376 ± 0.261
2.254MetAsp: 2.254 ± 0.781
1.127MetGlu: 1.127 ± 0.614
1.878MetPhe: 1.878 ± 0.81
0.376MetGly: 0.376 ± 0.261
0.376MetHis: 0.376 ± 0.261
1.127MetIle: 1.127 ± 0.95
0.376MetLys: 0.376 ± 0.261
2.254MetLeu: 2.254 ± 1.173
0.751MetMet: 0.751 ± 0.522
0.751MetAsn: 0.751 ± 0.275
0.376MetPro: 0.376 ± 0.317
0.751MetGln: 0.751 ± 0.275
3.381MetArg: 3.381 ± 1.49
4.132MetSer: 4.132 ± 0.687
1.878MetThr: 1.878 ± 0.905
0.376MetVal: 0.376 ± 0.317
0.376MetTrp: 0.376 ± 0.261
0.376MetTyr: 0.376 ± 0.261
0.0MetXaa: 0.0 ± 0.0
Asn
3.757AsnAla: 3.757 ± 0.809
0.376AsnCys: 0.376 ± 0.261
2.254AsnAsp: 2.254 ± 0.865
2.254AsnGlu: 2.254 ± 0.631
1.127AsnPhe: 1.127 ± 0.95
2.63AsnGly: 2.63 ± 0.736
1.127AsnHis: 1.127 ± 0.957
0.751AsnIle: 0.751 ± 0.522
2.63AsnLys: 2.63 ± 1.208
3.381AsnLeu: 3.381 ± 0.746
0.376AsnMet: 0.376 ± 0.317
3.005AsnAsn: 3.005 ± 2.563
1.503AsnPro: 1.503 ± 1.049
0.376AsnGln: 0.376 ± 0.501
1.503AsnArg: 1.503 ± 1.248
2.254AsnSer: 2.254 ± 0.929
1.127AsnThr: 1.127 ± 0.95
3.005AsnVal: 3.005 ± 0.764
0.751AsnTrp: 0.751 ± 0.523
1.503AsnTyr: 1.503 ± 0.744
0.0AsnXaa: 0.0 ± 0.0
Pro
2.63ProAla: 2.63 ± 1.276
1.127ProCys: 1.127 ± 0.383
3.757ProAsp: 3.757 ± 0.746
3.005ProGlu: 3.005 ± 0.557
1.503ProPhe: 1.503 ± 0.485
1.878ProGly: 1.878 ± 0.611
0.376ProHis: 0.376 ± 0.261
3.005ProIle: 3.005 ± 0.99
2.254ProLys: 2.254 ± 1.037
3.757ProLeu: 3.757 ± 0.935
0.376ProMet: 0.376 ± 0.501
1.127ProAsn: 1.127 ± 0.957
3.757ProPro: 3.757 ± 1.441
1.878ProGln: 1.878 ± 0.856
2.254ProArg: 2.254 ± 0.464
5.259ProSer: 5.259 ± 1.606
5.635ProThr: 5.635 ± 0.983
4.884ProVal: 4.884 ± 1.583
0.0ProTrp: 0.0 ± 0.0
1.127ProTyr: 1.127 ± 0.614
0.0ProXaa: 0.0 ± 0.0
Gln
2.63GlnAla: 2.63 ± 0.629
1.127GlnCys: 1.127 ± 0.614
1.127GlnAsp: 1.127 ± 0.783
1.127GlnGlu: 1.127 ± 0.418
2.254GlnPhe: 2.254 ± 0.978
1.878GlnGly: 1.878 ± 0.641
0.751GlnHis: 0.751 ± 0.275
1.127GlnIle: 1.127 ± 0.68
1.503GlnLys: 1.503 ± 0.861
2.63GlnLeu: 2.63 ± 1.107
1.127GlnMet: 1.127 ± 0.963
0.376GlnAsn: 0.376 ± 0.501
1.127GlnPro: 1.127 ± 0.649
2.254GlnGln: 2.254 ± 0.608
3.381GlnArg: 3.381 ± 1.492
2.63GlnSer: 2.63 ± 0.862
1.503GlnThr: 1.503 ± 0.549
1.878GlnVal: 1.878 ± 1.335
0.376GlnTrp: 0.376 ± 0.261
0.751GlnTyr: 0.751 ± 0.755
0.0GlnXaa: 0.0 ± 0.0
Arg
4.508ArgAla: 4.508 ± 1.585
1.503ArgCys: 1.503 ± 0.457
1.127ArgAsp: 1.127 ± 0.433
3.381ArgGlu: 3.381 ± 1.589
2.63ArgPhe: 2.63 ± 1.36
2.63ArgGly: 2.63 ± 0.72
2.254ArgHis: 2.254 ± 1.568
3.757ArgIle: 3.757 ± 1.839
2.63ArgLys: 2.63 ± 1.355
7.137ArgLeu: 7.137 ± 1.108
0.751ArgMet: 0.751 ± 0.503
2.63ArgAsn: 2.63 ± 0.515
4.132ArgPro: 4.132 ± 1.479
1.878ArgGln: 1.878 ± 0.786
7.137ArgArg: 7.137 ± 4.117
4.884ArgSer: 4.884 ± 1.32
4.132ArgThr: 4.132 ± 0.597
3.381ArgVal: 3.381 ± 1.596
0.376ArgTrp: 0.376 ± 0.261
1.503ArgTyr: 1.503 ± 0.853
0.0ArgXaa: 0.0 ± 0.0
Ser
6.386SerAla: 6.386 ± 1.351
1.503SerCys: 1.503 ± 0.816
4.132SerAsp: 4.132 ± 0.727
6.011SerGlu: 6.011 ± 0.525
6.011SerPhe: 6.011 ± 1.116
6.386SerGly: 6.386 ± 2.292
1.878SerHis: 1.878 ± 0.549
3.005SerIle: 3.005 ± 0.996
6.386SerLys: 6.386 ± 0.981
6.011SerLeu: 6.011 ± 1.005
2.254SerMet: 2.254 ± 1.186
3.005SerAsn: 3.005 ± 0.764
7.137SerPro: 7.137 ± 2.107
4.132SerGln: 4.132 ± 1.668
7.889SerArg: 7.889 ± 2.791
8.264SerSer: 8.264 ± 1.829
5.635SerThr: 5.635 ± 1.305
6.762SerVal: 6.762 ± 1.037
0.751SerTrp: 0.751 ± 0.424
3.381SerTyr: 3.381 ± 1.184
0.0SerXaa: 0.0 ± 0.0
Thr
4.884ThrAla: 4.884 ± 1.494
0.376ThrCys: 0.376 ± 0.261
5.259ThrAsp: 5.259 ± 0.842
1.878ThrGlu: 1.878 ± 0.641
4.508ThrPhe: 4.508 ± 0.947
1.878ThrGly: 1.878 ± 0.786
1.127ThrHis: 1.127 ± 0.378
3.381ThrIle: 3.381 ± 1.318
3.757ThrLys: 3.757 ± 1.352
8.64ThrLeu: 8.64 ± 2.055
1.878ThrMet: 1.878 ± 0.605
0.0ThrAsn: 0.0 ± 0.0
0.751ThrPro: 0.751 ± 0.275
1.878ThrGln: 1.878 ± 0.863
3.381ThrArg: 3.381 ± 1.201
5.635ThrSer: 5.635 ± 1.258
3.381ThrThr: 3.381 ± 0.852
4.884ThrVal: 4.884 ± 1.125
0.0ThrTrp: 0.0 ± 0.0
2.254ThrTyr: 2.254 ± 0.866
0.0ThrXaa: 0.0 ± 0.0
Val
5.259ValAla: 5.259 ± 0.529
0.751ValCys: 0.751 ± 0.462
7.137ValAsp: 7.137 ± 2.65
2.254ValGlu: 2.254 ± 0.65
1.878ValPhe: 1.878 ± 0.536
3.757ValGly: 3.757 ± 0.847
2.254ValHis: 2.254 ± 0.865
3.381ValIle: 3.381 ± 1.298
3.381ValLys: 3.381 ± 0.728
7.513ValLeu: 7.513 ± 0.95
1.503ValMet: 1.503 ± 0.641
2.63ValAsn: 2.63 ± 0.531
4.884ValPro: 4.884 ± 0.546
1.503ValGln: 1.503 ± 0.563
5.635ValArg: 5.635 ± 1.443
6.386ValSer: 6.386 ± 1.126
8.264ValThr: 8.264 ± 0.594
6.011ValVal: 6.011 ± 1.353
0.751ValTrp: 0.751 ± 0.755
2.63ValTyr: 2.63 ± 1.704
0.0ValXaa: 0.0 ± 0.0
Trp
0.376TrpAla: 0.376 ± 0.425
1.127TrpCys: 1.127 ± 0.532
0.751TrpAsp: 0.751 ± 0.522
0.0TrpGlu: 0.0 ± 0.0
1.503TrpPhe: 1.503 ± 0.91
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.751TrpLys: 0.751 ± 0.522
0.0TrpLeu: 0.0 ± 0.0
1.127TrpMet: 1.127 ± 0.657
0.376TrpAsn: 0.376 ± 0.261
0.376TrpPro: 0.376 ± 0.317
0.376TrpGln: 0.376 ± 0.501
0.376TrpArg: 0.376 ± 0.261
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
1.503TrpVal: 1.503 ± 1.049
0.376TrpTrp: 0.376 ± 0.317
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.254TyrAla: 2.254 ± 1.36
0.751TyrCys: 0.751 ± 0.522
4.884TyrAsp: 4.884 ± 1.058
1.503TyrGlu: 1.503 ± 0.818
1.127TyrPhe: 1.127 ± 0.402
1.503TyrGly: 1.503 ± 0.457
0.376TyrHis: 0.376 ± 0.261
3.005TyrIle: 3.005 ± 0.731
1.878TyrLys: 1.878 ± 0.843
1.878TyrLeu: 1.878 ± 0.803
1.127TyrMet: 1.127 ± 0.614
1.127TyrAsn: 1.127 ± 0.657
0.0TyrPro: 0.0 ± 0.0
1.127TyrGln: 1.127 ± 0.72
0.376TyrArg: 0.376 ± 0.47
4.508TyrSer: 4.508 ± 1.192
2.254TyrThr: 2.254 ± 1.056
1.127TyrVal: 1.127 ± 0.433
0.0TyrTrp: 0.0 ± 0.0
0.376TyrTyr: 0.376 ± 0.317
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2663 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski