Amino acid dipepetide frequency for Cucumber mosaic virus (strain FNY) (CMV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.515AlaAla: 6.515 ± 1.362
1.221AlaCys: 1.221 ± 0.549
5.7AlaAsp: 5.7 ± 1.215
6.922AlaGlu: 6.922 ± 1.737
4.886AlaPhe: 4.886 ± 1.239
4.072AlaGly: 4.072 ± 1.275
2.036AlaHis: 2.036 ± 0.757
4.886AlaIle: 4.886 ± 1.111
2.443AlaLys: 2.443 ± 0.655
5.7AlaLeu: 5.7 ± 0.974
3.664AlaMet: 3.664 ± 1.601
2.036AlaAsn: 2.036 ± 1.13
2.85AlaPro: 2.85 ± 0.905
2.036AlaGln: 2.036 ± 0.962
2.85AlaArg: 2.85 ± 2.472
5.7AlaSer: 5.7 ± 2.457
4.072AlaThr: 4.072 ± 0.973
4.479AlaVal: 4.479 ± 1.25
0.814AlaTrp: 0.814 ± 0.303
0.814AlaTyr: 0.814 ± 0.53
0.0AlaXaa: 0.0 ± 0.0
Cys
1.221CysAla: 1.221 ± 0.476
0.407CysCys: 0.407 ± 0.265
2.036CysAsp: 2.036 ± 0.776
2.443CysGlu: 2.443 ± 1.099
1.629CysPhe: 1.629 ± 0.566
1.221CysGly: 1.221 ± 0.795
0.0CysHis: 0.0 ± 0.0
0.814CysIle: 0.814 ± 0.303
0.407CysLys: 0.407 ± 0.265
2.036CysLeu: 2.036 ± 0.757
0.0CysMet: 0.0 ± 0.0
0.814CysAsn: 0.814 ± 0.303
2.85CysPro: 2.85 ± 0.92
0.814CysGln: 0.814 ± 0.303
1.221CysArg: 1.221 ± 1.105
2.85CysSer: 2.85 ± 0.999
0.814CysThr: 0.814 ± 0.53
1.629CysVal: 1.629 ± 0.77
0.0CysTrp: 0.0 ± 0.0
0.814CysTyr: 0.814 ± 0.303
0.0CysXaa: 0.0 ± 0.0
Asp
3.257AspAla: 3.257 ± 0.922
1.221AspCys: 1.221 ± 0.622
4.886AspAsp: 4.886 ± 0.915
2.85AspGlu: 2.85 ± 0.905
2.85AspPhe: 2.85 ± 0.656
4.479AspGly: 4.479 ± 0.972
1.629AspHis: 1.629 ± 0.598
2.036AspIle: 2.036 ± 1.13
4.479AspLys: 4.479 ± 1.092
8.143AspLeu: 8.143 ± 1.914
2.443AspMet: 2.443 ± 0.894
1.221AspAsn: 1.221 ± 0.735
1.629AspPro: 1.629 ± 0.606
1.629AspGln: 1.629 ± 0.606
3.257AspArg: 3.257 ± 1.992
6.107AspSer: 6.107 ± 1.577
3.257AspThr: 3.257 ± 0.761
4.886AspVal: 4.886 ± 1.392
0.814AspTrp: 0.814 ± 0.731
2.443AspTyr: 2.443 ± 1.099
0.0AspXaa: 0.0 ± 0.0
Glu
5.293GluAla: 5.293 ± 1.81
2.443GluCys: 2.443 ± 0.791
3.257GluAsp: 3.257 ± 0.792
3.664GluGlu: 3.664 ± 1.345
3.257GluPhe: 3.257 ± 1.673
2.036GluGly: 2.036 ± 0.783
0.814GluHis: 0.814 ± 0.625
0.814GluIle: 0.814 ± 0.303
3.257GluLys: 3.257 ± 1.211
7.329GluLeu: 7.329 ± 3.414
0.814GluMet: 0.814 ± 0.303
0.814GluAsn: 0.814 ± 0.303
1.629GluPro: 1.629 ± 0.837
2.85GluGln: 2.85 ± 1.019
7.736GluArg: 7.736 ± 1.016
4.886GluSer: 4.886 ± 1.476
3.664GluThr: 3.664 ± 0.654
2.85GluVal: 2.85 ± 0.708
1.221GluTrp: 1.221 ± 0.789
1.221GluTyr: 1.221 ± 0.724
0.0GluXaa: 0.0 ± 0.0
Phe
4.072PheAla: 4.072 ± 0.531
0.814PheCys: 0.814 ± 0.303
4.886PheAsp: 4.886 ± 1.248
3.257PheGlu: 3.257 ± 2.044
1.629PhePhe: 1.629 ± 0.566
2.443PheGly: 2.443 ± 0.909
1.629PheHis: 1.629 ± 1.228
1.221PheIle: 1.221 ± 0.549
2.85PheLys: 2.85 ± 0.708
2.85PheLeu: 2.85 ± 0.854
0.814PheMet: 0.814 ± 0.641
2.443PheAsn: 2.443 ± 0.67
1.629PhePro: 1.629 ± 0.712
2.036PheGln: 2.036 ± 1.317
2.85PheArg: 2.85 ± 1.011
6.107PheSer: 6.107 ± 1.39
1.629PheThr: 1.629 ± 0.579
3.257PheVal: 3.257 ± 0.879
0.407PheTrp: 0.407 ± 0.265
1.629PheTyr: 1.629 ± 0.598
0.0PheXaa: 0.0 ± 0.0
Gly
3.257GlyAla: 3.257 ± 1.409
0.814GlyCys: 0.814 ± 0.303
4.072GlyAsp: 4.072 ± 0.44
2.85GlyGlu: 2.85 ± 0.661
2.036GlyPhe: 2.036 ± 0.832
3.257GlyGly: 3.257 ± 1.23
1.629GlyHis: 1.629 ± 0.598
2.443GlyIle: 2.443 ± 1.218
3.257GlyLys: 3.257 ± 0.885
4.072GlyLeu: 4.072 ± 1.114
1.629GlyMet: 1.629 ± 0.712
2.443GlyAsn: 2.443 ± 0.842
0.814GlyPro: 0.814 ± 0.303
1.221GlyGln: 1.221 ± 0.476
3.257GlyArg: 3.257 ± 1.556
4.479GlySer: 4.479 ± 2.843
4.072GlyThr: 4.072 ± 0.98
4.886GlyVal: 4.886 ± 1.255
0.407GlyTrp: 0.407 ± 0.307
3.257GlyTyr: 3.257 ± 2.221
0.0GlyXaa: 0.0 ± 0.0
His
1.629HisAla: 1.629 ± 0.712
1.221HisCys: 1.221 ± 0.479
2.036HisAsp: 2.036 ± 0.783
0.814HisGlu: 0.814 ± 0.53
1.629HisPhe: 1.629 ± 0.712
2.443HisGly: 2.443 ± 1.244
0.407HisHis: 0.407 ± 0.63
0.814HisIle: 0.814 ± 0.612
2.036HisLys: 2.036 ± 1.261
0.814HisLeu: 0.814 ± 0.303
0.814HisMet: 0.814 ± 0.614
0.814HisAsn: 0.814 ± 0.567
0.407HisPro: 0.407 ± 0.674
1.221HisGln: 1.221 ± 0.532
0.814HisArg: 0.814 ± 0.614
1.221HisSer: 1.221 ± 0.549
0.407HisThr: 0.407 ± 0.265
1.629HisVal: 1.629 ± 0.994
0.407HisTrp: 0.407 ± 0.63
0.814HisTyr: 0.814 ± 0.53
0.0HisXaa: 0.0 ± 0.0
Ile
3.664IleAla: 3.664 ± 0.908
2.036IleCys: 2.036 ± 0.832
3.257IleAsp: 3.257 ± 1.125
2.443IleGlu: 2.443 ± 1.056
0.814IlePhe: 0.814 ± 0.303
2.443IleGly: 2.443 ± 1.019
1.221IleHis: 1.221 ± 0.795
1.629IleIle: 1.629 ± 0.712
3.257IleLys: 3.257 ± 0.918
3.664IleLeu: 3.664 ± 1.15
0.814IleMet: 0.814 ± 0.303
3.664IleAsn: 3.664 ± 0.314
3.664IlePro: 3.664 ± 1.116
0.814IleGln: 0.814 ± 0.625
2.85IleArg: 2.85 ± 1.119
5.293IleSer: 5.293 ± 0.807
2.443IleThr: 2.443 ± 0.68
2.036IleVal: 2.036 ± 0.962
0.814IleTrp: 0.814 ± 0.53
0.814IleTyr: 0.814 ± 0.612
0.0IleXaa: 0.0 ± 0.0
Lys
2.443LysAla: 2.443 ± 1.218
2.036LysCys: 2.036 ± 1.134
1.221LysAsp: 1.221 ± 0.724
2.443LysGlu: 2.443 ± 0.548
2.85LysPhe: 2.85 ± 0.905
1.629LysGly: 1.629 ± 0.77
0.407LysHis: 0.407 ± 0.307
4.886LysIle: 4.886 ± 0.713
4.479LysLys: 4.479 ± 1.138
3.257LysLeu: 3.257 ± 1.389
1.629LysMet: 1.629 ± 0.77
1.221LysAsn: 1.221 ± 0.735
1.629LysPro: 1.629 ± 0.809
2.036LysGln: 2.036 ± 1.296
3.257LysArg: 3.257 ± 0.277
8.55LysSer: 8.55 ± 1.312
3.257LysThr: 3.257 ± 1.359
3.257LysVal: 3.257 ± 1.007
1.629LysTrp: 1.629 ± 0.606
2.443LysTyr: 2.443 ± 0.628
0.0LysXaa: 0.0 ± 0.0
Leu
4.886LeuAla: 4.886 ± 0.817
2.036LeuCys: 2.036 ± 1.16
3.257LeuAsp: 3.257 ± 1.23
3.257LeuGlu: 3.257 ± 0.734
4.072LeuPhe: 4.072 ± 0.853
5.293LeuGly: 5.293 ± 1.304
2.85LeuHis: 2.85 ± 0.689
2.85LeuIle: 2.85 ± 1.048
4.479LeuLys: 4.479 ± 0.751
7.329LeuLeu: 7.329 ± 1.697
1.221LeuMet: 1.221 ± 0.477
5.7LeuAsn: 5.7 ± 0.507
6.107LeuPro: 6.107 ± 2.446
2.036LeuGln: 2.036 ± 0.783
5.293LeuArg: 5.293 ± 0.852
7.329LeuSer: 7.329 ± 2.155
4.072LeuThr: 4.072 ± 1.071
8.55LeuVal: 8.55 ± 1.913
0.407LeuTrp: 0.407 ± 0.307
1.221LeuTyr: 1.221 ± 0.532
0.0LeuXaa: 0.0 ± 0.0
Met
4.072MetAla: 4.072 ± 0.98
0.407MetCys: 0.407 ± 0.265
0.407MetAsp: 0.407 ± 0.674
2.443MetGlu: 2.443 ± 0.842
1.221MetPhe: 1.221 ± 0.724
0.407MetGly: 0.407 ± 0.265
0.0MetHis: 0.0 ± 0.0
1.629MetIle: 1.629 ± 0.837
0.814MetLys: 0.814 ± 0.303
0.814MetLeu: 0.814 ± 0.303
0.814MetMet: 0.814 ± 0.53
1.221MetAsn: 1.221 ± 0.795
0.814MetPro: 0.814 ± 0.614
0.407MetGln: 0.407 ± 0.265
3.664MetArg: 3.664 ± 1.198
2.443MetSer: 2.443 ± 0.548
1.629MetThr: 1.629 ± 0.921
1.629MetVal: 1.629 ± 0.605
0.407MetTrp: 0.407 ± 0.265
0.814MetTyr: 0.814 ± 0.303
0.0MetXaa: 0.0 ± 0.0
Asn
3.257AsnAla: 3.257 ± 1.53
0.407AsnCys: 0.407 ± 0.265
2.036AsnAsp: 2.036 ± 0.757
2.85AsnGlu: 2.85 ± 1.54
1.629AsnPhe: 1.629 ± 0.911
2.443AsnGly: 2.443 ± 0.67
0.814AsnHis: 0.814 ± 0.909
2.036AsnIle: 2.036 ± 0.593
2.443AsnLys: 2.443 ± 1.166
3.664AsnLeu: 3.664 ± 1.045
1.221AsnMet: 1.221 ± 0.479
1.629AsnAsn: 1.629 ± 1.987
0.407AsnPro: 0.407 ± 0.674
0.407AsnGln: 0.407 ± 0.63
2.443AsnArg: 2.443 ± 1.586
2.85AsnSer: 2.85 ± 1.048
1.221AsnThr: 1.221 ± 0.921
2.85AsnVal: 2.85 ± 2.22
0.407AsnTrp: 0.407 ± 0.63
1.629AsnTyr: 1.629 ± 0.946
0.0AsnXaa: 0.0 ± 0.0
Pro
5.293ProAla: 5.293 ± 1.794
1.221ProCys: 1.221 ± 0.549
3.664ProAsp: 3.664 ± 1.116
3.664ProGlu: 3.664 ± 1.037
1.221ProPhe: 1.221 ± 1.007
1.221ProGly: 1.221 ± 0.532
0.407ProHis: 0.407 ± 0.265
2.036ProIle: 2.036 ± 0.776
2.036ProLys: 2.036 ± 1.347
3.664ProLeu: 3.664 ± 0.494
1.221ProMet: 1.221 ± 0.476
0.0ProAsn: 0.0 ± 0.0
3.257ProPro: 3.257 ± 1.125
1.629ProGln: 1.629 ± 1.134
1.221ProArg: 1.221 ± 0.532
2.85ProSer: 2.85 ± 1.011
4.886ProThr: 4.886 ± 1.405
4.072ProVal: 4.072 ± 1.517
0.407ProTrp: 0.407 ± 0.307
0.814ProTyr: 0.814 ± 0.614
0.0ProXaa: 0.0 ± 0.0
Gln
2.443GlnAla: 2.443 ± 0.972
1.221GlnCys: 1.221 ± 0.735
1.629GlnAsp: 1.629 ± 1.059
0.814GlnGlu: 0.814 ± 0.303
2.443GlnPhe: 2.443 ± 0.909
1.629GlnGly: 1.629 ± 0.468
0.407GlnHis: 0.407 ± 0.265
1.221GlnIle: 1.221 ± 0.532
1.221GlnLys: 1.221 ± 0.476
4.072GlnLeu: 4.072 ± 0.832
0.0GlnMet: 0.0 ± 0.0
0.814GlnAsn: 0.814 ± 1.114
1.629GlnPro: 1.629 ± 0.77
1.629GlnGln: 1.629 ± 0.778
3.664GlnArg: 3.664 ± 1.453
2.85GlnSer: 2.85 ± 0.92
2.443GlnThr: 2.443 ± 0.548
2.036GlnVal: 2.036 ± 1.388
0.407GlnTrp: 0.407 ± 0.265
1.221GlnTyr: 1.221 ± 0.804
0.0GlnXaa: 0.0 ± 0.0
Arg
4.479ArgAla: 4.479 ± 0.567
2.443ArgCys: 2.443 ± 0.378
1.629ArgAsp: 1.629 ± 0.606
3.664ArgGlu: 3.664 ± 0.862
2.85ArgPhe: 2.85 ± 0.658
3.664ArgGly: 3.664 ± 1.608
2.443ArgHis: 2.443 ± 1.441
4.072ArgIle: 4.072 ± 1.059
3.257ArgLys: 3.257 ± 1.111
6.107ArgLeu: 6.107 ± 1.647
1.221ArgMet: 1.221 ± 0.593
2.443ArgAsn: 2.443 ± 0.628
4.072ArgPro: 4.072 ± 1.694
1.629ArgGln: 1.629 ± 0.606
5.293ArgArg: 5.293 ± 3.6
4.886ArgSer: 4.886 ± 1.689
4.072ArgThr: 4.072 ± 1.96
4.886ArgVal: 4.886 ± 1.36
0.407ArgTrp: 0.407 ± 0.265
0.814ArgTyr: 0.814 ± 0.53
0.0ArgXaa: 0.0 ± 0.0
Ser
6.515SerAla: 6.515 ± 3.382
1.221SerCys: 1.221 ± 0.746
6.107SerAsp: 6.107 ± 1.869
7.736SerGlu: 7.736 ± 2.672
5.293SerPhe: 5.293 ± 1.304
4.886SerGly: 4.886 ± 1.519
1.221SerHis: 1.221 ± 0.593
4.072SerIle: 4.072 ± 1.126
5.293SerLys: 5.293 ± 0.584
5.293SerLeu: 5.293 ± 1.304
0.407SerMet: 0.407 ± 0.265
3.257SerAsn: 3.257 ± 1.196
4.072SerPro: 4.072 ± 1.819
6.107SerGln: 6.107 ± 1.05
6.107SerArg: 6.107 ± 1.708
9.365SerSer: 9.365 ± 1.665
6.107SerThr: 6.107 ± 1.493
5.293SerVal: 5.293 ± 1.234
0.407SerTrp: 0.407 ± 0.265
4.072SerTyr: 4.072 ± 1.381
0.0SerXaa: 0.0 ± 0.0
Thr
4.479ThrAla: 4.479 ± 1.338
0.0ThrCys: 0.0 ± 0.0
3.664ThrAsp: 3.664 ± 0.619
2.85ThrGlu: 2.85 ± 0.441
4.072ThrPhe: 4.072 ± 1.396
3.257ThrGly: 3.257 ± 0.792
2.036ThrHis: 2.036 ± 1.001
3.257ThrIle: 3.257 ± 0.879
4.072ThrLys: 4.072 ± 1.122
5.293ThrLeu: 5.293 ± 0.642
2.85ThrMet: 2.85 ± 1.048
0.814ThrAsn: 0.814 ± 1.114
2.443ThrPro: 2.443 ± 1.099
2.036ThrGln: 2.036 ± 0.969
3.257ThrArg: 3.257 ± 0.918
5.293ThrSer: 5.293 ± 1.373
2.85ThrThr: 2.85 ± 0.708
4.886ThrVal: 4.886 ± 1.506
0.0ThrTrp: 0.0 ± 0.0
3.257ThrTyr: 3.257 ± 0.752
0.0ThrXaa: 0.0 ± 0.0
Val
5.293ValAla: 5.293 ± 0.906
0.814ValCys: 0.814 ± 0.567
4.886ValAsp: 4.886 ± 0.908
2.85ValGlu: 2.85 ± 1.412
1.629ValPhe: 1.629 ± 0.712
4.479ValGly: 4.479 ± 0.931
1.629ValHis: 1.629 ± 1.059
3.664ValIle: 3.664 ± 1.512
2.443ValLys: 2.443 ± 1.099
6.922ValLeu: 6.922 ± 2.582
2.036ValMet: 2.036 ± 0.696
3.257ValAsn: 3.257 ± 0.79
4.886ValPro: 4.886 ± 1.083
1.629ValGln: 1.629 ± 0.579
4.072ValArg: 4.072 ± 1.349
6.107ValSer: 6.107 ± 0.931
7.329ValThr: 7.329 ± 1.308
3.257ValVal: 3.257 ± 0.917
0.407ValTrp: 0.407 ± 0.674
1.629ValTyr: 1.629 ± 1.294
0.0ValXaa: 0.0 ± 0.0
Trp
0.407TrpAla: 0.407 ± 0.767
1.221TrpCys: 1.221 ± 0.921
0.407TrpAsp: 0.407 ± 0.265
0.407TrpGlu: 0.407 ± 0.265
1.629TrpPhe: 1.629 ± 0.605
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.814TrpLys: 0.814 ± 0.303
0.0TrpLeu: 0.0 ± 0.0
1.221TrpMet: 1.221 ± 0.622
1.221TrpAsn: 1.221 ± 0.795
0.0TrpPro: 0.0 ± 0.0
0.407TrpGln: 0.407 ± 0.63
0.407TrpArg: 0.407 ± 0.265
0.407TrpSer: 0.407 ± 0.307
0.0TrpThr: 0.0 ± 0.0
1.221TrpVal: 1.221 ± 0.724
0.407TrpTrp: 0.407 ± 0.307
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.221TyrAla: 1.221 ± 1.295
0.814TyrCys: 0.814 ± 0.53
4.479TyrAsp: 4.479 ± 0.935
1.629TyrGlu: 1.629 ± 0.712
0.814TyrPhe: 0.814 ± 0.303
2.85TyrGly: 2.85 ± 0.905
0.814TyrHis: 0.814 ± 0.303
3.257TyrIle: 3.257 ± 0.752
1.221TyrLys: 1.221 ± 0.549
1.629TyrLeu: 1.629 ± 1.074
0.814TyrMet: 0.814 ± 0.612
0.814TyrAsn: 0.814 ± 0.53
0.0TyrPro: 0.0 ± 0.0
1.221TyrGln: 1.221 ± 1.105
0.814TyrArg: 0.814 ± 0.303
3.257TyrSer: 3.257 ± 1.168
2.443TyrThr: 2.443 ± 1.056
1.629TyrVal: 1.629 ± 0.712
0.0TyrTrp: 0.0 ± 0.0
1.221TyrTyr: 1.221 ± 0.718
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (2457 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski