Amino acid dipepetide frequency for Cassava common mosaic virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.558AlaAla: 7.558 ± 3.416
1.417AlaCys: 1.417 ± 0.585
1.889AlaAsp: 1.889 ± 1.845
2.834AlaGlu: 2.834 ± 0.636
3.779AlaPhe: 3.779 ± 1.299
7.085AlaGly: 7.085 ± 2.326
0.472AlaHis: 0.472 ± 0.914
1.417AlaIle: 1.417 ± 0.691
6.141AlaLys: 6.141 ± 1.192
10.864AlaLeu: 10.864 ± 3.64
1.889AlaMet: 1.889 ± 1.035
4.251AlaAsn: 4.251 ± 1.719
3.779AlaPro: 3.779 ± 0.799
6.141AlaGln: 6.141 ± 1.5
5.196AlaArg: 5.196 ± 1.252
6.141AlaSer: 6.141 ± 2.048
6.141AlaThr: 6.141 ± 3.585
6.613AlaVal: 6.613 ± 2.85
0.0AlaTrp: 0.0 ± 0.0
1.889AlaTyr: 1.889 ± 0.661
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.945CysGlu: 0.945 ± 0.518
0.472CysPhe: 0.472 ± 0.914
0.472CysGly: 0.472 ± 0.259
0.0CysHis: 0.0 ± 0.0
0.472CysIle: 0.472 ± 0.974
0.945CysLys: 0.945 ± 0.618
3.779CysLeu: 3.779 ± 1.304
0.0CysMet: 0.0 ± 0.0
0.472CysAsn: 0.472 ± 0.259
0.945CysPro: 0.945 ± 1.49
0.472CysGln: 0.472 ± 0.259
0.945CysArg: 0.945 ± 1.422
1.417CysSer: 1.417 ± 1.759
0.472CysThr: 0.472 ± 0.745
0.472CysVal: 0.472 ± 0.259
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.417AspAla: 1.417 ± 0.691
0.945AspCys: 0.945 ± 1.401
1.417AspAsp: 1.417 ± 0.777
4.251AspGlu: 4.251 ± 2.223
1.889AspPhe: 1.889 ± 0.707
3.307AspGly: 3.307 ± 1.753
1.417AspHis: 1.417 ± 0.777
5.196AspIle: 5.196 ± 2.848
1.889AspLys: 1.889 ± 0.661
5.668AspLeu: 5.668 ± 1.947
0.945AspMet: 0.945 ± 0.518
2.834AspAsn: 2.834 ± 1.142
2.834AspPro: 2.834 ± 1.17
1.417AspGln: 1.417 ± 0.585
0.0AspArg: 0.0 ± 0.0
3.779AspSer: 3.779 ± 1.035
1.417AspThr: 1.417 ± 0.777
0.945AspVal: 0.945 ± 0.618
0.945AspTrp: 0.945 ± 0.768
0.945AspTyr: 0.945 ± 0.867
0.0AspXaa: 0.0 ± 0.0
Glu
7.558GluAla: 7.558 ± 2.634
0.472GluCys: 0.472 ± 0.259
1.889GluAsp: 1.889 ± 1.035
4.724GluGlu: 4.724 ± 1.858
1.889GluPhe: 1.889 ± 1.035
2.834GluGly: 2.834 ± 1.385
0.472GluHis: 0.472 ± 0.259
2.834GluIle: 2.834 ± 1.385
4.251GluLys: 4.251 ± 2.33
11.337GluLeu: 11.337 ± 3.212
0.945GluMet: 0.945 ± 0.733
2.834GluAsn: 2.834 ± 1.553
5.196GluPro: 5.196 ± 1.889
2.362GluGln: 2.362 ± 0.699
2.834GluArg: 2.834 ± 1.553
2.834GluSer: 2.834 ± 1.42
3.307GluThr: 3.307 ± 0.674
1.417GluVal: 1.417 ± 0.777
1.417GluTrp: 1.417 ± 0.777
2.834GluTyr: 2.834 ± 1.014
0.0GluXaa: 0.0 ± 0.0
Phe
1.889PheAla: 1.889 ± 0.707
2.362PheCys: 2.362 ± 2.266
3.779PheAsp: 3.779 ± 2.211
3.307PheGlu: 3.307 ± 0.674
1.417PhePhe: 1.417 ± 2.2
0.945PheGly: 0.945 ± 0.518
0.0PheHis: 0.0 ± 0.0
2.834PheIle: 2.834 ± 0.934
1.417PheLys: 1.417 ± 0.777
4.251PheLeu: 4.251 ± 1.574
1.889PheMet: 1.889 ± 1.035
0.945PheAsn: 0.945 ± 0.518
0.945PhePro: 0.945 ± 1.401
1.417PheGln: 1.417 ± 0.777
1.889PheArg: 1.889 ± 1.035
3.307PheSer: 3.307 ± 1.507
2.834PheThr: 2.834 ± 1.472
2.362PheVal: 2.362 ± 0.816
0.945PheTrp: 0.945 ± 0.867
0.472PheTyr: 0.472 ± 0.259
0.0PheXaa: 0.0 ± 0.0
Gly
4.724GlyAla: 4.724 ± 1.124
0.945GlyCys: 0.945 ± 1.401
3.779GlyAsp: 3.779 ± 1.473
3.307GlyGlu: 3.307 ± 1.812
2.834GlyPhe: 2.834 ± 1.095
3.307GlyGly: 3.307 ± 2.299
2.362GlyHis: 2.362 ± 0.953
3.307GlyIle: 3.307 ± 2.044
4.251GlyLys: 4.251 ± 1.32
4.724GlyLeu: 4.724 ± 2.72
0.0GlyMet: 0.0 ± 0.0
0.945GlyAsn: 0.945 ± 0.768
3.307GlyPro: 3.307 ± 3.677
0.945GlyGln: 0.945 ± 0.518
1.889GlyArg: 1.889 ± 1.44
2.362GlySer: 2.362 ± 1.438
2.834GlyThr: 2.834 ± 0.934
2.834GlyVal: 2.834 ± 1.486
1.417GlyTrp: 1.417 ± 0.585
2.362GlyTyr: 2.362 ± 0.947
0.0GlyXaa: 0.0 ± 0.0
His
2.362HisAla: 2.362 ± 1.296
0.0HisCys: 0.0 ± 0.0
0.472HisAsp: 0.472 ± 0.745
1.889HisGlu: 1.889 ± 1.035
2.834HisPhe: 2.834 ± 1.17
2.834HisGly: 2.834 ± 1.472
1.889HisHis: 1.889 ± 1.733
1.417HisIle: 1.417 ± 1.028
1.889HisLys: 1.889 ± 1.035
2.362HisLeu: 2.362 ± 0.953
0.0HisMet: 0.0 ± 0.0
1.417HisAsn: 1.417 ± 0.777
1.889HisPro: 1.889 ± 1.312
0.945HisGln: 0.945 ± 0.518
2.362HisArg: 2.362 ± 0.953
0.945HisSer: 0.945 ± 0.618
3.307HisThr: 3.307 ± 0.922
0.945HisVal: 0.945 ± 1.348
0.0HisTrp: 0.0 ± 0.0
0.945HisTyr: 0.945 ± 0.518
0.0HisXaa: 0.0 ± 0.0
Ile
5.668IleAla: 5.668 ± 1.272
0.472IleCys: 0.472 ± 0.745
0.945IleAsp: 0.945 ± 1.751
2.834IleGlu: 2.834 ± 1.014
2.834IlePhe: 2.834 ± 1.659
2.362IleGly: 2.362 ± 3.329
3.779IleHis: 3.779 ± 2.076
3.779IleIle: 3.779 ± 3.133
8.03IleLys: 8.03 ± 2.886
5.196IleLeu: 5.196 ± 2.257
1.889IleMet: 1.889 ± 1.035
2.362IleAsn: 2.362 ± 0.816
3.779IlePro: 3.779 ± 1.415
2.834IleGln: 2.834 ± 1.014
1.417IleArg: 1.417 ± 0.585
3.307IleSer: 3.307 ± 3.376
5.196IleThr: 5.196 ± 0.894
0.945IleVal: 0.945 ± 0.518
0.472IleTrp: 0.472 ± 0.914
1.889IleTyr: 1.889 ± 1.035
0.0IleXaa: 0.0 ± 0.0
Lys
7.558LysAla: 7.558 ± 1.9
0.472LysCys: 0.472 ± 0.259
3.307LysAsp: 3.307 ± 1.812
4.251LysGlu: 4.251 ± 2.33
1.889LysPhe: 1.889 ± 0.871
3.307LysGly: 3.307 ± 1.202
1.417LysHis: 1.417 ± 0.691
4.251LysIle: 4.251 ± 0.978
4.251LysLys: 4.251 ± 1.709
9.92LysLeu: 9.92 ± 4.157
0.945LysMet: 0.945 ± 0.518
0.945LysAsn: 0.945 ± 0.518
5.196LysPro: 5.196 ± 1.31
1.889LysGln: 1.889 ± 1.035
2.362LysArg: 2.362 ± 1.294
5.196LysSer: 5.196 ± 1.681
6.141LysThr: 6.141 ± 2.092
1.889LysVal: 1.889 ± 0.84
0.472LysTrp: 0.472 ± 0.259
0.945LysTyr: 0.945 ± 1.242
0.0LysXaa: 0.0 ± 0.0
Leu
7.085LeuAla: 7.085 ± 1.087
0.472LeuCys: 0.472 ± 0.974
5.668LeuAsp: 5.668 ± 1.272
7.558LeuGlu: 7.558 ± 2.092
4.251LeuPhe: 4.251 ± 1.316
6.141LeuGly: 6.141 ± 2.092
2.362LeuHis: 2.362 ± 1.334
6.613LeuIle: 6.613 ± 3.074
8.503LeuLys: 8.503 ± 3.24
8.503LeuLeu: 8.503 ± 2.354
0.945LeuMet: 0.945 ± 0.802
3.779LeuAsn: 3.779 ± 0.799
7.558LeuPro: 7.558 ± 2.56
6.613LeuGln: 6.613 ± 3.838
4.724LeuArg: 4.724 ± 1.582
11.337LeuSer: 11.337 ± 2.456
4.251LeuThr: 4.251 ± 2.897
4.724LeuVal: 4.724 ± 2.267
2.362LeuTrp: 2.362 ± 1.294
3.307LeuTyr: 3.307 ± 1.235
0.0LeuXaa: 0.0 ± 0.0
Met
1.417MetAla: 1.417 ± 0.691
0.472MetCys: 0.472 ± 0.259
1.889MetAsp: 1.889 ± 0.661
0.945MetGlu: 0.945 ± 1.401
0.945MetPhe: 0.945 ± 0.518
0.472MetGly: 0.472 ± 0.259
0.472MetHis: 0.472 ± 0.259
1.889MetIle: 1.889 ± 1.035
0.945MetLys: 0.945 ± 0.518
2.362MetLeu: 2.362 ± 1.294
0.0MetMet: 0.0 ± 0.0
0.945MetAsn: 0.945 ± 0.768
2.362MetPro: 2.362 ± 1.294
0.0MetGln: 0.0 ± 0.0
1.889MetArg: 1.889 ± 0.707
0.472MetSer: 0.472 ± 0.974
0.0MetThr: 0.0 ± 0.0
1.417MetVal: 1.417 ± 0.777
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.251AsnAla: 4.251 ± 2.074
1.417AsnCys: 1.417 ± 0.777
1.417AsnAsp: 1.417 ± 0.585
1.889AsnGlu: 1.889 ± 1.035
2.834AsnPhe: 2.834 ± 1.259
1.417AsnGly: 1.417 ± 0.777
0.472AsnHis: 0.472 ± 0.259
2.834AsnIle: 2.834 ± 1.095
2.362AsnLys: 2.362 ± 1.294
3.307AsnLeu: 3.307 ± 3.459
1.889AsnMet: 1.889 ± 1.648
0.945AsnAsn: 0.945 ± 0.768
1.417AsnPro: 1.417 ± 1.175
0.472AsnGln: 0.472 ± 0.259
0.945AsnArg: 0.945 ± 0.867
1.889AsnSer: 1.889 ± 0.871
2.362AsnThr: 2.362 ± 1.294
1.889AsnVal: 1.889 ± 0.707
0.472AsnTrp: 0.472 ± 0.259
2.362AsnTyr: 2.362 ± 1.294
0.0AsnXaa: 0.0 ± 0.0
Pro
3.307ProAla: 3.307 ± 1.346
0.472ProCys: 0.472 ± 0.259
2.362ProAsp: 2.362 ± 0.816
7.558ProGlu: 7.558 ± 2.701
1.417ProPhe: 1.417 ± 0.585
2.834ProGly: 2.834 ± 1.462
1.417ProHis: 1.417 ± 2.231
4.251ProIle: 4.251 ± 1.462
5.668ProLys: 5.668 ± 1.863
3.779ProLeu: 3.779 ± 1.59
0.472ProMet: 0.472 ± 0.259
1.417ProAsn: 1.417 ± 1.175
3.779ProPro: 3.779 ± 2.266
4.251ProGln: 4.251 ± 1.458
2.362ProArg: 2.362 ± 0.699
5.196ProSer: 5.196 ± 0.945
7.558ProThr: 7.558 ± 5.915
2.834ProVal: 2.834 ± 1.014
0.472ProTrp: 0.472 ± 0.259
2.362ProTyr: 2.362 ± 1.334
0.0ProXaa: 0.0 ± 0.0
Gln
3.307GlnAla: 3.307 ± 1.678
0.0GlnCys: 0.0 ± 0.0
2.362GlnAsp: 2.362 ± 0.816
3.307GlnGlu: 3.307 ± 0.674
1.889GlnPhe: 1.889 ± 1.535
1.889GlnGly: 1.889 ± 1.44
1.889GlnHis: 1.889 ± 0.661
4.251GlnIle: 4.251 ± 1.393
0.472GlnLys: 0.472 ± 0.259
3.779GlnLeu: 3.779 ± 1.187
1.417GlnMet: 1.417 ± 0.777
0.472GlnAsn: 0.472 ± 0.745
1.889GlnPro: 1.889 ± 1.312
2.362GlnGln: 2.362 ± 1.294
0.945GlnArg: 0.945 ± 0.518
4.251GlnSer: 4.251 ± 0.949
5.668GlnThr: 5.668 ± 2.453
2.362GlnVal: 2.362 ± 1.438
0.472GlnTrp: 0.472 ± 0.259
1.417GlnTyr: 1.417 ± 0.777
0.0GlnXaa: 0.0 ± 0.0
Arg
2.362ArgAla: 2.362 ± 0.699
0.0ArgCys: 0.0 ± 0.0
3.307ArgAsp: 3.307 ± 1.333
3.779ArgGlu: 3.779 ± 0.799
0.472ArgPhe: 0.472 ± 0.259
2.362ArgGly: 2.362 ± 0.811
1.889ArgHis: 1.889 ± 1.515
1.889ArgIle: 1.889 ± 0.661
2.362ArgLys: 2.362 ± 0.811
3.307ArgLeu: 3.307 ± 1.374
0.945ArgMet: 0.945 ± 0.518
2.362ArgAsn: 2.362 ± 1.334
2.834ArgPro: 2.834 ± 0.934
2.362ArgGln: 2.362 ± 0.699
2.362ArgArg: 2.362 ± 1.294
3.307ArgSer: 3.307 ± 1.753
3.307ArgThr: 3.307 ± 0.985
1.417ArgVal: 1.417 ± 1.028
0.0ArgTrp: 0.0 ± 0.0
2.362ArgTyr: 2.362 ± 0.811
0.0ArgXaa: 0.0 ± 0.0
Ser
6.141SerAla: 6.141 ± 3.321
1.417SerCys: 1.417 ± 1.825
1.889SerAsp: 1.889 ± 1.035
4.251SerGlu: 4.251 ± 1.62
2.834SerPhe: 2.834 ± 1.17
3.307SerGly: 3.307 ± 2.937
3.307SerHis: 3.307 ± 1.221
2.834SerIle: 2.834 ± 1.142
5.668SerLys: 5.668 ± 1.314
5.196SerLeu: 5.196 ± 4.181
0.945SerMet: 0.945 ± 0.518
3.307SerAsn: 3.307 ± 1.616
5.668SerPro: 5.668 ± 1.006
4.251SerGln: 4.251 ± 2.642
3.307SerArg: 3.307 ± 1.172
8.03SerSer: 8.03 ± 7.319
3.779SerThr: 3.779 ± 2.174
2.362SerVal: 2.362 ± 1.294
0.945SerTrp: 0.945 ± 0.518
2.834SerTyr: 2.834 ± 1.272
0.0SerXaa: 0.0 ± 0.0
Thr
7.085ThrAla: 7.085 ± 3.927
0.472ThrCys: 0.472 ± 0.259
3.307ThrAsp: 3.307 ± 0.922
3.307ThrGlu: 3.307 ± 1.235
2.362ThrPhe: 2.362 ± 1.294
3.307ThrGly: 3.307 ± 0.674
4.724ThrHis: 4.724 ± 1.632
4.251ThrIle: 4.251 ± 2.0
2.834ThrLys: 2.834 ± 1.142
7.085ThrLeu: 7.085 ± 1.576
0.945ThrMet: 0.945 ± 0.518
4.251ThrAsn: 4.251 ± 1.574
7.085ThrPro: 7.085 ± 3.465
3.307ThrGln: 3.307 ± 1.374
2.834ThrArg: 2.834 ± 1.826
4.251ThrSer: 4.251 ± 1.008
3.779ThrThr: 3.779 ± 4.342
4.724ThrVal: 4.724 ± 2.774
0.472ThrTrp: 0.472 ± 1.512
0.945ThrTyr: 0.945 ± 0.518
0.0ThrXaa: 0.0 ± 0.0
Val
4.251ValAla: 4.251 ± 2.79
0.0ValCys: 0.0 ± 0.0
2.362ValAsp: 2.362 ± 0.811
2.362ValGlu: 2.362 ± 0.811
1.417ValPhe: 1.417 ± 0.777
2.362ValGly: 2.362 ± 0.953
1.417ValHis: 1.417 ± 0.585
3.307ValIle: 3.307 ± 0.985
3.307ValLys: 3.307 ± 1.123
5.668ValLeu: 5.668 ± 1.481
0.945ValMet: 0.945 ± 0.768
1.417ValAsn: 1.417 ± 0.691
1.889ValPro: 1.889 ± 0.707
1.889ValGln: 1.889 ± 0.84
3.307ValArg: 3.307 ± 1.346
1.417ValSer: 1.417 ± 1.175
3.779ValThr: 3.779 ± 0.799
2.362ValVal: 2.362 ± 2.629
0.472ValTrp: 0.472 ± 0.259
0.945ValTyr: 0.945 ± 0.867
0.0ValXaa: 0.0 ± 0.0
Trp
3.307TrpAla: 3.307 ± 1.432
0.0TrpCys: 0.0 ± 0.0
0.472TrpAsp: 0.472 ± 0.259
1.417TrpGlu: 1.417 ± 0.777
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.472TrpLys: 0.472 ± 0.259
1.417TrpLeu: 1.417 ± 0.777
0.472TrpMet: 0.472 ± 0.259
0.945TrpAsn: 0.945 ± 0.768
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.472TrpArg: 0.472 ± 0.259
0.472TrpSer: 0.472 ± 0.745
1.417TrpThr: 1.417 ± 0.777
1.417TrpVal: 1.417 ± 0.777
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.251TyrAla: 4.251 ± 1.499
0.472TyrCys: 0.472 ± 0.745
0.945TyrAsp: 0.945 ± 0.518
0.0TyrGlu: 0.0 ± 0.0
0.945TyrPhe: 0.945 ± 0.518
1.889TyrGly: 1.889 ± 1.625
0.945TyrHis: 0.945 ± 0.518
2.362TyrIle: 2.362 ± 1.334
0.945TyrLys: 0.945 ± 0.518
4.251TyrLeu: 4.251 ± 1.709
0.945TyrMet: 0.945 ± 0.518
0.0TyrAsn: 0.0 ± 0.0
1.417TyrPro: 1.417 ± 0.585
0.472TyrGln: 0.472 ± 0.259
0.945TyrArg: 0.945 ± 0.867
2.362TyrSer: 2.362 ± 0.982
3.779TyrThr: 3.779 ± 1.035
0.945TyrVal: 0.945 ± 0.518
0.945TyrTrp: 0.945 ± 0.518
2.362TyrTyr: 2.362 ± 1.922
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (2118 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski