Amino acid dipepetide frequency for Okra leaf curl Cameroon virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.206AlaAla: 6.206 ± 3.082
0.887AlaCys: 0.887 ± 0.76
0.887AlaAsp: 0.887 ± 0.949
1.773AlaGlu: 1.773 ± 0.871
0.887AlaPhe: 0.887 ± 0.949
0.887AlaGly: 0.887 ± 0.676
1.773AlaHis: 1.773 ± 1.352
2.66AlaIle: 2.66 ± 1.363
5.319AlaLys: 5.319 ± 0.869
5.319AlaLeu: 5.319 ± 2.281
0.0AlaMet: 0.0 ± 0.0
3.546AlaAsn: 3.546 ± 2.23
3.546AlaPro: 3.546 ± 2.139
1.773AlaGln: 1.773 ± 0.871
6.206AlaArg: 6.206 ± 1.875
5.319AlaSer: 5.319 ± 1.046
3.546AlaThr: 3.546 ± 1.394
1.773AlaVal: 1.773 ± 0.972
0.887AlaTrp: 0.887 ± 0.676
0.887AlaTyr: 0.887 ± 0.676
0.0AlaXaa: 0.0 ± 0.0
Cys
0.887CysAla: 0.887 ± 0.949
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.887CysGlu: 0.887 ± 0.76
1.773CysPhe: 1.773 ± 1.513
1.773CysGly: 1.773 ± 0.871
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
1.773CysLys: 1.773 ± 1.519
1.773CysLeu: 1.773 ± 0.972
2.66CysMet: 2.66 ± 1.303
0.887CysAsn: 0.887 ± 0.676
2.66CysPro: 2.66 ± 1.559
0.887CysGln: 0.887 ± 0.676
1.773CysArg: 1.773 ± 0.871
2.66CysSer: 2.66 ± 1.559
2.66CysThr: 2.66 ± 1.931
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.66AspAla: 2.66 ± 1.237
0.0AspCys: 0.0 ± 0.0
2.66AspAsp: 2.66 ± 1.324
0.887AspGlu: 0.887 ± 0.76
1.773AspPhe: 1.773 ± 0.781
2.66AspGly: 2.66 ± 2.028
0.0AspHis: 0.0 ± 0.0
1.773AspIle: 1.773 ± 1.119
2.66AspLys: 2.66 ± 0.828
6.206AspLeu: 6.206 ± 2.285
0.887AspMet: 0.887 ± 0.629
2.66AspAsn: 2.66 ± 1.931
1.773AspPro: 1.773 ± 0.871
1.773AspGln: 1.773 ± 0.781
3.546AspArg: 3.546 ± 1.329
5.319AspSer: 5.319 ± 2.75
1.773AspThr: 1.773 ± 0.871
5.319AspVal: 5.319 ± 1.117
1.773AspTrp: 1.773 ± 0.871
1.773AspTyr: 1.773 ± 1.237
0.0AspXaa: 0.0 ± 0.0
Glu
6.206GluAla: 6.206 ± 1.152
0.0GluCys: 0.0 ± 0.0
1.773GluAsp: 1.773 ± 0.871
6.206GluGlu: 6.206 ± 4.0
3.546GluPhe: 3.546 ± 1.286
4.433GluGly: 4.433 ± 0.933
0.0GluHis: 0.0 ± 0.0
1.773GluIle: 1.773 ± 1.2
0.887GluLys: 0.887 ± 0.676
7.092GluLeu: 7.092 ± 2.036
0.887GluMet: 0.887 ± 0.861
5.319GluAsn: 5.319 ± 1.855
3.546GluPro: 3.546 ± 1.52
0.887GluGln: 0.887 ± 0.676
0.0GluArg: 0.0 ± 0.0
4.433GluSer: 4.433 ± 0.933
1.773GluThr: 1.773 ± 1.237
0.0GluVal: 0.0 ± 0.0
2.66GluTrp: 2.66 ± 1.237
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.0PheCys: 0.0 ± 0.0
4.433PheAsp: 4.433 ± 2.117
1.773PheGlu: 1.773 ± 0.871
1.773PhePhe: 1.773 ± 0.781
0.887PheGly: 0.887 ± 0.76
2.66PheHis: 2.66 ± 1.057
0.887PheIle: 0.887 ± 0.676
2.66PheLys: 2.66 ± 1.313
4.433PheLeu: 4.433 ± 1.715
1.773PheMet: 1.773 ± 0.881
1.773PheAsn: 1.773 ± 0.972
0.887PhePro: 0.887 ± 0.828
4.433PheGln: 4.433 ± 1.515
4.433PheArg: 4.433 ± 2.07
1.773PheSer: 1.773 ± 1.237
2.66PheThr: 2.66 ± 2.202
1.773PheVal: 1.773 ± 0.781
0.887PheTrp: 0.887 ± 0.76
0.887PheTyr: 0.887 ± 0.76
0.0PheXaa: 0.0 ± 0.0
Gly
1.773GlyAla: 1.773 ± 1.352
2.66GlyCys: 2.66 ± 1.66
3.546GlyAsp: 3.546 ± 1.601
3.546GlyGlu: 3.546 ± 1.632
1.773GlyPhe: 1.773 ± 1.282
3.546GlyGly: 3.546 ± 1.849
0.887GlyHis: 0.887 ± 0.676
3.546GlyIle: 3.546 ± 1.304
4.433GlyLys: 4.433 ± 1.97
3.546GlyLeu: 3.546 ± 1.697
0.0GlyMet: 0.0 ± 0.0
4.433GlyAsn: 4.433 ± 2.232
3.546GlyPro: 3.546 ± 1.563
4.433GlyGln: 4.433 ± 1.85
2.66GlyArg: 2.66 ± 1.237
2.66GlySer: 2.66 ± 1.559
2.66GlyThr: 2.66 ± 0.828
2.66GlyVal: 2.66 ± 2.164
0.0GlyTrp: 0.0 ± 0.0
0.0GlyTyr: 0.0 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.773HisAla: 1.773 ± 1.061
1.773HisCys: 1.773 ± 1.282
0.0HisAsp: 0.0 ± 0.0
1.773HisGlu: 1.773 ± 0.871
1.773HisPhe: 1.773 ± 1.352
1.773HisGly: 1.773 ± 1.2
1.773HisHis: 1.773 ± 1.119
1.773HisIle: 1.773 ± 1.483
1.773HisLys: 1.773 ± 1.2
2.66HisLeu: 2.66 ± 2.028
0.0HisMet: 0.0 ± 0.0
2.66HisAsn: 2.66 ± 1.313
0.887HisPro: 0.887 ± 0.676
1.773HisGln: 1.773 ± 1.119
2.66HisArg: 2.66 ± 1.931
0.887HisSer: 0.887 ± 1.174
2.66HisThr: 2.66 ± 2.279
2.66HisVal: 2.66 ± 0.746
0.0HisTrp: 0.0 ± 0.0
1.773HisTyr: 1.773 ± 0.871
0.0HisXaa: 0.0 ± 0.0
Ile
0.0IleAla: 0.0 ± 0.0
1.773IleCys: 1.773 ± 1.237
3.546IleAsp: 3.546 ± 1.793
0.887IleGlu: 0.887 ± 0.676
2.66IlePhe: 2.66 ± 1.237
1.773IleGly: 1.773 ± 1.483
0.887IleHis: 0.887 ± 0.861
1.773IleIle: 1.773 ± 1.519
6.206IleLys: 6.206 ± 1.389
0.887IleLeu: 0.887 ± 0.76
1.773IleMet: 1.773 ± 1.298
3.546IleAsn: 3.546 ± 1.409
2.66IlePro: 2.66 ± 1.153
4.433IleGln: 4.433 ± 1.26
7.979IleArg: 7.979 ± 2.651
6.206IleSer: 6.206 ± 1.483
2.66IleThr: 2.66 ± 1.044
1.773IleVal: 1.773 ± 0.781
0.887IleTrp: 0.887 ± 0.861
1.773IleTyr: 1.773 ± 0.871
0.0IleXaa: 0.0 ± 0.0
Lys
1.773LysAla: 1.773 ± 1.656
3.546LysCys: 3.546 ± 1.762
3.546LysAsp: 3.546 ± 2.704
4.433LysGlu: 4.433 ± 1.445
1.773LysPhe: 1.773 ± 0.871
1.773LysGly: 1.773 ± 1.237
0.887LysHis: 0.887 ± 0.676
4.433LysIle: 4.433 ± 1.357
2.66LysLys: 2.66 ± 0.828
0.887LysLeu: 0.887 ± 0.861
0.0LysMet: 0.0 ± 0.0
3.546LysAsn: 3.546 ± 1.909
2.66LysPro: 2.66 ± 0.88
1.773LysGln: 1.773 ± 1.2
7.092LysArg: 7.092 ± 3.592
5.319LysSer: 5.319 ± 2.344
2.66LysThr: 2.66 ± 1.313
3.546LysVal: 3.546 ± 1.329
0.0LysTrp: 0.0 ± 0.0
3.546LysTyr: 3.546 ± 1.156
0.0LysXaa: 0.0 ± 0.0
Leu
1.773LeuAla: 1.773 ± 0.871
1.773LeuCys: 1.773 ± 1.352
3.546LeuAsp: 3.546 ± 1.742
5.319LeuGlu: 5.319 ± 2.641
1.773LeuPhe: 1.773 ± 0.881
6.206LeuGly: 6.206 ± 1.658
1.773LeuHis: 1.773 ± 1.352
5.319LeuIle: 5.319 ± 2.308
6.206LeuLys: 6.206 ± 1.358
3.546LeuLeu: 3.546 ± 1.439
0.0LeuMet: 0.0 ± 1.063
3.546LeuAsn: 3.546 ± 1.158
1.773LeuPro: 1.773 ± 1.311
6.206LeuGln: 6.206 ± 1.939
2.66LeuArg: 2.66 ± 1.044
7.092LeuSer: 7.092 ± 2.709
5.319LeuThr: 5.319 ± 2.247
2.66LeuVal: 2.66 ± 0.88
0.0LeuTrp: 0.0 ± 0.0
6.206LeuTyr: 6.206 ± 3.084
0.0LeuXaa: 0.0 ± 0.0
Met
2.66MetAla: 2.66 ± 1.106
0.887MetCys: 0.887 ± 1.174
2.66MetAsp: 2.66 ± 1.227
0.887MetGlu: 0.887 ± 1.174
2.66MetPhe: 2.66 ± 1.66
0.887MetGly: 0.887 ± 0.676
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.887MetLys: 0.887 ± 0.861
1.773MetLeu: 1.773 ± 1.656
0.0MetMet: 0.0 ± 0.0
1.773MetAsn: 1.773 ± 0.972
0.0MetPro: 0.0 ± 0.0
0.887MetGln: 0.887 ± 0.861
0.887MetArg: 0.887 ± 0.949
0.887MetSer: 0.887 ± 0.76
0.887MetThr: 0.887 ± 1.174
0.887MetVal: 0.887 ± 0.861
0.887MetTrp: 0.887 ± 0.828
3.546MetTyr: 3.546 ± 3.038
0.0MetXaa: 0.0 ± 0.0
Asn
5.319AsnAla: 5.319 ± 1.322
0.0AsnCys: 0.0 ± 0.0
2.66AsnAsp: 2.66 ± 1.248
1.773AsnGlu: 1.773 ± 1.061
2.66AsnPhe: 2.66 ± 0.746
4.433AsnGly: 4.433 ± 2.351
5.319AsnHis: 5.319 ± 2.954
1.773AsnIle: 1.773 ± 0.881
1.773AsnLys: 1.773 ± 1.352
4.433AsnLeu: 4.433 ± 1.499
1.773AsnMet: 1.773 ± 1.494
3.546AsnAsn: 3.546 ± 1.84
3.546AsnPro: 3.546 ± 1.158
5.319AsnGln: 5.319 ± 1.595
1.773AsnArg: 1.773 ± 1.237
5.319AsnSer: 5.319 ± 3.263
2.66AsnThr: 2.66 ± 0.88
6.206AsnVal: 6.206 ± 1.752
0.0AsnTrp: 0.0 ± 0.0
3.546AsnTyr: 3.546 ± 1.666
0.0AsnXaa: 0.0 ± 0.0
Pro
3.546ProAla: 3.546 ± 1.472
2.66ProCys: 2.66 ± 1.363
2.66ProAsp: 2.66 ± 1.363
2.66ProGlu: 2.66 ± 1.219
0.887ProPhe: 0.887 ± 0.676
2.66ProGly: 2.66 ± 1.248
3.546ProHis: 3.546 ± 1.909
2.66ProIle: 2.66 ± 1.692
2.66ProLys: 2.66 ± 2.028
4.433ProLeu: 4.433 ± 2.145
2.66ProMet: 2.66 ± 1.301
4.433ProAsn: 4.433 ± 2.541
2.66ProPro: 2.66 ± 1.153
3.546ProGln: 3.546 ± 2.384
2.66ProArg: 2.66 ± 1.649
3.546ProSer: 3.546 ± 2.238
4.433ProThr: 4.433 ± 2.209
2.66ProVal: 2.66 ± 1.385
0.0ProTrp: 0.0 ± 0.0
2.66ProTyr: 2.66 ± 1.385
0.0ProXaa: 0.0 ± 0.0
Gln
1.773GlnAla: 1.773 ± 1.061
0.0GlnCys: 0.0 ± 0.0
1.773GlnAsp: 1.773 ± 0.781
4.433GlnGlu: 4.433 ± 1.515
0.887GlnPhe: 0.887 ± 0.676
3.546GlnGly: 3.546 ± 1.943
1.773GlnHis: 1.773 ± 1.483
7.092GlnIle: 7.092 ± 3.097
2.66GlnLys: 2.66 ± 1.559
1.773GlnLeu: 1.773 ± 0.871
0.887GlnMet: 0.887 ± 0.949
3.546GlnAsn: 3.546 ± 1.687
3.546GlnPro: 3.546 ± 1.329
2.66GlnGln: 2.66 ± 1.237
3.546GlnArg: 3.546 ± 2.143
3.546GlnSer: 3.546 ± 1.017
5.319GlnThr: 5.319 ± 2.728
3.546GlnVal: 3.546 ± 1.329
0.0GlnTrp: 0.0 ± 0.0
0.887GlnTyr: 0.887 ± 0.676
0.0GlnXaa: 0.0 ± 0.0
Arg
3.546ArgAla: 3.546 ± 1.578
2.66ArgCys: 2.66 ± 1.363
6.206ArgAsp: 6.206 ± 2.706
5.319ArgGlu: 5.319 ± 2.621
4.433ArgPhe: 4.433 ± 1.892
3.546ArgGly: 3.546 ± 1.329
0.887ArgHis: 0.887 ± 0.828
3.546ArgIle: 3.546 ± 1.601
2.66ArgLys: 2.66 ± 1.672
5.319ArgLeu: 5.319 ± 2.09
3.546ArgMet: 3.546 ± 2.278
0.0ArgAsn: 0.0 ± 0.0
6.206ArgPro: 6.206 ± 1.591
3.546ArgGln: 3.546 ± 2.57
7.979ArgArg: 7.979 ± 4.394
5.319ArgSer: 5.319 ± 1.573
3.546ArgThr: 3.546 ± 0.94
1.773ArgVal: 1.773 ± 0.881
0.0ArgTrp: 0.0 ± 0.0
1.773ArgTyr: 1.773 ± 1.061
0.0ArgXaa: 0.0 ± 0.0
Ser
3.546SerAla: 3.546 ± 1.366
0.887SerCys: 0.887 ± 0.828
3.546SerAsp: 3.546 ± 0.953
0.887SerGlu: 0.887 ± 0.861
2.66SerPhe: 2.66 ± 0.828
3.546SerGly: 3.546 ± 1.156
0.887SerHis: 0.887 ± 0.949
2.66SerIle: 2.66 ± 1.421
6.206SerLys: 6.206 ± 1.74
4.433SerLeu: 4.433 ± 1.577
0.0SerMet: 0.0 ± 0.0
10.638SerAsn: 10.638 ± 1.741
9.752SerPro: 9.752 ± 2.207
0.887SerGln: 0.887 ± 0.949
5.319SerArg: 5.319 ± 1.744
8.865SerSer: 8.865 ± 3.061
6.206SerThr: 6.206 ± 3.105
5.319SerVal: 5.319 ± 3.183
1.773SerTrp: 1.773 ± 0.781
2.66SerTyr: 2.66 ± 2.028
0.0SerXaa: 0.0 ± 0.0
Thr
3.546ThrAla: 3.546 ± 1.04
2.66ThrCys: 2.66 ± 1.363
0.887ThrAsp: 0.887 ± 1.174
2.66ThrGlu: 2.66 ± 0.746
1.773ThrPhe: 1.773 ± 1.237
5.319ThrGly: 5.319 ± 2.064
7.979ThrHis: 7.979 ± 3.069
3.546ThrIle: 3.546 ± 2.422
0.887ThrLys: 0.887 ± 0.676
3.546ThrLeu: 3.546 ± 1.472
1.773ThrMet: 1.773 ± 1.2
1.773ThrAsn: 1.773 ± 0.781
2.66ThrPro: 2.66 ± 1.446
2.66ThrGln: 2.66 ± 1.324
2.66ThrArg: 2.66 ± 1.303
5.319ThrSer: 5.319 ± 2.078
0.887ThrThr: 0.887 ± 0.861
5.319ThrVal: 5.319 ± 1.858
0.887ThrTrp: 0.887 ± 1.174
3.546ThrTyr: 3.546 ± 1.796
0.0ThrXaa: 0.0 ± 0.0
Val
1.773ValAla: 1.773 ± 1.376
0.887ValCys: 0.887 ± 0.676
0.887ValAsp: 0.887 ± 0.676
2.66ValGlu: 2.66 ± 1.044
2.66ValPhe: 2.66 ± 0.746
0.0ValGly: 0.0 ± 0.0
0.887ValHis: 0.887 ± 0.828
6.206ValIle: 6.206 ± 1.589
1.773ValLys: 1.773 ± 0.781
6.206ValLeu: 6.206 ± 2.084
1.773ValMet: 1.773 ± 1.061
1.773ValAsn: 1.773 ± 1.2
4.433ValPro: 4.433 ± 1.361
3.546ValGln: 3.546 ± 1.687
3.546ValArg: 3.546 ± 2.278
2.66ValSer: 2.66 ± 1.385
3.546ValThr: 3.546 ± 3.038
0.887ValVal: 0.887 ± 0.676
2.66ValTrp: 2.66 ± 0.746
2.66ValTyr: 2.66 ± 1.66
0.0ValXaa: 0.0 ± 0.0
Trp
2.66TrpAla: 2.66 ± 2.028
0.0TrpCys: 0.0 ± 0.0
0.887TrpAsp: 0.887 ± 0.828
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.887TrpGly: 0.887 ± 0.676
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.887TrpLeu: 0.887 ± 0.76
0.887TrpMet: 0.887 ± 0.76
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.887TrpGln: 0.887 ± 0.676
0.887TrpArg: 0.887 ± 0.949
1.773TrpSer: 1.773 ± 1.483
2.66TrpThr: 2.66 ± 1.672
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.887TrpTyr: 0.887 ± 0.676
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.66TyrAla: 2.66 ± 1.248
0.0TyrCys: 0.0 ± 0.0
1.773TyrAsp: 1.773 ± 0.972
2.66TyrGlu: 2.66 ± 1.745
2.66TyrPhe: 2.66 ± 1.227
1.773TyrGly: 1.773 ± 0.781
0.887TyrHis: 0.887 ± 0.676
2.66TyrIle: 2.66 ± 1.248
1.773TyrLys: 1.773 ± 0.781
4.433TyrLeu: 4.433 ± 1.675
1.773TyrMet: 1.773 ± 0.877
4.433TyrAsn: 4.433 ± 1.51
0.887TyrPro: 0.887 ± 0.676
0.887TyrGln: 0.887 ± 0.76
3.546TyrArg: 3.546 ± 2.194
1.773TyrSer: 1.773 ± 1.237
1.773TyrThr: 1.773 ± 1.061
2.66TyrVal: 2.66 ± 1.057
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1129 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski