Amino acid dipepetide frequency for Sweet potato leaf curl Canary virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.993AlaAla: 7.993 ± 3.436
0.0AlaCys: 0.0 ± 0.0
0.888AlaAsp: 0.888 ± 1.16
6.217AlaGlu: 6.217 ± 3.06
0.888AlaPhe: 0.888 ± 1.022
1.776AlaGly: 1.776 ± 1.178
0.0AlaHis: 0.0 ± 0.0
3.552AlaIle: 3.552 ± 1.085
6.217AlaLys: 6.217 ± 2.216
7.105AlaLeu: 7.105 ± 2.605
0.0AlaMet: 0.0 ± 0.0
2.664AlaAsn: 2.664 ± 1.264
1.776AlaPro: 1.776 ± 1.409
1.776AlaGln: 1.776 ± 1.022
3.552AlaArg: 3.552 ± 1.648
1.776AlaSer: 1.776 ± 1.546
1.776AlaThr: 1.776 ± 1.093
2.664AlaVal: 2.664 ± 0.987
1.776AlaTrp: 1.776 ± 1.045
0.888AlaTyr: 0.888 ± 0.773
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
1.776CysCys: 1.776 ± 1.965
0.888CysAsp: 0.888 ± 0.589
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
2.664CysGly: 2.664 ± 2.033
0.0CysHis: 0.0 ± 0.0
1.776CysIle: 1.776 ± 1.387
2.664CysLys: 2.664 ± 0.958
0.0CysLeu: 0.0 ± 0.0
0.888CysMet: 0.888 ± 0.982
2.664CysAsn: 2.664 ± 2.034
5.329CysPro: 5.329 ± 3.213
0.0CysGln: 0.0 ± 0.0
1.776CysArg: 1.776 ± 1.079
5.329CysSer: 5.329 ± 2.044
0.888CysThr: 0.888 ± 1.022
1.776CysVal: 1.776 ± 1.546
0.0CysTrp: 0.0 ± 0.0
0.888CysTyr: 0.888 ± 1.022
0.0CysXaa: 0.0 ± 0.0
Asp
0.888AspAla: 0.888 ± 0.589
1.776AspCys: 1.776 ± 1.409
3.552AspAsp: 3.552 ± 1.012
1.776AspGlu: 1.776 ± 1.405
0.888AspPhe: 0.888 ± 0.773
3.552AspGly: 3.552 ± 1.648
0.888AspHis: 0.888 ± 0.773
2.664AspIle: 2.664 ± 1.355
1.776AspLys: 1.776 ± 1.178
4.44AspLeu: 4.44 ± 1.27
0.0AspMet: 0.0 ± 0.0
2.664AspAsn: 2.664 ± 1.71
2.664AspPro: 2.664 ± 1.169
0.888AspGln: 0.888 ± 0.982
5.329AspArg: 5.329 ± 2.022
3.552AspSer: 3.552 ± 1.063
1.776AspThr: 1.776 ± 1.405
4.44AspVal: 4.44 ± 1.441
3.552AspTrp: 3.552 ± 1.89
2.664AspTyr: 2.664 ± 0.958
0.0AspXaa: 0.0 ± 0.0
Glu
5.329GluAla: 5.329 ± 1.493
0.888GluCys: 0.888 ± 0.982
0.0GluAsp: 0.0 ± 0.0
5.329GluGlu: 5.329 ± 1.623
4.44GluPhe: 4.44 ± 1.441
3.552GluGly: 3.552 ± 1.851
0.888GluHis: 0.888 ± 1.022
2.664GluIle: 2.664 ± 2.319
4.44GluLys: 4.44 ± 2.374
2.664GluLeu: 2.664 ± 2.947
0.888GluMet: 0.888 ± 0.982
3.552GluAsn: 3.552 ± 1.355
3.552GluPro: 3.552 ± 1.912
4.44GluGln: 4.44 ± 1.621
1.776GluArg: 1.776 ± 2.19
4.44GluSer: 4.44 ± 2.625
2.664GluThr: 2.664 ± 1.576
1.776GluVal: 1.776 ± 1.121
0.0GluTrp: 0.0 ± 0.0
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
0.888PheAla: 0.888 ± 0.982
0.888PheCys: 0.888 ± 0.589
1.776PheAsp: 1.776 ± 1.022
1.776PheGlu: 1.776 ± 1.079
1.776PhePhe: 1.776 ± 1.21
1.776PheGly: 1.776 ± 1.051
2.664PheHis: 2.664 ± 2.034
1.776PheIle: 1.776 ± 1.093
6.217PheLys: 6.217 ± 2.133
2.664PheLeu: 2.664 ± 1.258
0.888PheMet: 0.888 ± 0.589
0.888PheAsn: 0.888 ± 0.589
0.0PhePro: 0.0 ± 0.0
3.552PheGln: 3.552 ± 1.678
3.552PheArg: 3.552 ± 2.02
4.44PheSer: 4.44 ± 1.21
2.664PheThr: 2.664 ± 1.536
1.776PheVal: 1.776 ± 1.21
0.888PheTrp: 0.888 ± 0.589
0.888PheTyr: 0.888 ± 0.773
0.0PheXaa: 0.0 ± 0.0
Gly
3.552GlyAla: 3.552 ± 1.678
3.552GlyCys: 3.552 ± 1.912
1.776GlyAsp: 1.776 ± 0.781
4.44GlyGlu: 4.44 ± 1.415
6.217GlyPhe: 6.217 ± 3.23
3.552GlyGly: 3.552 ± 1.561
1.776GlyHis: 1.776 ± 1.121
5.329GlyIle: 5.329 ± 1.646
7.105GlyLys: 7.105 ± 1.344
1.776GlyLeu: 1.776 ± 1.546
0.888GlyMet: 0.888 ± 0.832
0.888GlyAsn: 0.888 ± 0.773
3.552GlyPro: 3.552 ± 1.561
2.664GlyGln: 2.664 ± 1.147
4.44GlyArg: 4.44 ± 2.128
3.552GlySer: 3.552 ± 1.89
3.552GlyThr: 3.552 ± 1.491
1.776GlyVal: 1.776 ± 1.387
0.0GlyTrp: 0.0 ± 0.0
0.888GlyTyr: 0.888 ± 1.022
0.0GlyXaa: 0.0 ± 0.0
His
2.664HisAla: 2.664 ± 1.308
1.776HisCys: 1.776 ± 1.121
0.888HisAsp: 0.888 ± 1.16
0.888HisGlu: 0.888 ± 1.095
1.776HisPhe: 1.776 ± 1.178
0.888HisGly: 0.888 ± 1.022
0.888HisHis: 0.888 ± 0.589
0.888HisIle: 0.888 ± 1.16
2.664HisLys: 2.664 ± 1.71
2.664HisLeu: 2.664 ± 1.258
0.888HisMet: 0.888 ± 0.98
3.552HisAsn: 3.552 ± 1.648
3.552HisPro: 3.552 ± 1.194
1.776HisGln: 1.776 ± 1.093
1.776HisArg: 1.776 ± 1.051
0.888HisSer: 0.888 ± 1.095
3.552HisThr: 3.552 ± 1.844
2.664HisVal: 2.664 ± 0.958
0.888HisTrp: 0.888 ± 0.982
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
1.776IleAla: 1.776 ± 1.21
0.888IleCys: 0.888 ± 1.095
4.44IleAsp: 4.44 ± 1.854
1.776IleGlu: 1.776 ± 1.178
7.105IlePhe: 7.105 ± 2.287
0.888IleGly: 0.888 ± 0.589
1.776IleHis: 1.776 ± 1.541
3.552IleIle: 3.552 ± 1.331
1.776IleLys: 1.776 ± 0.781
3.552IleLeu: 3.552 ± 3.344
0.888IleMet: 0.888 ± 1.095
0.0IleAsn: 0.0 ± 0.0
6.217IlePro: 6.217 ± 4.801
2.664IleGln: 2.664 ± 1.225
7.993IleArg: 7.993 ± 1.951
5.329IleSer: 5.329 ± 3.374
4.44IleThr: 4.44 ± 1.916
2.664IleVal: 2.664 ± 0.958
0.0IleTrp: 0.0 ± 0.0
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
2.664LysAla: 2.664 ± 2.033
1.776LysCys: 1.776 ± 1.093
3.552LysAsp: 3.552 ± 1.201
6.217LysGlu: 6.217 ± 1.442
2.664LysPhe: 2.664 ± 1.71
4.44LysGly: 4.44 ± 1.418
1.776LysHis: 1.776 ± 1.178
2.664LysIle: 2.664 ± 1.71
2.664LysLys: 2.664 ± 1.169
3.552LysLeu: 3.552 ± 1.851
0.888LysMet: 0.888 ± 0.773
3.552LysAsn: 3.552 ± 2.356
1.776LysPro: 1.776 ± 1.178
1.776LysGln: 1.776 ± 1.121
5.329LysArg: 5.329 ± 2.342
3.552LysSer: 3.552 ± 1.194
2.664LysThr: 2.664 ± 1.412
2.664LysVal: 2.664 ± 0.987
0.888LysTrp: 0.888 ± 1.095
5.329LysTyr: 5.329 ± 1.732
0.0LysXaa: 0.0 ± 0.0
Leu
0.0LeuAla: 0.0 ± 0.0
3.552LeuCys: 3.552 ± 2.09
5.329LeuAsp: 5.329 ± 2.854
3.552LeuGlu: 3.552 ± 1.302
0.888LeuPhe: 0.888 ± 0.589
4.44LeuGly: 4.44 ± 1.924
3.552LeuHis: 3.552 ± 1.851
3.552LeuIle: 3.552 ± 1.42
6.217LeuLys: 6.217 ± 1.833
2.664LeuLeu: 2.664 ± 2.516
1.776LeuMet: 1.776 ± 1.21
2.664LeuAsn: 2.664 ± 1.438
3.552LeuPro: 3.552 ± 1.526
4.44LeuGln: 4.44 ± 2.015
5.329LeuArg: 5.329 ± 2.024
5.329LeuSer: 5.329 ± 1.425
4.44LeuThr: 4.44 ± 2.071
2.664LeuVal: 2.664 ± 2.034
1.776LeuTrp: 1.776 ± 1.045
3.552LeuTyr: 3.552 ± 2.159
0.0LeuXaa: 0.0 ± 0.0
Met
0.888MetAla: 0.888 ± 0.982
0.0MetCys: 0.0 ± 0.0
4.44MetAsp: 4.44 ± 2.102
0.0MetGlu: 0.0 ± 0.0
0.888MetPhe: 0.888 ± 1.16
2.664MetGly: 2.664 ± 1.225
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.888MetLys: 0.888 ± 0.773
1.776MetLeu: 1.776 ± 1.079
0.888MetMet: 0.888 ± 0.982
0.0MetAsn: 0.0 ± 0.0
0.888MetPro: 0.888 ± 0.589
0.888MetGln: 0.888 ± 0.773
0.0MetArg: 0.0 ± 0.0
2.664MetSer: 2.664 ± 1.701
1.776MetThr: 1.776 ± 1.079
0.0MetVal: 0.0 ± 0.0
0.888MetTrp: 0.888 ± 0.982
2.664MetTyr: 2.664 ± 2.174
0.0MetXaa: 0.0 ± 0.0
Asn
3.552AsnAla: 3.552 ± 1.561
1.776AsnCys: 1.776 ± 1.045
1.776AsnAsp: 1.776 ± 1.022
1.776AsnGlu: 1.776 ± 1.21
2.664AsnPhe: 2.664 ± 1.438
0.0AsnGly: 0.0 ± 0.0
2.664AsnHis: 2.664 ± 1.536
1.776AsnIle: 1.776 ± 0.781
0.888AsnLys: 0.888 ± 0.773
3.552AsnLeu: 3.552 ± 2.043
0.0AsnMet: 0.0 ± 0.706
1.776AsnAsn: 1.776 ± 0.781
6.217AsnPro: 6.217 ± 1.068
0.888AsnGln: 0.888 ± 0.773
0.0AsnArg: 0.0 ± 0.0
4.44AsnSer: 4.44 ± 2.352
0.0AsnThr: 0.0 ± 0.0
4.44AsnVal: 4.44 ± 1.441
1.776AsnTrp: 1.776 ± 1.093
2.664AsnTyr: 2.664 ± 1.278
0.0AsnXaa: 0.0 ± 0.0
Pro
1.776ProAla: 1.776 ± 1.093
0.888ProCys: 0.888 ± 0.773
4.44ProAsp: 4.44 ± 2.099
3.552ProGlu: 3.552 ± 2.185
2.664ProPhe: 2.664 ± 1.258
2.664ProGly: 2.664 ± 1.941
3.552ProHis: 3.552 ± 1.194
2.664ProIle: 2.664 ± 0.958
3.552ProLys: 3.552 ± 1.201
3.552ProLeu: 3.552 ± 1.886
1.776ProMet: 1.776 ± 1.305
3.552ProAsn: 3.552 ± 1.356
4.44ProPro: 4.44 ± 2.268
3.552ProGln: 3.552 ± 2.123
6.217ProArg: 6.217 ± 1.65
5.329ProSer: 5.329 ± 3.035
1.776ProThr: 1.776 ± 0.781
2.664ProVal: 2.664 ± 1.438
0.0ProTrp: 0.0 ± 0.0
2.664ProTyr: 2.664 ± 2.319
0.0ProXaa: 0.0 ± 0.0
Gln
3.552GlnAla: 3.552 ± 1.648
0.0GlnCys: 0.0 ± 0.0
2.664GlnAsp: 2.664 ± 0.958
4.44GlnGlu: 4.44 ± 2.409
2.664GlnPhe: 2.664 ± 1.258
3.552GlnGly: 3.552 ± 1.371
0.888GlnHis: 0.888 ± 1.022
5.329GlnIle: 5.329 ± 1.543
0.888GlnLys: 0.888 ± 0.589
6.217GlnLeu: 6.217 ± 2.33
0.0GlnMet: 0.0 ± 0.0
1.776GlnAsn: 1.776 ± 1.305
0.888GlnPro: 0.888 ± 1.022
0.888GlnGln: 0.888 ± 0.589
0.888GlnArg: 0.888 ± 1.16
3.552GlnSer: 3.552 ± 1.542
2.664GlnThr: 2.664 ± 0.987
1.776GlnVal: 1.776 ± 1.546
0.0GlnTrp: 0.0 ± 0.0
2.664GlnTyr: 2.664 ± 1.383
0.0GlnXaa: 0.0 ± 0.0
Arg
5.329ArgAla: 5.329 ± 2.69
1.776ArgCys: 1.776 ± 1.079
3.552ArgAsp: 3.552 ± 2.27
2.664ArgGlu: 2.664 ± 1.278
0.888ArgPhe: 0.888 ± 0.982
5.329ArgGly: 5.329 ± 2.13
2.664ArgHis: 2.664 ± 1.355
8.881ArgIle: 8.881 ± 2.539
3.552ArgLys: 3.552 ± 1.821
4.44ArgLeu: 4.44 ± 2.612
3.552ArgMet: 3.552 ± 2.583
0.0ArgAsn: 0.0 ± 0.0
4.44ArgPro: 4.44 ± 1.871
3.552ArgGln: 3.552 ± 1.729
7.105ArgArg: 7.105 ± 3.786
4.44ArgSer: 4.44 ± 1.654
4.44ArgThr: 4.44 ± 3.41
6.217ArgVal: 6.217 ± 1.801
0.0ArgTrp: 0.0 ± 0.0
1.776ArgTyr: 1.776 ± 1.21
0.0ArgXaa: 0.0 ± 0.0
Ser
4.44SerAla: 4.44 ± 1.621
0.888SerCys: 0.888 ± 1.022
5.329SerAsp: 5.329 ± 2.152
1.776SerGlu: 1.776 ± 1.045
1.776SerPhe: 1.776 ± 1.178
3.552SerGly: 3.552 ± 0.98
5.329SerHis: 5.329 ± 2.367
1.776SerIle: 1.776 ± 1.121
2.664SerLys: 2.664 ± 1.365
5.329SerLeu: 5.329 ± 2.198
3.552SerMet: 3.552 ± 1.417
5.329SerAsn: 5.329 ± 2.152
5.329SerPro: 5.329 ± 2.096
4.44SerGln: 4.44 ± 3.099
8.881SerArg: 8.881 ± 5.478
15.098SerSer: 15.098 ± 5.937
3.552SerThr: 3.552 ± 3.578
2.664SerVal: 2.664 ± 1.356
1.776SerTrp: 1.776 ± 2.19
2.664SerTyr: 2.664 ± 0.978
0.0SerXaa: 0.0 ± 0.0
Thr
4.44ThrAla: 4.44 ± 1.674
0.888ThrCys: 0.888 ± 1.022
0.0ThrAsp: 0.0 ± 0.0
1.776ThrGlu: 1.776 ± 1.517
0.888ThrPhe: 0.888 ± 0.773
9.769ThrGly: 9.769 ± 3.617
3.552ThrHis: 3.552 ± 1.242
2.664ThrIle: 2.664 ± 1.278
0.888ThrLys: 0.888 ± 1.022
4.44ThrLeu: 4.44 ± 3.064
0.888ThrMet: 0.888 ± 0.773
1.776ThrAsn: 1.776 ± 1.21
1.776ThrPro: 1.776 ± 1.457
0.888ThrGln: 0.888 ± 1.095
4.44ThrArg: 4.44 ± 1.878
1.776ThrSer: 1.776 ± 2.32
6.217ThrThr: 6.217 ± 3.224
1.776ThrVal: 1.776 ± 0.781
0.0ThrTrp: 0.0 ± 0.0
3.552ThrTyr: 3.552 ± 1.715
0.0ThrXaa: 0.0 ± 0.0
Val
0.888ValAla: 0.888 ± 0.773
4.44ValCys: 4.44 ± 2.105
0.888ValAsp: 0.888 ± 1.095
0.888ValGlu: 0.888 ± 1.095
0.888ValPhe: 0.888 ± 1.095
1.776ValGly: 1.776 ± 1.21
0.888ValHis: 0.888 ± 1.095
2.664ValIle: 2.664 ± 1.258
2.664ValLys: 2.664 ± 1.264
2.664ValLeu: 2.664 ± 0.987
0.888ValMet: 0.888 ± 0.589
3.552ValAsn: 3.552 ± 2.17
4.44ValPro: 4.44 ± 2.927
3.552ValGln: 3.552 ± 2.356
2.664ValArg: 2.664 ± 1.6
7.105ValSer: 7.105 ± 2.938
1.776ValThr: 1.776 ± 1.546
0.0ValVal: 0.0 ± 0.0
2.664ValTrp: 2.664 ± 0.958
2.664ValTyr: 2.664 ± 1.147
0.0ValXaa: 0.0 ± 0.0
Trp
2.664TrpAla: 2.664 ± 1.767
0.888TrpCys: 0.888 ± 1.022
0.888TrpAsp: 0.888 ± 0.982
0.888TrpGlu: 0.888 ± 0.982
0.0TrpPhe: 0.0 ± 0.0
1.776TrpGly: 1.776 ± 1.045
0.0TrpHis: 0.0 ± 0.0
0.888TrpIle: 0.888 ± 1.095
0.888TrpLys: 0.888 ± 1.022
1.776TrpLeu: 1.776 ± 0.781
0.888TrpMet: 0.888 ± 0.773
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.888TrpGln: 0.888 ± 0.589
1.776TrpArg: 1.776 ± 1.405
0.0TrpSer: 0.0 ± 0.0
0.888TrpThr: 0.888 ± 1.095
0.888TrpVal: 0.888 ± 0.589
0.0TrpTrp: 0.0 ± 0.0
1.776TrpTyr: 1.776 ± 1.093
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.0TyrAla: 0.0 ± 0.0
0.888TyrCys: 0.888 ± 0.589
2.664TyrAsp: 2.664 ± 1.6
3.552TyrGlu: 3.552 ± 1.491
1.776TyrPhe: 1.776 ± 1.051
2.664TyrGly: 2.664 ± 0.978
1.776TyrHis: 1.776 ± 0.781
2.664TyrIle: 2.664 ± 1.365
1.776TyrLys: 1.776 ± 1.305
4.44TyrLeu: 4.44 ± 1.712
0.888TyrMet: 0.888 ± 1.008
2.664TyrAsn: 2.664 ± 1.438
0.888TyrPro: 0.888 ± 0.589
1.776TyrGln: 1.776 ± 1.051
1.776TyrArg: 1.776 ± 1.405
3.552TyrSer: 3.552 ± 1.355
0.888TyrThr: 0.888 ± 0.773
2.664TyrVal: 2.664 ± 1.308
0.888TyrTrp: 0.888 ± 0.773
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1127 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski