Amino acid dipepetide frequency for Citrus virus A

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.714AlaAla: 2.714 ± 1.478
1.018AlaCys: 1.018 ± 0.531
3.053AlaAsp: 3.053 ± 1.133
3.392AlaGlu: 3.392 ± 1.069
1.357AlaPhe: 1.357 ± 0.708
1.018AlaGly: 1.018 ± 1.285
0.0AlaHis: 0.0 ± 0.0
2.374AlaIle: 2.374 ± 0.771
4.071AlaLys: 4.071 ± 1.317
6.106AlaLeu: 6.106 ± 0.552
0.678AlaMet: 0.678 ± 0.541
2.035AlaAsn: 2.035 ± 0.639
1.018AlaPro: 1.018 ± 0.531
2.035AlaGln: 2.035 ± 0.639
2.714AlaArg: 2.714 ± 1.309
4.41AlaSer: 4.41 ± 1.219
4.41AlaThr: 4.41 ± 1.219
2.374AlaVal: 2.374 ± 2.66
0.678AlaTrp: 0.678 ± 0.354
1.357AlaTyr: 1.357 ± 0.655
0.0AlaXaa: 0.0 ± 0.0
Cys
0.678CysAla: 0.678 ± 0.354
0.339CysCys: 0.339 ± 0.177
0.339CysAsp: 0.339 ± 0.177
1.018CysGlu: 1.018 ± 0.531
0.0CysPhe: 0.0 ± 0.0
1.018CysGly: 1.018 ± 0.531
0.0CysHis: 0.0 ± 0.0
0.339CysIle: 0.339 ± 0.71
1.018CysLys: 1.018 ± 0.531
1.696CysLeu: 1.696 ± 0.885
0.339CysMet: 0.339 ± 0.177
1.357CysAsn: 1.357 ± 0.708
0.339CysPro: 0.339 ± 0.71
0.678CysGln: 0.678 ± 0.354
1.018CysArg: 1.018 ± 0.531
3.392CysSer: 3.392 ± 0.543
0.339CysThr: 0.339 ± 0.177
0.339CysVal: 0.339 ± 0.177
0.0CysTrp: 0.0 ± 0.0
0.678CysTyr: 0.678 ± 0.354
0.0CysXaa: 0.0 ± 0.0
Asp
1.357AspAla: 1.357 ± 1.083
0.339AspCys: 0.339 ± 0.177
3.053AspAsp: 3.053 ± 0.964
4.071AspGlu: 4.071 ± 1.335
2.714AspPhe: 2.714 ± 1.54
0.678AspGly: 0.678 ± 0.581
1.018AspHis: 1.018 ± 0.531
5.088AspIle: 5.088 ± 1.548
3.392AspLys: 3.392 ± 1.236
7.123AspLeu: 7.123 ± 1.671
1.696AspMet: 1.696 ± 0.885
2.035AspAsn: 2.035 ± 1.624
2.374AspPro: 2.374 ± 1.239
2.714AspGln: 2.714 ± 0.062
2.374AspArg: 2.374 ± 0.126
5.088AspSer: 5.088 ± 0.865
5.767AspThr: 5.767 ± 2.29
3.731AspVal: 3.731 ± 1.401
0.678AspTrp: 0.678 ± 0.354
3.392AspTyr: 3.392 ± 0.543
0.0AspXaa: 0.0 ± 0.0
Glu
3.053GluAla: 3.053 ± 0.964
0.678GluCys: 0.678 ± 0.354
6.445GluAsp: 6.445 ± 0.527
10.176GluGlu: 10.176 ± 1.57
3.392GluPhe: 3.392 ± 1.359
4.071GluGly: 4.071 ± 1.335
2.035GluHis: 2.035 ± 1.829
6.106GluIle: 6.106 ± 0.466
3.392GluLys: 3.392 ± 1.069
5.088GluLeu: 5.088 ± 0.692
0.678GluMet: 0.678 ± 0.581
4.071GluAsn: 4.071 ± 0.603
3.392GluPro: 3.392 ± 0.523
2.714GluGln: 2.714 ± 2.672
2.374GluArg: 2.374 ± 0.771
7.123GluSer: 7.123 ± 1.158
4.41GluThr: 4.41 ± 1.208
4.071GluVal: 4.071 ± 1.083
1.018GluTrp: 1.018 ± 0.483
3.392GluTyr: 3.392 ± 1.124
0.0GluXaa: 0.0 ± 0.0
Phe
3.053PheAla: 3.053 ± 1.074
1.357PheCys: 1.357 ± 0.708
3.731PheAsp: 3.731 ± 1.499
2.714PheGlu: 2.714 ± 0.918
2.714PhePhe: 2.714 ± 1.417
2.035PheGly: 2.035 ± 0.967
0.339PheHis: 0.339 ± 0.177
1.696PheIle: 1.696 ± 1.005
2.714PheLys: 2.714 ± 0.758
4.41PheLeu: 4.41 ± 1.015
1.018PheMet: 1.018 ± 0.477
3.392PheAsn: 3.392 ± 2.21
2.035PhePro: 2.035 ± 1.113
2.035PheGln: 2.035 ± 0.547
2.374PheArg: 2.374 ± 0.906
3.731PheSer: 3.731 ± 0.586
3.731PheThr: 3.731 ± 0.499
2.374PheVal: 2.374 ± 0.771
1.018PheTrp: 1.018 ± 0.531
1.018PheTyr: 1.018 ± 0.531
0.0PheXaa: 0.0 ± 0.0
Gly
2.035GlyAla: 2.035 ± 0.301
0.678GlyCys: 0.678 ± 0.354
2.714GlyAsp: 2.714 ± 0.062
2.714GlyGlu: 2.714 ± 0.758
3.392GlyPhe: 3.392 ± 0.543
2.035GlyGly: 2.035 ± 0.639
0.0GlyHis: 0.0 ± 0.0
4.071GlyIle: 4.071 ± 1.569
8.141GlyLys: 8.141 ± 1.428
2.714GlyLeu: 2.714 ± 1.417
1.357GlyMet: 1.357 ± 0.439
0.0GlyAsn: 0.0 ± 0.0
1.357GlyPro: 1.357 ± 0.475
1.696GlyGln: 1.696 ± 0.478
3.053GlyArg: 3.053 ± 0.233
3.392GlySer: 3.392 ± 1.666
2.374GlyThr: 2.374 ± 1.805
2.374GlyVal: 2.374 ± 0.906
1.018GlyTrp: 1.018 ± 0.483
1.357GlyTyr: 1.357 ± 0.708
0.0GlyXaa: 0.0 ± 0.0
His
0.678HisAla: 0.678 ± 0.354
0.339HisCys: 0.339 ± 0.177
0.339HisAsp: 0.339 ± 0.71
1.357HisGlu: 1.357 ± 0.475
0.339HisPhe: 0.339 ± 0.177
1.018HisGly: 1.018 ± 0.477
0.0HisHis: 0.0 ± 0.0
2.374HisIle: 2.374 ± 0.126
1.018HisLys: 1.018 ± 0.531
1.018HisLeu: 1.018 ± 0.531
0.678HisMet: 0.678 ± 0.581
1.018HisAsn: 1.018 ± 0.832
1.357HisPro: 1.357 ± 0.708
1.018HisGln: 1.018 ± 1.182
1.018HisArg: 1.018 ± 0.531
0.0HisSer: 0.0 ± 0.0
1.357HisThr: 1.357 ± 0.475
1.018HisVal: 1.018 ± 0.531
1.018HisTrp: 1.018 ± 0.531
1.357HisTyr: 1.357 ± 0.475
0.0HisXaa: 0.0 ± 0.0
Ile
1.357IleAla: 1.357 ± 1.162
1.018IleCys: 1.018 ± 0.531
6.784IleAsp: 6.784 ± 1.337
5.427IleGlu: 5.427 ± 1.62
3.731IlePhe: 3.731 ± 1.401
2.714IleGly: 2.714 ± 0.918
2.714IleHis: 2.714 ± 1.417
4.071IleIle: 4.071 ± 1.424
6.106IleLys: 6.106 ± 0.552
5.088IleLeu: 5.088 ± 2.082
2.374IleMet: 2.374 ± 0.881
2.374IleAsn: 2.374 ± 1.543
3.731IlePro: 3.731 ± 0.779
1.357IleGln: 1.357 ± 1.083
4.41IleArg: 4.41 ± 0.427
8.141IleSer: 8.141 ± 1.041
3.731IleThr: 3.731 ± 1.339
5.088IleVal: 5.088 ± 1.548
0.678IleTrp: 0.678 ± 0.354
3.053IleTyr: 3.053 ± 0.964
0.0IleXaa: 0.0 ± 0.0
Lys
3.731LysAla: 3.731 ± 0.586
1.018LysCys: 1.018 ± 0.477
3.053LysAsp: 3.053 ± 1.501
8.141LysGlu: 8.141 ± 1.206
4.071LysPhe: 4.071 ± 2.226
3.731LysGly: 3.731 ± 2.068
1.018LysHis: 1.018 ± 0.531
9.498LysIle: 9.498 ± 1.545
8.48LysLys: 8.48 ± 1.029
7.802LysLeu: 7.802 ± 0.72
2.035LysMet: 2.035 ± 0.639
2.374LysAsn: 2.374 ± 1.681
3.731LysPro: 3.731 ± 0.435
2.374LysGln: 2.374 ± 0.771
2.714LysArg: 2.714 ± 0.949
8.141LysSer: 8.141 ± 1.897
3.731LysThr: 3.731 ± 0.435
7.123LysVal: 7.123 ± 2.001
1.018LysTrp: 1.018 ± 0.531
2.714LysTyr: 2.714 ± 0.918
0.0LysXaa: 0.0 ± 0.0
Leu
6.445LeuAla: 6.445 ± 1.46
1.357LeuCys: 1.357 ± 0.708
8.141LeuAsp: 8.141 ± 1.526
7.802LeuGlu: 7.802 ± 0.893
3.392LeuPhe: 3.392 ± 1.236
3.392LeuGly: 3.392 ± 0.926
1.696LeuHis: 1.696 ± 0.478
6.445LeuIle: 6.445 ± 1.331
9.837LeuLys: 9.837 ± 0.469
8.48LeuLeu: 8.48 ± 1.98
2.035LeuMet: 2.035 ± 0.639
3.392LeuAsn: 3.392 ± 1.124
3.392LeuPro: 3.392 ± 1.359
2.714LeuGln: 2.714 ± 0.878
3.731LeuArg: 3.731 ± 1.401
7.802LeuSer: 7.802 ± 1.38
6.784LeuThr: 6.784 ± 0.819
3.053LeuVal: 3.053 ± 1.501
1.018LeuTrp: 1.018 ± 0.531
1.696LeuTyr: 1.696 ± 0.478
0.0LeuXaa: 0.0 ± 0.0
Met
1.357MetAla: 1.357 ± 0.655
0.339MetCys: 0.339 ± 0.649
1.018MetAsp: 1.018 ± 0.531
1.357MetGlu: 1.357 ± 0.439
1.696MetPhe: 1.696 ± 0.478
2.374MetGly: 2.374 ± 0.771
0.339MetHis: 0.339 ± 0.177
1.018MetIle: 1.018 ± 0.477
1.357MetLys: 1.357 ± 0.708
2.035MetLeu: 2.035 ± 0.547
0.339MetMet: 0.339 ± 0.177
1.018MetAsn: 1.018 ± 0.483
1.018MetPro: 1.018 ± 0.531
0.678MetGln: 0.678 ± 0.354
1.696MetArg: 1.696 ± 0.885
2.374MetSer: 2.374 ± 0.668
1.357MetThr: 1.357 ± 0.475
1.357MetVal: 1.357 ± 0.708
0.339MetTrp: 0.339 ± 0.649
0.339MetTyr: 0.339 ± 0.177
0.0MetXaa: 0.0 ± 0.0
Asn
1.018AsnAla: 1.018 ± 0.477
0.339AsnCys: 0.339 ± 0.177
1.357AsnAsp: 1.357 ± 0.708
3.731AsnGlu: 3.731 ± 2.919
3.053AsnPhe: 3.053 ± 0.635
1.696AsnGly: 1.696 ± 0.463
0.339AsnHis: 0.339 ± 0.71
4.749AsnIle: 4.749 ± 0.559
7.802AsnLys: 7.802 ± 2.412
3.731AsnLeu: 3.731 ± 1.225
1.018AsnMet: 1.018 ± 0.531
1.696AsnAsn: 1.696 ± 0.463
2.374AsnPro: 2.374 ± 0.9
2.035AsnGln: 2.035 ± 0.639
1.696AsnArg: 1.696 ± 0.463
3.392AsnSer: 3.392 ± 0.523
1.018AsnThr: 1.018 ± 0.531
2.374AsnVal: 2.374 ± 0.951
0.0AsnTrp: 0.0 ± 0.0
1.696AsnTyr: 1.696 ± 0.885
0.0AsnXaa: 0.0 ± 0.0
Pro
1.357ProAla: 1.357 ± 0.655
0.339ProCys: 0.339 ± 0.177
2.714ProAsp: 2.714 ± 1.478
2.035ProGlu: 2.035 ± 0.547
1.018ProPhe: 1.018 ± 0.531
1.696ProGly: 1.696 ± 0.534
0.339ProHis: 0.339 ± 0.177
3.053ProIle: 3.053 ± 0.964
3.731ProLys: 3.731 ± 1.225
3.731ProLeu: 3.731 ± 0.435
0.0ProMet: 0.0 ± 0.0
2.374ProAsn: 2.374 ± 0.951
0.339ProPro: 0.339 ± 0.71
2.035ProGln: 2.035 ± 1.113
1.357ProArg: 1.357 ± 1.162
3.392ProSer: 3.392 ± 0.409
1.696ProThr: 1.696 ± 0.534
2.035ProVal: 2.035 ± 0.547
0.0ProTrp: 0.0 ± 0.0
2.714ProTyr: 2.714 ± 0.878
0.0ProXaa: 0.0 ± 0.0
Gln
2.714GlnAla: 2.714 ± 0.758
0.339GlnCys: 0.339 ± 0.177
1.696GlnAsp: 1.696 ± 0.463
2.035GlnGlu: 2.035 ± 1.113
2.374GlnPhe: 2.374 ± 0.126
2.374GlnGly: 2.374 ± 0.9
1.357GlnHis: 1.357 ± 0.708
2.374GlnIle: 2.374 ± 1.239
2.374GlnLys: 2.374 ± 1.632
3.731GlnLeu: 3.731 ± 0.779
0.339GlnMet: 0.339 ± 0.177
0.678GlnAsn: 0.678 ± 1.009
0.339GlnPro: 0.339 ± 0.177
1.696GlnGln: 1.696 ± 0.463
2.714GlnArg: 2.714 ± 1.309
2.714GlnSer: 2.714 ± 1.54
1.357GlnThr: 1.357 ± 0.475
1.018GlnVal: 1.018 ± 0.531
0.0GlnTrp: 0.0 ± 0.0
1.357GlnTyr: 1.357 ± 0.439
0.0GlnXaa: 0.0 ± 0.0
Arg
2.374ArgAla: 2.374 ± 0.951
0.339ArgCys: 0.339 ± 0.177
1.357ArgAsp: 1.357 ± 0.708
3.392ArgGlu: 3.392 ± 1.666
2.374ArgPhe: 2.374 ± 0.906
3.053ArgGly: 3.053 ± 1.074
2.374ArgHis: 2.374 ± 0.771
3.731ArgIle: 3.731 ± 0.499
3.053ArgLys: 3.053 ± 2.185
4.749ArgLeu: 4.749 ± 0.918
2.035ArgMet: 2.035 ± 0.301
2.714ArgAsn: 2.714 ± 0.878
1.018ArgPro: 1.018 ± 0.483
1.018ArgGln: 1.018 ± 0.483
2.374ArgArg: 2.374 ± 1.632
4.749ArgSer: 4.749 ± 0.975
1.018ArgThr: 1.018 ± 0.531
4.41ArgVal: 4.41 ± 2.302
0.339ArgTrp: 0.339 ± 0.177
1.357ArgTyr: 1.357 ± 0.708
0.0ArgXaa: 0.0 ± 0.0
Ser
5.427SerAla: 5.427 ± 0.847
0.678SerCys: 0.678 ± 0.354
5.767SerAsp: 5.767 ± 0.292
6.445SerGlu: 6.445 ± 1.165
4.41SerPhe: 4.41 ± 0.587
5.427SerGly: 5.427 ± 1.961
2.714SerHis: 2.714 ± 0.758
7.123SerIle: 7.123 ± 0.642
6.784SerLys: 6.784 ± 1.63
9.837SerLeu: 9.837 ± 1.097
2.035SerMet: 2.035 ± 1.062
7.802SerAsn: 7.802 ± 2.085
1.357SerPro: 1.357 ± 0.475
1.018SerGln: 1.018 ± 0.477
6.106SerArg: 6.106 ± 0.466
6.784SerSer: 6.784 ± 1.285
3.731SerThr: 3.731 ± 3.341
4.749SerVal: 4.749 ± 1.685
0.678SerTrp: 0.678 ± 0.354
3.053SerTyr: 3.053 ± 0.885
0.0SerXaa: 0.0 ± 0.0
Thr
3.392ThrAla: 3.392 ± 2.318
1.018ThrCys: 1.018 ± 0.531
2.714ThrAsp: 2.714 ± 0.062
4.749ThrGlu: 4.749 ± 0.975
2.714ThrPhe: 2.714 ± 0.758
2.714ThrGly: 2.714 ± 1.417
1.357ThrHis: 1.357 ± 0.475
4.41ThrIle: 4.41 ± 1.015
6.784ThrLys: 6.784 ± 1.912
6.445ThrLeu: 6.445 ± 1.165
1.018ThrMet: 1.018 ± 0.531
1.696ThrAsn: 1.696 ± 0.463
2.374ThrPro: 2.374 ± 1.632
1.357ThrGln: 1.357 ± 1.162
2.035ThrArg: 2.035 ± 0.301
5.427ThrSer: 5.427 ± 0.124
3.392ThrThr: 3.392 ± 2.054
3.731ThrVal: 3.731 ± 1.401
0.339ThrTrp: 0.339 ± 0.177
1.018ThrTyr: 1.018 ± 0.477
0.0ThrXaa: 0.0 ± 0.0
Val
2.035ValAla: 2.035 ± 0.967
1.018ValCys: 1.018 ± 0.531
2.035ValAsp: 2.035 ± 0.639
3.392ValGlu: 3.392 ± 0.409
2.714ValPhe: 2.714 ± 0.062
2.035ValGly: 2.035 ± 0.547
0.339ValHis: 0.339 ± 0.177
2.714ValIle: 2.714 ± 1.478
4.749ValLys: 4.749 ± 1.336
5.427ValLeu: 5.427 ± 0.841
2.035ValMet: 2.035 ± 0.826
2.714ValAsn: 2.714 ± 0.794
3.053ValPro: 3.053 ± 0.964
2.714ValGln: 2.714 ± 1.533
1.696ValArg: 1.696 ± 0.478
7.463ValSer: 7.463 ± 0.211
5.427ValThr: 5.427 ± 2.709
2.714ValVal: 2.714 ± 0.758
0.339ValTrp: 0.339 ± 0.177
1.018ValTyr: 1.018 ± 0.531
0.0ValXaa: 0.0 ± 0.0
Trp
0.339TrpAla: 0.339 ± 0.177
0.339TrpCys: 0.339 ± 0.177
0.678TrpAsp: 0.678 ± 0.354
0.678TrpGlu: 0.678 ± 0.581
0.678TrpPhe: 0.678 ± 0.354
1.357TrpGly: 1.357 ± 0.439
0.0TrpHis: 0.0 ± 0.0
0.339TrpIle: 0.339 ± 0.177
0.678TrpLys: 0.678 ± 0.354
0.339TrpLeu: 0.339 ± 0.177
0.678TrpMet: 0.678 ± 0.354
0.678TrpAsn: 0.678 ± 0.354
0.0TrpPro: 0.0 ± 0.0
0.339TrpGln: 0.339 ± 0.177
0.339TrpArg: 0.339 ± 0.177
1.357TrpSer: 1.357 ± 0.708
0.678TrpThr: 0.678 ± 0.541
0.678TrpVal: 0.678 ± 0.354
0.0TrpTrp: 0.0 ± 0.0
0.339TrpTyr: 0.339 ± 0.177
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.696TyrAla: 1.696 ± 0.463
2.035TyrCys: 2.035 ± 1.742
1.696TyrAsp: 1.696 ± 0.534
3.053TyrGlu: 3.053 ± 0.233
1.357TyrPhe: 1.357 ± 0.708
2.035TyrGly: 2.035 ± 0.639
0.339TyrHis: 0.339 ± 0.177
2.714TyrIle: 2.714 ± 0.81
1.018TyrLys: 1.018 ± 0.483
3.053TyrLeu: 3.053 ± 1.594
0.678TyrMet: 0.678 ± 0.354
1.696TyrAsn: 1.696 ± 0.885
1.357TyrPro: 1.357 ± 0.475
1.357TyrGln: 1.357 ± 0.475
2.035TyrArg: 2.035 ± 0.639
3.053TyrSer: 3.053 ± 0.649
2.374TyrThr: 2.374 ± 1.239
1.018TyrVal: 1.018 ± 0.483
0.339TyrTrp: 0.339 ± 0.177
1.018TyrTyr: 1.018 ± 0.483
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2949 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski