Amino acid dipepetide frequency for Opuntia virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.153AlaAla: 4.153 ± 0.737
1.278AlaCys: 1.278 ± 0.431
0.958AlaAsp: 0.958 ± 0.281
1.597AlaGlu: 1.597 ± 0.441
1.917AlaPhe: 1.917 ± 0.563
2.236AlaGly: 2.236 ± 1.282
0.958AlaHis: 0.958 ± 0.281
0.958AlaIle: 0.958 ± 0.637
4.153AlaLys: 4.153 ± 1.303
8.307AlaLeu: 8.307 ± 1.744
2.556AlaMet: 2.556 ± 0.782
1.278AlaAsn: 1.278 ± 0.7
2.236AlaPro: 2.236 ± 0.866
1.278AlaGln: 1.278 ± 0.391
0.958AlaArg: 0.958 ± 1.708
6.07AlaSer: 6.07 ± 0.451
8.307AlaThr: 8.307 ± 1.377
5.431AlaVal: 5.431 ± 0.536
0.639AlaTrp: 0.639 ± 0.55
0.319AlaTyr: 0.319 ± 0.2
0.0AlaXaa: 0.0 ± 0.0
Cys
1.597CysAla: 1.597 ± 0.633
1.278CysCys: 1.278 ± 0.391
1.597CysAsp: 1.597 ± 0.428
0.0CysGlu: 0.0 ± 0.0
1.597CysPhe: 1.597 ± 0.441
1.278CysGly: 1.278 ± 0.447
0.0CysHis: 0.0 ± 0.0
1.597CysIle: 1.597 ± 0.441
0.958CysLys: 0.958 ± 0.281
2.556CysLeu: 2.556 ± 0.45
0.0CysMet: 0.0 ± 0.0
0.958CysAsn: 0.958 ± 0.54
0.958CysPro: 0.958 ± 0.281
0.0CysGln: 0.0 ± 0.0
0.958CysArg: 0.958 ± 0.598
1.917CysSer: 1.917 ± 0.586
2.556CysThr: 2.556 ± 0.28
0.319CysVal: 0.319 ± 0.69
0.639CysTrp: 0.639 ± 0.195
1.597CysTyr: 1.597 ± 0.41
0.0CysXaa: 0.0 ± 0.0
Asp
3.514AspAla: 3.514 ± 0.709
0.958AspCys: 0.958 ± 0.281
2.236AspAsp: 2.236 ± 0.77
5.112AspGlu: 5.112 ± 0.847
3.195AspPhe: 3.195 ± 0.997
4.473AspGly: 4.473 ± 0.665
0.639AspHis: 0.639 ± 1.381
3.195AspIle: 3.195 ± 0.784
4.153AspLys: 4.153 ± 0.623
5.112AspLeu: 5.112 ± 1.16
0.639AspMet: 0.639 ± 0.195
2.556AspAsn: 2.556 ± 0.582
2.556AspPro: 2.556 ± 0.714
1.278AspGln: 1.278 ± 0.701
4.473AspArg: 4.473 ± 0.865
5.751AspSer: 5.751 ± 0.411
4.153AspThr: 4.153 ± 0.786
6.07AspVal: 6.07 ± 0.563
0.319AspTrp: 0.319 ± 0.2
2.556AspTyr: 2.556 ± 0.782
0.0AspXaa: 0.0 ± 0.0
Glu
2.236GluAla: 2.236 ± 0.72
2.236GluCys: 2.236 ± 0.622
2.556GluAsp: 2.556 ± 0.782
2.236GluGlu: 2.236 ± 0.866
4.792GluPhe: 4.792 ± 0.886
1.597GluGly: 1.597 ± 0.441
1.278GluHis: 1.278 ± 0.391
2.236GluIle: 2.236 ± 0.622
1.278GluLys: 1.278 ± 0.391
7.987GluLeu: 7.987 ± 0.397
0.958GluMet: 0.958 ± 0.281
2.556GluAsn: 2.556 ± 0.714
3.514GluPro: 3.514 ± 1.014
0.958GluGln: 0.958 ± 0.458
1.917GluArg: 1.917 ± 0.331
3.514GluSer: 3.514 ± 0.561
1.917GluThr: 1.917 ± 0.778
3.195GluVal: 3.195 ± 1.159
1.917GluTrp: 1.917 ± 0.586
1.917GluTyr: 1.917 ± 0.563
0.0GluXaa: 0.0 ± 0.0
Phe
3.834PheAla: 3.834 ± 0.662
3.514PheCys: 3.514 ± 0.766
4.153PheAsp: 4.153 ± 0.961
3.834PheGlu: 3.834 ± 0.218
2.236PhePhe: 2.236 ± 0.72
2.556PheGly: 2.556 ± 0.714
2.556PheHis: 2.556 ± 0.714
1.597PheIle: 1.597 ± 0.472
5.431PheLys: 5.431 ± 0.853
5.112PheLeu: 5.112 ± 0.873
1.597PheMet: 1.597 ± 0.441
1.278PheAsn: 1.278 ± 0.447
2.875PhePro: 2.875 ± 0.489
3.514PheGln: 3.514 ± 0.323
1.597PheArg: 1.597 ± 0.441
4.473PheSer: 4.473 ± 1.263
2.236PheThr: 2.236 ± 0.388
2.236PheVal: 2.236 ± 0.866
0.958PheTrp: 0.958 ± 0.54
1.597PheTyr: 1.597 ± 0.472
0.0PheXaa: 0.0 ± 0.0
Gly
2.556GlyAla: 2.556 ± 1.817
0.958GlyCys: 0.958 ± 0.281
3.195GlyAsp: 3.195 ± 0.416
3.514GlyGlu: 3.514 ± 0.561
2.556GlyPhe: 2.556 ± 1.727
3.195GlyGly: 3.195 ± 1.317
1.597GlyHis: 1.597 ± 0.441
3.195GlyIle: 3.195 ± 0.614
3.195GlyLys: 3.195 ± 0.351
5.431GlyLeu: 5.431 ± 0.756
1.278GlyMet: 1.278 ± 0.391
4.473GlyAsn: 4.473 ± 1.004
0.639GlyPro: 0.639 ± 0.55
0.639GlyGln: 0.639 ± 0.195
3.514GlyArg: 3.514 ± 0.594
4.153GlySer: 4.153 ± 1.954
1.597GlyThr: 1.597 ± 0.41
4.153GlyVal: 4.153 ± 0.927
0.639GlyTrp: 0.639 ± 0.195
1.597GlyTyr: 1.597 ± 0.441
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
2.236HisCys: 2.236 ± 0.622
2.236HisAsp: 2.236 ± 0.622
0.958HisGlu: 0.958 ± 0.281
1.597HisPhe: 1.597 ± 0.428
1.278HisGly: 1.278 ± 0.391
0.0HisHis: 0.0 ± 0.0
1.278HisIle: 1.278 ± 0.391
0.0HisLys: 0.0 ± 0.0
2.556HisLeu: 2.556 ± 0.714
0.319HisMet: 0.319 ± 0.2
0.639HisAsn: 0.639 ± 0.195
1.278HisPro: 1.278 ± 0.391
1.278HisGln: 1.278 ± 0.391
1.278HisArg: 1.278 ± 0.466
2.236HisSer: 2.236 ± 0.622
2.556HisThr: 2.556 ± 0.782
2.236HisVal: 2.236 ± 0.388
0.0HisTrp: 0.0 ± 0.0
1.917HisTyr: 1.917 ± 0.586
0.0HisXaa: 0.0 ± 0.0
Ile
0.639IleAla: 0.639 ± 0.633
0.319IleCys: 0.319 ± 0.69
5.431IleAsp: 5.431 ± 0.536
2.556IleGlu: 2.556 ± 1.161
2.875IlePhe: 2.875 ± 0.844
3.834IleGly: 3.834 ± 1.06
0.958IleHis: 0.958 ± 0.281
2.556IleIle: 2.556 ± 0.426
3.195IleLys: 3.195 ± 0.944
2.556IleLeu: 2.556 ± 0.593
0.0IleMet: 0.0 ± 0.235
1.917IleAsn: 1.917 ± 0.586
2.236IlePro: 2.236 ± 0.989
0.958IleGln: 0.958 ± 0.458
0.0IleArg: 0.0 ± 0.0
7.348IleSer: 7.348 ± 0.821
1.917IleThr: 1.917 ± 1.08
7.029IleVal: 7.029 ± 0.835
0.639IleTrp: 0.639 ± 0.4
1.597IleTyr: 1.597 ± 0.441
0.319IleXaa: 0.319 ± 0.2
Lys
3.514LysAla: 3.514 ± 0.766
0.319LysCys: 0.319 ± 0.2
2.556LysAsp: 2.556 ± 0.714
3.514LysGlu: 3.514 ± 0.991
2.875LysPhe: 2.875 ± 1.943
4.792LysGly: 4.792 ± 0.646
0.0LysHis: 0.0 ± 0.0
1.597LysIle: 1.597 ± 0.428
2.236LysLys: 2.236 ± 0.622
6.07LysLeu: 6.07 ± 0.398
0.0LysMet: 0.0 ± 0.0
2.875LysAsn: 2.875 ± 0.809
3.514LysPro: 3.514 ± 0.561
3.195LysGln: 3.195 ± 0.977
4.473LysArg: 4.473 ± 1.586
5.751LysSer: 5.751 ± 1.833
2.875LysThr: 2.875 ± 0.293
4.792LysVal: 4.792 ± 0.646
0.639LysTrp: 0.639 ± 0.195
1.278LysTyr: 1.278 ± 0.447
0.0LysXaa: 0.0 ± 0.0
Leu
5.431LeuAla: 5.431 ± 1.201
1.597LeuCys: 1.597 ± 0.428
5.751LeuAsp: 5.751 ± 1.213
7.348LeuGlu: 7.348 ± 0.795
2.875LeuPhe: 2.875 ± 0.607
3.834LeuGly: 3.834 ± 2.186
3.195LeuHis: 3.195 ± 0.977
7.348LeuIle: 7.348 ± 0.177
6.709LeuLys: 6.709 ± 0.73
9.904LeuLeu: 9.904 ± 0.546
2.875LeuMet: 2.875 ± 0.809
2.556LeuAsn: 2.556 ± 0.586
2.875LeuPro: 2.875 ± 0.916
7.348LeuGln: 7.348 ± 0.177
7.348LeuArg: 7.348 ± 0.569
9.904LeuSer: 9.904 ± 3.196
3.834LeuThr: 3.834 ± 0.921
5.112LeuVal: 5.112 ± 0.847
0.958LeuTrp: 0.958 ± 0.458
3.195LeuTyr: 3.195 ± 0.882
0.0LeuXaa: 0.0 ± 0.0
Met
1.597MetAla: 1.597 ± 0.428
0.639MetCys: 0.639 ± 0.195
0.958MetAsp: 0.958 ± 0.281
0.0MetGlu: 0.0 ± 0.0
0.639MetPhe: 0.639 ± 0.195
0.958MetGly: 0.958 ± 0.281
0.639MetHis: 0.639 ± 0.195
0.958MetIle: 0.958 ± 0.281
2.556MetLys: 2.556 ± 0.782
2.556MetLeu: 2.556 ± 0.782
1.917MetMet: 1.917 ± 0.586
1.597MetAsn: 1.597 ± 0.428
0.639MetPro: 0.639 ± 0.55
0.639MetGln: 0.639 ± 0.195
1.278MetArg: 1.278 ± 0.391
1.278MetSer: 1.278 ± 0.391
0.0MetThr: 0.0 ± 0.0
0.958MetVal: 0.958 ± 0.458
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.597AsnAla: 1.597 ± 0.982
1.278AsnCys: 1.278 ± 0.447
2.236AsnAsp: 2.236 ± 0.341
2.556AsnGlu: 2.556 ± 0.782
3.195AsnPhe: 3.195 ± 0.416
0.639AsnGly: 0.639 ± 0.195
1.917AsnHis: 1.917 ± 0.586
1.597AsnIle: 1.597 ± 0.428
1.917AsnLys: 1.917 ± 1.08
4.792AsnLeu: 4.792 ± 0.769
0.319AsnMet: 0.319 ± 0.184
1.917AsnAsn: 1.917 ± 0.586
1.278AsnPro: 1.278 ± 0.897
0.319AsnGln: 0.319 ± 0.2
1.278AsnArg: 1.278 ± 0.431
6.39AsnSer: 6.39 ± 1.114
2.236AsnThr: 2.236 ± 0.622
3.834AsnVal: 3.834 ± 0.495
0.639AsnTrp: 0.639 ± 0.195
0.639AsnTyr: 0.639 ± 0.4
0.0AsnXaa: 0.0 ± 0.0
Pro
4.153ProAla: 4.153 ± 0.809
0.0ProCys: 0.0 ± 0.0
4.792ProAsp: 4.792 ± 0.646
3.514ProGlu: 3.514 ± 0.767
1.917ProPhe: 1.917 ± 0.586
4.792ProGly: 4.792 ± 1.196
0.958ProHis: 0.958 ± 0.281
3.195ProIle: 3.195 ± 0.351
3.514ProLys: 3.514 ± 0.766
6.39ProLeu: 6.39 ± 1.789
0.0ProMet: 0.0 ± 0.0
0.958ProAsn: 0.958 ± 0.54
1.917ProPro: 1.917 ± 0.586
0.0ProGln: 0.0 ± 0.0
1.597ProArg: 1.597 ± 0.441
1.597ProSer: 1.597 ± 0.41
1.597ProThr: 1.597 ± 0.491
3.834ProVal: 3.834 ± 1.06
0.0ProTrp: 0.0 ± 0.0
0.319ProTyr: 0.319 ± 0.572
0.0ProXaa: 0.0 ± 0.0
Gln
2.875GlnAla: 2.875 ± 0.441
0.639GlnCys: 0.639 ± 0.195
3.195GlnAsp: 3.195 ± 0.977
1.278GlnGlu: 1.278 ± 0.447
2.236GlnPhe: 2.236 ± 0.341
2.556GlnGly: 2.556 ± 0.593
0.0GlnHis: 0.0 ± 0.0
1.917GlnIle: 1.917 ± 0.586
0.639GlnLys: 0.639 ± 0.195
4.153GlnLeu: 4.153 ± 0.788
0.958GlnMet: 0.958 ± 0.458
0.639GlnAsn: 0.639 ± 0.4
0.639GlnPro: 0.639 ± 0.55
0.639GlnGln: 0.639 ± 0.195
3.195GlnArg: 3.195 ± 0.882
4.792GlnSer: 4.792 ± 1.297
0.639GlnThr: 0.639 ± 0.55
2.875GlnVal: 2.875 ± 0.844
0.319GlnTrp: 0.319 ± 0.2
0.958GlnTyr: 0.958 ± 0.281
0.0GlnXaa: 0.0 ± 0.0
Arg
2.236ArgAla: 2.236 ± 0.341
0.958ArgCys: 0.958 ± 0.281
3.834ArgAsp: 3.834 ± 0.84
0.958ArgGlu: 0.958 ± 0.281
3.514ArgPhe: 3.514 ± 0.424
3.195ArgGly: 3.195 ± 0.977
1.917ArgHis: 1.917 ± 0.586
1.597ArgIle: 1.597 ± 0.816
2.875ArgLys: 2.875 ± 0.844
3.195ArgLeu: 3.195 ± 0.784
0.319ArgMet: 0.319 ± 0.578
4.473ArgAsn: 4.473 ± 1.515
4.153ArgPro: 4.153 ± 0.504
1.278ArgGln: 1.278 ± 0.391
3.514ArgArg: 3.514 ± 1.0
4.792ArgSer: 4.792 ± 1.297
3.834ArgThr: 3.834 ± 1.239
5.751ArgVal: 5.751 ± 0.956
0.319ArgTrp: 0.319 ± 0.2
1.597ArgTyr: 1.597 ± 0.41
0.0ArgXaa: 0.0 ± 0.0
Ser
8.307SerAla: 8.307 ± 1.335
1.597SerCys: 1.597 ± 0.441
4.792SerAsp: 4.792 ± 0.598
4.153SerGlu: 4.153 ± 0.665
6.709SerPhe: 6.709 ± 1.368
5.751SerGly: 5.751 ± 1.544
2.556SerHis: 2.556 ± 0.714
4.473SerIle: 4.473 ± 0.777
4.792SerLys: 4.792 ± 2.25
6.709SerLeu: 6.709 ± 0.796
1.917SerMet: 1.917 ± 0.586
2.556SerAsn: 2.556 ± 0.28
3.195SerPro: 3.195 ± 0.855
4.153SerGln: 4.153 ± 0.786
5.751SerArg: 5.751 ± 0.747
7.029SerSer: 7.029 ± 2.213
3.514SerThr: 3.514 ± 0.767
14.377SerVal: 14.377 ± 1.0
0.639SerTrp: 0.639 ± 0.195
0.639SerTyr: 0.639 ± 0.195
0.0SerXaa: 0.0 ± 0.0
Thr
2.236ThrAla: 2.236 ± 0.866
0.319ThrCys: 0.319 ± 0.2
2.556ThrAsp: 2.556 ± 1.269
0.0ThrGlu: 0.0 ± 0.0
5.112ThrPhe: 5.112 ± 0.737
0.958ThrGly: 0.958 ± 0.598
0.639ThrHis: 0.639 ± 0.195
2.875ThrIle: 2.875 ± 0.293
1.917ThrLys: 1.917 ± 0.916
5.751ThrLeu: 5.751 ± 1.593
1.917ThrMet: 1.917 ± 0.596
1.278ThrAsn: 1.278 ± 0.466
3.834ThrPro: 3.834 ± 0.86
2.236ThrGln: 2.236 ± 0.451
3.514ThrArg: 3.514 ± 0.835
4.153ThrSer: 4.153 ± 0.737
3.195ThrThr: 3.195 ± 0.542
7.348ThrVal: 7.348 ± 1.031
0.0ThrTrp: 0.0 ± 0.0
2.236ThrTyr: 2.236 ± 0.388
0.0ThrXaa: 0.0 ± 0.0
Val
4.473ValAla: 4.473 ± 0.826
0.958ValCys: 0.958 ± 0.458
5.751ValAsp: 5.751 ± 0.927
4.153ValGlu: 4.153 ± 0.504
4.153ValPhe: 4.153 ± 0.786
1.597ValGly: 1.597 ± 1.577
5.112ValHis: 5.112 ± 1.564
5.751ValIle: 5.751 ± 0.569
4.473ValLys: 4.473 ± 0.776
9.265ValLeu: 9.265 ± 1.555
0.0ValMet: 0.0 ± 0.0
5.112ValAsn: 5.112 ± 0.847
5.112ValPro: 5.112 ± 1.564
2.556ValGln: 2.556 ± 0.582
6.39ValArg: 6.39 ± 1.018
9.904ValSer: 9.904 ± 1.168
4.792ValThr: 4.792 ± 0.646
5.751ValVal: 5.751 ± 1.801
1.278ValTrp: 1.278 ± 0.466
2.236ValTyr: 2.236 ± 0.388
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.958TrpGlu: 0.958 ± 0.281
1.917TrpPhe: 1.917 ± 0.586
0.639TrpGly: 0.639 ± 0.195
0.319TrpHis: 0.319 ± 0.69
1.278TrpIle: 1.278 ± 0.391
1.597TrpLys: 1.597 ± 0.441
0.958TrpLeu: 0.958 ± 1.104
0.639TrpMet: 0.639 ± 0.195
0.319TrpAsn: 0.319 ± 0.2
0.0TrpPro: 0.0 ± 0.0
0.958TrpGln: 0.958 ± 0.281
0.0TrpArg: 0.0 ± 0.0
0.319TrpSer: 0.319 ± 0.2
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.639TrpTyr: 0.639 ± 0.633
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.639TyrAla: 0.639 ± 0.195
1.278TyrCys: 1.278 ± 0.466
4.153TyrAsp: 4.153 ± 0.737
2.556TyrGlu: 2.556 ± 0.714
1.278TyrPhe: 1.278 ± 0.447
1.917TyrGly: 1.917 ± 0.563
0.639TyrHis: 0.639 ± 0.195
0.0TyrIle: 0.0 ± 0.0
1.278TyrLys: 1.278 ± 0.466
0.639TyrLeu: 0.639 ± 0.195
1.278TyrMet: 1.278 ± 0.391
0.639TyrAsn: 0.639 ± 0.195
1.597TyrPro: 1.597 ± 0.441
1.597TyrGln: 1.597 ± 0.441
1.278TyrArg: 1.278 ± 0.391
2.236TyrSer: 2.236 ± 0.989
0.319TyrThr: 0.319 ± 0.2
3.514TyrVal: 3.514 ± 0.767
0.0TyrTrp: 0.0 ± 0.0
0.319TyrTyr: 0.319 ± 0.69
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.319XaaGln: 0.319 ± 0.2
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3131 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski