Amino acid dipepetide frequency for Opuntia virus X

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.323AlaAla: 7.323 ± 4.771
0.915AlaCys: 0.915 ± 1.718
4.119AlaAsp: 4.119 ± 0.702
1.831AlaGlu: 1.831 ± 0.687
5.95AlaPhe: 5.95 ± 0.913
4.577AlaGly: 4.577 ± 1.75
1.831AlaHis: 1.831 ± 0.932
3.661AlaIle: 3.661 ± 1.158
3.661AlaLys: 3.661 ± 1.992
10.984AlaLeu: 10.984 ± 3.802
2.288AlaMet: 2.288 ± 1.255
3.661AlaAsn: 3.661 ± 0.535
4.119AlaPro: 4.119 ± 0.988
4.119AlaGln: 4.119 ± 2.316
4.119AlaArg: 4.119 ± 2.162
5.034AlaSer: 5.034 ± 3.367
4.119AlaThr: 4.119 ± 1.446
4.577AlaVal: 4.577 ± 4.342
2.288AlaTrp: 2.288 ± 1.536
2.288AlaTyr: 2.288 ± 1.245
0.0AlaXaa: 0.0 ± 0.0
Cys
1.373CysAla: 1.373 ± 0.71
0.0CysCys: 0.0 ± 0.0
0.458CysAsp: 0.458 ± 0.249
0.915CysGlu: 0.915 ± 0.747
1.373CysPhe: 1.373 ± 0.71
0.915CysGly: 0.915 ± 1.009
0.0CysHis: 0.0 ± 0.0
0.458CysIle: 0.458 ± 0.249
0.458CysLys: 0.458 ± 0.249
1.831CysLeu: 1.831 ± 1.301
0.0CysMet: 0.0 ± 0.0
0.915CysAsn: 0.915 ± 0.498
0.458CysPro: 0.458 ± 0.859
1.373CysGln: 1.373 ± 0.991
0.915CysArg: 0.915 ± 0.747
2.288CysSer: 2.288 ± 0.99
0.458CysThr: 0.458 ± 0.249
0.458CysVal: 0.458 ± 0.249
0.0CysTrp: 0.0 ± 0.0
0.458CysTyr: 0.458 ± 0.994
0.0CysXaa: 0.0 ± 0.0
Asp
3.204AspAla: 3.204 ± 1.234
0.915AspCys: 0.915 ± 0.498
1.831AspAsp: 1.831 ± 0.996
3.204AspGlu: 3.204 ± 1.234
3.204AspPhe: 3.204 ± 0.451
2.288AspGly: 2.288 ± 2.163
1.373AspHis: 1.373 ± 0.747
1.831AspIle: 1.831 ± 0.757
2.288AspLys: 2.288 ± 1.245
4.119AspLeu: 4.119 ± 1.663
0.458AspMet: 0.458 ± 0.476
2.746AspAsn: 2.746 ± 1.084
5.034AspPro: 5.034 ± 1.906
2.288AspGln: 2.288 ± 0.74
1.831AspArg: 1.831 ± 1.301
4.119AspSer: 4.119 ± 1.576
2.746AspThr: 2.746 ± 1.494
2.746AspVal: 2.746 ± 1.144
0.458AspTrp: 0.458 ± 0.249
1.831AspTyr: 1.831 ± 1.173
0.0AspXaa: 0.0 ± 0.0
Glu
4.577GluAla: 4.577 ± 1.67
1.373GluCys: 1.373 ± 0.747
2.288GluAsp: 2.288 ± 0.875
5.034GluGlu: 5.034 ± 1.59
3.204GluPhe: 3.204 ± 1.234
2.288GluGly: 2.288 ± 0.64
1.831GluHis: 1.831 ± 0.687
5.034GluIle: 5.034 ± 1.131
3.204GluLys: 3.204 ± 1.743
2.746GluLeu: 2.746 ± 1.35
0.915GluMet: 0.915 ± 0.726
3.661GluAsn: 3.661 ± 1.801
3.204GluPro: 3.204 ± 1.234
2.746GluGln: 2.746 ± 1.494
1.373GluArg: 1.373 ± 0.721
2.288GluSer: 2.288 ± 1.245
5.034GluThr: 5.034 ± 1.693
3.204GluVal: 3.204 ± 1.743
1.831GluTrp: 1.831 ± 0.996
0.915GluTyr: 0.915 ± 0.747
0.0GluXaa: 0.0 ± 0.0
Phe
3.661PheAla: 3.661 ± 2.696
0.458PheCys: 0.458 ± 0.249
4.577PheAsp: 4.577 ± 2.099
3.661PheGlu: 3.661 ± 0.535
2.288PhePhe: 2.288 ± 0.74
2.746PheGly: 2.746 ± 1.419
0.915PheHis: 0.915 ± 0.498
4.119PheIle: 4.119 ± 1.667
2.746PheLys: 2.746 ± 1.494
5.492PheLeu: 5.492 ± 1.193
0.915PheMet: 0.915 ± 0.747
1.831PheAsn: 1.831 ± 1.301
1.831PhePro: 1.831 ± 1.495
2.746PheGln: 2.746 ± 1.494
2.288PheArg: 2.288 ± 1.536
5.034PheSer: 5.034 ± 0.95
1.373PheThr: 1.373 ± 0.721
0.458PheVal: 0.458 ± 0.249
1.373PheTrp: 1.373 ± 0.747
0.915PheTyr: 0.915 ± 1.009
0.0PheXaa: 0.0 ± 0.0
Gly
5.492GlyAla: 5.492 ± 1.018
0.915GlyCys: 0.915 ± 1.058
2.746GlyAsp: 2.746 ± 1.102
1.831GlyGlu: 1.831 ± 1.301
2.288GlyPhe: 2.288 ± 0.99
2.288GlyGly: 2.288 ± 1.193
1.831GlyHis: 1.831 ± 1.392
1.831GlyIle: 1.831 ± 0.836
4.577GlyLys: 4.577 ± 1.75
4.577GlyLeu: 4.577 ± 1.934
0.915GlyMet: 0.915 ± 0.822
1.831GlyAsn: 1.831 ± 0.687
4.119GlyPro: 4.119 ± 3.158
2.288GlyGln: 2.288 ± 1.245
0.915GlyArg: 0.915 ± 1.009
3.204GlySer: 3.204 ± 0.451
3.661GlyThr: 3.661 ± 3.02
4.119GlyVal: 4.119 ± 1.012
0.0GlyTrp: 0.0 ± 0.0
0.458GlyTyr: 0.458 ± 0.249
0.0GlyXaa: 0.0 ± 0.0
His
3.204HisAla: 3.204 ± 0.451
1.373HisCys: 1.373 ± 0.747
0.458HisAsp: 0.458 ± 0.249
2.288HisGlu: 2.288 ± 1.245
2.746HisPhe: 2.746 ± 1.419
2.746HisGly: 2.746 ± 2.507
0.915HisHis: 0.915 ± 0.498
1.373HisIle: 1.373 ± 0.939
2.746HisLys: 2.746 ± 1.494
2.746HisLeu: 2.746 ± 0.865
0.458HisMet: 0.458 ± 0.249
0.458HisAsn: 0.458 ± 0.249
1.373HisPro: 1.373 ± 1.324
1.831HisGln: 1.831 ± 0.996
1.831HisArg: 1.831 ± 1.495
3.204HisSer: 3.204 ± 2.396
2.288HisThr: 2.288 ± 1.2
0.458HisVal: 0.458 ± 0.859
0.458HisTrp: 0.458 ± 0.249
0.458HisTyr: 0.458 ± 0.994
0.0HisXaa: 0.0 ± 0.0
Ile
5.034IleAla: 5.034 ± 1.188
0.915IleCys: 0.915 ± 1.718
1.831IleAsp: 1.831 ± 0.996
2.746IleGlu: 2.746 ± 1.494
2.288IlePhe: 2.288 ± 0.875
2.746IleGly: 2.746 ± 1.264
1.373IleHis: 1.373 ± 0.747
3.661IleIle: 3.661 ± 1.027
3.661IleLys: 3.661 ± 2.138
2.746IleLeu: 2.746 ± 1.04
0.915IleMet: 0.915 ± 0.498
5.034IleAsn: 5.034 ± 3.358
4.577IlePro: 4.577 ± 1.452
3.661IleGln: 3.661 ± 1.469
1.831IleArg: 1.831 ± 0.836
4.119IleSer: 4.119 ± 1.468
4.119IleThr: 4.119 ± 1.667
3.661IleVal: 3.661 ± 1.192
0.458IleTrp: 0.458 ± 0.994
1.373IleTyr: 1.373 ± 0.939
0.0IleXaa: 0.0 ± 0.0
Lys
6.407LysAla: 6.407 ± 1.914
0.915LysCys: 0.915 ± 0.498
5.492LysAsp: 5.492 ± 1.193
4.119LysGlu: 4.119 ± 2.129
1.373LysPhe: 1.373 ± 0.747
4.577LysGly: 4.577 ± 1.469
2.288LysHis: 2.288 ± 0.99
3.661LysIle: 3.661 ± 1.992
1.831LysLys: 1.831 ± 0.996
5.492LysLeu: 5.492 ± 2.134
1.373LysMet: 1.373 ± 0.71
1.831LysAsn: 1.831 ± 0.996
4.577LysPro: 4.577 ± 0.934
2.746LysGln: 2.746 ± 1.04
2.288LysArg: 2.288 ± 1.245
4.577LysSer: 4.577 ± 0.971
4.577LysThr: 4.577 ± 1.308
4.577LysVal: 4.577 ± 1.75
0.458LysTrp: 0.458 ± 0.249
1.831LysTyr: 1.831 ± 0.932
0.0LysXaa: 0.0 ± 0.0
Leu
8.238LeuAla: 8.238 ± 3.721
0.0LeuCys: 0.0 ± 0.0
3.204LeuAsp: 3.204 ± 1.446
4.577LeuGlu: 4.577 ± 0.908
5.034LeuPhe: 5.034 ± 0.92
5.95LeuGly: 5.95 ± 1.304
3.661LeuHis: 3.661 ± 1.192
6.865LeuIle: 6.865 ± 2.327
8.238LeuLys: 8.238 ± 3.816
5.95LeuLeu: 5.95 ± 1.263
0.915LeuMet: 0.915 ± 0.831
2.746LeuAsn: 2.746 ± 1.441
8.238LeuPro: 8.238 ± 1.527
2.746LeuGln: 2.746 ± 0.494
4.577LeuArg: 4.577 ± 1.843
6.865LeuSer: 6.865 ± 2.749
8.696LeuThr: 8.696 ± 3.614
4.577LeuVal: 4.577 ± 4.186
0.458LeuTrp: 0.458 ± 0.249
3.661LeuTyr: 3.661 ± 1.992
0.0LeuXaa: 0.0 ± 0.0
Met
1.831MetAla: 1.831 ± 0.687
0.458MetCys: 0.458 ± 0.249
0.458MetAsp: 0.458 ± 0.249
0.458MetGlu: 0.458 ± 0.859
1.373MetPhe: 1.373 ± 0.747
1.373MetGly: 1.373 ± 0.747
0.458MetHis: 0.458 ± 0.249
1.373MetIle: 1.373 ± 0.71
1.373MetLys: 1.373 ± 0.71
3.204MetLeu: 3.204 ± 1.142
0.0MetMet: 0.0 ± 0.0
0.458MetAsn: 0.458 ± 0.249
1.373MetPro: 1.373 ± 0.721
1.373MetGln: 1.373 ± 0.747
1.373MetArg: 1.373 ± 0.747
1.373MetSer: 1.373 ± 0.939
0.915MetThr: 0.915 ± 0.498
0.458MetVal: 0.458 ± 0.249
0.0MetTrp: 0.0 ± 0.0
0.458MetTyr: 0.458 ± 0.249
0.0MetXaa: 0.0 ± 0.0
Asn
4.119AsnAla: 4.119 ± 2.855
1.373AsnCys: 1.373 ± 0.991
3.204AsnAsp: 3.204 ± 1.142
2.288AsnGlu: 2.288 ± 0.74
1.373AsnPhe: 1.373 ± 0.747
2.288AsnGly: 2.288 ± 1.307
2.288AsnHis: 2.288 ± 1.487
1.831AsnIle: 1.831 ± 0.757
2.288AsnLys: 2.288 ± 2.27
3.204AsnLeu: 3.204 ± 1.19
3.204AsnMet: 3.204 ± 1.3
0.458AsnAsn: 0.458 ± 0.994
4.577AsnPro: 4.577 ± 1.67
0.915AsnGln: 0.915 ± 0.498
1.373AsnArg: 1.373 ± 0.721
3.204AsnSer: 3.204 ± 3.472
1.831AsnThr: 1.831 ± 0.996
0.915AsnVal: 0.915 ± 0.831
0.0AsnTrp: 0.0 ± 0.0
1.373AsnTyr: 1.373 ± 0.721
0.0AsnXaa: 0.0 ± 0.0
Pro
2.288ProAla: 2.288 ± 0.74
0.458ProCys: 0.458 ± 0.249
3.661ProAsp: 3.661 ± 1.493
6.407ProGlu: 6.407 ± 0.902
1.831ProPhe: 1.831 ± 1.663
1.831ProGly: 1.831 ± 0.687
1.831ProHis: 1.831 ± 1.173
4.577ProIle: 4.577 ± 0.971
5.492ProLys: 5.492 ± 1.116
6.865ProLeu: 6.865 ± 3.924
0.458ProMet: 0.458 ± 0.249
1.831ProAsn: 1.831 ± 0.757
3.661ProPro: 3.661 ± 2.69
2.288ProGln: 2.288 ± 0.74
1.831ProArg: 1.831 ± 0.687
5.95ProSer: 5.95 ± 0.887
5.95ProThr: 5.95 ± 0.887
2.288ProVal: 2.288 ± 0.74
0.458ProTrp: 0.458 ± 0.249
1.831ProTyr: 1.831 ± 0.996
0.0ProXaa: 0.0 ± 0.0
Gln
4.119GlnAla: 4.119 ± 2.162
0.0GlnCys: 0.0 ± 0.0
2.746GlnAsp: 2.746 ± 1.147
2.288GlnGlu: 2.288 ± 0.875
2.288GlnPhe: 2.288 ± 2.27
1.831GlnGly: 1.831 ± 0.996
1.831GlnHis: 1.831 ± 0.757
2.288GlnIle: 2.288 ± 0.74
3.661GlnLys: 3.661 ± 1.233
5.492GlnLeu: 5.492 ± 2.988
1.831GlnMet: 1.831 ± 0.996
1.831GlnAsn: 1.831 ± 0.996
2.746GlnPro: 2.746 ± 1.04
2.288GlnGln: 2.288 ± 1.245
0.915GlnArg: 0.915 ± 0.498
3.204GlnSer: 3.204 ± 1.035
4.577GlnThr: 4.577 ± 1.28
3.204GlnVal: 3.204 ± 2.171
0.915GlnTrp: 0.915 ± 0.498
0.915GlnTyr: 0.915 ± 0.498
0.0GlnXaa: 0.0 ± 0.0
Arg
2.288ArgAla: 2.288 ± 1.534
0.0ArgCys: 0.0 ± 0.0
1.373ArgAsp: 1.373 ± 0.747
3.204ArgGlu: 3.204 ± 1.743
1.373ArgPhe: 1.373 ± 1.946
1.373ArgGly: 1.373 ± 0.721
2.288ArgHis: 2.288 ± 1.436
2.288ArgIle: 2.288 ± 0.64
3.204ArgLys: 3.204 ± 0.451
4.577ArgLeu: 4.577 ± 1.335
0.458ArgMet: 0.458 ± 0.249
1.831ArgAsn: 1.831 ± 1.663
0.458ArgPro: 0.458 ± 0.249
4.577ArgGln: 4.577 ± 1.67
1.831ArgArg: 1.831 ± 1.173
0.458ArgSer: 0.458 ± 1.131
1.831ArgThr: 1.831 ± 1.173
1.831ArgVal: 1.831 ± 0.687
0.915ArgTrp: 0.915 ± 0.498
2.288ArgTyr: 2.288 ± 1.536
0.0ArgXaa: 0.0 ± 0.0
Ser
5.492SerAla: 5.492 ± 1.686
0.458SerCys: 0.458 ± 1.131
2.746SerAsp: 2.746 ± 0.995
3.204SerGlu: 3.204 ± 1.385
4.119SerPhe: 4.119 ± 1.964
1.373SerGly: 1.373 ± 2.13
0.915SerHis: 0.915 ± 1.009
3.661SerIle: 3.661 ± 1.373
5.95SerLys: 5.95 ± 1.53
10.069SerLeu: 10.069 ± 5.37
0.458SerMet: 0.458 ± 0.249
3.661SerAsn: 3.661 ± 2.397
3.661SerPro: 3.661 ± 1.233
4.577SerGln: 4.577 ± 2.117
2.288SerArg: 2.288 ± 0.74
5.95SerSer: 5.95 ± 7.062
6.865SerThr: 6.865 ± 2.201
2.746SerVal: 2.746 ± 1.494
0.915SerTrp: 0.915 ± 0.747
2.746SerTyr: 2.746 ± 1.494
0.0SerXaa: 0.0 ± 0.0
Thr
5.034ThrAla: 5.034 ± 2.085
1.373ThrCys: 1.373 ± 1.324
3.204ThrAsp: 3.204 ± 0.451
5.034ThrGlu: 5.034 ± 2.739
4.577ThrPhe: 4.577 ± 1.67
5.034ThrGly: 5.034 ± 1.133
4.119ThrHis: 4.119 ± 2.129
2.746ThrIle: 2.746 ± 2.305
4.119ThrLys: 4.119 ± 1.384
6.865ThrLeu: 6.865 ± 2.066
1.373ThrMet: 1.373 ± 0.747
3.204ThrAsn: 3.204 ± 1.385
5.95ThrPro: 5.95 ± 0.914
1.831ThrGln: 1.831 ± 0.687
3.204ThrArg: 3.204 ± 2.733
4.119ThrSer: 4.119 ± 2.697
3.204ThrThr: 3.204 ± 0.451
5.492ThrVal: 5.492 ± 1.942
0.458ThrTrp: 0.458 ± 0.249
1.831ThrTyr: 1.831 ± 0.996
0.0ThrXaa: 0.0 ± 0.0
Val
2.288ValAla: 2.288 ± 1.436
1.373ValCys: 1.373 ± 0.747
2.288ValAsp: 2.288 ± 1.245
2.288ValGlu: 2.288 ± 0.875
1.373ValPhe: 1.373 ± 0.747
2.746ValGly: 2.746 ± 1.178
2.746ValHis: 2.746 ± 0.494
3.661ValIle: 3.661 ± 1.236
3.661ValLys: 3.661 ± 1.156
4.577ValLeu: 4.577 ± 2.942
0.915ValMet: 0.915 ± 0.498
1.373ValAsn: 1.373 ± 0.721
0.915ValPro: 0.915 ± 0.831
3.204ValGln: 3.204 ± 1.385
3.204ValArg: 3.204 ± 1.234
1.831ValSer: 1.831 ± 1.301
6.407ValThr: 6.407 ± 3.578
3.204ValVal: 3.204 ± 1.743
0.458ValTrp: 0.458 ± 0.249
2.288ValTyr: 2.288 ± 1.059
0.0ValXaa: 0.0 ± 0.0
Trp
1.831TrpAla: 1.831 ± 0.996
0.0TrpCys: 0.0 ± 0.0
1.373TrpAsp: 1.373 ± 0.71
0.915TrpGlu: 0.915 ± 0.831
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.458TrpHis: 0.458 ± 0.994
0.458TrpIle: 0.458 ± 0.249
0.0TrpLys: 0.0 ± 0.0
1.373TrpLeu: 1.373 ± 0.747
0.0TrpMet: 0.0 ± 0.0
0.915TrpAsn: 0.915 ± 0.831
0.458TrpPro: 0.458 ± 0.249
0.915TrpGln: 0.915 ± 0.498
0.458TrpArg: 0.458 ± 0.249
0.0TrpSer: 0.0 ± 0.0
1.831TrpThr: 1.831 ± 0.996
0.915TrpVal: 0.915 ± 0.498
0.915TrpTrp: 0.915 ± 0.498
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.661TyrAla: 3.661 ± 1.864
1.831TyrCys: 1.831 ± 0.836
0.915TyrAsp: 0.915 ± 0.831
0.458TyrGlu: 0.458 ± 0.249
1.373TyrPhe: 1.373 ± 0.721
0.915TyrGly: 0.915 ± 0.498
0.458TyrHis: 0.458 ± 0.249
1.373TyrIle: 1.373 ± 0.71
1.831TyrLys: 1.831 ± 0.932
2.288TyrLeu: 2.288 ± 1.193
1.831TyrMet: 1.831 ± 0.996
1.831TyrAsn: 1.831 ± 0.687
0.458TyrPro: 0.458 ± 0.249
0.458TyrGln: 0.458 ± 0.249
0.0TyrArg: 0.0 ± 0.0
5.034TyrSer: 5.034 ± 2.062
2.288TyrThr: 2.288 ± 0.875
0.915TyrVal: 0.915 ± 0.498
0.0TyrTrp: 0.0 ± 0.0
0.915TyrTyr: 0.915 ± 0.498
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (2186 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski