Amino acid dipepetide frequency for Pepper yellow vein Mali virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.475AlaAla: 6.475 ± 3.28
0.925AlaCys: 0.925 ± 0.72
0.925AlaAsp: 0.925 ± 0.681
3.7AlaGlu: 3.7 ± 1.439
0.0AlaPhe: 0.0 ± 0.0
0.0AlaGly: 0.0 ± 0.0
0.925AlaHis: 0.925 ± 1.062
3.7AlaIle: 3.7 ± 1.938
4.625AlaLys: 4.625 ± 0.893
6.475AlaLeu: 6.475 ± 1.918
0.925AlaMet: 0.925 ± 0.681
0.925AlaAsn: 0.925 ± 0.681
2.775AlaPro: 2.775 ± 1.347
1.85AlaGln: 1.85 ± 0.831
5.55AlaArg: 5.55 ± 2.34
2.775AlaSer: 2.775 ± 1.409
4.625AlaThr: 4.625 ± 1.789
4.625AlaVal: 4.625 ± 1.825
0.925AlaTrp: 0.925 ± 0.681
0.0AlaTyr: 0.0 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
1.85CysCys: 1.85 ± 2.124
0.0CysAsp: 0.0 ± 0.0
0.925CysGlu: 0.925 ± 0.72
0.925CysPhe: 0.925 ± 1.09
1.85CysGly: 1.85 ± 0.853
0.0CysHis: 0.0 ± 0.0
1.85CysIle: 1.85 ± 1.127
0.925CysLys: 0.925 ± 0.72
1.85CysLeu: 1.85 ± 1.469
0.925CysMet: 0.925 ± 1.062
0.925CysAsn: 0.925 ± 0.681
0.925CysPro: 0.925 ± 1.062
0.0CysGln: 0.0 ± 0.0
1.85CysArg: 1.85 ± 1.363
3.7CysSer: 3.7 ± 1.807
1.85CysThr: 1.85 ± 0.831
1.85CysVal: 1.85 ± 0.831
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.775AspAla: 2.775 ± 1.338
1.85AspCys: 1.85 ± 1.885
2.775AspAsp: 2.775 ± 0.903
0.925AspGlu: 0.925 ± 0.72
0.925AspPhe: 0.925 ± 0.72
1.85AspGly: 1.85 ± 1.363
0.925AspHis: 0.925 ± 1.09
2.775AspIle: 2.775 ± 2.025
1.85AspLys: 1.85 ± 0.853
7.401AspLeu: 7.401 ± 3.033
0.925AspMet: 0.925 ± 0.625
1.85AspAsn: 1.85 ± 1.127
1.85AspPro: 1.85 ± 1.064
0.0AspGln: 0.0 ± 0.0
3.7AspArg: 3.7 ± 1.662
5.55AspSer: 5.55 ± 0.91
0.925AspThr: 0.925 ± 0.681
5.55AspVal: 5.55 ± 1.805
1.85AspTrp: 1.85 ± 0.853
0.925AspTyr: 0.925 ± 1.062
0.0AspXaa: 0.0 ± 0.0
Glu
6.475GluAla: 6.475 ± 2.114
0.925GluCys: 0.925 ± 0.681
0.0GluAsp: 0.0 ± 0.0
4.625GluGlu: 4.625 ± 2.592
3.7GluPhe: 3.7 ± 1.982
4.625GluGly: 4.625 ± 1.042
0.925GluHis: 0.925 ± 1.09
1.85GluIle: 1.85 ± 2.18
0.925GluLys: 0.925 ± 0.681
3.7GluLeu: 3.7 ± 1.439
0.925GluMet: 0.925 ± 0.99
3.7GluAsn: 3.7 ± 2.063
4.625GluPro: 4.625 ± 1.042
2.775GluGln: 2.775 ± 1.398
0.0GluArg: 0.0 ± 0.0
0.925GluSer: 0.925 ± 1.062
2.775GluThr: 2.775 ± 1.894
0.925GluVal: 0.925 ± 0.99
1.85GluTrp: 1.85 ± 0.853
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.925PheCys: 0.925 ± 0.72
2.775PheAsp: 2.775 ± 1.398
0.925PheGlu: 0.925 ± 0.681
3.7PhePhe: 3.7 ± 1.662
1.85PheGly: 1.85 ± 0.831
3.7PheHis: 3.7 ± 1.035
2.775PheIle: 2.775 ± 1.403
1.85PheLys: 1.85 ± 1.056
7.401PheLeu: 7.401 ± 2.064
0.0PheMet: 0.0 ± 0.0
3.7PheAsn: 3.7 ± 3.076
1.85PhePro: 1.85 ± 1.064
3.7PheGln: 3.7 ± 1.937
4.625PheArg: 4.625 ± 1.825
0.925PheSer: 0.925 ± 0.943
2.775PheThr: 2.775 ± 1.165
0.925PheVal: 0.925 ± 0.681
0.0PheTrp: 0.0 ± 0.0
0.925PheTyr: 0.925 ± 0.943
0.0PheXaa: 0.0 ± 0.0
Gly
2.775GlyAla: 2.775 ± 2.044
2.775GlyCys: 2.775 ± 0.903
4.625GlyAsp: 4.625 ± 1.602
2.775GlyGlu: 2.775 ± 1.284
1.85GlyPhe: 1.85 ± 1.475
3.7GlyGly: 3.7 ± 1.319
3.7GlyHis: 3.7 ± 1.365
5.55GlyIle: 5.55 ± 1.726
4.625GlyLys: 4.625 ± 2.121
1.85GlyLeu: 1.85 ± 1.435
0.925GlyMet: 0.925 ± 0.72
1.85GlyAsn: 1.85 ± 1.98
4.625GlyPro: 4.625 ± 2.184
1.85GlyGln: 1.85 ± 1.221
0.925GlyArg: 0.925 ± 0.681
0.925GlySer: 0.925 ± 1.062
2.775GlyThr: 2.775 ± 1.949
2.775GlyVal: 2.775 ± 2.035
0.0GlyTrp: 0.0 ± 0.0
0.925GlyTyr: 0.925 ± 0.99
0.0GlyXaa: 0.0 ± 0.0
His
2.775HisAla: 2.775 ± 1.434
1.85HisCys: 1.85 ± 1.475
2.775HisAsp: 2.775 ± 1.876
1.85HisGlu: 1.85 ± 1.435
1.85HisPhe: 1.85 ± 1.064
1.85HisGly: 1.85 ± 1.475
0.925HisHis: 0.925 ± 0.943
2.775HisIle: 2.775 ± 1.766
1.85HisLys: 1.85 ± 1.578
1.85HisLeu: 1.85 ± 1.363
0.0HisMet: 0.0 ± 0.0
2.775HisAsn: 2.775 ± 1.403
2.775HisPro: 2.775 ± 1.222
2.775HisGln: 2.775 ± 1.409
2.775HisArg: 2.775 ± 1.949
0.925HisSer: 0.925 ± 0.943
4.625HisThr: 4.625 ± 2.026
3.7HisVal: 3.7 ± 1.012
0.925HisTrp: 0.925 ± 0.681
0.925HisTyr: 0.925 ± 0.681
0.0HisXaa: 0.0 ± 0.0
Ile
0.925IleAla: 0.925 ± 0.72
0.0IleCys: 0.0 ± 0.0
3.7IleAsp: 3.7 ± 1.706
0.0IleGlu: 0.0 ± 0.0
2.775IlePhe: 2.775 ± 2.044
0.0IleGly: 0.0 ± 0.0
2.775IleHis: 2.775 ± 1.876
2.775IleIle: 2.775 ± 1.398
6.475IleLys: 6.475 ± 0.664
1.85IleLeu: 1.85 ± 1.059
1.85IleMet: 1.85 ± 1.351
4.625IleAsn: 4.625 ± 1.812
0.925IlePro: 0.925 ± 0.681
3.7IleGln: 3.7 ± 1.706
8.326IleArg: 8.326 ± 3.56
5.55IleSer: 5.55 ± 3.047
3.7IleThr: 3.7 ± 2.283
1.85IleVal: 1.85 ± 0.831
1.85IleTrp: 1.85 ± 1.059
1.85IleTyr: 1.85 ± 1.441
0.0IleXaa: 0.0 ± 0.0
Lys
1.85LysAla: 1.85 ± 1.475
1.85LysCys: 1.85 ± 1.056
0.925LysAsp: 0.925 ± 0.681
5.55LysGlu: 5.55 ± 2.242
2.775LysPhe: 2.775 ± 0.869
1.85LysGly: 1.85 ± 0.853
1.85LysHis: 1.85 ± 0.831
2.775LysIle: 2.775 ± 1.398
1.85LysLys: 1.85 ± 0.831
0.925LysLeu: 0.925 ± 0.681
0.0LysMet: 0.0 ± 0.0
4.625LysAsn: 4.625 ± 1.675
3.7LysPro: 3.7 ± 1.035
2.775LysGln: 2.775 ± 1.409
4.625LysArg: 4.625 ± 2.026
3.7LysSer: 3.7 ± 1.954
3.7LysThr: 3.7 ± 1.365
5.55LysVal: 5.55 ± 1.022
0.0LysTrp: 0.0 ± 0.0
4.625LysTyr: 4.625 ± 1.061
0.0LysXaa: 0.0 ± 0.0
Leu
0.925LeuAla: 0.925 ± 1.062
2.775LeuCys: 2.775 ± 1.347
4.625LeuAsp: 4.625 ± 1.995
4.625LeuGlu: 4.625 ± 2.592
1.85LeuPhe: 1.85 ± 1.435
5.55LeuGly: 5.55 ± 2.07
3.7LeuHis: 3.7 ± 1.365
2.775LeuIle: 2.775 ± 2.364
6.475LeuLys: 6.475 ± 1.859
3.7LeuLeu: 3.7 ± 1.214
0.925LeuMet: 0.925 ± 0.681
7.401LeuAsn: 7.401 ± 2.331
1.85LeuPro: 1.85 ± 1.064
4.625LeuGln: 4.625 ± 2.902
5.55LeuArg: 5.55 ± 3.125
2.775LeuSer: 2.775 ± 2.044
4.625LeuThr: 4.625 ± 1.274
3.7LeuVal: 3.7 ± 1.197
0.0LeuTrp: 0.0 ± 0.0
5.55LeuTyr: 5.55 ± 2.743
0.0LeuXaa: 0.0 ± 0.0
Met
2.775MetAla: 2.775 ± 1.338
0.0MetCys: 0.0 ± 0.0
2.775MetAsp: 2.775 ± 1.36
0.925MetGlu: 0.925 ± 1.09
3.7MetPhe: 3.7 ± 2.269
2.775MetGly: 2.775 ± 1.1
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.925MetLys: 0.925 ± 0.72
1.85MetLeu: 1.85 ± 1.469
0.0MetMet: 0.0 ± 0.0
0.925MetAsn: 0.925 ± 1.09
0.0MetPro: 0.0 ± 0.0
1.85MetGln: 1.85 ± 0.853
0.0MetArg: 0.0 ± 0.0
0.925MetSer: 0.925 ± 0.72
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
0.925MetTrp: 0.925 ± 1.062
1.85MetTyr: 1.85 ± 1.441
0.0MetXaa: 0.0 ± 0.0
Asn
3.7AsnAla: 3.7 ± 1.954
0.0AsnCys: 0.0 ± 0.0
2.775AsnAsp: 2.775 ± 1.338
1.85AsnGlu: 1.85 ± 1.036
1.85AsnPhe: 1.85 ± 1.059
2.775AsnGly: 2.775 ± 1.284
6.475AsnHis: 6.475 ± 2.503
1.85AsnIle: 1.85 ± 0.831
1.85AsnLys: 1.85 ± 0.853
6.475AsnLeu: 6.475 ± 3.11
2.775AsnMet: 2.775 ± 1.358
3.7AsnAsn: 3.7 ± 3.089
2.775AsnPro: 2.775 ± 0.869
3.7AsnGln: 3.7 ± 1.133
1.85AsnArg: 1.85 ± 0.831
1.85AsnSer: 1.85 ± 1.363
5.55AsnThr: 5.55 ± 1.372
3.7AsnVal: 3.7 ± 1.534
0.0AsnTrp: 0.0 ± 0.0
2.775AsnTyr: 2.775 ± 2.044
0.0AsnXaa: 0.0 ± 0.0
Pro
1.85ProAla: 1.85 ± 1.064
1.85ProCys: 1.85 ± 1.036
0.925ProAsp: 0.925 ± 0.72
1.85ProGlu: 1.85 ± 1.064
1.85ProPhe: 1.85 ± 1.056
3.7ProGly: 3.7 ± 1.552
3.7ProHis: 3.7 ± 1.786
2.775ProIle: 2.775 ± 1.247
3.7ProLys: 3.7 ± 2.726
4.625ProLeu: 4.625 ± 1.721
1.85ProMet: 1.85 ± 1.284
3.7ProAsn: 3.7 ± 1.365
2.775ProPro: 2.775 ± 2.044
4.625ProGln: 4.625 ± 1.185
4.625ProArg: 4.625 ± 2.32
4.625ProSer: 4.625 ± 1.926
4.625ProThr: 4.625 ± 2.242
1.85ProVal: 1.85 ± 1.441
0.925ProTrp: 0.925 ± 0.681
2.775ProTyr: 2.775 ± 1.398
0.0ProXaa: 0.0 ± 0.0
Gln
5.55GlnAla: 5.55 ± 2.654
0.925GlnCys: 0.925 ± 0.681
3.7GlnAsp: 3.7 ± 1.012
2.775GlnGlu: 2.775 ± 0.903
0.925GlnPhe: 0.925 ± 0.681
3.7GlnGly: 3.7 ± 1.136
1.85GlnHis: 1.85 ± 1.372
1.85GlnIle: 1.85 ± 1.056
0.925GlnLys: 0.925 ± 1.062
1.85GlnLeu: 1.85 ± 0.853
0.925GlnMet: 0.925 ± 0.943
1.85GlnAsn: 1.85 ± 1.036
4.625GlnPro: 4.625 ± 3.133
0.925GlnGln: 0.925 ± 0.99
2.775GlnArg: 2.775 ± 1.073
4.625GlnSer: 4.625 ± 1.602
0.925GlnThr: 0.925 ± 1.09
6.475GlnVal: 6.475 ± 1.527
0.0GlnTrp: 0.0 ± 0.0
0.925GlnTyr: 0.925 ± 0.681
0.0GlnXaa: 0.0 ± 0.0
Arg
4.625ArgAla: 4.625 ± 2.425
1.85ArgCys: 1.85 ± 1.036
5.55ArgAsp: 5.55 ± 2.099
2.775ArgGlu: 2.775 ± 1.347
6.475ArgPhe: 6.475 ± 1.531
4.625ArgGly: 4.625 ± 1.552
2.775ArgHis: 2.775 ± 2.237
3.7ArgIle: 3.7 ± 1.136
2.775ArgLys: 2.775 ± 1.64
1.85ArgLeu: 1.85 ± 1.036
1.85ArgMet: 1.85 ± 1.441
0.925ArgAsn: 0.925 ± 0.681
8.326ArgPro: 8.326 ± 1.945
0.925ArgGln: 0.925 ± 1.062
8.326ArgArg: 8.326 ± 3.483
5.55ArgSer: 5.55 ± 1.982
3.7ArgThr: 3.7 ± 2.036
2.775ArgVal: 2.775 ± 1.447
0.0ArgTrp: 0.0 ± 0.0
1.85ArgTyr: 1.85 ± 1.036
0.0ArgXaa: 0.0 ± 0.0
Ser
2.775SerAla: 2.775 ± 1.347
0.0SerCys: 0.0 ± 0.0
2.775SerAsp: 2.775 ± 0.903
0.925SerGlu: 0.925 ± 0.681
1.85SerPhe: 1.85 ± 0.853
0.925SerGly: 0.925 ± 0.72
0.0SerHis: 0.0 ± 0.0
6.475SerIle: 6.475 ± 0.999
5.55SerLys: 5.55 ± 1.887
1.85SerLeu: 1.85 ± 1.363
0.0SerMet: 0.0 ± 0.0
2.775SerAsn: 2.775 ± 1.338
8.326SerPro: 8.326 ± 2.154
3.7SerGln: 3.7 ± 2.566
4.625SerArg: 4.625 ± 1.942
12.026SerSer: 12.026 ± 3.49
10.176SerThr: 10.176 ± 3.652
2.775SerVal: 2.775 ± 2.459
1.85SerTrp: 1.85 ± 0.831
3.7SerTyr: 3.7 ± 1.136
0.0SerXaa: 0.0 ± 0.0
Thr
3.7ThrAla: 3.7 ± 1.706
0.925ThrCys: 0.925 ± 0.99
0.925ThrAsp: 0.925 ± 0.99
1.85ThrGlu: 1.85 ± 1.221
2.775ThrPhe: 2.775 ± 1.347
5.55ThrGly: 5.55 ± 2.269
6.475ThrHis: 6.475 ± 2.142
2.775ThrIle: 2.775 ± 2.367
2.775ThrLys: 2.775 ± 1.222
7.401ThrLeu: 7.401 ± 1.822
0.0ThrMet: 0.0 ± 0.0
7.401ThrAsn: 7.401 ± 2.843
3.7ThrPro: 3.7 ± 1.136
2.775ThrGln: 2.775 ± 1.247
3.7ThrArg: 3.7 ± 1.138
5.55ThrSer: 5.55 ± 3.032
1.85ThrThr: 1.85 ± 1.495
2.775ThrVal: 2.775 ± 1.398
0.925ThrTrp: 0.925 ± 0.99
1.85ThrTyr: 1.85 ± 1.036
0.0ThrXaa: 0.0 ± 0.0
Val
0.0ValAla: 0.0 ± 0.0
0.925ValCys: 0.925 ± 0.681
2.775ValAsp: 2.775 ± 1.403
3.7ValGlu: 3.7 ± 2.15
3.7ValPhe: 3.7 ± 1.788
1.85ValGly: 1.85 ± 1.441
0.925ValHis: 0.925 ± 1.062
4.625ValIle: 4.625 ± 2.354
3.7ValLys: 3.7 ± 2.063
5.55ValLeu: 5.55 ± 2.946
2.775ValMet: 2.775 ± 0.869
1.85ValAsn: 1.85 ± 1.059
2.775ValPro: 2.775 ± 0.903
2.775ValGln: 2.775 ± 1.97
2.775ValArg: 2.775 ± 2.161
6.475ValSer: 6.475 ± 2.059
2.775ValThr: 2.775 ± 1.398
3.7ValVal: 3.7 ± 1.197
1.85ValTrp: 1.85 ± 1.056
2.775ValTyr: 2.775 ± 1.398
0.0ValXaa: 0.0 ± 0.0
Trp
0.925TrpAla: 0.925 ± 0.681
0.0TrpCys: 0.0 ± 0.0
0.925TrpAsp: 0.925 ± 1.062
0.925TrpGlu: 0.925 ± 1.09
0.0TrpPhe: 0.0 ± 0.0
0.925TrpGly: 0.925 ± 0.681
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.925TrpLys: 0.925 ± 0.681
0.0TrpLeu: 0.0 ± 0.0
0.925TrpMet: 0.925 ± 0.72
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.925TrpGln: 0.925 ± 0.681
1.85TrpArg: 1.85 ± 0.853
0.925TrpSer: 0.925 ± 0.943
2.775TrpThr: 2.775 ± 0.869
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
1.85TrpTyr: 1.85 ± 0.965
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.85TyrAla: 1.85 ± 0.831
0.0TyrCys: 0.0 ± 0.0
0.925TyrAsp: 0.925 ± 0.72
2.775TyrGlu: 2.775 ± 1.434
1.85TyrPhe: 1.85 ± 1.056
1.85TyrGly: 1.85 ± 0.831
0.925TyrHis: 0.925 ± 0.681
1.85TyrIle: 1.85 ± 0.831
0.925TyrLys: 0.925 ± 0.681
5.55TyrLeu: 5.55 ± 0.876
2.775TyrMet: 2.775 ± 1.328
2.775TyrAsn: 2.775 ± 0.886
0.925TyrPro: 0.925 ± 0.99
1.85TyrGln: 1.85 ± 0.831
2.775TyrArg: 2.775 ± 2.161
2.775TyrSer: 2.775 ± 1.347
0.925TyrThr: 0.925 ± 0.72
2.775TyrVal: 2.775 ± 1.399
0.0TyrTrp: 0.0 ± 0.0
0.925TyrTyr: 0.925 ± 0.943
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1082 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski