Amino acid dipepetide frequency for Caucasus prunus virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.936AlaAla: 3.936 ± 2.043
1.431AlaCys: 1.431 ± 0.997
2.504AlaAsp: 2.504 ± 2.516
2.504AlaGlu: 2.504 ± 1.395
2.862AlaPhe: 2.862 ± 1.034
3.936AlaGly: 3.936 ± 1.281
1.431AlaHis: 1.431 ± 0.758
3.578AlaIle: 3.578 ± 1.167
8.945AlaLys: 8.945 ± 1.13
5.009AlaLeu: 5.009 ± 1.113
1.073AlaMet: 1.073 ± 2.278
2.862AlaAsn: 2.862 ± 0.731
0.716AlaPro: 0.716 ± 0.495
1.431AlaGln: 1.431 ± 0.482
3.578AlaArg: 3.578 ± 3.269
3.22AlaSer: 3.22 ± 1.207
2.147AlaThr: 2.147 ± 0.774
2.504AlaVal: 2.504 ± 1.048
0.716AlaTrp: 0.716 ± 1.499
0.716AlaTyr: 0.716 ± 0.379
0.0AlaXaa: 0.0 ± 0.0
Cys
1.073CysAla: 1.073 ± 0.601
0.716CysCys: 0.716 ± 1.291
2.147CysAsp: 2.147 ± 0.779
0.716CysGlu: 0.716 ± 0.495
1.073CysPhe: 1.073 ± 0.569
2.147CysGly: 2.147 ± 1.132
1.431CysHis: 1.431 ± 2.186
1.073CysIle: 1.073 ± 0.569
1.431CysLys: 1.431 ± 0.482
1.789CysLeu: 1.789 ± 0.978
0.716CysMet: 0.716 ± 1.093
0.0CysAsn: 0.0 ± 0.0
0.716CysPro: 0.716 ± 0.495
0.0CysGln: 0.0 ± 0.0
1.073CysArg: 1.073 ± 0.601
2.862CysSer: 2.862 ± 1.981
1.789CysThr: 1.789 ± 0.948
0.716CysVal: 0.716 ± 0.495
0.358CysTrp: 0.358 ± 0.6
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.936AspAla: 3.936 ± 1.058
0.358AspCys: 0.358 ± 0.19
4.651AspAsp: 4.651 ± 1.868
5.725AspGlu: 5.725 ± 1.389
3.22AspPhe: 3.22 ± 1.219
5.009AspGly: 5.009 ± 1.113
1.789AspHis: 1.789 ± 0.669
2.147AspIle: 2.147 ± 0.712
3.22AspLys: 3.22 ± 0.758
5.367AspLeu: 5.367 ± 1.674
1.789AspMet: 1.789 ± 0.578
1.789AspAsn: 1.789 ± 0.948
2.147AspPro: 2.147 ± 0.774
2.504AspGln: 2.504 ± 1.076
4.651AspArg: 4.651 ± 2.222
4.293AspSer: 4.293 ± 0.716
2.862AspThr: 2.862 ± 1.981
3.22AspVal: 3.22 ± 0.676
1.073AspTrp: 1.073 ± 0.569
2.504AspTyr: 2.504 ± 1.327
0.0AspXaa: 0.0 ± 0.0
Glu
5.725GluAla: 5.725 ± 0.728
1.789GluCys: 1.789 ± 0.948
3.936GluAsp: 3.936 ± 0.965
5.009GluGlu: 5.009 ± 0.912
3.22GluPhe: 3.22 ± 1.351
2.862GluGly: 2.862 ± 0.529
1.073GluHis: 1.073 ± 0.45
3.578GluIle: 3.578 ± 1.167
6.082GluLys: 6.082 ± 2.042
6.082GluLeu: 6.082 ± 1.452
2.147GluMet: 2.147 ± 1.204
3.578GluAsn: 3.578 ± 1.83
3.22GluPro: 3.22 ± 1.207
2.862GluGln: 2.862 ± 1.211
2.862GluArg: 2.862 ± 2.197
3.22GluSer: 3.22 ± 1.207
3.936GluThr: 3.936 ± 1.724
5.009GluVal: 5.009 ± 2.043
0.716GluTrp: 0.716 ± 0.652
2.147GluTyr: 2.147 ± 0.774
0.0GluXaa: 0.0 ± 0.0
Phe
5.009PheAla: 5.009 ± 0.31
2.504PheCys: 2.504 ± 0.867
5.009PheAsp: 5.009 ± 1.072
5.725PheGlu: 5.725 ± 1.169
4.651PhePhe: 4.651 ± 2.465
3.578PheGly: 3.578 ± 0.735
0.716PheHis: 0.716 ± 0.495
3.22PheIle: 3.22 ± 0.676
3.578PheLys: 3.578 ± 1.357
6.44PheLeu: 6.44 ± 3.413
1.789PheMet: 1.789 ± 0.669
2.504PheAsn: 2.504 ± 0.455
3.578PhePro: 3.578 ± 1.339
1.789PheGln: 1.789 ± 0.948
3.578PheArg: 3.578 ± 1.966
5.009PheSer: 5.009 ± 2.097
2.862PheThr: 2.862 ± 0.712
3.22PheVal: 3.22 ± 0.809
0.358PheTrp: 0.358 ± 0.19
0.716PheTyr: 0.716 ± 1.093
0.0PheXaa: 0.0 ± 0.0
Gly
2.504GlyAla: 2.504 ± 0.907
1.073GlyCys: 1.073 ± 1.084
5.009GlyAsp: 5.009 ± 1.734
3.578GlyGlu: 3.578 ± 0.801
2.862GlyPhe: 2.862 ± 0.731
4.293GlyGly: 4.293 ± 0.81
2.147GlyHis: 2.147 ± 1.195
2.504GlyIle: 2.504 ± 0.827
4.651GlyLys: 4.651 ± 3.333
6.44GlyLeu: 6.44 ± 1.316
0.716GlyMet: 0.716 ± 0.495
6.44GlyAsn: 6.44 ± 0.839
1.073GlyPro: 1.073 ± 0.601
1.789GlyGln: 1.789 ± 0.928
2.504GlyArg: 2.504 ± 0.867
6.44GlySer: 6.44 ± 3.196
1.431GlyThr: 1.431 ± 1.305
5.009GlyVal: 5.009 ± 3.883
0.716GlyTrp: 0.716 ± 0.495
0.716GlyTyr: 0.716 ± 0.379
0.0GlyXaa: 0.0 ± 0.0
His
0.716HisAla: 0.716 ± 0.652
0.0HisCys: 0.0 ± 0.0
1.431HisAsp: 1.431 ± 0.758
1.789HisGlu: 1.789 ± 0.928
3.22HisPhe: 3.22 ± 1.951
1.789HisGly: 1.789 ± 0.928
1.073HisHis: 1.073 ± 0.569
1.431HisIle: 1.431 ± 0.982
1.073HisLys: 1.073 ± 0.569
3.936HisLeu: 3.936 ± 1.095
0.0HisMet: 0.0 ± 0.0
0.716HisAsn: 0.716 ± 0.379
0.716HisPro: 0.716 ± 0.379
0.716HisGln: 0.716 ± 0.652
2.862HisArg: 2.862 ± 0.712
2.504HisSer: 2.504 ± 0.455
0.716HisThr: 0.716 ± 0.652
1.789HisVal: 1.789 ± 0.874
1.073HisTrp: 1.073 ± 0.569
0.716HisTyr: 0.716 ± 0.379
0.0HisXaa: 0.0 ± 0.0
Ile
3.22IleAla: 3.22 ± 0.809
1.431IleCys: 1.431 ± 0.758
4.293IleAsp: 4.293 ± 0.81
3.936IleGlu: 3.936 ± 1.277
3.22IlePhe: 3.22 ± 1.219
1.789IleGly: 1.789 ± 0.874
3.22IleHis: 3.22 ± 1.047
1.073IleIle: 1.073 ± 0.569
4.651IleLys: 4.651 ± 1.804
3.936IleLeu: 3.936 ± 1.566
1.789IleMet: 1.789 ± 0.781
4.293IleAsn: 4.293 ± 1.802
1.789IlePro: 1.789 ± 0.578
2.147IleGln: 2.147 ± 1.581
4.293IleArg: 4.293 ± 0.974
2.504IleSer: 2.504 ± 0.455
1.789IleThr: 1.789 ± 0.578
2.862IleVal: 2.862 ± 1.057
0.0IleTrp: 0.0 ± 0.0
1.431IleTyr: 1.431 ± 0.758
0.0IleXaa: 0.0 ± 0.0
Lys
4.293LysAla: 4.293 ± 1.248
1.073LysCys: 1.073 ± 0.569
2.862LysAsp: 2.862 ± 1.034
5.009LysGlu: 5.009 ± 0.907
5.367LysPhe: 5.367 ± 1.421
3.936LysGly: 3.936 ± 1.389
1.073LysHis: 1.073 ± 1.022
3.578LysIle: 3.578 ± 1.385
6.082LysLys: 6.082 ± 2.675
7.871LysLeu: 7.871 ± 1.515
2.147LysMet: 2.147 ± 1.195
4.293LysAsn: 4.293 ± 1.082
1.431LysPro: 1.431 ± 0.991
1.431LysGln: 1.431 ± 0.758
5.367LysArg: 5.367 ± 2.921
5.725LysSer: 5.725 ± 1.928
3.22LysThr: 3.22 ± 2.151
5.725LysVal: 5.725 ± 1.199
1.073LysTrp: 1.073 ± 0.45
1.073LysTyr: 1.073 ± 0.569
0.0LysXaa: 0.0 ± 0.0
Leu
5.009LeuAla: 5.009 ± 2.043
3.22LeuCys: 3.22 ± 0.758
4.651LeuAsp: 4.651 ± 1.231
5.725LeuGlu: 5.725 ± 0.93
6.082LeuPhe: 6.082 ± 2.042
5.725LeuGly: 5.725 ± 1.496
1.789LeuHis: 1.789 ± 0.928
5.009LeuIle: 5.009 ± 0.31
8.587LeuLys: 8.587 ± 2.39
8.229LeuLeu: 8.229 ± 2.01
2.147LeuMet: 2.147 ± 1.138
4.651LeuAsn: 4.651 ± 2.465
3.936LeuPro: 3.936 ± 1.058
2.862LeuGln: 2.862 ± 1.297
5.725LeuArg: 5.725 ± 2.61
9.66LeuSer: 9.66 ± 1.33
5.725LeuThr: 5.725 ± 0.728
5.009LeuVal: 5.009 ± 2.02
0.358LeuTrp: 0.358 ± 0.6
2.504LeuTyr: 2.504 ± 0.867
0.0LeuXaa: 0.0 ± 0.0
Met
1.789MetAla: 1.789 ± 0.669
0.358MetCys: 0.358 ± 0.19
0.716MetAsp: 0.716 ± 0.652
1.789MetGlu: 1.789 ± 0.578
2.147MetPhe: 2.147 ± 1.204
2.504MetGly: 2.504 ± 1.501
1.431MetHis: 1.431 ± 0.997
1.789MetIle: 1.789 ± 0.948
1.431MetLys: 1.431 ± 0.982
2.147MetLeu: 2.147 ± 0.779
0.358MetMet: 0.358 ± 0.75
2.147MetAsn: 2.147 ± 1.011
1.073MetPro: 1.073 ± 0.601
0.358MetGln: 0.358 ± 0.75
1.789MetArg: 1.789 ± 1.277
2.147MetSer: 2.147 ± 0.952
0.716MetThr: 0.716 ± 1.291
1.789MetVal: 1.789 ± 0.527
0.0MetTrp: 0.0 ± 0.0
0.358MetTyr: 0.358 ± 0.19
0.0MetXaa: 0.0 ± 0.0
Asn
1.431AsnAla: 1.431 ± 1.68
1.431AsnCys: 1.431 ± 0.648
6.082AsnAsp: 6.082 ± 1.992
2.147AsnGlu: 2.147 ± 0.712
4.293AsnPhe: 4.293 ± 1.136
2.504AsnGly: 2.504 ± 0.721
2.147AsnHis: 2.147 ± 1.138
2.862AsnIle: 2.862 ± 1.057
1.431AsnLys: 1.431 ± 0.982
8.945AsnLeu: 8.945 ± 2.368
1.431AsnMet: 1.431 ± 0.648
1.073AsnAsn: 1.073 ± 0.45
1.073AsnPro: 1.073 ± 1.138
1.431AsnGln: 1.431 ± 0.758
3.578AsnArg: 3.578 ± 1.155
3.578AsnSer: 3.578 ± 1.385
1.073AsnThr: 1.073 ± 0.569
4.651AsnVal: 4.651 ± 2.133
1.073AsnTrp: 1.073 ± 0.569
1.789AsnTyr: 1.789 ± 0.948
0.0AsnXaa: 0.0 ± 0.0
Pro
0.716ProAla: 0.716 ± 1.093
0.0ProCys: 0.0 ± 0.0
0.716ProAsp: 0.716 ± 0.379
3.578ProGlu: 3.578 ± 1.551
2.862ProPhe: 2.862 ± 1.517
2.147ProGly: 2.147 ± 1.486
0.358ProHis: 0.358 ± 0.75
4.293ProIle: 4.293 ± 1.446
1.789ProLys: 1.789 ± 0.874
3.22ProLeu: 3.22 ± 0.752
0.716ProMet: 0.716 ± 0.39
1.431ProAsn: 1.431 ± 0.607
0.358ProPro: 0.358 ± 0.19
1.431ProGln: 1.431 ± 1.466
1.789ProArg: 1.789 ± 0.578
2.147ProSer: 2.147 ± 1.202
1.073ProThr: 1.073 ± 0.45
1.073ProVal: 1.073 ± 0.45
0.0ProTrp: 0.0 ± 0.0
0.716ProTyr: 0.716 ± 0.495
0.0ProXaa: 0.0 ± 0.0
Gln
2.862GlnAla: 2.862 ± 1.211
0.0GlnCys: 0.0 ± 0.0
1.789GlnAsp: 1.789 ± 1.24
1.431GlnGlu: 1.431 ± 1.518
1.073GlnPhe: 1.073 ± 0.569
2.504GlnGly: 2.504 ± 1.159
0.358GlnHis: 0.358 ± 0.6
2.147GlnIle: 2.147 ± 0.901
1.431GlnLys: 1.431 ± 0.648
3.22GlnLeu: 3.22 ± 0.652
0.358GlnMet: 0.358 ± 0.19
1.789GlnAsn: 1.789 ± 0.578
1.073GlnPro: 1.073 ± 0.45
0.0GlnGln: 0.0 ± 0.0
1.431GlnArg: 1.431 ± 0.482
2.147GlnSer: 2.147 ± 0.454
0.358GlnThr: 0.358 ± 0.19
1.431GlnVal: 1.431 ± 0.482
0.0GlnTrp: 0.0 ± 0.0
0.716GlnTyr: 0.716 ± 0.652
0.0GlnXaa: 0.0 ± 0.0
Arg
3.22ArgAla: 3.22 ± 1.029
2.147ArgCys: 2.147 ± 2.705
2.147ArgAsp: 2.147 ± 0.712
3.936ArgGlu: 3.936 ± 2.079
4.651ArgPhe: 4.651 ± 0.798
4.651ArgGly: 4.651 ± 1.773
1.073ArgHis: 1.073 ± 0.569
2.504ArgIle: 2.504 ± 1.076
2.504ArgLys: 2.504 ± 1.159
4.293ArgLeu: 4.293 ± 2.095
2.147ArgMet: 2.147 ± 1.594
3.578ArgAsn: 3.578 ± 0.951
1.789ArgPro: 1.789 ± 1.289
1.431ArgGln: 1.431 ± 0.482
2.504ArgArg: 2.504 ± 2.327
6.44ArgSer: 6.44 ± 1.58
4.293ArgThr: 4.293 ± 1.545
3.936ArgVal: 3.936 ± 2.399
0.716ArgTrp: 0.716 ± 0.379
3.22ArgTyr: 3.22 ± 1.606
0.0ArgXaa: 0.0 ± 0.0
Ser
2.504SerAla: 2.504 ± 1.44
1.789SerCys: 1.789 ± 0.578
5.725SerAsp: 5.725 ± 1.224
6.798SerGlu: 6.798 ± 2.14
6.44SerPhe: 6.44 ± 1.395
3.578SerGly: 3.578 ± 1.155
2.862SerHis: 2.862 ± 1.034
5.725SerIle: 5.725 ± 1.728
5.725SerLys: 5.725 ± 1.789
8.229SerLeu: 8.229 ± 3.833
1.431SerMet: 1.431 ± 0.482
3.578SerAsn: 3.578 ± 0.801
1.073SerPro: 1.073 ± 0.601
1.431SerGln: 1.431 ± 0.482
6.798SerArg: 6.798 ± 2.109
3.22SerSer: 3.22 ± 1.047
2.862SerThr: 2.862 ± 3.036
5.009SerVal: 5.009 ± 1.734
0.716SerTrp: 0.716 ± 0.379
1.789SerTyr: 1.789 ± 0.578
0.0SerXaa: 0.0 ± 0.0
Thr
1.073ThrAla: 1.073 ± 1.084
0.716ThrCys: 0.716 ± 1.291
1.431ThrAsp: 1.431 ± 0.482
1.431ThrGlu: 1.431 ± 0.758
4.293ThrPhe: 4.293 ± 0.727
3.936ThrGly: 3.936 ± 1.281
1.431ThrHis: 1.431 ± 1.305
2.147ThrIle: 2.147 ± 0.952
3.22ThrLys: 3.22 ± 0.809
5.009ThrLeu: 5.009 ± 2.317
0.0ThrMet: 0.0 ± 0.0
1.789ThrAsn: 1.789 ± 1.095
3.22ThrPro: 3.22 ± 0.652
0.716ThrGln: 0.716 ± 0.379
1.073ThrArg: 1.073 ± 1.138
4.651ThrSer: 4.651 ± 0.798
1.073ThrThr: 1.073 ± 1.138
2.862ThrVal: 2.862 ± 1.016
0.0ThrTrp: 0.0 ± 0.0
1.073ThrTyr: 1.073 ± 0.569
0.0ThrXaa: 0.0 ± 0.0
Val
3.22ValAla: 3.22 ± 0.752
1.431ValCys: 1.431 ± 1.371
5.367ValAsp: 5.367 ± 0.531
5.009ValGlu: 5.009 ± 1.613
2.862ValPhe: 2.862 ± 1.297
2.862ValGly: 2.862 ± 0.731
1.431ValHis: 1.431 ± 0.758
2.862ValIle: 2.862 ± 1.057
5.367ValLys: 5.367 ± 2.24
4.293ValLeu: 4.293 ± 1.056
3.578ValMet: 3.578 ± 1.497
5.725ValAsn: 5.725 ± 0.992
1.431ValPro: 1.431 ± 1.949
1.431ValGln: 1.431 ± 0.648
2.862ValArg: 2.862 ± 1.836
4.293ValSer: 4.293 ± 2.704
2.147ValThr: 2.147 ± 0.774
3.22ValVal: 3.22 ± 1.207
0.358ValTrp: 0.358 ± 0.6
1.431ValTyr: 1.431 ± 0.482
0.0ValXaa: 0.0 ± 0.0
Trp
0.358TrpAla: 0.358 ± 0.75
0.0TrpCys: 0.0 ± 0.0
0.358TrpAsp: 0.358 ± 0.19
0.716TrpGlu: 0.716 ± 0.652
0.358TrpPhe: 0.358 ± 0.19
0.716TrpGly: 0.716 ± 1.2
0.358TrpHis: 0.358 ± 0.19
0.358TrpIle: 0.358 ± 0.19
0.0TrpLys: 0.0 ± 0.0
0.358TrpLeu: 0.358 ± 0.19
1.073TrpMet: 1.073 ± 0.569
0.716TrpAsn: 0.716 ± 0.379
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.073TrpArg: 1.073 ± 0.797
1.789TrpSer: 1.789 ± 0.928
0.358TrpThr: 0.358 ± 0.19
1.073TrpVal: 1.073 ± 0.569
0.358TrpTrp: 0.358 ± 0.6
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.504TyrAla: 2.504 ± 0.867
0.358TyrCys: 0.358 ± 0.19
1.431TyrAsp: 1.431 ± 0.758
2.504TyrGlu: 2.504 ± 1.327
1.073TyrPhe: 1.073 ± 1.022
1.431TyrGly: 1.431 ± 0.482
1.073TyrHis: 1.073 ± 0.569
1.789TyrIle: 1.789 ± 0.527
1.431TyrLys: 1.431 ± 0.982
1.789TyrLeu: 1.789 ± 0.669
1.073TyrMet: 1.073 ± 0.601
1.073TyrAsn: 1.073 ± 0.569
0.358TyrPro: 0.358 ± 0.19
0.358TyrGln: 0.358 ± 0.19
1.789TyrArg: 1.789 ± 0.578
1.431TyrSer: 1.431 ± 0.607
0.716TyrThr: 0.716 ± 0.379
1.073TyrVal: 1.073 ± 0.569
0.358TyrTrp: 0.358 ± 0.19
1.073TyrTyr: 1.073 ± 0.601
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (2796 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski