Amino acid dipepetide frequency for Chrysanthemum virus B (CVB)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.54AlaAla: 6.54 ± 1.576
1.377AlaCys: 1.377 ± 0.715
2.065AlaAsp: 2.065 ± 0.705
5.164AlaGlu: 5.164 ± 1.693
2.754AlaPhe: 2.754 ± 1.028
5.164AlaGly: 5.164 ± 1.984
2.065AlaHis: 2.065 ± 1.412
4.819AlaIle: 4.819 ± 2.08
5.508AlaLys: 5.508 ± 2.858
6.196AlaLeu: 6.196 ± 2.21
0.688AlaMet: 0.688 ± 0.518
2.754AlaAsn: 2.754 ± 1.501
0.688AlaPro: 0.688 ± 0.518
2.065AlaGln: 2.065 ± 0.52
4.475AlaArg: 4.475 ± 1.029
4.131AlaSer: 4.131 ± 0.536
1.721AlaThr: 1.721 ± 0.641
5.508AlaVal: 5.508 ± 1.401
0.688AlaTrp: 0.688 ± 0.518
2.754AlaTyr: 2.754 ± 1.223
0.0AlaXaa: 0.0 ± 0.0
Cys
1.721CysAla: 1.721 ± 0.641
0.344CysCys: 0.344 ± 0.179
1.033CysAsp: 1.033 ± 0.769
1.377CysGlu: 1.377 ± 0.715
2.065CysPhe: 2.065 ± 0.965
2.41CysGly: 2.41 ± 0.894
1.033CysHis: 1.033 ± 1.296
1.033CysIle: 1.033 ± 1.691
1.033CysLys: 1.033 ± 0.536
1.721CysLeu: 1.721 ± 0.658
0.344CysMet: 0.344 ± 0.179
1.377CysAsn: 1.377 ± 1.584
1.377CysPro: 1.377 ± 0.736
0.688CysGln: 0.688 ± 0.357
1.377CysArg: 1.377 ± 0.729
2.41CysSer: 2.41 ± 1.251
2.065CysThr: 2.065 ± 0.965
2.754CysVal: 2.754 ± 3.346
0.0CysTrp: 0.0 ± 0.0
1.377CysTyr: 1.377 ± 0.736
0.0CysXaa: 0.0 ± 0.0
Asp
1.721AspAla: 1.721 ± 1.583
0.688AspCys: 0.688 ± 0.357
1.721AspAsp: 1.721 ± 0.641
5.164AspGlu: 5.164 ± 1.459
2.41AspPhe: 2.41 ± 1.64
3.787AspGly: 3.787 ± 0.468
1.721AspHis: 1.721 ± 1.179
1.377AspIle: 1.377 ± 0.736
0.688AspLys: 0.688 ± 0.357
3.787AspLeu: 3.787 ± 1.574
1.033AspMet: 1.033 ± 0.502
2.065AspAsn: 2.065 ± 0.797
4.131AspPro: 4.131 ± 2.902
1.033AspGln: 1.033 ± 0.536
1.721AspArg: 1.721 ± 0.788
1.721AspSer: 1.721 ± 0.619
1.377AspThr: 1.377 ± 1.036
3.098AspVal: 3.098 ± 0.686
1.033AspTrp: 1.033 ± 0.502
2.065AspTyr: 2.065 ± 0.772
0.0AspXaa: 0.0 ± 0.0
Glu
7.229GluAla: 7.229 ± 2.015
1.033GluCys: 1.033 ± 0.536
2.41GluAsp: 2.41 ± 0.897
8.606GluGlu: 8.606 ± 3.203
1.721GluPhe: 1.721 ± 0.893
6.196GluGly: 6.196 ± 1.939
2.754GluHis: 2.754 ± 1.062
5.164GluIle: 5.164 ± 1.238
4.475GluLys: 4.475 ± 1.651
7.573GluLeu: 7.573 ± 2.082
1.377GluMet: 1.377 ± 1.036
3.787GluAsn: 3.787 ± 0.815
1.721GluPro: 1.721 ± 0.658
4.475GluGln: 4.475 ± 1.455
4.819GluArg: 4.819 ± 1.817
3.442GluSer: 3.442 ± 0.677
4.131GluThr: 4.131 ± 1.489
5.852GluVal: 5.852 ± 0.921
1.033GluTrp: 1.033 ± 0.502
2.065GluTyr: 2.065 ± 0.52
0.0GluXaa: 0.0 ± 0.0
Phe
1.721PheAla: 1.721 ± 1.49
0.0PheCys: 0.0 ± 0.0
2.754PheAsp: 2.754 ± 1.81
5.852PheGlu: 5.852 ± 2.325
2.065PhePhe: 2.065 ± 0.52
3.442PheGly: 3.442 ± 2.373
1.721PheHis: 1.721 ± 0.641
1.377PheIle: 1.377 ± 0.715
1.377PheLys: 1.377 ± 0.659
6.196PheLeu: 6.196 ± 1.216
1.377PheMet: 1.377 ± 0.66
3.442PheAsn: 3.442 ± 1.359
2.065PhePro: 2.065 ± 1.537
1.721PheGln: 1.721 ± 0.893
3.442PheArg: 3.442 ± 1.787
2.41PheSer: 2.41 ± 0.992
2.065PheThr: 2.065 ± 1.412
3.098PheVal: 3.098 ± 1.171
0.344PheTrp: 0.344 ± 0.179
2.065PheTyr: 2.065 ± 1.072
0.0PheXaa: 0.0 ± 0.0
Gly
4.131GlyAla: 4.131 ± 1.041
2.754GlyCys: 2.754 ± 1.592
3.442GlyAsp: 3.442 ± 0.811
6.196GlyGlu: 6.196 ± 2.062
3.098GlyPhe: 3.098 ± 1.411
5.852GlyGly: 5.852 ± 1.93
1.033GlyHis: 1.033 ± 0.578
2.41GlyIle: 2.41 ± 0.75
4.819GlyLys: 4.819 ± 1.96
6.54GlyLeu: 6.54 ± 1.706
0.688GlyMet: 0.688 ± 0.357
2.754GlyAsn: 2.754 ± 1.09
3.098GlyPro: 3.098 ± 0.844
2.41GlyGln: 2.41 ± 0.894
3.787GlyArg: 3.787 ± 1.596
4.475GlySer: 4.475 ± 1.127
4.131GlyThr: 4.131 ± 1.087
6.196GlyVal: 6.196 ± 1.284
0.688GlyTrp: 0.688 ± 0.357
2.065GlyTyr: 2.065 ± 0.797
0.0GlyXaa: 0.0 ± 0.0
His
1.721HisAla: 1.721 ± 0.893
0.344HisCys: 0.344 ± 0.905
0.344HisAsp: 0.344 ± 0.179
2.41HisGlu: 2.41 ± 1.251
2.065HisPhe: 2.065 ± 0.705
2.41HisGly: 2.41 ± 0.894
0.688HisHis: 0.688 ± 0.357
1.033HisIle: 1.033 ± 0.706
1.033HisLys: 1.033 ± 0.536
5.164HisLeu: 5.164 ± 2.117
0.688HisMet: 0.688 ± 1.112
1.033HisAsn: 1.033 ± 0.939
1.033HisPro: 1.033 ± 1.039
1.033HisGln: 1.033 ± 0.536
3.098HisArg: 3.098 ± 1.147
3.098HisSer: 3.098 ± 0.525
0.344HisThr: 0.344 ± 0.696
1.721HisVal: 1.721 ± 0.641
0.0HisTrp: 0.0 ± 0.0
0.344HisTyr: 0.344 ± 0.179
0.0HisXaa: 0.0 ± 0.0
Ile
1.721IleAla: 1.721 ± 0.893
2.065IleCys: 2.065 ± 0.705
0.344IleAsp: 0.344 ± 0.179
5.508IleGlu: 5.508 ± 1.966
1.377IlePhe: 1.377 ± 0.596
3.787IleGly: 3.787 ± 1.165
1.033IleHis: 1.033 ± 1.039
3.787IleIle: 3.787 ± 0.883
4.131IleLys: 4.131 ± 0.967
4.131IleLeu: 4.131 ± 2.945
2.065IleMet: 2.065 ± 1.049
2.065IleAsn: 2.065 ± 0.878
1.377IlePro: 1.377 ± 1.195
0.688IleGln: 0.688 ± 0.518
1.721IleArg: 1.721 ± 1.944
3.098IleSer: 3.098 ± 2.138
3.098IleThr: 3.098 ± 1.102
3.787IleVal: 3.787 ± 1.989
0.0IleTrp: 0.0 ± 0.0
2.754IleTyr: 2.754 ± 0.992
0.0IleXaa: 0.0 ± 0.0
Lys
4.131LysAla: 4.131 ± 0.967
1.721LysCys: 1.721 ± 1.49
2.754LysAsp: 2.754 ± 1.037
5.164LysGlu: 5.164 ± 1.172
3.098LysPhe: 3.098 ± 1.12
4.475LysGly: 4.475 ± 0.777
0.688LysHis: 0.688 ± 0.357
4.131LysIle: 4.131 ± 0.799
5.852LysLys: 5.852 ± 3.037
5.508LysLeu: 5.508 ± 2.362
0.688LysMet: 0.688 ± 1.17
2.754LysAsn: 2.754 ± 1.429
4.131LysPro: 4.131 ± 1.524
1.377LysGln: 1.377 ± 0.715
2.41LysArg: 2.41 ± 0.882
3.442LysSer: 3.442 ± 1.317
3.442LysThr: 3.442 ± 0.465
4.819LysVal: 4.819 ± 1.167
0.344LysTrp: 0.344 ± 0.179
1.377LysTyr: 1.377 ± 0.547
0.0LysXaa: 0.0 ± 0.0
Leu
6.196LeuAla: 6.196 ± 1.942
3.442LeuCys: 3.442 ± 0.864
3.787LeuAsp: 3.787 ± 1.305
7.573LeuGlu: 7.573 ± 1.966
4.475LeuPhe: 4.475 ± 1.126
6.196LeuGly: 6.196 ± 0.97
2.41LeuHis: 2.41 ± 1.354
3.787LeuIle: 3.787 ± 1.409
6.885LeuLys: 6.885 ± 2.388
9.639LeuLeu: 9.639 ± 2.44
1.721LeuMet: 1.721 ± 0.605
4.819LeuAsn: 4.819 ± 1.509
5.164LeuPro: 5.164 ± 1.714
2.065LeuGln: 2.065 ± 1.004
5.852LeuArg: 5.852 ± 1.097
7.229LeuSer: 7.229 ± 2.559
3.787LeuThr: 3.787 ± 1.041
5.164LeuVal: 5.164 ± 1.699
1.377LeuTrp: 1.377 ± 0.924
4.131LeuTyr: 4.131 ± 1.817
0.0LeuXaa: 0.0 ± 0.0
Met
2.754MetAla: 2.754 ± 1.095
0.344MetCys: 0.344 ± 0.179
1.377MetAsp: 1.377 ± 1.195
1.377MetGlu: 1.377 ± 0.547
0.0MetPhe: 0.0 ± 0.0
1.377MetGly: 1.377 ± 0.547
1.721MetHis: 1.721 ± 0.641
1.721MetIle: 1.721 ± 0.893
0.344MetLys: 0.344 ± 0.179
1.721MetLeu: 1.721 ± 0.833
1.033MetMet: 1.033 ± 0.536
0.344MetAsn: 0.344 ± 0.59
1.033MetPro: 1.033 ± 0.828
0.344MetGln: 0.344 ± 0.179
2.754MetArg: 2.754 ± 1.062
0.344MetSer: 0.344 ± 0.812
1.377MetThr: 1.377 ± 0.659
1.033MetVal: 1.033 ± 0.502
0.0MetTrp: 0.0 ± 0.0
0.344MetTyr: 0.344 ± 0.59
0.0MetXaa: 0.0 ± 0.0
Asn
2.065AsnAla: 2.065 ± 1.004
2.754AsnCys: 2.754 ± 1.425
1.033AsnAsp: 1.033 ± 0.502
3.787AsnGlu: 3.787 ± 1.965
3.098AsnPhe: 3.098 ± 0.686
4.131AsnGly: 4.131 ± 0.967
1.721AsnHis: 1.721 ± 0.658
1.721AsnIle: 1.721 ± 0.619
2.065AsnLys: 2.065 ± 1.242
3.787AsnLeu: 3.787 ± 0.468
1.377AsnMet: 1.377 ± 2.361
3.442AsnAsn: 3.442 ± 3.406
1.033AsnPro: 1.033 ± 0.706
0.344AsnGln: 0.344 ± 0.179
3.442AsnArg: 3.442 ± 1.26
2.41AsnSer: 2.41 ± 1.108
3.098AsnThr: 3.098 ± 1.503
3.442AsnVal: 3.442 ± 1.043
0.688AsnTrp: 0.688 ± 0.357
2.41AsnTyr: 2.41 ± 0.908
0.0AsnXaa: 0.0 ± 0.0
Pro
3.442ProAla: 3.442 ± 2.869
2.41ProCys: 2.41 ± 0.79
2.754ProAsp: 2.754 ± 1.502
2.065ProGlu: 2.065 ± 0.972
1.721ProPhe: 1.721 ± 0.893
3.098ProGly: 3.098 ± 1.693
1.377ProHis: 1.377 ± 1.195
1.721ProIle: 1.721 ± 1.074
2.41ProLys: 2.41 ± 1.54
2.754ProLeu: 2.754 ± 1.239
0.688ProMet: 0.688 ± 0.357
1.721ProAsn: 1.721 ± 1.004
5.852ProPro: 5.852 ± 5.463
2.065ProGln: 2.065 ± 0.642
2.41ProArg: 2.41 ± 1.349
1.721ProSer: 1.721 ± 1.004
3.442ProThr: 3.442 ± 2.009
2.41ProVal: 2.41 ± 1.599
1.377ProTrp: 1.377 ± 0.596
1.377ProTyr: 1.377 ± 0.715
0.0ProXaa: 0.0 ± 0.0
Gln
1.377GlnAla: 1.377 ± 0.715
0.344GlnCys: 0.344 ± 0.696
1.033GlnAsp: 1.033 ± 0.502
2.41GlnGlu: 2.41 ± 0.908
2.41GlnPhe: 2.41 ± 0.79
2.065GlnGly: 2.065 ± 1.115
0.688GlnHis: 0.688 ± 0.357
0.688GlnIle: 0.688 ± 0.357
1.033GlnLys: 1.033 ± 0.536
3.098GlnLeu: 3.098 ± 1.179
0.0GlnMet: 0.0 ± 0.0
1.033GlnAsn: 1.033 ± 0.706
2.754GlnPro: 2.754 ± 0.74
0.344GlnGln: 0.344 ± 0.59
1.721GlnArg: 1.721 ± 0.641
2.41GlnSer: 2.41 ± 1.251
1.721GlnThr: 1.721 ± 1.61
1.721GlnVal: 1.721 ± 1.193
0.0GlnTrp: 0.0 ± 0.0
0.344GlnTyr: 0.344 ± 0.179
0.0GlnXaa: 0.0 ± 0.0
Arg
5.852ArgAla: 5.852 ± 2.184
1.721ArgCys: 1.721 ± 3.093
2.41ArgAsp: 2.41 ± 0.882
4.475ArgGlu: 4.475 ± 1.144
5.508ArgPhe: 5.508 ± 2.007
2.41ArgGly: 2.41 ± 0.908
2.754ArgHis: 2.754 ± 1.606
1.721ArgIle: 1.721 ± 0.798
3.787ArgLys: 3.787 ± 0.96
5.164ArgLeu: 5.164 ± 2.177
1.721ArgMet: 1.721 ± 0.893
2.065ArgAsn: 2.065 ± 0.642
2.41ArgPro: 2.41 ± 1.035
0.344ArgGln: 0.344 ± 0.59
2.754ArgArg: 2.754 ± 3.396
5.164ArgSer: 5.164 ± 1.616
1.721ArgThr: 1.721 ± 2.488
2.065ArgVal: 2.065 ± 0.797
1.033ArgTrp: 1.033 ± 0.536
2.065ArgTyr: 2.065 ± 1.072
0.0ArgXaa: 0.0 ± 0.0
Ser
4.819SerAla: 4.819 ± 0.594
2.41SerCys: 2.41 ± 0.776
4.819SerAsp: 4.819 ± 1.291
3.442SerGlu: 3.442 ± 0.677
2.065SerPhe: 2.065 ± 1.072
4.475SerGly: 4.475 ± 1.14
1.377SerHis: 1.377 ± 1.036
3.787SerIle: 3.787 ± 1.721
6.54SerLys: 6.54 ± 2.732
5.508SerLeu: 5.508 ± 1.382
0.344SerMet: 0.344 ± 0.179
3.098SerAsn: 3.098 ± 1.179
1.033SerPro: 1.033 ± 0.536
1.033SerGln: 1.033 ± 0.502
4.475SerArg: 4.475 ± 1.432
4.819SerSer: 4.819 ± 2.036
5.164SerThr: 5.164 ± 1.858
4.475SerVal: 4.475 ± 1.729
0.344SerTrp: 0.344 ± 0.179
2.41SerTyr: 2.41 ± 2.265
0.0SerXaa: 0.0 ± 0.0
Thr
2.41ThrAla: 2.41 ± 0.471
1.033ThrCys: 1.033 ± 0.769
2.065ThrAsp: 2.065 ± 0.766
3.442ThrGlu: 3.442 ± 1.522
4.131ThrPhe: 4.131 ± 1.361
3.098ThrGly: 3.098 ± 0.741
1.721ThrHis: 1.721 ± 0.893
2.065ThrIle: 2.065 ± 0.766
2.41ThrLys: 2.41 ± 1.375
6.196ThrLeu: 6.196 ± 1.462
2.065ThrMet: 2.065 ± 0.673
2.754ThrAsn: 2.754 ± 1.501
2.41ThrPro: 2.41 ± 2.54
0.688ThrGln: 0.688 ± 0.518
2.754ThrArg: 2.754 ± 2.215
4.819ThrSer: 4.819 ± 0.942
1.377ThrThr: 1.377 ± 0.715
2.754ThrVal: 2.754 ± 1.318
0.0ThrTrp: 0.0 ± 0.0
1.721ThrTyr: 1.721 ± 0.663
0.0ThrXaa: 0.0 ± 0.0
Val
3.787ValAla: 3.787 ± 1.724
2.41ValCys: 2.41 ± 1.108
3.442ValAsp: 3.442 ± 0.583
3.442ValGlu: 3.442 ± 1.102
3.098ValPhe: 3.098 ± 1.034
4.475ValGly: 4.475 ± 1.385
2.41ValHis: 2.41 ± 0.471
4.475ValIle: 4.475 ± 3.168
3.787ValLys: 3.787 ± 1.883
5.852ValLeu: 5.852 ± 1.061
1.377ValMet: 1.377 ± 0.715
1.377ValAsn: 1.377 ± 0.736
3.098ValPro: 3.098 ± 2.813
3.098ValGln: 3.098 ± 0.801
2.754ValArg: 2.754 ± 1.022
5.508ValSer: 5.508 ± 1.117
3.098ValThr: 3.098 ± 0.829
2.41ValVal: 2.41 ± 0.471
0.688ValTrp: 0.688 ± 0.357
3.787ValTyr: 3.787 ± 1.667
0.0ValXaa: 0.0 ± 0.0
Trp
0.688TrpAla: 0.688 ± 0.357
0.344TrpCys: 0.344 ± 0.179
0.688TrpAsp: 0.688 ± 0.518
0.0TrpGlu: 0.0 ± 0.0
1.033TrpPhe: 1.033 ± 0.578
0.344TrpGly: 0.344 ± 0.938
0.344TrpHis: 0.344 ± 0.179
0.344TrpIle: 0.344 ± 0.179
0.344TrpLys: 0.344 ± 0.59
1.721TrpLeu: 1.721 ± 0.893
0.688TrpMet: 0.688 ± 0.357
1.377TrpAsn: 1.377 ± 0.547
0.344TrpPro: 0.344 ± 0.179
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
1.033TrpSer: 1.033 ± 0.502
0.344TrpThr: 0.344 ± 0.179
0.688TrpVal: 0.688 ± 0.357
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.098TyrAla: 3.098 ± 1.052
0.0TyrCys: 0.0 ± 0.0
2.065TyrAsp: 2.065 ± 0.705
2.41TyrGlu: 2.41 ± 1.354
0.344TyrPhe: 0.344 ± 0.179
1.377TyrGly: 1.377 ± 0.596
0.344TyrHis: 0.344 ± 0.179
1.377TyrIle: 1.377 ± 0.736
3.787TyrLys: 3.787 ± 0.96
3.787TyrLeu: 3.787 ± 1.574
1.033TyrMet: 1.033 ± 0.502
3.787TyrAsn: 3.787 ± 1.399
2.065TyrPro: 2.065 ± 0.797
1.377TyrGln: 1.377 ± 0.748
1.377TyrArg: 1.377 ± 1.195
2.754TyrSer: 2.754 ± 0.74
2.41TyrThr: 2.41 ± 2.279
1.377TyrVal: 1.377 ± 1.036
0.688TyrTrp: 0.688 ± 0.357
1.033TyrTyr: 1.033 ± 0.706
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2906 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski