Amino acid dipepetide frequency for Butterbur mosaic virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.605AlaAla: 4.605 ± 2.164
1.063AlaCys: 1.063 ± 1.406
3.897AlaAsp: 3.897 ± 1.547
3.897AlaGlu: 3.897 ± 0.782
4.959AlaPhe: 4.959 ± 0.901
3.542AlaGly: 3.542 ± 0.77
3.542AlaHis: 3.542 ± 0.746
6.376AlaIle: 6.376 ± 2.185
7.085AlaLys: 7.085 ± 1.463
7.085AlaLeu: 7.085 ± 1.75
2.125AlaMet: 2.125 ± 0.871
1.771AlaAsn: 1.771 ± 0.619
2.48AlaPro: 2.48 ± 1.225
0.708AlaGln: 0.708 ± 0.606
3.188AlaArg: 3.188 ± 1.629
5.668AlaSer: 5.668 ± 1.426
3.188AlaThr: 3.188 ± 1.44
2.48AlaVal: 2.48 ± 1.088
0.354AlaTrp: 0.354 ± 0.192
1.063AlaTyr: 1.063 ± 0.576
0.0AlaXaa: 0.0 ± 0.0
Cys
2.125CysAla: 2.125 ± 0.944
0.0CysCys: 0.0 ± 0.0
1.417CysAsp: 1.417 ± 1.758
1.063CysGlu: 1.063 ± 0.576
2.125CysPhe: 2.125 ± 0.782
1.417CysGly: 1.417 ± 0.708
0.0CysHis: 0.0 ± 0.0
1.771CysIle: 1.771 ± 0.961
1.063CysLys: 1.063 ± 0.576
2.125CysLeu: 2.125 ± 1.606
0.354CysMet: 0.354 ± 0.192
0.0CysAsn: 0.0 ± 0.0
0.708CysPro: 0.708 ± 0.699
0.354CysGln: 0.354 ± 0.192
2.125CysArg: 2.125 ± 0.732
2.48CysSer: 2.48 ± 1.399
1.417CysThr: 1.417 ± 0.612
2.834CysVal: 2.834 ± 2.666
0.0CysTrp: 0.0 ± 0.0
1.771CysTyr: 1.771 ± 0.928
0.0CysXaa: 0.0 ± 0.0
Asp
4.959AspAla: 4.959 ± 1.873
0.708AspCys: 0.708 ± 0.384
2.834AspAsp: 2.834 ± 1.537
4.251AspGlu: 4.251 ± 0.958
1.063AspPhe: 1.063 ± 0.546
3.897AspGly: 3.897 ± 0.876
1.771AspHis: 1.771 ± 0.619
1.063AspIle: 1.063 ± 0.576
1.063AspLys: 1.063 ± 0.546
6.73AspLeu: 6.73 ± 1.21
1.063AspMet: 1.063 ± 0.459
3.542AspAsn: 3.542 ± 1.406
3.542AspPro: 3.542 ± 1.108
1.063AspGln: 1.063 ± 0.606
2.125AspArg: 2.125 ± 0.732
3.542AspSer: 3.542 ± 1.351
2.125AspThr: 2.125 ± 1.528
4.251AspVal: 4.251 ± 2.516
0.354AspTrp: 0.354 ± 0.192
2.834AspTyr: 2.834 ± 1.416
0.0AspXaa: 0.0 ± 0.0
Glu
4.605GluAla: 4.605 ± 1.883
0.708GluCys: 0.708 ± 0.384
2.125GluAsp: 2.125 ± 0.782
3.542GluGlu: 3.542 ± 0.77
4.251GluPhe: 4.251 ± 1.083
4.605GluGly: 4.605 ± 1.579
1.771GluHis: 1.771 ± 1.137
1.417GluIle: 1.417 ± 0.551
3.542GluLys: 3.542 ± 1.404
8.502GluLeu: 8.502 ± 3.36
1.063GluMet: 1.063 ± 0.555
1.771GluAsn: 1.771 ± 0.961
2.48GluPro: 2.48 ± 0.988
2.125GluGln: 2.125 ± 1.153
2.48GluArg: 2.48 ± 0.988
5.313GluSer: 5.313 ± 1.839
3.188GluThr: 3.188 ± 1.638
4.605GluVal: 4.605 ± 1.883
0.354GluTrp: 0.354 ± 0.841
1.417GluTyr: 1.417 ± 2.16
0.0GluXaa: 0.0 ± 0.0
Phe
4.605PheAla: 4.605 ± 1.35
2.125PheCys: 2.125 ± 0.732
3.542PheAsp: 3.542 ± 1.237
5.313PheGlu: 5.313 ± 0.605
0.354PhePhe: 0.354 ± 0.192
3.542PheGly: 3.542 ± 3.349
0.708PheHis: 0.708 ± 0.384
1.771PheIle: 1.771 ± 1.251
2.48PheLys: 2.48 ± 0.917
6.022PheLeu: 6.022 ± 1.834
0.708PheMet: 0.708 ± 0.384
2.48PheAsn: 2.48 ± 1.08
1.063PhePro: 1.063 ± 0.546
2.125PheGln: 2.125 ± 1.153
2.48PheArg: 2.48 ± 1.037
6.73PheSer: 6.73 ± 1.62
3.897PheThr: 3.897 ± 1.039
4.251PheVal: 4.251 ± 0.93
0.354PheTrp: 0.354 ± 0.713
2.48PheTyr: 2.48 ± 1.303
0.0PheXaa: 0.0 ± 0.0
Gly
2.834GlyAla: 2.834 ± 2.151
1.063GlyCys: 1.063 ± 0.576
5.313GlyAsp: 5.313 ± 1.991
2.834GlyGlu: 2.834 ± 1.392
4.251GlyPhe: 4.251 ± 1.725
3.897GlyGly: 3.897 ± 3.252
0.708GlyHis: 0.708 ± 0.384
3.542GlyIle: 3.542 ± 1.222
3.897GlyLys: 3.897 ± 1.719
7.793GlyLeu: 7.793 ± 1.91
1.417GlyMet: 1.417 ± 1.211
3.542GlyAsn: 3.542 ± 1.442
2.125GlyPro: 2.125 ± 1.153
3.188GlyGln: 3.188 ± 0.993
3.188GlyArg: 3.188 ± 1.129
5.313GlySer: 5.313 ± 1.015
4.605GlyThr: 4.605 ± 1.236
4.959GlyVal: 4.959 ± 2.077
1.417GlyTrp: 1.417 ± 0.551
1.771GlyTyr: 1.771 ± 0.619
0.0GlyXaa: 0.0 ± 0.0
His
0.354HisAla: 0.354 ± 0.192
1.063HisCys: 1.063 ± 0.606
0.354HisAsp: 0.354 ± 0.192
1.417HisGlu: 1.417 ± 0.551
1.063HisPhe: 1.063 ± 0.576
2.48HisGly: 2.48 ± 0.979
1.771HisHis: 1.771 ± 0.761
1.063HisIle: 1.063 ± 1.218
2.125HisLys: 2.125 ± 0.948
3.542HisLeu: 3.542 ± 1.069
0.354HisMet: 0.354 ± 0.192
1.417HisAsn: 1.417 ± 0.844
0.708HisPro: 0.708 ± 0.658
0.354HisGln: 0.354 ± 0.757
1.417HisArg: 1.417 ± 0.708
3.188HisSer: 3.188 ± 1.729
2.834HisThr: 2.834 ± 1.868
1.063HisVal: 1.063 ± 0.704
0.354HisTrp: 0.354 ± 0.192
1.063HisTyr: 1.063 ± 0.576
0.0HisXaa: 0.0 ± 0.0
Ile
3.897IleAla: 3.897 ± 1.358
1.417IleCys: 1.417 ± 1.105
3.542IleAsp: 3.542 ± 1.442
4.251IleGlu: 4.251 ± 1.759
1.771IlePhe: 1.771 ± 0.676
2.834IleGly: 2.834 ± 1.068
1.417IleHis: 1.417 ± 1.053
2.48IleIle: 2.48 ± 0.61
4.605IleLys: 4.605 ± 1.37
4.605IleLeu: 4.605 ± 0.976
1.771IleMet: 1.771 ± 0.961
2.125IleAsn: 2.125 ± 0.855
2.48IlePro: 2.48 ± 0.873
1.063IleGln: 1.063 ± 0.546
1.063IleArg: 1.063 ± 1.65
4.251IleSer: 4.251 ± 1.707
1.771IleThr: 1.771 ± 1.008
2.125IleVal: 2.125 ± 1.399
0.0IleTrp: 0.0 ± 0.0
2.125IleTyr: 2.125 ± 1.153
0.0IleXaa: 0.0 ± 0.0
Lys
5.668LysAla: 5.668 ± 1.334
1.417LysCys: 1.417 ± 0.612
1.771LysAsp: 1.771 ± 1.137
4.251LysGlu: 4.251 ± 0.615
3.188LysPhe: 3.188 ± 1.156
4.251LysGly: 4.251 ± 1.433
1.771LysHis: 1.771 ± 0.961
3.542LysIle: 3.542 ± 0.85
2.125LysLys: 2.125 ± 1.153
9.564LysLeu: 9.564 ± 1.293
0.708LysMet: 0.708 ± 0.699
2.125LysAsn: 2.125 ± 1.092
1.771LysPro: 1.771 ± 0.554
1.063LysGln: 1.063 ± 0.546
3.897LysArg: 3.897 ± 1.412
4.959LysSer: 4.959 ± 1.231
5.313LysThr: 5.313 ± 1.81
2.834LysVal: 2.834 ± 0.965
0.0LysTrp: 0.0 ± 0.0
2.48LysTyr: 2.48 ± 1.328
0.0LysXaa: 0.0 ± 0.0
Leu
6.73LeuAla: 6.73 ± 2.285
2.834LeuCys: 2.834 ± 1.174
7.085LeuAsp: 7.085 ± 1.679
6.022LeuGlu: 6.022 ± 1.074
4.959LeuPhe: 4.959 ± 2.067
8.502LeuGly: 8.502 ± 3.328
2.48LeuHis: 2.48 ± 1.689
5.668LeuIle: 5.668 ± 2.161
8.856LeuLys: 8.856 ± 2.512
9.919LeuLeu: 9.919 ± 2.847
2.834LeuMet: 2.834 ± 1.031
4.605LeuAsn: 4.605 ± 0.886
4.959LeuPro: 4.959 ± 1.951
2.48LeuGln: 2.48 ± 2.788
7.085LeuArg: 7.085 ± 1.463
6.022LeuSer: 6.022 ± 2.099
4.251LeuThr: 4.251 ± 2.621
5.668LeuVal: 5.668 ± 1.501
0.354LeuTrp: 0.354 ± 0.192
2.48LeuTyr: 2.48 ± 0.917
0.0LeuXaa: 0.0 ± 0.0
Met
1.063MetAla: 1.063 ± 0.546
0.354MetCys: 0.354 ± 0.192
1.771MetAsp: 1.771 ± 0.554
1.063MetGlu: 1.063 ± 0.576
0.708MetPhe: 0.708 ± 0.606
0.708MetGly: 0.708 ± 1.427
0.708MetHis: 0.708 ± 0.384
1.063MetIle: 1.063 ± 0.576
2.125MetLys: 2.125 ± 0.617
2.125MetLeu: 2.125 ± 1.092
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
1.771MetPro: 1.771 ± 0.795
1.417MetGln: 1.417 ± 1.329
1.063MetArg: 1.063 ± 0.576
1.771MetSer: 1.771 ± 0.748
1.771MetThr: 1.771 ± 0.619
0.0MetVal: 0.0 ± 0.0
0.354MetTrp: 0.354 ± 0.192
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.48AsnAla: 2.48 ± 1.08
1.771AsnCys: 1.771 ± 0.877
0.708AsnAsp: 0.708 ± 0.606
2.125AsnGlu: 2.125 ± 1.153
3.188AsnPhe: 3.188 ± 0.599
3.188AsnGly: 3.188 ± 1.156
0.354AsnHis: 0.354 ± 0.192
2.125AsnIle: 2.125 ± 1.151
2.48AsnLys: 2.48 ± 1.987
4.959AsnLeu: 4.959 ± 1.0
0.708AsnMet: 0.708 ± 0.606
1.771AsnAsn: 1.771 ± 1.137
2.834AsnPro: 2.834 ± 1.174
1.417AsnGln: 1.417 ± 0.551
2.48AsnArg: 2.48 ± 1.08
4.251AsnSer: 4.251 ± 1.546
2.48AsnThr: 2.48 ± 0.866
1.417AsnVal: 1.417 ± 0.68
0.354AsnTrp: 0.354 ± 0.192
1.063AsnTyr: 1.063 ± 0.576
0.0AsnXaa: 0.0 ± 0.0
Pro
1.771ProAla: 1.771 ± 1.251
1.417ProCys: 1.417 ± 0.768
2.48ProAsp: 2.48 ± 1.603
4.605ProGlu: 4.605 ± 1.7
1.063ProPhe: 1.063 ± 0.576
4.605ProGly: 4.605 ± 2.805
1.417ProHis: 1.417 ± 1.503
2.125ProIle: 2.125 ± 1.092
2.48ProLys: 2.48 ± 1.104
4.251ProLeu: 4.251 ± 1.781
0.354ProMet: 0.354 ± 0.192
2.48ProAsn: 2.48 ± 0.431
3.542ProPro: 3.542 ± 3.703
0.708ProGln: 0.708 ± 0.658
2.48ProArg: 2.48 ± 0.988
2.48ProSer: 2.48 ± 2.489
2.125ProThr: 2.125 ± 0.732
3.897ProVal: 3.897 ± 1.547
0.708ProTrp: 0.708 ± 0.384
2.125ProTyr: 2.125 ± 0.732
0.0ProXaa: 0.0 ± 0.0
Gln
1.417GlnAla: 1.417 ± 0.68
0.708GlnCys: 0.708 ± 0.384
1.771GlnAsp: 1.771 ± 0.619
1.771GlnGlu: 1.771 ± 0.676
1.063GlnPhe: 1.063 ± 0.546
1.771GlnGly: 1.771 ± 0.676
1.063GlnHis: 1.063 ± 0.576
1.417GlnIle: 1.417 ± 0.612
1.771GlnLys: 1.771 ± 1.911
2.48GlnLeu: 2.48 ± 1.345
1.417GlnMet: 1.417 ± 1.694
0.354GlnAsn: 0.354 ± 0.713
1.771GlnPro: 1.771 ± 1.008
0.354GlnGln: 0.354 ± 0.757
1.417GlnArg: 1.417 ± 0.551
4.251GlnSer: 4.251 ± 1.616
0.354GlnThr: 0.354 ± 0.713
1.417GlnVal: 1.417 ± 0.612
0.0GlnTrp: 0.0 ± 0.0
1.417GlnTyr: 1.417 ± 0.68
0.0GlnXaa: 0.0 ± 0.0
Arg
5.313ArgAla: 5.313 ± 2.494
2.125ArgCys: 2.125 ± 3.278
4.605ArgAsp: 4.605 ± 0.983
1.417ArgGlu: 1.417 ± 0.768
6.73ArgPhe: 6.73 ± 1.723
2.125ArgGly: 2.125 ± 0.915
1.063ArgHis: 1.063 ± 0.851
2.125ArgIle: 2.125 ± 0.458
3.542ArgLys: 3.542 ± 1.921
3.897ArgLeu: 3.897 ± 1.677
1.063ArgMet: 1.063 ± 1.309
0.708ArgAsn: 0.708 ± 0.606
1.417ArgPro: 1.417 ± 1.211
1.063ArgGln: 1.063 ± 0.546
2.125ArgArg: 2.125 ± 0.906
3.188ArgSer: 3.188 ± 0.82
0.708ArgThr: 0.708 ± 0.384
4.251ArgVal: 4.251 ± 1.496
0.708ArgTrp: 0.708 ± 0.384
2.125ArgTyr: 2.125 ± 0.729
0.0ArgXaa: 0.0 ± 0.0
Ser
4.959SerAla: 4.959 ± 1.258
1.771SerCys: 1.771 ± 0.676
4.605SerAsp: 4.605 ± 1.702
4.605SerGlu: 4.605 ± 1.042
5.668SerPhe: 5.668 ± 1.43
4.251SerGly: 4.251 ± 1.011
4.251SerHis: 4.251 ± 1.897
3.897SerIle: 3.897 ± 1.206
5.313SerLys: 5.313 ± 2.33
6.376SerLeu: 6.376 ± 1.202
2.125SerMet: 2.125 ± 0.867
5.313SerAsn: 5.313 ± 2.731
5.313SerPro: 5.313 ± 2.054
3.897SerGln: 3.897 ± 1.552
2.834SerArg: 2.834 ± 0.919
8.502SerSer: 8.502 ± 2.109
3.188SerThr: 3.188 ± 1.384
4.251SerVal: 4.251 ± 1.896
0.354SerTrp: 0.354 ± 0.192
2.125SerTyr: 2.125 ± 0.855
0.0SerXaa: 0.0 ± 0.0
Thr
4.605ThrAla: 4.605 ± 1.368
1.063ThrCys: 1.063 ± 1.896
1.063ThrAsp: 1.063 ± 0.576
1.771ThrGlu: 1.771 ± 0.676
4.605ThrPhe: 4.605 ± 2.497
6.022ThrGly: 6.022 ± 2.549
2.125ThrHis: 2.125 ± 1.153
1.063ThrIle: 1.063 ± 0.576
4.605ThrLys: 4.605 ± 1.975
4.605ThrLeu: 4.605 ± 1.384
0.354ThrMet: 0.354 ± 0.192
1.417ThrAsn: 1.417 ± 0.551
2.48ThrPro: 2.48 ± 1.018
1.063ThrGln: 1.063 ± 0.546
3.542ThrArg: 3.542 ± 2.069
3.897ThrSer: 3.897 ± 2.077
4.959ThrThr: 4.959 ± 1.409
2.125ThrVal: 2.125 ± 1.153
1.063ThrTrp: 1.063 ± 0.546
2.834ThrTyr: 2.834 ± 1.07
0.0ThrXaa: 0.0 ± 0.0
Val
2.834ValAla: 2.834 ± 1.23
2.125ValCys: 2.125 ± 0.976
1.417ValAsp: 1.417 ± 0.612
3.897ValGlu: 3.897 ± 1.352
3.188ValPhe: 3.188 ± 1.612
4.959ValGly: 4.959 ± 1.958
0.708ValHis: 0.708 ± 0.384
3.188ValIle: 3.188 ± 1.233
2.125ValLys: 2.125 ± 0.617
4.251ValLeu: 4.251 ± 2.506
0.708ValMet: 0.708 ± 0.635
4.251ValAsn: 4.251 ± 1.653
2.834ValPro: 2.834 ± 2.597
2.834ValGln: 2.834 ± 0.991
2.834ValArg: 2.834 ± 0.934
4.959ValSer: 4.959 ± 2.434
4.251ValThr: 4.251 ± 1.931
1.771ValVal: 1.771 ± 1.693
0.0ValTrp: 0.0 ± 0.0
2.48ValTyr: 2.48 ± 0.917
0.0ValXaa: 0.0 ± 0.0
Trp
1.417TrpAla: 1.417 ± 0.551
0.354TrpCys: 0.354 ± 0.192
0.354TrpAsp: 0.354 ± 0.192
0.0TrpGlu: 0.0 ± 0.0
0.354TrpPhe: 0.354 ± 0.192
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.708TrpIle: 0.708 ± 0.384
0.0TrpLys: 0.0 ± 0.0
0.708TrpLeu: 0.708 ± 0.384
0.0TrpMet: 0.0 ± 0.0
0.708TrpAsn: 0.708 ± 0.606
0.354TrpPro: 0.354 ± 0.192
0.354TrpGln: 0.354 ± 0.713
0.0TrpArg: 0.0 ± 0.0
0.708TrpSer: 0.708 ± 0.384
0.354TrpThr: 0.354 ± 0.192
0.354TrpVal: 0.354 ± 0.192
0.0TrpTrp: 0.0 ± 0.0
0.708TrpTyr: 0.708 ± 0.752
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.834TyrAla: 2.834 ± 1.23
1.063TyrCys: 1.063 ± 1.027
2.125TyrAsp: 2.125 ± 0.732
1.771TyrGlu: 1.771 ± 0.961
2.48TyrPhe: 2.48 ± 0.988
1.063TyrGly: 1.063 ± 0.662
0.354TyrHis: 0.354 ± 0.192
3.188TyrIle: 3.188 ± 1.299
1.417TyrLys: 1.417 ± 0.768
4.251TyrLeu: 4.251 ± 1.341
0.354TyrMet: 0.354 ± 0.192
1.771TyrAsn: 1.771 ± 1.513
2.48TyrPro: 2.48 ± 1.203
0.354TyrGln: 0.354 ± 0.192
2.48TyrArg: 2.48 ± 1.037
2.125TyrSer: 2.125 ± 0.458
2.48TyrThr: 2.48 ± 0.917
1.417TyrVal: 1.417 ± 0.612
0.354TyrTrp: 0.354 ± 0.192
0.708TyrTyr: 0.708 ± 0.658
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2824 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski