Amino acid dipepetide frequency for African cassava mosaic Burkina Faso virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.665AlaAla: 3.665 ± 1.745
1.222AlaCys: 1.222 ± 0.605
0.611AlaAsp: 0.611 ± 0.48
1.833AlaGlu: 1.833 ± 0.754
0.611AlaPhe: 0.611 ± 0.507
3.054AlaGly: 3.054 ± 1.309
2.443AlaHis: 2.443 ± 1.283
3.054AlaIle: 3.054 ± 1.011
3.054AlaLys: 3.054 ± 1.247
4.887AlaLeu: 4.887 ± 2.027
1.833AlaMet: 1.833 ± 1.076
1.222AlaAsn: 1.222 ± 0.593
3.665AlaPro: 3.665 ± 1.392
2.443AlaGln: 2.443 ± 1.662
4.276AlaArg: 4.276 ± 1.688
4.887AlaSer: 4.887 ± 1.329
3.054AlaThr: 3.054 ± 2.236
0.0AlaVal: 0.0 ± 0.0
1.833AlaTrp: 1.833 ± 1.084
0.0AlaTyr: 0.0 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
1.222CysGlu: 1.222 ± 0.605
0.611CysPhe: 0.611 ± 0.706
1.222CysGly: 1.222 ± 0.72
0.0CysHis: 0.0 ± 0.0
0.611CysIle: 0.611 ± 0.593
1.833CysLys: 1.833 ± 1.241
1.222CysLeu: 1.222 ± 0.872
1.222CysMet: 1.222 ± 0.803
1.833CysAsn: 1.833 ± 0.715
1.222CysPro: 1.222 ± 1.308
1.833CysGln: 1.833 ± 1.136
0.0CysArg: 0.0 ± 0.0
2.443CysSer: 2.443 ± 1.926
0.611CysThr: 0.611 ± 0.593
0.611CysVal: 0.611 ± 0.604
0.0CysTrp: 0.0 ± 0.0
1.222CysTyr: 1.222 ± 0.761
0.0CysXaa: 0.0 ± 0.0
Asp
2.443AspAla: 2.443 ± 1.919
0.611AspCys: 0.611 ± 0.604
3.665AspAsp: 3.665 ± 0.728
4.887AspGlu: 4.887 ± 1.301
1.833AspPhe: 1.833 ± 1.146
3.054AspGly: 3.054 ± 1.512
1.222AspHis: 1.222 ± 0.822
3.054AspIle: 3.054 ± 1.27
0.611AspLys: 0.611 ± 0.48
6.72AspLeu: 6.72 ± 1.844
0.611AspMet: 0.611 ± 0.507
1.833AspAsn: 1.833 ± 0.767
2.443AspPro: 2.443 ± 0.792
0.611AspGln: 0.611 ± 0.604
1.833AspArg: 1.833 ± 1.241
4.887AspSer: 4.887 ± 1.208
4.276AspThr: 4.276 ± 1.873
4.276AspVal: 4.276 ± 1.758
1.833AspTrp: 1.833 ± 1.031
2.443AspTyr: 2.443 ± 1.917
0.0AspXaa: 0.0 ± 0.0
Glu
4.276GluAla: 4.276 ± 1.052
0.611GluCys: 0.611 ± 0.706
1.833GluAsp: 1.833 ± 0.754
3.054GluGlu: 3.054 ± 1.171
3.665GluPhe: 3.665 ± 1.76
2.443GluGly: 2.443 ± 1.16
1.833GluHis: 1.833 ± 0.937
1.222GluIle: 1.222 ± 1.411
2.443GluLys: 2.443 ± 1.084
4.276GluLeu: 4.276 ± 1.825
0.0GluMet: 0.0 ± 0.0
3.665GluAsn: 3.665 ± 2.11
4.887GluPro: 4.887 ± 1.662
2.443GluGln: 2.443 ± 1.286
1.222GluArg: 1.222 ± 0.817
1.833GluSer: 1.833 ± 1.811
3.054GluThr: 3.054 ± 1.247
1.222GluVal: 1.222 ± 0.909
1.222GluTrp: 1.222 ± 0.72
1.222GluTyr: 1.222 ± 1.208
0.0GluXaa: 0.0 ± 0.0
Phe
0.611PheAla: 0.611 ± 0.604
0.611PheCys: 0.611 ± 0.593
2.443PheAsp: 2.443 ± 1.076
1.833PheGlu: 1.833 ± 0.641
0.611PhePhe: 0.611 ± 0.48
1.222PheGly: 1.222 ± 0.605
3.054PheHis: 3.054 ± 0.901
1.222PheIle: 1.222 ± 0.959
2.443PheLys: 2.443 ± 0.898
4.887PheLeu: 4.887 ± 2.717
1.222PheMet: 1.222 ± 0.959
4.887PheAsn: 4.887 ± 2.152
2.443PhePro: 2.443 ± 1.808
2.443PheGln: 2.443 ± 0.958
3.054PheArg: 3.054 ± 1.531
3.054PheSer: 3.054 ± 1.467
2.443PheThr: 2.443 ± 0.881
3.054PheVal: 3.054 ± 2.537
0.611PheTrp: 0.611 ± 0.604
1.833PheTyr: 1.833 ± 1.78
0.0PheXaa: 0.0 ± 0.0
Gly
1.222GlyAla: 1.222 ± 0.959
2.443GlyCys: 2.443 ± 0.754
5.498GlyAsp: 5.498 ± 1.665
3.665GlyGlu: 3.665 ± 1.649
0.611GlyPhe: 0.611 ± 0.66
3.054GlyGly: 3.054 ± 1.084
2.443GlyHis: 2.443 ± 0.821
3.054GlyIle: 3.054 ± 0.528
4.887GlyLys: 4.887 ± 1.808
3.665GlyLeu: 3.665 ± 1.486
1.222GlyMet: 1.222 ± 0.618
2.443GlyAsn: 2.443 ± 1.267
5.498GlyPro: 5.498 ± 1.653
1.222GlyGln: 1.222 ± 0.877
1.833GlyArg: 1.833 ± 0.715
4.887GlySer: 4.887 ± 0.92
1.222GlyThr: 1.222 ± 0.806
3.665GlyVal: 3.665 ± 2.004
0.0GlyTrp: 0.0 ± 0.0
0.0GlyTyr: 0.0 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
2.443HisAla: 2.443 ± 1.264
1.222HisCys: 1.222 ± 0.839
2.443HisAsp: 2.443 ± 1.316
0.611HisGlu: 0.611 ± 0.706
1.833HisPhe: 1.833 ± 1.11
1.833HisGly: 1.833 ± 0.886
2.443HisHis: 2.443 ± 1.52
1.222HisIle: 1.222 ± 1.008
2.443HisLys: 2.443 ± 1.323
1.833HisLeu: 1.833 ± 1.084
0.0HisMet: 0.0 ± 0.0
3.665HisAsn: 3.665 ± 1.51
1.222HisPro: 1.222 ± 0.959
1.222HisGln: 1.222 ± 0.811
3.054HisArg: 3.054 ± 1.375
3.054HisSer: 3.054 ± 1.632
2.443HisThr: 2.443 ± 1.045
4.276HisVal: 4.276 ± 1.263
0.0HisTrp: 0.0 ± 0.0
1.222HisTyr: 1.222 ± 0.588
0.0HisXaa: 0.0 ± 0.0
Ile
0.611IleAla: 0.611 ± 0.604
1.222IleCys: 1.222 ± 0.73
3.054IleAsp: 3.054 ± 1.207
4.276IleGlu: 4.276 ± 1.888
3.054IlePhe: 3.054 ± 1.959
3.054IleGly: 3.054 ± 1.27
1.222IleHis: 1.222 ± 0.809
3.054IleIle: 3.054 ± 1.011
6.72IleLys: 6.72 ± 1.125
1.833IleLeu: 1.833 ± 0.726
1.222IleMet: 1.222 ± 0.817
3.054IleAsn: 3.054 ± 1.789
1.833IlePro: 1.833 ± 0.923
6.109IleGln: 6.109 ± 1.701
7.33IleArg: 7.33 ± 1.587
5.498IleSer: 5.498 ± 1.511
4.276IleThr: 4.276 ± 2.319
1.222IleVal: 1.222 ± 0.588
1.833IleTrp: 1.833 ± 0.756
1.222IleTyr: 1.222 ± 1.186
0.0IleXaa: 0.0 ± 0.0
Lys
3.054LysAla: 3.054 ± 1.265
2.443LysCys: 2.443 ± 1.119
3.054LysAsp: 3.054 ± 1.645
4.887LysGlu: 4.887 ± 2.298
2.443LysPhe: 2.443 ± 0.744
1.222LysGly: 1.222 ± 0.72
1.833LysHis: 1.833 ± 0.632
3.054LysIle: 3.054 ± 1.087
1.833LysLys: 1.833 ± 0.632
3.665LysLeu: 3.665 ± 2.317
0.0LysMet: 0.0 ± 0.0
3.665LysAsn: 3.665 ± 1.138
3.054LysPro: 3.054 ± 0.982
4.887LysGln: 4.887 ± 1.61
3.665LysArg: 3.665 ± 2.172
3.054LysSer: 3.054 ± 1.002
2.443LysThr: 2.443 ± 1.114
4.887LysVal: 4.887 ± 1.487
0.0LysTrp: 0.0 ± 0.0
4.887LysTyr: 4.887 ± 1.098
0.0LysXaa: 0.0 ± 0.0
Leu
0.611LeuAla: 0.611 ± 0.654
1.833LeuCys: 1.833 ± 0.952
4.276LeuAsp: 4.276 ± 1.295
3.054LeuGlu: 3.054 ± 1.339
3.054LeuPhe: 3.054 ± 1.803
4.276LeuGly: 4.276 ± 1.629
3.665LeuHis: 3.665 ± 1.108
3.665LeuIle: 3.665 ± 1.274
4.887LeuLys: 4.887 ± 1.61
3.054LeuLeu: 3.054 ± 1.536
1.833LeuMet: 1.833 ± 1.021
3.665LeuAsn: 3.665 ± 1.636
2.443LeuPro: 2.443 ± 1.092
3.665LeuGln: 3.665 ± 1.411
6.72LeuArg: 6.72 ± 2.938
4.887LeuSer: 4.887 ± 1.895
5.498LeuThr: 5.498 ± 1.642
3.665LeuVal: 3.665 ± 1.014
0.0LeuTrp: 0.0 ± 0.0
2.443LeuTyr: 2.443 ± 0.675
0.0LeuXaa: 0.0 ± 0.0
Met
2.443MetAla: 2.443 ± 1.509
0.611MetCys: 0.611 ± 0.744
2.443MetAsp: 2.443 ± 0.739
0.611MetGlu: 0.611 ± 0.507
1.833MetPhe: 1.833 ± 1.38
1.833MetGly: 1.833 ± 0.801
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
1.222MetLys: 1.222 ± 0.684
0.611MetLeu: 0.611 ± 0.654
0.611MetMet: 0.611 ± 0.593
1.833MetAsn: 1.833 ± 1.4
0.0MetPro: 0.0 ± 0.0
0.611MetGln: 0.611 ± 0.66
0.0MetArg: 0.0 ± 0.0
3.665MetSer: 3.665 ± 0.975
0.0MetThr: 0.0 ± 0.0
0.611MetVal: 0.611 ± 0.507
1.222MetTrp: 1.222 ± 0.774
2.443MetTyr: 2.443 ± 1.642
0.0MetXaa: 0.0 ± 0.0
Asn
4.276AsnAla: 4.276 ± 1.209
1.222AsnCys: 1.222 ± 0.72
3.054AsnAsp: 3.054 ± 0.923
2.443AsnGlu: 2.443 ± 0.976
1.222AsnPhe: 1.222 ± 0.605
3.054AsnGly: 3.054 ± 1.464
4.276AsnHis: 4.276 ± 2.726
3.054AsnIle: 3.054 ± 0.919
2.443AsnLys: 2.443 ± 0.762
4.276AsnLeu: 4.276 ± 1.513
0.611AsnMet: 0.611 ± 1.131
3.054AsnAsn: 3.054 ± 0.92
2.443AsnPro: 2.443 ± 0.744
3.665AsnGln: 3.665 ± 1.111
3.665AsnArg: 3.665 ± 2.117
2.443AsnSer: 2.443 ± 1.13
3.054AsnThr: 3.054 ± 1.073
6.109AsnVal: 6.109 ± 1.522
0.0AsnTrp: 0.0 ± 0.0
2.443AsnTyr: 2.443 ± 1.054
0.0AsnXaa: 0.0 ± 0.0
Pro
1.222ProAla: 1.222 ± 0.593
1.222ProCys: 1.222 ± 0.91
2.443ProAsp: 2.443 ± 0.976
2.443ProGlu: 2.443 ± 0.882
2.443ProPhe: 2.443 ± 0.812
4.887ProGly: 4.887 ± 1.598
2.443ProHis: 2.443 ± 1.525
5.498ProIle: 5.498 ± 2.198
3.665ProLys: 3.665 ± 1.763
3.665ProLeu: 3.665 ± 1.115
3.054ProMet: 3.054 ± 1.658
0.611ProAsn: 0.611 ± 0.48
1.222ProPro: 1.222 ± 0.959
2.443ProGln: 2.443 ± 1.621
1.833ProArg: 1.833 ± 0.975
5.498ProSer: 5.498 ± 1.775
6.109ProThr: 6.109 ± 1.849
3.054ProVal: 3.054 ± 1.375
1.222ProTrp: 1.222 ± 0.588
2.443ProTyr: 2.443 ± 1.084
0.0ProXaa: 0.0 ± 0.0
Gln
4.887GlnAla: 4.887 ± 1.352
0.0GlnCys: 0.0 ± 0.0
3.665GlnAsp: 3.665 ± 0.902
1.222GlnGlu: 1.222 ± 0.91
3.054GlnPhe: 3.054 ± 1.149
1.833GlnGly: 1.833 ± 0.754
1.833GlnHis: 1.833 ± 1.27
3.665GlnIle: 3.665 ± 0.985
0.611GlnLys: 0.611 ± 0.654
1.222GlnLeu: 1.222 ± 0.822
0.611GlnMet: 0.611 ± 0.66
3.665GlnAsn: 3.665 ± 0.728
3.054GlnPro: 3.054 ± 1.354
3.665GlnGln: 3.665 ± 1.3
3.665GlnArg: 3.665 ± 1.163
6.109GlnSer: 6.109 ± 1.811
2.443GlnThr: 2.443 ± 1.007
3.054GlnVal: 3.054 ± 1.228
0.0GlnTrp: 0.0 ± 0.0
1.222GlnTyr: 1.222 ± 0.588
0.0GlnXaa: 0.0 ± 0.0
Arg
3.665ArgAla: 3.665 ± 1.926
0.611ArgCys: 0.611 ± 0.654
4.276ArgAsp: 4.276 ± 1.523
0.611ArgGlu: 0.611 ± 0.48
4.887ArgPhe: 4.887 ± 1.217
4.276ArgGly: 4.276 ± 1.1
0.611ArgHis: 0.611 ± 0.654
3.665ArgIle: 3.665 ± 1.015
4.887ArgLys: 4.887 ± 1.59
4.276ArgLeu: 4.276 ± 1.733
0.611ArgMet: 0.611 ± 0.593
3.665ArgAsn: 3.665 ± 0.905
7.33ArgPro: 7.33 ± 1.402
1.222ArgGln: 1.222 ± 0.839
7.33ArgArg: 7.33 ± 2.568
4.276ArgSer: 4.276 ± 1.127
3.665ArgThr: 3.665 ± 1.698
3.665ArgVal: 3.665 ± 1.414
0.0ArgTrp: 0.0 ± 0.0
3.665ArgTyr: 3.665 ± 1.679
0.0ArgXaa: 0.0 ± 0.0
Ser
5.498SerAla: 5.498 ± 2.045
0.611SerCys: 0.611 ± 0.507
3.665SerAsp: 3.665 ± 0.963
1.833SerGlu: 1.833 ± 0.715
3.054SerPhe: 3.054 ± 0.992
2.443SerGly: 2.443 ± 0.897
1.833SerHis: 1.833 ± 0.696
5.498SerIle: 5.498 ± 1.37
6.109SerLys: 6.109 ± 1.664
2.443SerLeu: 2.443 ± 0.991
2.443SerMet: 2.443 ± 1.382
6.109SerAsn: 6.109 ± 1.811
5.498SerPro: 5.498 ± 1.691
4.887SerGln: 4.887 ± 1.876
3.665SerArg: 3.665 ± 1.639
12.217SerSer: 12.217 ± 3.345
7.33SerThr: 7.33 ± 2.665
6.72SerVal: 6.72 ± 3.126
0.0SerTrp: 0.0 ± 0.0
5.498SerTyr: 5.498 ± 2.393
0.0SerXaa: 0.0 ± 0.0
Thr
3.054ThrAla: 3.054 ± 1.579
0.0ThrCys: 0.0 ± 0.0
1.222ThrAsp: 1.222 ± 1.208
3.665ThrGlu: 3.665 ± 1.46
3.054ThrPhe: 3.054 ± 1.486
4.276ThrGly: 4.276 ± 1.873
4.276ThrHis: 4.276 ± 1.543
5.498ThrIle: 5.498 ± 1.458
3.054ThrLys: 3.054 ± 1.167
5.498ThrLeu: 5.498 ± 1.485
0.611ThrMet: 0.611 ± 0.48
3.054ThrAsn: 3.054 ± 0.916
3.665ThrPro: 3.665 ± 1.829
0.0ThrGln: 0.0 ± 0.0
4.887ThrArg: 4.887 ± 1.177
5.498ThrSer: 5.498 ± 2.207
3.054ThrThr: 3.054 ± 1.9
3.054ThrVal: 3.054 ± 1.27
1.833ThrTrp: 1.833 ± 1.094
2.443ThrTyr: 2.443 ± 0.85
0.0ThrXaa: 0.0 ± 0.0
Val
0.0ValAla: 0.0 ± 0.0
0.611ValCys: 0.611 ± 0.66
3.665ValAsp: 3.665 ± 1.548
1.833ValGlu: 1.833 ± 0.917
3.054ValPhe: 3.054 ± 1.668
3.054ValGly: 3.054 ± 1.131
1.833ValHis: 1.833 ± 0.858
7.941ValIle: 7.941 ± 1.653
3.054ValLys: 3.054 ± 0.916
4.276ValLeu: 4.276 ± 1.574
1.833ValMet: 1.833 ± 0.931
3.054ValAsn: 3.054 ± 1.667
3.054ValPro: 3.054 ± 1.613
4.276ValGln: 4.276 ± 1.474
2.443ValArg: 2.443 ± 0.739
6.109ValSer: 6.109 ± 2.875
3.054ValThr: 3.054 ± 1.536
1.222ValVal: 1.222 ± 0.817
0.0ValTrp: 0.0 ± 0.0
3.054ValTyr: 3.054 ± 1.556
0.0ValXaa: 0.0 ± 0.0
Trp
1.833TrpAla: 1.833 ± 0.887
0.0TrpCys: 0.0 ± 0.0
0.611TrpAsp: 0.611 ± 0.654
0.611TrpGlu: 0.611 ± 0.706
0.0TrpPhe: 0.0 ± 0.0
0.611TrpGly: 0.611 ± 0.48
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
1.222TrpMet: 1.222 ± 1.186
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
1.222TrpGln: 1.222 ± 0.588
1.222TrpArg: 1.222 ± 0.72
1.222TrpSer: 1.222 ± 0.806
1.222TrpThr: 1.222 ± 0.809
0.611TrpVal: 0.611 ± 0.48
0.0TrpTrp: 0.0 ± 0.0
1.222TrpTyr: 1.222 ± 0.827
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.833TyrAla: 1.833 ± 0.726
0.611TyrCys: 0.611 ± 0.654
1.222TyrAsp: 1.222 ± 0.684
1.833TyrGlu: 1.833 ± 0.858
3.054TyrPhe: 3.054 ± 0.93
1.833TyrGly: 1.833 ± 0.632
1.222TyrHis: 1.222 ± 0.774
3.665TyrIle: 3.665 ± 1.138
1.833TyrLys: 1.833 ± 1.091
4.887TyrLeu: 4.887 ± 1.458
1.222TyrMet: 1.222 ± 0.792
2.443TyrAsn: 2.443 ± 0.958
2.443TyrPro: 2.443 ± 1.213
0.611TyrGln: 0.611 ± 0.706
5.498TyrArg: 5.498 ± 2.56
1.833TyrSer: 1.833 ± 0.952
2.443TyrThr: 2.443 ± 1.263
2.443TyrVal: 2.443 ± 1.557
0.0TyrTrp: 0.0 ± 0.0
0.611TyrTyr: 0.611 ± 0.66
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (1638 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski