Amino acid dipepetide frequency for Brome mosaic virus (BMV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.916AlaAla: 7.916 ± 3.126
1.319AlaCys: 1.319 ± 0.64
5.717AlaAsp: 5.717 ± 1.363
5.277AlaGlu: 5.277 ± 1.453
4.398AlaPhe: 4.398 ± 1.373
5.277AlaGly: 5.277 ± 2.9
0.44AlaHis: 0.44 ± 0.361
5.717AlaIle: 5.717 ± 1.735
6.596AlaLys: 6.596 ± 1.082
8.795AlaLeu: 8.795 ± 0.89
2.639AlaMet: 2.639 ± 1.314
2.199AlaAsn: 2.199 ± 0.658
1.759AlaPro: 1.759 ± 0.557
3.958AlaGln: 3.958 ± 1.471
2.639AlaArg: 2.639 ± 1.443
5.277AlaSer: 5.277 ± 1.462
4.398AlaThr: 4.398 ± 2.38
3.518AlaVal: 3.518 ± 2.396
0.44AlaTrp: 0.44 ± 0.566
1.319AlaTyr: 1.319 ± 0.677
0.0AlaXaa: 0.0 ± 0.0
Cys
0.88CysAla: 0.88 ± 0.278
0.44CysCys: 0.44 ± 0.29
3.518CysAsp: 3.518 ± 0.626
0.88CysGlu: 0.88 ± 0.721
2.199CysPhe: 2.199 ± 0.927
0.88CysGly: 0.88 ± 0.581
2.199CysHis: 2.199 ± 0.677
0.88CysIle: 0.88 ± 0.581
1.319CysLys: 1.319 ± 0.385
2.199CysLeu: 2.199 ± 0.677
0.0CysMet: 0.0 ± 0.0
0.44CysAsn: 0.44 ± 0.29
1.759CysPro: 1.759 ± 0.952
0.0CysGln: 0.0 ± 0.0
0.88CysArg: 0.88 ± 0.278
1.759CysSer: 1.759 ± 0.557
0.44CysThr: 0.44 ± 0.361
1.759CysVal: 1.759 ± 0.938
0.0CysTrp: 0.0 ± 0.0
0.88CysTyr: 0.88 ± 0.509
0.0CysXaa: 0.0 ± 0.0
Asp
5.277AspAla: 5.277 ± 1.343
3.518AspCys: 3.518 ± 1.211
4.398AspAsp: 4.398 ± 1.525
4.837AspGlu: 4.837 ± 1.298
3.518AspPhe: 3.518 ± 0.666
4.837AspGly: 4.837 ± 1.477
0.88AspHis: 0.88 ± 0.509
3.958AspIle: 3.958 ± 1.321
3.518AspLys: 3.518 ± 0.694
7.036AspLeu: 7.036 ± 0.334
1.319AspMet: 1.319 ± 0.44
1.759AspAsn: 1.759 ± 0.454
3.958AspPro: 3.958 ± 1.725
0.88AspGln: 0.88 ± 0.509
4.837AspArg: 4.837 ± 1.298
7.036AspSer: 7.036 ± 1.467
4.398AspThr: 4.398 ± 2.063
5.277AspVal: 5.277 ± 1.237
2.639AspTrp: 2.639 ± 0.835
2.199AspTyr: 2.199 ± 0.829
0.0AspXaa: 0.0 ± 0.0
Glu
5.717GluAla: 5.717 ± 1.797
0.44GluCys: 0.44 ± 0.566
5.717GluAsp: 5.717 ± 1.245
5.277GluGlu: 5.277 ± 1.237
2.199GluPhe: 2.199 ± 0.677
0.88GluGly: 0.88 ± 0.278
2.199GluHis: 2.199 ± 0.687
4.398GluIle: 4.398 ± 1.405
3.958GluLys: 3.958 ± 1.437
3.518GluLeu: 3.518 ± 1.572
1.759GluMet: 1.759 ± 0.919
1.759GluAsn: 1.759 ± 0.557
1.319GluPro: 1.319 ± 0.929
1.319GluGln: 1.319 ± 0.44
3.078GluArg: 3.078 ± 1.094
5.717GluSer: 5.717 ± 2.634
2.639GluThr: 2.639 ± 0.835
5.717GluVal: 5.717 ± 0.836
0.88GluTrp: 0.88 ± 0.581
1.759GluTyr: 1.759 ± 0.692
0.0GluXaa: 0.0 ± 0.0
Phe
0.88PheAla: 0.88 ± 0.278
0.88PheCys: 0.88 ± 0.278
5.717PheAsp: 5.717 ± 0.517
3.078PheGlu: 3.078 ± 1.123
1.319PhePhe: 1.319 ± 0.64
1.759PheGly: 1.759 ± 0.454
1.759PheHis: 1.759 ± 0.919
0.88PheIle: 0.88 ± 0.721
4.398PheLys: 4.398 ± 1.068
3.078PheLeu: 3.078 ± 0.873
0.44PheMet: 0.44 ± 0.566
1.319PheAsn: 1.319 ± 0.44
2.639PhePro: 2.639 ± 0.88
3.078PheGln: 3.078 ± 1.571
2.199PheArg: 2.199 ± 0.687
3.078PheSer: 3.078 ± 1.119
2.639PheThr: 2.639 ± 1.251
2.199PheVal: 2.199 ± 1.272
0.88PheTrp: 0.88 ± 0.581
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.078GlyAla: 3.078 ± 1.628
0.88GlyCys: 0.88 ± 0.589
5.277GlyAsp: 5.277 ± 1.545
2.639GlyGlu: 2.639 ± 0.471
2.199GlyPhe: 2.199 ± 0.658
3.958GlyGly: 3.958 ± 1.093
0.44GlyHis: 0.44 ± 0.29
3.078GlyIle: 3.078 ± 0.766
3.518GlyLys: 3.518 ± 1.637
3.078GlyLeu: 3.078 ± 0.868
0.88GlyMet: 0.88 ± 0.278
2.199GlyAsn: 2.199 ± 0.677
1.319GlyPro: 1.319 ± 0.385
1.319GlyGln: 1.319 ± 0.793
3.518GlyArg: 3.518 ± 2.042
4.837GlySer: 4.837 ± 1.063
2.639GlyThr: 2.639 ± 1.107
6.157GlyVal: 6.157 ± 1.118
1.319GlyTrp: 1.319 ± 0.871
2.199GlyTyr: 2.199 ± 0.686
0.0GlyXaa: 0.0 ± 0.0
His
2.639HisAla: 2.639 ± 0.563
1.759HisCys: 1.759 ± 0.557
0.44HisAsp: 0.44 ± 0.29
2.639HisGlu: 2.639 ± 0.869
1.759HisPhe: 1.759 ± 0.557
3.958HisGly: 3.958 ± 2.145
0.88HisHis: 0.88 ± 0.581
0.44HisIle: 0.44 ± 0.29
1.319HisLys: 1.319 ± 0.44
2.639HisLeu: 2.639 ± 0.568
0.88HisMet: 0.88 ± 0.721
0.88HisAsn: 0.88 ± 0.278
0.44HisPro: 0.44 ± 0.361
0.44HisGln: 0.44 ± 0.361
1.759HisArg: 1.759 ± 0.557
2.199HisSer: 2.199 ± 0.677
0.0HisThr: 0.0 ± 0.0
0.88HisVal: 0.88 ± 1.026
0.44HisTrp: 0.44 ± 0.29
0.88HisTyr: 0.88 ± 0.581
0.0HisXaa: 0.0 ± 0.0
Ile
5.717IleAla: 5.717 ± 1.179
0.88IleCys: 0.88 ± 0.278
5.717IleAsp: 5.717 ± 0.923
2.639IleGlu: 2.639 ± 0.471
1.759IlePhe: 1.759 ± 0.557
2.639IleGly: 2.639 ± 1.617
0.88IleHis: 0.88 ± 0.581
1.759IleIle: 1.759 ± 0.557
3.078IleLys: 3.078 ± 1.372
2.199IleLeu: 2.199 ± 0.829
2.199IleMet: 2.199 ± 0.658
1.759IleAsn: 1.759 ± 0.557
1.759IlePro: 1.759 ± 0.557
0.88IleGln: 0.88 ± 0.509
1.319IleArg: 1.319 ± 0.677
5.277IleSer: 5.277 ± 0.963
3.078IleThr: 3.078 ± 1.465
3.078IleVal: 3.078 ± 1.106
0.0IleTrp: 0.0 ± 0.0
1.319IleTyr: 1.319 ± 0.945
0.0IleXaa: 0.0 ± 0.0
Lys
5.277LysAla: 5.277 ± 4.045
3.078LysCys: 3.078 ± 1.532
2.199LysAsp: 2.199 ± 0.658
4.398LysGlu: 4.398 ± 1.121
3.078LysPhe: 3.078 ± 1.49
1.759LysGly: 1.759 ± 0.454
1.319LysHis: 1.319 ± 0.44
2.199LysIle: 2.199 ± 0.677
4.398LysLys: 4.398 ± 0.847
4.837LysLeu: 4.837 ± 1.063
1.759LysMet: 1.759 ± 0.783
1.759LysAsn: 1.759 ± 0.966
2.639LysPro: 2.639 ± 1.056
1.319LysGln: 1.319 ± 0.929
4.837LysArg: 4.837 ± 1.477
6.157LysSer: 6.157 ± 1.582
3.958LysThr: 3.958 ± 1.655
4.837LysVal: 4.837 ± 1.038
0.88LysTrp: 0.88 ± 0.795
3.078LysTyr: 3.078 ± 0.693
0.0LysXaa: 0.0 ± 0.0
Leu
7.036LeuAla: 7.036 ± 2.235
1.759LeuCys: 1.759 ± 0.459
3.958LeuAsp: 3.958 ± 0.75
5.717LeuGlu: 5.717 ± 2.475
2.639LeuPhe: 2.639 ± 0.911
4.398LeuGly: 4.398 ± 1.808
3.078LeuHis: 3.078 ± 0.519
3.078LeuIle: 3.078 ± 0.791
7.476LeuLys: 7.476 ± 1.602
7.036LeuLeu: 7.036 ± 2.3
0.0LeuMet: 0.0 ± 0.0
3.078LeuAsn: 3.078 ± 0.685
3.518LeuPro: 3.518 ± 2.042
3.958LeuGln: 3.958 ± 0.242
7.916LeuArg: 7.916 ± 1.652
6.157LeuSer: 6.157 ± 0.88
5.277LeuThr: 5.277 ± 1.237
5.717LeuVal: 5.717 ± 1.039
0.88LeuTrp: 0.88 ± 0.739
2.199LeuTyr: 2.199 ± 0.587
0.0LeuXaa: 0.0 ± 0.0
Met
1.759MetAla: 1.759 ± 0.557
0.44MetCys: 0.44 ± 0.29
1.759MetAsp: 1.759 ± 1.442
1.759MetGlu: 1.759 ± 0.459
0.88MetPhe: 0.88 ± 0.278
2.199MetGly: 2.199 ± 0.829
0.88MetHis: 0.88 ± 0.581
0.88MetIle: 0.88 ± 0.721
1.759MetLys: 1.759 ± 0.557
2.199MetLeu: 2.199 ± 0.829
0.0MetMet: 0.0 ± 0.0
1.319MetAsn: 1.319 ± 0.677
0.44MetPro: 0.44 ± 0.566
0.0MetGln: 0.0 ± 0.0
0.44MetArg: 0.44 ± 0.29
3.078MetSer: 3.078 ± 1.465
1.759MetThr: 1.759 ± 0.597
1.759MetVal: 1.759 ± 0.454
0.0MetTrp: 0.0 ± 0.0
0.44MetTyr: 0.44 ± 0.794
0.0MetXaa: 0.0 ± 0.0
Asn
0.88AsnAla: 0.88 ± 0.795
0.88AsnCys: 0.88 ± 0.278
0.44AsnAsp: 0.44 ± 0.361
1.319AsnGlu: 1.319 ± 0.736
1.759AsnPhe: 1.759 ± 0.459
1.759AsnGly: 1.759 ± 0.692
0.0AsnHis: 0.0 ± 0.0
1.319AsnIle: 1.319 ± 0.385
1.319AsnLys: 1.319 ± 0.793
3.958AsnLeu: 3.958 ± 1.618
1.319AsnMet: 1.319 ± 0.575
1.319AsnAsn: 1.319 ± 0.385
1.319AsnPro: 1.319 ± 1.082
1.759AsnGln: 1.759 ± 0.927
3.518AsnArg: 3.518 ± 1.071
0.88AsnSer: 0.88 ± 0.581
0.88AsnThr: 0.88 ± 0.721
3.518AsnVal: 3.518 ± 0.626
1.319AsnTrp: 1.319 ± 0.575
1.759AsnTyr: 1.759 ± 2.264
0.0AsnXaa: 0.0 ± 0.0
Pro
2.639ProAla: 2.639 ± 1.066
0.0ProCys: 0.0 ± 0.0
2.639ProAsp: 2.639 ± 1.15
2.639ProGlu: 2.639 ± 0.88
1.759ProPhe: 1.759 ± 1.018
0.88ProGly: 0.88 ± 0.721
1.319ProHis: 1.319 ± 0.64
1.759ProIle: 1.759 ± 0.919
2.639ProLys: 2.639 ± 0.563
3.518ProLeu: 3.518 ± 0.8
0.44ProMet: 0.44 ± 0.566
1.319ProAsn: 1.319 ± 0.736
1.319ProPro: 1.319 ± 0.44
1.319ProGln: 1.319 ± 0.44
1.759ProArg: 1.759 ± 1.177
3.518ProSer: 3.518 ± 1.486
2.199ProThr: 2.199 ± 0.686
4.837ProVal: 4.837 ± 0.95
0.44ProTrp: 0.44 ± 0.361
0.44ProTyr: 0.44 ± 0.361
0.0ProXaa: 0.0 ± 0.0
Gln
3.958GlnAla: 3.958 ± 1.422
0.0GlnCys: 0.0 ± 0.0
2.199GlnAsp: 2.199 ± 1.452
2.199GlnGlu: 2.199 ± 0.927
0.44GlnPhe: 0.44 ± 0.361
2.199GlnGly: 2.199 ± 0.587
0.44GlnHis: 0.44 ± 0.361
1.759GlnIle: 1.759 ± 0.597
0.88GlnLys: 0.88 ± 0.278
2.639GlnLeu: 2.639 ± 1.468
1.319GlnMet: 1.319 ± 1.082
0.0GlnAsn: 0.0 ± 0.0
0.88GlnPro: 0.88 ± 0.795
1.759GlnGln: 1.759 ± 0.557
3.078GlnArg: 3.078 ± 1.571
2.639GlnSer: 2.639 ± 0.77
1.319GlnThr: 1.319 ± 0.44
3.078GlnVal: 3.078 ± 0.432
0.44GlnTrp: 0.44 ± 0.361
0.88GlnTyr: 0.88 ± 0.581
0.0GlnXaa: 0.0 ± 0.0
Arg
4.398ArgAla: 4.398 ± 1.121
1.759ArgCys: 1.759 ± 0.952
5.277ArgAsp: 5.277 ± 2.076
3.078ArgGlu: 3.078 ± 0.937
2.199ArgPhe: 2.199 ± 0.829
3.518ArgGly: 3.518 ± 1.6
2.199ArgHis: 2.199 ± 1.452
3.078ArgIle: 3.078 ± 0.911
2.199ArgLys: 2.199 ± 0.677
6.157ArgLeu: 6.157 ± 1.44
3.518ArgMet: 3.518 ± 0.644
2.199ArgAsn: 2.199 ± 1.181
1.319ArgPro: 1.319 ± 0.929
0.88ArgGln: 0.88 ± 0.509
3.518ArgArg: 3.518 ± 1.196
1.759ArgSer: 1.759 ± 0.692
4.837ArgThr: 4.837 ± 0.875
5.717ArgVal: 5.717 ± 1.804
1.319ArgTrp: 1.319 ± 0.64
2.199ArgTyr: 2.199 ± 0.829
0.0ArgXaa: 0.0 ± 0.0
Ser
5.277SerAla: 5.277 ± 2.767
0.88SerCys: 0.88 ± 0.581
7.916SerAsp: 7.916 ± 3.695
3.518SerGlu: 3.518 ± 1.637
4.837SerPhe: 4.837 ± 1.638
4.398SerGly: 4.398 ± 2.675
1.759SerHis: 1.759 ± 0.459
4.398SerIle: 4.398 ± 1.192
7.476SerLys: 7.476 ± 0.734
7.036SerLeu: 7.036 ± 2.505
1.319SerMet: 1.319 ± 0.575
3.518SerAsn: 3.518 ± 1.292
2.199SerPro: 2.199 ± 1.487
1.319SerGln: 1.319 ± 0.44
4.837SerArg: 4.837 ± 1.562
7.036SerSer: 7.036 ± 1.834
3.078SerThr: 3.078 ± 0.658
7.036SerVal: 7.036 ± 1.968
1.319SerTrp: 1.319 ± 0.736
2.199SerTyr: 2.199 ± 0.363
0.0SerXaa: 0.0 ± 0.0
Thr
4.398ThrAla: 4.398 ± 1.131
0.88ThrCys: 0.88 ± 0.581
4.398ThrAsp: 4.398 ± 1.433
3.518ThrGlu: 3.518 ± 1.114
2.199ThrPhe: 2.199 ± 0.959
2.639ThrGly: 2.639 ± 1.361
1.319ThrHis: 1.319 ± 0.677
2.199ThrIle: 2.199 ± 0.855
2.639ThrLys: 2.639 ± 0.88
5.717ThrLeu: 5.717 ± 0.868
0.88ThrMet: 0.88 ± 0.278
1.759ThrAsn: 1.759 ± 0.731
1.319ThrPro: 1.319 ± 0.929
2.199ThrGln: 2.199 ± 0.363
2.199ThrArg: 2.199 ± 1.141
4.837ThrSer: 4.837 ± 1.355
4.398ThrThr: 4.398 ± 1.123
3.078ThrVal: 3.078 ± 1.18
0.44ThrTrp: 0.44 ± 0.361
3.518ThrTyr: 3.518 ± 1.818
0.0ThrXaa: 0.0 ± 0.0
Val
8.795ValAla: 8.795 ± 3.409
1.319ValCys: 1.319 ± 0.677
4.837ValAsp: 4.837 ± 0.788
3.958ValGlu: 3.958 ± 1.098
0.44ValPhe: 0.44 ± 0.361
3.518ValGly: 3.518 ± 0.832
2.639ValHis: 2.639 ± 1.375
3.078ValIle: 3.078 ± 0.868
3.518ValLys: 3.518 ± 0.917
3.958ValLeu: 3.958 ± 0.535
1.759ValMet: 1.759 ± 0.459
1.319ValAsn: 1.319 ± 0.44
6.596ValPro: 6.596 ± 0.388
2.639ValGln: 2.639 ± 0.769
7.036ValArg: 7.036 ± 1.152
7.476ValSer: 7.476 ± 1.821
3.958ValThr: 3.958 ± 0.75
3.958ValVal: 3.958 ± 2.095
0.44ValTrp: 0.44 ± 0.29
1.759ValTyr: 1.759 ± 1.19
0.0ValXaa: 0.0 ± 0.0
Trp
0.88TrpAla: 0.88 ± 0.589
0.44TrpCys: 0.44 ± 0.29
0.88TrpAsp: 0.88 ± 0.278
0.44TrpGlu: 0.44 ± 0.794
0.88TrpPhe: 0.88 ± 0.278
1.319TrpGly: 1.319 ± 0.575
0.44TrpHis: 0.44 ± 0.29
1.319TrpIle: 1.319 ± 0.575
1.319TrpLys: 1.319 ± 0.44
1.759TrpLeu: 1.759 ± 0.967
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
1.319TrpGln: 1.319 ± 0.736
0.0TrpArg: 0.0 ± 0.0
0.88TrpSer: 0.88 ± 0.278
1.759TrpThr: 1.759 ± 0.938
0.0TrpVal: 0.0 ± 0.0
1.319TrpTrp: 1.319 ± 0.44
0.88TrpTyr: 0.88 ± 0.721
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.199TyrAla: 2.199 ± 0.587
1.759TyrCys: 1.759 ± 0.692
3.078TyrAsp: 3.078 ± 1.169
0.44TyrGlu: 0.44 ± 0.29
1.759TyrPhe: 1.759 ± 1.442
1.759TyrGly: 1.759 ± 0.557
2.199TyrHis: 2.199 ± 0.677
1.759TyrIle: 1.759 ± 1.177
0.88TyrLys: 0.88 ± 0.589
3.518TyrLeu: 3.518 ± 0.834
0.88TyrMet: 0.88 ± 0.721
1.319TyrAsn: 1.319 ± 0.736
0.88TyrPro: 0.88 ± 0.278
1.759TyrGln: 1.759 ± 0.557
1.759TyrArg: 1.759 ± 0.731
1.759TyrSer: 1.759 ± 1.19
0.88TyrThr: 0.88 ± 0.739
0.88TyrVal: 0.88 ± 0.278
0.44TyrTrp: 0.44 ± 0.566
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (2275 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski