Amino acid dipepetide frequency for Mushroom bacilliform virus (isolate Australia/AUS LF-1) (MBV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.506AlaAla: 5.506 ± 1.32
0.688AlaCys: 0.688 ± 0.415
2.065AlaAsp: 2.065 ± 0.71
3.441AlaGlu: 3.441 ± 1.422
0.0AlaPhe: 0.0 ± 0.0
4.818AlaGly: 4.818 ± 1.766
1.376AlaHis: 1.376 ± 0.851
2.065AlaIle: 2.065 ± 0.673
2.753AlaLys: 2.753 ± 1.016
4.129AlaLeu: 4.129 ± 1.511
1.376AlaMet: 1.376 ± 0.967
2.753AlaAsn: 2.753 ± 1.801
3.441AlaPro: 3.441 ± 0.765
3.441AlaGln: 3.441 ± 1.482
6.194AlaArg: 6.194 ± 1.027
4.129AlaSer: 4.129 ± 1.409
6.194AlaThr: 6.194 ± 1.922
3.441AlaVal: 3.441 ± 1.744
1.376AlaTrp: 1.376 ± 0.851
2.065AlaTyr: 2.065 ± 0.957
0.0AlaXaa: 0.0 ± 0.0
Cys
0.688CysAla: 0.688 ± 0.415
0.0CysCys: 0.0 ± 0.0
1.376CysAsp: 1.376 ± 0.967
1.376CysGlu: 1.376 ± 0.467
0.0CysPhe: 0.0 ± 0.0
0.688CysGly: 0.688 ± 0.903
0.688CysHis: 0.688 ± 0.903
2.065CysIle: 2.065 ± 1.244
2.065CysLys: 2.065 ± 0.748
2.065CysLeu: 2.065 ± 0.967
2.065CysMet: 2.065 ± 0.748
0.688CysAsn: 0.688 ± 0.415
1.376CysPro: 1.376 ± 0.967
0.0CysGln: 0.0 ± 0.0
2.065CysArg: 2.065 ± 1.955
4.129CysSer: 4.129 ± 1.449
0.0CysThr: 0.0 ± 0.0
1.376CysVal: 1.376 ± 0.467
0.688CysTrp: 0.688 ± 0.571
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.818AspAla: 4.818 ± 0.949
0.688AspCys: 0.688 ± 0.571
2.753AspAsp: 2.753 ± 0.659
2.753AspGlu: 2.753 ± 1.025
3.441AspPhe: 3.441 ± 1.081
3.441AspGly: 3.441 ± 0.841
0.688AspHis: 0.688 ± 0.903
1.376AspIle: 1.376 ± 0.958
0.688AspLys: 0.688 ± 0.415
5.506AspLeu: 5.506 ± 1.895
0.0AspMet: 0.0 ± 0.0
1.376AspAsn: 1.376 ± 0.467
1.376AspPro: 1.376 ± 0.958
1.376AspGln: 1.376 ± 0.467
2.065AspArg: 2.065 ± 1.244
5.506AspSer: 5.506 ± 1.466
0.0AspThr: 0.0 ± 0.0
4.818AspVal: 4.818 ± 2.446
1.376AspTrp: 1.376 ± 0.467
0.688AspTyr: 0.688 ± 0.571
0.0AspXaa: 0.0 ± 0.0
Glu
2.753GluAla: 2.753 ± 1.659
2.065GluCys: 2.065 ± 0.985
1.376GluAsp: 1.376 ± 0.829
5.506GluGlu: 5.506 ± 2.032
4.129GluPhe: 4.129 ± 0.605
2.065GluGly: 2.065 ± 0.957
0.688GluHis: 0.688 ± 0.571
5.506GluIle: 5.506 ± 2.929
4.818GluLys: 4.818 ± 2.198
3.441GluLeu: 3.441 ± 0.765
1.376GluMet: 1.376 ± 0.806
1.376GluAsn: 1.376 ± 0.851
1.376GluPro: 1.376 ± 0.467
2.753GluGln: 2.753 ± 0.727
2.753GluArg: 2.753 ± 1.016
6.194GluSer: 6.194 ± 2.289
4.818GluThr: 4.818 ± 0.847
4.129GluVal: 4.129 ± 0.551
0.688GluTrp: 0.688 ± 0.415
2.065GluTyr: 2.065 ± 1.714
0.0GluXaa: 0.0 ± 0.0
Phe
4.818PheAla: 4.818 ± 2.26
2.753PheCys: 2.753 ± 0.659
2.753PheAsp: 2.753 ± 1.702
4.818PheGlu: 4.818 ± 0.956
0.688PhePhe: 0.688 ± 0.571
2.065PheGly: 2.065 ± 0.957
0.0PheHis: 0.0 ± 0.0
0.688PheIle: 0.688 ± 0.571
0.688PheLys: 0.688 ± 0.571
4.129PheLeu: 4.129 ± 1.266
0.0PheMet: 0.0 ± 0.0
2.065PheAsn: 2.065 ± 0.748
2.753PhePro: 2.753 ± 0.659
1.376PheGln: 1.376 ± 0.829
3.441PheArg: 3.441 ± 1.482
4.129PheSer: 4.129 ± 1.4
1.376PheThr: 1.376 ± 0.829
4.129PheVal: 4.129 ± 0.605
0.0PheTrp: 0.0 ± 0.0
1.376PheTyr: 1.376 ± 0.967
0.0PheXaa: 0.0 ± 0.0
Gly
2.753GlyAla: 2.753 ± 1.917
1.376GlyCys: 1.376 ± 0.851
3.441GlyAsp: 3.441 ± 2.856
2.065GlyGlu: 2.065 ± 1.277
6.882GlyPhe: 6.882 ± 1.409
2.753GlyGly: 2.753 ± 1.506
2.065GlyHis: 2.065 ± 0.957
6.194GlyIle: 6.194 ± 1.433
3.441GlyLys: 3.441 ± 0.599
10.323GlyLeu: 10.323 ± 2.309
2.065GlyMet: 2.065 ± 0.748
3.441GlyAsn: 3.441 ± 1.683
2.065GlyPro: 2.065 ± 1.06
2.753GlyGln: 2.753 ± 1.025
3.441GlyArg: 3.441 ± 0.841
4.818GlySer: 4.818 ± 1.417
6.194GlyThr: 6.194 ± 1.473
2.065GlyVal: 2.065 ± 0.967
4.129GlyTrp: 4.129 ± 0.879
2.753GlyTyr: 2.753 ± 1.506
0.0GlyXaa: 0.0 ± 0.0
His
0.688HisAla: 0.688 ± 0.571
0.688HisCys: 0.688 ± 0.903
0.688HisAsp: 0.688 ± 0.415
0.0HisGlu: 0.0 ± 0.0
0.688HisPhe: 0.688 ± 0.903
2.753HisGly: 2.753 ± 0.727
0.688HisHis: 0.688 ± 0.571
0.688HisIle: 0.688 ± 0.926
0.688HisLys: 0.688 ± 0.415
1.376HisLeu: 1.376 ± 0.967
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
1.376HisPro: 1.376 ± 1.143
0.688HisGln: 0.688 ± 0.903
0.688HisArg: 0.688 ± 0.903
3.441HisSer: 3.441 ± 1.635
1.376HisThr: 1.376 ± 0.467
0.688HisVal: 0.688 ± 0.415
0.0HisTrp: 0.0 ± 0.0
0.688HisTyr: 0.688 ± 0.926
0.0HisXaa: 0.0 ± 0.0
Ile
3.441IleAla: 3.441 ± 1.422
2.065IleCys: 2.065 ± 1.277
1.376IleAsp: 1.376 ± 0.467
2.753IleGlu: 2.753 ± 1.251
2.065IlePhe: 2.065 ± 1.714
2.753IleGly: 2.753 ± 1.88
0.688IleHis: 0.688 ± 0.415
0.688IleIle: 0.688 ± 0.571
2.065IleLys: 2.065 ± 1.307
3.441IleLeu: 3.441 ± 2.073
2.065IleMet: 2.065 ± 0.967
1.376IleAsn: 1.376 ± 0.829
2.065IlePro: 2.065 ± 0.967
0.0IleGln: 0.0 ± 0.0
3.441IleArg: 3.441 ± 0.96
2.065IleSer: 2.065 ± 0.748
2.753IleThr: 2.753 ± 0.659
4.129IleVal: 4.129 ± 0.605
1.376IleTrp: 1.376 ± 1.143
1.376IleTyr: 1.376 ± 0.829
0.0IleXaa: 0.0 ± 0.0
Lys
3.441LysAla: 3.441 ± 1.482
0.0LysCys: 0.0 ± 0.0
0.688LysAsp: 0.688 ± 0.903
1.376LysGlu: 1.376 ± 0.829
2.753LysPhe: 2.753 ± 1.22
2.065LysGly: 2.065 ± 0.748
0.688LysHis: 0.688 ± 0.415
0.688LysIle: 0.688 ± 0.415
2.065LysLys: 2.065 ± 1.244
7.571LysLeu: 7.571 ± 2.388
2.753LysMet: 2.753 ± 0.972
1.376LysAsn: 1.376 ± 0.851
0.688LysPro: 0.688 ± 0.415
3.441LysGln: 3.441 ± 1.744
1.376LysArg: 1.376 ± 0.467
4.818LysSer: 4.818 ± 2.903
3.441LysThr: 3.441 ± 1.679
6.194LysVal: 6.194 ± 1.308
1.376LysTrp: 1.376 ± 0.851
4.129LysTyr: 4.129 ± 1.384
0.0LysXaa: 0.0 ± 0.0
Leu
4.818LeuAla: 4.818 ± 1.766
2.753LeuCys: 2.753 ± 0.727
4.818LeuAsp: 4.818 ± 2.198
8.947LeuGlu: 8.947 ± 1.654
2.065LeuPhe: 2.065 ± 1.06
8.947LeuGly: 8.947 ± 1.94
0.688LeuHis: 0.688 ± 0.903
2.065LeuIle: 2.065 ± 0.71
0.688LeuLys: 0.688 ± 0.571
6.882LeuLeu: 6.882 ± 2.083
5.506LeuMet: 5.506 ± 1.895
2.753LeuAsn: 2.753 ± 0.933
2.753LeuPro: 2.753 ± 0.933
2.753LeuGln: 2.753 ± 0.834
6.882LeuArg: 6.882 ± 2.946
10.323LeuSer: 10.323 ± 2.519
1.376LeuThr: 1.376 ± 0.467
12.388LeuVal: 12.388 ± 3.652
2.065LeuTrp: 2.065 ± 1.244
2.753LeuTyr: 2.753 ± 1.77
0.0LeuXaa: 0.0 ± 0.0
Met
2.753MetAla: 2.753 ± 0.834
0.688MetCys: 0.688 ± 0.571
2.753MetAsp: 2.753 ± 1.016
2.065MetGlu: 2.065 ± 1.244
0.688MetPhe: 0.688 ± 0.571
3.441MetGly: 3.441 ± 1.393
0.0MetHis: 0.0 ± 0.0
0.688MetIle: 0.688 ± 0.415
0.688MetLys: 0.688 ± 0.571
2.753MetLeu: 2.753 ± 0.933
0.688MetMet: 0.688 ± 0.415
1.376MetAsn: 1.376 ± 0.85
1.376MetPro: 1.376 ± 0.467
0.0MetGln: 0.0 ± 0.0
0.688MetArg: 0.688 ± 0.571
2.753MetSer: 2.753 ± 1.033
0.688MetThr: 0.688 ± 0.571
2.753MetVal: 2.753 ± 1.251
0.0MetTrp: 0.0 ± 0.0
1.376MetTyr: 1.376 ± 0.958
0.0MetXaa: 0.0 ± 0.0
Asn
1.376AsnAla: 1.376 ± 1.143
1.376AsnCys: 1.376 ± 1.234
1.376AsnAsp: 1.376 ± 0.967
0.688AsnGlu: 0.688 ± 0.926
1.376AsnPhe: 1.376 ± 0.467
6.194AsnGly: 6.194 ± 1.589
0.0AsnHis: 0.0 ± 0.0
1.376AsnIle: 1.376 ± 0.967
5.506AsnLys: 5.506 ± 2.136
0.0AsnLeu: 0.0 ± 0.0
2.065AsnMet: 2.065 ± 0.892
2.065AsnAsn: 2.065 ± 1.307
0.0AsnPro: 0.0 ± 0.0
0.0AsnGln: 0.0 ± 0.0
2.753AsnArg: 2.753 ± 2.642
2.753AsnSer: 2.753 ± 0.659
3.441AsnThr: 3.441 ± 1.398
1.376AsnVal: 1.376 ± 0.85
1.376AsnTrp: 1.376 ± 0.85
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
2.065ProAla: 2.065 ± 0.71
0.0ProCys: 0.0 ± 0.0
2.065ProAsp: 2.065 ± 0.748
2.753ProGlu: 2.753 ± 1.033
1.376ProPhe: 1.376 ± 0.85
4.818ProGly: 4.818 ± 1.825
1.376ProHis: 1.376 ± 0.958
2.065ProIle: 2.065 ± 1.244
3.441ProLys: 3.441 ± 0.86
2.753ProLeu: 2.753 ± 0.933
1.376ProMet: 1.376 ± 0.829
2.065ProAsn: 2.065 ± 1.73
4.818ProPro: 4.818 ± 1.553
0.688ProGln: 0.688 ± 0.415
1.376ProArg: 1.376 ± 0.85
8.259ProSer: 8.259 ± 1.74
2.753ProThr: 2.753 ± 1.583
4.129ProVal: 4.129 ± 1.795
0.688ProTrp: 0.688 ± 0.571
0.688ProTyr: 0.688 ± 0.926
0.0ProXaa: 0.0 ± 0.0
Gln
0.688GlnAla: 0.688 ± 0.415
0.0GlnCys: 0.0 ± 0.0
1.376GlnAsp: 1.376 ± 0.829
2.065GlnGlu: 2.065 ± 0.748
1.376GlnPhe: 1.376 ± 1.807
0.0GlnGly: 0.0 ± 0.0
0.0GlnHis: 0.0 ± 0.0
1.376GlnIle: 1.376 ± 0.967
2.753GlnLys: 2.753 ± 1.025
4.129GlnLeu: 4.129 ± 2.172
0.0GlnMet: 0.0 ± 0.0
2.065GlnAsn: 2.065 ± 1.307
0.688GlnPro: 0.688 ± 0.415
2.065GlnGln: 2.065 ± 0.673
2.753GlnArg: 2.753 ± 1.025
4.818GlnSer: 4.818 ± 2.609
1.376GlnThr: 1.376 ± 0.467
2.753GlnVal: 2.753 ± 1.506
1.376GlnTrp: 1.376 ± 0.467
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
4.818ArgAla: 4.818 ± 2.488
1.376ArgCys: 1.376 ± 0.85
2.753ArgAsp: 2.753 ± 1.699
3.441ArgGlu: 3.441 ± 0.96
2.065ArgPhe: 2.065 ± 0.957
3.441ArgGly: 3.441 ± 0.599
2.753ArgHis: 2.753 ± 0.659
5.506ArgIle: 5.506 ± 1.751
4.129ArgLys: 4.129 ± 1.971
6.194ArgLeu: 6.194 ± 2.116
0.688ArgMet: 0.688 ± 0.571
2.065ArgAsn: 2.065 ± 1.06
2.753ArgPro: 2.753 ± 1.016
0.688ArgGln: 0.688 ± 0.926
6.194ArgArg: 6.194 ± 3.452
0.688ArgSer: 0.688 ± 0.415
4.129ArgThr: 4.129 ± 0.879
6.194ArgVal: 6.194 ± 2.155
0.0ArgTrp: 0.0 ± 0.0
2.065ArgTyr: 2.065 ± 0.71
0.0ArgXaa: 0.0 ± 0.0
Ser
4.818SerAla: 4.818 ± 1.553
1.376SerCys: 1.376 ± 0.467
4.818SerAsp: 4.818 ± 1.766
8.259SerGlu: 8.259 ± 3.048
2.753SerPhe: 2.753 ± 0.659
8.947SerGly: 8.947 ± 0.863
2.753SerHis: 2.753 ± 1.934
0.688SerIle: 0.688 ± 0.415
7.571SerLys: 7.571 ± 3.191
6.194SerLeu: 6.194 ± 2.016
1.376SerMet: 1.376 ± 0.657
3.441SerAsn: 3.441 ± 0.765
4.129SerPro: 4.129 ± 1.934
2.753SerGln: 2.753 ± 1.592
4.818SerArg: 4.818 ± 2.273
13.076SerSer: 13.076 ± 3.746
6.194SerThr: 6.194 ± 1.619
6.194SerVal: 6.194 ± 2.067
1.376SerTrp: 1.376 ± 0.829
2.065SerTyr: 2.065 ± 0.967
0.0SerXaa: 0.0 ± 0.0
Thr
4.129ThrAla: 4.129 ± 1.151
0.688ThrCys: 0.688 ± 0.415
2.753ThrAsp: 2.753 ± 1.016
1.376ThrGlu: 1.376 ± 0.829
2.753ThrPhe: 2.753 ± 1.22
4.818ThrGly: 4.818 ± 1.05
0.0ThrHis: 0.0 ± 0.0
2.065ThrIle: 2.065 ± 0.748
1.376ThrLys: 1.376 ± 0.851
6.194ThrLeu: 6.194 ± 2.245
2.065ThrMet: 2.065 ± 0.673
0.0ThrAsn: 0.0 ± 0.0
4.129ThrPro: 4.129 ± 1.151
3.441ThrGln: 3.441 ± 0.841
3.441ThrArg: 3.441 ± 0.918
4.129ThrSer: 4.129 ± 0.76
2.065ThrThr: 2.065 ± 0.967
6.882ThrVal: 6.882 ± 1.198
1.376ThrTrp: 1.376 ± 0.467
0.688ThrTyr: 0.688 ± 0.415
0.0ThrXaa: 0.0 ± 0.0
Val
4.818ValAla: 4.818 ± 1.581
1.376ValCys: 1.376 ± 0.85
2.753ValAsp: 2.753 ± 0.933
2.753ValGlu: 2.753 ± 1.025
4.129ValPhe: 4.129 ± 1.179
8.259ValGly: 8.259 ± 1.609
2.065ValHis: 2.065 ± 1.173
4.818ValIle: 4.818 ± 0.956
4.129ValLys: 4.129 ± 2.475
8.259ValLeu: 8.259 ± 2.256
2.065ValMet: 2.065 ± 0.957
1.376ValAsn: 1.376 ± 0.958
7.571ValPro: 7.571 ± 2.717
3.441ValGln: 3.441 ± 1.482
4.129ValArg: 4.129 ± 1.905
4.818ValSer: 4.818 ± 1.581
4.129ValThr: 4.129 ± 1.795
9.635ValVal: 9.635 ± 2.226
2.065ValTrp: 2.065 ± 0.985
1.376ValTyr: 1.376 ± 1.853
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
2.753TrpCys: 2.753 ± 0.727
1.376TrpAsp: 1.376 ± 1.143
2.065TrpGlu: 2.065 ± 0.957
3.441TrpPhe: 3.441 ± 1.081
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.688TrpIle: 0.688 ± 0.926
0.688TrpLys: 0.688 ± 0.415
2.753TrpLeu: 2.753 ± 1.251
0.0TrpMet: 0.0 ± 0.0
0.688TrpAsn: 0.688 ± 0.926
2.065TrpPro: 2.065 ± 0.985
0.0TrpGln: 0.0 ± 0.0
0.688TrpArg: 0.688 ± 0.415
2.065TrpSer: 2.065 ± 0.957
1.376TrpThr: 1.376 ± 0.829
0.0TrpVal: 0.0 ± 0.0
0.688TrpTrp: 0.688 ± 0.415
0.688TrpTyr: 0.688 ± 0.415
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.065TyrAla: 2.065 ± 1.73
0.688TyrCys: 0.688 ± 0.903
1.376TyrAsp: 1.376 ± 1.853
1.376TyrGlu: 1.376 ± 0.851
1.376TyrPhe: 1.376 ± 0.467
2.753TyrGly: 2.753 ± 0.968
0.688TyrHis: 0.688 ± 0.415
0.688TyrIle: 0.688 ± 0.903
0.688TyrLys: 0.688 ± 0.415
4.129TyrLeu: 4.129 ± 1.341
0.0TyrMet: 0.0 ± 0.0
2.753TyrAsn: 2.753 ± 1.025
2.753TyrPro: 2.753 ± 1.506
0.0TyrGln: 0.0 ± 0.0
2.753TyrArg: 2.753 ± 0.968
1.376TyrSer: 1.376 ± 1.143
0.688TyrThr: 0.688 ± 0.415
0.688TyrVal: 0.688 ± 0.571
0.0TyrTrp: 0.0 ± 0.0
1.376TyrTyr: 1.376 ± 0.958
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1454 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski