Amino acid dipepetide frequency for Banana streak MY virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.298AlaAla: 2.298 ± 1.073
1.379AlaCys: 1.379 ± 0.644
5.515AlaAsp: 5.515 ± 0.538
4.596AlaGlu: 4.596 ± 2.145
1.379AlaPhe: 1.379 ± 0.644
2.757AlaGly: 2.757 ± 0.737
1.379AlaHis: 1.379 ± 1.087
3.676AlaIle: 3.676 ± 2.411
3.676AlaLys: 3.676 ± 1.716
5.515AlaLeu: 5.515 ± 2.81
2.757AlaMet: 2.757 ± 1.287
4.136AlaAsn: 4.136 ± 1.151
2.298AlaPro: 2.298 ± 1.088
2.757AlaGln: 2.757 ± 1.287
3.676AlaArg: 3.676 ± 1.716
3.676AlaSer: 3.676 ± 3.168
2.757AlaThr: 2.757 ± 1.003
2.298AlaVal: 2.298 ± 1.073
0.919AlaTrp: 0.919 ± 0.429
2.298AlaTyr: 2.298 ± 1.073
0.0AlaXaa: 0.0 ± 0.0
Cys
0.919CysAla: 0.919 ± 0.429
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
1.379CysGlu: 1.379 ± 0.644
0.46CysPhe: 0.46 ± 0.215
0.919CysGly: 0.919 ± 0.429
0.919CysHis: 0.919 ± 0.429
0.0CysIle: 0.0 ± 0.0
2.298CysLys: 2.298 ± 1.073
0.0CysLeu: 0.0 ± 0.0
0.46CysMet: 0.46 ± 0.215
0.46CysAsn: 0.46 ± 0.215
0.46CysPro: 0.46 ± 1.438
0.919CysGln: 0.919 ± 0.429
2.298CysArg: 2.298 ± 1.073
0.0CysSer: 0.0 ± 0.0
1.838CysThr: 1.838 ± 0.858
0.46CysVal: 0.46 ± 0.215
0.0CysTrp: 0.0 ± 0.0
0.919CysTyr: 0.919 ± 0.429
0.0CysXaa: 0.0 ± 0.0
Asp
3.676AspAla: 3.676 ± 1.716
0.46AspCys: 0.46 ± 0.215
4.596AspAsp: 4.596 ± 1.102
4.596AspGlu: 4.596 ± 1.102
3.217AspPhe: 3.217 ± 1.502
1.838AspGly: 1.838 ± 0.858
1.838AspHis: 1.838 ± 0.858
2.757AspIle: 2.757 ± 1.003
5.055AspLys: 5.055 ± 3.813
5.974AspLeu: 5.974 ± 6.656
1.838AspMet: 1.838 ± 0.858
0.46AspAsn: 0.46 ± 0.215
2.757AspPro: 2.757 ± 1.287
2.298AspGln: 2.298 ± 1.073
3.217AspArg: 3.217 ± 1.502
1.838AspSer: 1.838 ± 0.858
2.757AspThr: 2.757 ± 0.737
1.379AspVal: 1.379 ± 1.087
0.46AspTrp: 0.46 ± 0.215
3.676AspTyr: 3.676 ± 1.362
0.0AspXaa: 0.0 ± 0.0
Glu
6.434GluAla: 6.434 ± 1.589
0.46GluCys: 0.46 ± 0.215
4.596GluAsp: 4.596 ± 0.989
13.787GluGlu: 13.787 ± 0.577
1.838GluPhe: 1.838 ± 0.858
5.974GluGly: 5.974 ± 1.951
1.379GluHis: 1.379 ± 0.644
6.893GluIle: 6.893 ± 1.909
12.408GluLys: 12.408 ± 2.523
11.949GluLeu: 11.949 ± 5.352
2.298GluMet: 2.298 ± 1.073
4.136GluAsn: 4.136 ± 0.855
1.379GluPro: 1.379 ± 0.644
2.757GluGln: 2.757 ± 3.267
5.055GluArg: 5.055 ± 0.737
3.217GluSer: 3.217 ± 1.502
3.676GluThr: 3.676 ± 1.716
7.353GluVal: 7.353 ± 1.925
1.379GluTrp: 1.379 ± 0.644
4.136GluTyr: 4.136 ± 0.855
0.0GluXaa: 0.0 ± 0.0
Phe
0.919PheAla: 0.919 ± 0.429
0.46PheCys: 0.46 ± 0.215
1.838PheAsp: 1.838 ± 0.858
2.298PheGlu: 2.298 ± 0.815
0.46PhePhe: 0.46 ± 0.215
1.838PheGly: 1.838 ± 0.937
0.46PheHis: 0.46 ± 0.215
1.838PheIle: 1.838 ± 0.858
3.676PheLys: 3.676 ± 1.362
4.136PheLeu: 4.136 ± 1.931
0.46PheMet: 0.46 ± 0.215
1.379PheAsn: 1.379 ± 0.644
0.46PhePro: 0.46 ± 0.215
3.217PheGln: 3.217 ± 1.502
1.838PheArg: 1.838 ± 0.858
0.919PheSer: 0.919 ± 0.429
1.838PheThr: 1.838 ± 1.206
0.46PheVal: 0.46 ± 0.215
0.0PheTrp: 0.0 ± 0.0
2.757PheTyr: 2.757 ± 1.287
0.0PheXaa: 0.0 ± 0.0
Gly
2.298GlyAla: 2.298 ± 1.088
0.919GlyCys: 0.919 ± 0.429
3.217GlyAsp: 3.217 ± 1.573
4.596GlyGlu: 4.596 ± 2.145
1.838GlyPhe: 1.838 ± 2.212
2.757GlyGly: 2.757 ± 1.003
0.46GlyHis: 0.46 ± 0.215
3.676GlyIle: 3.676 ± 0.759
4.136GlyLys: 4.136 ± 1.931
4.596GlyLeu: 4.596 ± 1.102
0.46GlyMet: 0.46 ± 1.02
3.676GlyAsn: 3.676 ± 0.962
2.757GlyPro: 2.757 ± 1.287
2.757GlyGln: 2.757 ± 1.003
2.298GlyArg: 2.298 ± 1.073
0.919GlySer: 0.919 ± 0.429
5.055GlyThr: 5.055 ± 1.224
4.596GlyVal: 4.596 ± 1.102
1.379GlyTrp: 1.379 ± 1.347
2.298GlyTyr: 2.298 ± 1.073
0.0GlyXaa: 0.0 ± 0.0
His
0.46HisAla: 0.46 ± 0.215
0.0HisCys: 0.0 ± 0.0
0.46HisAsp: 0.46 ± 0.215
1.838HisGlu: 1.838 ± 0.858
0.46HisPhe: 0.46 ± 0.215
0.46HisGly: 0.46 ± 0.215
0.46HisHis: 0.46 ± 0.215
2.757HisIle: 2.757 ± 1.287
0.919HisLys: 0.919 ± 1.257
1.838HisLeu: 1.838 ± 0.858
0.46HisMet: 0.46 ± 0.215
0.46HisAsn: 0.46 ± 1.438
0.0HisPro: 0.0 ± 0.0
1.838HisGln: 1.838 ± 1.206
3.217HisArg: 3.217 ± 0.717
0.0HisSer: 0.0 ± 0.0
1.379HisThr: 1.379 ± 1.087
0.919HisVal: 0.919 ± 1.257
0.46HisTrp: 0.46 ± 0.215
0.46HisTyr: 0.46 ± 0.215
0.0HisXaa: 0.0 ± 0.0
Ile
2.298IleAla: 2.298 ± 2.341
0.919IleCys: 0.919 ± 0.429
4.136IleAsp: 4.136 ± 1.931
4.596IleGlu: 4.596 ± 2.776
0.919IlePhe: 0.919 ± 0.429
5.974IleGly: 5.974 ± 1.507
1.379IleHis: 1.379 ± 2.425
2.757IleIle: 2.757 ± 0.737
5.515IleLys: 5.515 ± 1.368
5.055IleLeu: 5.055 ± 2.081
2.757IleMet: 2.757 ± 1.287
3.676IleAsn: 3.676 ± 1.716
2.757IlePro: 2.757 ± 1.287
2.757IleGln: 2.757 ± 1.003
2.757IleArg: 2.757 ± 1.003
6.434IleSer: 6.434 ± 0.245
4.596IleThr: 4.596 ± 1.102
2.298IleVal: 2.298 ± 0.815
1.838IleTrp: 1.838 ± 0.858
2.298IleTyr: 2.298 ± 1.088
0.0IleXaa: 0.0 ± 0.0
Lys
3.676LysAla: 3.676 ± 3.168
3.217LysCys: 3.217 ± 1.502
4.136LysAsp: 4.136 ± 2.632
10.11LysGlu: 10.11 ± 1.474
4.596LysPhe: 4.596 ± 2.145
2.757LysGly: 2.757 ± 1.287
1.379LysHis: 1.379 ± 0.644
7.353LysIle: 7.353 ± 1.706
4.596LysLys: 4.596 ± 4.017
6.893LysLeu: 6.893 ± 3.888
2.298LysMet: 2.298 ± 1.073
3.217LysAsn: 3.217 ± 1.573
2.757LysPro: 2.757 ± 1.287
0.919LysGln: 0.919 ± 0.429
3.676LysArg: 3.676 ± 0.962
5.055LysSer: 5.055 ± 2.584
3.217LysThr: 3.217 ± 0.717
9.191LysVal: 9.191 ± 5.309
0.919LysTrp: 0.919 ± 0.429
1.379LysTyr: 1.379 ± 0.644
0.0LysXaa: 0.0 ± 0.0
Leu
5.974LeuAla: 5.974 ± 3.359
1.379LeuCys: 1.379 ± 0.644
3.217LeuAsp: 3.217 ± 1.573
11.949LeuGlu: 11.949 ± 3.976
1.379LeuPhe: 1.379 ± 1.087
6.434LeuGly: 6.434 ± 1.699
2.298LeuHis: 2.298 ± 2.341
5.055LeuIle: 5.055 ± 0.737
8.272LeuLys: 8.272 ± 1.431
6.434LeuLeu: 6.434 ± 3.003
1.838LeuMet: 1.838 ± 0.858
5.515LeuAsn: 5.515 ± 3.61
3.217LeuPro: 3.217 ± 1.502
4.596LeuGln: 4.596 ± 2.421
5.974LeuArg: 5.974 ± 1.795
5.515LeuSer: 5.515 ± 3.61
3.217LeuThr: 3.217 ± 4.637
5.055LeuVal: 5.055 ± 1.538
0.0LeuTrp: 0.0 ± 0.0
3.676LeuTyr: 3.676 ± 0.962
0.0LeuXaa: 0.0 ± 0.0
Met
3.217MetAla: 3.217 ± 1.502
0.0MetCys: 0.0 ± 0.0
2.757MetAsp: 2.757 ± 0.737
1.838MetGlu: 1.838 ± 0.858
1.379MetPhe: 1.379 ± 0.644
0.919MetGly: 0.919 ± 0.429
0.0MetHis: 0.0 ± 0.0
1.379MetIle: 1.379 ± 0.644
2.298MetLys: 2.298 ± 1.073
0.919MetLeu: 0.919 ± 0.429
1.379MetMet: 1.379 ± 0.644
0.919MetAsn: 0.919 ± 0.429
0.919MetPro: 0.919 ± 0.429
0.0MetGln: 0.0 ± 0.0
1.379MetArg: 1.379 ± 0.644
2.298MetSer: 2.298 ± 1.088
1.838MetThr: 1.838 ± 0.858
2.757MetVal: 2.757 ± 1.287
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.757AsnAla: 2.757 ± 1.287
0.46AsnCys: 0.46 ± 0.215
0.919AsnAsp: 0.919 ± 0.429
2.757AsnGlu: 2.757 ± 1.287
1.379AsnPhe: 1.379 ± 0.644
3.217AsnGly: 3.217 ± 0.959
0.46AsnHis: 0.46 ± 0.215
1.838AsnIle: 1.838 ± 0.858
3.676AsnLys: 3.676 ± 1.873
8.732AsnLeu: 8.732 ± 2.248
1.379AsnMet: 1.379 ± 0.644
2.757AsnAsn: 2.757 ± 1.287
2.298AsnPro: 2.298 ± 1.088
2.757AsnGln: 2.757 ± 0.737
0.919AsnArg: 0.919 ± 1.257
1.379AsnSer: 1.379 ± 1.087
4.596AsnThr: 4.596 ± 2.776
1.379AsnVal: 1.379 ± 0.644
0.0AsnTrp: 0.0 ± 0.0
3.217AsnTyr: 3.217 ± 0.717
0.0AsnXaa: 0.0 ± 0.0
Pro
3.217ProAla: 3.217 ± 1.502
0.0ProCys: 0.0 ± 0.0
2.298ProAsp: 2.298 ± 1.088
3.676ProGlu: 3.676 ± 1.716
0.919ProPhe: 0.919 ± 0.429
2.298ProGly: 2.298 ± 1.073
1.379ProHis: 1.379 ± 0.644
1.838ProIle: 1.838 ± 0.858
3.217ProLys: 3.217 ± 0.717
2.757ProLeu: 2.757 ± 1.786
0.46ProMet: 0.46 ± 0.215
1.838ProAsn: 1.838 ± 0.858
1.379ProPro: 1.379 ± 0.644
0.0ProGln: 0.0 ± 0.0
1.379ProArg: 1.379 ± 0.644
3.217ProSer: 3.217 ± 1.502
2.298ProThr: 2.298 ± 1.073
0.919ProVal: 0.919 ± 0.429
0.46ProTrp: 0.46 ± 0.215
0.46ProTyr: 0.46 ± 1.678
0.0ProXaa: 0.0 ± 0.0
Gln
4.136GlnAla: 4.136 ± 0.855
0.0GlnCys: 0.0 ± 0.0
3.217GlnAsp: 3.217 ± 1.502
3.217GlnGlu: 3.217 ± 1.502
0.0GlnPhe: 0.0 ± 0.0
2.757GlnGly: 2.757 ± 2.695
1.838GlnHis: 1.838 ± 0.858
5.055GlnIle: 5.055 ± 1.224
1.379GlnLys: 1.379 ± 0.644
2.298GlnLeu: 2.298 ± 1.088
0.919GlnMet: 0.919 ± 0.992
3.676GlnAsn: 3.676 ± 0.759
1.838GlnPro: 1.838 ± 0.937
3.217GlnGln: 3.217 ± 0.717
3.217GlnArg: 3.217 ± 1.573
1.379GlnSer: 1.379 ± 2.425
1.379GlnThr: 1.379 ± 1.087
1.838GlnVal: 1.838 ± 0.858
0.0GlnTrp: 0.0 ± 0.0
1.838GlnTyr: 1.838 ± 0.858
0.0GlnXaa: 0.0 ± 0.0
Arg
3.217ArgAla: 3.217 ± 1.502
1.379ArgCys: 1.379 ± 1.087
2.298ArgAsp: 2.298 ± 1.073
6.893ArgGlu: 6.893 ± 1.386
2.298ArgPhe: 2.298 ± 1.073
1.838ArgGly: 1.838 ± 1.206
0.919ArgHis: 0.919 ± 0.429
4.136ArgIle: 4.136 ± 0.855
3.676ArgLys: 3.676 ± 0.759
5.055ArgLeu: 5.055 ± 0.737
1.379ArgMet: 1.379 ± 0.644
3.217ArgAsn: 3.217 ± 1.502
2.298ArgPro: 2.298 ± 1.073
1.379ArgGln: 1.379 ± 1.347
4.596ArgArg: 4.596 ± 2.145
5.055ArgSer: 5.055 ± 3.813
4.596ArgThr: 4.596 ± 0.989
3.217ArgVal: 3.217 ± 0.959
1.379ArgTrp: 1.379 ± 0.644
0.919ArgTyr: 0.919 ± 0.429
0.0ArgXaa: 0.0 ± 0.0
Ser
2.757SerAla: 2.757 ± 1.003
0.0SerCys: 0.0 ± 0.0
3.676SerAsp: 3.676 ± 1.873
7.353SerGlu: 7.353 ± 1.187
0.919SerPhe: 0.919 ± 0.429
4.596SerGly: 4.596 ± 1.102
0.919SerHis: 0.919 ± 1.257
4.596SerIle: 4.596 ± 0.942
3.217SerLys: 3.217 ± 0.717
6.434SerLeu: 6.434 ± 6.736
0.919SerMet: 0.919 ± 0.429
1.838SerAsn: 1.838 ± 2.212
1.838SerPro: 1.838 ± 1.206
3.217SerGln: 3.217 ± 0.717
4.136SerArg: 4.136 ± 0.855
3.217SerSer: 3.217 ± 2.548
4.136SerThr: 4.136 ± 1.931
1.838SerVal: 1.838 ± 1.206
0.919SerTrp: 0.919 ± 0.429
0.46SerTyr: 0.46 ± 0.215
0.0SerXaa: 0.0 ± 0.0
Thr
5.515ThrAla: 5.515 ± 1.368
0.919ThrCys: 0.919 ± 0.429
2.298ThrAsp: 2.298 ± 1.073
7.353ThrGlu: 7.353 ± 1.187
2.757ThrPhe: 2.757 ± 0.737
3.217ThrGly: 3.217 ± 1.502
0.46ThrHis: 0.46 ± 0.215
3.217ThrIle: 3.217 ± 0.959
5.055ThrLys: 5.055 ± 3.784
2.757ThrLeu: 2.757 ± 1.003
1.838ThrMet: 1.838 ± 0.858
0.919ThrAsn: 0.919 ± 0.429
0.919ThrPro: 0.919 ± 0.429
1.838ThrGln: 1.838 ± 1.206
3.217ThrArg: 3.217 ± 1.573
5.974ThrSer: 5.974 ± 1.529
4.136ThrThr: 4.136 ± 1.931
3.217ThrVal: 3.217 ± 1.502
0.46ThrTrp: 0.46 ± 0.215
2.757ThrTyr: 2.757 ± 2.175
0.0ThrXaa: 0.0 ± 0.0
Val
2.298ValAla: 2.298 ± 1.073
1.838ValCys: 1.838 ± 0.858
2.757ValAsp: 2.757 ± 0.737
4.136ValGlu: 4.136 ± 3.262
3.676ValPhe: 3.676 ± 0.962
3.676ValGly: 3.676 ± 2.411
0.46ValHis: 0.46 ± 0.215
2.757ValIle: 2.757 ± 2.175
2.757ValLys: 2.757 ± 2.695
4.136ValLeu: 4.136 ± 0.855
0.919ValMet: 0.919 ± 0.429
2.757ValAsn: 2.757 ± 0.737
1.838ValPro: 1.838 ± 0.858
2.757ValGln: 2.757 ± 1.287
3.217ValArg: 3.217 ± 0.717
4.136ValSer: 4.136 ± 0.855
3.676ValThr: 3.676 ± 0.962
2.298ValVal: 2.298 ± 1.998
0.0ValTrp: 0.0 ± 0.0
1.838ValTyr: 1.838 ± 0.858
0.0ValXaa: 0.0 ± 0.0
Trp
0.46TrpAla: 0.46 ± 0.215
0.0TrpCys: 0.0 ± 0.0
0.919TrpAsp: 0.919 ± 1.257
1.838TrpGlu: 1.838 ± 1.206
0.0TrpPhe: 0.0 ± 0.0
0.46TrpGly: 0.46 ± 0.215
0.0TrpHis: 0.0 ± 0.0
0.919TrpIle: 0.919 ± 0.429
0.919TrpLys: 0.919 ± 0.429
1.379TrpLeu: 1.379 ± 0.644
0.0TrpMet: 0.0 ± 0.0
0.919TrpAsn: 0.919 ± 0.429
0.0TrpPro: 0.0 ± 0.0
1.379TrpGln: 1.379 ± 0.644
0.919TrpArg: 0.919 ± 0.429
0.0TrpSer: 0.0 ± 0.0
1.838TrpThr: 1.838 ± 0.858
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.217TyrAla: 3.217 ± 1.502
0.919TyrCys: 0.919 ± 0.429
1.838TyrAsp: 1.838 ± 0.858
2.298TyrGlu: 2.298 ± 1.088
1.379TyrPhe: 1.379 ± 0.644
0.46TyrGly: 0.46 ± 0.215
0.46TyrHis: 0.46 ± 0.215
3.217TyrIle: 3.217 ± 1.502
4.596TyrLys: 4.596 ± 2.776
4.136TyrLeu: 4.136 ± 1.742
0.919TyrMet: 0.919 ± 0.429
0.919TyrAsn: 0.919 ± 0.429
1.838TyrPro: 1.838 ± 0.858
1.838TyrGln: 1.838 ± 0.858
2.757TyrArg: 2.757 ± 0.737
2.757TyrSer: 2.757 ± 1.003
0.0TyrThr: 0.0 ± 0.0
0.46TyrVal: 0.46 ± 0.215
1.379TyrTrp: 1.379 ± 1.087
1.838TyrTyr: 1.838 ± 0.858
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2177 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski