Amino acid dipepetide frequency for Banana bunchy top virus (BBTV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.127AlaAla: 1.127 ± 0.971
0.0AlaCys: 0.0 ± 0.0
1.127AlaAsp: 1.127 ± 0.988
2.255AlaGlu: 2.255 ± 1.078
3.382AlaPhe: 3.382 ± 1.504
3.382AlaGly: 3.382 ± 1.289
1.127AlaHis: 1.127 ± 0.757
2.255AlaIle: 2.255 ± 2.056
1.127AlaLys: 1.127 ± 0.757
2.255AlaLeu: 2.255 ± 1.38
3.382AlaMet: 3.382 ± 1.027
1.127AlaAsn: 1.127 ± 0.757
2.255AlaPro: 2.255 ± 1.127
1.127AlaGln: 1.127 ± 0.757
3.382AlaArg: 3.382 ± 1.504
2.255AlaSer: 2.255 ± 1.38
3.382AlaThr: 3.382 ± 2.2
2.255AlaVal: 2.255 ± 1.377
1.127AlaTrp: 1.127 ± 0.757
0.0AlaTyr: 0.0 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
1.127CysCys: 1.127 ± 1.137
4.51CysAsp: 4.51 ± 2.985
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
1.127CysHis: 1.127 ± 0.757
0.0CysIle: 0.0 ± 0.0
2.255CysLys: 2.255 ± 1.015
2.255CysLeu: 2.255 ± 1.942
2.255CysMet: 2.255 ± 1.399
2.255CysAsn: 2.255 ± 1.078
1.127CysPro: 1.127 ± 0.757
2.255CysGln: 2.255 ± 2.274
1.127CysArg: 1.127 ± 0.757
3.382CysSer: 3.382 ± 1.857
1.127CysThr: 1.127 ± 0.988
3.382CysVal: 3.382 ± 2.316
1.127CysTrp: 1.127 ± 0.757
1.127CysTyr: 1.127 ± 0.988
0.0CysXaa: 0.0 ± 0.0
Asp
1.127AspAla: 1.127 ± 1.137
2.255AspCys: 2.255 ± 1.052
4.51AspAsp: 4.51 ± 2.18
2.255AspGlu: 2.255 ± 1.052
1.127AspPhe: 1.127 ± 0.988
1.127AspGly: 1.127 ± 0.988
0.0AspHis: 0.0 ± 0.0
4.51AspIle: 4.51 ± 2.23
1.127AspLys: 1.127 ± 0.971
3.382AspLeu: 3.382 ± 2.316
2.255AspMet: 2.255 ± 1.078
2.255AspAsn: 2.255 ± 1.078
0.0AspPro: 0.0 ± 0.0
3.382AspGln: 3.382 ± 2.151
4.51AspArg: 4.51 ± 1.263
2.255AspSer: 2.255 ± 1.977
2.255AspThr: 2.255 ± 1.514
6.764AspVal: 6.764 ± 2.836
3.382AspTrp: 3.382 ± 2.965
6.764AspTyr: 6.764 ± 1.87
0.0AspXaa: 0.0 ± 0.0
Glu
5.637GluAla: 5.637 ± 1.746
2.255GluCys: 2.255 ± 1.127
9.019GluAsp: 9.019 ± 3.041
5.637GluGlu: 5.637 ± 3.062
4.51GluPhe: 4.51 ± 1.263
5.637GluGly: 5.637 ± 2.932
1.127GluHis: 1.127 ± 0.971
4.51GluIle: 4.51 ± 1.915
2.255GluLys: 2.255 ± 1.052
3.382GluLeu: 3.382 ± 1.289
4.51GluMet: 4.51 ± 2.005
2.255GluAsn: 2.255 ± 1.015
2.255GluPro: 2.255 ± 1.015
0.0GluGln: 0.0 ± 0.0
2.255GluArg: 2.255 ± 1.044
2.255GluSer: 2.255 ± 1.37
1.127GluThr: 1.127 ± 0.757
6.764GluVal: 6.764 ± 0.286
1.127GluTrp: 1.127 ± 1.028
5.637GluTyr: 5.637 ± 1.873
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
1.127PheCys: 1.127 ± 1.137
3.382PheAsp: 3.382 ± 2.272
4.51PheGlu: 4.51 ± 2.008
4.51PhePhe: 4.51 ± 1.833
4.51PheGly: 4.51 ± 2.008
0.0PheHis: 0.0 ± 0.0
3.382PheIle: 3.382 ± 2.2
3.382PheLys: 3.382 ± 1.579
3.382PheLeu: 3.382 ± 1.241
2.255PheMet: 2.255 ± 1.35
1.127PheAsn: 1.127 ± 0.988
1.127PhePro: 1.127 ± 0.757
0.0PheGln: 0.0 ± 0.0
1.127PheArg: 1.127 ± 0.971
4.51PheSer: 4.51 ± 1.37
1.127PheThr: 1.127 ± 0.757
0.0PheVal: 0.0 ± 0.0
1.127PheTrp: 1.127 ± 1.137
4.51PheTyr: 4.51 ± 0.775
0.0PheXaa: 0.0 ± 0.0
Gly
4.51GlyAla: 4.51 ± 1.527
1.127GlyCys: 1.127 ± 0.988
3.382GlyAsp: 3.382 ± 2.31
3.382GlyGlu: 3.382 ± 0.984
2.255GlyPhe: 2.255 ± 1.078
4.51GlyGly: 4.51 ± 1.37
0.0GlyHis: 0.0 ± 0.0
4.51GlyIle: 4.51 ± 1.527
5.637GlyLys: 5.637 ± 2.596
2.255GlyLeu: 2.255 ± 1.015
0.0GlyMet: 0.0 ± 0.0
2.255GlyAsn: 2.255 ± 1.377
3.382GlyPro: 3.382 ± 1.506
1.127GlyGln: 1.127 ± 0.757
5.637GlyArg: 5.637 ± 2.724
4.51GlySer: 4.51 ± 1.631
2.255GlyThr: 2.255 ± 1.015
4.51GlyVal: 4.51 ± 2.629
0.0GlyTrp: 0.0 ± 0.0
3.382GlyTyr: 3.382 ± 1.925
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
1.127HisAsp: 1.127 ± 0.971
1.127HisGlu: 1.127 ± 1.137
0.0HisPhe: 0.0 ± 0.0
1.127HisGly: 1.127 ± 0.988
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
1.127HisLys: 1.127 ± 0.757
3.382HisLeu: 3.382 ± 1.504
0.0HisMet: 0.0 ± 0.0
1.127HisAsn: 1.127 ± 0.988
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
1.127HisArg: 1.127 ± 0.757
1.127HisSer: 1.127 ± 0.971
0.0HisThr: 0.0 ± 0.0
2.255HisVal: 2.255 ± 1.015
1.127HisTrp: 1.127 ± 0.971
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
1.127IleAla: 1.127 ± 1.028
2.255IleCys: 2.255 ± 1.052
1.127IleAsp: 1.127 ± 0.988
3.382IleGlu: 3.382 ± 0.984
3.382IlePhe: 3.382 ± 1.506
0.0IleGly: 0.0 ± 0.0
0.0IleHis: 0.0 ± 0.0
3.382IleIle: 3.382 ± 1.438
9.019IleLys: 9.019 ± 1.836
5.637IleLeu: 5.637 ± 2.721
2.255IleMet: 2.255 ± 1.052
3.382IleAsn: 3.382 ± 1.836
2.255IlePro: 2.255 ± 1.044
2.255IleGln: 2.255 ± 1.514
3.382IleArg: 3.382 ± 1.533
4.51IleSer: 4.51 ± 2.985
2.255IleThr: 2.255 ± 1.416
7.892IleVal: 7.892 ± 1.79
2.255IleTrp: 2.255 ± 1.015
2.255IleTyr: 2.255 ± 2.274
0.0IleXaa: 0.0 ± 0.0
Lys
2.255LysAla: 2.255 ± 1.015
1.127LysCys: 1.127 ± 0.988
2.255LysAsp: 2.255 ± 1.052
5.637LysGlu: 5.637 ± 1.896
1.127LysPhe: 1.127 ± 1.137
2.255LysGly: 2.255 ± 1.078
1.127LysHis: 1.127 ± 0.757
3.382LysIle: 3.382 ± 1.142
5.637LysLys: 5.637 ± 2.409
6.764LysLeu: 6.764 ± 1.571
1.127LysMet: 1.127 ± 0.971
2.255LysAsn: 2.255 ± 1.015
3.382LysPro: 3.382 ± 2.012
2.255LysGln: 2.255 ± 1.044
5.637LysArg: 5.637 ± 1.775
6.764LysSer: 6.764 ± 1.87
7.892LysThr: 7.892 ± 1.459
4.51LysVal: 4.51 ± 2.253
0.0LysTrp: 0.0 ± 0.0
4.51LysTyr: 4.51 ± 1.497
0.0LysXaa: 0.0 ± 0.0
Leu
2.255LeuAla: 2.255 ± 1.315
1.127LeuCys: 1.127 ± 0.988
1.127LeuAsp: 1.127 ± 0.757
6.764LeuGlu: 6.764 ± 2.003
6.764LeuPhe: 6.764 ± 2.76
2.255LeuGly: 2.255 ± 1.315
1.127LeuHis: 1.127 ± 0.971
1.127LeuIle: 1.127 ± 0.988
6.764LeuLys: 6.764 ± 2.898
9.019LeuLeu: 9.019 ± 2.145
2.255LeuMet: 2.255 ± 0.916
2.255LeuAsn: 2.255 ± 1.052
3.382LeuPro: 3.382 ± 1.504
0.0LeuGln: 0.0 ± 0.0
4.51LeuArg: 4.51 ± 2.253
3.382LeuSer: 3.382 ± 1.084
2.255LeuThr: 2.255 ± 2.056
10.147LeuVal: 10.147 ± 4.339
0.0LeuTrp: 0.0 ± 0.0
7.892LeuTyr: 7.892 ± 0.879
0.0LeuXaa: 0.0 ± 0.0
Met
6.764MetAla: 6.764 ± 1.463
0.0MetCys: 0.0 ± 0.0
1.127MetAsp: 1.127 ± 0.988
2.255MetGlu: 2.255 ± 1.37
3.382MetPhe: 3.382 ± 1.245
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
1.127MetIle: 1.127 ± 0.971
3.382MetLys: 3.382 ± 2.272
1.127MetLeu: 1.127 ± 0.971
0.0MetMet: 0.0 ± 0.0
2.255MetAsn: 2.255 ± 1.514
1.127MetPro: 1.127 ± 1.137
1.127MetGln: 1.127 ± 1.028
4.51MetArg: 4.51 ± 2.23
0.0MetSer: 0.0 ± 0.0
0.0MetThr: 0.0 ± 0.0
2.255MetVal: 2.255 ± 1.052
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.127AsnAla: 1.127 ± 0.757
1.127AsnCys: 1.127 ± 0.988
2.255AsnAsp: 2.255 ± 1.078
1.127AsnGlu: 1.127 ± 0.988
1.127AsnPhe: 1.127 ± 0.757
3.382AsnGly: 3.382 ± 1.579
0.0AsnHis: 0.0 ± 0.0
1.127AsnIle: 1.127 ± 0.757
2.255AsnLys: 2.255 ± 1.052
1.127AsnLeu: 1.127 ± 0.757
0.0AsnMet: 0.0 ± 0.0
3.382AsnAsn: 3.382 ± 1.925
2.255AsnPro: 2.255 ± 1.044
1.127AsnGln: 1.127 ± 0.971
3.382AsnArg: 3.382 ± 1.884
3.382AsnSer: 3.382 ± 1.142
2.255AsnThr: 2.255 ± 1.015
4.51AsnVal: 4.51 ± 1.463
0.0AsnTrp: 0.0 ± 0.0
3.382AsnTyr: 3.382 ± 1.504
0.0AsnXaa: 0.0 ± 0.0
Pro
1.127ProAla: 1.127 ± 1.028
3.382ProCys: 3.382 ± 1.142
1.127ProAsp: 1.127 ± 1.137
2.255ProGlu: 2.255 ± 1.942
2.255ProPhe: 2.255 ± 1.078
5.637ProGly: 5.637 ± 1.737
0.0ProHis: 0.0 ± 0.0
0.0ProIle: 0.0 ± 0.0
3.382ProLys: 3.382 ± 1.245
1.127ProLeu: 1.127 ± 0.757
1.127ProMet: 1.127 ± 1.028
2.255ProAsn: 2.255 ± 1.514
0.0ProPro: 0.0 ± 0.0
1.127ProGln: 1.127 ± 1.028
3.382ProArg: 3.382 ± 0.984
1.127ProSer: 1.127 ± 1.028
3.382ProThr: 3.382 ± 1.836
3.382ProVal: 3.382 ± 1.579
0.0ProTrp: 0.0 ± 0.0
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
2.255GlnAla: 2.255 ± 1.044
0.0GlnCys: 0.0 ± 0.0
2.255GlnAsp: 2.255 ± 1.052
4.51GlnGlu: 4.51 ± 2.104
1.127GlnPhe: 1.127 ± 0.988
2.255GlnGly: 2.255 ± 1.044
0.0GlnHis: 0.0 ± 0.0
1.127GlnIle: 1.127 ± 1.028
2.255GlnLys: 2.255 ± 2.274
2.255GlnLeu: 2.255 ± 2.056
1.127GlnMet: 1.127 ± 0.757
1.127GlnAsn: 1.127 ± 0.971
3.382GlnPro: 3.382 ± 1.245
2.255GlnGln: 2.255 ± 2.056
1.127GlnArg: 1.127 ± 1.028
1.127GlnSer: 1.127 ± 0.757
0.0GlnThr: 0.0 ± 0.0
2.255GlnVal: 2.255 ± 1.044
0.0GlnTrp: 0.0 ± 0.0
1.127GlnTyr: 1.127 ± 0.988
0.0GlnXaa: 0.0 ± 0.0
Arg
0.0ArgAla: 0.0 ± 0.0
1.127ArgCys: 1.127 ± 0.757
5.637ArgAsp: 5.637 ± 2.51
2.255ArgGlu: 2.255 ± 1.052
0.0ArgPhe: 0.0 ± 0.0
5.637ArgGly: 5.637 ± 2.069
1.127ArgHis: 1.127 ± 0.757
9.019ArgIle: 9.019 ± 2.056
7.892ArgLys: 7.892 ± 2.248
2.255ArgLeu: 2.255 ± 1.052
0.0ArgMet: 0.0 ± 0.0
3.382ArgAsn: 3.382 ± 1.142
2.255ArgPro: 2.255 ± 1.078
1.127ArgGln: 1.127 ± 0.988
11.274ArgArg: 11.274 ± 4.659
4.51ArgSer: 4.51 ± 2.23
2.255ArgThr: 2.255 ± 1.044
4.51ArgVal: 4.51 ± 2.046
0.0ArgTrp: 0.0 ± 0.0
5.637ArgTyr: 5.637 ± 1.723
0.0ArgXaa: 0.0 ± 0.0
Ser
3.382SerAla: 3.382 ± 1.142
2.255SerCys: 2.255 ± 1.514
5.637SerAsp: 5.637 ± 2.477
4.51SerGlu: 4.51 ± 1.263
2.255SerPhe: 2.255 ± 1.078
5.637SerGly: 5.637 ± 1.975
2.255SerHis: 2.255 ± 1.942
9.019SerIle: 9.019 ± 3.346
2.255SerLys: 2.255 ± 1.015
4.51SerLeu: 4.51 ± 1.347
1.127SerMet: 1.127 ± 1.137
3.382SerAsn: 3.382 ± 2.101
2.255SerPro: 2.255 ± 1.078
4.51SerGln: 4.51 ± 2.008
1.127SerArg: 1.127 ± 0.988
11.274SerSer: 11.274 ± 3.905
2.255SerThr: 2.255 ± 2.274
1.127SerVal: 1.127 ± 1.137
3.382SerTrp: 3.382 ± 1.027
2.255SerTyr: 2.255 ± 1.052
0.0SerXaa: 0.0 ± 0.0
Thr
0.0ThrAla: 0.0 ± 0.0
2.255ThrCys: 2.255 ± 1.127
1.127ThrAsp: 1.127 ± 0.971
3.382ThrGlu: 3.382 ± 1.531
1.127ThrPhe: 1.127 ± 0.757
2.255ThrGly: 2.255 ± 1.416
2.255ThrHis: 2.255 ± 1.078
2.255ThrIle: 2.255 ± 1.044
1.127ThrLys: 1.127 ± 0.757
4.51ThrLeu: 4.51 ± 1.237
0.0ThrMet: 0.0 ± 0.0
0.0ThrAsn: 0.0 ± 0.0
0.0ThrPro: 0.0 ± 0.0
2.255ThrGln: 2.255 ± 1.38
4.51ThrArg: 4.51 ± 2.143
5.637ThrSer: 5.637 ± 1.696
5.637ThrThr: 5.637 ± 2.025
2.255ThrVal: 2.255 ± 1.942
0.0ThrTrp: 0.0 ± 0.0
2.255ThrTyr: 2.255 ± 1.015
0.0ThrXaa: 0.0 ± 0.0
Val
2.255ValAla: 2.255 ± 1.127
4.51ValCys: 4.51 ± 1.631
2.255ValAsp: 2.255 ± 1.127
11.274ValGlu: 11.274 ± 3.496
3.382ValPhe: 3.382 ± 1.289
3.382ValGly: 3.382 ± 1.531
0.0ValHis: 0.0 ± 0.0
9.019ValIle: 9.019 ± 3.792
6.764ValLys: 6.764 ± 3.285
6.764ValLeu: 6.764 ± 1.378
2.255ValMet: 2.255 ± 1.481
1.127ValAsn: 1.127 ± 1.028
4.51ValPro: 4.51 ± 2.083
2.255ValGln: 2.255 ± 1.514
3.382ValArg: 3.382 ± 1.533
5.637ValSer: 5.637 ± 1.449
2.255ValThr: 2.255 ± 1.315
3.382ValVal: 3.382 ± 1.142
1.127ValTrp: 1.127 ± 1.028
5.637ValTyr: 5.637 ± 3.062
0.0ValXaa: 0.0 ± 0.0
Trp
2.255TrpAla: 2.255 ± 1.127
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
3.382TrpGlu: 3.382 ± 2.258
1.127TrpPhe: 1.127 ± 1.028
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
2.255TrpLys: 2.255 ± 1.127
0.0TrpLeu: 0.0 ± 0.0
2.255TrpMet: 2.255 ± 1.044
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.127TrpArg: 1.127 ± 0.757
2.255TrpSer: 2.255 ± 1.37
1.127TrpThr: 1.127 ± 0.757
1.127TrpVal: 1.127 ± 0.757
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.127TyrAla: 1.127 ± 0.757
3.382TyrCys: 3.382 ± 1.027
2.255TyrAsp: 2.255 ± 1.052
2.255TyrGlu: 2.255 ± 1.514
2.255TyrPhe: 2.255 ± 1.127
5.637TyrGly: 5.637 ± 0.829
3.382TyrHis: 3.382 ± 1.289
2.255TyrIle: 2.255 ± 2.056
0.0TyrLys: 0.0 ± 0.0
9.019TyrLeu: 9.019 ± 2.125
1.127TyrMet: 1.127 ± 0.757
1.127TyrAsn: 1.127 ± 0.757
1.127TyrPro: 1.127 ± 0.971
3.382TyrGln: 3.382 ± 1.027
3.382TyrArg: 3.382 ± 2.056
4.51TyrSer: 4.51 ± 1.915
0.0TyrThr: 0.0 ± 0.0
9.019TyrVal: 9.019 ± 3.685
1.127TyrTrp: 1.127 ± 1.137
1.127TyrTyr: 1.127 ± 0.988
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (888 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski