Amino acid dipepetide frequency for Hibiscus bacilliform virus GD1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.893AlaAla: 3.893 ± 1.141
0.433AlaCys: 0.433 ± 1.363
1.298AlaAsp: 1.298 ± 0.65
3.893AlaGlu: 3.893 ± 1.285
4.758AlaPhe: 4.758 ± 1.517
3.028AlaGly: 3.028 ± 0.968
0.865AlaHis: 0.865 ± 1.237
3.893AlaIle: 3.893 ± 1.175
2.163AlaLys: 2.163 ± 0.963
4.758AlaLeu: 4.758 ± 3.378
2.595AlaMet: 2.595 ± 1.299
1.298AlaAsn: 1.298 ± 1.943
3.028AlaPro: 3.028 ± 0.968
2.595AlaGln: 2.595 ± 2.276
3.893AlaArg: 3.893 ± 1.175
5.19AlaSer: 5.19 ± 3.564
4.325AlaThr: 4.325 ± 3.757
3.893AlaVal: 3.893 ± 2.438
0.0AlaTrp: 0.0 ± 0.0
3.028AlaTyr: 3.028 ± 1.516
0.0AlaXaa: 0.0 ± 0.0
Cys
0.865CysAla: 0.865 ± 1.212
0.433CysCys: 0.433 ± 0.217
0.0CysAsp: 0.0 ± 0.0
0.433CysGlu: 0.433 ± 0.217
0.865CysPhe: 0.865 ± 0.433
0.865CysGly: 0.865 ± 0.433
0.0CysHis: 0.0 ± 0.0
0.433CysIle: 0.433 ± 0.217
1.73CysLys: 1.73 ± 0.866
0.0CysLeu: 0.0 ± 0.0
0.433CysMet: 0.433 ± 0.217
0.433CysAsn: 0.433 ± 0.217
0.865CysPro: 0.865 ± 0.433
0.433CysGln: 0.433 ± 0.217
0.865CysArg: 0.865 ± 0.433
1.73CysSer: 1.73 ± 0.866
0.865CysThr: 0.865 ± 0.433
0.433CysVal: 0.433 ± 0.217
0.0CysTrp: 0.0 ± 0.0
0.865CysTyr: 0.865 ± 0.433
0.0CysXaa: 0.0 ± 0.0
Asp
2.595AspAla: 2.595 ± 1.299
0.433AspCys: 0.433 ± 0.217
2.595AspAsp: 2.595 ± 1.299
5.623AspGlu: 5.623 ± 2.815
0.865AspPhe: 0.865 ± 0.433
2.595AspGly: 2.595 ± 1.299
0.433AspHis: 0.433 ± 1.363
3.028AspIle: 3.028 ± 1.379
3.46AspLys: 3.46 ± 1.092
5.19AspLeu: 5.19 ± 1.928
0.865AspMet: 0.865 ± 0.433
2.595AspAsn: 2.595 ± 2.905
3.46AspPro: 3.46 ± 1.092
1.73AspGln: 1.73 ± 0.866
1.73AspArg: 1.73 ± 2.424
1.73AspSer: 1.73 ± 1.762
4.758AspThr: 4.758 ± 1.844
0.865AspVal: 0.865 ± 0.433
0.865AspTrp: 0.865 ± 0.433
2.595AspTyr: 2.595 ± 0.925
0.0AspXaa: 0.0 ± 0.0
Glu
7.353GluAla: 7.353 ± 0.119
1.298GluCys: 1.298 ± 0.65
6.92GluAsp: 6.92 ± 0.122
11.246GluGlu: 11.246 ± 1.334
0.865GluPhe: 0.865 ± 0.433
6.488GluGly: 6.488 ± 1.759
1.73GluHis: 1.73 ± 0.866
3.893GluIle: 3.893 ± 2.298
9.083GluLys: 9.083 ± 3.672
4.758GluLeu: 4.758 ± 1.483
2.595GluMet: 2.595 ± 1.299
3.028GluAsn: 3.028 ± 1.516
2.163GluPro: 2.163 ± 0.963
6.92GluGln: 6.92 ± 1.677
6.055GluArg: 6.055 ± 4.05
4.758GluSer: 4.758 ± 3.206
6.055GluThr: 6.055 ± 2.063
6.488GluVal: 6.488 ± 1.3
1.298GluTrp: 1.298 ± 0.65
2.595GluTyr: 2.595 ± 1.018
0.0GluXaa: 0.0 ± 0.0
Phe
3.46PheAla: 3.46 ± 2.027
0.865PheCys: 0.865 ± 0.433
2.163PheAsp: 2.163 ± 0.963
3.028PheGlu: 3.028 ± 1.516
0.865PhePhe: 0.865 ± 0.433
0.865PheGly: 0.865 ± 0.433
1.73PheHis: 1.73 ± 0.866
3.46PheIle: 3.46 ± 1.163
2.163PheLys: 2.163 ± 0.963
2.163PheLeu: 2.163 ± 1.083
0.865PheMet: 0.865 ± 0.433
1.298PheAsn: 1.298 ± 0.65
1.298PhePro: 1.298 ± 0.65
1.73PheGln: 1.73 ± 0.866
2.595PheArg: 2.595 ± 1.299
1.298PheSer: 1.298 ± 0.65
2.163PheThr: 2.163 ± 1.083
0.865PheVal: 0.865 ± 0.433
0.433PheTrp: 0.433 ± 0.217
0.865PheTyr: 0.865 ± 1.212
0.0PheXaa: 0.0 ± 0.0
Gly
2.595GlyAla: 2.595 ± 0.96
0.865GlyCys: 0.865 ± 0.433
1.73GlyAsp: 1.73 ± 0.866
5.19GlyGlu: 5.19 ± 1.658
1.73GlyPhe: 1.73 ± 1.013
2.595GlyGly: 2.595 ± 1.299
1.298GlyHis: 1.298 ± 0.65
3.893GlyIle: 3.893 ± 1.908
3.46GlyLys: 3.46 ± 1.732
4.325GlyLeu: 4.325 ± 1.069
0.0GlyMet: 0.0 ± 0.0
2.595GlyAsn: 2.595 ± 1.299
0.865GlyPro: 0.865 ± 0.433
1.73GlyGln: 1.73 ± 0.866
4.325GlyArg: 4.325 ± 1.32
2.163GlySer: 2.163 ± 0.932
3.028GlyThr: 3.028 ± 1.005
2.163GlyVal: 2.163 ± 1.083
0.865GlyTrp: 0.865 ± 0.433
3.893GlyTyr: 3.893 ± 1.949
0.0GlyXaa: 0.0 ± 0.0
His
1.298HisAla: 1.298 ± 0.65
0.433HisCys: 0.433 ± 0.217
0.433HisAsp: 0.433 ± 0.217
1.73HisGlu: 1.73 ± 0.987
0.433HisPhe: 0.433 ± 0.217
0.433HisGly: 0.433 ± 0.217
0.865HisHis: 0.865 ± 0.433
3.46HisIle: 3.46 ± 1.247
1.298HisLys: 1.298 ± 1.125
1.298HisLeu: 1.298 ± 0.65
0.433HisMet: 0.433 ± 0.217
0.433HisAsn: 0.433 ± 1.374
1.73HisPro: 1.73 ± 0.866
1.73HisGln: 1.73 ± 0.987
2.163HisArg: 2.163 ± 0.932
1.298HisSer: 1.298 ± 1.948
0.865HisThr: 0.865 ± 1.212
1.298HisVal: 1.298 ± 0.65
0.433HisTrp: 0.433 ± 0.217
0.865HisTyr: 0.865 ± 0.433
0.0HisXaa: 0.0 ± 0.0
Ile
4.758IleAla: 4.758 ± 1.943
0.433IleCys: 0.433 ± 0.217
2.163IleAsp: 2.163 ± 1.083
3.46IleGlu: 3.46 ± 3.193
1.73IlePhe: 1.73 ± 0.866
3.893IleGly: 3.893 ± 1.175
2.163IleHis: 2.163 ± 0.932
3.028IleIle: 3.028 ± 2.754
7.353IleLys: 7.353 ± 2.552
4.758IleLeu: 4.758 ± 1.517
0.865IleMet: 0.865 ± 0.433
2.163IleAsn: 2.163 ± 2.29
3.028IlePro: 3.028 ± 1.516
4.325IleGln: 4.325 ± 3.886
4.325IleArg: 4.325 ± 1.32
4.325IleSer: 4.325 ± 2.019
3.46IleThr: 3.46 ± 1.054
2.163IleVal: 2.163 ± 1.083
0.865IleTrp: 0.865 ± 1.212
1.298IleTyr: 1.298 ± 1.125
0.0IleXaa: 0.0 ± 0.0
Lys
2.595LysAla: 2.595 ± 0.96
1.73LysCys: 1.73 ± 0.866
4.758LysAsp: 4.758 ± 1.038
7.785LysGlu: 7.785 ± 2.807
1.73LysPhe: 1.73 ± 0.866
4.325LysGly: 4.325 ± 1.069
3.893LysHis: 3.893 ± 1.949
3.893LysIle: 3.893 ± 1.285
6.055LysLys: 6.055 ± 1.767
6.488LysLeu: 6.488 ± 6.54
1.298LysMet: 1.298 ± 0.65
5.623LysAsn: 5.623 ± 2.146
3.028LysPro: 3.028 ± 1.005
3.46LysGln: 3.46 ± 2.592
3.028LysArg: 3.028 ± 0.968
2.595LysSer: 2.595 ± 1.299
0.433LysThr: 0.433 ± 1.374
3.893LysVal: 3.893 ± 1.027
1.73LysTrp: 1.73 ± 1.047
1.298LysTyr: 1.298 ± 0.65
0.0LysXaa: 0.0 ± 0.0
Leu
3.028LeuAla: 3.028 ± 2.618
1.73LeuCys: 1.73 ± 0.866
2.595LeuAsp: 2.595 ± 2.783
8.218LeuGlu: 8.218 ± 3.417
1.73LeuPhe: 1.73 ± 1.013
3.46LeuGly: 3.46 ± 1.732
2.163LeuHis: 2.163 ± 1.573
2.595LeuIle: 2.595 ± 1.53
6.488LeuLys: 6.488 ± 0.331
6.055LeuLeu: 6.055 ± 0.546
2.163LeuMet: 2.163 ± 1.99
2.595LeuAsn: 2.595 ± 1.018
4.758LeuPro: 4.758 ± 1.038
4.325LeuGln: 4.325 ± 1.41
3.028LeuArg: 3.028 ± 1.516
6.488LeuSer: 6.488 ± 1.772
6.92LeuThr: 6.92 ± 4.915
4.758LeuVal: 4.758 ± 2.016
0.865LeuTrp: 0.865 ± 0.433
1.73LeuTyr: 1.73 ± 0.866
0.0LeuXaa: 0.0 ± 0.0
Met
2.595MetAla: 2.595 ± 2.168
0.0MetCys: 0.0 ± 0.0
2.163MetAsp: 2.163 ± 0.932
3.028MetGlu: 3.028 ± 1.516
2.163MetPhe: 2.163 ± 1.083
1.73MetGly: 1.73 ± 0.866
0.865MetHis: 0.865 ± 0.433
0.433MetIle: 0.433 ± 0.217
1.73MetLys: 1.73 ± 0.866
0.865MetLeu: 0.865 ± 0.433
0.433MetMet: 0.433 ± 0.217
1.73MetAsn: 1.73 ± 0.866
1.73MetPro: 1.73 ± 0.866
2.163MetGln: 2.163 ± 1.083
1.298MetArg: 1.298 ± 0.65
1.73MetSer: 1.73 ± 1.762
0.865MetThr: 0.865 ± 0.433
1.298MetVal: 1.298 ± 0.65
0.0MetTrp: 0.0 ± 0.0
0.433MetTyr: 0.433 ± 0.217
0.0MetXaa: 0.0 ± 0.0
Asn
2.163AsnAla: 2.163 ± 2.29
0.0AsnCys: 0.0 ± 0.0
2.163AsnAsp: 2.163 ± 0.932
2.595AsnGlu: 2.595 ± 1.299
2.595AsnPhe: 2.595 ± 0.925
1.298AsnGly: 1.298 ± 0.65
0.865AsnHis: 0.865 ± 1.237
2.163AsnIle: 2.163 ± 1.083
3.893AsnLys: 3.893 ± 0.996
3.028AsnLeu: 3.028 ± 3.589
1.73AsnMet: 1.73 ± 0.866
1.298AsnAsn: 1.298 ± 2.055
1.73AsnPro: 1.73 ± 0.866
1.73AsnGln: 1.73 ± 1.047
1.73AsnArg: 1.73 ± 1.754
2.163AsnSer: 2.163 ± 1.083
3.46AsnThr: 3.46 ± 2.027
3.028AsnVal: 3.028 ± 2.109
1.73AsnTrp: 1.73 ± 0.987
2.163AsnTyr: 2.163 ± 1.01
0.0AsnXaa: 0.0 ± 0.0
Pro
3.028ProAla: 3.028 ± 1.005
0.0ProCys: 0.0 ± 0.0
3.028ProAsp: 3.028 ± 1.516
3.46ProGlu: 3.46 ± 1.732
0.865ProPhe: 0.865 ± 0.433
0.865ProGly: 0.865 ± 0.433
1.298ProHis: 1.298 ± 0.65
1.298ProIle: 1.298 ± 0.65
3.46ProLys: 3.46 ± 1.13
3.028ProLeu: 3.028 ± 1.243
1.298ProMet: 1.298 ± 0.609
1.298ProAsn: 1.298 ± 0.65
2.595ProPro: 2.595 ± 0.96
4.758ProGln: 4.758 ± 2.382
1.73ProArg: 1.73 ± 0.866
4.325ProSer: 4.325 ± 3.391
3.893ProThr: 3.893 ± 1.212
3.028ProVal: 3.028 ± 1.516
0.0ProTrp: 0.0 ± 0.0
3.028ProTyr: 3.028 ± 1.005
0.0ProXaa: 0.0 ± 0.0
Gln
3.46GlnAla: 3.46 ± 3.509
0.433GlnCys: 0.433 ± 0.217
1.73GlnAsp: 1.73 ± 0.866
9.083GlnGlu: 9.083 ± 1.89
2.163GlnPhe: 2.163 ± 1.083
2.163GlnGly: 2.163 ± 1.083
0.865GlnHis: 0.865 ± 0.433
3.46GlnIle: 3.46 ± 1.105
2.163GlnLys: 2.163 ± 1.573
5.19GlnLeu: 5.19 ± 2.341
3.028GlnMet: 3.028 ± 0.968
2.595GlnAsn: 2.595 ± 0.96
3.893GlnPro: 3.893 ± 1.027
6.488GlnGln: 6.488 ± 1.698
3.893GlnArg: 3.893 ± 1.175
4.325GlnSer: 4.325 ± 2.741
3.028GlnThr: 3.028 ± 0.968
2.163GlnVal: 2.163 ± 1.083
1.73GlnTrp: 1.73 ± 1.047
0.865GlnTyr: 0.865 ± 0.433
0.0GlnXaa: 0.0 ± 0.0
Arg
2.595ArgAla: 2.595 ± 1.53
0.433ArgCys: 0.433 ± 0.217
3.028ArgAsp: 3.028 ± 2.062
4.758ArgGlu: 4.758 ± 1.038
1.298ArgPhe: 1.298 ± 0.65
2.595ArgGly: 2.595 ± 0.925
0.433ArgHis: 0.433 ± 1.374
6.488ArgIle: 6.488 ± 2.82
3.028ArgLys: 3.028 ± 1.005
7.353ArgLeu: 7.353 ± 2.222
2.163ArgMet: 2.163 ± 1.083
3.46ArgAsn: 3.46 ± 1.054
3.028ArgPro: 3.028 ± 1.262
3.893ArgGln: 3.893 ± 1.908
7.785ArgArg: 7.785 ± 1.773
3.028ArgSer: 3.028 ± 1.516
3.893ArgThr: 3.893 ± 1.141
3.028ArgVal: 3.028 ± 2.162
1.73ArgTrp: 1.73 ± 1.047
1.298ArgTyr: 1.298 ± 1.084
0.0ArgXaa: 0.0 ± 0.0
Ser
0.865SerAla: 0.865 ± 1.212
1.73SerCys: 1.73 ± 0.866
3.028SerAsp: 3.028 ± 2.733
3.893SerGlu: 3.893 ± 1.027
3.46SerPhe: 3.46 ± 1.732
3.028SerGly: 3.028 ± 1.516
0.865SerHis: 0.865 ± 1.227
3.46SerIle: 3.46 ± 1.105
2.595SerLys: 2.595 ± 2.25
5.19SerLeu: 5.19 ± 3.341
1.73SerMet: 1.73 ± 0.827
3.46SerAsn: 3.46 ± 2.093
3.028SerPro: 3.028 ± 1.516
3.893SerGln: 3.893 ± 2.045
4.758SerArg: 4.758 ± 4.995
6.92SerSer: 6.92 ± 6.497
9.083SerThr: 9.083 ± 2.604
3.028SerVal: 3.028 ± 1.262
0.433SerTrp: 0.433 ± 0.217
1.298SerTyr: 1.298 ± 1.105
0.0SerXaa: 0.0 ± 0.0
Thr
4.325ThrAla: 4.325 ± 1.069
0.0ThrCys: 0.0 ± 0.0
3.893ThrAsp: 3.893 ± 1.965
8.218ThrGlu: 8.218 ± 3.55
2.163ThrPhe: 2.163 ± 1.01
5.19ThrGly: 5.19 ± 2.598
1.298ThrHis: 1.298 ± 1.084
4.758ThrIle: 4.758 ± 1.038
2.163ThrLys: 2.163 ± 1.01
4.758ThrLeu: 4.758 ± 5.621
2.595ThrMet: 2.595 ± 0.914
1.298ThrAsn: 1.298 ± 0.65
2.595ThrPro: 2.595 ± 0.925
3.893ThrGln: 3.893 ± 1.285
6.488ThrArg: 6.488 ± 0.331
5.623ThrSer: 5.623 ± 1.837
5.19ThrThr: 5.19 ± 1.764
1.298ThrVal: 1.298 ± 0.65
0.865ThrTrp: 0.865 ± 0.433
1.298ThrTyr: 1.298 ± 0.65
0.0ThrXaa: 0.0 ± 0.0
Val
3.46ValAla: 3.46 ± 1.092
0.433ValCys: 0.433 ± 0.217
2.595ValAsp: 2.595 ± 1.299
4.758ValGlu: 4.758 ± 2.016
3.893ValPhe: 3.893 ± 1.212
1.73ValGly: 1.73 ± 2.455
0.433ValHis: 0.433 ± 0.217
3.46ValIle: 3.46 ± 2.434
1.73ValLys: 1.73 ± 0.866
2.595ValLeu: 2.595 ± 1.299
1.298ValMet: 1.298 ± 0.65
2.163ValAsn: 2.163 ± 0.963
2.595ValPro: 2.595 ± 1.299
3.46ValGln: 3.46 ± 1.163
3.46ValArg: 3.46 ± 1.247
3.46ValSer: 3.46 ± 2.093
4.325ValThr: 4.325 ± 1.43
2.163ValVal: 2.163 ± 1.083
0.0ValTrp: 0.0 ± 0.0
1.298ValTyr: 1.298 ± 0.65
0.0ValXaa: 0.0 ± 0.0
Trp
1.298TrpAla: 1.298 ± 1.084
0.0TrpCys: 0.0 ± 0.0
0.433TrpAsp: 0.433 ± 0.217
1.73TrpGlu: 1.73 ± 1.047
0.0TrpPhe: 0.0 ± 0.0
0.865TrpGly: 0.865 ± 0.433
0.0TrpHis: 0.0 ± 0.0
1.298TrpIle: 1.298 ± 0.65
1.298TrpLys: 1.298 ± 0.65
0.865TrpLeu: 0.865 ± 0.433
0.0TrpMet: 0.0 ± 0.0
0.433TrpAsn: 0.433 ± 0.217
0.433TrpPro: 0.433 ± 0.217
2.163TrpGln: 2.163 ± 0.932
0.433TrpArg: 0.433 ± 0.217
0.433TrpSer: 0.433 ± 1.374
0.865TrpThr: 0.865 ± 0.433
1.298TrpVal: 1.298 ± 0.65
0.0TrpTrp: 0.0 ± 0.0
0.865TrpTyr: 0.865 ± 1.237
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.595TyrAla: 2.595 ± 1.299
0.865TyrCys: 0.865 ± 0.433
1.298TyrAsp: 1.298 ± 0.65
3.46TyrGlu: 3.46 ± 1.163
0.0TyrPhe: 0.0 ± 0.0
1.298TyrGly: 1.298 ± 0.65
0.865TyrHis: 0.865 ± 1.212
2.163TyrIle: 2.163 ± 0.932
4.325TyrLys: 4.325 ± 1.41
3.028TyrLeu: 3.028 ± 1.072
0.433TyrMet: 0.433 ± 0.217
1.73TyrAsn: 1.73 ± 1.047
0.433TyrPro: 0.433 ± 0.217
1.298TyrGln: 1.298 ± 0.65
1.73TyrArg: 1.73 ± 0.866
2.163TyrSer: 2.163 ± 1.083
0.865TyrThr: 0.865 ± 0.433
2.163TyrVal: 2.163 ± 0.963
0.865TyrTrp: 0.865 ± 0.433
0.433TyrTyr: 0.433 ± 0.217
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (2313 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski