Amino acid dipepetide frequency for Giardia lamblia virus (isolate Wang) (GLV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.298AlaAla: 4.298 ± 0.665
0.0AlaCys: 0.0 ± 0.0
4.298AlaAsp: 4.298 ± 0.108
3.223AlaGlu: 3.223 ± 0.081
2.149AlaPhe: 2.149 ± 0.054
2.865AlaGly: 2.865 ± 0.258
1.074AlaHis: 1.074 ± 0.027
3.582AlaIle: 3.582 ± 0.461
3.94AlaLys: 3.94 ± 0.841
10.029AlaLeu: 10.029 ± 0.623
0.716AlaMet: 0.716 ± 0.173
0.0AlaAsn: 0.0 ± 0.0
5.731AlaPro: 5.731 ± 0.515
2.149AlaGln: 2.149 ± 0.611
4.656AlaArg: 4.656 ± 1.182
6.805AlaSer: 6.805 ± 0.014
4.298AlaThr: 4.298 ± 0.108
2.149AlaVal: 2.149 ± 0.503
1.433AlaTrp: 1.433 ± 0.15
2.507AlaTyr: 2.507 ± 0.434
0.0AlaXaa: 0.0 ± 0.0
Cys
1.074CysAla: 1.074 ± 0.027
0.0CysCys: 0.0 ± 0.0
1.074CysAsp: 1.074 ± 0.027
0.358CysGlu: 0.358 ± 0.177
0.0CysPhe: 0.0 ± 0.0
0.358CysGly: 0.358 ± 0.177
0.0CysHis: 0.0 ± 0.0
0.358CysIle: 0.358 ± 0.177
0.358CysLys: 0.358 ± 0.177
1.074CysLeu: 1.074 ± 0.584
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.433CysPro: 1.433 ± 0.407
0.0CysGln: 0.0 ± 0.0
0.358CysArg: 0.358 ± 0.177
1.433CysSer: 1.433 ± 0.407
0.358CysThr: 0.358 ± 0.38
0.358CysVal: 0.358 ± 0.177
0.0CysTrp: 0.0 ± 0.0
1.433CysTyr: 1.433 ± 0.407
0.0CysXaa: 0.0 ± 0.0
Asp
3.223AspAla: 3.223 ± 0.638
0.0AspCys: 0.0 ± 0.0
1.433AspAsp: 1.433 ± 0.15
1.074AspGlu: 1.074 ± 0.027
3.223AspPhe: 3.223 ± 0.081
2.149AspGly: 2.149 ± 0.054
1.074AspHis: 1.074 ± 0.027
3.582AspIle: 3.582 ± 0.096
3.223AspLys: 3.223 ± 0.476
4.298AspLeu: 4.298 ± 0.449
2.507AspMet: 2.507 ± 0.434
2.507AspAsn: 2.507 ± 0.123
1.791AspPro: 1.791 ± 0.326
0.358AspGln: 0.358 ± 0.177
4.656AspArg: 4.656 ± 0.068
0.358AspSer: 0.358 ± 0.177
5.014AspThr: 5.014 ± 0.312
4.298AspVal: 4.298 ± 0.665
1.074AspTrp: 1.074 ± 0.027
1.433AspTyr: 1.433 ± 0.407
0.0AspXaa: 0.0 ± 0.0
Glu
1.433GluAla: 1.433 ± 0.15
0.0GluCys: 0.0 ± 0.0
1.791GluAsp: 1.791 ± 0.231
3.582GluGlu: 3.582 ± 0.096
1.791GluPhe: 1.791 ± 0.326
2.149GluGly: 2.149 ± 0.611
1.074GluHis: 1.074 ± 0.027
2.507GluIle: 2.507 ± 0.434
0.358GluLys: 0.358 ± 0.177
4.656GluLeu: 4.656 ± 1.739
0.0GluMet: 0.0 ± 0.0
2.865GluAsn: 2.865 ± 0.258
2.507GluPro: 2.507 ± 0.679
1.433GluGln: 1.433 ± 0.706
4.298GluArg: 4.298 ± 0.665
2.149GluSer: 2.149 ± 0.054
1.791GluThr: 1.791 ± 0.326
4.656GluVal: 4.656 ± 0.488
0.358GluTrp: 0.358 ± 0.177
0.716GluTyr: 0.716 ± 0.353
0.0GluXaa: 0.0 ± 0.0
Phe
2.865PheAla: 2.865 ± 0.814
0.0PheCys: 0.0 ± 0.0
2.149PheAsp: 2.149 ± 0.054
1.433PheGlu: 1.433 ± 0.15
1.791PhePhe: 1.791 ± 0.231
2.507PheGly: 2.507 ± 0.434
0.716PheHis: 0.716 ± 0.353
1.791PheIle: 1.791 ± 0.231
1.791PheLys: 1.791 ± 0.326
5.014PheLeu: 5.014 ± 0.245
0.358PheMet: 0.358 ± 0.177
4.298PheAsn: 4.298 ± 0.108
2.149PhePro: 2.149 ± 0.054
1.791PheGln: 1.791 ± 0.231
2.507PheArg: 2.507 ± 0.434
4.656PheSer: 4.656 ± 0.068
2.507PheThr: 2.507 ± 0.679
2.507PheVal: 2.507 ± 0.679
0.716PheTrp: 0.716 ± 0.353
2.149PheTyr: 2.149 ± 0.054
0.0PheXaa: 0.0 ± 0.0
Gly
2.149GlyAla: 2.149 ± 0.054
0.716GlyCys: 0.716 ± 0.204
2.507GlyAsp: 2.507 ± 1.236
1.433GlyGlu: 1.433 ± 0.407
2.507GlyPhe: 2.507 ± 0.679
4.298GlyGly: 4.298 ± 0.108
0.0GlyHis: 0.0 ± 0.0
2.865GlyIle: 2.865 ± 0.258
2.507GlyLys: 2.507 ± 0.679
6.805GlyLeu: 6.805 ± 0.542
1.074GlyMet: 1.074 ± 0.027
4.298GlyAsn: 4.298 ± 0.665
3.582GlyPro: 3.582 ± 0.461
2.507GlyGln: 2.507 ± 0.434
3.94GlyArg: 3.94 ± 0.272
6.089GlySer: 6.089 ± 0.218
3.94GlyThr: 3.94 ± 0.285
2.865GlyVal: 2.865 ± 0.258
1.791GlyTrp: 1.791 ± 0.231
5.014GlyTyr: 5.014 ± 0.245
0.0GlyXaa: 0.0 ± 0.0
His
2.149HisAla: 2.149 ± 0.054
0.716HisCys: 0.716 ± 0.204
1.433HisAsp: 1.433 ± 0.407
1.433HisGlu: 1.433 ± 0.15
0.716HisPhe: 0.716 ± 0.353
1.791HisGly: 1.791 ± 0.231
0.358HisHis: 0.358 ± 0.38
3.223HisIle: 3.223 ± 0.638
0.0HisLys: 0.0 ± 0.0
1.791HisLeu: 1.791 ± 0.231
1.074HisMet: 1.074 ± 0.027
0.716HisAsn: 0.716 ± 0.204
0.716HisPro: 0.716 ± 0.353
0.0HisGln: 0.0 ± 0.0
3.582HisArg: 3.582 ± 0.096
1.074HisSer: 1.074 ± 0.027
1.791HisThr: 1.791 ± 0.787
2.507HisVal: 2.507 ± 0.123
0.0HisTrp: 0.0 ± 0.0
1.791HisTyr: 1.791 ± 0.326
0.0HisXaa: 0.0 ± 0.0
Ile
3.582IleAla: 3.582 ± 0.461
0.0IleCys: 0.0 ± 0.0
3.223IleAsp: 3.223 ± 0.638
1.074IleGlu: 1.074 ± 0.027
4.656IlePhe: 4.656 ± 0.488
4.656IleGly: 4.656 ± 0.068
1.074IleHis: 1.074 ± 0.027
1.074IleIle: 1.074 ± 0.027
2.149IleLys: 2.149 ± 0.054
7.163IleLeu: 7.163 ± 0.191
1.791IleMet: 1.791 ± 0.231
1.433IleAsn: 1.433 ± 0.15
6.089IlePro: 6.089 ± 0.339
0.716IleGln: 0.716 ± 0.353
3.223IleArg: 3.223 ± 0.081
3.94IleSer: 3.94 ± 0.272
3.223IleThr: 3.223 ± 0.638
5.731IleVal: 5.731 ± 0.041
0.716IleTrp: 0.716 ± 0.204
1.791IleTyr: 1.791 ± 0.883
0.0IleXaa: 0.0 ± 0.0
Lys
3.223LysAla: 3.223 ± 0.081
0.358LysCys: 0.358 ± 0.177
2.149LysAsp: 2.149 ± 0.611
1.433LysGlu: 1.433 ± 0.706
1.074LysPhe: 1.074 ± 0.53
1.791LysGly: 1.791 ± 0.231
1.433LysHis: 1.433 ± 0.15
2.865LysIle: 2.865 ± 0.299
1.433LysLys: 1.433 ± 0.706
2.865LysLeu: 2.865 ± 0.258
0.0LysMet: 0.0 ± 0.0
0.358LysAsn: 0.358 ± 0.177
1.433LysPro: 1.433 ± 0.706
1.433LysGln: 1.433 ± 0.407
1.791LysArg: 1.791 ± 0.883
1.433LysSer: 1.433 ± 0.15
3.223LysThr: 3.223 ± 0.081
4.656LysVal: 4.656 ± 0.625
0.716LysTrp: 0.716 ± 0.353
1.074LysTyr: 1.074 ± 0.53
0.0LysXaa: 0.0 ± 0.0
Leu
8.954LeuAla: 8.954 ± 0.04
1.433LeuCys: 1.433 ± 0.15
5.731LeuAsp: 5.731 ± 0.515
2.865LeuGlu: 2.865 ± 0.856
2.149LeuPhe: 2.149 ± 0.054
7.163LeuGly: 7.163 ± 0.366
3.223LeuHis: 3.223 ± 0.638
5.731LeuIle: 5.731 ± 0.041
2.149LeuLys: 2.149 ± 0.503
9.312LeuLeu: 9.312 ± 1.807
2.149LeuMet: 2.149 ± 0.439
2.507LeuAsn: 2.507 ± 0.434
5.731LeuPro: 5.731 ± 0.041
5.372LeuGln: 5.372 ± 0.422
4.656LeuArg: 4.656 ± 1.182
11.103LeuSer: 11.103 ± 0.65
10.387LeuThr: 10.387 ± 0.667
6.805LeuVal: 6.805 ± 0.571
2.507LeuTrp: 2.507 ± 0.123
5.014LeuTyr: 5.014 ± 0.312
0.0LeuXaa: 0.0 ± 0.0
Met
1.433MetAla: 1.433 ± 0.15
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
1.433MetGlu: 1.433 ± 0.407
2.149MetPhe: 2.149 ± 0.611
0.716MetGly: 0.716 ± 0.353
1.074MetHis: 1.074 ± 0.027
0.716MetIle: 0.716 ± 0.353
0.0MetLys: 0.0 ± 0.0
1.791MetLeu: 1.791 ± 0.326
1.433MetMet: 1.433 ± 0.15
1.074MetAsn: 1.074 ± 0.027
1.074MetPro: 1.074 ± 0.027
0.716MetGln: 0.716 ± 0.204
1.433MetArg: 1.433 ± 0.407
1.433MetSer: 1.433 ± 0.15
2.149MetThr: 2.149 ± 0.054
2.149MetVal: 2.149 ± 0.611
0.358MetTrp: 0.358 ± 0.177
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.507AsnAla: 2.507 ± 0.434
1.433AsnCys: 1.433 ± 0.407
1.074AsnAsp: 1.074 ± 0.027
1.074AsnGlu: 1.074 ± 0.027
0.716AsnPhe: 0.716 ± 0.353
2.865AsnGly: 2.865 ± 0.258
1.791AsnHis: 1.791 ± 0.231
1.791AsnIle: 1.791 ± 0.231
0.716AsnLys: 0.716 ± 0.353
7.163AsnLeu: 7.163 ± 0.366
1.074AsnMet: 1.074 ± 0.027
3.582AsnAsn: 3.582 ± 0.461
3.223AsnPro: 3.223 ± 0.081
0.716AsnGln: 0.716 ± 0.204
2.149AsnArg: 2.149 ± 0.611
0.716AsnSer: 0.716 ± 0.204
2.149AsnThr: 2.149 ± 0.054
4.298AsnVal: 4.298 ± 0.108
0.358AsnTrp: 0.358 ± 0.177
2.507AsnTyr: 2.507 ± 0.991
0.0AsnXaa: 0.0 ± 0.0
Pro
2.865ProAla: 2.865 ± 0.299
0.0ProCys: 0.0 ± 0.0
2.507ProAsp: 2.507 ± 0.679
4.656ProGlu: 4.656 ± 0.068
0.716ProPhe: 0.716 ± 0.204
2.149ProGly: 2.149 ± 0.503
2.865ProHis: 2.865 ± 0.258
7.88ProIle: 7.88 ± 1.683
3.582ProLys: 3.582 ± 0.652
5.731ProLeu: 5.731 ± 0.515
0.716ProMet: 0.716 ± 0.204
2.149ProAsn: 2.149 ± 0.503
2.507ProPro: 2.507 ± 0.123
2.507ProGln: 2.507 ± 0.434
1.074ProArg: 1.074 ± 0.53
4.656ProSer: 4.656 ± 0.068
4.656ProThr: 4.656 ± 0.488
4.656ProVal: 4.656 ± 0.625
2.149ProTrp: 2.149 ± 0.054
3.582ProTyr: 3.582 ± 0.096
0.0ProXaa: 0.0 ± 0.0
Gln
1.074GlnAla: 1.074 ± 0.53
0.0GlnCys: 0.0 ± 0.0
0.716GlnAsp: 0.716 ± 0.353
2.149GlnGlu: 2.149 ± 0.611
2.507GlnPhe: 2.507 ± 0.434
1.791GlnGly: 1.791 ± 0.326
0.0GlnHis: 0.0 ± 0.0
1.433GlnIle: 1.433 ± 0.15
0.358GlnLys: 0.358 ± 0.177
3.582GlnLeu: 3.582 ± 1.209
0.716GlnMet: 0.716 ± 0.353
3.223GlnAsn: 3.223 ± 0.638
3.582GlnPro: 3.582 ± 0.461
2.149GlnGln: 2.149 ± 0.054
2.865GlnArg: 2.865 ± 0.258
3.582GlnSer: 3.582 ± 0.096
2.149GlnThr: 2.149 ± 0.054
1.791GlnVal: 1.791 ± 0.231
1.074GlnTrp: 1.074 ± 0.027
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
3.94ArgAla: 3.94 ± 0.272
1.433ArgCys: 1.433 ± 0.407
3.223ArgAsp: 3.223 ± 0.081
2.149ArgGlu: 2.149 ± 0.503
5.014ArgPhe: 5.014 ± 0.312
4.298ArgGly: 4.298 ± 0.108
2.865ArgHis: 2.865 ± 0.299
2.865ArgIle: 2.865 ± 0.258
1.433ArgLys: 1.433 ± 0.706
6.447ArgLeu: 6.447 ± 0.395
1.791ArgMet: 1.791 ± 0.326
1.074ArgAsn: 1.074 ± 0.027
1.433ArgPro: 1.433 ± 0.15
2.149ArgGln: 2.149 ± 0.503
1.791ArgArg: 1.791 ± 0.326
5.014ArgSer: 5.014 ± 1.359
5.372ArgThr: 5.372 ± 0.422
3.223ArgVal: 3.223 ± 0.081
0.358ArgTrp: 0.358 ± 0.177
1.791ArgTyr: 1.791 ± 0.231
0.0ArgXaa: 0.0 ± 0.0
Ser
7.163SerAla: 7.163 ± 0.366
0.716SerCys: 0.716 ± 0.204
4.298SerAsp: 4.298 ± 0.108
5.372SerGlu: 5.372 ± 0.978
3.582SerPhe: 3.582 ± 0.461
4.656SerGly: 4.656 ± 0.625
2.149SerHis: 2.149 ± 0.054
5.731SerIle: 5.731 ± 0.515
3.223SerLys: 3.223 ± 0.638
6.805SerLeu: 6.805 ± 0.014
2.149SerMet: 2.149 ± 0.503
4.656SerAsn: 4.656 ± 0.068
6.805SerPro: 6.805 ± 1.099
2.507SerGln: 2.507 ± 0.679
4.656SerArg: 4.656 ± 0.068
10.745SerSer: 10.745 ± 1.384
4.298SerThr: 4.298 ± 0.108
5.014SerVal: 5.014 ± 0.245
0.358SerTrp: 0.358 ± 0.177
3.582SerTyr: 3.582 ± 0.652
0.0SerXaa: 0.0 ± 0.0
Thr
5.372ThrAla: 5.372 ± 0.135
1.433ThrCys: 1.433 ± 0.407
1.433ThrAsp: 1.433 ± 0.15
1.433ThrGlu: 1.433 ± 0.15
4.298ThrPhe: 4.298 ± 0.449
5.731ThrGly: 5.731 ± 0.041
2.865ThrHis: 2.865 ± 0.258
1.433ThrIle: 1.433 ± 0.15
1.433ThrLys: 1.433 ± 0.15
12.536ThrLeu: 12.536 ± 1.058
1.433ThrMet: 1.433 ± 0.15
0.716ThrAsn: 0.716 ± 0.204
2.865ThrPro: 2.865 ± 0.299
2.149ThrGln: 2.149 ± 0.054
2.865ThrArg: 2.865 ± 0.258
8.954ThrSer: 8.954 ± 0.04
6.447ThrThr: 6.447 ± 0.162
5.731ThrVal: 5.731 ± 0.515
0.0ThrTrp: 0.0 ± 0.0
3.582ThrTyr: 3.582 ± 0.096
0.0ThrXaa: 0.0 ± 0.0
Val
3.94ValAla: 3.94 ± 0.285
0.716ValCys: 0.716 ± 0.353
5.014ValAsp: 5.014 ± 0.245
2.507ValGlu: 2.507 ± 0.434
3.223ValPhe: 3.223 ± 0.081
3.223ValGly: 3.223 ± 0.081
1.791ValHis: 1.791 ± 0.231
3.223ValIle: 3.223 ± 1.032
2.507ValLys: 2.507 ± 0.123
3.582ValLeu: 3.582 ± 0.096
1.791ValMet: 1.791 ± 0.787
3.582ValAsn: 3.582 ± 1.018
6.089ValPro: 6.089 ± 1.332
3.94ValGln: 3.94 ± 0.272
4.656ValArg: 4.656 ± 1.182
6.447ValSer: 6.447 ± 0.162
3.94ValThr: 3.94 ± 0.272
3.582ValVal: 3.582 ± 0.652
2.149ValTrp: 2.149 ± 0.611
3.223ValTyr: 3.223 ± 0.638
0.0ValXaa: 0.0 ± 0.0
Trp
0.358TrpAla: 0.358 ± 0.177
0.0TrpCys: 0.0 ± 0.0
1.791TrpAsp: 1.791 ± 0.787
0.358TrpGlu: 0.358 ± 0.177
0.358TrpPhe: 0.358 ± 0.177
1.791TrpGly: 1.791 ± 0.326
0.716TrpHis: 0.716 ± 0.204
0.716TrpIle: 0.716 ± 0.353
0.716TrpLys: 0.716 ± 0.353
1.791TrpLeu: 1.791 ± 0.326
0.0TrpMet: 0.0 ± 0.0
1.433TrpAsn: 1.433 ± 0.407
1.433TrpPro: 1.433 ± 0.407
0.358TrpGln: 0.358 ± 0.177
0.358TrpArg: 0.358 ± 0.177
1.791TrpSer: 1.791 ± 0.326
0.716TrpThr: 0.716 ± 0.353
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
1.433TrpTyr: 1.433 ± 0.407
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.656TyrAla: 4.656 ± 0.488
1.074TyrCys: 1.074 ± 0.027
1.433TyrAsp: 1.433 ± 0.15
1.074TyrGlu: 1.074 ± 0.027
1.074TyrPhe: 1.074 ± 0.53
3.94TyrGly: 3.94 ± 0.285
0.716TyrHis: 0.716 ± 0.204
3.94TyrIle: 3.94 ± 0.829
2.865TyrLys: 2.865 ± 0.299
2.507TyrLeu: 2.507 ± 0.123
0.0TyrMet: 0.0 ± 0.0
1.433TyrAsn: 1.433 ± 0.407
1.433TyrPro: 1.433 ± 0.15
1.791TyrGln: 1.791 ± 0.231
2.149TyrArg: 2.149 ± 0.503
6.089TyrSer: 6.089 ± 0.895
4.298TyrThr: 4.298 ± 0.665
2.149TyrVal: 2.149 ± 0.054
0.0TyrTrp: 0.0 ± 0.0
2.149TyrTyr: 2.149 ± 0.054
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2793 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski