Amino acid dipepetide frequency for Gonium pectorale (Green alga)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
38.589AlaAla: 38.589 ± 0.26
1.836AlaCys: 1.836 ± 0.018
6.497AlaAsp: 6.497 ± 0.034
8.474AlaGlu: 8.474 ± 0.064
3.076AlaPhe: 3.076 ± 0.023
15.716AlaGly: 15.716 ± 0.077
2.303AlaHis: 2.303 ± 0.018
2.925AlaIle: 2.925 ± 0.022
3.59AlaLys: 3.59 ± 0.031
12.819AlaLeu: 12.819 ± 0.057
2.687AlaMet: 2.687 ± 0.02
2.657AlaAsn: 2.657 ± 0.024
9.304AlaPro: 9.304 ± 0.068
4.82AlaGln: 4.82 ± 0.039
7.867AlaArg: 7.867 ± 0.039
10.484AlaSer: 10.484 ± 0.055
7.07AlaThr: 7.07 ± 0.041
10.163AlaVal: 10.163 ± 0.052
1.591AlaTrp: 1.591 ± 0.019
2.123AlaTyr: 2.123 ± 0.019
0.0AlaXaa: 0.0 ± 0.0
Cys
1.629CysAla: 1.629 ± 0.017
0.386CysCys: 0.386 ± 0.008
0.717CysAsp: 0.717 ± 0.011
0.681CysGlu: 0.681 ± 0.008
0.439CysPhe: 0.439 ± 0.008
1.542CysGly: 1.542 ± 0.019
0.302CysHis: 0.302 ± 0.006
0.415CysIle: 0.415 ± 0.007
0.428CysLys: 0.428 ± 0.007
1.451CysLeu: 1.451 ± 0.017
0.305CysMet: 0.305 ± 0.006
0.397CysAsn: 0.397 ± 0.008
0.982CysPro: 0.982 ± 0.015
0.431CysGln: 0.431 ± 0.008
1.052CysArg: 1.052 ± 0.012
1.109CysSer: 1.109 ± 0.015
0.773CysThr: 0.773 ± 0.013
0.995CysVal: 0.995 ± 0.012
0.266CysTrp: 0.266 ± 0.007
0.32CysTyr: 0.32 ± 0.007
0.0CysXaa: 0.0 ± 0.0
Asp
6.373AspAla: 6.373 ± 0.034
0.701AspCys: 0.701 ± 0.011
3.193AspAsp: 3.193 ± 0.031
3.279AspGlu: 3.279 ± 0.029
1.349AspPhe: 1.349 ± 0.016
5.762AspGly: 5.762 ± 0.041
0.721AspHis: 0.721 ± 0.01
1.371AspIle: 1.371 ± 0.014
1.485AspLys: 1.485 ± 0.022
4.202AspLeu: 4.202 ± 0.028
1.103AspMet: 1.103 ± 0.012
1.019AspAsn: 1.019 ± 0.011
3.171AspPro: 3.171 ± 0.021
1.19AspGln: 1.19 ± 0.013
2.572AspArg: 2.572 ± 0.022
2.969AspSer: 2.969 ± 0.022
2.069AspThr: 2.069 ± 0.018
3.627AspVal: 3.627 ± 0.019
0.762AspTrp: 0.762 ± 0.012
0.994AspTyr: 0.994 ± 0.013
0.0AspXaa: 0.0 ± 0.0
Glu
9.25GluAla: 9.25 ± 0.065
0.601GluCys: 0.601 ± 0.009
2.998GluAsp: 2.998 ± 0.027
5.146GluGlu: 5.146 ± 0.054
1.105GluPhe: 1.105 ± 0.013
4.932GluGly: 4.932 ± 0.031
1.065GluHis: 1.065 ± 0.013
1.121GluIle: 1.121 ± 0.017
1.511GluLys: 1.511 ± 0.021
6.203GluLeu: 6.203 ± 0.037
1.049GluMet: 1.049 ± 0.012
0.822GluAsn: 0.822 ± 0.012
2.957GluPro: 2.957 ± 0.031
2.531GluGln: 2.531 ± 0.023
4.146GluArg: 4.146 ± 0.035
2.516GluSer: 2.516 ± 0.019
1.773GluThr: 1.773 ± 0.017
3.912GluVal: 3.912 ± 0.025
0.803GluTrp: 0.803 ± 0.011
1.076GluTyr: 1.076 ± 0.012
0.0GluXaa: 0.0 ± 0.0
Phe
2.676PheAla: 2.676 ± 0.021
0.431PheCys: 0.431 ± 0.007
1.333PheAsp: 1.333 ± 0.014
1.253PheGlu: 1.253 ± 0.013
0.854PhePhe: 0.854 ± 0.012
2.329PheGly: 2.329 ± 0.022
0.472PheHis: 0.472 ± 0.008
0.769PheIle: 0.769 ± 0.011
0.92PheLys: 0.92 ± 0.012
2.273PheLeu: 2.273 ± 0.02
0.582PheMet: 0.582 ± 0.009
0.806PheAsn: 0.806 ± 0.012
1.206PhePro: 1.206 ± 0.012
0.753PheGln: 0.753 ± 0.011
1.525PheArg: 1.525 ± 0.015
1.705PheSer: 1.705 ± 0.016
1.409PheThr: 1.409 ± 0.017
1.763PheVal: 1.763 ± 0.018
0.351PheTrp: 0.351 ± 0.007
0.61PheTyr: 0.61 ± 0.011
0.0PheXaa: 0.0 ± 0.0
Gly
14.743GlyAla: 14.743 ± 0.079
1.709GlyCys: 1.709 ± 0.019
4.821GlyAsp: 4.821 ± 0.034
4.474GlyGlu: 4.474 ± 0.032
2.184GlyPhe: 2.184 ± 0.021
18.648GlyGly: 18.648 ± 0.154
1.998GlyHis: 1.998 ± 0.02
2.036GlyIle: 2.036 ± 0.018
2.519GlyLys: 2.519 ± 0.023
7.898GlyLeu: 7.898 ± 0.038
1.945GlyMet: 1.945 ± 0.025
2.094GlyAsn: 2.094 ± 0.021
6.374GlyPro: 6.374 ± 0.041
3.098GlyGln: 3.098 ± 0.021
6.715GlyArg: 6.715 ± 0.039
8.942GlySer: 8.942 ± 0.061
4.39GlyThr: 4.39 ± 0.03
5.724GlyVal: 5.724 ± 0.028
1.283GlyTrp: 1.283 ± 0.015
1.797GlyTyr: 1.797 ± 0.023
0.0GlyXaa: 0.0 ± 0.0
His
2.35HisAla: 2.35 ± 0.018
0.318HisCys: 0.318 ± 0.007
0.892HisAsp: 0.892 ± 0.012
0.945HisGlu: 0.945 ± 0.012
0.591HisPhe: 0.591 ± 0.009
2.041HisGly: 2.041 ± 0.019
0.785HisHis: 0.785 ± 0.018
0.638HisIle: 0.638 ± 0.01
0.584HisLys: 0.584 ± 0.01
2.038HisLeu: 2.038 ± 0.018
0.499HisMet: 0.499 ± 0.008
0.494HisAsn: 0.494 ± 0.008
1.442HisPro: 1.442 ± 0.016
0.785HisGln: 0.785 ± 0.011
1.39HisArg: 1.39 ± 0.013
1.372HisSer: 1.372 ± 0.014
0.95HisThr: 0.95 ± 0.01
1.497HisVal: 1.497 ± 0.017
0.323HisTrp: 0.323 ± 0.006
0.453HisTyr: 0.453 ± 0.008
0.0HisXaa: 0.0 ± 0.0
Ile
2.855IleAla: 2.855 ± 0.023
0.391IleCys: 0.391 ± 0.009
1.251IleAsp: 1.251 ± 0.014
1.214IleGlu: 1.214 ± 0.015
0.735IlePhe: 0.735 ± 0.012
1.744IleGly: 1.744 ± 0.016
0.509IleHis: 0.509 ± 0.009
0.892IleIle: 0.892 ± 0.015
1.067IleLys: 1.067 ± 0.016
2.005IleLeu: 2.005 ± 0.019
0.599IleMet: 0.599 ± 0.01
0.82IleAsn: 0.82 ± 0.012
1.274IlePro: 1.274 ± 0.015
0.83IleGln: 0.83 ± 0.011
1.63IleArg: 1.63 ± 0.017
1.762IleSer: 1.762 ± 0.017
1.495IleThr: 1.495 ± 0.017
1.742IleVal: 1.742 ± 0.016
0.301IleTrp: 0.301 ± 0.007
0.576IleTyr: 0.576 ± 0.01
0.0IleXaa: 0.0 ± 0.0
Lys
3.755LysAla: 3.755 ± 0.035
0.327LysCys: 0.327 ± 0.006
1.466LysAsp: 1.466 ± 0.02
1.896LysGlu: 1.896 ± 0.024
0.662LysPhe: 0.662 ± 0.011
2.2LysGly: 2.2 ± 0.024
0.574LysHis: 0.574 ± 0.009
0.747LysIle: 0.747 ± 0.012
1.473LysLys: 1.473 ± 0.026
2.842LysLeu: 2.842 ± 0.024
0.585LysMet: 0.585 ± 0.011
0.624LysAsn: 0.624 ± 0.011
1.74LysPro: 1.74 ± 0.02
1.236LysGln: 1.236 ± 0.015
2.021LysArg: 2.021 ± 0.02
1.429LysSer: 1.429 ± 0.016
1.195LysThr: 1.195 ± 0.014
1.98LysVal: 1.98 ± 0.02
0.358LysTrp: 0.358 ± 0.007
0.723LysTyr: 0.723 ± 0.013
0.0LysXaa: 0.0 ± 0.0
Leu
13.079LeuAla: 13.079 ± 0.06
1.387LeuCys: 1.387 ± 0.016
4.617LeuAsp: 4.617 ± 0.029
5.729LeuGlu: 5.729 ± 0.038
2.244LeuPhe: 2.244 ± 0.023
7.35LeuGly: 7.35 ± 0.039
2.351LeuHis: 2.351 ± 0.021
2.011LeuIle: 2.011 ± 0.02
2.673LeuLys: 2.673 ± 0.025
11.401LeuLeu: 11.401 ± 0.066
1.921LeuMet: 1.921 ± 0.017
1.929LeuAsn: 1.929 ± 0.018
6.993LeuPro: 6.993 ± 0.038
4.74LeuGln: 4.74 ± 0.031
7.803LeuArg: 7.803 ± 0.043
6.396LeuSer: 6.396 ± 0.035
4.537LeuThr: 4.537 ± 0.029
6.219LeuVal: 6.219 ± 0.036
1.178LeuTrp: 1.178 ± 0.014
1.994LeuTyr: 1.994 ± 0.019
0.0LeuXaa: 0.0 ± 0.0
Met
2.871MetAla: 2.871 ± 0.023
0.29MetCys: 0.29 ± 0.006
1.014MetAsp: 1.014 ± 0.011
1.216MetGlu: 1.216 ± 0.014
0.494MetPhe: 0.494 ± 0.009
1.675MetGly: 1.675 ± 0.023
0.419MetHis: 0.419 ± 0.007
0.445MetIle: 0.445 ± 0.008
0.519MetLys: 0.519 ± 0.011
2.082MetLeu: 2.082 ± 0.02
0.508MetMet: 0.508 ± 0.015
0.397MetAsn: 0.397 ± 0.007
1.333MetPro: 1.333 ± 0.013
0.918MetGln: 0.918 ± 0.013
1.425MetArg: 1.425 ± 0.012
1.29MetSer: 1.29 ± 0.012
0.89MetThr: 0.89 ± 0.011
1.254MetVal: 1.254 ± 0.014
0.261MetTrp: 0.261 ± 0.006
0.493MetTyr: 0.493 ± 0.009
0.0MetXaa: 0.0 ± 0.0
Asn
2.506AsnAla: 2.506 ± 0.023
0.347AsnCys: 0.347 ± 0.007
0.965AsnAsp: 0.965 ± 0.01
0.922AsnGlu: 0.922 ± 0.013
0.645AsnPhe: 0.645 ± 0.01
2.369AsnGly: 2.369 ± 0.026
0.392AsnHis: 0.392 ± 0.007
0.81AsnIle: 0.81 ± 0.012
0.825AsnLys: 0.825 ± 0.012
1.925AsnLeu: 1.925 ± 0.019
0.523AsnMet: 0.523 ± 0.01
0.706AsnAsn: 0.706 ± 0.012
1.356AsnPro: 1.356 ± 0.017
0.616AsnGln: 0.616 ± 0.011
1.216AsnArg: 1.216 ± 0.014
1.452AsnSer: 1.452 ± 0.015
1.188AsnThr: 1.188 ± 0.015
1.568AsnVal: 1.568 ± 0.018
0.304AsnTrp: 0.304 ± 0.007
0.514AsnTyr: 0.514 ± 0.009
0.0AsnXaa: 0.0 ± 0.0
Pro
10.401ProAla: 10.401 ± 0.061
0.77ProCys: 0.77 ± 0.012
3.151ProAsp: 3.151 ± 0.023
3.636ProGlu: 3.636 ± 0.032
1.419ProPhe: 1.419 ± 0.013
6.811ProGly: 6.811 ± 0.04
1.463ProHis: 1.463 ± 0.016
1.235ProIle: 1.235 ± 0.014
1.464ProLys: 1.464 ± 0.018
6.035ProLeu: 6.035 ± 0.033
1.025ProMet: 1.025 ± 0.013
1.279ProAsn: 1.279 ± 0.015
8.416ProPro: 8.416 ± 0.076
2.699ProGln: 2.699 ± 0.028
3.881ProArg: 3.881 ± 0.027
5.659ProSer: 5.659 ± 0.04
3.092ProThr: 3.092 ± 0.02
3.961ProVal: 3.961 ± 0.029
0.819ProTrp: 0.819 ± 0.012
1.333ProTyr: 1.333 ± 0.017
0.0ProXaa: 0.0 ± 0.0
Gln
5.133GlnAla: 5.133 ± 0.037
0.418GlnCys: 0.418 ± 0.008
1.409GlnAsp: 1.409 ± 0.014
2.191GlnGlu: 2.191 ± 0.02
0.715GlnPhe: 0.715 ± 0.011
2.801GlnGly: 2.801 ± 0.023
1.158GlnHis: 1.158 ± 0.013
0.798GlnIle: 0.798 ± 0.012
0.877GlnLys: 0.877 ± 0.014
4.976GlnLeu: 4.976 ± 0.035
0.714GlnMet: 0.714 ± 0.012
0.581GlnAsn: 0.581 ± 0.009
3.308GlnPro: 3.308 ± 0.027
4.41GlnGln: 4.41 ± 0.07
3.286GlnArg: 3.286 ± 0.024
1.872GlnSer: 1.872 ± 0.017
1.261GlnThr: 1.261 ± 0.014
2.273GlnVal: 2.273 ± 0.022
0.444GlnTrp: 0.444 ± 0.009
0.774GlnTyr: 0.774 ± 0.011
0.0GlnXaa: 0.0 ± 0.0
Arg
7.908ArgAla: 7.908 ± 0.04
1.182ArgCys: 1.182 ± 0.012
3.062ArgAsp: 3.062 ± 0.026
3.767ArgGlu: 3.767 ± 0.029
1.679ArgPhe: 1.679 ± 0.015
5.799ArgGly: 5.799 ± 0.035
1.612ArgHis: 1.612 ± 0.017
1.754ArgIle: 1.754 ± 0.019
1.963ArgLys: 1.963 ± 0.019
7.428ArgLeu: 7.428 ± 0.038
1.454ArgMet: 1.454 ± 0.014
1.513ArgAsn: 1.513 ± 0.016
4.409ArgPro: 4.409 ± 0.029
3.056ArgGln: 3.056 ± 0.023
6.35ArgArg: 6.35 ± 0.047
4.704ArgSer: 4.704 ± 0.03
2.859ArgThr: 2.859 ± 0.021
4.149ArgVal: 4.149 ± 0.025
0.967ArgTrp: 0.967 ± 0.012
1.346ArgTyr: 1.346 ± 0.013
0.0ArgXaa: 0.0 ± 0.0
Ser
9.949SerAla: 9.949 ± 0.06
1.118SerCys: 1.118 ± 0.014
3.146SerAsp: 3.146 ± 0.022
2.987SerGlu: 2.987 ± 0.022
1.8SerPhe: 1.8 ± 0.016
8.759SerGly: 8.759 ± 0.05
1.274SerHis: 1.274 ± 0.014
1.718SerIle: 1.718 ± 0.017
1.771SerLys: 1.771 ± 0.015
6.299SerLeu: 6.299 ± 0.034
1.29SerMet: 1.29 ± 0.014
1.526SerAsn: 1.526 ± 0.018
5.109SerPro: 5.109 ± 0.041
2.213SerGln: 2.213 ± 0.02
4.545SerArg: 4.545 ± 0.032
6.993SerSer: 6.993 ± 0.056
3.522SerThr: 3.522 ± 0.027
4.238SerVal: 4.238 ± 0.026
0.912SerTrp: 0.912 ± 0.012
1.472SerTyr: 1.472 ± 0.016
0.0SerXaa: 0.0 ± 0.0
Thr
7.624ThrAla: 7.624 ± 0.043
0.771ThrCys: 0.771 ± 0.015
2.095ThrAsp: 2.095 ± 0.017
2.051ThrGlu: 2.051 ± 0.017
1.315ThrPhe: 1.315 ± 0.016
4.414ThrGly: 4.414 ± 0.027
0.893ThrHis: 0.893 ± 0.011
1.366ThrIle: 1.366 ± 0.015
1.19ThrLys: 1.19 ± 0.015
4.27ThrLeu: 4.27 ± 0.028
0.805ThrMet: 0.805 ± 0.01
1.031ThrAsn: 1.031 ± 0.013
3.348ThrPro: 3.348 ± 0.028
1.41ThrGln: 1.41 ± 0.015
2.508ThrArg: 2.508 ± 0.016
3.529ThrSer: 3.529 ± 0.029
2.591ThrThr: 2.591 ± 0.025
3.278ThrVal: 3.278 ± 0.026
0.624ThrTrp: 0.624 ± 0.01
1.037ThrTyr: 1.037 ± 0.013
0.0ThrXaa: 0.0 ± 0.0
Val
9.621ValAla: 9.621 ± 0.045
1.03ValCys: 1.03 ± 0.012
3.247ValAsp: 3.247 ± 0.022
3.829ValGlu: 3.829 ± 0.028
1.777ValPhe: 1.777 ± 0.019
5.184ValGly: 5.184 ± 0.028
1.374ValHis: 1.374 ± 0.013
1.674ValIle: 1.674 ± 0.016
1.908ValLys: 1.908 ± 0.019
6.905ValLeu: 6.905 ± 0.039
1.415ValMet: 1.415 ± 0.014
1.435ValAsn: 1.435 ± 0.016
4.155ValPro: 4.155 ± 0.022
2.414ValGln: 2.414 ± 0.019
4.512ValArg: 4.512 ± 0.028
4.298ValSer: 4.298 ± 0.028
3.432ValThr: 3.432 ± 0.025
5.299ValVal: 5.299 ± 0.04
0.926ValTrp: 0.926 ± 0.012
1.428ValTyr: 1.428 ± 0.014
0.0ValXaa: 0.0 ± 0.0
Trp
1.451TrpAla: 1.451 ± 0.014
0.22TrpCys: 0.22 ± 0.006
0.721TrpAsp: 0.721 ± 0.012
0.748TrpGlu: 0.748 ± 0.01
0.313TrpPhe: 0.313 ± 0.007
1.036TrpGly: 1.036 ± 0.013
0.308TrpHis: 0.308 ± 0.008
0.31TrpIle: 0.31 ± 0.007
0.355TrpLys: 0.355 ± 0.007
1.637TrpLeu: 1.637 ± 0.022
0.28TrpMet: 0.28 ± 0.007
0.313TrpAsn: 0.313 ± 0.008
0.728TrpPro: 0.728 ± 0.009
0.587TrpGln: 0.587 ± 0.009
1.222TrpArg: 1.222 ± 0.014
0.849TrpSer: 0.849 ± 0.011
0.566TrpThr: 0.566 ± 0.009
0.833TrpVal: 0.833 ± 0.011
0.247TrpTrp: 0.247 ± 0.006
0.295TrpTyr: 0.295 ± 0.007
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.828TyrAla: 1.828 ± 0.014
0.39TyrCys: 0.39 ± 0.008
1.315TyrAsp: 1.315 ± 0.015
1.081TyrGlu: 1.081 ± 0.014
0.62TyrPhe: 0.62 ± 0.01
2.278TyrGly: 2.278 ± 0.029
0.413TyrHis: 0.413 ± 0.007
0.656TyrIle: 0.656 ± 0.011
0.712TyrLys: 0.712 ± 0.011
1.867TyrLeu: 1.867 ± 0.017
0.506TyrMet: 0.506 ± 0.009
0.705TyrAsn: 0.705 ± 0.012
0.945TyrPro: 0.945 ± 0.013
0.663TyrGln: 0.663 ± 0.01
1.295TyrArg: 1.295 ± 0.016
1.306TyrSer: 1.306 ± 0.014
1.089TyrThr: 1.089 ± 0.015
1.389TyrVal: 1.389 ± 0.015
0.3TyrTrp: 0.3 ± 0.007
0.597TyrTyr: 0.597 ± 0.011
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 16224 proteins (8125833 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski