Amino acid dipepetide frequency for Thiohalocapsa sp. PB-PSB1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.346AlaAla: 13.346 ± 0.119
1.249AlaCys: 1.249 ± 0.031
7.173AlaAsp: 7.173 ± 0.066
7.23AlaGlu: 7.23 ± 0.066
3.624AlaPhe: 3.624 ± 0.048
8.663AlaGly: 8.663 ± 0.094
2.191AlaHis: 2.191 ± 0.038
5.509AlaIle: 5.509 ± 0.061
3.187AlaLys: 3.187 ± 0.051
12.841AlaLeu: 12.841 ± 0.11
2.854AlaMet: 2.854 ± 0.042
3.077AlaAsn: 3.077 ± 0.043
4.608AlaPro: 4.608 ± 0.06
4.14AlaGln: 4.14 ± 0.054
7.922AlaArg: 7.922 ± 0.084
5.997AlaSer: 5.997 ± 0.062
5.096AlaThr: 5.096 ± 0.054
7.174AlaVal: 7.174 ± 0.083
1.643AlaTrp: 1.643 ± 0.032
2.438AlaTyr: 2.438 ± 0.038
0.0AlaXaa: 0.0 ± 0.0
Cys
1.13CysAla: 1.13 ± 0.028
0.211CysCys: 0.211 ± 0.011
0.656CysAsp: 0.656 ± 0.023
0.585CysGlu: 0.585 ± 0.019
0.434CysPhe: 0.434 ± 0.017
0.985CysGly: 0.985 ± 0.022
0.335CysHis: 0.335 ± 0.017
0.582CysIle: 0.582 ± 0.02
0.299CysLys: 0.299 ± 0.013
1.059CysLeu: 1.059 ± 0.025
0.233CysMet: 0.233 ± 0.01
0.355CysAsn: 0.355 ± 0.014
0.598CysPro: 0.598 ± 0.021
0.388CysGln: 0.388 ± 0.014
0.874CysArg: 0.874 ± 0.021
0.651CysSer: 0.651 ± 0.021
0.521CysThr: 0.521 ± 0.019
0.652CysVal: 0.652 ± 0.022
0.195CysTrp: 0.195 ± 0.011
0.304CysTyr: 0.304 ± 0.015
0.0CysXaa: 0.0 ± 0.0
Asp
7.371AspAla: 7.371 ± 0.073
0.709AspCys: 0.709 ± 0.023
3.689AspAsp: 3.689 ± 0.054
3.482AspGlu: 3.482 ± 0.054
2.237AspPhe: 2.237 ± 0.038
5.128AspGly: 5.128 ± 0.079
1.292AspHis: 1.292 ± 0.033
2.924AspIle: 2.924 ± 0.046
1.677AspLys: 1.677 ± 0.034
6.916AspLeu: 6.916 ± 0.067
1.232AspMet: 1.232 ± 0.027
1.767AspAsn: 1.767 ± 0.041
3.746AspPro: 3.746 ± 0.057
2.67AspGln: 2.67 ± 0.048
4.35AspArg: 4.35 ± 0.049
3.184AspSer: 3.184 ± 0.049
2.747AspThr: 2.747 ± 0.052
3.64AspVal: 3.64 ± 0.046
1.188AspTrp: 1.188 ± 0.031
1.828AspTyr: 1.828 ± 0.039
0.0AspXaa: 0.0 ± 0.0
Glu
5.861GluAla: 5.861 ± 0.066
0.469GluCys: 0.469 ± 0.018
2.831GluAsp: 2.831 ± 0.047
2.807GluGlu: 2.807 ± 0.044
1.865GluPhe: 1.865 ± 0.033
3.384GluGly: 3.384 ± 0.048
1.674GluHis: 1.674 ± 0.037
3.357GluIle: 3.357 ± 0.046
1.74GluLys: 1.74 ± 0.041
6.675GluLeu: 6.675 ± 0.071
1.294GluMet: 1.294 ± 0.028
1.561GluAsn: 1.561 ± 0.03
3.1GluPro: 3.1 ± 0.047
3.65GluGln: 3.65 ± 0.056
5.417GluArg: 5.417 ± 0.062
2.983GluSer: 2.983 ± 0.043
3.179GluThr: 3.179 ± 0.043
3.747GluVal: 3.747 ± 0.053
0.711GluTrp: 0.711 ± 0.019
1.287GluTyr: 1.287 ± 0.028
0.0GluXaa: 0.0 ± 0.0
Phe
3.837PheAla: 3.837 ± 0.049
0.462PheCys: 0.462 ± 0.018
2.805PheAsp: 2.805 ± 0.042
2.21PheGlu: 2.21 ± 0.035
1.437PhePhe: 1.437 ± 0.038
3.295PheGly: 3.295 ± 0.05
0.838PheHis: 0.838 ± 0.021
1.697PheIle: 1.697 ± 0.032
0.96PheLys: 0.96 ± 0.028
3.337PheLeu: 3.337 ± 0.05
0.739PheMet: 0.739 ± 0.021
1.091PheAsn: 1.091 ± 0.028
1.584PhePro: 1.584 ± 0.031
1.166PheGln: 1.166 ± 0.025
2.309PheArg: 2.309 ± 0.039
2.38PheSer: 2.38 ± 0.042
1.68PheThr: 1.68 ± 0.035
2.461PheVal: 2.461 ± 0.038
0.56PheTrp: 0.56 ± 0.021
0.982PheTyr: 0.982 ± 0.022
0.0PheXaa: 0.0 ± 0.0
Gly
7.241GlyAla: 7.241 ± 0.076
1.019GlyCys: 1.019 ± 0.024
4.51GlyAsp: 4.51 ± 0.06
4.405GlyGlu: 4.405 ± 0.053
3.241GlyPhe: 3.241 ± 0.044
5.748GlyGly: 5.748 ± 0.081
1.795GlyHis: 1.795 ± 0.033
4.588GlyIle: 4.588 ± 0.055
2.761GlyLys: 2.761 ± 0.049
8.642GlyLeu: 8.642 ± 0.075
2.023GlyMet: 2.023 ± 0.037
2.496GlyAsn: 2.496 ± 0.044
2.821GlyPro: 2.821 ± 0.045
2.924GlyGln: 2.924 ± 0.048
5.511GlyArg: 5.511 ± 0.064
4.665GlySer: 4.665 ± 0.065
3.903GlyThr: 3.903 ± 0.054
5.054GlyVal: 5.054 ± 0.058
1.338GlyTrp: 1.338 ± 0.028
2.371GlyTyr: 2.371 ± 0.057
0.0GlyXaa: 0.0 ± 0.0
His
2.464HisAla: 2.464 ± 0.037
0.386HisCys: 0.386 ± 0.017
1.196HisAsp: 1.196 ± 0.029
1.123HisGlu: 1.123 ± 0.029
0.987HisPhe: 0.987 ± 0.026
1.908HisGly: 1.908 ± 0.04
0.652HisHis: 0.652 ± 0.021
1.058HisIle: 1.058 ± 0.024
0.585HisLys: 0.585 ± 0.019
2.59HisLeu: 2.59 ± 0.045
0.488HisMet: 0.488 ± 0.018
0.621HisAsn: 0.621 ± 0.019
1.58HisPro: 1.58 ± 0.034
1.022HisGln: 1.022 ± 0.023
1.911HisArg: 1.911 ± 0.041
1.134HisSer: 1.134 ± 0.026
0.942HisThr: 0.942 ± 0.024
1.354HisVal: 1.354 ± 0.03
0.5HisTrp: 0.5 ± 0.017
0.728HisTyr: 0.728 ± 0.019
0.0HisXaa: 0.0 ± 0.0
Ile
6.435IleAla: 6.435 ± 0.061
0.588IleCys: 0.588 ± 0.02
4.213IleAsp: 4.213 ± 0.058
3.759IleGlu: 3.759 ± 0.05
1.485IlePhe: 1.485 ± 0.029
4.918IleGly: 4.918 ± 0.067
1.081IleHis: 1.081 ± 0.025
2.178IleIle: 2.178 ± 0.038
1.505IleLys: 1.505 ± 0.028
4.664IleLeu: 4.664 ± 0.061
0.88IleMet: 0.88 ± 0.026
1.505IleAsn: 1.505 ± 0.03
2.593IlePro: 2.593 ± 0.04
1.775IleGln: 1.775 ± 0.034
3.316IleArg: 3.316 ± 0.042
2.902IleSer: 2.902 ± 0.044
2.425IleThr: 2.425 ± 0.038
3.333IleVal: 3.333 ± 0.045
0.654IleTrp: 0.654 ± 0.018
1.185IleTyr: 1.185 ± 0.026
0.0IleXaa: 0.0 ± 0.0
Lys
3.093LysAla: 3.093 ± 0.047
0.212LysCys: 0.212 ± 0.012
1.514LysAsp: 1.514 ± 0.035
1.488LysGlu: 1.488 ± 0.03
0.754LysPhe: 0.754 ± 0.023
1.948LysGly: 1.948 ± 0.042
0.731LysHis: 0.731 ± 0.023
1.417LysIle: 1.417 ± 0.032
1.07LysLys: 1.07 ± 0.034
3.041LysLeu: 3.041 ± 0.05
0.612LysMet: 0.612 ± 0.02
0.82LysAsn: 0.82 ± 0.025
1.8LysPro: 1.8 ± 0.036
1.473LysGln: 1.473 ± 0.035
2.459LysArg: 2.459 ± 0.047
1.598LysSer: 1.598 ± 0.032
1.755LysThr: 1.755 ± 0.035
1.893LysVal: 1.893 ± 0.041
0.332LysTrp: 0.332 ± 0.016
0.625LysTyr: 0.625 ± 0.02
0.0LysXaa: 0.0 ± 0.0
Leu
13.213LeuAla: 13.213 ± 0.116
1.245LeuCys: 1.245 ± 0.026
6.945LeuAsp: 6.945 ± 0.064
6.172LeuGlu: 6.172 ± 0.066
4.182LeuPhe: 4.182 ± 0.059
8.257LeuGly: 8.257 ± 0.075
2.661LeuHis: 2.661 ± 0.037
5.682LeuIle: 5.682 ± 0.061
2.919LeuLys: 2.919 ± 0.044
13.08LeuLeu: 13.08 ± 0.134
2.411LeuMet: 2.411 ± 0.043
2.911LeuAsn: 2.911 ± 0.046
6.168LeuPro: 6.168 ± 0.069
4.03LeuGln: 4.03 ± 0.045
8.328LeuArg: 8.328 ± 0.093
6.811LeuSer: 6.811 ± 0.066
5.546LeuThr: 5.546 ± 0.059
7.508LeuVal: 7.508 ± 0.067
1.413LeuTrp: 1.413 ± 0.035
2.594LeuTyr: 2.594 ± 0.038
0.0LeuXaa: 0.0 ± 0.0
Met
2.486MetAla: 2.486 ± 0.041
0.174MetCys: 0.174 ± 0.011
1.221MetAsp: 1.221 ± 0.024
1.085MetGlu: 1.085 ± 0.026
0.647MetPhe: 0.647 ± 0.021
1.388MetGly: 1.388 ± 0.032
0.596MetHis: 0.596 ± 0.018
1.15MetIle: 1.15 ± 0.029
0.725MetLys: 0.725 ± 0.021
2.796MetLeu: 2.796 ± 0.044
0.519MetMet: 0.519 ± 0.019
0.772MetAsn: 0.772 ± 0.021
1.516MetPro: 1.516 ± 0.029
1.145MetGln: 1.145 ± 0.03
1.787MetArg: 1.787 ± 0.036
1.397MetSer: 1.397 ± 0.032
1.437MetThr: 1.437 ± 0.032
1.382MetVal: 1.382 ± 0.033
0.141MetTrp: 0.141 ± 0.009
0.327MetTyr: 0.327 ± 0.012
0.0MetXaa: 0.0 ± 0.0
Asn
3.31AsnAla: 3.31 ± 0.048
0.337AsnCys: 0.337 ± 0.015
1.688AsnAsp: 1.688 ± 0.039
1.423AsnGlu: 1.423 ± 0.028
0.957AsnPhe: 0.957 ± 0.029
2.351AsnGly: 2.351 ± 0.047
0.638AsnHis: 0.638 ± 0.019
1.317AsnIle: 1.317 ± 0.031
0.767AsnLys: 0.767 ± 0.025
3.197AsnLeu: 3.197 ± 0.041
0.611AsnMet: 0.611 ± 0.017
0.921AsnAsn: 0.921 ± 0.028
2.054AsnPro: 2.054 ± 0.032
1.282AsnGln: 1.282 ± 0.03
2.229AsnArg: 2.229 ± 0.037
1.555AsnSer: 1.555 ± 0.032
1.395AsnThr: 1.395 ± 0.032
1.638AsnVal: 1.638 ± 0.027
0.458AsnTrp: 0.458 ± 0.017
0.751AsnTyr: 0.751 ± 0.023
0.0AsnXaa: 0.0 ± 0.0
Pro
5.656ProAla: 5.656 ± 0.062
0.449ProCys: 0.449 ± 0.018
3.782ProAsp: 3.782 ± 0.055
3.846ProGlu: 3.846 ± 0.047
1.824ProPhe: 1.824 ± 0.035
4.344ProGly: 4.344 ± 0.066
1.02ProHis: 1.02 ± 0.028
2.574ProIle: 2.574 ± 0.036
1.524ProLys: 1.524 ± 0.038
5.204ProLeu: 5.204 ± 0.06
1.309ProMet: 1.309 ± 0.03
1.63ProAsn: 1.63 ± 0.031
2.708ProPro: 2.708 ± 0.059
1.69ProGln: 1.69 ± 0.031
3.267ProArg: 3.267 ± 0.049
3.018ProSer: 3.018 ± 0.043
2.578ProThr: 2.578 ± 0.038
3.692ProVal: 3.692 ± 0.054
0.83ProTrp: 0.83 ± 0.021
1.185ProTyr: 1.185 ± 0.029
0.0ProXaa: 0.0 ± 0.0
Gln
4.938GlnAla: 4.938 ± 0.062
0.372GlnCys: 0.372 ± 0.015
2.078GlnAsp: 2.078 ± 0.035
1.907GlnGlu: 1.907 ± 0.041
1.227GlnPhe: 1.227 ± 0.025
2.821GlnGly: 2.821 ± 0.039
1.058GlnHis: 1.058 ± 0.026
2.206GlnIle: 2.206 ± 0.032
0.96GlnLys: 0.96 ± 0.025
4.18GlnLeu: 4.18 ± 0.05
0.921GlnMet: 0.921 ± 0.025
0.964GlnAsn: 0.964 ± 0.026
2.446GlnPro: 2.446 ± 0.035
2.482GlnGln: 2.482 ± 0.057
3.855GlnArg: 3.855 ± 0.062
2.151GlnSer: 2.151 ± 0.038
2.167GlnThr: 2.167 ± 0.037
3.01GlnVal: 3.01 ± 0.043
0.542GlnTrp: 0.542 ± 0.016
0.808GlnTyr: 0.808 ± 0.022
0.0GlnXaa: 0.0 ± 0.0
Arg
7.278ArgAla: 7.278 ± 0.076
0.862ArgCys: 0.862 ± 0.022
4.208ArgAsp: 4.208 ± 0.055
4.238ArgGlu: 4.238 ± 0.057
3.283ArgPhe: 3.283 ± 0.048
4.765ArgGly: 4.765 ± 0.055
1.909ArgHis: 1.909 ± 0.038
4.551ArgIle: 4.551 ± 0.055
2.128ArgLys: 2.128 ± 0.039
9.344ArgLeu: 9.344 ± 0.096
1.927ArgMet: 1.927 ± 0.038
2.166ArgAsn: 2.166 ± 0.039
3.549ArgPro: 3.549 ± 0.052
3.23ArgGln: 3.23 ± 0.05
6.22ArgArg: 6.22 ± 0.079
3.926ArgSer: 3.926 ± 0.059
3.226ArgThr: 3.226 ± 0.042
4.921ArgVal: 4.921 ± 0.055
1.336ArgTrp: 1.336 ± 0.031
2.144ArgTyr: 2.144 ± 0.036
0.0ArgXaa: 0.0 ± 0.0
Ser
6.36SerAla: 6.36 ± 0.057
0.573SerCys: 0.573 ± 0.019
3.516SerAsp: 3.516 ± 0.053
3.209SerGlu: 3.209 ± 0.045
1.98SerPhe: 1.98 ± 0.038
5.158SerGly: 5.158 ± 0.059
1.174SerHis: 1.174 ± 0.028
3.067SerIle: 3.067 ± 0.048
1.655SerLys: 1.655 ± 0.035
5.957SerLeu: 5.957 ± 0.071
1.458SerMet: 1.458 ± 0.028
1.684SerAsn: 1.684 ± 0.033
2.843SerPro: 2.843 ± 0.042
1.875SerGln: 1.875 ± 0.031
3.792SerArg: 3.792 ± 0.047
3.33SerSer: 3.33 ± 0.054
2.923SerThr: 2.923 ± 0.043
3.804SerVal: 3.804 ± 0.052
0.815SerTrp: 0.815 ± 0.022
1.368SerTyr: 1.368 ± 0.034
0.0SerXaa: 0.0 ± 0.0
Thr
5.348ThrAla: 5.348 ± 0.055
0.501ThrCys: 0.501 ± 0.018
3.149ThrAsp: 3.149 ± 0.06
2.763ThrGlu: 2.763 ± 0.048
1.606ThrPhe: 1.606 ± 0.031
4.382ThrGly: 4.382 ± 0.052
1.064ThrHis: 1.064 ± 0.029
2.303ThrIle: 2.303 ± 0.039
1.304ThrLys: 1.304 ± 0.029
6.049ThrLeu: 6.049 ± 0.076
0.983ThrMet: 0.983 ± 0.026
1.403ThrAsn: 1.403 ± 0.033
2.991ThrPro: 2.991 ± 0.044
1.698ThrGln: 1.698 ± 0.032
3.33ThrArg: 3.33 ± 0.045
2.792ThrSer: 2.792 ± 0.039
2.637ThrThr: 2.637 ± 0.057
3.114ThrVal: 3.114 ± 0.062
0.708ThrTrp: 0.708 ± 0.021
1.225ThrTyr: 1.225 ± 0.032
0.0ThrXaa: 0.0 ± 0.0
Val
6.992ValAla: 6.992 ± 0.077
0.747ValCys: 0.747 ± 0.022
4.296ValAsp: 4.296 ± 0.047
4.02ValGlu: 4.02 ± 0.051
2.426ValPhe: 2.426 ± 0.041
4.825ValGly: 4.825 ± 0.059
1.491ValHis: 1.491 ± 0.03
3.437ValIle: 3.437 ± 0.05
1.86ValLys: 1.86 ± 0.033
7.436ValLeu: 7.436 ± 0.068
1.421ValMet: 1.421 ± 0.026
1.931ValAsn: 1.931 ± 0.04
3.299ValPro: 3.299 ± 0.047
2.497ValGln: 2.497 ± 0.039
4.639ValArg: 4.639 ± 0.058
3.703ValSer: 3.703 ± 0.047
3.391ValThr: 3.391 ± 0.048
4.912ValVal: 4.912 ± 0.064
0.851ValTrp: 0.851 ± 0.025
1.589ValTyr: 1.589 ± 0.033
0.0ValXaa: 0.0 ± 0.0
Trp
1.114TrpAla: 1.114 ± 0.028
0.166TrpCys: 0.166 ± 0.011
0.747TrpAsp: 0.747 ± 0.021
0.69TrpGlu: 0.69 ± 0.02
0.594TrpPhe: 0.594 ± 0.019
0.888TrpGly: 0.888 ± 0.025
0.437TrpHis: 0.437 ± 0.017
0.763TrpIle: 0.763 ± 0.021
0.414TrpLys: 0.414 ± 0.018
2.083TrpLeu: 2.083 ± 0.036
0.36TrpMet: 0.36 ± 0.015
0.487TrpAsn: 0.487 ± 0.017
0.763TrpPro: 0.763 ± 0.022
0.837TrpGln: 0.837 ± 0.025
1.328TrpArg: 1.328 ± 0.028
0.951TrpSer: 0.951 ± 0.024
0.704TrpThr: 0.704 ± 0.023
0.935TrpVal: 0.935 ± 0.022
0.268TrpTrp: 0.268 ± 0.013
0.377TrpTyr: 0.377 ± 0.017
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.489TyrAla: 2.489 ± 0.04
0.315TyrCys: 0.315 ± 0.013
1.56TyrAsp: 1.56 ± 0.051
1.15TyrGlu: 1.15 ± 0.028
1.04TyrPhe: 1.04 ± 0.028
1.901TyrGly: 1.901 ± 0.043
0.594TyrHis: 0.594 ± 0.018
1.033TyrIle: 1.033 ± 0.025
0.596TyrLys: 0.596 ± 0.019
2.984TyrLeu: 2.984 ± 0.042
0.416TyrMet: 0.416 ± 0.015
0.719TyrAsn: 0.719 ± 0.026
1.36TyrPro: 1.36 ± 0.028
1.12TyrGln: 1.12 ± 0.025
2.366TyrArg: 2.366 ± 0.037
1.411TyrSer: 1.411 ± 0.03
1.098TyrThr: 1.098 ± 0.03
1.54TyrVal: 1.54 ± 0.029
0.425TyrTrp: 0.425 ± 0.016
0.71TyrTyr: 0.71 ± 0.023
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6200 proteins (1699814 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski