Amino acid dipepetide frequency for Thiohalocapsa marina

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.298AlaAla: 16.298 ± 0.165
1.172AlaCys: 1.172 ± 0.037
7.769AlaAsp: 7.769 ± 0.089
8.176AlaGlu: 8.176 ± 0.093
3.904AlaPhe: 3.904 ± 0.065
10.319AlaGly: 10.319 ± 0.112
2.397AlaHis: 2.397 ± 0.053
5.152AlaIle: 5.152 ± 0.072
2.461AlaLys: 2.461 ± 0.046
14.997AlaLeu: 14.997 ± 0.143
2.99AlaMet: 2.99 ± 0.054
2.445AlaAsn: 2.445 ± 0.055
5.677AlaPro: 5.677 ± 0.084
4.478AlaGln: 4.478 ± 0.071
9.457AlaArg: 9.457 ± 0.115
5.783AlaSer: 5.783 ± 0.083
5.292AlaThr: 5.292 ± 0.08
8.533AlaVal: 8.533 ± 0.093
1.693AlaTrp: 1.693 ± 0.039
2.465AlaTyr: 2.465 ± 0.047
0.0AlaXaa: 0.0 ± 0.0
Cys
1.094CysAla: 1.094 ± 0.032
0.184CysCys: 0.184 ± 0.014
0.605CysAsp: 0.605 ± 0.02
0.532CysGlu: 0.532 ± 0.025
0.332CysPhe: 0.332 ± 0.017
1.026CysGly: 1.026 ± 0.034
0.386CysHis: 0.386 ± 0.025
0.437CysIle: 0.437 ± 0.021
0.205CysLys: 0.205 ± 0.014
0.997CysLeu: 0.997 ± 0.028
0.179CysMet: 0.179 ± 0.011
0.243CysAsn: 0.243 ± 0.016
0.625CysPro: 0.625 ± 0.026
0.292CysGln: 0.292 ± 0.016
0.864CysArg: 0.864 ± 0.029
0.522CysSer: 0.522 ± 0.022
0.443CysThr: 0.443 ± 0.021
0.653CysVal: 0.653 ± 0.026
0.137CysTrp: 0.137 ± 0.011
0.251CysTyr: 0.251 ± 0.014
0.0CysXaa: 0.0 ± 0.0
Asp
7.89AspAla: 7.89 ± 0.096
0.682AspCys: 0.682 ± 0.026
3.559AspAsp: 3.559 ± 0.057
3.402AspGlu: 3.402 ± 0.056
2.061AspPhe: 2.061 ± 0.047
5.323AspGly: 5.323 ± 0.08
1.188AspHis: 1.188 ± 0.031
2.815AspIle: 2.815 ± 0.052
1.408AspLys: 1.408 ± 0.039
6.887AspLeu: 6.887 ± 0.077
1.247AspMet: 1.247 ± 0.037
1.356AspAsn: 1.356 ± 0.038
3.774AspPro: 3.774 ± 0.071
2.42AspGln: 2.42 ± 0.047
4.252AspArg: 4.252 ± 0.061
2.681AspSer: 2.681 ± 0.053
2.812AspThr: 2.812 ± 0.059
3.589AspVal: 3.589 ± 0.068
1.272AspTrp: 1.272 ± 0.032
1.719AspTyr: 1.719 ± 0.039
0.0AspXaa: 0.0 ± 0.0
Glu
7.22GluAla: 7.22 ± 0.084
0.394GluCys: 0.394 ± 0.02
2.785GluAsp: 2.785 ± 0.047
2.682GluGlu: 2.682 ± 0.064
1.547GluPhe: 1.547 ± 0.043
3.707GluGly: 3.707 ± 0.049
1.631GluHis: 1.631 ± 0.038
2.826GluIle: 2.826 ± 0.049
1.369GluLys: 1.369 ± 0.037
6.516GluLeu: 6.516 ± 0.089
1.175GluMet: 1.175 ± 0.031
1.158GluAsn: 1.158 ± 0.035
3.351GluPro: 3.351 ± 0.063
3.618GluGln: 3.618 ± 0.062
5.979GluArg: 5.979 ± 0.084
2.519GluSer: 2.519 ± 0.048
3.109GluThr: 3.109 ± 0.055
3.932GluVal: 3.932 ± 0.052
0.567GluTrp: 0.567 ± 0.023
1.096GluTyr: 1.096 ± 0.031
0.0GluXaa: 0.0 ± 0.0
Phe
3.76PheAla: 3.76 ± 0.07
0.37PheCys: 0.37 ± 0.017
2.553PheAsp: 2.553 ± 0.042
2.045PheGlu: 2.045 ± 0.044
1.279PhePhe: 1.279 ± 0.038
3.191PheGly: 3.191 ± 0.052
0.745PheHis: 0.745 ± 0.025
1.407PheIle: 1.407 ± 0.039
0.789PheLys: 0.789 ± 0.027
3.22PheLeu: 3.22 ± 0.071
0.686PheMet: 0.686 ± 0.025
0.941PheAsn: 0.941 ± 0.033
1.495PhePro: 1.495 ± 0.037
1.131PheGln: 1.131 ± 0.03
2.287PheArg: 2.287 ± 0.042
2.043PheSer: 2.043 ± 0.05
1.605PheThr: 1.605 ± 0.039
2.432PheVal: 2.432 ± 0.051
0.533PheTrp: 0.533 ± 0.025
0.913PheTyr: 0.913 ± 0.027
0.0PheXaa: 0.0 ± 0.0
Gly
8.655GlyAla: 8.655 ± 0.101
1.026GlyCys: 1.026 ± 0.034
4.729GlyAsp: 4.729 ± 0.073
4.634GlyGlu: 4.634 ± 0.06
3.168GlyPhe: 3.168 ± 0.054
6.773GlyGly: 6.773 ± 0.196
1.961GlyHis: 1.961 ± 0.041
4.306GlyIle: 4.306 ± 0.066
2.419GlyLys: 2.419 ± 0.041
9.872GlyLeu: 9.872 ± 0.108
2.018GlyMet: 2.018 ± 0.047
2.028GlyAsn: 2.028 ± 0.052
3.499GlyPro: 3.499 ± 0.063
3.208GlyGln: 3.208 ± 0.05
6.554GlyArg: 6.554 ± 0.081
4.507GlySer: 4.507 ± 0.071
4.003GlyThr: 4.003 ± 0.066
5.855GlyVal: 5.855 ± 0.078
1.364GlyTrp: 1.364 ± 0.038
2.334GlyTyr: 2.334 ± 0.049
0.0GlyXaa: 0.0 ± 0.0
His
2.513HisAla: 2.513 ± 0.05
0.38HisCys: 0.38 ± 0.016
1.239HisAsp: 1.239 ± 0.034
1.08HisGlu: 1.08 ± 0.033
0.853HisPhe: 0.853 ± 0.029
2.142HisGly: 2.142 ± 0.051
0.652HisHis: 0.652 ± 0.024
0.885HisIle: 0.885 ± 0.027
0.508HisLys: 0.508 ± 0.022
2.596HisLeu: 2.596 ± 0.047
0.47HisMet: 0.47 ± 0.019
0.481HisAsn: 0.481 ± 0.019
1.616HisPro: 1.616 ± 0.039
0.908HisGln: 0.908 ± 0.03
1.905HisArg: 1.905 ± 0.047
1.007HisSer: 1.007 ± 0.027
0.863HisThr: 0.863 ± 0.027
1.421HisVal: 1.421 ± 0.038
0.534HisTrp: 0.534 ± 0.02
0.678HisTyr: 0.678 ± 0.023
0.0HisXaa: 0.0 ± 0.0
Ile
5.866IleAla: 5.866 ± 0.072
0.45IleCys: 0.45 ± 0.018
3.548IleAsp: 3.548 ± 0.064
3.252IleGlu: 3.252 ± 0.051
1.226IlePhe: 1.226 ± 0.036
4.489IleGly: 4.489 ± 0.074
1.013IleHis: 1.013 ± 0.032
1.876IleIle: 1.876 ± 0.044
1.195IleLys: 1.195 ± 0.034
4.339IleLeu: 4.339 ± 0.065
0.767IleMet: 0.767 ± 0.026
1.214IleAsn: 1.214 ± 0.031
2.301IlePro: 2.301 ± 0.043
1.436IleGln: 1.436 ± 0.036
3.437IleArg: 3.437 ± 0.058
2.173IleSer: 2.173 ± 0.043
2.28IleThr: 2.28 ± 0.045
3.018IleVal: 3.018 ± 0.061
0.476IleTrp: 0.476 ± 0.02
1.029IleTyr: 1.029 ± 0.035
0.0IleXaa: 0.0 ± 0.0
Lys
2.907LysAla: 2.907 ± 0.055
0.154LysCys: 0.154 ± 0.013
1.314LysAsp: 1.314 ± 0.036
1.183LysGlu: 1.183 ± 0.038
0.606LysPhe: 0.606 ± 0.022
1.871LysGly: 1.871 ± 0.046
0.556LysHis: 0.556 ± 0.019
1.048LysIle: 1.048 ± 0.037
0.808LysLys: 0.808 ± 0.038
2.318LysLeu: 2.318 ± 0.045
0.442LysMet: 0.442 ± 0.02
0.58LysAsn: 0.58 ± 0.023
1.492LysPro: 1.492 ± 0.041
1.077LysGln: 1.077 ± 0.028
2.055LysArg: 2.055 ± 0.041
1.191LysSer: 1.191 ± 0.03
1.428LysThr: 1.428 ± 0.039
1.716LysVal: 1.716 ± 0.042
0.249LysTrp: 0.249 ± 0.015
0.53LysTyr: 0.53 ± 0.023
0.0LysXaa: 0.0 ± 0.0
Leu
14.811LeuAla: 14.811 ± 0.13
1.06LeuCys: 1.06 ± 0.032
7.084LeuAsp: 7.084 ± 0.085
6.459LeuGlu: 6.459 ± 0.08
4.001LeuPhe: 4.001 ± 0.066
9.262LeuGly: 9.262 ± 0.103
2.461LeuHis: 2.461 ± 0.043
5.188LeuIle: 5.188 ± 0.083
2.783LeuLys: 2.783 ± 0.05
13.709LeuLeu: 13.709 ± 0.18
2.428LeuMet: 2.428 ± 0.049
2.428LeuAsn: 2.428 ± 0.047
6.5LeuPro: 6.5 ± 0.08
4.495LeuGln: 4.495 ± 0.064
9.1LeuArg: 9.1 ± 0.118
6.581LeuSer: 6.581 ± 0.084
5.751LeuThr: 5.751 ± 0.083
8.211LeuVal: 8.211 ± 0.102
1.385LeuTrp: 1.385 ± 0.042
2.486LeuTyr: 2.486 ± 0.051
0.0LeuXaa: 0.0 ± 0.0
Met
2.72MetAla: 2.72 ± 0.043
0.123MetCys: 0.123 ± 0.01
1.256MetAsp: 1.256 ± 0.037
0.998MetGlu: 0.998 ± 0.028
0.529MetPhe: 0.529 ± 0.02
1.533MetGly: 1.533 ± 0.039
0.499MetHis: 0.499 ± 0.023
0.958MetIle: 0.958 ± 0.029
0.669MetLys: 0.669 ± 0.026
2.521MetLeu: 2.521 ± 0.053
0.483MetMet: 0.483 ± 0.023
0.639MetAsn: 0.639 ± 0.023
1.417MetPro: 1.417 ± 0.034
1.043MetGln: 1.043 ± 0.029
1.669MetArg: 1.669 ± 0.037
1.35MetSer: 1.35 ± 0.033
1.432MetThr: 1.432 ± 0.033
1.452MetVal: 1.452 ± 0.035
0.129MetTrp: 0.129 ± 0.01
0.308MetTyr: 0.308 ± 0.015
0.0MetXaa: 0.0 ± 0.0
Asn
2.796AsnAla: 2.796 ± 0.057
0.262AsnCys: 0.262 ± 0.017
1.223AsnAsp: 1.223 ± 0.035
1.028AsnGlu: 1.028 ± 0.031
0.736AsnPhe: 0.736 ± 0.028
1.948AsnGly: 1.948 ± 0.044
0.491AsnHis: 0.491 ± 0.022
1.104AsnIle: 1.104 ± 0.034
0.539AsnLys: 0.539 ± 0.019
2.631AsnLeu: 2.631 ± 0.053
0.477AsnMet: 0.477 ± 0.024
0.574AsnAsn: 0.574 ± 0.027
1.681AsnPro: 1.681 ± 0.04
0.875AsnGln: 0.875 ± 0.029
1.849AsnArg: 1.849 ± 0.036
1.028AsnSer: 1.028 ± 0.034
1.093AsnThr: 1.093 ± 0.038
1.411AsnVal: 1.411 ± 0.039
0.342AsnTrp: 0.342 ± 0.018
0.596AsnTyr: 0.596 ± 0.028
0.0AsnXaa: 0.0 ± 0.0
Pro
6.928ProAla: 6.928 ± 0.098
0.445ProCys: 0.445 ± 0.019
3.968ProAsp: 3.968 ± 0.068
4.135ProGlu: 4.135 ± 0.083
1.85ProPhe: 1.85 ± 0.042
4.918ProGly: 4.918 ± 0.065
1.044ProHis: 1.044 ± 0.032
2.101ProIle: 2.101 ± 0.042
1.175ProLys: 1.175 ± 0.033
5.567ProLeu: 5.567 ± 0.078
1.272ProMet: 1.272 ± 0.029
1.175ProAsn: 1.175 ± 0.033
3.204ProPro: 3.204 ± 0.063
1.916ProGln: 1.916 ± 0.041
3.47ProArg: 3.47 ± 0.052
2.823ProSer: 2.823 ± 0.052
2.559ProThr: 2.559 ± 0.047
4.394ProVal: 4.394 ± 0.067
0.873ProTrp: 0.873 ± 0.026
1.251ProTyr: 1.251 ± 0.033
0.0ProXaa: 0.0 ± 0.0
Gln
5.651GlnAla: 5.651 ± 0.08
0.317GlnCys: 0.317 ± 0.015
1.947GlnAsp: 1.947 ± 0.039
1.788GlnGlu: 1.788 ± 0.047
1.127GlnPhe: 1.127 ± 0.031
3.28GlnGly: 3.28 ± 0.053
0.96GlnHis: 0.96 ± 0.026
1.887GlnIle: 1.887 ± 0.036
0.774GlnLys: 0.774 ± 0.028
4.129GlnLeu: 4.129 ± 0.069
0.85GlnMet: 0.85 ± 0.027
0.71GlnAsn: 0.71 ± 0.027
2.383GlnPro: 2.383 ± 0.049
2.293GlnGln: 2.293 ± 0.058
4.015GlnArg: 4.015 ± 0.066
1.867GlnSer: 1.867 ± 0.049
2.013GlnThr: 2.013 ± 0.044
3.155GlnVal: 3.155 ± 0.054
0.528GlnTrp: 0.528 ± 0.024
0.713GlnTyr: 0.713 ± 0.026
0.0GlnXaa: 0.0 ± 0.0
Arg
8.383ArgAla: 8.383 ± 0.096
0.809ArgCys: 0.809 ± 0.029
4.516ArgAsp: 4.516 ± 0.069
4.541ArgGlu: 4.541 ± 0.071
3.276ArgPhe: 3.276 ± 0.059
5.138ArgGly: 5.138 ± 0.067
2.204ArgHis: 2.204 ± 0.048
4.244ArgIle: 4.244 ± 0.072
1.823ArgLys: 1.823 ± 0.04
10.73ArgLeu: 10.73 ± 0.128
1.883ArgMet: 1.883 ± 0.041
1.762ArgAsn: 1.762 ± 0.042
4.208ArgPro: 4.208 ± 0.07
3.605ArgGln: 3.605 ± 0.061
7.353ArgArg: 7.353 ± 0.108
3.552ArgSer: 3.552 ± 0.064
3.428ArgThr: 3.428 ± 0.057
5.583ArgVal: 5.583 ± 0.082
1.301ArgTrp: 1.301 ± 0.038
2.127ArgTyr: 2.127 ± 0.042
0.0ArgXaa: 0.0 ± 0.0
Ser
6.19SerAla: 6.19 ± 0.076
0.504SerCys: 0.504 ± 0.022
2.872SerAsp: 2.872 ± 0.054
2.614SerGlu: 2.614 ± 0.052
1.635SerPhe: 1.635 ± 0.04
5.251SerGly: 5.251 ± 0.087
1.082SerHis: 1.082 ± 0.03
2.272SerIle: 2.272 ± 0.049
1.177SerLys: 1.177 ± 0.038
5.469SerLeu: 5.469 ± 0.077
1.192SerMet: 1.192 ± 0.032
1.234SerAsn: 1.234 ± 0.039
2.921SerPro: 2.921 ± 0.051
1.684SerGln: 1.684 ± 0.04
3.795SerArg: 3.795 ± 0.062
2.537SerSer: 2.537 ± 0.058
2.526SerThr: 2.526 ± 0.053
3.553SerVal: 3.553 ± 0.06
0.643SerTrp: 0.643 ± 0.023
1.128SerTyr: 1.128 ± 0.03
0.0SerXaa: 0.0 ± 0.0
Thr
5.875ThrAla: 5.875 ± 0.089
0.468ThrCys: 0.468 ± 0.021
3.005ThrAsp: 3.005 ± 0.063
2.618ThrGlu: 2.618 ± 0.044
1.446ThrPhe: 1.446 ± 0.031
4.623ThrGly: 4.623 ± 0.075
1.059ThrHis: 1.059 ± 0.033
1.918ThrIle: 1.918 ± 0.045
0.88ThrLys: 0.88 ± 0.03
6.34ThrLeu: 6.34 ± 0.087
0.854ThrMet: 0.854 ± 0.03
1.047ThrAsn: 1.047 ± 0.037
3.28ThrPro: 3.28 ± 0.059
1.605ThrGln: 1.605 ± 0.036
3.544ThrArg: 3.544 ± 0.068
2.291ThrSer: 2.291 ± 0.05
2.441ThrThr: 2.441 ± 0.063
3.48ThrVal: 3.48 ± 0.066
0.648ThrTrp: 0.648 ± 0.021
1.034ThrTyr: 1.034 ± 0.036
0.0ThrXaa: 0.0 ± 0.0
Val
8.243ValAla: 8.243 ± 0.081
0.772ValCys: 0.772 ± 0.026
4.32ValAsp: 4.32 ± 0.065
4.312ValGlu: 4.312 ± 0.063
2.445ValPhe: 2.445 ± 0.057
5.259ValGly: 5.259 ± 0.083
1.482ValHis: 1.482 ± 0.036
3.45ValIle: 3.45 ± 0.062
1.658ValLys: 1.658 ± 0.045
8.285ValLeu: 8.285 ± 0.105
1.546ValMet: 1.546 ± 0.032
1.729ValAsn: 1.729 ± 0.044
3.634ValPro: 3.634 ± 0.057
2.466ValGln: 2.466 ± 0.053
5.318ValArg: 5.318 ± 0.07
3.852ValSer: 3.852 ± 0.062
3.712ValThr: 3.712 ± 0.097
5.453ValVal: 5.453 ± 0.083
0.865ValTrp: 0.865 ± 0.029
1.559ValTyr: 1.559 ± 0.033
0.0ValXaa: 0.0 ± 0.0
Trp
1.156TrpAla: 1.156 ± 0.032
0.154TrpCys: 0.154 ± 0.01
0.677TrpAsp: 0.677 ± 0.024
0.605TrpGlu: 0.605 ± 0.02
0.54TrpPhe: 0.54 ± 0.02
0.912TrpGly: 0.912 ± 0.028
0.366TrpHis: 0.366 ± 0.015
0.66TrpIle: 0.66 ± 0.022
0.354TrpLys: 0.354 ± 0.019
2.201TrpLeu: 2.201 ± 0.05
0.35TrpMet: 0.35 ± 0.018
0.373TrpAsn: 0.373 ± 0.016
0.746TrpPro: 0.746 ± 0.026
0.793TrpGln: 0.793 ± 0.026
1.288TrpArg: 1.288 ± 0.036
0.822TrpSer: 0.822 ± 0.029
0.639TrpThr: 0.639 ± 0.025
0.954TrpVal: 0.954 ± 0.029
0.242TrpTrp: 0.242 ± 0.017
0.352TrpTyr: 0.352 ± 0.02
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.501TyrAla: 2.501 ± 0.049
0.281TyrCys: 0.281 ± 0.016
1.37TyrAsp: 1.37 ± 0.037
1.096TyrGlu: 1.096 ± 0.031
0.863TyrPhe: 0.863 ± 0.027
1.947TyrGly: 1.947 ± 0.043
0.556TyrHis: 0.556 ± 0.023
0.85TyrIle: 0.85 ± 0.026
0.53TyrLys: 0.53 ± 0.02
2.858TyrLeu: 2.858 ± 0.059
0.416TyrMet: 0.416 ± 0.021
0.566TyrAsn: 0.566 ± 0.025
1.263TyrPro: 1.263 ± 0.038
1.05TyrGln: 1.05 ± 0.033
2.276TyrArg: 2.276 ± 0.046
1.15TyrSer: 1.15 ± 0.033
1.029TyrThr: 1.029 ± 0.029
1.563TyrVal: 1.563 ± 0.04
0.403TyrTrp: 0.403 ± 0.02
0.671TyrTyr: 0.671 ± 0.024
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3686 proteins (1237040 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski