Amino acid dipepetide frequency for Ephemeroptericola cinctiostellae

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.657AlaAla: 11.657 ± 0.2
1.17AlaCys: 1.17 ± 0.037
5.448AlaAsp: 5.448 ± 0.101
5.326AlaGlu: 5.326 ± 0.112
3.912AlaPhe: 3.912 ± 0.075
7.518AlaGly: 7.518 ± 0.136
3.085AlaHis: 3.085 ± 0.068
5.862AlaIle: 5.862 ± 0.135
4.653AlaLys: 4.653 ± 0.107
11.822AlaLeu: 11.822 ± 0.137
3.166AlaMet: 3.166 ± 0.074
3.463AlaAsn: 3.463 ± 0.077
4.107AlaPro: 4.107 ± 0.097
5.737AlaGln: 5.737 ± 0.111
5.232AlaArg: 5.232 ± 0.104
5.687AlaSer: 5.687 ± 0.108
5.854AlaThr: 5.854 ± 0.358
7.847AlaVal: 7.847 ± 0.156
1.412AlaTrp: 1.412 ± 0.048
2.753AlaTyr: 2.753 ± 0.066
0.0AlaXaa: 0.0 ± 0.0
Cys
1.139CysAla: 1.139 ± 0.044
0.149CysCys: 0.149 ± 0.015
0.589CysAsp: 0.589 ± 0.029
0.558CysGlu: 0.558 ± 0.029
0.365CysPhe: 0.365 ± 0.022
0.905CysGly: 0.905 ± 0.039
0.29CysHis: 0.29 ± 0.021
0.54CysIle: 0.54 ± 0.027
0.286CysLys: 0.286 ± 0.021
0.912CysLeu: 0.912 ± 0.034
0.231CysMet: 0.231 ± 0.018
0.3CysAsn: 0.3 ± 0.021
0.466CysPro: 0.466 ± 0.025
0.318CysGln: 0.318 ± 0.022
0.372CysArg: 0.372 ± 0.022
0.58CysSer: 0.58 ± 0.027
0.572CysThr: 0.572 ± 0.026
0.767CysVal: 0.767 ± 0.033
0.126CysTrp: 0.126 ± 0.014
0.259CysTyr: 0.259 ± 0.019
0.0CysXaa: 0.0 ± 0.0
Asp
6.718AspAla: 6.718 ± 0.149
0.506AspCys: 0.506 ± 0.025
3.302AspAsp: 3.302 ± 0.081
3.312AspGlu: 3.312 ± 0.076
2.286AspPhe: 2.286 ± 0.065
4.125AspGly: 4.125 ± 0.085
1.171AspHis: 1.171 ± 0.043
3.081AspIle: 3.081 ± 0.066
2.424AspLys: 2.424 ± 0.079
5.245AspLeu: 5.245 ± 0.092
1.564AspMet: 1.564 ± 0.047
2.957AspAsn: 2.957 ± 0.449
1.885AspPro: 1.885 ± 0.054
1.608AspGln: 1.608 ± 0.045
2.142AspArg: 2.142 ± 0.065
2.479AspSer: 2.479 ± 0.07
2.885AspThr: 2.885 ± 0.09
5.323AspVal: 5.323 ± 0.118
0.894AspTrp: 0.894 ± 0.037
1.648AspTyr: 1.648 ± 0.052
0.0AspXaa: 0.0 ± 0.0
Glu
6.35GluAla: 6.35 ± 0.345
0.46GluCys: 0.46 ± 0.025
2.142GluAsp: 2.142 ± 0.057
2.358GluGlu: 2.358 ± 0.076
1.999GluPhe: 1.999 ± 0.044
3.195GluGly: 3.195 ± 0.081
1.751GluHis: 1.751 ± 0.053
2.865GluIle: 2.865 ± 0.074
2.79GluLys: 2.79 ± 0.084
5.765GluLeu: 5.765 ± 0.186
1.46GluMet: 1.46 ± 0.046
1.896GluAsn: 1.896 ± 0.056
1.934GluPro: 1.934 ± 0.056
2.881GluGln: 2.881 ± 0.063
3.318GluArg: 3.318 ± 0.095
3.218GluSer: 3.218 ± 0.231
2.814GluThr: 2.814 ± 0.063
5.005GluVal: 5.005 ± 0.457
0.821GluTrp: 0.821 ± 0.029
1.367GluTyr: 1.367 ± 0.045
0.0GluXaa: 0.0 ± 0.0
Phe
3.885PheAla: 3.885 ± 0.073
0.464PheCys: 0.464 ± 0.025
2.713PheAsp: 2.713 ± 0.064
2.43PheGlu: 2.43 ± 0.068
1.741PhePhe: 1.741 ± 0.064
3.209PheGly: 3.209 ± 0.075
0.771PheHis: 0.771 ± 0.033
2.335PheIle: 2.335 ± 0.067
2.03PheLys: 2.03 ± 0.053
3.246PheLeu: 3.246 ± 0.082
1.099PheMet: 1.099 ± 0.046
1.866PheAsn: 1.866 ± 0.055
1.321PhePro: 1.321 ± 0.049
1.0PheGln: 1.0 ± 0.034
1.379PheArg: 1.379 ± 0.043
2.655PheSer: 2.655 ± 0.078
2.195PheThr: 2.195 ± 0.051
2.839PheVal: 2.839 ± 0.062
0.59PheTrp: 0.59 ± 0.031
1.24PheTyr: 1.24 ± 0.047
0.0PheXaa: 0.0 ± 0.0
Gly
6.848GlyAla: 6.848 ± 0.114
0.786GlyCys: 0.786 ± 0.034
3.794GlyAsp: 3.794 ± 0.104
3.839GlyGlu: 3.839 ± 0.076
3.331GlyPhe: 3.331 ± 0.075
5.608GlyGly: 5.608 ± 0.153
1.849GlyHis: 1.849 ± 0.049
4.049GlyIle: 4.049 ± 0.078
3.408GlyLys: 3.408 ± 0.073
7.544GlyLeu: 7.544 ± 0.128
2.294GlyMet: 2.294 ± 0.061
2.554GlyAsn: 2.554 ± 0.126
1.362GlyPro: 1.362 ± 0.049
3.1GlyGln: 3.1 ± 0.066
3.482GlyArg: 3.482 ± 0.085
3.952GlySer: 3.952 ± 0.081
3.714GlyThr: 3.714 ± 0.08
6.472GlyVal: 6.472 ± 0.106
1.058GlyTrp: 1.058 ± 0.042
2.125GlyTyr: 2.125 ± 0.054
0.0GlyXaa: 0.0 ± 0.0
His
2.938HisAla: 2.938 ± 0.069
0.303HisCys: 0.303 ± 0.019
1.472HisAsp: 1.472 ± 0.05
1.269HisGlu: 1.269 ± 0.04
1.149HisPhe: 1.149 ± 0.034
1.876HisGly: 1.876 ± 0.049
0.916HisHis: 0.916 ± 0.042
1.701HisIle: 1.701 ± 0.051
0.903HisLys: 0.903 ± 0.032
2.519HisLeu: 2.519 ± 0.074
0.702HisMet: 0.702 ± 0.03
0.99HisAsn: 0.99 ± 0.039
1.476HisPro: 1.476 ± 0.039
0.972HisGln: 0.972 ± 0.038
1.201HisArg: 1.201 ± 0.044
1.456HisSer: 1.456 ± 0.047
1.526HisThr: 1.526 ± 0.05
2.05HisVal: 2.05 ± 0.051
0.432HisTrp: 0.432 ± 0.027
0.803HisTyr: 0.803 ± 0.034
0.0HisXaa: 0.0 ± 0.0
Ile
6.448IleAla: 6.448 ± 0.104
0.554IleCys: 0.554 ± 0.032
3.664IleAsp: 3.664 ± 0.079
4.112IleGlu: 4.112 ± 0.096
1.839IlePhe: 1.839 ± 0.056
4.503IleGly: 4.503 ± 0.088
1.347IleHis: 1.347 ± 0.042
2.876IleIle: 2.876 ± 0.073
2.919IleLys: 2.919 ± 0.084
4.588IleLeu: 4.588 ± 0.09
1.325IleMet: 1.325 ± 0.045
2.535IleAsn: 2.535 ± 0.075
2.287IlePro: 2.287 ± 0.058
1.943IleGln: 1.943 ± 0.051
2.483IleArg: 2.483 ± 0.064
3.24IleSer: 3.24 ± 0.07
3.17IleThr: 3.17 ± 0.071
4.008IleVal: 4.008 ± 0.084
0.571IleTrp: 0.571 ± 0.028
1.321IleTyr: 1.321 ± 0.044
0.0IleXaa: 0.0 ± 0.0
Lys
4.436LysAla: 4.436 ± 0.112
0.256LysCys: 0.256 ± 0.017
2.379LysAsp: 2.379 ± 0.067
2.235LysGlu: 2.235 ± 0.075
1.349LysPhe: 1.349 ± 0.043
3.082LysGly: 3.082 ± 0.075
1.289LysHis: 1.289 ± 0.04
2.789LysIle: 2.789 ± 0.073
2.62LysLys: 2.62 ± 0.071
3.968LysLeu: 3.968 ± 0.079
1.537LysMet: 1.537 ± 0.05
2.206LysAsn: 2.206 ± 0.068
2.893LysPro: 2.893 ± 0.285
1.964LysGln: 1.964 ± 0.055
2.339LysArg: 2.339 ± 0.069
2.671LysSer: 2.671 ± 0.059
3.104LysThr: 3.104 ± 0.067
3.25LysVal: 3.25 ± 0.075
0.501LysTrp: 0.501 ± 0.028
1.133LysTyr: 1.133 ± 0.041
0.0LysXaa: 0.0 ± 0.0
Leu
10.452LeuAla: 10.452 ± 0.14
1.098LeuCys: 1.098 ± 0.038
5.14LeuAsp: 5.14 ± 0.089
4.615LeuGlu: 4.615 ± 0.092
3.914LeuPhe: 3.914 ± 0.084
7.04LeuGly: 7.04 ± 0.116
2.378LeuHis: 2.378 ± 0.066
5.666LeuIle: 5.666 ± 0.103
5.792LeuLys: 5.792 ± 0.301
9.857LeuLeu: 9.857 ± 0.159
2.929LeuMet: 2.929 ± 0.067
4.731LeuAsn: 4.731 ± 0.092
5.283LeuPro: 5.283 ± 0.116
3.642LeuGln: 3.642 ± 0.087
4.912LeuArg: 4.912 ± 0.091
7.299LeuSer: 7.299 ± 0.101
6.068LeuThr: 6.068 ± 0.093
6.401LeuVal: 6.401 ± 0.107
1.21LeuTrp: 1.21 ± 0.045
2.421LeuTyr: 2.421 ± 0.066
0.0LeuXaa: 0.0 ± 0.0
Met
3.068MetAla: 3.068 ± 0.079
0.238MetCys: 0.238 ± 0.017
1.401MetAsp: 1.401 ± 0.05
1.004MetGlu: 1.004 ± 0.04
0.976MetPhe: 0.976 ± 0.042
2.249MetGly: 2.249 ± 0.061
0.726MetHis: 0.726 ± 0.03
1.272MetIle: 1.272 ± 0.046
1.299MetLys: 1.299 ± 0.045
2.896MetLeu: 2.896 ± 0.079
0.969MetMet: 0.969 ± 0.048
1.289MetAsn: 1.289 ± 0.039
1.477MetPro: 1.477 ± 0.05
1.281MetGln: 1.281 ± 0.04
1.677MetArg: 1.677 ± 0.051
2.082MetSer: 2.082 ± 0.054
1.848MetThr: 1.848 ± 0.054
1.995MetVal: 1.995 ± 0.058
0.316MetTrp: 0.316 ± 0.019
0.576MetTyr: 0.576 ± 0.03
0.0MetXaa: 0.0 ± 0.0
Asn
4.35AsnAla: 4.35 ± 0.101
0.344AsnCys: 0.344 ± 0.024
2.955AsnAsp: 2.955 ± 0.14
2.908AsnGlu: 2.908 ± 0.345
1.325AsnPhe: 1.325 ± 0.041
3.055AsnGly: 3.055 ± 0.069
1.006AsnHis: 1.006 ± 0.034
2.245AsnIle: 2.245 ± 0.063
1.67AsnLys: 1.67 ± 0.051
3.45AsnLeu: 3.45 ± 0.074
1.01AsnMet: 1.01 ± 0.035
1.682AsnAsn: 1.682 ± 0.064
2.228AsnPro: 2.228 ± 0.061
1.644AsnGln: 1.644 ± 0.05
1.705AsnArg: 1.705 ± 0.052
2.03AsnSer: 2.03 ± 0.056
2.317AsnThr: 2.317 ± 0.057
2.912AsnVal: 2.912 ± 0.096
0.571AsnTrp: 0.571 ± 0.031
1.009AsnTyr: 1.009 ± 0.038
0.0AsnXaa: 0.0 ± 0.0
Pro
4.376ProAla: 4.376 ± 0.092
0.338ProCys: 0.338 ± 0.024
2.357ProAsp: 2.357 ± 0.06
2.796ProGlu: 2.796 ± 0.079
1.771ProPhe: 1.771 ± 0.047
1.58ProGly: 1.58 ± 0.046
1.226ProHis: 1.226 ± 0.041
2.492ProIle: 2.492 ± 0.064
1.955ProLys: 1.955 ± 0.058
4.184ProLeu: 4.184 ± 0.091
1.262ProMet: 1.262 ± 0.042
1.955ProAsn: 1.955 ± 0.06
1.427ProPro: 1.427 ± 0.047
1.593ProGln: 1.593 ± 0.053
1.794ProArg: 1.794 ± 0.059
2.64ProSer: 2.64 ± 0.061
2.567ProThr: 2.567 ± 0.072
4.165ProVal: 4.165 ± 0.323
0.53ProTrp: 0.53 ± 0.028
1.272ProTyr: 1.272 ± 0.044
0.0ProXaa: 0.0 ± 0.0
Gln
4.758GlnAla: 4.758 ± 0.098
0.357GlnCys: 0.357 ± 0.023
1.671GlnAsp: 1.671 ± 0.053
1.702GlnGlu: 1.702 ± 0.053
1.602GlnPhe: 1.602 ± 0.054
2.706GlnGly: 2.706 ± 0.059
1.34GlnHis: 1.34 ± 0.046
2.207GlnIle: 2.207 ± 0.061
1.854GlnLys: 1.854 ± 0.052
4.423GlnLeu: 4.423 ± 0.089
1.274GlnMet: 1.274 ± 0.04
1.577GlnAsn: 1.577 ± 0.047
1.696GlnPro: 1.696 ± 0.048
1.905GlnGln: 1.905 ± 0.053
2.352GlnArg: 2.352 ± 0.065
2.259GlnSer: 2.259 ± 0.059
2.535GlnThr: 2.535 ± 0.059
3.098GlnVal: 3.098 ± 0.071
0.591GlnTrp: 0.591 ± 0.029
1.151GlnTyr: 1.151 ± 0.042
0.0GlnXaa: 0.0 ± 0.0
Arg
4.934ArgAla: 4.934 ± 0.097
0.423ArgCys: 0.423 ± 0.025
2.691ArgAsp: 2.691 ± 0.077
2.918ArgGlu: 2.918 ± 0.069
2.086ArgPhe: 2.086 ± 0.051
3.084ArgGly: 3.084 ± 0.079
1.361ArgHis: 1.361 ± 0.052
3.049ArgIle: 3.049 ± 0.069
1.87ArgLys: 1.87 ± 0.059
5.195ArgLeu: 5.195 ± 0.101
1.491ArgMet: 1.491 ± 0.042
1.742ArgAsn: 1.742 ± 0.059
1.844ArgPro: 1.844 ± 0.061
1.964ArgGln: 1.964 ± 0.054
2.564ArgArg: 2.564 ± 0.073
2.481ArgSer: 2.481 ± 0.066
2.619ArgThr: 2.619 ± 0.068
3.767ArgVal: 3.767 ± 0.077
0.728ArgTrp: 0.728 ± 0.032
1.575ArgTyr: 1.575 ± 0.047
0.0ArgXaa: 0.0 ± 0.0
Ser
5.648SerAla: 5.648 ± 0.112
0.508SerCys: 0.508 ± 0.025
3.019SerAsp: 3.019 ± 0.067
2.933SerGlu: 2.933 ± 0.07
2.309SerPhe: 2.309 ± 0.066
4.82SerGly: 4.82 ± 0.084
1.588SerHis: 1.588 ± 0.039
3.46SerIle: 3.46 ± 0.076
2.537SerLys: 2.537 ± 0.061
5.55SerLeu: 5.55 ± 0.086
1.688SerMet: 1.688 ± 0.05
2.488SerAsn: 2.488 ± 0.068
2.629SerPro: 2.629 ± 0.059
2.146SerGln: 2.146 ± 0.055
2.723SerArg: 2.723 ± 0.061
3.762SerSer: 3.762 ± 0.091
4.196SerThr: 4.196 ± 0.226
4.14SerVal: 4.14 ± 0.078
0.733SerTrp: 0.733 ± 0.036
1.539SerTyr: 1.539 ± 0.05
0.0SerXaa: 0.0 ± 0.0
Thr
6.176ThrAla: 6.176 ± 0.15
0.518ThrCys: 0.518 ± 0.027
2.992ThrAsp: 2.992 ± 0.072
2.985ThrGlu: 2.985 ± 0.082
1.991ThrPhe: 1.991 ± 0.055
4.24ThrGly: 4.24 ± 0.096
1.741ThrHis: 1.741 ± 0.051
3.081ThrIle: 3.081 ± 0.075
1.965ThrLys: 1.965 ± 0.058
6.265ThrLeu: 6.265 ± 0.088
1.299ThrMet: 1.299 ± 0.041
1.79ThrAsn: 1.79 ± 0.051
3.282ThrPro: 3.282 ± 0.089
2.541ThrGln: 2.541 ± 0.06
2.637ThrArg: 2.637 ± 0.062
3.099ThrSer: 3.099 ± 0.069
3.175ThrThr: 3.175 ± 0.07
5.362ThrVal: 5.362 ± 0.47
0.733ThrTrp: 0.733 ± 0.03
1.415ThrTyr: 1.415 ± 0.051
0.0ThrXaa: 0.0 ± 0.0
Val
7.673ValAla: 7.673 ± 0.119
0.804ValCys: 0.804 ± 0.035
5.582ValAsp: 5.582 ± 0.574
5.334ValGlu: 5.334 ± 0.655
3.13ValPhe: 3.13 ± 0.074
5.356ValGly: 5.356 ± 0.115
1.722ValHis: 1.722 ± 0.051
4.21ValIle: 4.21 ± 0.085
3.499ValLys: 3.499 ± 0.078
8.93ValLeu: 8.93 ± 0.297
2.238ValMet: 2.238 ± 0.063
2.916ValAsn: 2.916 ± 0.079
2.997ValPro: 2.997 ± 0.074
2.897ValGln: 2.897 ± 0.074
3.644ValArg: 3.644 ± 0.08
4.629ValSer: 4.629 ± 0.083
4.054ValThr: 4.054 ± 0.089
6.3ValVal: 6.3 ± 0.127
1.009ValTrp: 1.009 ± 0.042
1.826ValTyr: 1.826 ± 0.048
0.0ValXaa: 0.0 ± 0.0
Trp
1.164TrpAla: 1.164 ± 0.042
0.149TrpCys: 0.149 ± 0.014
0.657TrpAsp: 0.657 ± 0.03
0.477TrpGlu: 0.477 ± 0.025
0.626TrpPhe: 0.626 ± 0.03
0.869TrpGly: 0.869 ± 0.039
0.437TrpHis: 0.437 ± 0.025
0.642TrpIle: 0.642 ± 0.032
0.398TrpLys: 0.398 ± 0.025
1.919TrpLeu: 1.919 ± 0.066
0.442TrpMet: 0.442 ± 0.025
0.45TrpAsn: 0.45 ± 0.026
0.463TrpPro: 0.463 ± 0.023
0.884TrpGln: 0.884 ± 0.034
0.914TrpArg: 0.914 ± 0.035
0.675TrpSer: 0.675 ± 0.027
0.58TrpThr: 0.58 ± 0.031
1.166TrpVal: 1.166 ± 0.039
0.22TrpTrp: 0.22 ± 0.018
0.334TrpTyr: 0.334 ± 0.024
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.692TyrAla: 2.692 ± 0.069
0.299TyrCys: 0.299 ± 0.021
1.59TyrAsp: 1.59 ± 0.052
1.28TyrGlu: 1.28 ± 0.045
1.307TyrPhe: 1.307 ± 0.052
2.152TyrGly: 2.152 ± 0.059
0.662TyrHis: 0.662 ± 0.028
1.309TyrIle: 1.309 ± 0.043
1.128TyrLys: 1.128 ± 0.042
2.571TyrLeu: 2.571 ± 0.066
0.653TyrMet: 0.653 ± 0.032
1.031TyrAsn: 1.031 ± 0.037
1.22TyrPro: 1.22 ± 0.037
1.039TyrGln: 1.039 ± 0.037
1.514TyrArg: 1.514 ± 0.05
1.522TyrSer: 1.522 ± 0.047
1.495TyrThr: 1.495 ± 0.046
1.884TyrVal: 1.884 ± 0.056
0.418TyrTrp: 0.418 ± 0.024
0.745TyrTyr: 0.745 ± 0.036
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2476 proteins (776007 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski