Amino acid dipepetide frequency for Urbifossiella limnaea

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
24.536AlaAla: 24.536 ± 0.21
1.222AlaCys: 1.222 ± 0.03
9.751AlaAsp: 9.751 ± 0.085
7.828AlaGlu: 7.828 ± 0.085
4.394AlaPhe: 4.394 ± 0.049
13.556AlaGly: 13.556 ± 0.125
2.103AlaHis: 2.103 ± 0.031
3.012AlaIle: 3.012 ± 0.041
4.365AlaLys: 4.365 ± 0.057
11.862AlaLeu: 11.862 ± 0.096
1.897AlaMet: 1.897 ± 0.031
2.706AlaAsn: 2.706 ± 0.05
7.215AlaPro: 7.215 ± 0.086
2.872AlaGln: 2.872 ± 0.038
9.707AlaArg: 9.707 ± 0.101
4.631AlaSer: 4.631 ± 0.09
6.956AlaThr: 6.956 ± 0.135
12.692AlaVal: 12.692 ± 0.099
1.861AlaTrp: 1.861 ± 0.036
2.489AlaTyr: 2.489 ± 0.039
0.0AlaXaa: 0.0 ± 0.0
Cys
0.893CysAla: 0.893 ± 0.024
0.166CysCys: 0.166 ± 0.011
0.571CysAsp: 0.571 ± 0.017
0.49CysGlu: 0.49 ± 0.015
0.313CysPhe: 0.313 ± 0.011
1.093CysGly: 1.093 ± 0.027
0.412CysHis: 0.412 ± 0.017
0.195CysIle: 0.195 ± 0.01
0.245CysLys: 0.245 ± 0.012
0.884CysLeu: 0.884 ± 0.021
0.133CysMet: 0.133 ± 0.008
0.19CysAsn: 0.19 ± 0.008
0.615CysPro: 0.615 ± 0.018
0.253CysGln: 0.253 ± 0.013
0.82CysArg: 0.82 ± 0.022
0.379CysSer: 0.379 ± 0.014
0.444CysThr: 0.444 ± 0.015
0.815CysVal: 0.815 ± 0.023
0.162CysTrp: 0.162 ± 0.009
0.249CysTyr: 0.249 ± 0.01
0.0CysXaa: 0.0 ± 0.0
Asp
8.449AspAla: 8.449 ± 0.073
0.413AspCys: 0.413 ± 0.015
3.759AspAsp: 3.759 ± 0.058
3.26AspGlu: 3.26 ± 0.044
2.03AspPhe: 2.03 ± 0.032
6.49AspGly: 6.49 ± 0.092
1.348AspHis: 1.348 ± 0.03
1.599AspIle: 1.599 ± 0.03
1.654AspLys: 1.654 ± 0.031
6.09AspLeu: 6.09 ± 0.06
0.754AspMet: 0.754 ± 0.021
1.089AspAsn: 1.089 ± 0.022
4.952AspPro: 4.952 ± 0.054
1.455AspGln: 1.455 ± 0.027
5.381AspArg: 5.381 ± 0.064
1.912AspSer: 1.912 ± 0.036
2.936AspThr: 2.936 ± 0.049
4.727AspVal: 4.727 ± 0.049
1.019AspTrp: 1.019 ± 0.023
1.421AspTyr: 1.421 ± 0.031
0.0AspXaa: 0.0 ± 0.0
Glu
6.48GluAla: 6.48 ± 0.072
0.418GluCys: 0.418 ± 0.013
1.936GluAsp: 1.936 ± 0.035
2.253GluGlu: 2.253 ± 0.037
2.024GluPhe: 2.024 ± 0.03
3.25GluGly: 3.25 ± 0.045
1.093GluHis: 1.093 ± 0.023
1.509GluIle: 1.509 ± 0.03
2.152GluLys: 2.152 ± 0.042
6.102GluLeu: 6.102 ± 0.077
0.938GluMet: 0.938 ± 0.021
1.072GluAsn: 1.072 ± 0.023
3.36GluPro: 3.36 ± 0.045
1.825GluGln: 1.825 ± 0.033
4.3GluArg: 4.3 ± 0.058
2.099GluSer: 2.099 ± 0.032
2.382GluThr: 2.382 ± 0.042
4.552GluVal: 4.552 ± 0.049
0.836GluTrp: 0.836 ± 0.023
1.327GluTyr: 1.327 ± 0.024
0.0GluXaa: 0.0 ± 0.0
Phe
4.669PheAla: 4.669 ± 0.053
0.36PheCys: 0.36 ± 0.013
2.762PheAsp: 2.762 ± 0.038
1.839PheGlu: 1.839 ± 0.034
1.166PhePhe: 1.166 ± 0.027
3.577PheGly: 3.577 ± 0.05
0.752PheHis: 0.752 ± 0.021
0.717PheIle: 0.717 ± 0.019
0.868PheLys: 0.868 ± 0.022
3.4PheLeu: 3.4 ± 0.042
0.37PheMet: 0.37 ± 0.013
0.918PheAsn: 0.918 ± 0.026
1.915PhePro: 1.915 ± 0.03
0.893PheGln: 0.893 ± 0.023
2.86PheArg: 2.86 ± 0.039
1.52PheSer: 1.52 ± 0.031
2.314PheThr: 2.314 ± 0.056
3.083PheVal: 3.083 ± 0.037
0.474PheTrp: 0.474 ± 0.016
0.778PheTyr: 0.778 ± 0.019
0.0PheXaa: 0.0 ± 0.0
Gly
9.29GlyAla: 9.29 ± 0.116
1.041GlyCys: 1.041 ± 0.024
5.021GlyAsp: 5.021 ± 0.056
4.367GlyGlu: 4.367 ± 0.052
3.463GlyPhe: 3.463 ± 0.044
9.94GlyGly: 9.94 ± 0.158
1.823GlyHis: 1.823 ± 0.033
2.624GlyIle: 2.624 ± 0.041
4.015GlyLys: 4.015 ± 0.061
8.499GlyLeu: 8.499 ± 0.074
1.885GlyMet: 1.885 ± 0.035
2.284GlyAsn: 2.284 ± 0.048
5.297GlyPro: 5.297 ± 0.063
2.48GlyGln: 2.48 ± 0.044
7.677GlyArg: 7.677 ± 0.094
4.605GlySer: 4.605 ± 0.085
5.863GlyThr: 5.863 ± 0.135
7.992GlyVal: 7.992 ± 0.076
1.758GlyTrp: 1.758 ± 0.032
2.423GlyTyr: 2.423 ± 0.039
0.0GlyXaa: 0.0 ± 0.0
His
2.462HisAla: 2.462 ± 0.042
0.233HisCys: 0.233 ± 0.01
1.224HisAsp: 1.224 ± 0.03
0.923HisGlu: 0.923 ± 0.023
0.771HisPhe: 0.771 ± 0.018
1.909HisGly: 1.909 ± 0.034
0.624HisHis: 0.624 ± 0.017
0.45HisIle: 0.45 ± 0.015
0.509HisLys: 0.509 ± 0.017
2.147HisLeu: 2.147 ± 0.035
0.258HisMet: 0.258 ± 0.01
0.451HisAsn: 0.451 ± 0.015
1.74HisPro: 1.74 ± 0.034
0.505HisGln: 0.505 ± 0.016
1.607HisArg: 1.607 ± 0.033
0.751HisSer: 0.751 ± 0.02
1.116HisThr: 1.116 ± 0.021
1.684HisVal: 1.684 ± 0.028
0.382HisTrp: 0.382 ± 0.014
0.537HisTyr: 0.537 ± 0.015
0.0HisXaa: 0.0 ± 0.0
Ile
3.406IleAla: 3.406 ± 0.043
0.238IleCys: 0.238 ± 0.01
2.396IleAsp: 2.396 ± 0.034
1.619IleGlu: 1.619 ± 0.031
0.717IlePhe: 0.717 ± 0.021
2.701IleGly: 2.701 ± 0.048
0.599IleHis: 0.599 ± 0.015
0.72IleIle: 0.72 ± 0.024
0.78IleLys: 0.78 ± 0.021
2.458IleLeu: 2.458 ± 0.036
0.293IleMet: 0.293 ± 0.011
0.703IleAsn: 0.703 ± 0.024
1.837IlePro: 1.837 ± 0.031
0.691IleGln: 0.691 ± 0.018
2.171IleArg: 2.171 ± 0.03
1.118IleSer: 1.118 ± 0.03
1.721IleThr: 1.721 ± 0.046
2.108IleVal: 2.108 ± 0.037
0.267IleTrp: 0.267 ± 0.013
0.545IleTyr: 0.545 ± 0.016
0.0IleXaa: 0.0 ± 0.0
Lys
4.44LysAla: 4.44 ± 0.055
0.267LysCys: 0.267 ± 0.011
1.938LysAsp: 1.938 ± 0.036
1.659LysGlu: 1.659 ± 0.035
0.923LysPhe: 0.923 ± 0.019
2.589LysGly: 2.589 ± 0.048
0.695LysHis: 0.695 ± 0.018
0.882LysIle: 0.882 ± 0.022
1.567LysLys: 1.567 ± 0.033
3.768LysLeu: 3.768 ± 0.058
0.704LysMet: 0.704 ± 0.022
0.785LysAsn: 0.785 ± 0.02
2.169LysPro: 2.169 ± 0.04
1.099LysGln: 1.099 ± 0.028
2.213LysArg: 2.213 ± 0.036
1.399LysSer: 1.399 ± 0.027
1.909LysThr: 1.909 ± 0.037
2.767LysVal: 2.767 ± 0.04
0.471LysTrp: 0.471 ± 0.016
0.816LysTyr: 0.816 ± 0.019
0.0LysXaa: 0.0 ± 0.0
Leu
14.693LeuAla: 14.693 ± 0.126
0.885LeuCys: 0.885 ± 0.02
6.279LeuAsp: 6.279 ± 0.052
4.19LeuGlu: 4.19 ± 0.052
3.323LeuPhe: 3.323 ± 0.043
8.252LeuGly: 8.252 ± 0.064
1.83LeuHis: 1.83 ± 0.03
2.783LeuIle: 2.783 ± 0.041
3.456LeuLys: 3.456 ± 0.052
9.903LeuLeu: 9.903 ± 0.101
1.481LeuMet: 1.481 ± 0.028
2.254LeuAsn: 2.254 ± 0.036
6.6LeuPro: 6.6 ± 0.067
2.024LeuGln: 2.024 ± 0.035
7.578LeuArg: 7.578 ± 0.075
4.438LeuSer: 4.438 ± 0.049
6.701LeuThr: 6.701 ± 0.084
8.395LeuVal: 8.395 ± 0.084
1.253LeuTrp: 1.253 ± 0.03
2.058LeuTyr: 2.058 ± 0.035
0.0LeuXaa: 0.0 ± 0.0
Met
1.892MetAla: 1.892 ± 0.031
0.145MetCys: 0.145 ± 0.008
0.685MetAsp: 0.685 ± 0.018
0.615MetGlu: 0.615 ± 0.018
0.486MetPhe: 0.486 ± 0.015
1.21MetGly: 1.21 ± 0.029
0.313MetHis: 0.313 ± 0.011
0.576MetIle: 0.576 ± 0.017
0.634MetLys: 0.634 ± 0.016
1.509MetLeu: 1.509 ± 0.025
0.301MetMet: 0.301 ± 0.012
0.473MetAsn: 0.473 ± 0.015
1.349MetPro: 1.349 ± 0.031
0.394MetGln: 0.394 ± 0.015
1.218MetArg: 1.218 ± 0.026
1.031MetSer: 1.031 ± 0.023
1.416MetThr: 1.416 ± 0.028
1.113MetVal: 1.113 ± 0.027
0.197MetTrp: 0.197 ± 0.01
0.297MetTyr: 0.297 ± 0.012
0.0MetXaa: 0.0 ± 0.0
Asn
2.46AsnAla: 2.46 ± 0.052
0.244AsnCys: 0.244 ± 0.011
1.335AsnAsp: 1.335 ± 0.042
0.945AsnGlu: 0.945 ± 0.022
0.815AsnPhe: 0.815 ± 0.021
2.382AsnGly: 2.382 ± 0.049
0.506AsnHis: 0.506 ± 0.017
0.657AsnIle: 0.657 ± 0.021
0.631AsnLys: 0.631 ± 0.019
2.335AsnLeu: 2.335 ± 0.043
0.347AsnMet: 0.347 ± 0.014
0.662AsnAsn: 0.662 ± 0.025
2.204AsnPro: 2.204 ± 0.03
0.694AsnGln: 0.694 ± 0.029
1.795AsnArg: 1.795 ± 0.032
0.956AsnSer: 0.956 ± 0.027
1.301AsnThr: 1.301 ± 0.046
1.897AsnVal: 1.897 ± 0.045
0.388AsnTrp: 0.388 ± 0.014
0.596AsnTyr: 0.596 ± 0.018
0.0AsnXaa: 0.0 ± 0.0
Pro
10.727ProAla: 10.727 ± 0.095
0.393ProCys: 0.393 ± 0.015
5.088ProAsp: 5.088 ± 0.062
3.756ProGlu: 3.756 ± 0.054
2.192ProPhe: 2.192 ± 0.032
6.591ProGly: 6.591 ± 0.064
1.299ProHis: 1.299 ± 0.028
1.56ProIle: 1.56 ± 0.026
2.224ProLys: 2.224 ± 0.047
5.25ProLeu: 5.25 ± 0.059
0.961ProMet: 0.961 ± 0.028
1.66ProAsn: 1.66 ± 0.032
5.9ProPro: 5.9 ± 0.097
1.492ProGln: 1.492 ± 0.03
4.212ProArg: 4.212 ± 0.058
2.641ProSer: 2.641 ± 0.036
4.339ProThr: 4.339 ± 0.056
5.661ProVal: 5.661 ± 0.063
0.81ProTrp: 0.81 ± 0.021
1.191ProTyr: 1.191 ± 0.025
0.0ProXaa: 0.0 ± 0.0
Gln
3.276GlnAla: 3.276 ± 0.044
0.228GlnCys: 0.228 ± 0.011
1.073GlnAsp: 1.073 ± 0.027
1.135GlnGlu: 1.135 ± 0.023
1.067GlnPhe: 1.067 ± 0.02
1.684GlnGly: 1.684 ± 0.031
0.551GlnHis: 0.551 ± 0.017
0.893GlnIle: 0.893 ± 0.021
0.987GlnLys: 0.987 ± 0.024
3.037GlnLeu: 3.037 ± 0.041
0.488GlnMet: 0.488 ± 0.015
0.641GlnAsn: 0.641 ± 0.021
1.999GlnPro: 1.999 ± 0.039
0.946GlnGln: 0.946 ± 0.023
1.743GlnArg: 1.743 ± 0.029
1.115GlnSer: 1.115 ± 0.025
1.464GlnThr: 1.464 ± 0.03
2.29GlnVal: 2.29 ± 0.038
0.38GlnTrp: 0.38 ± 0.014
0.647GlnTyr: 0.647 ± 0.016
0.0GlnXaa: 0.0 ± 0.0
Arg
9.147ArgAla: 9.147 ± 0.091
0.719ArgCys: 0.719 ± 0.018
4.538ArgAsp: 4.538 ± 0.057
4.258ArgGlu: 4.258 ± 0.054
3.173ArgPhe: 3.173 ± 0.048
5.948ArgGly: 5.948 ± 0.068
1.755ArgHis: 1.755 ± 0.037
2.19ArgIle: 2.19 ± 0.035
2.239ArgLys: 2.239 ± 0.038
8.638ArgLeu: 8.638 ± 0.084
1.557ArgMet: 1.557 ± 0.03
1.598ArgAsn: 1.598 ± 0.027
5.351ArgPro: 5.351 ± 0.065
2.256ArgGln: 2.256 ± 0.033
6.844ArgArg: 6.844 ± 0.091
3.081ArgSer: 3.081 ± 0.042
4.358ArgThr: 4.358 ± 0.05
7.318ArgVal: 7.318 ± 0.073
1.365ArgTrp: 1.365 ± 0.026
1.96ArgTyr: 1.96 ± 0.031
0.0ArgXaa: 0.0 ± 0.0
Ser
5.08SerAla: 5.08 ± 0.082
0.4SerCys: 0.4 ± 0.014
2.361SerAsp: 2.361 ± 0.044
1.741SerGlu: 1.741 ± 0.031
1.612SerPhe: 1.612 ± 0.033
4.481SerGly: 4.481 ± 0.085
0.901SerHis: 0.901 ± 0.018
1.285SerIle: 1.285 ± 0.046
1.245SerLys: 1.245 ± 0.029
3.926SerLeu: 3.926 ± 0.043
0.717SerMet: 0.717 ± 0.018
1.115SerAsn: 1.115 ± 0.047
3.244SerPro: 3.244 ± 0.049
1.031SerGln: 1.031 ± 0.021
3.136SerArg: 3.136 ± 0.037
1.952SerSer: 1.952 ± 0.06
2.234SerThr: 2.234 ± 0.06
3.534SerVal: 3.534 ± 0.056
0.665SerTrp: 0.665 ± 0.018
1.03SerTyr: 1.03 ± 0.022
0.0SerXaa: 0.0 ± 0.0
Thr
7.924ThrAla: 7.924 ± 0.145
0.505ThrCys: 0.505 ± 0.018
3.732ThrAsp: 3.732 ± 0.052
2.407ThrGlu: 2.407 ± 0.034
2.273ThrPhe: 2.273 ± 0.053
6.154ThrGly: 6.154 ± 0.102
1.209ThrHis: 1.209 ± 0.028
1.934ThrIle: 1.934 ± 0.061
1.597ThrLys: 1.597 ± 0.029
5.661ThrLeu: 5.661 ± 0.073
0.738ThrMet: 0.738 ± 0.018
1.448ThrAsn: 1.448 ± 0.046
4.769ThrPro: 4.769 ± 0.055
1.325ThrGln: 1.325 ± 0.029
4.093ThrArg: 4.093 ± 0.049
2.256ThrSer: 2.256 ± 0.073
3.594ThrThr: 3.594 ± 0.094
5.542ThrVal: 5.542 ± 0.139
0.79ThrTrp: 0.79 ± 0.021
1.288ThrTyr: 1.288 ± 0.033
0.0ThrXaa: 0.0 ± 0.0
Val
11.48ValAla: 11.48 ± 0.1
1.012ValCys: 1.012 ± 0.024
3.815ValAsp: 3.815 ± 0.046
4.71ValGlu: 4.71 ± 0.053
3.071ValPhe: 3.071 ± 0.038
7.486ValGly: 7.486 ± 0.089
1.488ValHis: 1.488 ± 0.03
2.747ValIle: 2.747 ± 0.035
2.735ValLys: 2.735 ± 0.044
8.669ValLeu: 8.669 ± 0.078
1.434ValMet: 1.434 ± 0.028
2.131ValAsn: 2.131 ± 0.052
5.326ValPro: 5.326 ± 0.059
2.116ValGln: 2.116 ± 0.035
7.577ValArg: 7.577 ± 0.06
4.05ValSer: 4.05 ± 0.072
6.022ValThr: 6.022 ± 0.141
8.871ValVal: 8.871 ± 0.069
1.417ValTrp: 1.417 ± 0.034
2.106ValTyr: 2.106 ± 0.036
0.0ValXaa: 0.0 ± 0.0
Trp
1.732TrpAla: 1.732 ± 0.034
0.179TrpCys: 0.179 ± 0.008
1.017TrpAsp: 1.017 ± 0.027
0.757TrpGlu: 0.757 ± 0.019
0.564TrpPhe: 0.564 ± 0.015
1.066TrpGly: 1.066 ± 0.025
0.39TrpHis: 0.39 ± 0.014
0.305TrpIle: 0.305 ± 0.014
0.546TrpLys: 0.546 ± 0.016
1.739TrpLeu: 1.739 ± 0.034
0.292TrpMet: 0.292 ± 0.011
0.43TrpAsn: 0.43 ± 0.014
0.783TrpPro: 0.783 ± 0.02
0.531TrpGln: 0.531 ± 0.016
1.164TrpArg: 1.164 ± 0.025
0.733TrpSer: 0.733 ± 0.019
0.831TrpThr: 0.831 ± 0.022
1.431TrpVal: 1.431 ± 0.028
0.277TrpTrp: 0.277 ± 0.012
0.371TrpTyr: 0.371 ± 0.012
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.618TyrAla: 2.618 ± 0.033
0.255TyrCys: 0.255 ± 0.01
1.456TyrAsp: 1.456 ± 0.031
1.155TyrGlu: 1.155 ± 0.026
0.858TyrPhe: 0.858 ± 0.018
1.987TyrGly: 1.987 ± 0.031
0.592TyrHis: 0.592 ± 0.018
0.446TyrIle: 0.446 ± 0.016
0.612TyrLys: 0.612 ± 0.016
2.501TyrLeu: 2.501 ± 0.035
0.304TyrMet: 0.304 ± 0.01
0.541TyrAsn: 0.541 ± 0.016
1.327TyrPro: 1.327 ± 0.029
0.76TyrGln: 0.76 ± 0.019
2.199TyrArg: 2.199 ± 0.035
1.022TyrSer: 1.022 ± 0.023
1.339TyrThr: 1.339 ± 0.037
1.79TyrVal: 1.79 ± 0.031
0.368TyrTrp: 0.368 ± 0.013
0.632TyrTyr: 0.632 ± 0.017
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6172 proteins (2292864 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski