Amino acid dipepetide frequency for Thermomonospora echinospora

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
21.2AlaAla: 21.2 ± 0.146
1.163AlaCys: 1.163 ± 0.025
8.346AlaAsp: 8.346 ± 0.058
9.285AlaGlu: 9.285 ± 0.074
3.501AlaPhe: 3.501 ± 0.042
13.752AlaGly: 13.752 ± 0.092
2.725AlaHis: 2.725 ± 0.038
3.908AlaIle: 3.908 ± 0.046
2.496AlaLys: 2.496 ± 0.039
14.096AlaLeu: 14.096 ± 0.089
2.669AlaMet: 2.669 ± 0.033
1.771AlaAsn: 1.771 ± 0.027
6.674AlaPro: 6.674 ± 0.062
3.533AlaGln: 3.533 ± 0.041
11.124AlaArg: 11.124 ± 0.083
5.08AlaSer: 5.08 ± 0.051
6.437AlaThr: 6.437 ± 0.058
11.924AlaVal: 11.924 ± 0.091
1.876AlaTrp: 1.876 ± 0.028
2.581AlaTyr: 2.581 ± 0.034
0.0AlaXaa: 0.0 ± 0.0
Cys
1.068CysAla: 1.068 ± 0.019
0.105CysCys: 0.105 ± 0.006
0.522CysAsp: 0.522 ± 0.016
0.435CysGlu: 0.435 ± 0.013
0.224CysPhe: 0.224 ± 0.01
0.986CysGly: 0.986 ± 0.021
0.213CysHis: 0.213 ± 0.01
0.172CysIle: 0.172 ± 0.009
0.113CysLys: 0.113 ± 0.006
0.766CysLeu: 0.766 ± 0.019
0.136CysMet: 0.136 ± 0.007
0.138CysAsn: 0.138 ± 0.008
0.534CysPro: 0.534 ± 0.016
0.186CysGln: 0.186 ± 0.009
0.734CysArg: 0.734 ± 0.019
0.413CysSer: 0.413 ± 0.015
0.476CysThr: 0.476 ± 0.014
0.668CysVal: 0.668 ± 0.015
0.123CysTrp: 0.123 ± 0.008
0.18CysTyr: 0.18 ± 0.008
0.0CysXaa: 0.0 ± 0.0
Asp
7.004AspAla: 7.004 ± 0.057
0.41AspCys: 0.41 ± 0.013
3.544AspAsp: 3.544 ± 0.043
3.895AspGlu: 3.895 ± 0.044
1.547AspPhe: 1.547 ± 0.03
6.205AspGly: 6.205 ± 0.057
1.442AspHis: 1.442 ± 0.026
1.857AspIle: 1.857 ± 0.026
1.066AspLys: 1.066 ± 0.024
6.624AspLeu: 6.624 ± 0.06
0.901AspMet: 0.901 ± 0.021
0.855AspAsn: 0.855 ± 0.021
4.943AspPro: 4.943 ± 0.052
1.55AspGln: 1.55 ± 0.024
5.472AspArg: 5.472 ± 0.045
2.111AspSer: 2.111 ± 0.031
2.726AspThr: 2.726 ± 0.035
4.793AspVal: 4.793 ± 0.043
0.926AspTrp: 0.926 ± 0.02
1.141AspTyr: 1.141 ± 0.024
0.0AspXaa: 0.0 ± 0.0
Glu
7.206GluAla: 7.206 ± 0.061
0.392GluCys: 0.392 ± 0.012
2.897GluAsp: 2.897 ± 0.036
3.645GluGlu: 3.645 ± 0.045
1.521GluPhe: 1.521 ± 0.025
4.15GluGly: 4.15 ± 0.046
1.683GluHis: 1.683 ± 0.027
2.511GluIle: 2.511 ± 0.028
1.027GluLys: 1.027 ± 0.023
7.041GluLeu: 7.041 ± 0.064
0.958GluMet: 0.958 ± 0.019
0.891GluAsn: 0.891 ± 0.018
3.784GluPro: 3.784 ± 0.047
2.215GluGln: 2.215 ± 0.035
6.282GluArg: 6.282 ± 0.056
2.286GluSer: 2.286 ± 0.034
2.848GluThr: 2.848 ± 0.035
4.84GluVal: 4.84 ± 0.052
0.819GluTrp: 0.819 ± 0.021
1.125GluTyr: 1.125 ± 0.02
0.0GluXaa: 0.0 ± 0.0
Phe
3.569PheAla: 3.569 ± 0.041
0.293PheCys: 0.293 ± 0.012
1.877PheAsp: 1.877 ± 0.03
1.457PheGlu: 1.457 ± 0.028
0.828PhePhe: 0.828 ± 0.019
3.062PheGly: 3.062 ± 0.034
0.628PheHis: 0.628 ± 0.016
0.717PheIle: 0.717 ± 0.016
0.454PheLys: 0.454 ± 0.016
2.529PheLeu: 2.529 ± 0.037
0.399PheMet: 0.399 ± 0.013
0.555PheAsn: 0.555 ± 0.015
1.447PhePro: 1.447 ± 0.026
0.693PheGln: 0.693 ± 0.018
1.87PheArg: 1.87 ± 0.03
1.29PheSer: 1.29 ± 0.024
2.049PheThr: 2.049 ± 0.031
2.272PheVal: 2.272 ± 0.031
0.405PheTrp: 0.405 ± 0.012
0.547PheTyr: 0.547 ± 0.014
0.0PheXaa: 0.0 ± 0.0
Gly
10.56GlyAla: 10.56 ± 0.079
0.873GlyCys: 0.873 ± 0.02
5.384GlyAsp: 5.384 ± 0.043
5.304GlyGlu: 5.304 ± 0.048
2.855GlyPhe: 2.855 ± 0.037
9.336GlyGly: 9.336 ± 0.075
2.324GlyHis: 2.324 ± 0.036
3.273GlyIle: 3.273 ± 0.037
2.031GlyLys: 2.031 ± 0.034
9.932GlyLeu: 9.932 ± 0.071
2.076GlyMet: 2.076 ± 0.029
1.611GlyAsn: 1.611 ± 0.029
5.836GlyPro: 5.836 ± 0.06
2.774GlyGln: 2.774 ± 0.041
9.147GlyArg: 9.147 ± 0.074
4.774GlySer: 4.774 ± 0.055
5.823GlyThr: 5.823 ± 0.05
7.628GlyVal: 7.628 ± 0.056
1.791GlyTrp: 1.791 ± 0.026
2.324GlyTyr: 2.324 ± 0.033
0.0GlyXaa: 0.0 ± 0.0
His
2.607HisAla: 2.607 ± 0.037
0.215HisCys: 0.215 ± 0.008
1.342HisAsp: 1.342 ± 0.022
1.224HisGlu: 1.224 ± 0.025
0.594HisPhe: 0.594 ± 0.016
2.442HisGly: 2.442 ± 0.035
0.679HisHis: 0.679 ± 0.024
0.712HisIle: 0.712 ± 0.018
0.322HisLys: 0.322 ± 0.012
2.451HisLeu: 2.451 ± 0.032
0.37HisMet: 0.37 ± 0.013
0.361HisAsn: 0.361 ± 0.013
1.782HisPro: 1.782 ± 0.023
0.637HisGln: 0.637 ± 0.017
2.239HisArg: 2.239 ± 0.03
0.926HisSer: 0.926 ± 0.019
1.128HisThr: 1.128 ± 0.02
1.738HisVal: 1.738 ± 0.025
0.39HisTrp: 0.39 ± 0.013
0.507HisTyr: 0.507 ± 0.016
0.0HisXaa: 0.0 ± 0.0
Ile
5.001IleAla: 5.001 ± 0.046
0.331IleCys: 0.331 ± 0.011
2.511IleAsp: 2.511 ± 0.032
2.219IleGlu: 2.219 ± 0.031
0.765IlePhe: 0.765 ± 0.018
3.607IleGly: 3.607 ± 0.045
0.615IleHis: 0.615 ± 0.016
1.09IleIle: 1.09 ± 0.024
0.729IleLys: 0.729 ± 0.016
2.511IleLeu: 2.511 ± 0.031
0.525IleMet: 0.525 ± 0.015
0.737IleAsn: 0.737 ± 0.017
1.869IlePro: 1.869 ± 0.027
0.71IleGln: 0.71 ± 0.018
2.637IleArg: 2.637 ± 0.034
1.715IleSer: 1.715 ± 0.028
2.326IleThr: 2.326 ± 0.033
3.056IleVal: 3.056 ± 0.038
0.384IleTrp: 0.384 ± 0.011
0.591IleTyr: 0.591 ± 0.016
0.0IleXaa: 0.0 ± 0.0
Lys
2.448LysAla: 2.448 ± 0.039
0.11LysCys: 0.11 ± 0.007
1.014LysAsp: 1.014 ± 0.023
0.982LysGlu: 0.982 ± 0.021
0.39LysPhe: 0.39 ± 0.014
1.575LysGly: 1.575 ± 0.032
0.369LysHis: 0.369 ± 0.014
0.853LysIle: 0.853 ± 0.02
0.536LysLys: 0.536 ± 0.019
1.696LysLeu: 1.696 ± 0.031
0.345LysMet: 0.345 ± 0.012
0.417LysAsn: 0.417 ± 0.014
1.195LysPro: 1.195 ± 0.025
0.524LysGln: 0.524 ± 0.016
1.398LysArg: 1.398 ± 0.025
0.955LysSer: 0.955 ± 0.021
1.105LysThr: 1.105 ± 0.025
1.852LysVal: 1.852 ± 0.033
0.237LysTrp: 0.237 ± 0.009
0.4LysTyr: 0.4 ± 0.015
0.0LysXaa: 0.0 ± 0.0
Leu
15.735LeuAla: 15.735 ± 0.113
0.81LeuCys: 0.81 ± 0.019
6.549LeuAsp: 6.549 ± 0.052
5.172LeuGlu: 5.172 ± 0.049
2.516LeuPhe: 2.516 ± 0.036
9.438LeuGly: 9.438 ± 0.067
2.214LeuHis: 2.214 ± 0.035
3.359LeuIle: 3.359 ± 0.039
1.762LeuLys: 1.762 ± 0.03
10.949LeuLeu: 10.949 ± 0.089
1.693LeuMet: 1.693 ± 0.027
1.76LeuAsn: 1.76 ± 0.028
6.526LeuPro: 6.526 ± 0.056
2.222LeuGln: 2.222 ± 0.03
9.403LeuArg: 9.403 ± 0.067
5.025LeuSer: 5.025 ± 0.049
6.661LeuThr: 6.661 ± 0.059
8.884LeuVal: 8.884 ± 0.068
1.311LeuTrp: 1.311 ± 0.029
1.824LeuTyr: 1.824 ± 0.03
0.0LeuXaa: 0.0 ± 0.0
Met
2.501MetAla: 2.501 ± 0.032
0.138MetCys: 0.138 ± 0.008
0.908MetAsp: 0.908 ± 0.02
0.793MetGlu: 0.793 ± 0.018
0.528MetPhe: 0.528 ± 0.015
1.402MetGly: 1.402 ± 0.026
0.347MetHis: 0.347 ± 0.012
0.794MetIle: 0.794 ± 0.02
0.39MetLys: 0.39 ± 0.011
1.895MetLeu: 1.895 ± 0.026
0.314MetMet: 0.314 ± 0.012
0.464MetAsn: 0.464 ± 0.013
1.22MetPro: 1.22 ± 0.023
0.445MetGln: 0.445 ± 0.012
1.664MetArg: 1.664 ± 0.024
1.281MetSer: 1.281 ± 0.021
1.592MetThr: 1.592 ± 0.025
1.469MetVal: 1.469 ± 0.028
0.203MetTrp: 0.203 ± 0.009
0.289MetTyr: 0.289 ± 0.011
0.0MetXaa: 0.0 ± 0.0
Asn
2.202AsnAla: 2.202 ± 0.029
0.172AsnCys: 0.172 ± 0.008
0.913AsnAsp: 0.913 ± 0.02
0.86AsnGlu: 0.86 ± 0.02
0.438AsnPhe: 0.438 ± 0.016
1.844AsnGly: 1.844 ± 0.029
0.353AsnHis: 0.353 ± 0.012
0.598AsnIle: 0.598 ± 0.016
0.33AsnLys: 0.33 ± 0.012
1.657AsnLeu: 1.657 ± 0.026
0.288AsnMet: 0.288 ± 0.01
0.347AsnAsn: 0.347 ± 0.013
1.3AsnPro: 1.3 ± 0.021
0.419AsnGln: 0.419 ± 0.014
1.284AsnArg: 1.284 ± 0.026
0.736AsnSer: 0.736 ± 0.019
0.959AsnThr: 0.959 ± 0.024
1.427AsnVal: 1.427 ± 0.026
0.258AsnTrp: 0.258 ± 0.01
0.358AsnTyr: 0.358 ± 0.011
0.0AsnXaa: 0.0 ± 0.0
Pro
9.082ProAla: 9.082 ± 0.082
0.343ProCys: 0.343 ± 0.012
4.75ProAsp: 4.75 ± 0.047
4.312ProGlu: 4.312 ± 0.053
1.573ProPhe: 1.573 ± 0.025
7.303ProGly: 7.303 ± 0.061
1.389ProHis: 1.389 ± 0.025
1.655ProIle: 1.655 ± 0.025
1.125ProLys: 1.125 ± 0.027
5.364ProLeu: 5.364 ± 0.054
1.16ProMet: 1.16 ± 0.022
0.938ProAsn: 0.938 ± 0.02
4.491ProPro: 4.491 ± 0.071
1.96ProGln: 1.96 ± 0.036
4.427ProArg: 4.427 ± 0.049
3.205ProSer: 3.205 ± 0.045
2.96ProThr: 2.96 ± 0.04
5.475ProVal: 5.475 ± 0.054
0.966ProTrp: 0.966 ± 0.02
1.388ProTyr: 1.388 ± 0.023
0.0ProXaa: 0.0 ± 0.0
Gln
4.077GlnAla: 4.077 ± 0.051
0.177GlnCys: 0.177 ± 0.008
1.331GlnAsp: 1.331 ± 0.022
1.466GlnGlu: 1.466 ± 0.03
0.637GlnPhe: 0.637 ± 0.017
2.275GlnGly: 2.275 ± 0.035
0.573GlnHis: 0.573 ± 0.016
1.231GlnIle: 1.231 ± 0.024
0.432GlnLys: 0.432 ± 0.014
2.478GlnLeu: 2.478 ± 0.033
0.519GlnMet: 0.519 ± 0.015
0.468GlnAsn: 0.468 ± 0.012
1.644GlnPro: 1.644 ± 0.034
1.084GlnGln: 1.084 ± 0.031
2.508GlnArg: 2.508 ± 0.029
1.101GlnSer: 1.101 ± 0.023
1.392GlnThr: 1.392 ± 0.021
2.752GlnVal: 2.752 ± 0.035
0.456GlnTrp: 0.456 ± 0.015
0.501GlnTyr: 0.501 ± 0.017
0.0GlnXaa: 0.0 ± 0.0
Arg
10.52ArgAla: 10.52 ± 0.081
0.667ArgCys: 0.667 ± 0.017
4.714ArgAsp: 4.714 ± 0.043
5.153ArgGlu: 5.153 ± 0.052
2.473ArgPhe: 2.473 ± 0.031
6.198ArgGly: 6.198 ± 0.053
2.369ArgHis: 2.369 ± 0.03
3.586ArgIle: 3.586 ± 0.043
1.572ArgLys: 1.572 ± 0.031
10.362ArgLeu: 10.362 ± 0.081
2.082ArgMet: 2.082 ± 0.026
1.451ArgAsn: 1.451 ± 0.023
6.08ArgPro: 6.08 ± 0.063
2.574ArgGln: 2.574 ± 0.033
9.433ArgArg: 9.433 ± 0.075
4.116ArgSer: 4.116 ± 0.047
5.21ArgThr: 5.21 ± 0.047
6.491ArgVal: 6.491 ± 0.055
1.593ArgTrp: 1.593 ± 0.025
2.036ArgTyr: 2.036 ± 0.032
0.0ArgXaa: 0.0 ± 0.0
Ser
5.903SerAla: 5.903 ± 0.056
0.371SerCys: 0.371 ± 0.013
2.349SerAsp: 2.349 ± 0.03
2.118SerGlu: 2.118 ± 0.034
1.439SerPhe: 1.439 ± 0.023
5.526SerGly: 5.526 ± 0.059
0.91SerHis: 0.91 ± 0.017
1.487SerIle: 1.487 ± 0.025
0.844SerLys: 0.844 ± 0.023
4.409SerLeu: 4.409 ± 0.047
1.091SerMet: 1.091 ± 0.022
0.737SerAsn: 0.737 ± 0.018
3.244SerPro: 3.244 ± 0.039
1.163SerGln: 1.163 ± 0.023
3.737SerArg: 3.737 ± 0.04
2.34SerSer: 2.34 ± 0.038
2.502SerThr: 2.502 ± 0.035
3.681SerVal: 3.681 ± 0.045
0.852SerTrp: 0.852 ± 0.021
1.106SerTyr: 1.106 ± 0.024
0.0SerXaa: 0.0 ± 0.0
Thr
8.082ThrAla: 8.082 ± 0.059
0.448ThrCys: 0.448 ± 0.015
3.17ThrAsp: 3.17 ± 0.034
2.999ThrGlu: 2.999 ± 0.038
1.627ThrPhe: 1.627 ± 0.026
6.651ThrGly: 6.651 ± 0.057
1.006ThrHis: 1.006 ± 0.022
1.859ThrIle: 1.859 ± 0.031
0.971ThrLys: 0.971 ± 0.02
5.528ThrLeu: 5.528 ± 0.049
1.042ThrMet: 1.042 ± 0.021
0.845ThrAsn: 0.845 ± 0.021
3.837ThrPro: 3.837 ± 0.045
1.166ThrGln: 1.166 ± 0.027
4.203ThrArg: 4.203 ± 0.045
2.702ThrSer: 2.702 ± 0.038
3.326ThrThr: 3.326 ± 0.045
5.638ThrVal: 5.638 ± 0.051
0.919ThrTrp: 0.919 ± 0.021
1.197ThrTyr: 1.197 ± 0.026
0.0ThrXaa: 0.0 ± 0.0
Val
11.062ValAla: 11.062 ± 0.073
0.806ValCys: 0.806 ± 0.019
4.689ValAsp: 4.689 ± 0.048
4.776ValGlu: 4.776 ± 0.046
2.393ValPhe: 2.393 ± 0.039
6.543ValGly: 6.543 ± 0.048
2.002ValHis: 2.002 ± 0.032
3.211ValIle: 3.211 ± 0.042
1.52ValLys: 1.52 ± 0.024
9.51ValLeu: 9.51 ± 0.074
1.488ValMet: 1.488 ± 0.025
1.667ValAsn: 1.667 ± 0.032
5.508ValPro: 5.508 ± 0.054
2.101ValGln: 2.101 ± 0.035
7.634ValArg: 7.634 ± 0.062
4.002ValSer: 4.002 ± 0.045
5.615ValThr: 5.615 ± 0.056
7.976ValVal: 7.976 ± 0.076
1.123ValTrp: 1.123 ± 0.025
1.593ValTyr: 1.593 ± 0.026
0.0ValXaa: 0.0 ± 0.0
Trp
1.709TrpAla: 1.709 ± 0.029
0.159TrpCys: 0.159 ± 0.008
0.819TrpAsp: 0.819 ± 0.023
0.81TrpGlu: 0.81 ± 0.017
0.469TrpPhe: 0.469 ± 0.013
1.063TrpGly: 1.063 ± 0.019
0.392TrpHis: 0.392 ± 0.01
0.559TrpIle: 0.559 ± 0.015
0.328TrpLys: 0.328 ± 0.012
1.787TrpLeu: 1.787 ± 0.031
0.317TrpMet: 0.317 ± 0.01
0.37TrpAsn: 0.37 ± 0.013
0.89TrpPro: 0.89 ± 0.02
0.542TrpGln: 0.542 ± 0.014
1.587TrpArg: 1.587 ± 0.025
0.869TrpSer: 0.869 ± 0.024
0.992TrpThr: 0.992 ± 0.022
0.994TrpVal: 0.994 ± 0.021
0.361TrpTrp: 0.361 ± 0.012
0.344TrpTyr: 0.344 ± 0.011
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.608TyrAla: 2.608 ± 0.032
0.209TyrCys: 0.209 ± 0.009
1.382TyrAsp: 1.382 ± 0.031
1.217TyrGlu: 1.217 ± 0.024
0.624TyrPhe: 0.624 ± 0.015
2.296TyrGly: 2.296 ± 0.036
0.434TyrHis: 0.434 ± 0.014
0.497TyrIle: 0.497 ± 0.013
0.355TyrLys: 0.355 ± 0.013
2.224TyrLeu: 2.224 ± 0.03
0.264TyrMet: 0.264 ± 0.012
0.359TyrAsn: 0.359 ± 0.012
1.062TyrPro: 1.062 ± 0.023
0.605TyrGln: 0.605 ± 0.017
1.898TyrArg: 1.898 ± 0.028
0.882TyrSer: 0.882 ± 0.019
1.089TyrThr: 1.089 ± 0.022
1.659TyrVal: 1.659 ± 0.029
0.368TyrTrp: 0.368 ± 0.014
0.472TyrTyr: 0.472 ± 0.015
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.001XaaXaa: 0.001 ± 0.001
Statistics based on 7946 proteins (2513009 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski