Amino acid dipepetide frequency for Streptomyces sp. WAC06614

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
24.007AlaAla: 24.007 ± 0.179
1.104AlaCys: 1.104 ± 0.021
8.392AlaAsp: 8.392 ± 0.069
9.121AlaGlu: 9.121 ± 0.097
3.623AlaPhe: 3.623 ± 0.042
14.538AlaGly: 14.538 ± 0.097
3.041AlaHis: 3.041 ± 0.048
2.88AlaIle: 2.88 ± 0.048
2.709AlaLys: 2.709 ± 0.049
15.498AlaLeu: 15.498 ± 0.115
2.431AlaMet: 2.431 ± 0.037
1.728AlaAsn: 1.728 ± 0.031
7.935AlaPro: 7.935 ± 0.078
3.937AlaGln: 3.937 ± 0.048
11.006AlaArg: 11.006 ± 0.092
5.521AlaSer: 5.521 ± 0.054
7.233AlaThr: 7.233 ± 0.063
12.939AlaVal: 12.939 ± 0.099
1.908AlaTrp: 1.908 ± 0.029
2.869AlaTyr: 2.869 ± 0.038
0.0AlaXaa: 0.0 ± 0.0
Cys
1.197CysAla: 1.197 ± 0.027
0.101CysCys: 0.101 ± 0.007
0.452CysAsp: 0.452 ± 0.014
0.402CysGlu: 0.402 ± 0.014
0.224CysPhe: 0.224 ± 0.01
0.972CysGly: 0.972 ± 0.019
0.204CysHis: 0.204 ± 0.011
0.19CysIle: 0.19 ± 0.01
0.12CysLys: 0.12 ± 0.007
0.774CysLeu: 0.774 ± 0.02
0.129CysMet: 0.129 ± 0.008
0.15CysAsn: 0.15 ± 0.009
0.515CysPro: 0.515 ± 0.019
0.169CysGln: 0.169 ± 0.01
0.639CysArg: 0.639 ± 0.018
0.4CysSer: 0.4 ± 0.016
0.579CysThr: 0.579 ± 0.019
0.655CysVal: 0.655 ± 0.018
0.136CysTrp: 0.136 ± 0.007
0.146CysTyr: 0.146 ± 0.008
0.0CysXaa: 0.0 ± 0.0
Asp
7.462AspAla: 7.462 ± 0.063
0.432AspCys: 0.432 ± 0.017
3.031AspAsp: 3.031 ± 0.039
3.555AspGlu: 3.555 ± 0.048
1.527AspPhe: 1.527 ± 0.029
6.358AspGly: 6.358 ± 0.067
1.445AspHis: 1.445 ± 0.027
1.616AspIle: 1.616 ± 0.03
1.159AspLys: 1.159 ± 0.029
6.226AspLeu: 6.226 ± 0.055
0.744AspMet: 0.744 ± 0.019
0.863AspAsn: 0.863 ± 0.021
4.789AspPro: 4.789 ± 0.057
1.503AspGln: 1.503 ± 0.028
5.225AspArg: 5.225 ± 0.056
2.165AspSer: 2.165 ± 0.03
3.11AspThr: 3.11 ± 0.035
4.504AspVal: 4.504 ± 0.054
0.968AspTrp: 0.968 ± 0.023
1.023AspTyr: 1.023 ± 0.024
0.0AspXaa: 0.0 ± 0.0
Glu
7.671GluAla: 7.671 ± 0.085
0.372GluCys: 0.372 ± 0.012
2.895GluAsp: 2.895 ± 0.042
3.55GluGlu: 3.55 ± 0.049
1.436GluPhe: 1.436 ± 0.026
4.314GluGly: 4.314 ± 0.046
1.462GluHis: 1.462 ± 0.03
2.148GluIle: 2.148 ± 0.03
1.263GluLys: 1.263 ± 0.032
6.889GluLeu: 6.889 ± 0.069
0.828GluMet: 0.828 ± 0.021
0.928GluAsn: 0.928 ± 0.022
3.321GluPro: 3.321 ± 0.038
2.206GluGln: 2.206 ± 0.037
5.324GluArg: 5.324 ± 0.055
2.208GluSer: 2.208 ± 0.032
2.54GluThr: 2.54 ± 0.037
4.63GluVal: 4.63 ± 0.048
0.699GluTrp: 0.699 ± 0.021
1.041GluTyr: 1.041 ± 0.022
0.0GluXaa: 0.0 ± 0.0
Phe
3.705PheAla: 3.705 ± 0.048
0.27PheCys: 0.27 ± 0.012
1.871PheAsp: 1.871 ± 0.031
1.399PheGlu: 1.399 ± 0.026
0.883PhePhe: 0.883 ± 0.02
2.97PheGly: 2.97 ± 0.04
0.641PheHis: 0.641 ± 0.018
0.626PheIle: 0.626 ± 0.016
0.494PheLys: 0.494 ± 0.015
2.633PheLeu: 2.633 ± 0.04
0.389PheMet: 0.389 ± 0.012
0.52PheAsn: 0.52 ± 0.017
1.413PhePro: 1.413 ± 0.024
0.634PheGln: 0.634 ± 0.018
1.867PheArg: 1.867 ± 0.028
1.384PheSer: 1.384 ± 0.025
2.028PheThr: 2.028 ± 0.031
2.227PheVal: 2.227 ± 0.032
0.406PheTrp: 0.406 ± 0.015
0.551PheTyr: 0.551 ± 0.017
0.0PheXaa: 0.0 ± 0.0
Gly
12.006GlyAla: 12.006 ± 0.087
0.867GlyCys: 0.867 ± 0.021
4.934GlyAsp: 4.934 ± 0.054
4.948GlyGlu: 4.948 ± 0.05
2.865GlyPhe: 2.865 ± 0.035
9.469GlyGly: 9.469 ± 0.1
2.428GlyHis: 2.428 ± 0.034
3.324GlyIle: 3.324 ± 0.039
2.338GlyLys: 2.338 ± 0.041
9.73GlyLeu: 9.73 ± 0.082
1.931GlyMet: 1.931 ± 0.029
1.686GlyAsn: 1.686 ± 0.032
6.237GlyPro: 6.237 ± 0.069
2.606GlyGln: 2.606 ± 0.04
8.514GlyArg: 8.514 ± 0.074
5.342GlySer: 5.342 ± 0.057
6.658GlyThr: 6.658 ± 0.066
7.465GlyVal: 7.465 ± 0.059
1.709GlyTrp: 1.709 ± 0.027
2.216GlyTyr: 2.216 ± 0.036
0.0GlyXaa: 0.0 ± 0.0
His
2.819HisAla: 2.819 ± 0.046
0.228HisCys: 0.228 ± 0.01
1.304HisAsp: 1.304 ± 0.026
1.248HisGlu: 1.248 ± 0.028
0.649HisPhe: 0.649 ± 0.017
2.544HisGly: 2.544 ± 0.032
0.746HisHis: 0.746 ± 0.02
0.587HisIle: 0.587 ± 0.018
0.349HisLys: 0.349 ± 0.014
2.553HisLeu: 2.553 ± 0.033
0.345HisMet: 0.345 ± 0.012
0.342HisAsn: 0.342 ± 0.013
1.934HisPro: 1.934 ± 0.034
0.677HisGln: 0.677 ± 0.019
2.248HisArg: 2.248 ± 0.034
0.947HisSer: 0.947 ± 0.021
1.347HisThr: 1.347 ± 0.025
1.731HisVal: 1.731 ± 0.029
0.385HisTrp: 0.385 ± 0.013
0.461HisTyr: 0.461 ± 0.016
0.0HisXaa: 0.0 ± 0.0
Ile
4.312IleAla: 4.312 ± 0.058
0.285IleCys: 0.285 ± 0.01
1.961IleAsp: 1.961 ± 0.033
1.787IleGlu: 1.787 ± 0.026
0.641IlePhe: 0.641 ± 0.018
3.316IleGly: 3.316 ± 0.047
0.559IleHis: 0.559 ± 0.016
0.716IleIle: 0.716 ± 0.023
0.698IleLys: 0.698 ± 0.02
2.134IleLeu: 2.134 ± 0.033
0.397IleMet: 0.397 ± 0.014
0.641IleAsn: 0.641 ± 0.018
1.546IlePro: 1.546 ± 0.03
0.657IleGln: 0.657 ± 0.018
2.063IleArg: 2.063 ± 0.033
1.465IleSer: 1.465 ± 0.025
2.028IleThr: 2.028 ± 0.034
2.441IleVal: 2.441 ± 0.037
0.321IleTrp: 0.321 ± 0.012
0.468IleTyr: 0.468 ± 0.015
0.0IleXaa: 0.0 ± 0.0
Lys
2.866LysAla: 2.866 ± 0.046
0.111LysCys: 0.111 ± 0.008
1.313LysAsp: 1.313 ± 0.032
1.174LysGlu: 1.174 ± 0.025
0.465LysPhe: 0.465 ± 0.017
1.762LysGly: 1.762 ± 0.031
0.386LysHis: 0.386 ± 0.013
0.833LysIle: 0.833 ± 0.021
0.772LysLys: 0.772 ± 0.03
1.973LysLeu: 1.973 ± 0.037
0.345LysMet: 0.345 ± 0.011
0.489LysAsn: 0.489 ± 0.017
1.242LysPro: 1.242 ± 0.032
0.645LysGln: 0.645 ± 0.02
1.26LysArg: 1.26 ± 0.028
0.961LysSer: 0.961 ± 0.025
1.124LysThr: 1.124 ± 0.025
1.847LysVal: 1.847 ± 0.032
0.232LysTrp: 0.232 ± 0.011
0.41LysTyr: 0.41 ± 0.015
0.0LysXaa: 0.0 ± 0.0
Leu
16.032LeuAla: 16.032 ± 0.115
0.92LeuCys: 0.92 ± 0.021
6.719LeuAsp: 6.719 ± 0.06
4.644LeuGlu: 4.644 ± 0.046
2.61LeuPhe: 2.61 ± 0.038
9.669LeuGly: 9.669 ± 0.078
2.437LeuHis: 2.437 ± 0.034
2.851LeuIle: 2.851 ± 0.035
1.869LeuLys: 1.869 ± 0.032
11.733LeuLeu: 11.733 ± 0.107
1.567LeuMet: 1.567 ± 0.028
1.527LeuAsn: 1.527 ± 0.033
6.881LeuPro: 6.881 ± 0.064
2.258LeuGln: 2.258 ± 0.032
8.774LeuArg: 8.774 ± 0.078
5.007LeuSer: 5.007 ± 0.051
7.046LeuThr: 7.046 ± 0.056
9.199LeuVal: 9.199 ± 0.069
1.367LeuTrp: 1.367 ± 0.027
1.871LeuTyr: 1.871 ± 0.029
0.0LeuXaa: 0.0 ± 0.0
Met
2.242MetAla: 2.242 ± 0.034
0.126MetCys: 0.126 ± 0.006
0.89MetAsp: 0.89 ± 0.021
0.727MetGlu: 0.727 ± 0.017
0.443MetPhe: 0.443 ± 0.014
1.302MetGly: 1.302 ± 0.029
0.334MetHis: 0.334 ± 0.012
0.616MetIle: 0.616 ± 0.019
0.383MetLys: 0.383 ± 0.013
1.648MetLeu: 1.648 ± 0.029
0.271MetMet: 0.271 ± 0.012
0.429MetAsn: 0.429 ± 0.017
1.069MetPro: 1.069 ± 0.02
0.429MetGln: 0.429 ± 0.014
1.34MetArg: 1.34 ± 0.025
1.206MetSer: 1.206 ± 0.021
1.463MetThr: 1.463 ± 0.029
1.203MetVal: 1.203 ± 0.023
0.199MetTrp: 0.199 ± 0.008
0.289MetTyr: 0.289 ± 0.012
0.0MetXaa: 0.0 ± 0.0
Asn
2.054AsnAla: 2.054 ± 0.032
0.16AsnCys: 0.16 ± 0.009
0.846AsnAsp: 0.846 ± 0.02
0.75AsnGlu: 0.75 ± 0.021
0.445AsnPhe: 0.445 ± 0.014
1.736AsnGly: 1.736 ± 0.034
0.375AsnHis: 0.375 ± 0.016
0.573AsnIle: 0.573 ± 0.02
0.367AsnLys: 0.367 ± 0.016
1.575AsnLeu: 1.575 ± 0.028
0.277AsnMet: 0.277 ± 0.011
0.38AsnAsn: 0.38 ± 0.013
1.294AsnPro: 1.294 ± 0.03
0.492AsnGln: 0.492 ± 0.015
1.171AsnArg: 1.171 ± 0.024
0.787AsnSer: 0.787 ± 0.019
1.042AsnThr: 1.042 ± 0.022
1.288AsnVal: 1.288 ± 0.028
0.265AsnTrp: 0.265 ± 0.011
0.349AsnTyr: 0.349 ± 0.014
0.0AsnXaa: 0.0 ± 0.0
Pro
10.148ProAla: 10.148 ± 0.105
0.353ProCys: 0.353 ± 0.013
4.603ProAsp: 4.603 ± 0.048
4.391ProGlu: 4.391 ± 0.049
1.555ProPhe: 1.555 ± 0.026
7.59ProGly: 7.59 ± 0.067
1.487ProHis: 1.487 ± 0.029
1.056ProIle: 1.056 ± 0.025
1.09ProLys: 1.09 ± 0.023
5.583ProLeu: 5.583 ± 0.059
1.023ProMet: 1.023 ± 0.022
0.834ProAsn: 0.834 ± 0.022
3.77ProPro: 3.77 ± 0.059
1.936ProGln: 1.936 ± 0.035
4.223ProArg: 4.223 ± 0.046
2.984ProSer: 2.984 ± 0.041
3.246ProThr: 3.246 ± 0.042
5.868ProVal: 5.868 ± 0.062
0.932ProTrp: 0.932 ± 0.022
1.538ProTyr: 1.538 ± 0.031
0.0ProXaa: 0.0 ± 0.0
Gln
3.857GlnAla: 3.857 ± 0.049
0.178GlnCys: 0.178 ± 0.01
1.471GlnAsp: 1.471 ± 0.025
1.571GlnGlu: 1.571 ± 0.029
0.65GlnPhe: 0.65 ± 0.019
2.419GlnGly: 2.419 ± 0.032
0.649GlnHis: 0.649 ± 0.016
1.021GlnIle: 1.021 ± 0.022
0.589GlnLys: 0.589 ± 0.02
2.993GlnLeu: 2.993 ± 0.04
0.471GlnMet: 0.471 ± 0.013
0.49GlnAsn: 0.49 ± 0.016
1.632GlnPro: 1.632 ± 0.027
1.296GlnGln: 1.296 ± 0.032
2.276GlnArg: 2.276 ± 0.031
1.104GlnSer: 1.104 ± 0.025
1.268GlnThr: 1.268 ± 0.026
2.477GlnVal: 2.477 ± 0.037
0.463GlnTrp: 0.463 ± 0.017
0.582GlnTyr: 0.582 ± 0.018
0.0GlnXaa: 0.0 ± 0.0
Arg
10.594ArgAla: 10.594 ± 0.091
0.625ArgCys: 0.625 ± 0.018
4.039ArgAsp: 4.039 ± 0.045
4.605ArgGlu: 4.605 ± 0.054
2.316ArgPhe: 2.316 ± 0.03
5.889ArgGly: 5.889 ± 0.056
2.188ArgHis: 2.188 ± 0.035
3.344ArgIle: 3.344 ± 0.039
1.57ArgLys: 1.57 ± 0.028
8.905ArgLeu: 8.905 ± 0.086
1.746ArgMet: 1.746 ± 0.031
1.282ArgAsn: 1.282 ± 0.029
5.668ArgPro: 5.668 ± 0.066
2.206ArgGln: 2.206 ± 0.037
7.862ArgArg: 7.862 ± 0.088
4.109ArgSer: 4.109 ± 0.043
6.013ArgThr: 6.013 ± 0.06
5.559ArgVal: 5.559 ± 0.06
1.389ArgTrp: 1.389 ± 0.029
1.74ArgTyr: 1.74 ± 0.031
0.0ArgXaa: 0.0 ± 0.0
Ser
6.411SerAla: 6.411 ± 0.071
0.445SerCys: 0.445 ± 0.014
2.37SerAsp: 2.37 ± 0.034
2.07SerGlu: 2.07 ± 0.032
1.5SerPhe: 1.5 ± 0.025
5.59SerGly: 5.59 ± 0.062
0.948SerHis: 0.948 ± 0.02
1.234SerIle: 1.234 ± 0.025
0.905SerLys: 0.905 ± 0.023
4.57SerLeu: 4.57 ± 0.048
0.949SerMet: 0.949 ± 0.022
0.754SerAsn: 0.754 ± 0.021
3.061SerPro: 3.061 ± 0.044
1.133SerGln: 1.133 ± 0.024
3.47SerArg: 3.47 ± 0.038
2.553SerSer: 2.553 ± 0.038
2.818SerThr: 2.818 ± 0.041
4.011SerVal: 4.011 ± 0.036
0.875SerTrp: 0.875 ± 0.02
1.142SerTyr: 1.142 ± 0.024
0.0SerXaa: 0.0 ± 0.0
Thr
9.372ThrAla: 9.372 ± 0.068
0.465ThrCys: 0.465 ± 0.014
3.585ThrAsp: 3.585 ± 0.044
3.178ThrGlu: 3.178 ± 0.039
1.622ThrPhe: 1.622 ± 0.034
7.021ThrGly: 7.021 ± 0.061
1.212ThrHis: 1.212 ± 0.025
1.426ThrIle: 1.426 ± 0.033
1.08ThrLys: 1.08 ± 0.031
5.793ThrLeu: 5.793 ± 0.057
0.896ThrMet: 0.896 ± 0.02
0.91ThrAsn: 0.91 ± 0.02
4.255ThrPro: 4.255 ± 0.046
1.309ThrGln: 1.309 ± 0.028
3.917ThrArg: 3.917 ± 0.044
2.895ThrSer: 2.895 ± 0.039
3.841ThrThr: 3.841 ± 0.053
6.187ThrVal: 6.187 ± 0.056
0.934ThrTrp: 0.934 ± 0.022
1.375ThrTyr: 1.375 ± 0.027
0.0ThrXaa: 0.0 ± 0.0
Val
11.074ValAla: 11.074 ± 0.086
0.779ValCys: 0.779 ± 0.019
4.82ValAsp: 4.82 ± 0.048
4.613ValGlu: 4.613 ± 0.047
2.327ValPhe: 2.327 ± 0.036
6.471ValGly: 6.471 ± 0.053
2.134ValHis: 2.134 ± 0.035
2.489ValIle: 2.489 ± 0.041
1.733ValLys: 1.733 ± 0.032
9.91ValLeu: 9.91 ± 0.084
1.358ValMet: 1.358 ± 0.027
1.542ValAsn: 1.542 ± 0.025
5.76ValPro: 5.76 ± 0.058
2.179ValGln: 2.179 ± 0.034
7.386ValArg: 7.386 ± 0.065
4.007ValSer: 4.007 ± 0.045
5.71ValThr: 5.71 ± 0.054
8.24ValVal: 8.24 ± 0.075
1.122ValTrp: 1.122 ± 0.022
1.515ValTyr: 1.515 ± 0.024
0.0ValXaa: 0.0 ± 0.0
Trp
1.706TrpAla: 1.706 ± 0.025
0.167TrpCys: 0.167 ± 0.008
0.781TrpAsp: 0.781 ± 0.02
0.716TrpGlu: 0.716 ± 0.019
0.501TrpPhe: 0.501 ± 0.015
1.039TrpGly: 1.039 ± 0.025
0.376TrpHis: 0.376 ± 0.014
0.521TrpIle: 0.521 ± 0.014
0.347TrpLys: 0.347 ± 0.013
1.775TrpLeu: 1.775 ± 0.034
0.277TrpMet: 0.277 ± 0.01
0.391TrpAsn: 0.391 ± 0.014
0.803TrpPro: 0.803 ± 0.021
0.62TrpGln: 0.62 ± 0.018
1.286TrpArg: 1.286 ± 0.025
0.911TrpSer: 0.911 ± 0.022
1.077TrpThr: 1.077 ± 0.023
0.995TrpVal: 0.995 ± 0.027
0.333TrpTrp: 0.333 ± 0.012
0.361TrpTyr: 0.361 ± 0.013
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.887TyrAla: 2.887 ± 0.042
0.168TyrCys: 0.168 ± 0.009
1.429TyrAsp: 1.429 ± 0.028
1.273TyrGlu: 1.273 ± 0.023
0.628TyrPhe: 0.628 ± 0.018
2.307TyrGly: 2.307 ± 0.035
0.391TyrHis: 0.391 ± 0.015
0.384TyrIle: 0.384 ± 0.012
0.375TyrLys: 0.375 ± 0.015
2.086TyrLeu: 2.086 ± 0.036
0.233TyrMet: 0.233 ± 0.012
0.34TyrAsn: 0.34 ± 0.014
1.083TyrPro: 1.083 ± 0.024
0.568TyrGln: 0.568 ± 0.017
1.797TyrArg: 1.797 ± 0.027
0.856TyrSer: 0.856 ± 0.021
1.101TyrThr: 1.101 ± 0.023
1.701TyrVal: 1.701 ± 0.026
0.338TyrTrp: 0.338 ± 0.013
0.415TyrTyr: 0.415 ± 0.014
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6875 proteins (2221611 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski