Amino acid dipepetide frequency for Streptomyces olivochromogenes

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
20.45AlaAla: 20.45 ± 0.117
1.055AlaCys: 1.055 ± 0.017
7.948AlaAsp: 7.948 ± 0.05
8.312AlaGlu: 8.312 ± 0.071
3.528AlaPhe: 3.528 ± 0.035
12.249AlaGly: 12.249 ± 0.067
2.973AlaHis: 2.973 ± 0.031
3.563AlaIle: 3.563 ± 0.037
2.976AlaLys: 2.976 ± 0.039
14.364AlaLeu: 14.364 ± 0.095
2.468AlaMet: 2.468 ± 0.026
2.041AlaAsn: 2.041 ± 0.03
6.929AlaPro: 6.929 ± 0.053
3.932AlaGln: 3.932 ± 0.039
9.927AlaArg: 9.927 ± 0.069
6.308AlaSer: 6.308 ± 0.047
7.358AlaThr: 7.358 ± 0.059
11.962AlaVal: 11.962 ± 0.085
1.894AlaTrp: 1.894 ± 0.027
2.829AlaTyr: 2.829 ± 0.032
0.0AlaXaa: 0.0 ± 0.0
Cys
1.063CysAla: 1.063 ± 0.021
0.094CysCys: 0.094 ± 0.006
0.469CysAsp: 0.469 ± 0.013
0.399CysGlu: 0.399 ± 0.012
0.233CysPhe: 0.233 ± 0.009
0.964CysGly: 0.964 ± 0.02
0.194CysHis: 0.194 ± 0.008
0.163CysIle: 0.163 ± 0.008
0.113CysLys: 0.113 ± 0.006
0.76CysLeu: 0.76 ± 0.016
0.121CysMet: 0.121 ± 0.006
0.132CysAsn: 0.132 ± 0.007
0.496CysPro: 0.496 ± 0.014
0.182CysGln: 0.182 ± 0.008
0.602CysArg: 0.602 ± 0.016
0.457CysSer: 0.457 ± 0.012
0.536CysThr: 0.536 ± 0.013
0.674CysVal: 0.674 ± 0.014
0.13CysTrp: 0.13 ± 0.006
0.155CysTyr: 0.155 ± 0.007
0.0CysXaa: 0.0 ± 0.0
Asp
7.385AspAla: 7.385 ± 0.052
0.414AspCys: 0.414 ± 0.012
3.482AspAsp: 3.482 ± 0.036
3.711AspGlu: 3.711 ± 0.041
1.677AspPhe: 1.677 ± 0.026
6.141AspGly: 6.141 ± 0.052
1.477AspHis: 1.477 ± 0.025
1.952AspIle: 1.952 ± 0.025
1.23AspLys: 1.23 ± 0.022
6.169AspLeu: 6.169 ± 0.048
0.839AspMet: 0.839 ± 0.016
1.037AspAsn: 1.037 ± 0.02
4.394AspPro: 4.394 ± 0.038
1.663AspGln: 1.663 ± 0.025
4.698AspArg: 4.698 ± 0.041
2.7AspSer: 2.7 ± 0.028
3.303AspThr: 3.303 ± 0.034
4.62AspVal: 4.62 ± 0.039
1.01AspTrp: 1.01 ± 0.017
1.128AspTyr: 1.128 ± 0.02
0.0AspXaa: 0.0 ± 0.0
Glu
7.001GluAla: 7.001 ± 0.065
0.356GluCys: 0.356 ± 0.01
2.681GluAsp: 2.681 ± 0.029
3.339GluGlu: 3.339 ± 0.043
1.431GluPhe: 1.431 ± 0.022
4.192GluGly: 4.192 ± 0.036
1.536GluHis: 1.536 ± 0.021
2.211GluIle: 2.211 ± 0.024
1.43GluLys: 1.43 ± 0.025
6.648GluLeu: 6.648 ± 0.061
0.834GluMet: 0.834 ± 0.017
1.036GluAsn: 1.036 ± 0.019
3.275GluPro: 3.275 ± 0.035
2.181GluGln: 2.181 ± 0.031
5.251GluArg: 5.251 ± 0.045
2.615GluSer: 2.615 ± 0.03
2.879GluThr: 2.879 ± 0.027
4.241GluVal: 4.241 ± 0.039
0.769GluTrp: 0.769 ± 0.015
1.098GluTyr: 1.098 ± 0.016
0.0GluXaa: 0.0 ± 0.0
Phe
3.651PheAla: 3.651 ± 0.034
0.268PheCys: 0.268 ± 0.009
1.911PheAsp: 1.911 ± 0.024
1.401PheGlu: 1.401 ± 0.023
0.862PhePhe: 0.862 ± 0.017
3.01PheGly: 3.01 ± 0.035
0.634PheHis: 0.634 ± 0.014
0.749PheIle: 0.749 ± 0.017
0.558PheLys: 0.558 ± 0.013
2.58PheLeu: 2.58 ± 0.03
0.415PheMet: 0.415 ± 0.012
0.597PheAsn: 0.597 ± 0.013
1.437PhePro: 1.437 ± 0.023
0.711PheGln: 0.711 ± 0.015
1.813PheArg: 1.813 ± 0.023
1.51PheSer: 1.51 ± 0.019
2.064PheThr: 2.064 ± 0.025
2.256PheVal: 2.256 ± 0.028
0.433PheTrp: 0.433 ± 0.012
0.591PheTyr: 0.591 ± 0.015
0.0PheXaa: 0.0 ± 0.0
Gly
10.778GlyAla: 10.778 ± 0.071
0.838GlyCys: 0.838 ± 0.018
5.016GlyAsp: 5.016 ± 0.042
4.819GlyGlu: 4.819 ± 0.041
2.876GlyPhe: 2.876 ± 0.024
8.711GlyGly: 8.711 ± 0.087
2.334GlyHis: 2.334 ± 0.028
3.55GlyIle: 3.55 ± 0.037
2.471GlyLys: 2.471 ± 0.036
9.287GlyLeu: 9.287 ± 0.064
1.931GlyMet: 1.931 ± 0.028
1.847GlyAsn: 1.847 ± 0.034
5.103GlyPro: 5.103 ± 0.046
2.738GlyGln: 2.738 ± 0.036
7.491GlyArg: 7.491 ± 0.048
5.611GlySer: 5.611 ± 0.048
6.454GlyThr: 6.454 ± 0.054
7.385GlyVal: 7.385 ± 0.05
1.716GlyTrp: 1.716 ± 0.026
2.266GlyTyr: 2.266 ± 0.031
0.0GlyXaa: 0.0 ± 0.0
His
2.778HisAla: 2.778 ± 0.029
0.217HisCys: 0.217 ± 0.008
1.409HisAsp: 1.409 ± 0.02
1.246HisGlu: 1.246 ± 0.02
0.683HisPhe: 0.683 ± 0.012
2.499HisGly: 2.499 ± 0.029
0.765HisHis: 0.765 ± 0.017
0.719HisIle: 0.719 ± 0.015
0.387HisLys: 0.387 ± 0.011
2.459HisLeu: 2.459 ± 0.03
0.364HisMet: 0.364 ± 0.011
0.404HisAsn: 0.404 ± 0.011
1.865HisPro: 1.865 ± 0.026
0.713HisGln: 0.713 ± 0.017
2.145HisArg: 2.145 ± 0.027
1.088HisSer: 1.088 ± 0.02
1.455HisThr: 1.455 ± 0.021
1.706HisVal: 1.706 ± 0.025
0.408HisTrp: 0.408 ± 0.01
0.497HisTyr: 0.497 ± 0.012
0.0HisXaa: 0.0 ± 0.0
Ile
4.855IleAla: 4.855 ± 0.037
0.295IleCys: 0.295 ± 0.009
2.279IleAsp: 2.279 ± 0.025
1.96IleGlu: 1.96 ± 0.027
0.709IlePhe: 0.709 ± 0.015
3.569IleGly: 3.569 ± 0.038
0.672IleHis: 0.672 ± 0.014
0.894IleIle: 0.894 ± 0.017
0.76IleLys: 0.76 ± 0.017
2.474IleLeu: 2.474 ± 0.03
0.451IleMet: 0.451 ± 0.012
0.747IleAsn: 0.747 ± 0.015
1.892IlePro: 1.892 ± 0.025
0.8IleGln: 0.8 ± 0.015
2.323IleArg: 2.323 ± 0.03
1.746IleSer: 1.746 ± 0.025
2.365IleThr: 2.365 ± 0.026
2.749IleVal: 2.749 ± 0.03
0.399IleTrp: 0.399 ± 0.01
0.548IleTyr: 0.548 ± 0.014
0.0IleXaa: 0.0 ± 0.0
Lys
3.081LysAla: 3.081 ± 0.041
0.125LysCys: 0.125 ± 0.007
1.408LysAsp: 1.408 ± 0.025
1.179LysGlu: 1.179 ± 0.021
0.485LysPhe: 0.485 ± 0.014
1.908LysGly: 1.908 ± 0.029
0.464LysHis: 0.464 ± 0.012
0.907LysIle: 0.907 ± 0.017
0.921LysLys: 0.921 ± 0.024
2.072LysLeu: 2.072 ± 0.032
0.392LysMet: 0.392 ± 0.011
0.601LysAsn: 0.601 ± 0.017
1.395LysPro: 1.395 ± 0.024
0.725LysGln: 0.725 ± 0.017
1.477LysArg: 1.477 ± 0.025
1.292LysSer: 1.292 ± 0.025
1.452LysThr: 1.452 ± 0.025
1.963LysVal: 1.963 ± 0.026
0.307LysTrp: 0.307 ± 0.011
0.478LysTyr: 0.478 ± 0.015
0.0LysXaa: 0.0 ± 0.0
Leu
14.673LeuAla: 14.673 ± 0.092
0.86LeuCys: 0.86 ± 0.013
6.601LeuAsp: 6.601 ± 0.048
4.542LeuGlu: 4.542 ± 0.045
2.62LeuPhe: 2.62 ± 0.033
9.165LeuGly: 9.165 ± 0.075
2.387LeuHis: 2.387 ± 0.032
3.376LeuIle: 3.376 ± 0.035
2.143LeuLys: 2.143 ± 0.03
11.064LeuLeu: 11.064 ± 0.084
1.634LeuMet: 1.634 ± 0.025
1.775LeuAsn: 1.775 ± 0.027
6.45LeuPro: 6.45 ± 0.05
2.282LeuGln: 2.282 ± 0.03
8.518LeuArg: 8.518 ± 0.062
5.569LeuSer: 5.569 ± 0.044
6.972LeuThr: 6.972 ± 0.049
8.717LeuVal: 8.717 ± 0.058
1.342LeuTrp: 1.342 ± 0.024
1.885LeuTyr: 1.885 ± 0.025
0.0LeuXaa: 0.0 ± 0.0
Met
2.241MetAla: 2.241 ± 0.032
0.133MetCys: 0.133 ± 0.007
0.884MetAsp: 0.884 ± 0.016
0.74MetGlu: 0.74 ± 0.016
0.429MetPhe: 0.429 ± 0.012
1.325MetGly: 1.325 ± 0.024
0.359MetHis: 0.359 ± 0.012
0.651MetIle: 0.651 ± 0.015
0.453MetLys: 0.453 ± 0.013
1.64MetLeu: 1.64 ± 0.025
0.296MetMet: 0.296 ± 0.01
0.48MetAsn: 0.48 ± 0.012
1.132MetPro: 1.132 ± 0.018
0.433MetGln: 0.433 ± 0.012
1.43MetArg: 1.43 ± 0.02
1.363MetSer: 1.363 ± 0.02
1.611MetThr: 1.611 ± 0.022
1.237MetVal: 1.237 ± 0.021
0.221MetTrp: 0.221 ± 0.008
0.33MetTyr: 0.33 ± 0.01
0.0MetXaa: 0.0 ± 0.0
Asn
2.324AsnAla: 2.324 ± 0.027
0.164AsnCys: 0.164 ± 0.007
1.02AsnAsp: 1.02 ± 0.02
0.808AsnGlu: 0.808 ± 0.016
0.496AsnPhe: 0.496 ± 0.012
2.036AsnGly: 2.036 ± 0.031
0.435AsnHis: 0.435 ± 0.011
0.692AsnIle: 0.692 ± 0.015
0.45AsnLys: 0.45 ± 0.012
1.718AsnLeu: 1.718 ± 0.022
0.306AsnMet: 0.306 ± 0.01
0.481AsnAsn: 0.481 ± 0.015
1.419AsnPro: 1.419 ± 0.021
0.564AsnGln: 0.564 ± 0.015
1.338AsnArg: 1.338 ± 0.022
1.063AsnSer: 1.063 ± 0.022
1.239AsnThr: 1.239 ± 0.025
1.417AsnVal: 1.417 ± 0.025
0.317AsnTrp: 0.317 ± 0.011
0.443AsnTyr: 0.443 ± 0.013
0.0AsnXaa: 0.0 ± 0.0
Pro
8.253ProAla: 8.253 ± 0.062
0.363ProCys: 0.363 ± 0.011
4.416ProAsp: 4.416 ± 0.039
4.162ProGlu: 4.162 ± 0.041
1.534ProPhe: 1.534 ± 0.019
6.381ProGly: 6.381 ± 0.051
1.468ProHis: 1.468 ± 0.027
1.366ProIle: 1.366 ± 0.022
1.267ProLys: 1.267 ± 0.019
5.447ProLeu: 5.447 ± 0.048
1.017ProMet: 1.017 ± 0.017
0.955ProAsn: 0.955 ± 0.019
3.539ProPro: 3.539 ± 0.052
1.819ProGln: 1.819 ± 0.03
4.078ProArg: 4.078 ± 0.042
3.533ProSer: 3.533 ± 0.046
3.468ProThr: 3.468 ± 0.035
5.429ProVal: 5.429 ± 0.045
0.935ProTrp: 0.935 ± 0.018
1.431ProTyr: 1.431 ± 0.022
0.0ProXaa: 0.0 ± 0.0
Gln
3.736GlnAla: 3.736 ± 0.039
0.188GlnCys: 0.188 ± 0.007
1.474GlnAsp: 1.474 ± 0.021
1.49GlnGlu: 1.49 ± 0.021
0.729GlnPhe: 0.729 ± 0.017
2.424GlnGly: 2.424 ± 0.032
0.729GlnHis: 0.729 ± 0.015
1.125GlnIle: 1.125 ± 0.022
0.66GlnLys: 0.66 ± 0.014
3.143GlnLeu: 3.143 ± 0.032
0.509GlnMet: 0.509 ± 0.013
0.578GlnAsn: 0.578 ± 0.016
1.759GlnPro: 1.759 ± 0.029
1.33GlnGln: 1.33 ± 0.029
2.406GlnArg: 2.406 ± 0.028
1.412GlnSer: 1.412 ± 0.022
1.446GlnThr: 1.446 ± 0.021
2.411GlnVal: 2.411 ± 0.033
0.498GlnTrp: 0.498 ± 0.012
0.664GlnTyr: 0.664 ± 0.017
0.0GlnXaa: 0.0 ± 0.0
Arg
9.589ArgAla: 9.589 ± 0.072
0.603ArgCys: 0.603 ± 0.015
4.132ArgAsp: 4.132 ± 0.042
4.567ArgGlu: 4.567 ± 0.041
2.311ArgPhe: 2.311 ± 0.031
5.687ArgGly: 5.687 ± 0.043
2.158ArgHis: 2.158 ± 0.026
3.213ArgIle: 3.213 ± 0.028
1.627ArgLys: 1.627 ± 0.023
8.647ArgLeu: 8.647 ± 0.068
1.726ArgMet: 1.726 ± 0.023
1.348ArgAsn: 1.348 ± 0.022
4.946ArgPro: 4.946 ± 0.05
2.358ArgGln: 2.358 ± 0.026
7.644ArgArg: 7.644 ± 0.066
4.102ArgSer: 4.102 ± 0.038
5.446ArgThr: 5.446 ± 0.042
5.722ArgVal: 5.722 ± 0.039
1.412ArgTrp: 1.412 ± 0.021
1.765ArgTyr: 1.765 ± 0.027
0.0ArgXaa: 0.0 ± 0.0
Ser
7.163SerAla: 7.163 ± 0.053
0.438SerCys: 0.438 ± 0.011
2.918SerAsp: 2.918 ± 0.034
2.507SerGlu: 2.507 ± 0.029
1.556SerPhe: 1.556 ± 0.023
6.193SerGly: 6.193 ± 0.062
1.123SerHis: 1.123 ± 0.019
1.536SerIle: 1.536 ± 0.024
1.192SerLys: 1.192 ± 0.022
5.001SerLeu: 5.001 ± 0.042
1.166SerMet: 1.166 ± 0.018
0.975SerAsn: 0.975 ± 0.021
3.396SerPro: 3.396 ± 0.038
1.366SerGln: 1.366 ± 0.025
3.755SerArg: 3.755 ± 0.036
3.278SerSer: 3.278 ± 0.045
3.48SerThr: 3.48 ± 0.04
4.434SerVal: 4.434 ± 0.035
0.973SerTrp: 0.973 ± 0.02
1.312SerTyr: 1.312 ± 0.022
0.0SerXaa: 0.0 ± 0.0
Thr
9.063ThrAla: 9.063 ± 0.06
0.464ThrCys: 0.464 ± 0.011
3.731ThrAsp: 3.731 ± 0.037
3.247ThrGlu: 3.247 ± 0.029
1.694ThrPhe: 1.694 ± 0.026
6.855ThrGly: 6.855 ± 0.058
1.332ThrHis: 1.332 ± 0.022
1.811ThrIle: 1.811 ± 0.025
1.329ThrLys: 1.329 ± 0.021
5.956ThrLeu: 5.956 ± 0.047
0.964ThrMet: 0.964 ± 0.018
1.093ThrAsn: 1.093 ± 0.019
4.431ThrPro: 4.431 ± 0.04
1.495ThrGln: 1.495 ± 0.026
3.975ThrArg: 3.975 ± 0.038
3.555ThrSer: 3.555 ± 0.04
4.294ThrThr: 4.294 ± 0.047
6.19ThrVal: 6.19 ± 0.053
0.989ThrTrp: 0.989 ± 0.018
1.453ThrTyr: 1.453 ± 0.027
0.0ThrXaa: 0.0 ± 0.0
Val
10.505ValAla: 10.505 ± 0.07
0.738ValCys: 0.738 ± 0.015
4.884ValAsp: 4.884 ± 0.045
4.554ValGlu: 4.554 ± 0.039
2.407ValPhe: 2.407 ± 0.033
6.532ValGly: 6.532 ± 0.05
1.96ValHis: 1.96 ± 0.024
2.907ValIle: 2.907 ± 0.033
1.822ValLys: 1.822 ± 0.033
9.248ValLeu: 9.248 ± 0.06
1.394ValMet: 1.394 ± 0.023
1.711ValAsn: 1.711 ± 0.024
5.126ValPro: 5.126 ± 0.04
2.139ValGln: 2.139 ± 0.025
6.97ValArg: 6.97 ± 0.049
4.503ValSer: 4.503 ± 0.036
5.77ValThr: 5.77 ± 0.048
7.689ValVal: 7.689 ± 0.06
1.126ValTrp: 1.126 ± 0.019
1.546ValTyr: 1.546 ± 0.024
0.0ValXaa: 0.0 ± 0.0
Trp
1.675TrpAla: 1.675 ± 0.027
0.147TrpCys: 0.147 ± 0.007
0.832TrpAsp: 0.832 ± 0.015
0.737TrpGlu: 0.737 ± 0.015
0.511TrpPhe: 0.511 ± 0.013
1.094TrpGly: 1.094 ± 0.019
0.408TrpHis: 0.408 ± 0.011
0.59TrpIle: 0.59 ± 0.015
0.401TrpLys: 0.401 ± 0.011
1.787TrpLeu: 1.787 ± 0.026
0.304TrpMet: 0.304 ± 0.009
0.449TrpAsn: 0.449 ± 0.012
0.799TrpPro: 0.799 ± 0.017
0.651TrpGln: 0.651 ± 0.014
1.361TrpArg: 1.361 ± 0.022
1.041TrpSer: 1.041 ± 0.02
1.081TrpThr: 1.081 ± 0.018
1.011TrpVal: 1.011 ± 0.017
0.366TrpTrp: 0.366 ± 0.011
0.372TrpTyr: 0.372 ± 0.012
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.802TyrAla: 2.802 ± 0.032
0.177TyrCys: 0.177 ± 0.007
1.535TyrAsp: 1.535 ± 0.023
1.283TyrGlu: 1.283 ± 0.023
0.678TyrPhe: 0.678 ± 0.014
2.288TyrGly: 2.288 ± 0.031
0.397TyrHis: 0.397 ± 0.011
0.51TyrIle: 0.51 ± 0.012
0.442TyrLys: 0.442 ± 0.013
2.11TyrLeu: 2.11 ± 0.023
0.26TyrMet: 0.26 ± 0.008
0.443TyrAsn: 0.443 ± 0.012
1.077TyrPro: 1.077 ± 0.017
0.628TyrGln: 0.628 ± 0.014
1.801TyrArg: 1.801 ± 0.023
1.02TyrSer: 1.02 ± 0.021
1.247TyrThr: 1.247 ± 0.022
1.719TyrVal: 1.719 ± 0.027
0.376TyrTrp: 0.376 ± 0.011
0.483TyrTyr: 0.483 ± 0.014
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10549 proteins (3319167 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski