Amino acid dipepetide frequency for Streptomyces alkaliterrae

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
22.019AlaAla: 22.019 ± 0.193
1.058AlaCys: 1.058 ± 0.029
8.86AlaAsp: 8.86 ± 0.083
9.724AlaGlu: 9.724 ± 0.11
3.318AlaPhe: 3.318 ± 0.046
12.89AlaGly: 12.89 ± 0.088
2.838AlaHis: 2.838 ± 0.049
3.036AlaIle: 3.036 ± 0.053
2.694AlaLys: 2.694 ± 0.059
14.523AlaLeu: 14.523 ± 0.129
2.377AlaMet: 2.377 ± 0.04
1.815AlaAsn: 1.815 ± 0.033
6.848AlaPro: 6.848 ± 0.077
3.384AlaGln: 3.384 ± 0.047
11.01AlaArg: 11.01 ± 0.105
5.352AlaSer: 5.352 ± 0.059
6.87AlaThr: 6.87 ± 0.063
12.376AlaVal: 12.376 ± 0.103
1.749AlaTrp: 1.749 ± 0.033
2.477AlaTyr: 2.477 ± 0.035
0.0AlaXaa: 0.0 ± 0.0
Cys
1.152CysAla: 1.152 ± 0.031
0.106CysCys: 0.106 ± 0.009
0.483CysAsp: 0.483 ± 0.016
0.445CysGlu: 0.445 ± 0.018
0.231CysPhe: 0.231 ± 0.012
1.014CysGly: 1.014 ± 0.024
0.2CysHis: 0.2 ± 0.01
0.134CysIle: 0.134 ± 0.009
0.11CysLys: 0.11 ± 0.01
0.775CysLeu: 0.775 ± 0.021
0.122CysMet: 0.122 ± 0.009
0.12CysAsn: 0.12 ± 0.008
0.51CysPro: 0.51 ± 0.018
0.183CysGln: 0.183 ± 0.009
0.711CysArg: 0.711 ± 0.024
0.409CysSer: 0.409 ± 0.017
0.46CysThr: 0.46 ± 0.015
0.753CysVal: 0.753 ± 0.021
0.13CysTrp: 0.13 ± 0.009
0.156CysTyr: 0.156 ± 0.01
0.0CysXaa: 0.0 ± 0.0
Asp
7.597AspAla: 7.597 ± 0.077
0.443AspCys: 0.443 ± 0.015
3.63AspAsp: 3.63 ± 0.059
4.135AspGlu: 4.135 ± 0.059
1.614AspPhe: 1.614 ± 0.034
6.549AspGly: 6.549 ± 0.08
1.453AspHis: 1.453 ± 0.03
1.589AspIle: 1.589 ± 0.033
1.203AspLys: 1.203 ± 0.037
5.99AspLeu: 5.99 ± 0.067
0.826AspMet: 0.826 ± 0.023
1.047AspAsn: 1.047 ± 0.032
4.48AspPro: 4.48 ± 0.058
1.499AspGln: 1.499 ± 0.032
5.377AspArg: 5.377 ± 0.059
2.561AspSer: 2.561 ± 0.042
3.125AspThr: 3.125 ± 0.043
4.851AspVal: 4.851 ± 0.061
0.995AspTrp: 0.995 ± 0.025
1.143AspTyr: 1.143 ± 0.03
0.0AspXaa: 0.0 ± 0.0
Glu
7.635GluAla: 7.635 ± 0.099
0.431GluCys: 0.431 ± 0.016
2.988GluAsp: 2.988 ± 0.052
4.141GluGlu: 4.141 ± 0.067
1.522GluPhe: 1.522 ± 0.029
4.6GluGly: 4.6 ± 0.057
1.576GluHis: 1.576 ± 0.031
2.239GluIle: 2.239 ± 0.038
1.479GluLys: 1.479 ± 0.037
7.636GluLeu: 7.636 ± 0.08
0.969GluMet: 0.969 ± 0.031
1.059GluAsn: 1.059 ± 0.029
3.82GluPro: 3.82 ± 0.055
2.317GluGln: 2.317 ± 0.042
6.368GluArg: 6.368 ± 0.08
2.755GluSer: 2.755 ± 0.043
3.023GluThr: 3.023 ± 0.04
4.844GluVal: 4.844 ± 0.057
0.833GluTrp: 0.833 ± 0.025
1.183GluTyr: 1.183 ± 0.026
0.0GluXaa: 0.0 ± 0.0
Phe
3.434PheAla: 3.434 ± 0.05
0.263PheCys: 0.263 ± 0.011
1.982PheAsp: 1.982 ± 0.035
1.462PheGlu: 1.462 ± 0.031
0.847PhePhe: 0.847 ± 0.025
2.942PheGly: 2.942 ± 0.043
0.624PheHis: 0.624 ± 0.02
0.593PheIle: 0.593 ± 0.021
0.454PheLys: 0.454 ± 0.018
2.539PheLeu: 2.539 ± 0.045
0.353PheMet: 0.353 ± 0.013
0.546PheAsn: 0.546 ± 0.018
1.311PhePro: 1.311 ± 0.026
0.655PheGln: 0.655 ± 0.021
1.902PheArg: 1.902 ± 0.03
1.399PheSer: 1.399 ± 0.031
1.884PheThr: 1.884 ± 0.034
2.235PheVal: 2.235 ± 0.036
0.432PheTrp: 0.432 ± 0.019
0.532PheTyr: 0.532 ± 0.017
0.0PheXaa: 0.0 ± 0.0
Gly
10.786GlyAla: 10.786 ± 0.095
0.87GlyCys: 0.87 ± 0.026
5.329GlyAsp: 5.329 ± 0.079
5.832GlyGlu: 5.832 ± 0.053
2.703GlyPhe: 2.703 ± 0.039
9.717GlyGly: 9.717 ± 0.094
2.253GlyHis: 2.253 ± 0.041
2.803GlyIle: 2.803 ± 0.038
2.305GlyLys: 2.305 ± 0.052
9.264GlyLeu: 9.264 ± 0.086
1.964GlyMet: 1.964 ± 0.033
1.713GlyAsn: 1.713 ± 0.039
5.354GlyPro: 5.354 ± 0.067
2.581GlyGln: 2.581 ± 0.04
8.649GlyArg: 8.649 ± 0.072
5.007GlySer: 5.007 ± 0.063
5.818GlyThr: 5.818 ± 0.072
7.877GlyVal: 7.877 ± 0.084
1.662GlyTrp: 1.662 ± 0.035
2.291GlyTyr: 2.291 ± 0.042
0.0GlyXaa: 0.0 ± 0.0
His
2.641HisAla: 2.641 ± 0.046
0.221HisCys: 0.221 ± 0.013
1.365HisAsp: 1.365 ± 0.033
1.205HisGlu: 1.205 ± 0.028
0.622HisPhe: 0.622 ± 0.022
2.424HisGly: 2.424 ± 0.041
0.765HisHis: 0.765 ± 0.026
0.57HisIle: 0.57 ± 0.02
0.333HisLys: 0.333 ± 0.015
2.413HisLeu: 2.413 ± 0.034
0.309HisMet: 0.309 ± 0.013
0.398HisAsn: 0.398 ± 0.019
1.777HisPro: 1.777 ± 0.033
0.679HisGln: 0.679 ± 0.021
2.329HisArg: 2.329 ± 0.035
1.02HisSer: 1.02 ± 0.024
1.345HisThr: 1.345 ± 0.029
1.749HisVal: 1.749 ± 0.035
0.373HisTrp: 0.373 ± 0.015
0.455HisTyr: 0.455 ± 0.015
0.0HisXaa: 0.0 ± 0.0
Ile
4.053IleAla: 4.053 ± 0.061
0.257IleCys: 0.257 ± 0.013
1.932IleAsp: 1.932 ± 0.036
1.803IleGlu: 1.803 ± 0.036
0.625IlePhe: 0.625 ± 0.02
3.091IleGly: 3.091 ± 0.057
0.551IleHis: 0.551 ± 0.02
0.786IleIle: 0.786 ± 0.023
0.674IleLys: 0.674 ± 0.023
2.023IleLeu: 2.023 ± 0.037
0.395IleMet: 0.395 ± 0.016
0.7IleAsn: 0.7 ± 0.021
1.532IlePro: 1.532 ± 0.029
0.625IleGln: 0.625 ± 0.018
2.163IleArg: 2.163 ± 0.037
1.587IleSer: 1.587 ± 0.034
1.907IleThr: 1.907 ± 0.037
2.44IleVal: 2.44 ± 0.048
0.31IleTrp: 0.31 ± 0.013
0.455IleTyr: 0.455 ± 0.017
0.0IleXaa: 0.0 ± 0.0
Lys
2.651LysAla: 2.651 ± 0.05
0.115LysCys: 0.115 ± 0.008
1.229LysAsp: 1.229 ± 0.036
1.182LysGlu: 1.182 ± 0.026
0.385LysPhe: 0.385 ± 0.015
1.731LysGly: 1.731 ± 0.061
0.415LysHis: 0.415 ± 0.015
0.765LysIle: 0.765 ± 0.022
0.782LysLys: 0.782 ± 0.037
1.946LysLeu: 1.946 ± 0.04
0.342LysMet: 0.342 ± 0.014
0.476LysAsn: 0.476 ± 0.022
1.357LysPro: 1.357 ± 0.035
0.655LysGln: 0.655 ± 0.021
1.548LysArg: 1.548 ± 0.03
1.093LysSer: 1.093 ± 0.029
1.161LysThr: 1.161 ± 0.033
1.783LysVal: 1.783 ± 0.04
0.235LysTrp: 0.235 ± 0.011
0.401LysTyr: 0.401 ± 0.017
0.0LysXaa: 0.0 ± 0.0
Leu
15.621LeuAla: 15.621 ± 0.137
0.903LeuCys: 0.903 ± 0.021
6.703LeuAsp: 6.703 ± 0.07
5.101LeuGlu: 5.101 ± 0.067
2.539LeuPhe: 2.539 ± 0.054
9.264LeuGly: 9.264 ± 0.091
2.26LeuHis: 2.26 ± 0.037
2.891LeuIle: 2.891 ± 0.05
1.768LeuLys: 1.768 ± 0.039
12.093LeuLeu: 12.093 ± 0.128
1.593LeuMet: 1.593 ± 0.037
1.651LeuAsn: 1.651 ± 0.035
6.779LeuPro: 6.779 ± 0.074
1.919LeuGln: 1.919 ± 0.042
9.564LeuArg: 9.564 ± 0.085
5.237LeuSer: 5.237 ± 0.059
6.881LeuThr: 6.881 ± 0.067
9.282LeuVal: 9.282 ± 0.094
1.347LeuTrp: 1.347 ± 0.032
1.803LeuTyr: 1.803 ± 0.034
0.0LeuXaa: 0.0 ± 0.0
Met
2.239MetAla: 2.239 ± 0.036
0.142MetCys: 0.142 ± 0.009
0.833MetAsp: 0.833 ± 0.02
0.751MetGlu: 0.751 ± 0.021
0.45MetPhe: 0.45 ± 0.017
1.26MetGly: 1.26 ± 0.03
0.316MetHis: 0.316 ± 0.014
0.624MetIle: 0.624 ± 0.022
0.39MetLys: 0.39 ± 0.016
1.717MetLeu: 1.717 ± 0.036
0.285MetMet: 0.285 ± 0.014
0.378MetAsn: 0.378 ± 0.017
1.09MetPro: 1.09 ± 0.025
0.376MetGln: 0.376 ± 0.015
1.512MetArg: 1.512 ± 0.027
1.289MetSer: 1.289 ± 0.026
1.405MetThr: 1.405 ± 0.025
1.248MetVal: 1.248 ± 0.029
0.206MetTrp: 0.206 ± 0.013
0.306MetTyr: 0.306 ± 0.013
0.0MetXaa: 0.0 ± 0.0
Asn
2.193AsnAla: 2.193 ± 0.035
0.166AsnCys: 0.166 ± 0.009
0.888AsnAsp: 0.888 ± 0.026
0.845AsnGlu: 0.845 ± 0.024
0.472AsnPhe: 0.472 ± 0.017
1.798AsnGly: 1.798 ± 0.041
0.39AsnHis: 0.39 ± 0.014
0.593AsnIle: 0.593 ± 0.019
0.442AsnLys: 0.442 ± 0.019
1.6AsnLeu: 1.6 ± 0.03
0.289AsnMet: 0.289 ± 0.015
0.459AsnAsn: 0.459 ± 0.019
1.307AsnPro: 1.307 ± 0.027
0.457AsnGln: 0.457 ± 0.016
1.393AsnArg: 1.393 ± 0.029
0.977AsnSer: 0.977 ± 0.028
1.048AsnThr: 1.048 ± 0.03
1.383AsnVal: 1.383 ± 0.03
0.264AsnTrp: 0.264 ± 0.012
0.38AsnTyr: 0.38 ± 0.018
0.0AsnXaa: 0.0 ± 0.0
Pro
8.644ProAla: 8.644 ± 0.094
0.361ProCys: 0.361 ± 0.016
4.624ProAsp: 4.624 ± 0.064
4.812ProGlu: 4.812 ± 0.054
1.48ProPhe: 1.48 ± 0.034
6.831ProGly: 6.831 ± 0.081
1.456ProHis: 1.456 ± 0.034
1.227ProIle: 1.227 ± 0.027
1.136ProLys: 1.136 ± 0.038
5.596ProLeu: 5.596 ± 0.071
0.971ProMet: 0.971 ± 0.024
0.928ProAsn: 0.928 ± 0.023
3.892ProPro: 3.892 ± 0.067
1.561ProGln: 1.561 ± 0.037
4.279ProArg: 4.279 ± 0.055
3.09ProSer: 3.09 ± 0.047
3.324ProThr: 3.324 ± 0.047
5.403ProVal: 5.403 ± 0.067
0.855ProTrp: 0.855 ± 0.024
1.28ProTyr: 1.28 ± 0.028
0.0ProXaa: 0.0 ± 0.0
Gln
3.49GlnAla: 3.49 ± 0.049
0.167GlnCys: 0.167 ± 0.01
1.251GlnAsp: 1.251 ± 0.027
1.424GlnGlu: 1.424 ± 0.028
0.615GlnPhe: 0.615 ± 0.019
2.075GlnGly: 2.075 ± 0.041
0.605GlnHis: 0.605 ± 0.017
0.936GlnIle: 0.936 ± 0.025
0.495GlnLys: 0.495 ± 0.019
3.023GlnLeu: 3.023 ± 0.046
0.467GlnMet: 0.467 ± 0.018
0.475GlnAsn: 0.475 ± 0.017
1.781GlnPro: 1.781 ± 0.053
1.126GlnGln: 1.126 ± 0.033
2.526GlnArg: 2.526 ± 0.049
1.143GlnSer: 1.143 ± 0.025
1.225GlnThr: 1.225 ± 0.028
2.208GlnVal: 2.208 ± 0.036
0.402GlnTrp: 0.402 ± 0.016
0.524GlnTyr: 0.524 ± 0.02
0.0GlnXaa: 0.0 ± 0.0
Arg
10.84ArgAla: 10.84 ± 0.098
0.689ArgCys: 0.689 ± 0.022
4.609ArgAsp: 4.609 ± 0.051
5.574ArgGlu: 5.574 ± 0.068
2.436ArgPhe: 2.436 ± 0.039
6.251ArgGly: 6.251 ± 0.068
2.338ArgHis: 2.338 ± 0.036
2.875ArgIle: 2.875 ± 0.04
1.708ArgLys: 1.708 ± 0.034
10.022ArgLeu: 10.022 ± 0.091
1.807ArgMet: 1.807 ± 0.04
1.412ArgAsn: 1.412 ± 0.03
5.586ArgPro: 5.586 ± 0.072
2.509ArgGln: 2.509 ± 0.046
9.14ArgArg: 9.14 ± 0.09
4.227ArgSer: 4.227 ± 0.055
5.3ArgThr: 5.3 ± 0.064
6.696ArgVal: 6.696 ± 0.069
1.522ArgTrp: 1.522 ± 0.033
1.964ArgTyr: 1.964 ± 0.033
0.0ArgXaa: 0.0 ± 0.0
Ser
6.567SerAla: 6.567 ± 0.072
0.441SerCys: 0.441 ± 0.017
2.572SerAsp: 2.572 ± 0.042
2.544SerGlu: 2.544 ± 0.04
1.444SerPhe: 1.444 ± 0.031
5.802SerGly: 5.802 ± 0.065
1.036SerHis: 1.036 ± 0.024
1.207SerIle: 1.207 ± 0.026
0.932SerLys: 0.932 ± 0.025
4.671SerLeu: 4.671 ± 0.058
0.965SerMet: 0.965 ± 0.026
0.846SerAsn: 0.846 ± 0.023
3.176SerPro: 3.176 ± 0.042
1.173SerGln: 1.173 ± 0.029
3.935SerArg: 3.935 ± 0.05
2.547SerSer: 2.547 ± 0.039
2.874SerThr: 2.874 ± 0.045
4.114SerVal: 4.114 ± 0.052
0.858SerTrp: 0.858 ± 0.023
1.105SerTyr: 1.105 ± 0.027
0.0SerXaa: 0.0 ± 0.0
Thr
8.695ThrAla: 8.695 ± 0.081
0.397ThrCys: 0.397 ± 0.017
3.465ThrAsp: 3.465 ± 0.043
3.291ThrGlu: 3.291 ± 0.053
1.585ThrPhe: 1.585 ± 0.034
6.475ThrGly: 6.475 ± 0.063
1.182ThrHis: 1.182 ± 0.028
1.579ThrIle: 1.579 ± 0.031
1.019ThrLys: 1.019 ± 0.032
5.562ThrLeu: 5.562 ± 0.051
0.912ThrMet: 0.912 ± 0.024
0.913ThrAsn: 0.913 ± 0.024
3.906ThrPro: 3.906 ± 0.054
1.176ThrGln: 1.176 ± 0.027
4.038ThrArg: 4.038 ± 0.053
2.95ThrSer: 2.95 ± 0.045
3.741ThrThr: 3.741 ± 0.07
5.832ThrVal: 5.832 ± 0.056
0.807ThrTrp: 0.807 ± 0.023
1.181ThrTyr: 1.181 ± 0.032
0.0ThrXaa: 0.0 ± 0.0
Val
10.783ValAla: 10.783 ± 0.089
0.836ValCys: 0.836 ± 0.024
5.24ValAsp: 5.24 ± 0.062
5.176ValGlu: 5.176 ± 0.052
2.4ValPhe: 2.4 ± 0.04
6.962ValGly: 6.962 ± 0.068
1.94ValHis: 1.94 ± 0.035
2.585ValIle: 2.585 ± 0.048
1.629ValLys: 1.629 ± 0.04
9.816ValLeu: 9.816 ± 0.104
1.365ValMet: 1.365 ± 0.03
1.676ValAsn: 1.676 ± 0.037
5.363ValPro: 5.363 ± 0.055
1.888ValGln: 1.888 ± 0.029
7.585ValArg: 7.585 ± 0.07
4.337ValSer: 4.337 ± 0.055
5.294ValThr: 5.294 ± 0.065
8.239ValVal: 8.239 ± 0.094
1.159ValTrp: 1.159 ± 0.028
1.575ValTyr: 1.575 ± 0.032
0.0ValXaa: 0.0 ± 0.0
Trp
1.621TrpAla: 1.621 ± 0.032
0.167TrpCys: 0.167 ± 0.01
0.76TrpAsp: 0.76 ± 0.023
0.728TrpGlu: 0.728 ± 0.02
0.505TrpPhe: 0.505 ± 0.017
1.005TrpGly: 1.005 ± 0.027
0.378TrpHis: 0.378 ± 0.014
0.486TrpIle: 0.486 ± 0.02
0.336TrpLys: 0.336 ± 0.017
1.858TrpLeu: 1.858 ± 0.04
0.267TrpMet: 0.267 ± 0.014
0.349TrpAsn: 0.349 ± 0.016
0.787TrpPro: 0.787 ± 0.024
0.575TrpGln: 0.575 ± 0.021
1.514TrpArg: 1.514 ± 0.032
0.897TrpSer: 0.897 ± 0.021
0.957TrpThr: 0.957 ± 0.023
0.965TrpVal: 0.965 ± 0.026
0.34TrpTrp: 0.34 ± 0.015
0.327TrpTyr: 0.327 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.555TyrAla: 2.555 ± 0.038
0.172TyrCys: 0.172 ± 0.009
1.364TyrAsp: 1.364 ± 0.04
1.241TyrGlu: 1.241 ± 0.028
0.595TyrPhe: 0.595 ± 0.02
2.096TyrGly: 2.096 ± 0.037
0.418TyrHis: 0.418 ± 0.018
0.391TyrIle: 0.391 ± 0.017
0.364TyrLys: 0.364 ± 0.016
2.132TyrLeu: 2.132 ± 0.032
0.238TyrMet: 0.238 ± 0.011
0.383TyrAsn: 0.383 ± 0.015
1.095TyrPro: 1.095 ± 0.029
0.619TyrGln: 0.619 ± 0.019
1.961TyrArg: 1.961 ± 0.039
0.934TyrSer: 0.934 ± 0.026
1.066TyrThr: 1.066 ± 0.028
1.573TyrVal: 1.573 ± 0.029
0.342TyrTrp: 0.342 ± 0.015
0.435TyrTyr: 0.435 ± 0.019
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5331 proteins (1683650 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski