Amino acid dipepetide frequency for Streptomyces sp. IB201691-2A2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
19.773AlaAla: 19.773 ± 0.131
0.995AlaCys: 0.995 ± 0.02
8.064AlaAsp: 8.064 ± 0.054
8.508AlaGlu: 8.508 ± 0.073
3.456AlaPhe: 3.456 ± 0.034
12.447AlaGly: 12.447 ± 0.07
2.77AlaHis: 2.77 ± 0.034
3.566AlaIle: 3.566 ± 0.038
2.978AlaLys: 2.978 ± 0.041
14.032AlaLeu: 14.032 ± 0.092
2.457AlaMet: 2.457 ± 0.028
2.018AlaAsn: 2.018 ± 0.026
6.66AlaPro: 6.66 ± 0.06
3.708AlaGln: 3.708 ± 0.036
9.679AlaArg: 9.679 ± 0.072
6.276AlaSer: 6.276 ± 0.044
7.075AlaThr: 7.075 ± 0.055
11.994AlaVal: 11.994 ± 0.071
1.865AlaTrp: 1.865 ± 0.029
2.824AlaTyr: 2.824 ± 0.031
0.0AlaXaa: 0.0 ± 0.0
Cys
1.036CysAla: 1.036 ± 0.019
0.084CysCys: 0.084 ± 0.006
0.465CysAsp: 0.465 ± 0.011
0.403CysGlu: 0.403 ± 0.011
0.206CysPhe: 0.206 ± 0.007
0.907CysGly: 0.907 ± 0.019
0.179CysHis: 0.179 ± 0.008
0.168CysIle: 0.168 ± 0.008
0.109CysLys: 0.109 ± 0.006
0.734CysLeu: 0.734 ± 0.014
0.12CysMet: 0.12 ± 0.007
0.139CysAsn: 0.139 ± 0.007
0.447CysPro: 0.447 ± 0.013
0.161CysGln: 0.161 ± 0.009
0.564CysArg: 0.564 ± 0.013
0.425CysSer: 0.425 ± 0.013
0.497CysThr: 0.497 ± 0.012
0.638CysVal: 0.638 ± 0.013
0.122CysTrp: 0.122 ± 0.007
0.146CysTyr: 0.146 ± 0.007
0.0CysXaa: 0.0 ± 0.0
Asp
7.345AspAla: 7.345 ± 0.051
0.412AspCys: 0.412 ± 0.013
3.676AspAsp: 3.676 ± 0.038
3.991AspGlu: 3.991 ± 0.035
1.696AspPhe: 1.696 ± 0.026
6.44AspGly: 6.44 ± 0.058
1.436AspHis: 1.436 ± 0.022
1.996AspIle: 1.996 ± 0.029
1.321AspLys: 1.321 ± 0.026
6.143AspLeu: 6.143 ± 0.047
0.85AspMet: 0.85 ± 0.016
1.055AspAsn: 1.055 ± 0.022
4.378AspPro: 4.378 ± 0.04
1.623AspGln: 1.623 ± 0.024
4.781AspArg: 4.781 ± 0.045
2.878AspSer: 2.878 ± 0.032
3.424AspThr: 3.424 ± 0.034
4.783AspVal: 4.783 ± 0.044
1.079AspTrp: 1.079 ± 0.02
1.135AspTyr: 1.135 ± 0.022
0.0AspXaa: 0.0 ± 0.0
Glu
7.261GluAla: 7.261 ± 0.061
0.355GluCys: 0.355 ± 0.011
2.85GluAsp: 2.85 ± 0.033
3.517GluGlu: 3.517 ± 0.039
1.532GluPhe: 1.532 ± 0.022
4.491GluGly: 4.491 ± 0.041
1.496GluHis: 1.496 ± 0.024
2.238GluIle: 2.238 ± 0.033
1.547GluLys: 1.547 ± 0.028
6.83GluLeu: 6.83 ± 0.058
0.898GluMet: 0.898 ± 0.019
1.088GluAsn: 1.088 ± 0.02
3.4GluPro: 3.4 ± 0.037
2.266GluGln: 2.266 ± 0.029
5.305GluArg: 5.305 ± 0.045
2.786GluSer: 2.786 ± 0.032
2.962GluThr: 2.962 ± 0.03
4.395GluVal: 4.395 ± 0.04
0.804GluTrp: 0.804 ± 0.015
1.168GluTyr: 1.168 ± 0.019
0.0GluXaa: 0.0 ± 0.0
Phe
3.629PheAla: 3.629 ± 0.034
0.245PheCys: 0.245 ± 0.008
1.964PheAsp: 1.964 ± 0.028
1.433PheGlu: 1.433 ± 0.025
0.882PhePhe: 0.882 ± 0.017
3.079PheGly: 3.079 ± 0.035
0.609PheHis: 0.609 ± 0.015
0.757PheIle: 0.757 ± 0.018
0.565PheLys: 0.565 ± 0.014
2.615PheLeu: 2.615 ± 0.031
0.426PheMet: 0.426 ± 0.013
0.606PheAsn: 0.606 ± 0.015
1.402PhePro: 1.402 ± 0.024
0.699PheGln: 0.699 ± 0.016
1.837PheArg: 1.837 ± 0.023
1.523PheSer: 1.523 ± 0.027
2.057PheThr: 2.057 ± 0.026
2.277PheVal: 2.277 ± 0.025
0.447PheTrp: 0.447 ± 0.012
0.59PheTyr: 0.59 ± 0.016
0.0PheXaa: 0.0 ± 0.0
Gly
10.718GlyAla: 10.718 ± 0.069
0.812GlyCys: 0.812 ± 0.018
5.255GlyAsp: 5.255 ± 0.05
5.109GlyGlu: 5.109 ± 0.04
2.95GlyPhe: 2.95 ± 0.032
8.805GlyGly: 8.805 ± 0.083
2.261GlyHis: 2.261 ± 0.026
3.59GlyIle: 3.59 ± 0.032
2.555GlyLys: 2.555 ± 0.038
9.456GlyLeu: 9.456 ± 0.063
1.994GlyMet: 1.994 ± 0.026
1.916GlyAsn: 1.916 ± 0.03
5.159GlyPro: 5.159 ± 0.058
2.771GlyGln: 2.771 ± 0.033
7.485GlyArg: 7.485 ± 0.055
5.722GlySer: 5.722 ± 0.047
6.488GlyThr: 6.488 ± 0.056
7.455GlyVal: 7.455 ± 0.056
1.705GlyTrp: 1.705 ± 0.024
2.254GlyTyr: 2.254 ± 0.025
0.0GlyXaa: 0.0 ± 0.0
His
2.586HisAla: 2.586 ± 0.028
0.199HisCys: 0.199 ± 0.008
1.396HisAsp: 1.396 ± 0.024
1.255HisGlu: 1.255 ± 0.018
0.656HisPhe: 0.656 ± 0.016
2.354HisGly: 2.354 ± 0.034
0.682HisHis: 0.682 ± 0.018
0.715HisIle: 0.715 ± 0.014
0.376HisLys: 0.376 ± 0.01
2.337HisLeu: 2.337 ± 0.03
0.328HisMet: 0.328 ± 0.008
0.404HisAsn: 0.404 ± 0.012
1.784HisPro: 1.784 ± 0.027
0.649HisGln: 0.649 ± 0.016
2.051HisArg: 2.051 ± 0.028
1.057HisSer: 1.057 ± 0.019
1.358HisThr: 1.358 ± 0.022
1.655HisVal: 1.655 ± 0.022
0.382HisTrp: 0.382 ± 0.011
0.492HisTyr: 0.492 ± 0.013
0.0HisXaa: 0.0 ± 0.0
Ile
4.836IleAla: 4.836 ± 0.046
0.287IleCys: 0.287 ± 0.009
2.303IleAsp: 2.303 ± 0.029
2.009IleGlu: 2.009 ± 0.029
0.752IlePhe: 0.752 ± 0.019
3.617IleGly: 3.617 ± 0.04
0.65IleHis: 0.65 ± 0.015
0.887IleIle: 0.887 ± 0.019
0.761IleLys: 0.761 ± 0.017
2.492IleLeu: 2.492 ± 0.03
0.455IleMet: 0.455 ± 0.013
0.726IleAsn: 0.726 ± 0.017
1.873IlePro: 1.873 ± 0.024
0.749IleGln: 0.749 ± 0.017
2.327IleArg: 2.327 ± 0.029
1.746IleSer: 1.746 ± 0.022
2.317IleThr: 2.317 ± 0.026
2.807IleVal: 2.807 ± 0.034
0.399IleTrp: 0.399 ± 0.011
0.534IleTyr: 0.534 ± 0.015
0.0IleXaa: 0.0 ± 0.0
Lys
3.067LysAla: 3.067 ± 0.043
0.123LysCys: 0.123 ± 0.007
1.497LysAsp: 1.497 ± 0.027
1.284LysGlu: 1.284 ± 0.025
0.481LysPhe: 0.481 ± 0.013
2.011LysGly: 2.011 ± 0.035
0.458LysHis: 0.458 ± 0.011
0.892LysIle: 0.892 ± 0.02
0.96LysLys: 0.96 ± 0.029
2.112LysLeu: 2.112 ± 0.03
0.398LysMet: 0.398 ± 0.013
0.553LysAsn: 0.553 ± 0.015
1.43LysPro: 1.43 ± 0.024
0.759LysGln: 0.759 ± 0.018
1.493LysArg: 1.493 ± 0.021
1.304LysSer: 1.304 ± 0.022
1.376LysThr: 1.376 ± 0.027
2.001LysVal: 2.001 ± 0.03
0.32LysTrp: 0.32 ± 0.011
0.515LysTyr: 0.515 ± 0.014
0.0LysXaa: 0.0 ± 0.0
Leu
14.685LeuAla: 14.685 ± 0.092
0.81LeuCys: 0.81 ± 0.016
6.671LeuAsp: 6.671 ± 0.055
4.559LeuGlu: 4.559 ± 0.043
2.648LeuPhe: 2.648 ± 0.035
9.226LeuGly: 9.226 ± 0.062
2.274LeuHis: 2.274 ± 0.027
3.384LeuIle: 3.384 ± 0.034
2.219LeuLys: 2.219 ± 0.031
11.165LeuLeu: 11.165 ± 0.086
1.704LeuMet: 1.704 ± 0.023
1.779LeuAsn: 1.779 ± 0.022
6.342LeuPro: 6.342 ± 0.053
2.239LeuGln: 2.239 ± 0.029
8.43LeuArg: 8.43 ± 0.061
5.498LeuSer: 5.498 ± 0.042
6.894LeuThr: 6.894 ± 0.049
8.794LeuVal: 8.794 ± 0.065
1.305LeuTrp: 1.305 ± 0.021
1.892LeuTyr: 1.892 ± 0.027
0.0LeuXaa: 0.0 ± 0.0
Met
2.247MetAla: 2.247 ± 0.03
0.127MetCys: 0.127 ± 0.007
0.89MetAsp: 0.89 ± 0.017
0.78MetGlu: 0.78 ± 0.017
0.456MetPhe: 0.456 ± 0.011
1.351MetGly: 1.351 ± 0.019
0.377MetHis: 0.377 ± 0.011
0.674MetIle: 0.674 ± 0.017
0.454MetLys: 0.454 ± 0.012
1.704MetLeu: 1.704 ± 0.025
0.316MetMet: 0.316 ± 0.009
0.452MetAsn: 0.452 ± 0.012
1.157MetPro: 1.157 ± 0.022
0.455MetGln: 0.455 ± 0.013
1.447MetArg: 1.447 ± 0.021
1.341MetSer: 1.341 ± 0.023
1.55MetThr: 1.55 ± 0.021
1.291MetVal: 1.291 ± 0.022
0.215MetTrp: 0.215 ± 0.009
0.334MetTyr: 0.334 ± 0.011
0.0MetXaa: 0.0 ± 0.0
Asn
2.264AsnAla: 2.264 ± 0.028
0.17AsnCys: 0.17 ± 0.007
1.068AsnAsp: 1.068 ± 0.02
0.888AsnGlu: 0.888 ± 0.018
0.512AsnPhe: 0.512 ± 0.013
2.058AsnGly: 2.058 ± 0.036
0.422AsnHis: 0.422 ± 0.013
0.68AsnIle: 0.68 ± 0.014
0.451AsnLys: 0.451 ± 0.015
1.687AsnLeu: 1.687 ± 0.026
0.315AsnMet: 0.315 ± 0.009
0.467AsnAsn: 0.467 ± 0.013
1.437AsnPro: 1.437 ± 0.024
0.557AsnGln: 0.557 ± 0.015
1.316AsnArg: 1.316 ± 0.02
1.132AsnSer: 1.132 ± 0.022
1.217AsnThr: 1.217 ± 0.022
1.437AsnVal: 1.437 ± 0.023
0.333AsnTrp: 0.333 ± 0.011
0.459AsnTyr: 0.459 ± 0.015
0.0AsnXaa: 0.0 ± 0.0
Pro
7.962ProAla: 7.962 ± 0.059
0.317ProCys: 0.317 ± 0.009
4.49ProAsp: 4.49 ± 0.043
4.356ProGlu: 4.356 ± 0.035
1.567ProPhe: 1.567 ± 0.021
6.515ProGly: 6.515 ± 0.057
1.371ProHis: 1.371 ± 0.019
1.317ProIle: 1.317 ± 0.021
1.285ProLys: 1.285 ± 0.022
5.266ProLeu: 5.266 ± 0.05
1.029ProMet: 1.029 ± 0.018
0.98ProAsn: 0.98 ± 0.017
3.44ProPro: 3.44 ± 0.048
1.751ProGln: 1.751 ± 0.034
3.859ProArg: 3.859 ± 0.039
3.381ProSer: 3.381 ± 0.038
3.32ProThr: 3.32 ± 0.041
5.404ProVal: 5.404 ± 0.046
0.917ProTrp: 0.917 ± 0.02
1.403ProTyr: 1.403 ± 0.024
0.0ProXaa: 0.0 ± 0.0
Gln
3.617GlnAla: 3.617 ± 0.035
0.172GlnCys: 0.172 ± 0.008
1.477GlnAsp: 1.477 ± 0.024
1.51GlnGlu: 1.51 ± 0.021
0.706GlnPhe: 0.706 ± 0.015
2.33GlnGly: 2.33 ± 0.03
0.659GlnHis: 0.659 ± 0.017
1.084GlnIle: 1.084 ± 0.021
0.64GlnLys: 0.64 ± 0.015
3.221GlnLeu: 3.221 ± 0.028
0.512GlnMet: 0.512 ± 0.012
0.551GlnAsn: 0.551 ± 0.016
1.726GlnPro: 1.726 ± 0.033
1.309GlnGln: 1.309 ± 0.026
2.377GlnArg: 2.377 ± 0.027
1.361GlnSer: 1.361 ± 0.022
1.401GlnThr: 1.401 ± 0.024
2.35GlnVal: 2.35 ± 0.028
0.489GlnTrp: 0.489 ± 0.014
0.631GlnTyr: 0.631 ± 0.016
0.0GlnXaa: 0.0 ± 0.0
Arg
9.361ArgAla: 9.361 ± 0.069
0.568ArgCys: 0.568 ± 0.014
4.212ArgAsp: 4.212 ± 0.042
4.653ArgGlu: 4.653 ± 0.042
2.309ArgPhe: 2.309 ± 0.027
5.695ArgGly: 5.695 ± 0.045
2.079ArgHis: 2.079 ± 0.026
3.274ArgIle: 3.274 ± 0.035
1.662ArgLys: 1.662 ± 0.026
8.511ArgLeu: 8.511 ± 0.064
1.69ArgMet: 1.69 ± 0.024
1.409ArgAsn: 1.409 ± 0.025
4.773ArgPro: 4.773 ± 0.041
2.29ArgGln: 2.29 ± 0.028
7.478ArgArg: 7.478 ± 0.062
4.138ArgSer: 4.138 ± 0.044
5.442ArgThr: 5.442 ± 0.046
5.787ArgVal: 5.787 ± 0.044
1.367ArgTrp: 1.367 ± 0.021
1.777ArgTyr: 1.777 ± 0.025
0.0ArgXaa: 0.0 ± 0.0
Ser
7.016SerAla: 7.016 ± 0.053
0.397SerCys: 0.397 ± 0.012
2.953SerAsp: 2.953 ± 0.036
2.672SerGlu: 2.672 ± 0.03
1.566SerPhe: 1.566 ± 0.024
6.321SerGly: 6.321 ± 0.06
1.084SerHis: 1.084 ± 0.018
1.528SerIle: 1.528 ± 0.022
1.182SerLys: 1.182 ± 0.023
5.014SerLeu: 5.014 ± 0.042
1.131SerMet: 1.131 ± 0.019
0.982SerAsn: 0.982 ± 0.02
3.363SerPro: 3.363 ± 0.038
1.337SerGln: 1.337 ± 0.022
3.789SerArg: 3.789 ± 0.038
3.193SerSer: 3.193 ± 0.042
3.321SerThr: 3.321 ± 0.033
4.482SerVal: 4.482 ± 0.04
0.964SerTrp: 0.964 ± 0.02
1.298SerTyr: 1.298 ± 0.019
0.0SerXaa: 0.0 ± 0.0
Thr
8.863ThrAla: 8.863 ± 0.06
0.417ThrCys: 0.417 ± 0.011
3.787ThrAsp: 3.787 ± 0.04
3.369ThrGlu: 3.369 ± 0.035
1.665ThrPhe: 1.665 ± 0.022
6.754ThrGly: 6.754 ± 0.06
1.247ThrHis: 1.247 ± 0.019
1.774ThrIle: 1.774 ± 0.024
1.325ThrLys: 1.325 ± 0.025
5.831ThrLeu: 5.831 ± 0.048
0.961ThrMet: 0.961 ± 0.019
1.11ThrAsn: 1.11 ± 0.019
4.168ThrPro: 4.168 ± 0.039
1.412ThrGln: 1.412 ± 0.024
3.986ThrArg: 3.986 ± 0.035
3.411ThrSer: 3.411 ± 0.038
4.019ThrThr: 4.019 ± 0.038
6.215ThrVal: 6.215 ± 0.047
0.958ThrTrp: 0.958 ± 0.019
1.457ThrTyr: 1.457 ± 0.023
0.0ThrXaa: 0.0 ± 0.0
Val
10.438ValAla: 10.438 ± 0.059
0.728ValCys: 0.728 ± 0.017
5.021ValAsp: 5.021 ± 0.043
4.74ValGlu: 4.74 ± 0.042
2.44ValPhe: 2.44 ± 0.032
6.632ValGly: 6.632 ± 0.05
1.924ValHis: 1.924 ± 0.026
2.882ValIle: 2.882 ± 0.033
1.809ValLys: 1.809 ± 0.03
9.476ValLeu: 9.476 ± 0.069
1.459ValMet: 1.459 ± 0.022
1.721ValAsn: 1.721 ± 0.027
5.124ValPro: 5.124 ± 0.046
2.128ValGln: 2.128 ± 0.024
7.112ValArg: 7.112 ± 0.055
4.418ValSer: 4.418 ± 0.038
5.668ValThr: 5.668 ± 0.044
7.98ValVal: 7.98 ± 0.064
1.209ValTrp: 1.209 ± 0.021
1.565ValTyr: 1.565 ± 0.022
0.0ValXaa: 0.0 ± 0.0
Trp
1.659TrpAla: 1.659 ± 0.025
0.153TrpCys: 0.153 ± 0.007
0.885TrpAsp: 0.885 ± 0.017
0.767TrpGlu: 0.767 ± 0.016
0.5TrpPhe: 0.5 ± 0.012
1.101TrpGly: 1.101 ± 0.021
0.374TrpHis: 0.374 ± 0.012
0.596TrpIle: 0.596 ± 0.013
0.394TrpLys: 0.394 ± 0.011
1.809TrpLeu: 1.809 ± 0.028
0.303TrpMet: 0.303 ± 0.01
0.448TrpAsn: 0.448 ± 0.014
0.8TrpPro: 0.8 ± 0.015
0.64TrpGln: 0.64 ± 0.015
1.335TrpArg: 1.335 ± 0.022
0.972TrpSer: 0.972 ± 0.019
1.126TrpThr: 1.126 ± 0.02
1.001TrpVal: 1.001 ± 0.017
0.363TrpTrp: 0.363 ± 0.012
0.386TrpTyr: 0.386 ± 0.012
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.78TyrAla: 2.78 ± 0.032
0.177TyrCys: 0.177 ± 0.008
1.518TyrAsp: 1.518 ± 0.026
1.384TyrGlu: 1.384 ± 0.021
0.665TyrPhe: 0.665 ± 0.017
2.323TyrGly: 2.323 ± 0.033
0.367TyrHis: 0.367 ± 0.011
0.526TyrIle: 0.526 ± 0.012
0.441TyrLys: 0.441 ± 0.013
2.081TyrLeu: 2.081 ± 0.03
0.273TyrMet: 0.273 ± 0.01
0.463TyrAsn: 0.463 ± 0.013
1.067TyrPro: 1.067 ± 0.021
0.619TyrGln: 0.619 ± 0.014
1.827TyrArg: 1.827 ± 0.028
1.033TyrSer: 1.033 ± 0.019
1.216TyrThr: 1.216 ± 0.021
1.729TyrVal: 1.729 ± 0.022
0.373TyrTrp: 0.373 ± 0.013
0.498TyrTyr: 0.498 ± 0.015
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9596 proteins (3162214 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski