Amino acid dipepetide frequency for Actinokineospora iranica

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
22.581AlaAla: 22.581 ± 0.195
1.116AlaCys: 1.116 ± 0.029
9.377AlaAsp: 9.377 ± 0.076
9.414AlaGlu: 9.414 ± 0.08
3.582AlaPhe: 3.582 ± 0.051
12.695AlaGly: 12.695 ± 0.095
2.958AlaHis: 2.958 ± 0.039
4.193AlaIle: 4.193 ± 0.053
3.019AlaLys: 3.019 ± 0.055
14.559AlaLeu: 14.559 ± 0.12
2.518AlaMet: 2.518 ± 0.04
2.421AlaAsn: 2.421 ± 0.038
6.86AlaPro: 6.86 ± 0.073
3.527AlaGln: 3.527 ± 0.048
10.487AlaArg: 10.487 ± 0.085
5.404AlaSer: 5.404 ± 0.066
7.507AlaThr: 7.507 ± 0.073
12.923AlaVal: 12.923 ± 0.1
1.852AlaTrp: 1.852 ± 0.032
2.389AlaTyr: 2.389 ± 0.037
0.0AlaXaa: 0.0 ± 0.0
Cys
1.21CysAla: 1.21 ± 0.027
0.112CysCys: 0.112 ± 0.009
0.481CysAsp: 0.481 ± 0.017
0.398CysGlu: 0.398 ± 0.014
0.232CysPhe: 0.232 ± 0.01
0.988CysGly: 0.988 ± 0.026
0.186CysHis: 0.186 ± 0.01
0.115CysIle: 0.115 ± 0.008
0.11CysLys: 0.11 ± 0.009
0.747CysLeu: 0.747 ± 0.02
0.102CysMet: 0.102 ± 0.006
0.115CysAsn: 0.115 ± 0.008
0.539CysPro: 0.539 ± 0.016
0.198CysGln: 0.198 ± 0.009
0.615CysArg: 0.615 ± 0.017
0.443CysSer: 0.443 ± 0.017
0.411CysThr: 0.411 ± 0.017
0.758CysVal: 0.758 ± 0.021
0.135CysTrp: 0.135 ± 0.008
0.161CysTyr: 0.161 ± 0.009
0.0CysXaa: 0.0 ± 0.0
Asp
7.848AspAla: 7.848 ± 0.071
0.415AspCys: 0.415 ± 0.015
3.87AspAsp: 3.87 ± 0.055
3.873AspGlu: 3.873 ± 0.053
1.599AspPhe: 1.599 ± 0.031
6.002AspGly: 6.002 ± 0.067
1.551AspHis: 1.551 ± 0.033
1.965AspIle: 1.965 ± 0.034
1.19AspLys: 1.19 ± 0.027
7.163AspLeu: 7.163 ± 0.076
0.803AspMet: 0.803 ± 0.021
1.1AspAsn: 1.1 ± 0.028
4.7AspPro: 4.7 ± 0.055
1.84AspGln: 1.84 ± 0.035
5.268AspArg: 5.268 ± 0.05
2.699AspSer: 2.699 ± 0.045
3.378AspThr: 3.378 ± 0.045
4.946AspVal: 4.946 ± 0.061
1.014AspTrp: 1.014 ± 0.021
1.299AspTyr: 1.299 ± 0.027
0.0AspXaa: 0.0 ± 0.0
Glu
6.205GluAla: 6.205 ± 0.064
0.387GluCys: 0.387 ± 0.015
2.642GluAsp: 2.642 ± 0.031
2.385GluGlu: 2.385 ± 0.046
1.761GluPhe: 1.761 ± 0.035
3.448GluGly: 3.448 ± 0.048
1.732GluHis: 1.732 ± 0.03
2.485GluIle: 2.485 ± 0.039
1.19GluLys: 1.19 ± 0.033
6.319GluLeu: 6.319 ± 0.069
0.877GluMet: 0.877 ± 0.023
1.065GluAsn: 1.065 ± 0.027
3.571GluPro: 3.571 ± 0.048
2.184GluGln: 2.184 ± 0.037
5.186GluArg: 5.186 ± 0.063
2.708GluSer: 2.708 ± 0.046
2.914GluThr: 2.914 ± 0.046
4.986GluVal: 4.986 ± 0.056
0.789GluTrp: 0.789 ± 0.021
0.983GluTyr: 0.983 ± 0.025
0.0GluXaa: 0.0 ± 0.0
Phe
4.138PheAla: 4.138 ± 0.053
0.27PheCys: 0.27 ± 0.012
2.226PheAsp: 2.226 ± 0.039
1.307PheGlu: 1.307 ± 0.028
0.825PhePhe: 0.825 ± 0.022
3.149PheGly: 3.149 ± 0.046
0.663PheHis: 0.663 ± 0.02
0.717PheIle: 0.717 ± 0.023
0.415PheLys: 0.415 ± 0.015
2.544PheLeu: 2.544 ± 0.04
0.327PheMet: 0.327 ± 0.012
0.512PheAsn: 0.512 ± 0.017
1.438PhePro: 1.438 ± 0.031
0.648PheGln: 0.648 ± 0.019
1.866PheArg: 1.866 ± 0.032
1.442PheSer: 1.442 ± 0.026
2.036PheThr: 2.036 ± 0.035
2.335PheVal: 2.335 ± 0.035
0.402PheTrp: 0.402 ± 0.015
0.565PheTyr: 0.565 ± 0.018
0.0PheXaa: 0.0 ± 0.0
Gly
10.879GlyAla: 10.879 ± 0.1
0.818GlyCys: 0.818 ± 0.023
5.22GlyAsp: 5.22 ± 0.052
4.944GlyGlu: 4.944 ± 0.055
2.894GlyPhe: 2.894 ± 0.036
8.328GlyGly: 8.328 ± 0.116
2.173GlyHis: 2.173 ± 0.039
3.266GlyIle: 3.266 ± 0.042
2.539GlyLys: 2.539 ± 0.038
9.194GlyLeu: 9.194 ± 0.088
1.977GlyMet: 1.977 ± 0.033
1.777GlyAsn: 1.777 ± 0.039
4.8GlyPro: 4.8 ± 0.065
2.822GlyGln: 2.822 ± 0.047
7.011GlyArg: 7.011 ± 0.06
4.819GlySer: 4.819 ± 0.062
5.157GlyThr: 5.157 ± 0.065
8.246GlyVal: 8.246 ± 0.072
1.636GlyTrp: 1.636 ± 0.033
2.145GlyTyr: 2.145 ± 0.035
0.001GlyXaa: 0.001 ± 0.001
His
2.725HisAla: 2.725 ± 0.036
0.219HisCys: 0.219 ± 0.01
1.344HisAsp: 1.344 ± 0.032
1.119HisGlu: 1.119 ± 0.026
0.587HisPhe: 0.587 ± 0.017
2.366HisGly: 2.366 ± 0.036
0.656HisHis: 0.656 ± 0.019
0.718HisIle: 0.718 ± 0.019
0.356HisLys: 0.356 ± 0.012
2.449HisLeu: 2.449 ± 0.034
0.329HisMet: 0.329 ± 0.014
0.441HisAsn: 0.441 ± 0.013
1.802HisPro: 1.802 ± 0.033
0.666HisGln: 0.666 ± 0.018
2.107HisArg: 2.107 ± 0.043
1.035HisSer: 1.035 ± 0.029
1.234HisThr: 1.234 ± 0.023
1.76HisVal: 1.76 ± 0.033
0.384HisTrp: 0.384 ± 0.015
0.502HisTyr: 0.502 ± 0.017
0.0HisXaa: 0.0 ± 0.0
Ile
5.17IleAla: 5.17 ± 0.056
0.271IleCys: 0.271 ± 0.012
2.566IleAsp: 2.566 ± 0.041
2.109IleGlu: 2.109 ± 0.035
0.629IlePhe: 0.629 ± 0.018
3.573IleGly: 3.573 ± 0.052
0.603IleHis: 0.603 ± 0.019
0.862IleIle: 0.862 ± 0.026
0.681IleLys: 0.681 ± 0.023
2.358IleLeu: 2.358 ± 0.036
0.427IleMet: 0.427 ± 0.017
0.681IleAsn: 0.681 ± 0.018
1.888IlePro: 1.888 ± 0.031
0.721IleGln: 0.721 ± 0.017
2.38IleArg: 2.38 ± 0.038
1.638IleSer: 1.638 ± 0.034
2.449IleThr: 2.449 ± 0.037
2.867IleVal: 2.867 ± 0.037
0.332IleTrp: 0.332 ± 0.013
0.527IleTyr: 0.527 ± 0.017
0.0IleXaa: 0.0 ± 0.0
Lys
2.673LysAla: 2.673 ± 0.048
0.116LysCys: 0.116 ± 0.008
0.964LysAsp: 0.964 ± 0.027
0.872LysGlu: 0.872 ± 0.026
0.481LysPhe: 0.481 ± 0.017
1.417LysGly: 1.417 ± 0.037
0.493LysHis: 0.493 ± 0.017
0.905LysIle: 0.905 ± 0.022
0.551LysLys: 0.551 ± 0.02
2.071LysLeu: 2.071 ± 0.035
0.356LysMet: 0.356 ± 0.014
0.445LysAsn: 0.445 ± 0.015
1.486LysPro: 1.486 ± 0.031
0.648LysGln: 0.648 ± 0.019
1.611LysArg: 1.611 ± 0.03
1.141LysSer: 1.141 ± 0.029
1.312LysThr: 1.312 ± 0.032
1.902LysVal: 1.902 ± 0.039
0.273LysTrp: 0.273 ± 0.011
0.378LysTyr: 0.378 ± 0.016
0.0LysXaa: 0.0 ± 0.0
Leu
16.364LeuAla: 16.364 ± 0.142
0.838LeuCys: 0.838 ± 0.026
7.08LeuAsp: 7.08 ± 0.062
3.903LeuGlu: 3.903 ± 0.056
2.682LeuPhe: 2.682 ± 0.046
9.254LeuGly: 9.254 ± 0.078
2.179LeuHis: 2.179 ± 0.04
3.287LeuIle: 3.287 ± 0.047
1.453LeuLys: 1.453 ± 0.035
10.896LeuLeu: 10.896 ± 0.117
1.36LeuMet: 1.36 ± 0.025
1.63LeuAsn: 1.63 ± 0.032
6.475LeuPro: 6.475 ± 0.072
1.66LeuGln: 1.66 ± 0.033
9.383LeuArg: 9.383 ± 0.075
5.466LeuSer: 5.466 ± 0.054
6.601LeuThr: 6.601 ± 0.067
9.836LeuVal: 9.836 ± 0.091
1.284LeuTrp: 1.284 ± 0.03
1.515LeuTyr: 1.515 ± 0.027
0.0LeuXaa: 0.0 ± 0.0
Met
2.296MetAla: 2.296 ± 0.038
0.119MetCys: 0.119 ± 0.008
0.798MetAsp: 0.798 ± 0.02
0.594MetGlu: 0.594 ± 0.021
0.447MetPhe: 0.447 ± 0.015
1.21MetGly: 1.21 ± 0.025
0.308MetHis: 0.308 ± 0.013
0.672MetIle: 0.672 ± 0.019
0.321MetLys: 0.321 ± 0.013
1.613MetLeu: 1.613 ± 0.029
0.247MetMet: 0.247 ± 0.013
0.369MetAsn: 0.369 ± 0.015
1.081MetPro: 1.081 ± 0.026
0.339MetGln: 0.339 ± 0.011
1.487MetArg: 1.487 ± 0.029
1.303MetSer: 1.303 ± 0.025
1.592MetThr: 1.592 ± 0.03
1.419MetVal: 1.419 ± 0.029
0.199MetTrp: 0.199 ± 0.01
0.219MetTyr: 0.219 ± 0.012
0.0MetXaa: 0.0 ± 0.0
Asn
2.301AsnAla: 2.301 ± 0.037
0.189AsnCys: 0.189 ± 0.01
0.968AsnAsp: 0.968 ± 0.023
0.769AsnGlu: 0.769 ± 0.021
0.46AsnPhe: 0.46 ± 0.015
1.843AsnGly: 1.843 ± 0.039
0.436AsnHis: 0.436 ± 0.016
0.602AsnIle: 0.602 ± 0.017
0.393AsnLys: 0.393 ± 0.016
1.892AsnLeu: 1.892 ± 0.033
0.264AsnMet: 0.264 ± 0.012
0.425AsnAsn: 0.425 ± 0.015
1.573AsnPro: 1.573 ± 0.03
0.585AsnGln: 0.585 ± 0.017
1.521AsnArg: 1.521 ± 0.03
0.948AsnSer: 0.948 ± 0.025
1.146AsnThr: 1.146 ± 0.029
1.349AsnVal: 1.349 ± 0.028
0.287AsnTrp: 0.287 ± 0.015
0.398AsnTyr: 0.398 ± 0.015
0.0AsnXaa: 0.0 ± 0.0
Pro
8.475ProAla: 8.475 ± 0.098
0.363ProCys: 0.363 ± 0.015
4.879ProAsp: 4.879 ± 0.052
4.085ProGlu: 4.085 ± 0.049
1.591ProPhe: 1.591 ± 0.026
6.281ProGly: 6.281 ± 0.064
1.322ProHis: 1.322 ± 0.027
1.657ProIle: 1.657 ± 0.03
1.2ProLys: 1.2 ± 0.029
5.266ProLeu: 5.266 ± 0.068
1.082ProMet: 1.082 ± 0.023
1.171ProAsn: 1.171 ± 0.027
4.054ProPro: 4.054 ± 0.105
1.578ProGln: 1.578 ± 0.045
4.17ProArg: 4.17 ± 0.061
3.01ProSer: 3.01 ± 0.054
3.781ProThr: 3.781 ± 0.056
5.738ProVal: 5.738 ± 0.057
0.937ProTrp: 0.937 ± 0.023
1.051ProTyr: 1.051 ± 0.027
0.0ProXaa: 0.0 ± 0.0
Gln
3.753GlnAla: 3.753 ± 0.047
0.182GlnCys: 0.182 ± 0.009
1.277GlnAsp: 1.277 ± 0.028
1.246GlnGlu: 1.246 ± 0.025
0.666GlnPhe: 0.666 ± 0.02
2.056GlnGly: 2.056 ± 0.04
0.614GlnHis: 0.614 ± 0.015
1.04GlnIle: 1.04 ± 0.026
0.496GlnLys: 0.496 ± 0.02
2.618GlnLeu: 2.618 ± 0.042
0.437GlnMet: 0.437 ± 0.015
0.472GlnAsn: 0.472 ± 0.016
1.826GlnPro: 1.826 ± 0.056
0.967GlnGln: 0.967 ± 0.03
2.512GlnArg: 2.512 ± 0.036
1.215GlnSer: 1.215 ± 0.025
1.318GlnThr: 1.318 ± 0.028
2.655GlnVal: 2.655 ± 0.038
0.496GlnTrp: 0.496 ± 0.017
0.475GlnTyr: 0.475 ± 0.016
0.0GlnXaa: 0.0 ± 0.0
Arg
10.774ArgAla: 10.774 ± 0.087
0.658ArgCys: 0.658 ± 0.018
4.596ArgAsp: 4.596 ± 0.054
4.567ArgGlu: 4.567 ± 0.048
2.574ArgPhe: 2.574 ± 0.042
6.104ArgGly: 6.104 ± 0.066
2.058ArgHis: 2.058 ± 0.039
2.899ArgIle: 2.899 ± 0.042
1.858ArgLys: 1.858 ± 0.039
8.861ArgLeu: 8.861 ± 0.068
1.825ArgMet: 1.825 ± 0.028
1.376ArgAsn: 1.376 ± 0.026
4.943ArgPro: 4.943 ± 0.062
2.369ArgGln: 2.369 ± 0.037
7.461ArgArg: 7.461 ± 0.083
3.79ArgSer: 3.79 ± 0.053
4.646ArgThr: 4.646 ± 0.047
6.984ArgVal: 6.984 ± 0.062
1.524ArgTrp: 1.524 ± 0.031
1.782ArgTyr: 1.782 ± 0.032
0.0ArgXaa: 0.0 ± 0.0
Ser
6.94SerAla: 6.94 ± 0.071
0.406SerCys: 0.406 ± 0.016
2.708SerAsp: 2.708 ± 0.045
2.216SerGlu: 2.216 ± 0.036
1.463SerPhe: 1.463 ± 0.029
5.615SerGly: 5.615 ± 0.068
0.893SerHis: 0.893 ± 0.021
1.511SerIle: 1.511 ± 0.028
0.939SerLys: 0.939 ± 0.027
4.62SerLeu: 4.62 ± 0.049
1.008SerMet: 1.008 ± 0.025
0.829SerAsn: 0.829 ± 0.025
3.139SerPro: 3.139 ± 0.045
1.208SerGln: 1.208 ± 0.027
3.652SerArg: 3.652 ± 0.043
2.519SerSer: 2.519 ± 0.052
3.271SerThr: 3.271 ± 0.052
4.362SerVal: 4.362 ± 0.045
0.836SerTrp: 0.836 ± 0.023
1.005SerTyr: 1.005 ± 0.023
0.0SerXaa: 0.0 ± 0.0
Thr
8.525ThrAla: 8.525 ± 0.08
0.446ThrCys: 0.446 ± 0.016
3.553ThrAsp: 3.553 ± 0.048
3.345ThrGlu: 3.345 ± 0.048
1.585ThrPhe: 1.585 ± 0.031
6.241ThrGly: 6.241 ± 0.072
1.235ThrHis: 1.235 ± 0.027
1.876ThrIle: 1.876 ± 0.038
1.175ThrLys: 1.175 ± 0.029
5.603ThrLeu: 5.603 ± 0.056
1.023ThrMet: 1.023 ± 0.022
1.044ThrAsn: 1.044 ± 0.024
4.113ThrPro: 4.113 ± 0.057
1.328ThrGln: 1.328 ± 0.026
4.208ThrArg: 4.208 ± 0.06
3.045ThrSer: 3.045 ± 0.045
4.062ThrThr: 4.062 ± 0.073
6.045ThrVal: 6.045 ± 0.061
0.93ThrTrp: 0.93 ± 0.024
1.088ThrTyr: 1.088 ± 0.023
0.0ThrXaa: 0.0 ± 0.0
Val
12.557ValAla: 12.557 ± 0.097
0.799ValCys: 0.799 ± 0.02
6.029ValAsp: 6.029 ± 0.058
5.055ValGlu: 5.055 ± 0.053
2.624ValPhe: 2.624 ± 0.038
7.308ValGly: 7.308 ± 0.069
1.916ValHis: 1.916 ± 0.031
3.037ValIle: 3.037 ± 0.041
1.618ValLys: 1.618 ± 0.032
10.221ValLeu: 10.221 ± 0.093
1.203ValMet: 1.203 ± 0.029
1.784ValAsn: 1.784 ± 0.03
5.356ValPro: 5.356 ± 0.064
1.827ValGln: 1.827 ± 0.033
7.407ValArg: 7.407 ± 0.066
4.661ValSer: 4.661 ± 0.054
5.659ValThr: 5.659 ± 0.065
9.281ValVal: 9.281 ± 0.092
1.143ValTrp: 1.143 ± 0.027
1.485ValTyr: 1.485 ± 0.029
0.0ValXaa: 0.0 ± 0.0
Trp
1.683TrpAla: 1.683 ± 0.032
0.154TrpCys: 0.154 ± 0.01
0.743TrpAsp: 0.743 ± 0.021
0.678TrpGlu: 0.678 ± 0.02
0.537TrpPhe: 0.537 ± 0.018
1.001TrpGly: 1.001 ± 0.025
0.408TrpHis: 0.408 ± 0.015
0.497TrpIle: 0.497 ± 0.015
0.301TrpLys: 0.301 ± 0.014
1.914TrpLeu: 1.914 ± 0.039
0.28TrpMet: 0.28 ± 0.012
0.331TrpAsn: 0.331 ± 0.014
0.874TrpPro: 0.874 ± 0.021
0.578TrpGln: 0.578 ± 0.018
1.459TrpArg: 1.459 ± 0.031
0.932TrpSer: 0.932 ± 0.023
0.964TrpThr: 0.964 ± 0.026
1.172TrpVal: 1.172 ± 0.025
0.348TrpTrp: 0.348 ± 0.015
0.261TrpTyr: 0.261 ± 0.012
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.288TyrAla: 2.288 ± 0.033
0.175TyrCys: 0.175 ± 0.01
1.203TyrAsp: 1.203 ± 0.027
0.94TyrGlu: 0.94 ± 0.025
0.604TyrPhe: 0.604 ± 0.018
1.766TyrGly: 1.766 ± 0.035
0.415TyrHis: 0.415 ± 0.013
0.427TyrIle: 0.427 ± 0.016
0.29TyrLys: 0.29 ± 0.014
2.237TyrLeu: 2.237 ± 0.04
0.187TyrMet: 0.187 ± 0.009
0.361TyrAsn: 0.361 ± 0.016
1.173TyrPro: 1.173 ± 0.026
0.629TyrGln: 0.629 ± 0.02
1.858TyrArg: 1.858 ± 0.034
0.923TyrSer: 0.923 ± 0.024
1.031TyrThr: 1.031 ± 0.021
1.402TyrVal: 1.402 ± 0.024
0.318TyrTrp: 0.318 ± 0.013
0.413TyrTyr: 0.413 ± 0.017
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.001XaaGly: 0.001 ± 0.001
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.002XaaXaa: 0.002 ± 0.002
Statistics based on 6124 proteins (1928892 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski