Amino acid dipepetide frequency for Amycolatopsis alkalitolerans

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
19.56AlaAla: 19.56 ± 0.126
1.061AlaCys: 1.061 ± 0.023
7.499AlaAsp: 7.499 ± 0.072
8.987AlaGlu: 8.987 ± 0.082
3.646AlaPhe: 3.646 ± 0.046
13.705AlaGly: 13.705 ± 0.104
2.591AlaHis: 2.591 ± 0.039
4.638AlaIle: 4.638 ± 0.054
3.231AlaLys: 3.231 ± 0.046
13.657AlaLeu: 13.657 ± 0.108
2.643AlaMet: 2.643 ± 0.036
2.303AlaAsn: 2.303 ± 0.039
6.052AlaPro: 6.052 ± 0.06
3.644AlaGln: 3.644 ± 0.051
10.166AlaArg: 10.166 ± 0.091
5.898AlaSer: 5.898 ± 0.058
6.946AlaThr: 6.946 ± 0.06
11.826AlaVal: 11.826 ± 0.092
1.751AlaTrp: 1.751 ± 0.031
2.349AlaTyr: 2.349 ± 0.034
0.0AlaXaa: 0.0 ± 0.0
Cys
1.037CysAla: 1.037 ± 0.022
0.103CysCys: 0.103 ± 0.008
0.436CysAsp: 0.436 ± 0.019
0.441CysGlu: 0.441 ± 0.017
0.251CysPhe: 0.251 ± 0.012
0.974CysGly: 0.974 ± 0.023
0.205CysHis: 0.205 ± 0.008
0.193CysIle: 0.193 ± 0.009
0.103CysLys: 0.103 ± 0.007
0.747CysLeu: 0.747 ± 0.018
0.107CysMet: 0.107 ± 0.007
0.127CysAsn: 0.127 ± 0.008
0.524CysPro: 0.524 ± 0.016
0.199CysGln: 0.199 ± 0.01
0.571CysArg: 0.571 ± 0.018
0.445CysSer: 0.445 ± 0.016
0.489CysThr: 0.489 ± 0.016
0.659CysVal: 0.659 ± 0.019
0.129CysTrp: 0.129 ± 0.007
0.193CysTyr: 0.193 ± 0.009
0.0CysXaa: 0.0 ± 0.0
Asp
7.003AspAla: 7.003 ± 0.071
0.368AspCys: 0.368 ± 0.014
3.056AspAsp: 3.056 ± 0.04
4.096AspGlu: 4.096 ± 0.045
1.582AspPhe: 1.582 ± 0.033
5.536AspGly: 5.536 ± 0.052
1.252AspHis: 1.252 ± 0.024
1.893AspIle: 1.893 ± 0.033
1.106AspLys: 1.106 ± 0.027
6.259AspLeu: 6.259 ± 0.061
0.763AspMet: 0.763 ± 0.018
0.94AspAsn: 0.94 ± 0.024
4.19AspPro: 4.19 ± 0.053
1.654AspGln: 1.654 ± 0.03
4.604AspArg: 4.604 ± 0.053
2.386AspSer: 2.386 ± 0.036
2.812AspThr: 2.812 ± 0.04
5.024AspVal: 5.024 ± 0.047
0.893AspTrp: 0.893 ± 0.021
1.212AspTyr: 1.212 ± 0.026
0.0AspXaa: 0.0 ± 0.0
Glu
6.775GluAla: 6.775 ± 0.072
0.374GluCys: 0.374 ± 0.013
2.78GluAsp: 2.78 ± 0.038
3.057GluGlu: 3.057 ± 0.052
1.829GluPhe: 1.829 ± 0.032
3.791GluGly: 3.791 ± 0.04
1.741GluHis: 1.741 ± 0.031
2.724GluIle: 2.724 ± 0.038
1.408GluLys: 1.408 ± 0.031
7.515GluLeu: 7.515 ± 0.076
0.974GluMet: 0.974 ± 0.022
1.139GluAsn: 1.139 ± 0.024
3.543GluPro: 3.543 ± 0.049
2.29GluGln: 2.29 ± 0.03
5.384GluArg: 5.384 ± 0.056
2.618GluSer: 2.618 ± 0.04
2.943GluThr: 2.943 ± 0.042
4.974GluVal: 4.974 ± 0.048
0.777GluTrp: 0.777 ± 0.019
1.12GluTyr: 1.12 ± 0.027
0.0GluXaa: 0.0 ± 0.0
Phe
4.235PheAla: 4.235 ± 0.041
0.298PheCys: 0.298 ± 0.011
2.117PheAsp: 2.117 ± 0.031
1.552PheGlu: 1.552 ± 0.03
0.95PhePhe: 0.95 ± 0.027
3.6PheGly: 3.6 ± 0.054
0.664PheHis: 0.664 ± 0.019
0.811PheIle: 0.811 ± 0.02
0.45PheLys: 0.45 ± 0.016
2.798PheLeu: 2.798 ± 0.044
0.378PheMet: 0.378 ± 0.014
0.572PheAsn: 0.572 ± 0.018
1.444PhePro: 1.444 ± 0.029
0.664PheGln: 0.664 ± 0.017
1.947PheArg: 1.947 ± 0.03
1.578PheSer: 1.578 ± 0.029
2.04PheThr: 2.04 ± 0.035
2.561PheVal: 2.561 ± 0.038
0.422PheTrp: 0.422 ± 0.015
0.616PheTyr: 0.616 ± 0.018
0.0PheXaa: 0.0 ± 0.0
Gly
10.674GlyAla: 10.674 ± 0.097
0.826GlyCys: 0.826 ± 0.023
4.734GlyAsp: 4.734 ± 0.05
5.576GlyGlu: 5.576 ± 0.057
3.193GlyPhe: 3.193 ± 0.045
8.497GlyGly: 8.497 ± 0.08
2.18GlyHis: 2.18 ± 0.037
3.92GlyIle: 3.92 ± 0.044
2.617GlyLys: 2.617 ± 0.037
9.659GlyLeu: 9.659 ± 0.07
2.063GlyMet: 2.063 ± 0.032
1.886GlyAsn: 1.886 ± 0.039
4.581GlyPro: 4.581 ± 0.05
2.934GlyGln: 2.934 ± 0.039
7.112GlyArg: 7.112 ± 0.07
5.007GlySer: 5.007 ± 0.058
5.646GlyThr: 5.646 ± 0.054
8.011GlyVal: 8.011 ± 0.071
1.767GlyTrp: 1.767 ± 0.031
2.505GlyTyr: 2.505 ± 0.039
0.0GlyXaa: 0.0 ± 0.0
His
2.654HisAla: 2.654 ± 0.04
0.217HisCys: 0.217 ± 0.011
1.327HisAsp: 1.327 ± 0.026
1.334HisGlu: 1.334 ± 0.025
0.64HisPhe: 0.64 ± 0.02
2.346HisGly: 2.346 ± 0.037
0.659HisHis: 0.659 ± 0.021
0.649HisIle: 0.649 ± 0.019
0.346HisLys: 0.346 ± 0.014
2.405HisLeu: 2.405 ± 0.038
0.285HisMet: 0.285 ± 0.013
0.436HisAsn: 0.436 ± 0.015
1.61HisPro: 1.61 ± 0.029
0.649HisGln: 0.649 ± 0.018
1.98HisArg: 1.98 ± 0.034
1.015HisSer: 1.015 ± 0.023
1.167HisThr: 1.167 ± 0.026
1.769HisVal: 1.769 ± 0.027
0.351HisTrp: 0.351 ± 0.013
0.528HisTyr: 0.528 ± 0.018
0.0HisXaa: 0.0 ± 0.0
Ile
5.605IleAla: 5.605 ± 0.056
0.334IleCys: 0.334 ± 0.012
2.525IleAsp: 2.525 ± 0.038
2.35IleGlu: 2.35 ± 0.035
0.902IlePhe: 0.902 ± 0.024
4.106IleGly: 4.106 ± 0.052
0.635IleHis: 0.635 ± 0.017
1.115IleIle: 1.115 ± 0.029
0.735IleLys: 0.735 ± 0.021
2.779IleLeu: 2.779 ± 0.035
0.522IleMet: 0.522 ± 0.017
0.814IleAsn: 0.814 ± 0.021
2.053IlePro: 2.053 ± 0.03
0.821IleGln: 0.821 ± 0.019
2.572IleArg: 2.572 ± 0.032
2.022IleSer: 2.022 ± 0.031
2.475IleThr: 2.475 ± 0.045
3.499IleVal: 3.499 ± 0.047
0.43IleTrp: 0.43 ± 0.014
0.665IleTyr: 0.665 ± 0.021
0.0IleXaa: 0.0 ± 0.0
Lys
2.783LysAla: 2.783 ± 0.039
0.112LysCys: 0.112 ± 0.008
1.049LysAsp: 1.049 ± 0.024
0.997LysGlu: 0.997 ± 0.025
0.564LysPhe: 0.564 ± 0.016
1.523LysGly: 1.523 ± 0.034
0.499LysHis: 0.499 ± 0.017
1.045LysIle: 1.045 ± 0.026
0.6LysLys: 0.6 ± 0.023
2.335LysLeu: 2.335 ± 0.04
0.414LysMet: 0.414 ± 0.014
0.453LysAsn: 0.453 ± 0.016
1.476LysPro: 1.476 ± 0.029
0.746LysGln: 0.746 ± 0.018
1.636LysArg: 1.636 ± 0.029
1.029LysSer: 1.029 ± 0.024
1.3LysThr: 1.3 ± 0.025
1.99LysVal: 1.99 ± 0.033
0.288LysTrp: 0.288 ± 0.012
0.459LysTyr: 0.459 ± 0.016
0.0LysXaa: 0.0 ± 0.0
Leu
15.974LeuAla: 15.974 ± 0.112
0.836LeuCys: 0.836 ± 0.019
6.846LeuAsp: 6.846 ± 0.066
4.959LeuGlu: 4.959 ± 0.054
2.797LeuPhe: 2.797 ± 0.042
9.902LeuGly: 9.902 ± 0.076
2.254LeuHis: 2.254 ± 0.034
3.794LeuIle: 3.794 ± 0.051
1.813LeuLys: 1.813 ± 0.033
11.162LeuLeu: 11.162 ± 0.105
1.528LeuMet: 1.528 ± 0.032
1.784LeuAsn: 1.784 ± 0.031
6.36LeuPro: 6.36 ± 0.069
2.187LeuGln: 2.187 ± 0.033
8.928LeuArg: 8.928 ± 0.086
5.783LeuSer: 5.783 ± 0.056
6.444LeuThr: 6.444 ± 0.065
9.531LeuVal: 9.531 ± 0.077
1.2LeuTrp: 1.2 ± 0.027
1.723LeuTyr: 1.723 ± 0.032
0.0LeuXaa: 0.0 ± 0.0
Met
2.221MetAla: 2.221 ± 0.03
0.131MetCys: 0.131 ± 0.009
0.833MetAsp: 0.833 ± 0.021
0.642MetGlu: 0.642 ± 0.019
0.534MetPhe: 0.534 ± 0.018
1.275MetGly: 1.275 ± 0.024
0.369MetHis: 0.369 ± 0.013
0.785MetIle: 0.785 ± 0.022
0.407MetLys: 0.407 ± 0.012
1.87MetLeu: 1.87 ± 0.031
0.31MetMet: 0.31 ± 0.012
0.464MetAsn: 0.464 ± 0.015
1.143MetPro: 1.143 ± 0.022
0.413MetGln: 0.413 ± 0.016
1.549MetArg: 1.549 ± 0.029
1.344MetSer: 1.344 ± 0.026
1.563MetThr: 1.563 ± 0.027
1.426MetVal: 1.426 ± 0.026
0.208MetTrp: 0.208 ± 0.011
0.29MetTyr: 0.29 ± 0.012
0.0MetXaa: 0.0 ± 0.0
Asn
2.419AsnAla: 2.419 ± 0.041
0.165AsnCys: 0.165 ± 0.009
0.982AsnAsp: 0.982 ± 0.022
0.92AsnGlu: 0.92 ± 0.024
0.532AsnPhe: 0.532 ± 0.019
1.898AsnGly: 1.898 ± 0.04
0.428AsnHis: 0.428 ± 0.016
0.71AsnIle: 0.71 ± 0.021
0.383AsnLys: 0.383 ± 0.015
2.014AsnLeu: 2.014 ± 0.03
0.302AsnMet: 0.302 ± 0.012
0.474AsnAsn: 0.474 ± 0.032
1.482AsnPro: 1.482 ± 0.029
0.592AsnGln: 0.592 ± 0.018
1.368AsnArg: 1.368 ± 0.024
0.982AsnSer: 0.982 ± 0.022
1.107AsnThr: 1.107 ± 0.026
1.581AsnVal: 1.581 ± 0.028
0.314AsnTrp: 0.314 ± 0.013
0.464AsnTyr: 0.464 ± 0.016
0.0AsnXaa: 0.0 ± 0.0
Pro
7.986ProAla: 7.986 ± 0.065
0.351ProCys: 0.351 ± 0.014
4.043ProAsp: 4.043 ± 0.046
4.127ProGlu: 4.127 ± 0.05
1.62ProPhe: 1.62 ± 0.028
6.456ProGly: 6.456 ± 0.065
1.194ProHis: 1.194 ± 0.025
1.701ProIle: 1.701 ± 0.03
1.24ProLys: 1.24 ± 0.026
5.155ProLeu: 5.155 ± 0.052
1.037ProMet: 1.037 ± 0.022
1.089ProAsn: 1.089 ± 0.025
3.364ProPro: 3.364 ± 0.059
1.538ProGln: 1.538 ± 0.033
3.849ProArg: 3.849 ± 0.049
3.02ProSer: 3.02 ± 0.041
2.73ProThr: 2.73 ± 0.04
5.436ProVal: 5.436 ± 0.056
0.863ProTrp: 0.863 ± 0.022
1.183ProTyr: 1.183 ± 0.024
0.0ProXaa: 0.0 ± 0.0
Gln
3.85GlnAla: 3.85 ± 0.049
0.197GlnCys: 0.197 ± 0.011
1.217GlnAsp: 1.217 ± 0.025
1.242GlnGlu: 1.242 ± 0.028
0.771GlnPhe: 0.771 ± 0.021
2.028GlnGly: 2.028 ± 0.033
0.704GlnHis: 0.704 ± 0.017
1.201GlnIle: 1.201 ± 0.024
0.554GlnLys: 0.554 ± 0.018
3.334GlnLeu: 3.334 ± 0.047
0.489GlnMet: 0.489 ± 0.016
0.544GlnAsn: 0.544 ± 0.016
1.82GlnPro: 1.82 ± 0.04
1.262GlnGln: 1.262 ± 0.034
2.506GlnArg: 2.506 ± 0.035
1.237GlnSer: 1.237 ± 0.027
1.323GlnThr: 1.323 ± 0.028
2.674GlnVal: 2.674 ± 0.038
0.455GlnTrp: 0.455 ± 0.017
0.583GlnTyr: 0.583 ± 0.019
0.0GlnXaa: 0.0 ± 0.0
Arg
9.658ArgAla: 9.658 ± 0.092
0.618ArgCys: 0.618 ± 0.02
4.192ArgAsp: 4.192 ± 0.051
5.012ArgGlu: 5.012 ± 0.054
2.6ArgPhe: 2.6 ± 0.037
5.817ArgGly: 5.817 ± 0.06
1.964ArgHis: 1.964 ± 0.037
3.369ArgIle: 3.369 ± 0.042
1.841ArgLys: 1.841 ± 0.03
8.846ArgLeu: 8.846 ± 0.083
1.883ArgMet: 1.883 ± 0.028
1.481ArgAsn: 1.481 ± 0.027
4.53ArgPro: 4.53 ± 0.056
2.403ArgGln: 2.403 ± 0.038
7.925ArgArg: 7.925 ± 0.089
3.829ArgSer: 3.829 ± 0.046
4.614ArgThr: 4.614 ± 0.05
6.059ArgVal: 6.059 ± 0.056
1.424ArgTrp: 1.424 ± 0.028
1.971ArgTyr: 1.971 ± 0.033
0.0ArgXaa: 0.0 ± 0.0
Ser
6.539SerAla: 6.539 ± 0.06
0.432SerCys: 0.432 ± 0.017
2.571SerAsp: 2.571 ± 0.042
2.455SerGlu: 2.455 ± 0.035
1.632SerPhe: 1.632 ± 0.028
5.705SerGly: 5.705 ± 0.062
0.916SerHis: 0.916 ± 0.019
1.817SerIle: 1.817 ± 0.031
0.988SerLys: 0.988 ± 0.023
4.972SerLeu: 4.972 ± 0.051
1.213SerMet: 1.213 ± 0.024
0.963SerAsn: 0.963 ± 0.024
3.036SerPro: 3.036 ± 0.037
1.27SerGln: 1.27 ± 0.029
3.727SerArg: 3.727 ± 0.043
2.921SerSer: 2.921 ± 0.053
3.183SerThr: 3.183 ± 0.044
4.357SerVal: 4.357 ± 0.044
0.942SerTrp: 0.942 ± 0.025
1.186SerTyr: 1.186 ± 0.024
0.0SerXaa: 0.0 ± 0.0
Thr
7.646ThrAla: 7.646 ± 0.063
0.394ThrCys: 0.394 ± 0.014
3.188ThrAsp: 3.188 ± 0.041
3.36ThrGlu: 3.36 ± 0.041
1.702ThrPhe: 1.702 ± 0.031
6.361ThrGly: 6.361 ± 0.061
1.116ThrHis: 1.116 ± 0.023
2.208ThrIle: 2.208 ± 0.041
1.224ThrLys: 1.224 ± 0.027
5.471ThrLeu: 5.471 ± 0.052
1.088ThrMet: 1.088 ± 0.024
1.086ThrAsn: 1.086 ± 0.023
3.49ThrPro: 3.49 ± 0.047
1.298ThrGln: 1.298 ± 0.027
3.791ThrArg: 3.791 ± 0.045
3.132ThrSer: 3.132 ± 0.044
3.683ThrThr: 3.683 ± 0.056
5.882ThrVal: 5.882 ± 0.06
0.822ThrTrp: 0.822 ± 0.021
1.201ThrTyr: 1.201 ± 0.027
0.0ThrXaa: 0.0 ± 0.0
Val
11.499ValAla: 11.499 ± 0.084
0.774ValCys: 0.774 ± 0.021
5.221ValAsp: 5.221 ± 0.052
4.918ValGlu: 4.918 ± 0.056
2.687ValPhe: 2.687 ± 0.038
6.756ValGly: 6.756 ± 0.062
2.097ValHis: 2.097 ± 0.032
3.358ValIle: 3.358 ± 0.052
1.644ValLys: 1.644 ± 0.031
10.441ValLeu: 10.441 ± 0.084
1.302ValMet: 1.302 ± 0.026
1.819ValAsn: 1.819 ± 0.031
5.289ValPro: 5.289 ± 0.051
2.109ValGln: 2.109 ± 0.035
7.15ValArg: 7.15 ± 0.07
4.688ValSer: 4.688 ± 0.053
5.627ValThr: 5.627 ± 0.057
8.75ValVal: 8.75 ± 0.084
1.055ValTrp: 1.055 ± 0.025
1.547ValTyr: 1.547 ± 0.028
0.0ValXaa: 0.0 ± 0.0
Trp
1.573TrpAla: 1.573 ± 0.026
0.135TrpCys: 0.135 ± 0.008
0.692TrpAsp: 0.692 ± 0.02
0.619TrpGlu: 0.619 ± 0.017
0.549TrpPhe: 0.549 ± 0.016
1.017TrpGly: 1.017 ± 0.022
0.408TrpHis: 0.408 ± 0.015
0.569TrpIle: 0.569 ± 0.02
0.283TrpLys: 0.283 ± 0.014
1.955TrpLeu: 1.955 ± 0.038
0.269TrpMet: 0.269 ± 0.01
0.329TrpAsn: 0.329 ± 0.015
0.795TrpPro: 0.795 ± 0.02
0.618TrpGln: 0.618 ± 0.018
1.403TrpArg: 1.403 ± 0.026
0.86TrpSer: 0.86 ± 0.019
0.947TrpThr: 0.947 ± 0.022
1.063TrpVal: 1.063 ± 0.025
0.353TrpTrp: 0.353 ± 0.015
0.342TrpTyr: 0.342 ± 0.013
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.463TyrAla: 2.463 ± 0.035
0.205TyrCys: 0.205 ± 0.011
1.32TyrAsp: 1.32 ± 0.034
1.111TyrGlu: 1.111 ± 0.024
0.717TyrPhe: 0.717 ± 0.02
2.096TyrGly: 2.096 ± 0.036
0.49TyrHis: 0.49 ± 0.016
0.458TyrIle: 0.458 ± 0.016
0.328TyrLys: 0.328 ± 0.014
2.432TyrLeu: 2.432 ± 0.037
0.207TyrMet: 0.207 ± 0.012
0.413TyrAsn: 0.413 ± 0.017
1.2TyrPro: 1.2 ± 0.022
0.702TyrGln: 0.702 ± 0.019
1.868TyrArg: 1.868 ± 0.034
1.034TyrSer: 1.034 ± 0.023
1.103TyrThr: 1.103 ± 0.027
1.656TyrVal: 1.656 ± 0.031
0.336TyrTrp: 0.336 ± 0.012
0.501TyrTyr: 0.501 ± 0.018
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6455 proteins (2018279 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski