Amino acid dipepetide frequency for Streptomyces spectabilis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
23.546AlaAla: 23.546 ± 0.15
1.161AlaCys: 1.161 ± 0.024
9.082AlaAsp: 9.082 ± 0.072
8.994AlaGlu: 8.994 ± 0.091
3.638AlaPhe: 3.638 ± 0.038
12.941AlaGly: 12.941 ± 0.09
3.259AlaHis: 3.259 ± 0.035
3.15AlaIle: 3.15 ± 0.036
2.857AlaLys: 2.857 ± 0.052
15.596AlaLeu: 15.596 ± 0.12
2.404AlaMet: 2.404 ± 0.033
1.734AlaAsn: 1.734 ± 0.029
8.0AlaPro: 8.0 ± 0.082
3.685AlaGln: 3.685 ± 0.044
11.396AlaArg: 11.396 ± 0.088
5.774AlaSer: 5.774 ± 0.047
7.023AlaThr: 7.023 ± 0.055
12.762AlaVal: 12.762 ± 0.091
1.974AlaTrp: 1.974 ± 0.026
2.99AlaTyr: 2.99 ± 0.034
0.0AlaXaa: 0.0 ± 0.0
Cys
1.241CysAla: 1.241 ± 0.026
0.085CysCys: 0.085 ± 0.006
0.472CysAsp: 0.472 ± 0.014
0.389CysGlu: 0.389 ± 0.012
0.233CysPhe: 0.233 ± 0.009
1.0CysGly: 1.0 ± 0.023
0.191CysHis: 0.191 ± 0.009
0.121CysIle: 0.121 ± 0.007
0.117CysLys: 0.117 ± 0.007
0.719CysLeu: 0.719 ± 0.017
0.12CysMet: 0.12 ± 0.007
0.117CysAsn: 0.117 ± 0.007
0.485CysPro: 0.485 ± 0.014
0.154CysGln: 0.154 ± 0.007
0.622CysArg: 0.622 ± 0.017
0.41CysSer: 0.41 ± 0.012
0.469CysThr: 0.469 ± 0.014
0.713CysVal: 0.713 ± 0.017
0.117CysTrp: 0.117 ± 0.007
0.149CysTyr: 0.149 ± 0.007
0.0CysXaa: 0.0 ± 0.0
Asp
8.383AspAla: 8.383 ± 0.067
0.412AspCys: 0.412 ± 0.013
3.82AspAsp: 3.82 ± 0.05
3.917AspGlu: 3.917 ± 0.046
1.679AspPhe: 1.679 ± 0.027
6.691AspGly: 6.691 ± 0.067
1.429AspHis: 1.429 ± 0.024
1.687AspIle: 1.687 ± 0.026
1.276AspLys: 1.276 ± 0.031
6.424AspLeu: 6.424 ± 0.054
0.764AspMet: 0.764 ± 0.015
0.857AspAsn: 0.857 ± 0.018
4.403AspPro: 4.403 ± 0.043
1.474AspGln: 1.474 ± 0.024
5.081AspArg: 5.081 ± 0.057
2.39AspSer: 2.39 ± 0.032
3.251AspThr: 3.251 ± 0.041
5.083AspVal: 5.083 ± 0.05
1.005AspTrp: 1.005 ± 0.023
1.056AspTyr: 1.056 ± 0.022
0.0AspXaa: 0.0 ± 0.0
Glu
7.409GluAla: 7.409 ± 0.069
0.363GluCys: 0.363 ± 0.013
2.765GluAsp: 2.765 ± 0.034
3.288GluGlu: 3.288 ± 0.049
1.417GluPhe: 1.417 ± 0.024
4.445GluGly: 4.445 ± 0.051
1.537GluHis: 1.537 ± 0.025
1.941GluIle: 1.941 ± 0.034
1.352GluLys: 1.352 ± 0.031
6.896GluLeu: 6.896 ± 0.066
0.793GluMet: 0.793 ± 0.017
0.87GluAsn: 0.87 ± 0.023
3.526GluPro: 3.526 ± 0.037
2.105GluGln: 2.105 ± 0.033
6.036GluArg: 6.036 ± 0.058
2.492GluSer: 2.492 ± 0.035
2.657GluThr: 2.657 ± 0.031
4.389GluVal: 4.389 ± 0.049
0.712GluTrp: 0.712 ± 0.018
1.073GluTyr: 1.073 ± 0.019
0.0GluXaa: 0.0 ± 0.0
Phe
3.77PheAla: 3.77 ± 0.042
0.262PheCys: 0.262 ± 0.011
2.019PheAsp: 2.019 ± 0.03
1.387PheGlu: 1.387 ± 0.028
0.848PhePhe: 0.848 ± 0.022
3.007PheGly: 3.007 ± 0.039
0.642PheHis: 0.642 ± 0.015
0.661PheIle: 0.661 ± 0.017
0.535PheLys: 0.535 ± 0.017
2.529PheLeu: 2.529 ± 0.039
0.37PheMet: 0.37 ± 0.013
0.519PheAsn: 0.519 ± 0.015
1.385PhePro: 1.385 ± 0.02
0.666PheGln: 0.666 ± 0.016
1.834PheArg: 1.834 ± 0.026
1.353PheSer: 1.353 ± 0.025
1.995PheThr: 1.995 ± 0.031
2.263PheVal: 2.263 ± 0.032
0.417PheTrp: 0.417 ± 0.013
0.533PheTyr: 0.533 ± 0.017
0.0PheXaa: 0.0 ± 0.0
Gly
12.073GlyAla: 12.073 ± 0.083
0.839GlyCys: 0.839 ± 0.019
5.436GlyAsp: 5.436 ± 0.05
5.205GlyGlu: 5.205 ± 0.043
2.841GlyPhe: 2.841 ± 0.038
9.19GlyGly: 9.19 ± 0.081
2.443GlyHis: 2.443 ± 0.032
3.121GlyIle: 3.121 ± 0.042
2.464GlyLys: 2.464 ± 0.049
9.587GlyLeu: 9.587 ± 0.069
1.878GlyMet: 1.878 ± 0.029
1.548GlyAsn: 1.548 ± 0.033
5.458GlyPro: 5.458 ± 0.059
2.608GlyGln: 2.608 ± 0.038
8.017GlyArg: 8.017 ± 0.06
5.133GlySer: 5.133 ± 0.054
6.252GlyThr: 6.252 ± 0.058
7.867GlyVal: 7.867 ± 0.07
1.608GlyTrp: 1.608 ± 0.025
2.191GlyTyr: 2.191 ± 0.033
0.0GlyXaa: 0.0 ± 0.0
His
2.939HisAla: 2.939 ± 0.038
0.232HisCys: 0.232 ± 0.009
1.384HisAsp: 1.384 ± 0.024
1.254HisGlu: 1.254 ± 0.023
0.616HisPhe: 0.616 ± 0.015
2.54HisGly: 2.54 ± 0.033
0.718HisHis: 0.718 ± 0.019
0.641HisIle: 0.641 ± 0.015
0.373HisLys: 0.373 ± 0.013
2.501HisLeu: 2.501 ± 0.033
0.324HisMet: 0.324 ± 0.011
0.321HisAsn: 0.321 ± 0.011
1.844HisPro: 1.844 ± 0.031
0.671HisGln: 0.671 ± 0.016
2.156HisArg: 2.156 ± 0.034
0.978HisSer: 0.978 ± 0.021
1.405HisThr: 1.405 ± 0.026
1.846HisVal: 1.846 ± 0.028
0.366HisTrp: 0.366 ± 0.012
0.467HisTyr: 0.467 ± 0.013
0.0HisXaa: 0.0 ± 0.0
Ile
4.415IleAla: 4.415 ± 0.048
0.255IleCys: 0.255 ± 0.011
1.985IleAsp: 1.985 ± 0.028
1.805IleGlu: 1.805 ± 0.026
0.572IlePhe: 0.572 ± 0.017
3.328IleGly: 3.328 ± 0.039
0.546IleHis: 0.546 ± 0.016
0.717IleIle: 0.717 ± 0.02
0.72IleLys: 0.72 ± 0.017
2.061IleLeu: 2.061 ± 0.034
0.384IleMet: 0.384 ± 0.014
0.592IleAsn: 0.592 ± 0.015
1.514IlePro: 1.514 ± 0.026
0.624IleGln: 0.624 ± 0.016
1.974IleArg: 1.974 ± 0.029
1.5IleSer: 1.5 ± 0.023
1.912IleThr: 1.912 ± 0.029
2.405IleVal: 2.405 ± 0.035
0.321IleTrp: 0.321 ± 0.011
0.442IleTyr: 0.442 ± 0.012
0.0IleXaa: 0.0 ± 0.0
Lys
3.0LysAla: 3.0 ± 0.054
0.115LysCys: 0.115 ± 0.007
1.404LysAsp: 1.404 ± 0.03
1.259LysGlu: 1.259 ± 0.025
0.46LysPhe: 0.46 ± 0.014
1.932LysGly: 1.932 ± 0.038
0.453LysHis: 0.453 ± 0.016
0.756LysIle: 0.756 ± 0.021
0.975LysLys: 0.975 ± 0.035
1.955LysLeu: 1.955 ± 0.033
0.367LysMet: 0.367 ± 0.012
0.495LysAsn: 0.495 ± 0.017
1.406LysPro: 1.406 ± 0.034
0.672LysGln: 0.672 ± 0.018
1.454LysArg: 1.454 ± 0.027
1.153LysSer: 1.153 ± 0.025
1.194LysThr: 1.194 ± 0.024
1.875LysVal: 1.875 ± 0.037
0.276LysTrp: 0.276 ± 0.011
0.431LysTyr: 0.431 ± 0.013
0.0LysXaa: 0.0 ± 0.0
Leu
15.841LeuAla: 15.841 ± 0.115
0.871LeuCys: 0.871 ± 0.02
6.854LeuAsp: 6.854 ± 0.064
4.457LeuGlu: 4.457 ± 0.047
2.67LeuPhe: 2.67 ± 0.037
9.379LeuGly: 9.379 ± 0.074
2.363LeuHis: 2.363 ± 0.037
3.031LeuIle: 3.031 ± 0.041
2.082LeuLys: 2.082 ± 0.033
11.453LeuLeu: 11.453 ± 0.089
1.57LeuMet: 1.57 ± 0.025
1.603LeuAsn: 1.603 ± 0.028
6.625LeuPro: 6.625 ± 0.065
1.978LeuGln: 1.978 ± 0.027
9.254LeuArg: 9.254 ± 0.063
5.155LeuSer: 5.155 ± 0.05
7.095LeuThr: 7.095 ± 0.057
8.999LeuVal: 8.999 ± 0.07
1.331LeuTrp: 1.331 ± 0.025
1.861LeuTyr: 1.861 ± 0.029
0.0LeuXaa: 0.0 ± 0.0
Met
2.182MetAla: 2.182 ± 0.033
0.131MetCys: 0.131 ± 0.007
0.874MetAsp: 0.874 ± 0.019
0.698MetGlu: 0.698 ± 0.02
0.421MetPhe: 0.421 ± 0.015
1.224MetGly: 1.224 ± 0.026
0.319MetHis: 0.319 ± 0.011
0.562MetIle: 0.562 ± 0.014
0.394MetLys: 0.394 ± 0.015
1.527MetLeu: 1.527 ± 0.026
0.247MetMet: 0.247 ± 0.011
0.393MetAsn: 0.393 ± 0.013
1.035MetPro: 1.035 ± 0.023
0.385MetGln: 0.385 ± 0.012
1.406MetArg: 1.406 ± 0.023
1.203MetSer: 1.203 ± 0.021
1.412MetThr: 1.412 ± 0.024
1.231MetVal: 1.231 ± 0.021
0.193MetTrp: 0.193 ± 0.009
0.322MetTyr: 0.322 ± 0.014
0.0MetXaa: 0.0 ± 0.0
Asn
2.024AsnAla: 2.024 ± 0.029
0.149AsnCys: 0.149 ± 0.008
0.895AsnAsp: 0.895 ± 0.018
0.715AsnGlu: 0.715 ± 0.019
0.422AsnPhe: 0.422 ± 0.014
1.719AsnGly: 1.719 ± 0.026
0.352AsnHis: 0.352 ± 0.012
0.553AsnIle: 0.553 ± 0.015
0.397AsnLys: 0.397 ± 0.014
1.482AsnLeu: 1.482 ± 0.026
0.26AsnMet: 0.26 ± 0.01
0.373AsnAsn: 0.373 ± 0.013
1.214AsnPro: 1.214 ± 0.023
0.449AsnGln: 0.449 ± 0.012
1.126AsnArg: 1.126 ± 0.021
0.837AsnSer: 0.837 ± 0.021
0.993AsnThr: 0.993 ± 0.02
1.288AsnVal: 1.288 ± 0.024
0.262AsnTrp: 0.262 ± 0.01
0.379AsnTyr: 0.379 ± 0.011
0.0AsnXaa: 0.0 ± 0.0
Pro
8.775ProAla: 8.775 ± 0.079
0.349ProCys: 0.349 ± 0.011
4.654ProAsp: 4.654 ± 0.051
4.324ProGlu: 4.324 ± 0.048
1.52ProPhe: 1.52 ± 0.026
7.014ProGly: 7.014 ± 0.08
1.515ProHis: 1.515 ± 0.026
1.162ProIle: 1.162 ± 0.022
1.252ProLys: 1.252 ± 0.028
5.717ProLeu: 5.717 ± 0.051
0.93ProMet: 0.93 ± 0.019
0.803ProAsn: 0.803 ± 0.018
3.681ProPro: 3.681 ± 0.068
1.789ProGln: 1.789 ± 0.033
4.39ProArg: 4.39 ± 0.05
3.153ProSer: 3.153 ± 0.052
3.071ProThr: 3.071 ± 0.041
5.519ProVal: 5.519 ± 0.051
0.907ProTrp: 0.907 ± 0.021
1.321ProTyr: 1.321 ± 0.028
0.0ProXaa: 0.0 ± 0.0
Gln
3.573GlnAla: 3.573 ± 0.05
0.155GlnCys: 0.155 ± 0.008
1.448GlnAsp: 1.448 ± 0.028
1.466GlnGlu: 1.466 ± 0.025
0.635GlnPhe: 0.635 ± 0.015
2.301GlnGly: 2.301 ± 0.036
0.641GlnHis: 0.641 ± 0.017
0.92GlnIle: 0.92 ± 0.023
0.565GlnLys: 0.565 ± 0.017
2.984GlnLeu: 2.984 ± 0.036
0.428GlnMet: 0.428 ± 0.013
0.448GlnAsn: 0.448 ± 0.015
1.536GlnPro: 1.536 ± 0.034
1.136GlnGln: 1.136 ± 0.033
2.421GlnArg: 2.421 ± 0.031
1.123GlnSer: 1.123 ± 0.021
1.194GlnThr: 1.194 ± 0.022
2.239GlnVal: 2.239 ± 0.035
0.45GlnTrp: 0.45 ± 0.012
0.537GlnTyr: 0.537 ± 0.014
0.0GlnXaa: 0.0 ± 0.0
Arg
11.268ArgAla: 11.268 ± 0.091
0.638ArgCys: 0.638 ± 0.017
4.526ArgAsp: 4.526 ± 0.043
4.888ArgGlu: 4.888 ± 0.052
2.364ArgPhe: 2.364 ± 0.029
6.418ArgGly: 6.418 ± 0.054
2.27ArgHis: 2.27 ± 0.032
2.829ArgIle: 2.829 ± 0.035
1.626ArgLys: 1.626 ± 0.03
9.19ArgLeu: 9.19 ± 0.08
1.666ArgMet: 1.666 ± 0.024
1.238ArgAsn: 1.238 ± 0.023
5.27ArgPro: 5.27 ± 0.053
2.304ArgGln: 2.304 ± 0.028
7.886ArgArg: 7.886 ± 0.075
3.954ArgSer: 3.954 ± 0.04
5.538ArgThr: 5.538 ± 0.053
6.401ArgVal: 6.401 ± 0.052
1.395ArgTrp: 1.395 ± 0.021
1.753ArgTyr: 1.753 ± 0.027
0.0ArgXaa: 0.0 ± 0.0
Ser
6.612SerAla: 6.612 ± 0.057
0.384SerCys: 0.384 ± 0.014
2.545SerAsp: 2.545 ± 0.037
2.203SerGlu: 2.203 ± 0.033
1.509SerPhe: 1.509 ± 0.025
5.94SerGly: 5.94 ± 0.053
1.014SerHis: 1.014 ± 0.019
1.213SerIle: 1.213 ± 0.023
1.002SerLys: 1.002 ± 0.022
4.781SerLeu: 4.781 ± 0.047
0.972SerMet: 0.972 ± 0.021
0.787SerAsn: 0.787 ± 0.019
3.044SerPro: 3.044 ± 0.044
1.15SerGln: 1.15 ± 0.021
3.54SerArg: 3.54 ± 0.036
2.585SerSer: 2.585 ± 0.04
2.827SerThr: 2.827 ± 0.037
4.174SerVal: 4.174 ± 0.038
0.879SerTrp: 0.879 ± 0.018
1.153SerTyr: 1.153 ± 0.021
0.0SerXaa: 0.0 ± 0.0
Thr
8.98ThrAla: 8.98 ± 0.067
0.405ThrCys: 0.405 ± 0.013
3.468ThrAsp: 3.468 ± 0.034
3.114ThrGlu: 3.114 ± 0.034
1.549ThrPhe: 1.549 ± 0.026
6.696ThrGly: 6.696 ± 0.053
1.221ThrHis: 1.221 ± 0.019
1.466ThrIle: 1.466 ± 0.027
1.188ThrLys: 1.188 ± 0.024
5.666ThrLeu: 5.666 ± 0.055
0.855ThrMet: 0.855 ± 0.018
0.89ThrAsn: 0.89 ± 0.02
4.109ThrPro: 4.109 ± 0.044
1.264ThrGln: 1.264 ± 0.023
3.985ThrArg: 3.985 ± 0.036
3.018ThrSer: 3.018 ± 0.038
3.613ThrThr: 3.613 ± 0.042
5.854ThrVal: 5.854 ± 0.049
0.87ThrTrp: 0.87 ± 0.021
1.374ThrTyr: 1.374 ± 0.025
0.0ThrXaa: 0.0 ± 0.0
Val
11.341ValAla: 11.341 ± 0.08
0.795ValCys: 0.795 ± 0.018
5.182ValAsp: 5.182 ± 0.052
4.688ValGlu: 4.688 ± 0.046
2.483ValPhe: 2.483 ± 0.033
6.67ValGly: 6.67 ± 0.056
1.925ValHis: 1.925 ± 0.029
2.679ValIle: 2.679 ± 0.041
1.713ValLys: 1.713 ± 0.031
9.627ValLeu: 9.627 ± 0.069
1.328ValMet: 1.328 ± 0.022
1.558ValAsn: 1.558 ± 0.023
5.475ValPro: 5.475 ± 0.06
1.931ValGln: 1.931 ± 0.029
7.642ValArg: 7.642 ± 0.064
4.331ValSer: 4.331 ± 0.042
5.642ValThr: 5.642 ± 0.05
8.179ValVal: 8.179 ± 0.073
1.15ValTrp: 1.15 ± 0.021
1.477ValTyr: 1.477 ± 0.027
0.0ValXaa: 0.0 ± 0.0
Trp
1.697TrpAla: 1.697 ± 0.023
0.158TrpCys: 0.158 ± 0.008
0.819TrpAsp: 0.819 ± 0.018
0.713TrpGlu: 0.713 ± 0.017
0.483TrpPhe: 0.483 ± 0.014
1.089TrpGly: 1.089 ± 0.024
0.364TrpHis: 0.364 ± 0.012
0.483TrpIle: 0.483 ± 0.014
0.356TrpLys: 0.356 ± 0.012
1.79TrpLeu: 1.79 ± 0.029
0.258TrpMet: 0.258 ± 0.01
0.386TrpAsn: 0.386 ± 0.012
0.811TrpPro: 0.811 ± 0.018
0.616TrpGln: 0.616 ± 0.017
1.411TrpArg: 1.411 ± 0.022
0.922TrpSer: 0.922 ± 0.021
0.958TrpThr: 0.958 ± 0.022
0.956TrpVal: 0.956 ± 0.02
0.335TrpTrp: 0.335 ± 0.012
0.335TrpTyr: 0.335 ± 0.011
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.896TyrAla: 2.896 ± 0.034
0.168TyrCys: 0.168 ± 0.009
1.45TyrAsp: 1.45 ± 0.025
1.302TyrGlu: 1.302 ± 0.022
0.637TyrPhe: 0.637 ± 0.014
2.232TyrGly: 2.232 ± 0.031
0.374TyrHis: 0.374 ± 0.012
0.38TyrIle: 0.38 ± 0.012
0.387TyrLys: 0.387 ± 0.015
1.987TyrLeu: 1.987 ± 0.029
0.241TyrMet: 0.241 ± 0.009
0.357TyrAsn: 0.357 ± 0.012
1.024TyrPro: 1.024 ± 0.019
0.539TyrGln: 0.539 ± 0.012
1.791TyrArg: 1.791 ± 0.027
0.849TyrSer: 0.849 ± 0.018
1.084TyrThr: 1.084 ± 0.021
1.776TyrVal: 1.776 ± 0.032
0.373TyrTrp: 0.373 ± 0.012
0.436TyrTyr: 0.436 ± 0.015
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7680 proteins (2715784 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski