Amino acid dipepetide frequency for Streptomyces pseudovenezuelae

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
20.171AlaAla: 20.171 ± 0.133
1.025AlaCys: 1.025 ± 0.022
8.171AlaAsp: 8.171 ± 0.064
8.626AlaGlu: 8.626 ± 0.084
3.525AlaPhe: 3.525 ± 0.047
12.518AlaGly: 12.518 ± 0.089
2.922AlaHis: 2.922 ± 0.038
3.436AlaIle: 3.436 ± 0.041
2.9AlaLys: 2.9 ± 0.055
14.352AlaLeu: 14.352 ± 0.085
2.388AlaMet: 2.388 ± 0.031
1.987AlaAsn: 1.987 ± 0.033
6.765AlaPro: 6.765 ± 0.067
3.866AlaGln: 3.866 ± 0.044
10.04AlaArg: 10.04 ± 0.093
5.984AlaSer: 5.984 ± 0.045
7.044AlaThr: 7.044 ± 0.055
12.209AlaVal: 12.209 ± 0.08
1.874AlaTrp: 1.874 ± 0.03
2.851AlaTyr: 2.851 ± 0.035
0.0AlaXaa: 0.0 ± 0.0
Cys
1.083CysAla: 1.083 ± 0.021
0.091CysCys: 0.091 ± 0.007
0.463CysAsp: 0.463 ± 0.014
0.385CysGlu: 0.385 ± 0.015
0.221CysPhe: 0.221 ± 0.009
0.919CysGly: 0.919 ± 0.022
0.189CysHis: 0.189 ± 0.008
0.154CysIle: 0.154 ± 0.008
0.103CysLys: 0.103 ± 0.006
0.772CysLeu: 0.772 ± 0.02
0.118CysMet: 0.118 ± 0.007
0.136CysAsn: 0.136 ± 0.008
0.44CysPro: 0.44 ± 0.015
0.17CysGln: 0.17 ± 0.008
0.572CysArg: 0.572 ± 0.017
0.405CysSer: 0.405 ± 0.012
0.528CysThr: 0.528 ± 0.016
0.687CysVal: 0.687 ± 0.015
0.126CysTrp: 0.126 ± 0.007
0.154CysTyr: 0.154 ± 0.007
0.0CysXaa: 0.0 ± 0.0
Asp
7.438AspAla: 7.438 ± 0.059
0.413AspCys: 0.413 ± 0.014
3.573AspAsp: 3.573 ± 0.039
3.924AspGlu: 3.924 ± 0.039
1.76AspPhe: 1.76 ± 0.027
6.357AspGly: 6.357 ± 0.061
1.456AspHis: 1.456 ± 0.022
1.966AspIle: 1.966 ± 0.028
1.25AspLys: 1.25 ± 0.024
6.319AspLeu: 6.319 ± 0.052
0.849AspMet: 0.849 ± 0.017
1.042AspAsn: 1.042 ± 0.023
4.491AspPro: 4.491 ± 0.045
1.625AspGln: 1.625 ± 0.025
4.867AspArg: 4.867 ± 0.047
2.703AspSer: 2.703 ± 0.032
3.406AspThr: 3.406 ± 0.038
4.821AspVal: 4.821 ± 0.041
1.106AspTrp: 1.106 ± 0.021
1.157AspTyr: 1.157 ± 0.026
0.0AspXaa: 0.0 ± 0.0
Glu
7.293GluAla: 7.293 ± 0.081
0.364GluCys: 0.364 ± 0.012
2.912GluAsp: 2.912 ± 0.031
3.554GluGlu: 3.554 ± 0.05
1.496GluPhe: 1.496 ± 0.025
4.383GluGly: 4.383 ± 0.045
1.489GluHis: 1.489 ± 0.023
2.237GluIle: 2.237 ± 0.029
1.459GluLys: 1.459 ± 0.026
6.839GluLeu: 6.839 ± 0.061
0.858GluMet: 0.858 ± 0.017
1.042GluAsn: 1.042 ± 0.02
3.305GluPro: 3.305 ± 0.044
2.267GluGln: 2.267 ± 0.031
5.325GluArg: 5.325 ± 0.057
2.65GluSer: 2.65 ± 0.034
2.925GluThr: 2.925 ± 0.034
4.468GluVal: 4.468 ± 0.047
0.804GluTrp: 0.804 ± 0.019
1.126GluTyr: 1.126 ± 0.023
0.0GluXaa: 0.0 ± 0.0
Phe
3.662PheAla: 3.662 ± 0.041
0.254PheCys: 0.254 ± 0.009
1.956PheAsp: 1.956 ± 0.025
1.477PheGlu: 1.477 ± 0.026
0.894PhePhe: 0.894 ± 0.02
3.035PheGly: 3.035 ± 0.036
0.64PheHis: 0.64 ± 0.014
0.686PheIle: 0.686 ± 0.017
0.546PheLys: 0.546 ± 0.017
2.676PheLeu: 2.676 ± 0.039
0.424PheMet: 0.424 ± 0.013
0.607PheAsn: 0.607 ± 0.014
1.38PhePro: 1.38 ± 0.022
0.723PheGln: 0.723 ± 0.016
1.863PheArg: 1.863 ± 0.028
1.474PheSer: 1.474 ± 0.027
2.111PheThr: 2.111 ± 0.027
2.356PheVal: 2.356 ± 0.032
0.449PheTrp: 0.449 ± 0.014
0.608PheTyr: 0.608 ± 0.015
0.0PheXaa: 0.0 ± 0.0
Gly
10.754GlyAla: 10.754 ± 0.076
0.804GlyCys: 0.804 ± 0.019
5.181GlyAsp: 5.181 ± 0.045
5.036GlyGlu: 5.036 ± 0.05
2.963GlyPhe: 2.963 ± 0.033
8.53GlyGly: 8.53 ± 0.08
2.31GlyHis: 2.31 ± 0.032
3.502GlyIle: 3.502 ± 0.041
2.411GlyLys: 2.411 ± 0.036
9.427GlyLeu: 9.427 ± 0.071
1.933GlyMet: 1.933 ± 0.028
1.829GlyAsn: 1.829 ± 0.026
4.991GlyPro: 4.991 ± 0.051
2.701GlyGln: 2.701 ± 0.035
7.573GlyArg: 7.573 ± 0.064
5.5GlySer: 5.5 ± 0.056
6.633GlyThr: 6.633 ± 0.07
7.595GlyVal: 7.595 ± 0.053
1.789GlyTrp: 1.789 ± 0.028
2.331GlyTyr: 2.331 ± 0.029
0.0GlyXaa: 0.0 ± 0.0
His
2.666HisAla: 2.666 ± 0.034
0.213HisCys: 0.213 ± 0.009
1.383HisAsp: 1.383 ± 0.025
1.247HisGlu: 1.247 ± 0.021
0.658HisPhe: 0.658 ± 0.016
2.427HisGly: 2.427 ± 0.031
0.706HisHis: 0.706 ± 0.018
0.696HisIle: 0.696 ± 0.015
0.344HisLys: 0.344 ± 0.011
2.42HisLeu: 2.42 ± 0.033
0.334HisMet: 0.334 ± 0.011
0.379HisAsn: 0.379 ± 0.012
1.822HisPro: 1.822 ± 0.029
0.632HisGln: 0.632 ± 0.016
2.121HisArg: 2.121 ± 0.033
1.02HisSer: 1.02 ± 0.02
1.382HisThr: 1.382 ± 0.024
1.737HisVal: 1.737 ± 0.024
0.399HisTrp: 0.399 ± 0.012
0.512HisTyr: 0.512 ± 0.013
0.0HisXaa: 0.0 ± 0.0
Ile
4.728IleAla: 4.728 ± 0.046
0.267IleCys: 0.267 ± 0.012
2.257IleAsp: 2.257 ± 0.027
1.899IleGlu: 1.899 ± 0.032
0.684IlePhe: 0.684 ± 0.015
3.522IleGly: 3.522 ± 0.041
0.607IleHis: 0.607 ± 0.013
0.887IleIle: 0.887 ± 0.022
0.75IleLys: 0.75 ± 0.02
2.408IleLeu: 2.408 ± 0.039
0.457IleMet: 0.457 ± 0.014
0.698IleAsn: 0.698 ± 0.017
1.78IlePro: 1.78 ± 0.028
0.72IleGln: 0.72 ± 0.015
2.25IleArg: 2.25 ± 0.033
1.675IleSer: 1.675 ± 0.026
2.223IleThr: 2.223 ± 0.028
2.739IleVal: 2.739 ± 0.04
0.403IleTrp: 0.403 ± 0.011
0.54IleTyr: 0.54 ± 0.014
0.0IleXaa: 0.0 ± 0.0
Lys
3.061LysAla: 3.061 ± 0.046
0.12LysCys: 0.12 ± 0.008
1.413LysAsp: 1.413 ± 0.025
1.22LysGlu: 1.22 ± 0.026
0.467LysPhe: 0.467 ± 0.014
1.922LysGly: 1.922 ± 0.038
0.426LysHis: 0.426 ± 0.013
0.893LysIle: 0.893 ± 0.021
0.902LysLys: 0.902 ± 0.027
2.07LysLeu: 2.07 ± 0.033
0.376LysMet: 0.376 ± 0.013
0.556LysAsn: 0.556 ± 0.016
1.32LysPro: 1.32 ± 0.028
0.738LysGln: 0.738 ± 0.017
1.374LysArg: 1.374 ± 0.022
1.205LysSer: 1.205 ± 0.027
1.28LysThr: 1.28 ± 0.03
2.018LysVal: 2.018 ± 0.033
0.297LysTrp: 0.297 ± 0.01
0.51LysTyr: 0.51 ± 0.016
0.0LysXaa: 0.0 ± 0.0
Leu
14.849LeuAla: 14.849 ± 0.09
0.835LeuCys: 0.835 ± 0.017
6.864LeuAsp: 6.864 ± 0.05
4.659LeuGlu: 4.659 ± 0.046
2.63LeuPhe: 2.63 ± 0.034
9.258LeuGly: 9.258 ± 0.061
2.302LeuHis: 2.302 ± 0.03
3.217LeuIle: 3.217 ± 0.043
2.187LeuLys: 2.187 ± 0.039
11.411LeuLeu: 11.411 ± 0.092
1.639LeuMet: 1.639 ± 0.026
1.735LeuAsn: 1.735 ± 0.027
6.413LeuPro: 6.413 ± 0.053
2.29LeuGln: 2.29 ± 0.026
8.588LeuArg: 8.588 ± 0.07
5.4LeuSer: 5.4 ± 0.047
7.154LeuThr: 7.154 ± 0.068
9.019LeuVal: 9.019 ± 0.073
1.364LeuTrp: 1.364 ± 0.023
1.954LeuTyr: 1.954 ± 0.03
0.0LeuXaa: 0.0 ± 0.0
Met
2.199MetAla: 2.199 ± 0.029
0.144MetCys: 0.144 ± 0.006
0.876MetAsp: 0.876 ± 0.019
0.719MetGlu: 0.719 ± 0.017
0.459MetPhe: 0.459 ± 0.014
1.316MetGly: 1.316 ± 0.028
0.353MetHis: 0.353 ± 0.012
0.658MetIle: 0.658 ± 0.016
0.437MetLys: 0.437 ± 0.012
1.611MetLeu: 1.611 ± 0.029
0.291MetMet: 0.291 ± 0.011
0.438MetAsn: 0.438 ± 0.013
1.094MetPro: 1.094 ± 0.022
0.436MetGln: 0.436 ± 0.013
1.352MetArg: 1.352 ± 0.025
1.308MetSer: 1.308 ± 0.021
1.593MetThr: 1.593 ± 0.022
1.269MetVal: 1.269 ± 0.025
0.216MetTrp: 0.216 ± 0.009
0.331MetTyr: 0.331 ± 0.011
0.0MetXaa: 0.0 ± 0.0
Asn
2.243AsnAla: 2.243 ± 0.037
0.17AsnCys: 0.17 ± 0.008
1.027AsnAsp: 1.027 ± 0.02
0.815AsnGlu: 0.815 ± 0.018
0.511AsnPhe: 0.511 ± 0.014
2.027AsnGly: 2.027 ± 0.038
0.415AsnHis: 0.415 ± 0.012
0.674AsnIle: 0.674 ± 0.018
0.449AsnLys: 0.449 ± 0.014
1.679AsnLeu: 1.679 ± 0.025
0.296AsnMet: 0.296 ± 0.011
0.467AsnAsn: 0.467 ± 0.015
1.367AsnPro: 1.367 ± 0.022
0.581AsnGln: 0.581 ± 0.017
1.305AsnArg: 1.305 ± 0.022
1.053AsnSer: 1.053 ± 0.024
1.177AsnThr: 1.177 ± 0.024
1.379AsnVal: 1.379 ± 0.028
0.324AsnTrp: 0.324 ± 0.012
0.463AsnTyr: 0.463 ± 0.014
0.0AsnXaa: 0.0 ± 0.0
Pro
8.144ProAla: 8.144 ± 0.072
0.332ProCys: 0.332 ± 0.013
4.425ProAsp: 4.425 ± 0.041
4.312ProGlu: 4.312 ± 0.05
1.559ProPhe: 1.559 ± 0.024
6.318ProGly: 6.318 ± 0.061
1.386ProHis: 1.386 ± 0.021
1.203ProIle: 1.203 ± 0.024
1.196ProLys: 1.196 ± 0.023
5.338ProLeu: 5.338 ± 0.046
0.962ProMet: 0.962 ± 0.019
0.895ProAsn: 0.895 ± 0.019
3.322ProPro: 3.322 ± 0.051
1.747ProGln: 1.747 ± 0.031
3.803ProArg: 3.803 ± 0.046
3.225ProSer: 3.225 ± 0.041
3.224ProThr: 3.224 ± 0.038
5.485ProVal: 5.485 ± 0.051
0.909ProTrp: 0.909 ± 0.021
1.475ProTyr: 1.475 ± 0.026
0.0ProXaa: 0.0 ± 0.0
Gln
3.75GlnAla: 3.75 ± 0.042
0.172GlnCys: 0.172 ± 0.009
1.5GlnAsp: 1.5 ± 0.022
1.525GlnGlu: 1.525 ± 0.027
0.731GlnPhe: 0.731 ± 0.017
2.383GlnGly: 2.383 ± 0.038
0.685GlnHis: 0.685 ± 0.015
1.088GlnIle: 1.088 ± 0.021
0.667GlnLys: 0.667 ± 0.018
3.116GlnLeu: 3.116 ± 0.037
0.492GlnMet: 0.492 ± 0.013
0.545GlnAsn: 0.545 ± 0.017
1.635GlnPro: 1.635 ± 0.031
1.286GlnGln: 1.286 ± 0.027
2.285GlnArg: 2.285 ± 0.03
1.372GlnSer: 1.372 ± 0.023
1.38GlnThr: 1.38 ± 0.021
2.451GlnVal: 2.451 ± 0.035
0.518GlnTrp: 0.518 ± 0.014
0.655GlnTyr: 0.655 ± 0.017
0.0GlnXaa: 0.0 ± 0.0
Arg
9.598ArgAla: 9.598 ± 0.08
0.57ArgCys: 0.57 ± 0.015
4.187ArgAsp: 4.187 ± 0.042
4.732ArgGlu: 4.732 ± 0.051
2.335ArgPhe: 2.335 ± 0.032
5.707ArgGly: 5.707 ± 0.049
2.104ArgHis: 2.104 ± 0.03
3.195ArgIle: 3.195 ± 0.036
1.595ArgLys: 1.595 ± 0.027
8.814ArgLeu: 8.814 ± 0.072
1.655ArgMet: 1.655 ± 0.026
1.337ArgAsn: 1.337 ± 0.022
4.757ArgPro: 4.757 ± 0.048
2.306ArgGln: 2.306 ± 0.03
7.419ArgArg: 7.419 ± 0.073
3.993ArgSer: 3.993 ± 0.04
5.48ArgThr: 5.48 ± 0.047
5.922ArgVal: 5.922 ± 0.052
1.365ArgTrp: 1.365 ± 0.022
1.799ArgTyr: 1.799 ± 0.025
0.0ArgXaa: 0.0 ± 0.0
Ser
6.795SerAla: 6.795 ± 0.061
0.409SerCys: 0.409 ± 0.013
2.843SerAsp: 2.843 ± 0.032
2.484SerGlu: 2.484 ± 0.034
1.556SerPhe: 1.556 ± 0.025
6.151SerGly: 6.151 ± 0.058
1.03SerHis: 1.03 ± 0.02
1.399SerIle: 1.399 ± 0.023
1.075SerLys: 1.075 ± 0.024
4.95SerLeu: 4.95 ± 0.041
1.106SerMet: 1.106 ± 0.02
0.941SerAsn: 0.941 ± 0.026
3.099SerPro: 3.099 ± 0.037
1.308SerGln: 1.308 ± 0.025
3.664SerArg: 3.664 ± 0.043
2.923SerSer: 2.923 ± 0.047
3.196SerThr: 3.196 ± 0.044
4.483SerVal: 4.483 ± 0.042
0.96SerTrp: 0.96 ± 0.019
1.334SerTyr: 1.334 ± 0.023
0.0SerXaa: 0.0 ± 0.0
Thr
9.026ThrAla: 9.026 ± 0.069
0.441ThrCys: 0.441 ± 0.013
3.829ThrAsp: 3.829 ± 0.036
3.321ThrGlu: 3.321 ± 0.04
1.694ThrPhe: 1.694 ± 0.024
6.813ThrGly: 6.813 ± 0.059
1.268ThrHis: 1.268 ± 0.022
1.674ThrIle: 1.674 ± 0.03
1.244ThrLys: 1.244 ± 0.024
5.928ThrLeu: 5.928 ± 0.051
0.945ThrMet: 0.945 ± 0.022
1.102ThrAsn: 1.102 ± 0.023
4.111ThrPro: 4.111 ± 0.047
1.409ThrGln: 1.409 ± 0.02
4.028ThrArg: 4.028 ± 0.038
3.288ThrSer: 3.288 ± 0.041
3.985ThrThr: 3.985 ± 0.049
6.368ThrVal: 6.368 ± 0.054
0.968ThrTrp: 0.968 ± 0.022
1.494ThrTyr: 1.494 ± 0.022
0.0ThrXaa: 0.0 ± 0.0
Val
10.619ValAla: 10.619 ± 0.077
0.768ValCys: 0.768 ± 0.017
5.1ValAsp: 5.1 ± 0.045
4.816ValGlu: 4.816 ± 0.043
2.47ValPhe: 2.47 ± 0.034
6.672ValGly: 6.672 ± 0.054
2.029ValHis: 2.029 ± 0.031
2.831ValIle: 2.831 ± 0.036
1.823ValLys: 1.823 ± 0.036
9.634ValLeu: 9.634 ± 0.076
1.408ValMet: 1.408 ± 0.025
1.751ValAsn: 1.751 ± 0.031
5.292ValPro: 5.292 ± 0.049
2.131ValGln: 2.131 ± 0.032
7.273ValArg: 7.273 ± 0.057
4.491ValSer: 4.491 ± 0.037
5.784ValThr: 5.784 ± 0.05
8.183ValVal: 8.183 ± 0.072
1.229ValTrp: 1.229 ± 0.026
1.65ValTyr: 1.65 ± 0.027
0.0ValXaa: 0.0 ± 0.0
Trp
1.721TrpAla: 1.721 ± 0.028
0.148TrpCys: 0.148 ± 0.008
0.897TrpAsp: 0.897 ± 0.02
0.748TrpGlu: 0.748 ± 0.017
0.536TrpPhe: 0.536 ± 0.013
1.097TrpGly: 1.097 ± 0.02
0.39TrpHis: 0.39 ± 0.013
0.591TrpIle: 0.591 ± 0.017
0.389TrpLys: 0.389 ± 0.014
1.836TrpLeu: 1.836 ± 0.026
0.294TrpMet: 0.294 ± 0.011
0.474TrpAsn: 0.474 ± 0.015
0.808TrpPro: 0.808 ± 0.018
0.663TrpGln: 0.663 ± 0.017
1.328TrpArg: 1.328 ± 0.026
1.014TrpSer: 1.014 ± 0.02
1.145TrpThr: 1.145 ± 0.025
1.008TrpVal: 1.008 ± 0.021
0.36TrpTrp: 0.36 ± 0.012
0.394TrpTyr: 0.394 ± 0.012
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.852TyrAla: 2.852 ± 0.033
0.177TyrCys: 0.177 ± 0.009
1.662TyrAsp: 1.662 ± 0.025
1.295TyrGlu: 1.295 ± 0.023
0.672TyrPhe: 0.672 ± 0.017
2.44TyrGly: 2.44 ± 0.032
0.394TyrHis: 0.394 ± 0.014
0.508TyrIle: 0.508 ± 0.014
0.44TyrLys: 0.44 ± 0.013
2.167TyrLeu: 2.167 ± 0.028
0.275TyrMet: 0.275 ± 0.009
0.46TyrAsn: 0.46 ± 0.015
1.067TyrPro: 1.067 ± 0.02
0.635TyrGln: 0.635 ± 0.015
1.843TyrArg: 1.843 ± 0.028
1.027TyrSer: 1.027 ± 0.02
1.286TyrThr: 1.286 ± 0.023
1.76TyrVal: 1.76 ± 0.026
0.38TyrTrp: 0.38 ± 0.013
0.504TyrTyr: 0.504 ± 0.014
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8204 proteins (2728014 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski