Amino acid dipepetide frequency for Paenibacillus wynnii

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.771AlaAla: 7.771 ± 0.112
0.639AlaCys: 0.639 ± 0.022
3.804AlaAsp: 3.804 ± 0.051
5.328AlaGlu: 5.328 ± 0.072
3.022AlaPhe: 3.022 ± 0.044
6.254AlaGly: 6.254 ± 0.085
1.258AlaHis: 1.258 ± 0.03
5.188AlaIle: 5.188 ± 0.064
4.009AlaLys: 4.009 ± 0.052
7.921AlaLeu: 7.921 ± 0.081
2.069AlaMet: 2.069 ± 0.041
2.556AlaAsn: 2.556 ± 0.044
2.445AlaPro: 2.445 ± 0.048
2.477AlaGln: 2.477 ± 0.041
3.108AlaArg: 3.108 ± 0.046
4.536AlaSer: 4.536 ± 0.062
3.587AlaThr: 3.587 ± 0.054
6.208AlaVal: 6.208 ± 0.071
0.847AlaTrp: 0.847 ± 0.028
2.499AlaTyr: 2.499 ± 0.044
0.0AlaXaa: 0.0 ± 0.0
Cys
0.503CysAla: 0.503 ± 0.019
0.127CysCys: 0.127 ± 0.01
0.402CysAsp: 0.402 ± 0.017
0.405CysGlu: 0.405 ± 0.017
0.33CysPhe: 0.33 ± 0.016
0.768CysGly: 0.768 ± 0.024
0.201CysHis: 0.201 ± 0.012
0.533CysIle: 0.533 ± 0.019
0.342CysLys: 0.342 ± 0.015
0.757CysLeu: 0.757 ± 0.022
0.203CysMet: 0.203 ± 0.011
0.328CysAsn: 0.328 ± 0.016
0.339CysPro: 0.339 ± 0.017
0.255CysGln: 0.255 ± 0.015
0.46CysArg: 0.46 ± 0.02
0.602CysSer: 0.602 ± 0.02
0.435CysThr: 0.435 ± 0.02
0.448CysVal: 0.448 ± 0.017
0.091CysTrp: 0.091 ± 0.007
0.278CysTyr: 0.278 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
3.465AspAla: 3.465 ± 0.053
0.382AspCys: 0.382 ± 0.017
2.37AspAsp: 2.37 ± 0.042
3.685AspGlu: 3.685 ± 0.052
2.284AspPhe: 2.284 ± 0.04
3.677AspGly: 3.677 ± 0.061
1.092AspHis: 1.092 ± 0.03
3.973AspIle: 3.973 ± 0.054
2.995AspLys: 2.995 ± 0.056
5.158AspLeu: 5.158 ± 0.064
1.365AspMet: 1.365 ± 0.027
2.051AspAsn: 2.051 ± 0.043
2.216AspPro: 2.216 ± 0.041
1.801AspGln: 1.801 ± 0.038
2.547AspArg: 2.547 ± 0.04
3.048AspSer: 3.048 ± 0.049
2.696AspThr: 2.696 ± 0.042
3.346AspVal: 3.346 ± 0.057
0.795AspTrp: 0.795 ± 0.027
2.124AspTyr: 2.124 ± 0.046
0.0AspXaa: 0.0 ± 0.0
Glu
5.583GluAla: 5.583 ± 0.072
0.433GluCys: 0.433 ± 0.019
3.437GluAsp: 3.437 ± 0.056
5.537GluGlu: 5.537 ± 0.08
2.263GluPhe: 2.263 ± 0.043
4.688GluGly: 4.688 ± 0.063
1.529GluHis: 1.529 ± 0.036
4.829GluIle: 4.829 ± 0.059
4.108GluLys: 4.108 ± 0.059
7.142GluLeu: 7.142 ± 0.083
2.143GluMet: 2.143 ± 0.039
2.822GluAsn: 2.822 ± 0.044
2.098GluPro: 2.098 ± 0.042
3.183GluGln: 3.183 ± 0.054
3.651GluArg: 3.651 ± 0.062
3.769GluSer: 3.769 ± 0.05
3.171GluThr: 3.171 ± 0.05
4.703GluVal: 4.703 ± 0.059
0.896GluTrp: 0.896 ± 0.024
2.135GluTyr: 2.135 ± 0.04
0.0GluXaa: 0.0 ± 0.0
Phe
3.026PheAla: 3.026 ± 0.049
0.377PheCys: 0.377 ± 0.014
2.165PheAsp: 2.165 ± 0.036
2.428PheGlu: 2.428 ± 0.044
1.859PhePhe: 1.859 ± 0.047
3.193PheGly: 3.193 ± 0.053
0.886PheHis: 0.886 ± 0.028
3.089PheIle: 3.089 ± 0.049
2.132PheLys: 2.132 ± 0.039
3.985PheLeu: 3.985 ± 0.059
1.219PheMet: 1.219 ± 0.027
1.797PheAsn: 1.797 ± 0.037
1.626PhePro: 1.626 ± 0.034
1.406PheGln: 1.406 ± 0.033
1.968PheArg: 1.968 ± 0.038
3.081PheSer: 3.081 ± 0.05
2.571PheThr: 2.571 ± 0.05
2.828PheVal: 2.828 ± 0.046
0.551PheTrp: 0.551 ± 0.02
1.532PheTyr: 1.532 ± 0.033
0.0PheXaa: 0.0 ± 0.0
Gly
5.213GlyAla: 5.213 ± 0.086
0.701GlyCys: 0.701 ± 0.021
3.57GlyAsp: 3.57 ± 0.056
4.452GlyGlu: 4.452 ± 0.057
3.267GlyPhe: 3.267 ± 0.047
5.535GlyGly: 5.535 ± 0.078
1.454GlyHis: 1.454 ± 0.038
5.988GlyIle: 5.988 ± 0.065
4.518GlyLys: 4.518 ± 0.062
7.45GlyLeu: 7.45 ± 0.078
2.373GlyMet: 2.373 ± 0.041
2.967GlyAsn: 2.967 ± 0.049
1.867GlyPro: 1.867 ± 0.038
2.51GlyGln: 2.51 ± 0.04
3.336GlyArg: 3.336 ± 0.055
4.892GlySer: 4.892 ± 0.071
4.398GlyThr: 4.398 ± 0.063
5.3GlyVal: 5.3 ± 0.068
1.037GlyTrp: 1.037 ± 0.029
2.962GlyTyr: 2.962 ± 0.046
0.0GlyXaa: 0.0 ± 0.0
His
1.339HisAla: 1.339 ± 0.027
0.201HisCys: 0.201 ± 0.011
0.917HisAsp: 0.917 ± 0.027
1.256HisGlu: 1.256 ± 0.029
1.058HisPhe: 1.058 ± 0.027
1.428HisGly: 1.428 ± 0.034
0.525HisHis: 0.525 ± 0.022
1.402HisIle: 1.402 ± 0.027
0.963HisLys: 0.963 ± 0.026
2.136HisLeu: 2.136 ± 0.052
0.607HisMet: 0.607 ± 0.022
0.82HisAsn: 0.82 ± 0.023
1.185HisPro: 1.185 ± 0.028
0.711HisGln: 0.711 ± 0.022
1.019HisArg: 1.019 ± 0.029
1.3HisSer: 1.3 ± 0.031
1.139HisThr: 1.139 ± 0.027
1.31HisVal: 1.31 ± 0.029
0.303HisTrp: 0.303 ± 0.015
0.863HisTyr: 0.863 ± 0.025
0.0HisXaa: 0.0 ± 0.0
Ile
5.734IleAla: 5.734 ± 0.066
0.615IleCys: 0.615 ± 0.019
3.757IleAsp: 3.757 ± 0.05
4.493IleGlu: 4.493 ± 0.052
2.711IlePhe: 2.711 ± 0.055
5.381IleGly: 5.381 ± 0.077
1.582IleHis: 1.582 ± 0.033
5.0IleIle: 5.0 ± 0.066
3.53IleLys: 3.53 ± 0.061
6.654IleLeu: 6.654 ± 0.073
1.812IleMet: 1.812 ± 0.032
2.869IleAsn: 2.869 ± 0.048
3.296IlePro: 3.296 ± 0.048
2.537IleGln: 2.537 ± 0.045
3.493IleArg: 3.493 ± 0.059
5.254IleSer: 5.254 ± 0.068
4.338IleThr: 4.338 ± 0.066
5.193IleVal: 5.193 ± 0.062
0.69IleTrp: 0.69 ± 0.022
2.313IleTyr: 2.313 ± 0.048
0.0IleXaa: 0.0 ± 0.0
Lys
4.267LysAla: 4.267 ± 0.053
0.289LysCys: 0.289 ± 0.015
3.176LysAsp: 3.176 ± 0.054
4.576LysGlu: 4.576 ± 0.059
1.704LysPhe: 1.704 ± 0.033
4.026LysGly: 4.026 ± 0.061
1.034LysHis: 1.034 ± 0.025
3.587LysIle: 3.587 ± 0.052
3.517LysLys: 3.517 ± 0.053
5.669LysLeu: 5.669 ± 0.065
1.691LysMet: 1.691 ± 0.032
2.352LysAsn: 2.352 ± 0.047
2.212LysPro: 2.212 ± 0.035
2.349LysGln: 2.349 ± 0.041
2.72LysArg: 2.72 ± 0.039
3.354LysSer: 3.354 ± 0.049
2.838LysThr: 2.838 ± 0.042
4.042LysVal: 4.042 ± 0.059
0.734LysTrp: 0.734 ± 0.024
1.99LysTyr: 1.99 ± 0.042
0.0LysXaa: 0.0 ± 0.0
Leu
7.62LeuAla: 7.62 ± 0.08
0.85LeuCys: 0.85 ± 0.027
5.212LeuAsp: 5.212 ± 0.071
6.69LeuGlu: 6.69 ± 0.08
4.523LeuPhe: 4.523 ± 0.079
7.0LeuGly: 7.0 ± 0.075
2.107LeuHis: 2.107 ± 0.038
6.994LeuIle: 6.994 ± 0.09
5.705LeuLys: 5.705 ± 0.058
11.295LeuLeu: 11.295 ± 0.11
2.843LeuMet: 2.843 ± 0.047
4.329LeuAsn: 4.329 ± 0.06
4.484LeuPro: 4.484 ± 0.057
4.138LeuGln: 4.138 ± 0.052
4.963LeuArg: 4.963 ± 0.062
7.558LeuSer: 7.558 ± 0.083
5.807LeuThr: 5.807 ± 0.064
6.203LeuVal: 6.203 ± 0.068
1.069LeuTrp: 1.069 ± 0.027
3.25LeuTyr: 3.25 ± 0.051
0.0LeuXaa: 0.0 ± 0.0
Met
2.117MetAla: 2.117 ± 0.035
0.178MetCys: 0.178 ± 0.01
1.702MetAsp: 1.702 ± 0.033
1.986MetGlu: 1.986 ± 0.036
1.06MetPhe: 1.06 ± 0.028
1.918MetGly: 1.918 ± 0.04
0.47MetHis: 0.47 ± 0.019
2.007MetIle: 2.007 ± 0.039
2.09MetLys: 2.09 ± 0.042
2.969MetLeu: 2.969 ± 0.054
0.866MetMet: 0.866 ± 0.027
1.596MetAsn: 1.596 ± 0.032
1.071MetPro: 1.071 ± 0.027
0.956MetGln: 0.956 ± 0.026
1.239MetArg: 1.239 ± 0.032
1.841MetSer: 1.841 ± 0.038
1.634MetThr: 1.634 ± 0.035
1.798MetVal: 1.798 ± 0.035
0.238MetTrp: 0.238 ± 0.013
0.777MetTyr: 0.777 ± 0.022
0.0MetXaa: 0.0 ± 0.0
Asn
2.857AsnAla: 2.857 ± 0.042
0.276AsnCys: 0.276 ± 0.015
2.08AsnAsp: 2.08 ± 0.034
2.747AsnGlu: 2.747 ± 0.041
1.504AsnPhe: 1.504 ± 0.035
3.263AsnGly: 3.263 ± 0.056
0.933AsnHis: 0.933 ± 0.025
2.9AsnIle: 2.9 ± 0.052
2.415AsnLys: 2.415 ± 0.04
3.805AsnLeu: 3.805 ± 0.054
1.077AsnMet: 1.077 ± 0.031
2.005AsnAsn: 2.005 ± 0.042
2.13AsnPro: 2.13 ± 0.036
1.585AsnGln: 1.585 ± 0.034
2.06AsnArg: 2.06 ± 0.04
2.521AsnSer: 2.521 ± 0.053
2.351AsnThr: 2.351 ± 0.044
2.878AsnVal: 2.878 ± 0.043
0.547AsnTrp: 0.547 ± 0.021
1.539AsnTyr: 1.539 ± 0.033
0.0AsnXaa: 0.0 ± 0.0
Pro
2.89ProAla: 2.89 ± 0.049
0.261ProCys: 0.261 ± 0.012
2.382ProAsp: 2.382 ± 0.045
3.348ProGlu: 3.348 ± 0.05
1.83ProPhe: 1.83 ± 0.038
2.812ProGly: 2.812 ± 0.051
0.802ProHis: 0.802 ± 0.023
2.461ProIle: 2.461 ± 0.043
1.888ProLys: 1.888 ± 0.034
3.966ProLeu: 3.966 ± 0.055
0.921ProMet: 0.921 ± 0.027
1.538ProAsn: 1.538 ± 0.033
1.142ProPro: 1.142 ± 0.033
1.457ProGln: 1.457 ± 0.033
1.401ProArg: 1.401 ± 0.031
2.455ProSer: 2.455 ± 0.048
2.023ProThr: 2.023 ± 0.044
3.171ProVal: 3.171 ± 0.046
0.503ProTrp: 0.503 ± 0.017
1.523ProTyr: 1.523 ± 0.035
0.0ProXaa: 0.0 ± 0.0
Gln
2.897GlnAla: 2.897 ± 0.046
0.222GlnCys: 0.222 ± 0.013
1.654GlnAsp: 1.654 ± 0.031
2.7GlnGlu: 2.7 ± 0.043
1.513GlnPhe: 1.513 ± 0.035
2.619GlnGly: 2.619 ± 0.051
0.771GlnHis: 0.771 ± 0.023
2.554GlnIle: 2.554 ± 0.04
2.084GlnLys: 2.084 ± 0.036
3.934GlnLeu: 3.934 ± 0.057
1.141GlnMet: 1.141 ± 0.027
1.415GlnAsn: 1.415 ± 0.029
1.372GlnPro: 1.372 ± 0.033
1.731GlnGln: 1.731 ± 0.042
1.813GlnArg: 1.813 ± 0.038
2.352GlnSer: 2.352 ± 0.048
1.822GlnThr: 1.822 ± 0.034
2.349GlnVal: 2.349 ± 0.041
0.509GlnTrp: 0.509 ± 0.018
1.438GlnTyr: 1.438 ± 0.027
0.0GlnXaa: 0.0 ± 0.0
Arg
2.891ArgAla: 2.891 ± 0.048
0.367ArgCys: 0.367 ± 0.017
2.409ArgAsp: 2.409 ± 0.044
3.54ArgGlu: 3.54 ± 0.053
1.992ArgPhe: 1.992 ± 0.038
2.938ArgGly: 2.938 ± 0.047
1.038ArgHis: 1.038 ± 0.031
3.509ArgIle: 3.509 ± 0.049
3.087ArgLys: 3.087 ± 0.047
4.979ArgLeu: 4.979 ± 0.067
1.557ArgMet: 1.557 ± 0.037
2.131ArgAsn: 2.131 ± 0.037
1.564ArgPro: 1.564 ± 0.037
1.862ArgGln: 1.862 ± 0.04
2.658ArgArg: 2.658 ± 0.048
3.048ArgSer: 3.048 ± 0.048
2.537ArgThr: 2.537 ± 0.045
2.979ArgVal: 2.979 ± 0.048
0.592ArgTrp: 0.592 ± 0.018
1.851ArgTyr: 1.851 ± 0.036
0.0ArgXaa: 0.0 ± 0.0
Ser
4.728SerAla: 4.728 ± 0.061
0.493SerCys: 0.493 ± 0.019
3.147SerAsp: 3.147 ± 0.045
4.052SerGlu: 4.052 ± 0.062
3.098SerPhe: 3.098 ± 0.05
5.556SerGly: 5.556 ± 0.071
1.268SerHis: 1.268 ± 0.032
4.833SerIle: 4.833 ± 0.072
3.532SerLys: 3.532 ± 0.049
6.873SerLeu: 6.873 ± 0.076
1.891SerMet: 1.891 ± 0.039
2.6SerAsn: 2.6 ± 0.046
2.589SerPro: 2.589 ± 0.04
2.149SerGln: 2.149 ± 0.041
3.191SerArg: 3.191 ± 0.048
4.652SerSer: 4.652 ± 0.073
3.436SerThr: 3.436 ± 0.059
4.599SerVal: 4.599 ± 0.066
0.831SerTrp: 0.831 ± 0.027
2.453SerTyr: 2.453 ± 0.045
0.0SerXaa: 0.0 ± 0.0
Thr
4.54ThrAla: 4.54 ± 0.063
0.365ThrCys: 0.365 ± 0.018
2.767ThrAsp: 2.767 ± 0.049
3.367ThrGlu: 3.367 ± 0.048
2.445ThrPhe: 2.445 ± 0.043
4.647ThrGly: 4.647 ± 0.067
1.109ThrHis: 1.109 ± 0.027
3.771ThrIle: 3.771 ± 0.052
2.634ThrLys: 2.634 ± 0.044
5.851ThrLeu: 5.851 ± 0.061
1.335ThrMet: 1.335 ± 0.035
1.987ThrAsn: 1.987 ± 0.04
2.595ThrPro: 2.595 ± 0.044
1.617ThrGln: 1.617 ± 0.035
2.232ThrArg: 2.232 ± 0.039
3.507ThrSer: 3.507 ± 0.055
3.044ThrThr: 3.044 ± 0.064
4.417ThrVal: 4.417 ± 0.061
0.632ThrTrp: 0.632 ± 0.023
1.872ThrTyr: 1.872 ± 0.039
0.0ThrXaa: 0.0 ± 0.0
Val
4.874ValAla: 4.874 ± 0.072
0.601ValCys: 0.601 ± 0.023
3.624ValAsp: 3.624 ± 0.052
4.478ValGlu: 4.478 ± 0.065
3.025ValPhe: 3.025 ± 0.05
4.524ValGly: 4.524 ± 0.051
1.371ValHis: 1.371 ± 0.033
5.36ValIle: 5.36 ± 0.063
4.005ValLys: 4.005 ± 0.064
7.258ValLeu: 7.258 ± 0.074
2.058ValMet: 2.058 ± 0.037
3.006ValAsn: 3.006 ± 0.053
2.722ValPro: 2.722 ± 0.038
2.475ValGln: 2.475 ± 0.041
3.144ValArg: 3.144 ± 0.052
4.905ValSer: 4.905 ± 0.058
4.32ValThr: 4.32 ± 0.059
4.943ValVal: 4.943 ± 0.065
0.801ValTrp: 0.801 ± 0.022
2.3ValTyr: 2.3 ± 0.04
0.0ValXaa: 0.0 ± 0.0
Trp
0.757TrpAla: 0.757 ± 0.027
0.105TrpCys: 0.105 ± 0.008
0.656TrpAsp: 0.656 ± 0.023
0.796TrpGlu: 0.796 ± 0.026
0.533TrpPhe: 0.533 ± 0.019
0.862TrpGly: 0.862 ± 0.024
0.227TrpHis: 0.227 ± 0.012
0.895TrpIle: 0.895 ± 0.024
0.737TrpLys: 0.737 ± 0.023
1.345TrpLeu: 1.345 ± 0.035
0.413TrpMet: 0.413 ± 0.018
0.714TrpAsn: 0.714 ± 0.023
0.321TrpPro: 0.321 ± 0.015
0.459TrpGln: 0.459 ± 0.018
0.593TrpArg: 0.593 ± 0.02
0.85TrpSer: 0.85 ± 0.026
0.649TrpThr: 0.649 ± 0.023
0.792TrpVal: 0.792 ± 0.023
0.173TrpTrp: 0.173 ± 0.011
0.389TrpTyr: 0.389 ± 0.016
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.453TyrAla: 2.453 ± 0.048
0.321TyrCys: 0.321 ± 0.016
1.837TyrAsp: 1.837 ± 0.036
2.259TyrGlu: 2.259 ± 0.042
1.697TyrPhe: 1.697 ± 0.033
2.702TyrGly: 2.702 ± 0.048
0.824TyrHis: 0.824 ± 0.024
2.379TyrIle: 2.379 ± 0.04
1.838TyrLys: 1.838 ± 0.039
3.492TyrLeu: 3.492 ± 0.053
0.929TyrMet: 0.929 ± 0.022
1.589TyrAsn: 1.589 ± 0.033
1.503TyrPro: 1.503 ± 0.03
1.233TyrGln: 1.233 ± 0.029
1.933TyrArg: 1.933 ± 0.041
2.446TyrSer: 2.446 ± 0.047
1.939TyrThr: 1.939 ± 0.037
2.287TyrVal: 2.287 ± 0.037
0.428TyrTrp: 0.428 ± 0.019
1.435TyrTyr: 1.435 ± 0.036
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4821 proteins (1490243 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski