Amino acid dipepetide frequency for Paenibacillus sp. JCM 10914

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.955AlaAla: 7.955 ± 0.1
0.682AlaCys: 0.682 ± 0.021
4.119AlaAsp: 4.119 ± 0.049
5.323AlaGlu: 5.323 ± 0.068
3.139AlaPhe: 3.139 ± 0.046
6.395AlaGly: 6.395 ± 0.068
1.456AlaHis: 1.456 ± 0.029
5.281AlaIle: 5.281 ± 0.059
3.784AlaLys: 3.784 ± 0.052
7.963AlaLeu: 7.963 ± 0.07
2.361AlaMet: 2.361 ± 0.037
2.664AlaAsn: 2.664 ± 0.044
2.573AlaPro: 2.573 ± 0.047
2.732AlaGln: 2.732 ± 0.049
3.533AlaArg: 3.533 ± 0.044
4.936AlaSer: 4.936 ± 0.059
3.656AlaThr: 3.656 ± 0.072
6.046AlaVal: 6.046 ± 0.061
0.985AlaTrp: 0.985 ± 0.027
2.622AlaTyr: 2.622 ± 0.037
0.0AlaXaa: 0.0 ± 0.0
Cys
0.482CysAla: 0.482 ± 0.017
0.121CysCys: 0.121 ± 0.008
0.372CysAsp: 0.372 ± 0.016
0.4CysGlu: 0.4 ± 0.017
0.292CysPhe: 0.292 ± 0.013
0.709CysGly: 0.709 ± 0.022
0.213CysHis: 0.213 ± 0.01
0.503CysIle: 0.503 ± 0.019
0.284CysLys: 0.284 ± 0.015
0.702CysLeu: 0.702 ± 0.02
0.24CysMet: 0.24 ± 0.012
0.249CysAsn: 0.249 ± 0.012
0.346CysPro: 0.346 ± 0.014
0.224CysGln: 0.224 ± 0.012
0.454CysArg: 0.454 ± 0.018
0.608CysSer: 0.608 ± 0.02
0.38CysThr: 0.38 ± 0.014
0.433CysVal: 0.433 ± 0.015
0.115CysTrp: 0.115 ± 0.007
0.254CysTyr: 0.254 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
3.815AspAla: 3.815 ± 0.048
0.347AspCys: 0.347 ± 0.014
2.534AspAsp: 2.534 ± 0.039
3.909AspGlu: 3.909 ± 0.048
2.073AspPhe: 2.073 ± 0.033
4.041AspGly: 4.041 ± 0.051
1.349AspHis: 1.349 ± 0.03
3.79AspIle: 3.79 ± 0.046
2.438AspLys: 2.438 ± 0.036
4.824AspLeu: 4.824 ± 0.062
1.596AspMet: 1.596 ± 0.03
1.822AspAsn: 1.822 ± 0.033
2.437AspPro: 2.437 ± 0.039
2.333AspGln: 2.333 ± 0.036
2.858AspArg: 2.858 ± 0.042
2.841AspSer: 2.841 ± 0.04
2.57AspThr: 2.57 ± 0.041
3.632AspVal: 3.632 ± 0.048
0.848AspTrp: 0.848 ± 0.023
2.174AspTyr: 2.174 ± 0.037
0.0AspXaa: 0.0 ± 0.0
Glu
5.802GluAla: 5.802 ± 0.062
0.375GluCys: 0.375 ± 0.017
3.477GluAsp: 3.477 ± 0.052
5.441GluGlu: 5.441 ± 0.073
2.219GluPhe: 2.219 ± 0.04
4.62GluGly: 4.62 ± 0.06
1.658GluHis: 1.658 ± 0.033
4.376GluIle: 4.376 ± 0.058
3.345GluLys: 3.345 ± 0.054
7.252GluLeu: 7.252 ± 0.074
2.176GluMet: 2.176 ± 0.036
2.307GluAsn: 2.307 ± 0.038
2.422GluPro: 2.422 ± 0.041
3.834GluGln: 3.834 ± 0.046
4.152GluArg: 4.152 ± 0.049
3.649GluSer: 3.649 ± 0.045
3.212GluThr: 3.212 ± 0.046
4.601GluVal: 4.601 ± 0.056
0.996GluTrp: 0.996 ± 0.027
2.093GluTyr: 2.093 ± 0.036
0.0GluXaa: 0.0 ± 0.0
Phe
3.037PheAla: 3.037 ± 0.042
0.329PheCys: 0.329 ± 0.012
2.199PheAsp: 2.199 ± 0.034
2.388PheGlu: 2.388 ± 0.038
1.762PhePhe: 1.762 ± 0.042
3.063PheGly: 3.063 ± 0.048
0.95PheHis: 0.95 ± 0.025
2.993PheIle: 2.993 ± 0.047
1.891PheLys: 1.891 ± 0.035
3.704PheLeu: 3.704 ± 0.061
1.272PheMet: 1.272 ± 0.029
1.623PheAsn: 1.623 ± 0.036
1.558PhePro: 1.558 ± 0.031
1.576PheGln: 1.576 ± 0.033
2.004PheArg: 2.004 ± 0.039
2.741PheSer: 2.741 ± 0.04
2.443PheThr: 2.443 ± 0.038
2.829PheVal: 2.829 ± 0.046
0.531PheTrp: 0.531 ± 0.021
1.448PheTyr: 1.448 ± 0.029
0.0PheXaa: 0.0 ± 0.0
Gly
5.369GlyAla: 5.369 ± 0.098
0.658GlyCys: 0.658 ± 0.018
3.485GlyAsp: 3.485 ± 0.046
4.632GlyGlu: 4.632 ± 0.053
3.14GlyPhe: 3.14 ± 0.043
5.449GlyGly: 5.449 ± 0.078
1.584GlyHis: 1.584 ± 0.031
5.78GlyIle: 5.78 ± 0.063
4.034GlyLys: 4.034 ± 0.054
7.198GlyLeu: 7.198 ± 0.074
2.548GlyMet: 2.548 ± 0.043
2.823GlyAsn: 2.823 ± 0.049
2.143GlyPro: 2.143 ± 0.095
2.776GlyGln: 2.776 ± 0.042
3.538GlyArg: 3.538 ± 0.041
4.882GlySer: 4.882 ± 0.074
4.466GlyThr: 4.466 ± 0.06
5.239GlyVal: 5.239 ± 0.079
1.08GlyTrp: 1.08 ± 0.023
3.01GlyTyr: 3.01 ± 0.043
0.0GlyXaa: 0.0 ± 0.0
His
1.715HisAla: 1.715 ± 0.036
0.212HisCys: 0.212 ± 0.013
1.231HisAsp: 1.231 ± 0.028
1.523HisGlu: 1.523 ± 0.03
1.037HisPhe: 1.037 ± 0.023
1.651HisGly: 1.651 ± 0.035
0.716HisHis: 0.716 ± 0.022
1.528HisIle: 1.528 ± 0.036
0.884HisLys: 0.884 ± 0.023
2.154HisLeu: 2.154 ± 0.041
0.654HisMet: 0.654 ± 0.02
0.822HisAsn: 0.822 ± 0.025
1.289HisPro: 1.289 ± 0.029
0.922HisGln: 0.922 ± 0.024
1.236HisArg: 1.236 ± 0.026
1.292HisSer: 1.292 ± 0.026
1.174HisThr: 1.174 ± 0.024
1.595HisVal: 1.595 ± 0.033
0.335HisTrp: 0.335 ± 0.016
0.995HisTyr: 0.995 ± 0.025
0.0HisXaa: 0.0 ± 0.0
Ile
5.728IleAla: 5.728 ± 0.066
0.605IleCys: 0.605 ± 0.019
3.635IleAsp: 3.635 ± 0.047
4.384IleGlu: 4.384 ± 0.058
2.456IlePhe: 2.456 ± 0.041
5.446IleGly: 5.446 ± 0.069
1.711IleHis: 1.711 ± 0.035
4.453IleIle: 4.453 ± 0.059
2.841IleLys: 2.841 ± 0.042
6.195IleLeu: 6.195 ± 0.068
1.966IleMet: 1.966 ± 0.034
2.413IleAsn: 2.413 ± 0.04
3.301IlePro: 3.301 ± 0.043
2.882IleGln: 2.882 ± 0.041
3.858IleArg: 3.858 ± 0.054
4.688IleSer: 4.688 ± 0.056
4.056IleThr: 4.056 ± 0.057
5.099IleVal: 5.099 ± 0.062
0.791IleTrp: 0.791 ± 0.023
2.207IleTyr: 2.207 ± 0.037
0.0IleXaa: 0.0 ± 0.0
Lys
3.686LysAla: 3.686 ± 0.051
0.212LysCys: 0.212 ± 0.012
2.682LysAsp: 2.682 ± 0.043
3.864LysGlu: 3.864 ± 0.052
1.455LysPhe: 1.455 ± 0.028
3.283LysGly: 3.283 ± 0.045
1.127LysHis: 1.127 ± 0.027
2.844LysIle: 2.844 ± 0.043
2.714LysLys: 2.714 ± 0.046
5.052LysLeu: 5.052 ± 0.061
1.487LysMet: 1.487 ± 0.03
1.892LysAsn: 1.892 ± 0.034
2.152LysPro: 2.152 ± 0.037
2.439LysGln: 2.439 ± 0.032
2.748LysArg: 2.748 ± 0.046
2.853LysSer: 2.853 ± 0.043
2.445LysThr: 2.445 ± 0.039
3.29LysVal: 3.29 ± 0.05
0.685LysTrp: 0.685 ± 0.023
1.641LysTyr: 1.641 ± 0.033
0.0LysXaa: 0.0 ± 0.0
Leu
7.929LeuAla: 7.929 ± 0.081
0.783LeuCys: 0.783 ± 0.022
5.259LeuAsp: 5.259 ± 0.056
6.338LeuGlu: 6.338 ± 0.07
4.385LeuPhe: 4.385 ± 0.062
6.813LeuGly: 6.813 ± 0.074
2.335LeuHis: 2.335 ± 0.04
6.725LeuIle: 6.725 ± 0.081
4.98LeuLys: 4.98 ± 0.056
10.758LeuLeu: 10.758 ± 0.114
2.8LeuMet: 2.8 ± 0.041
4.057LeuAsn: 4.057 ± 0.048
4.523LeuPro: 4.523 ± 0.052
4.209LeuGln: 4.209 ± 0.05
4.989LeuArg: 4.989 ± 0.065
7.188LeuSer: 7.188 ± 0.065
5.603LeuThr: 5.603 ± 0.061
6.15LeuVal: 6.15 ± 0.067
1.074LeuTrp: 1.074 ± 0.028
3.259LeuTyr: 3.259 ± 0.045
0.0LeuXaa: 0.0 ± 0.0
Met
2.252MetAla: 2.252 ± 0.035
0.171MetCys: 0.171 ± 0.01
1.662MetAsp: 1.662 ± 0.034
2.109MetGlu: 2.109 ± 0.034
1.185MetPhe: 1.185 ± 0.025
1.842MetGly: 1.842 ± 0.036
0.521MetHis: 0.521 ± 0.016
2.256MetIle: 2.256 ± 0.036
2.08MetLys: 2.08 ± 0.034
3.155MetLeu: 3.155 ± 0.051
1.041MetMet: 1.041 ± 0.024
1.766MetAsn: 1.766 ± 0.03
1.192MetPro: 1.192 ± 0.031
1.108MetGln: 1.108 ± 0.026
1.373MetArg: 1.373 ± 0.035
1.936MetSer: 1.936 ± 0.036
1.833MetThr: 1.833 ± 0.032
1.943MetVal: 1.943 ± 0.037
0.291MetTrp: 0.291 ± 0.011
0.846MetTyr: 0.846 ± 0.024
0.0MetXaa: 0.0 ± 0.0
Asn
2.762AsnAla: 2.762 ± 0.042
0.238AsnCys: 0.238 ± 0.013
1.955AsnAsp: 1.955 ± 0.034
2.635AsnGlu: 2.635 ± 0.043
1.302AsnPhe: 1.302 ± 0.026
3.18AsnGly: 3.18 ± 0.053
0.949AsnHis: 0.949 ± 0.021
2.503AsnIle: 2.503 ± 0.036
1.861AsnLys: 1.861 ± 0.039
3.391AsnLeu: 3.391 ± 0.052
1.141AsnMet: 1.141 ± 0.026
1.567AsnAsn: 1.567 ± 0.035
2.088AsnPro: 2.088 ± 0.032
1.72AsnGln: 1.72 ± 0.034
2.192AsnArg: 2.192 ± 0.036
2.199AsnSer: 2.199 ± 0.039
2.127AsnThr: 2.127 ± 0.042
2.659AsnVal: 2.659 ± 0.037
0.559AsnTrp: 0.559 ± 0.018
1.365AsnTyr: 1.365 ± 0.033
0.0AsnXaa: 0.0 ± 0.0
Pro
3.196ProAla: 3.196 ± 0.049
0.24ProCys: 0.24 ± 0.013
2.679ProAsp: 2.679 ± 0.043
3.366ProGlu: 3.366 ± 0.05
1.794ProPhe: 1.794 ± 0.037
3.021ProGly: 3.021 ± 0.045
0.938ProHis: 0.938 ± 0.021
2.656ProIle: 2.656 ± 0.041
1.657ProLys: 1.657 ± 0.034
4.03ProLeu: 4.03 ± 0.053
1.057ProMet: 1.057 ± 0.025
1.58ProAsn: 1.58 ± 0.026
1.295ProPro: 1.295 ± 0.029
1.434ProGln: 1.434 ± 0.028
1.5ProArg: 1.5 ± 0.03
2.711ProSer: 2.711 ± 0.044
2.113ProThr: 2.113 ± 0.069
3.355ProVal: 3.355 ± 0.075
0.558ProTrp: 0.558 ± 0.019
1.557ProTyr: 1.557 ± 0.033
0.0ProXaa: 0.0 ± 0.0
Gln
3.506GlnAla: 3.506 ± 0.045
0.218GlnCys: 0.218 ± 0.011
2.078GlnAsp: 2.078 ± 0.038
3.032GlnGlu: 3.032 ± 0.043
1.576GlnPhe: 1.576 ± 0.03
2.866GlnGly: 2.866 ± 0.044
1.051GlnHis: 1.051 ± 0.027
2.612GlnIle: 2.612 ± 0.041
1.809GlnLys: 1.809 ± 0.033
4.36GlnLeu: 4.36 ± 0.053
1.219GlnMet: 1.219 ± 0.028
1.364GlnAsn: 1.364 ± 0.027
1.647GlnPro: 1.647 ± 0.032
2.176GlnGln: 2.176 ± 0.039
2.185GlnArg: 2.185 ± 0.039
2.485GlnSer: 2.485 ± 0.039
2.015GlnThr: 2.015 ± 0.032
2.728GlnVal: 2.728 ± 0.039
0.601GlnTrp: 0.601 ± 0.019
1.461GlnTyr: 1.461 ± 0.03
0.0GlnXaa: 0.0 ± 0.0
Arg
3.27ArgAla: 3.27 ± 0.044
0.402ArgCys: 0.402 ± 0.018
2.527ArgAsp: 2.527 ± 0.04
3.773ArgGlu: 3.773 ± 0.053
2.241ArgPhe: 2.241 ± 0.038
3.026ArgGly: 3.026 ± 0.047
1.195ArgHis: 1.195 ± 0.026
3.89ArgIle: 3.89 ± 0.043
2.871ArgLys: 2.871 ± 0.044
5.369ArgLeu: 5.369 ± 0.065
1.848ArgMet: 1.848 ± 0.032
2.086ArgAsn: 2.086 ± 0.035
1.713ArgPro: 1.713 ± 0.029
2.181ArgGln: 2.181 ± 0.034
2.852ArgArg: 2.852 ± 0.049
3.331ArgSer: 3.331 ± 0.045
2.82ArgThr: 2.82 ± 0.039
3.195ArgVal: 3.195 ± 0.043
0.761ArgTrp: 0.761 ± 0.024
2.07ArgTyr: 2.07 ± 0.035
0.0ArgXaa: 0.0 ± 0.0
Ser
4.674SerAla: 4.674 ± 0.053
0.441SerCys: 0.441 ± 0.018
3.133SerAsp: 3.133 ± 0.043
3.951SerGlu: 3.951 ± 0.045
2.917SerPhe: 2.917 ± 0.038
5.466SerGly: 5.466 ± 0.063
1.322SerHis: 1.322 ± 0.028
4.529SerIle: 4.529 ± 0.053
3.037SerLys: 3.037 ± 0.041
6.553SerLeu: 6.553 ± 0.068
2.005SerMet: 2.005 ± 0.033
2.403SerAsn: 2.403 ± 0.043
2.54SerPro: 2.54 ± 0.042
2.162SerGln: 2.162 ± 0.039
3.272SerArg: 3.272 ± 0.044
4.481SerSer: 4.481 ± 0.063
3.402SerThr: 3.402 ± 0.068
4.567SerVal: 4.567 ± 0.049
0.913SerTrp: 0.913 ± 0.026
2.355SerTyr: 2.355 ± 0.038
0.0SerXaa: 0.0 ± 0.0
Thr
4.519ThrAla: 4.519 ± 0.06
0.359ThrCys: 0.359 ± 0.015
2.888ThrAsp: 2.888 ± 0.043
3.412ThrGlu: 3.412 ± 0.054
2.24ThrPhe: 2.24 ± 0.036
4.922ThrGly: 4.922 ± 0.287
1.105ThrHis: 1.105 ± 0.027
3.788ThrIle: 3.788 ± 0.046
2.261ThrLys: 2.261 ± 0.041
5.5ThrLeu: 5.5 ± 0.066
1.512ThrMet: 1.512 ± 0.029
1.928ThrAsn: 1.928 ± 0.036
2.511ThrPro: 2.511 ± 0.043
1.65ThrGln: 1.65 ± 0.029
2.332ThrArg: 2.332 ± 0.033
3.467ThrSer: 3.467 ± 0.05
2.974ThrThr: 2.974 ± 0.044
4.37ThrVal: 4.37 ± 0.053
0.707ThrTrp: 0.707 ± 0.021
1.962ThrTyr: 1.962 ± 0.039
0.0ThrXaa: 0.0 ± 0.0
Val
5.015ValAla: 5.015 ± 0.063
0.581ValCys: 0.581 ± 0.019
3.616ValAsp: 3.616 ± 0.048
4.374ValGlu: 4.374 ± 0.059
2.874ValPhe: 2.874 ± 0.051
4.478ValGly: 4.478 ± 0.059
1.632ValHis: 1.632 ± 0.028
4.99ValIle: 4.99 ± 0.056
3.53ValLys: 3.53 ± 0.053
7.163ValLeu: 7.163 ± 0.068
2.156ValMet: 2.156 ± 0.036
2.859ValAsn: 2.859 ± 0.046
2.977ValPro: 2.977 ± 0.043
2.758ValGln: 2.758 ± 0.039
3.442ValArg: 3.442 ± 0.047
4.777ValSer: 4.777 ± 0.051
4.488ValThr: 4.488 ± 0.085
5.033ValVal: 5.033 ± 0.06
0.873ValTrp: 0.873 ± 0.023
2.403ValTyr: 2.403 ± 0.039
0.0ValXaa: 0.0 ± 0.0
Trp
0.829TrpAla: 0.829 ± 0.022
0.112TrpCys: 0.112 ± 0.008
0.72TrpAsp: 0.72 ± 0.021
0.795TrpGlu: 0.795 ± 0.023
0.628TrpPhe: 0.628 ± 0.019
0.87TrpGly: 0.87 ± 0.019
0.281TrpHis: 0.281 ± 0.014
1.006TrpIle: 1.006 ± 0.022
0.731TrpLys: 0.731 ± 0.021
1.479TrpLeu: 1.479 ± 0.029
0.507TrpMet: 0.507 ± 0.017
0.77TrpAsn: 0.77 ± 0.023
0.397TrpPro: 0.397 ± 0.015
0.503TrpGln: 0.503 ± 0.015
0.683TrpArg: 0.683 ± 0.02
0.927TrpSer: 0.927 ± 0.025
0.691TrpThr: 0.691 ± 0.019
0.831TrpVal: 0.831 ± 0.023
0.186TrpTrp: 0.186 ± 0.01
0.432TrpTyr: 0.432 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.663TyrAla: 2.663 ± 0.038
0.297TyrCys: 0.297 ± 0.012
1.981TyrAsp: 1.981 ± 0.035
2.361TyrGlu: 2.361 ± 0.041
1.622TyrPhe: 1.622 ± 0.035
2.69TyrGly: 2.69 ± 0.039
0.869TyrHis: 0.869 ± 0.025
2.213TyrIle: 2.213 ± 0.037
1.521TyrLys: 1.521 ± 0.03
3.354TyrLeu: 3.354 ± 0.045
0.975TyrMet: 0.975 ± 0.026
1.399TyrAsn: 1.399 ± 0.034
1.635TyrPro: 1.635 ± 0.031
1.37TyrGln: 1.37 ± 0.031
2.222TyrArg: 2.222 ± 0.032
2.129TyrSer: 2.129 ± 0.033
1.939TyrThr: 1.939 ± 0.033
2.427TyrVal: 2.427 ± 0.038
0.487TyrTrp: 0.487 ± 0.016
1.431TyrTyr: 1.431 ± 0.031
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6080 proteins (1730134 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski