Amino acid dipepetide frequency for Paenibacillus dendritiformis C454

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.84AlaAla: 10.84 ± 0.138
0.986AlaCys: 0.986 ± 0.026
4.968AlaAsp: 4.968 ± 0.058
6.589AlaGlu: 6.589 ± 0.091
3.442AlaPhe: 3.442 ± 0.053
7.625AlaGly: 7.625 ± 0.088
1.709AlaHis: 1.709 ± 0.034
5.518AlaIle: 5.518 ± 0.066
4.204AlaLys: 4.204 ± 0.056
9.07AlaLeu: 9.07 ± 0.089
2.638AlaMet: 2.638 ± 0.037
2.629AlaAsn: 2.629 ± 0.043
3.111AlaPro: 3.111 ± 0.055
3.008AlaGln: 3.008 ± 0.044
4.556AlaArg: 4.556 ± 0.062
5.33AlaSer: 5.33 ± 0.063
3.403AlaThr: 3.403 ± 0.043
7.016AlaVal: 7.016 ± 0.072
1.212AlaTrp: 1.212 ± 0.026
2.959AlaTyr: 2.959 ± 0.04
0.0AlaXaa: 0.0 ± 0.0
Cys
0.687CysAla: 0.687 ± 0.021
0.124CysCys: 0.124 ± 0.009
0.441CysAsp: 0.441 ± 0.019
0.478CysGlu: 0.478 ± 0.016
0.342CysPhe: 0.342 ± 0.014
0.899CysGly: 0.899 ± 0.026
0.213CysHis: 0.213 ± 0.012
0.55CysIle: 0.55 ± 0.017
0.337CysLys: 0.337 ± 0.015
0.819CysLeu: 0.819 ± 0.023
0.248CysMet: 0.248 ± 0.011
0.273CysAsn: 0.273 ± 0.015
0.43CysPro: 0.43 ± 0.02
0.255CysGln: 0.255 ± 0.015
0.594CysArg: 0.594 ± 0.018
0.636CysSer: 0.636 ± 0.017
0.441CysThr: 0.441 ± 0.017
0.522CysVal: 0.522 ± 0.017
0.113CysTrp: 0.113 ± 0.008
0.314CysTyr: 0.314 ± 0.013
0.0CysXaa: 0.0 ± 0.0
Asp
4.402AspAla: 4.402 ± 0.055
0.417AspCys: 0.417 ± 0.016
2.498AspAsp: 2.498 ± 0.048
3.936AspGlu: 3.936 ± 0.06
1.911AspPhe: 1.911 ± 0.036
4.194AspGly: 4.194 ± 0.058
1.065AspHis: 1.065 ± 0.027
3.714AspIle: 3.714 ± 0.048
2.696AspLys: 2.696 ± 0.043
4.403AspLeu: 4.403 ± 0.05
1.647AspMet: 1.647 ± 0.033
1.71AspAsn: 1.71 ± 0.035
2.456AspPro: 2.456 ± 0.042
1.722AspGln: 1.722 ± 0.032
3.117AspArg: 3.117 ± 0.045
2.715AspSer: 2.715 ± 0.04
2.538AspThr: 2.538 ± 0.037
3.647AspVal: 3.647 ± 0.055
0.805AspTrp: 0.805 ± 0.022
2.088AspTyr: 2.088 ± 0.033
0.0AspXaa: 0.0 ± 0.0
Glu
7.437GluAla: 7.437 ± 0.096
0.466GluCys: 0.466 ± 0.017
3.189GluAsp: 3.189 ± 0.051
5.791GluGlu: 5.791 ± 0.076
2.163GluPhe: 2.163 ± 0.033
4.638GluGly: 4.638 ± 0.05
1.668GluHis: 1.668 ± 0.032
3.716GluIle: 3.716 ± 0.054
3.647GluLys: 3.647 ± 0.058
7.055GluLeu: 7.055 ± 0.081
2.169GluMet: 2.169 ± 0.041
2.068GluAsn: 2.068 ± 0.037
2.771GluPro: 2.771 ± 0.046
4.002GluGln: 4.002 ± 0.058
5.327GluArg: 5.327 ± 0.076
3.134GluSer: 3.134 ± 0.042
3.075GluThr: 3.075 ± 0.046
4.286GluVal: 4.286 ± 0.056
1.09GluTrp: 1.09 ± 0.028
1.996GluTyr: 1.996 ± 0.04
0.0GluXaa: 0.0 ± 0.0
Phe
3.489PheAla: 3.489 ± 0.053
0.391PheCys: 0.391 ± 0.017
2.24PheAsp: 2.24 ± 0.042
2.347PheGlu: 2.347 ± 0.042
1.812PhePhe: 1.812 ± 0.042
3.201PheGly: 3.201 ± 0.042
1.01PheHis: 1.01 ± 0.023
2.774PheIle: 2.774 ± 0.043
1.611PheLys: 1.611 ± 0.034
3.711PheLeu: 3.711 ± 0.061
1.148PheMet: 1.148 ± 0.029
1.369PheAsn: 1.369 ± 0.032
1.576PhePro: 1.576 ± 0.027
1.395PheGln: 1.395 ± 0.028
2.248PheArg: 2.248 ± 0.041
2.518PheSer: 2.518 ± 0.044
2.321PheThr: 2.321 ± 0.042
2.92PheVal: 2.92 ± 0.045
0.496PheTrp: 0.496 ± 0.017
1.442PheTyr: 1.442 ± 0.03
0.0PheXaa: 0.0 ± 0.0
Gly
6.143GlyAla: 6.143 ± 0.075
0.803GlyCys: 0.803 ± 0.024
3.46GlyAsp: 3.46 ± 0.051
4.862GlyGlu: 4.862 ± 0.063
3.114GlyPhe: 3.114 ± 0.046
5.602GlyGly: 5.602 ± 0.069
1.602GlyHis: 1.602 ± 0.035
5.435GlyIle: 5.435 ± 0.06
4.167GlyLys: 4.167 ± 0.055
7.288GlyLeu: 7.288 ± 0.071
2.656GlyMet: 2.656 ± 0.04
2.353GlyAsn: 2.353 ± 0.051
2.175GlyPro: 2.175 ± 0.039
2.762GlyGln: 2.762 ± 0.035
4.141GlyArg: 4.141 ± 0.06
4.609GlySer: 4.609 ± 0.052
4.525GlyThr: 4.525 ± 0.058
5.077GlyVal: 5.077 ± 0.06
1.174GlyTrp: 1.174 ± 0.028
2.897GlyTyr: 2.897 ± 0.038
0.0GlyXaa: 0.0 ± 0.0
His
1.935HisAla: 1.935 ± 0.038
0.234HisCys: 0.234 ± 0.013
1.155HisAsp: 1.155 ± 0.026
1.361HisGlu: 1.361 ± 0.032
0.954HisPhe: 0.954 ± 0.026
1.734HisGly: 1.734 ± 0.034
0.683HisHis: 0.683 ± 0.021
1.612HisIle: 1.612 ± 0.056
0.79HisLys: 0.79 ± 0.022
2.174HisLeu: 2.174 ± 0.042
0.628HisMet: 0.628 ± 0.018
0.638HisAsn: 0.638 ± 0.02
1.423HisPro: 1.423 ± 0.029
0.879HisGln: 0.879 ± 0.027
1.381HisArg: 1.381 ± 0.03
1.28HisSer: 1.28 ± 0.029
1.104HisThr: 1.104 ± 0.024
1.561HisVal: 1.561 ± 0.032
0.314HisTrp: 0.314 ± 0.014
0.897HisTyr: 0.897 ± 0.023
0.0HisXaa: 0.0 ± 0.0
Ile
6.345IleAla: 6.345 ± 0.066
0.62IleCys: 0.62 ± 0.019
3.567IleAsp: 3.567 ± 0.043
4.295IleGlu: 4.295 ± 0.046
2.249IlePhe: 2.249 ± 0.041
5.515IleGly: 5.515 ± 0.06
1.554IleHis: 1.554 ± 0.031
3.975IleIle: 3.975 ± 0.056
2.378IleLys: 2.378 ± 0.044
5.3IleLeu: 5.3 ± 0.068
1.582IleMet: 1.582 ± 0.034
1.969IleAsn: 1.969 ± 0.035
3.072IlePro: 3.072 ± 0.039
2.438IleGln: 2.438 ± 0.037
4.122IleArg: 4.122 ± 0.05
3.737IleSer: 3.737 ± 0.051
3.344IleThr: 3.344 ± 0.048
5.206IleVal: 5.206 ± 0.058
0.684IleTrp: 0.684 ± 0.019
2.023IleTyr: 2.023 ± 0.039
0.0IleXaa: 0.0 ± 0.0
Lys
3.922LysAla: 3.922 ± 0.053
0.254LysCys: 0.254 ± 0.012
2.532LysAsp: 2.532 ± 0.044
4.103LysGlu: 4.103 ± 0.054
1.303LysPhe: 1.303 ± 0.029
3.096LysGly: 3.096 ± 0.044
1.099LysHis: 1.099 ± 0.025
2.596LysIle: 2.596 ± 0.041
3.003LysLys: 3.003 ± 0.056
5.004LysLeu: 5.004 ± 0.065
1.506LysMet: 1.506 ± 0.032
1.682LysAsn: 1.682 ± 0.036
2.285LysPro: 2.285 ± 0.04
2.504LysGln: 2.504 ± 0.04
3.055LysArg: 3.055 ± 0.044
2.396LysSer: 2.396 ± 0.042
2.522LysThr: 2.522 ± 0.042
3.008LysVal: 3.008 ± 0.058
0.645LysTrp: 0.645 ± 0.019
1.649LysTyr: 1.649 ± 0.035
0.0LysXaa: 0.0 ± 0.0
Leu
9.187LeuAla: 9.187 ± 0.097
0.925LeuCys: 0.925 ± 0.024
5.196LeuAsp: 5.196 ± 0.056
6.257LeuGlu: 6.257 ± 0.074
4.55LeuPhe: 4.55 ± 0.066
6.667LeuGly: 6.667 ± 0.063
2.429LeuHis: 2.429 ± 0.045
6.062LeuIle: 6.062 ± 0.068
4.604LeuLys: 4.604 ± 0.052
11.234LeuLeu: 11.234 ± 0.121
2.653LeuMet: 2.653 ± 0.045
3.453LeuAsn: 3.453 ± 0.043
4.729LeuPro: 4.729 ± 0.064
4.116LeuGln: 4.116 ± 0.056
5.673LeuArg: 5.673 ± 0.059
6.429LeuSer: 6.429 ± 0.058
5.338LeuThr: 5.338 ± 0.06
6.208LeuVal: 6.208 ± 0.072
1.065LeuTrp: 1.065 ± 0.027
3.17LeuTyr: 3.17 ± 0.041
0.0LeuXaa: 0.0 ± 0.0
Met
2.524MetAla: 2.524 ± 0.039
0.194MetCys: 0.194 ± 0.01
1.493MetAsp: 1.493 ± 0.03
2.142MetGlu: 2.142 ± 0.038
1.038MetPhe: 1.038 ± 0.025
1.752MetGly: 1.752 ± 0.034
0.534MetHis: 0.534 ± 0.016
1.901MetIle: 1.901 ± 0.034
2.108MetLys: 2.108 ± 0.041
3.215MetLeu: 3.215 ± 0.045
1.042MetMet: 1.042 ± 0.027
1.498MetAsn: 1.498 ± 0.031
1.242MetPro: 1.242 ± 0.028
1.12MetGln: 1.12 ± 0.024
1.577MetArg: 1.577 ± 0.031
1.845MetSer: 1.845 ± 0.034
1.864MetThr: 1.864 ± 0.033
1.712MetVal: 1.712 ± 0.037
0.253MetTrp: 0.253 ± 0.011
0.821MetTyr: 0.821 ± 0.024
0.0MetXaa: 0.0 ± 0.0
Asn
2.641AsnAla: 2.641 ± 0.043
0.251AsnCys: 0.251 ± 0.013
1.659AsnAsp: 1.659 ± 0.032
2.365AsnGlu: 2.365 ± 0.041
1.076AsnPhe: 1.076 ± 0.03
2.993AsnGly: 2.993 ± 0.059
0.759AsnHis: 0.759 ± 0.021
2.153AsnIle: 2.153 ± 0.035
1.719AsnLys: 1.719 ± 0.037
2.864AsnLeu: 2.864 ± 0.033
0.987AsnMet: 0.987 ± 0.02
1.242AsnAsn: 1.242 ± 0.03
1.893AsnPro: 1.893 ± 0.031
1.32AsnGln: 1.32 ± 0.03
2.189AsnArg: 2.189 ± 0.031
1.696AsnSer: 1.696 ± 0.035
1.622AsnThr: 1.622 ± 0.032
2.383AsnVal: 2.383 ± 0.04
0.469AsnTrp: 0.469 ± 0.018
1.171AsnTyr: 1.171 ± 0.024
0.0AsnXaa: 0.0 ± 0.0
Pro
4.101ProAla: 4.101 ± 0.063
0.288ProCys: 0.288 ± 0.014
2.996ProAsp: 2.996 ± 0.048
3.713ProGlu: 3.713 ± 0.05
1.928ProPhe: 1.928 ± 0.031
3.242ProGly: 3.242 ± 0.052
1.057ProHis: 1.057 ± 0.025
2.392ProIle: 2.392 ± 0.037
1.669ProLys: 1.669 ± 0.033
4.314ProLeu: 4.314 ± 0.046
1.058ProMet: 1.058 ± 0.025
1.517ProAsn: 1.517 ± 0.033
1.562ProPro: 1.562 ± 0.035
1.533ProGln: 1.533 ± 0.031
1.679ProArg: 1.679 ± 0.03
2.526ProSer: 2.526 ± 0.042
1.756ProThr: 1.756 ± 0.037
3.377ProVal: 3.377 ± 0.048
0.55ProTrp: 0.55 ± 0.02
1.628ProTyr: 1.628 ± 0.033
0.0ProXaa: 0.0 ± 0.0
Gln
4.207GlnAla: 4.207 ± 0.057
0.279GlnCys: 0.279 ± 0.012
1.852GlnAsp: 1.852 ± 0.034
2.919GlnGlu: 2.919 ± 0.042
1.592GlnPhe: 1.592 ± 0.032
2.673GlnGly: 2.673 ± 0.039
0.907GlnHis: 0.907 ± 0.023
2.186GlnIle: 2.186 ± 0.031
1.665GlnLys: 1.665 ± 0.032
4.246GlnLeu: 4.246 ± 0.061
1.218GlnMet: 1.218 ± 0.029
1.04GlnAsn: 1.04 ± 0.026
1.719GlnPro: 1.719 ± 0.035
1.876GlnGln: 1.876 ± 0.038
2.188GlnArg: 2.188 ± 0.041
2.102GlnSer: 2.102 ± 0.033
1.745GlnThr: 1.745 ± 0.034
2.597GlnVal: 2.597 ± 0.039
0.626GlnTrp: 0.626 ± 0.021
1.406GlnTyr: 1.406 ± 0.03
0.0GlnXaa: 0.0 ± 0.0
Arg
4.136ArgAla: 4.136 ± 0.053
0.503ArgCys: 0.503 ± 0.021
2.862ArgAsp: 2.862 ± 0.044
4.318ArgGlu: 4.318 ± 0.061
2.5ArgPhe: 2.5 ± 0.041
3.433ArgGly: 3.433 ± 0.053
1.57ArgHis: 1.57 ± 0.038
4.218ArgIle: 4.218 ± 0.053
2.965ArgLys: 2.965 ± 0.045
6.357ArgLeu: 6.357 ± 0.072
1.976ArgMet: 1.976 ± 0.033
2.011ArgAsn: 2.011 ± 0.032
2.182ArgPro: 2.182 ± 0.037
2.614ArgGln: 2.614 ± 0.046
3.632ArgArg: 3.632 ± 0.058
3.343ArgSer: 3.343 ± 0.044
3.053ArgThr: 3.053 ± 0.044
3.482ArgVal: 3.482 ± 0.048
0.866ArgTrp: 0.866 ± 0.021
2.14ArgTyr: 2.14 ± 0.04
0.0ArgXaa: 0.0 ± 0.0
Ser
4.878SerAla: 4.878 ± 0.053
0.495SerCys: 0.495 ± 0.017
2.855SerAsp: 2.855 ± 0.04
3.393SerGlu: 3.393 ± 0.047
2.848SerPhe: 2.848 ± 0.042
5.337SerGly: 5.337 ± 0.056
1.178SerHis: 1.178 ± 0.028
3.971SerIle: 3.971 ± 0.049
2.607SerLys: 2.607 ± 0.042
5.886SerLeu: 5.886 ± 0.06
1.727SerMet: 1.727 ± 0.028
1.841SerAsn: 1.841 ± 0.037
2.488SerPro: 2.488 ± 0.045
1.767SerGln: 1.767 ± 0.033
3.291SerArg: 3.291 ± 0.047
3.688SerSer: 3.688 ± 0.058
2.711SerThr: 2.711 ± 0.041
4.128SerVal: 4.128 ± 0.049
0.814SerTrp: 0.814 ± 0.023
2.099SerTyr: 2.099 ± 0.036
0.0SerXaa: 0.0 ± 0.0
Thr
4.75ThrAla: 4.75 ± 0.054
0.382ThrCys: 0.382 ± 0.014
2.75ThrAsp: 2.75 ± 0.047
3.257ThrGlu: 3.257 ± 0.047
2.206ThrPhe: 2.206 ± 0.035
4.389ThrGly: 4.389 ± 0.053
0.971ThrHis: 0.971 ± 0.025
3.38ThrIle: 3.38 ± 0.05
2.14ThrLys: 2.14 ± 0.038
5.136ThrLeu: 5.136 ± 0.054
1.457ThrMet: 1.457 ± 0.027
1.67ThrAsn: 1.67 ± 0.038
2.405ThrPro: 2.405 ± 0.038
1.356ThrGln: 1.356 ± 0.026
2.33ThrArg: 2.33 ± 0.037
2.871ThrSer: 2.871 ± 0.046
2.442ThrThr: 2.442 ± 0.045
4.283ThrVal: 4.283 ± 0.058
0.662ThrTrp: 0.662 ± 0.022
1.876ThrTyr: 1.876 ± 0.037
0.0ThrXaa: 0.0 ± 0.0
Val
5.4ValAla: 5.4 ± 0.068
0.68ValCys: 0.68 ± 0.02
3.406ValAsp: 3.406 ± 0.043
4.341ValGlu: 4.341 ± 0.051
2.869ValPhe: 2.869 ± 0.05
4.244ValGly: 4.244 ± 0.055
1.634ValHis: 1.634 ± 0.032
4.778ValIle: 4.778 ± 0.061
3.623ValLys: 3.623 ± 0.053
7.213ValLeu: 7.213 ± 0.063
2.05ValMet: 2.05 ± 0.034
2.616ValAsn: 2.616 ± 0.04
3.322ValPro: 3.322 ± 0.046
2.673ValGln: 2.673 ± 0.039
4.023ValArg: 4.023 ± 0.05
4.335ValSer: 4.335 ± 0.051
4.325ValThr: 4.325 ± 0.051
4.646ValVal: 4.646 ± 0.053
0.848ValTrp: 0.848 ± 0.023
2.293ValTyr: 2.293 ± 0.038
0.0ValXaa: 0.0 ± 0.0
Trp
0.911TrpAla: 0.911 ± 0.023
0.114TrpCys: 0.114 ± 0.007
0.701TrpAsp: 0.701 ± 0.02
0.843TrpGlu: 0.843 ± 0.024
0.587TrpPhe: 0.587 ± 0.019
0.864TrpGly: 0.864 ± 0.022
0.306TrpHis: 0.306 ± 0.012
0.908TrpIle: 0.908 ± 0.025
0.75TrpLys: 0.75 ± 0.021
1.549TrpLeu: 1.549 ± 0.034
0.489TrpMet: 0.489 ± 0.017
0.657TrpAsn: 0.657 ± 0.021
0.419TrpPro: 0.419 ± 0.018
0.503TrpGln: 0.503 ± 0.017
0.698TrpArg: 0.698 ± 0.018
0.859TrpSer: 0.859 ± 0.028
0.773TrpThr: 0.773 ± 0.02
0.799TrpVal: 0.799 ± 0.022
0.189TrpTrp: 0.189 ± 0.009
0.418TrpTyr: 0.418 ± 0.017
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.875TyrAla: 2.875 ± 0.042
0.309TyrCys: 0.309 ± 0.015
1.862TyrAsp: 1.862 ± 0.044
2.378TyrGlu: 2.378 ± 0.041
1.531TyrPhe: 1.531 ± 0.03
2.749TyrGly: 2.749 ± 0.047
0.789TyrHis: 0.789 ± 0.022
2.136TyrIle: 2.136 ± 0.039
1.479TyrLys: 1.479 ± 0.03
3.11TyrLeu: 3.11 ± 0.042
0.997TyrMet: 0.997 ± 0.023
1.217TyrAsn: 1.217 ± 0.028
1.63TyrPro: 1.63 ± 0.033
1.197TyrGln: 1.197 ± 0.028
2.34TyrArg: 2.34 ± 0.038
1.953TyrSer: 1.953 ± 0.039
1.808TyrThr: 1.808 ± 0.035
2.462TyrVal: 2.462 ± 0.036
0.464TyrTrp: 0.464 ± 0.017
1.351TyrTyr: 1.351 ± 0.029
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5660 proteins (1785742 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski