Amino acid dipepetide frequency for Effusibacillus lacus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.37AlaAla: 8.37 ± 0.114
0.769AlaCys: 0.769 ± 0.026
4.208AlaAsp: 4.208 ± 0.066
5.632AlaGlu: 5.632 ± 0.087
3.216AlaPhe: 3.216 ± 0.064
7.237AlaGly: 7.237 ± 0.094
1.479AlaHis: 1.479 ± 0.039
5.673AlaIle: 5.673 ± 0.087
4.86AlaLys: 4.86 ± 0.081
8.238AlaLeu: 8.238 ± 0.087
2.444AlaMet: 2.444 ± 0.049
2.839AlaAsn: 2.839 ± 0.051
2.602AlaPro: 2.602 ± 0.049
2.694AlaGln: 2.694 ± 0.051
4.191AlaArg: 4.191 ± 0.069
4.273AlaSer: 4.273 ± 0.068
3.803AlaThr: 3.803 ± 0.064
6.963AlaVal: 6.963 ± 0.085
0.811AlaTrp: 0.811 ± 0.03
2.373AlaTyr: 2.373 ± 0.046
0.0AlaXaa: 0.0 ± 0.0
Cys
0.547CysAla: 0.547 ± 0.023
0.108CysCys: 0.108 ± 0.01
0.429CysAsp: 0.429 ± 0.02
0.476CysGlu: 0.476 ± 0.02
0.298CysPhe: 0.298 ± 0.015
0.845CysGly: 0.845 ± 0.032
0.237CysHis: 0.237 ± 0.02
0.483CysIle: 0.483 ± 0.022
0.408CysLys: 0.408 ± 0.019
0.728CysLeu: 0.728 ± 0.027
0.214CysMet: 0.214 ± 0.013
0.297CysAsn: 0.297 ± 0.017
0.475CysPro: 0.475 ± 0.024
0.25CysGln: 0.25 ± 0.016
0.53CysArg: 0.53 ± 0.023
0.524CysSer: 0.524 ± 0.022
0.43CysThr: 0.43 ± 0.023
0.512CysVal: 0.512 ± 0.025
0.082CysTrp: 0.082 ± 0.01
0.248CysTyr: 0.248 ± 0.016
0.0CysXaa: 0.0 ± 0.0
Asp
3.577AspAla: 3.577 ± 0.054
0.44AspCys: 0.44 ± 0.022
2.091AspAsp: 2.091 ± 0.054
3.442AspGlu: 3.442 ± 0.061
2.042AspPhe: 2.042 ± 0.041
3.489AspGly: 3.489 ± 0.067
1.043AspHis: 1.043 ± 0.032
3.451AspIle: 3.451 ± 0.063
2.847AspLys: 2.847 ± 0.063
5.483AspLeu: 5.483 ± 0.075
1.356AspMet: 1.356 ± 0.04
1.464AspAsn: 1.464 ± 0.04
2.663AspPro: 2.663 ± 0.052
1.745AspGln: 1.745 ± 0.042
3.173AspArg: 3.173 ± 0.058
2.573AspSer: 2.573 ± 0.049
2.356AspThr: 2.356 ± 0.05
3.75AspVal: 3.75 ± 0.063
0.784AspTrp: 0.784 ± 0.029
1.638AspTyr: 1.638 ± 0.041
0.0AspXaa: 0.0 ± 0.0
Glu
5.82GluAla: 5.82 ± 0.085
0.417GluCys: 0.417 ± 0.02
3.255GluAsp: 3.255 ± 0.061
5.732GluGlu: 5.732 ± 0.089
2.46GluPhe: 2.46 ± 0.047
4.499GluGly: 4.499 ± 0.069
1.461GluHis: 1.461 ± 0.031
5.012GluIle: 5.012 ± 0.074
4.715GluLys: 4.715 ± 0.086
6.986GluLeu: 6.986 ± 0.092
2.127GluMet: 2.127 ± 0.047
2.466GluAsn: 2.466 ± 0.051
2.341GluPro: 2.341 ± 0.048
3.478GluGln: 3.478 ± 0.065
4.272GluArg: 4.272 ± 0.075
3.204GluSer: 3.204 ± 0.056
3.719GluThr: 3.719 ± 0.061
5.13GluVal: 5.13 ± 0.077
0.917GluTrp: 0.917 ± 0.033
2.029GluTyr: 2.029 ± 0.045
0.0GluXaa: 0.0 ± 0.0
Phe
3.453PheAla: 3.453 ± 0.07
0.375PheCys: 0.375 ± 0.019
2.145PheAsp: 2.145 ± 0.047
2.489PheGlu: 2.489 ± 0.052
1.742PhePhe: 1.742 ± 0.047
3.352PheGly: 3.352 ± 0.07
0.942PheHis: 0.942 ± 0.029
2.253PheIle: 2.253 ± 0.054
1.689PheLys: 1.689 ± 0.043
4.318PheLeu: 4.318 ± 0.086
0.938PheMet: 0.938 ± 0.034
1.298PheAsn: 1.298 ± 0.035
1.792PhePro: 1.792 ± 0.043
1.423PheGln: 1.423 ± 0.035
2.063PheArg: 2.063 ± 0.042
2.453PheSer: 2.453 ± 0.053
2.11PheThr: 2.11 ± 0.044
3.132PheVal: 3.132 ± 0.069
0.464PheTrp: 0.464 ± 0.021
1.357PheTyr: 1.357 ± 0.039
0.0PheXaa: 0.0 ± 0.0
Gly
5.792GlyAla: 5.792 ± 0.081
0.792GlyCys: 0.792 ± 0.032
3.578GlyAsp: 3.578 ± 0.066
4.757GlyGlu: 4.757 ± 0.065
3.337GlyPhe: 3.337 ± 0.057
5.726GlyGly: 5.726 ± 0.081
1.649GlyHis: 1.649 ± 0.043
6.394GlyIle: 6.394 ± 0.081
5.028GlyLys: 5.028 ± 0.085
7.5GlyLeu: 7.5 ± 0.1
2.43GlyMet: 2.43 ± 0.048
2.697GlyAsn: 2.697 ± 0.058
2.262GlyPro: 2.262 ± 0.046
2.816GlyGln: 2.816 ± 0.054
3.934GlyArg: 3.934 ± 0.064
4.427GlySer: 4.427 ± 0.066
4.65GlyThr: 4.65 ± 0.074
5.97GlyVal: 5.97 ± 0.078
1.008GlyTrp: 1.008 ± 0.034
2.778GlyTyr: 2.778 ± 0.055
0.0GlyXaa: 0.0 ± 0.0
His
1.542HisAla: 1.542 ± 0.039
0.203HisCys: 0.203 ± 0.014
0.918HisAsp: 0.918 ± 0.029
1.277HisGlu: 1.277 ± 0.034
0.977HisPhe: 0.977 ± 0.033
1.648HisGly: 1.648 ± 0.04
0.578HisHis: 0.578 ± 0.032
1.253HisIle: 1.253 ± 0.031
0.993HisLys: 0.993 ± 0.032
2.156HisLeu: 2.156 ± 0.047
0.533HisMet: 0.533 ± 0.021
0.674HisAsn: 0.674 ± 0.029
1.41HisPro: 1.41 ± 0.034
0.715HisGln: 0.715 ± 0.026
1.133HisArg: 1.133 ± 0.029
1.192HisSer: 1.192 ± 0.032
1.082HisThr: 1.082 ± 0.031
1.538HisVal: 1.538 ± 0.038
0.271HisTrp: 0.271 ± 0.018
0.749HisTyr: 0.749 ± 0.027
0.0HisXaa: 0.0 ± 0.0
Ile
5.907IleAla: 5.907 ± 0.083
0.59IleCys: 0.59 ± 0.024
3.543IleAsp: 3.543 ± 0.062
4.535IleGlu: 4.535 ± 0.062
2.336IlePhe: 2.336 ± 0.051
5.753IleGly: 5.753 ± 0.084
1.568IleHis: 1.568 ± 0.042
3.512IleIle: 3.512 ± 0.073
3.199IleLys: 3.199 ± 0.058
6.682IleLeu: 6.682 ± 0.091
1.454IleMet: 1.454 ± 0.043
2.127IleAsn: 2.127 ± 0.053
3.533IlePro: 3.533 ± 0.065
2.585IleGln: 2.585 ± 0.051
4.361IleArg: 4.361 ± 0.071
3.864IleSer: 3.864 ± 0.065
3.585IleThr: 3.585 ± 0.058
5.304IleVal: 5.304 ± 0.069
0.682IleTrp: 0.682 ± 0.026
1.81IleTyr: 1.81 ± 0.04
0.0IleXaa: 0.0 ± 0.0
Lys
4.483LysAla: 4.483 ± 0.066
0.32LysCys: 0.32 ± 0.019
2.884LysAsp: 2.884 ± 0.056
4.873LysGlu: 4.873 ± 0.071
1.697LysPhe: 1.697 ± 0.036
4.084LysGly: 4.084 ± 0.057
1.056LysHis: 1.056 ± 0.033
3.642LysIle: 3.642 ± 0.068
3.742LysLys: 3.742 ± 0.075
5.068LysLeu: 5.068 ± 0.073
1.761LysMet: 1.761 ± 0.038
2.192LysAsn: 2.192 ± 0.044
2.425LysPro: 2.425 ± 0.048
2.599LysGln: 2.599 ± 0.049
3.122LysArg: 3.122 ± 0.054
2.605LysSer: 2.605 ± 0.058
2.873LysThr: 2.873 ± 0.055
4.207LysVal: 4.207 ± 0.065
0.713LysTrp: 0.713 ± 0.025
1.715LysTyr: 1.715 ± 0.037
0.0LysXaa: 0.0 ± 0.0
Leu
9.529LeuAla: 9.529 ± 0.099
0.727LeuCys: 0.727 ± 0.028
4.91LeuAsp: 4.91 ± 0.069
6.853LeuGlu: 6.853 ± 0.097
4.209LeuPhe: 4.209 ± 0.084
7.531LeuGly: 7.531 ± 0.101
2.063LeuHis: 2.063 ± 0.041
6.278LeuIle: 6.278 ± 0.082
5.307LeuLys: 5.307 ± 0.069
10.718LeuLeu: 10.718 ± 0.142
2.523LeuMet: 2.523 ± 0.044
3.443LeuAsn: 3.443 ± 0.053
4.54LeuPro: 4.54 ± 0.071
4.209LeuGln: 4.209 ± 0.074
5.249LeuArg: 5.249 ± 0.079
6.175LeuSer: 6.175 ± 0.073
5.557LeuThr: 5.557 ± 0.071
7.431LeuVal: 7.431 ± 0.084
0.979LeuTrp: 0.979 ± 0.032
2.858LeuTyr: 2.858 ± 0.049
0.0LeuXaa: 0.0 ± 0.0
Met
2.531MetAla: 2.531 ± 0.045
0.165MetCys: 0.165 ± 0.012
1.357MetAsp: 1.357 ± 0.037
2.094MetGlu: 2.094 ± 0.049
0.929MetPhe: 0.929 ± 0.033
2.132MetGly: 2.132 ± 0.05
0.488MetHis: 0.488 ± 0.023
1.842MetIle: 1.842 ± 0.042
1.819MetLys: 1.819 ± 0.041
2.529MetLeu: 2.529 ± 0.056
0.804MetMet: 0.804 ± 0.026
1.26MetAsn: 1.26 ± 0.038
1.092MetPro: 1.092 ± 0.035
1.072MetGln: 1.072 ± 0.03
1.331MetArg: 1.331 ± 0.035
1.595MetSer: 1.595 ± 0.036
1.612MetThr: 1.612 ± 0.038
1.932MetVal: 1.932 ± 0.044
0.255MetTrp: 0.255 ± 0.017
0.695MetTyr: 0.695 ± 0.027
0.0MetXaa: 0.0 ± 0.0
Asn
2.578AsnAla: 2.578 ± 0.058
0.348AsnCys: 0.348 ± 0.019
1.579AsnAsp: 1.579 ± 0.041
2.135AsnGlu: 2.135 ± 0.047
1.165AsnPhe: 1.165 ± 0.034
3.084AsnGly: 3.084 ± 0.056
0.738AsnHis: 0.738 ± 0.025
2.095AsnIle: 2.095 ± 0.044
1.823AsnLys: 1.823 ± 0.043
3.728AsnLeu: 3.728 ± 0.062
0.926AsnMet: 0.926 ± 0.027
1.184AsnAsn: 1.184 ± 0.035
2.326AsnPro: 2.326 ± 0.055
1.542AsnGln: 1.542 ± 0.042
2.394AsnArg: 2.394 ± 0.048
1.882AsnSer: 1.882 ± 0.045
1.686AsnThr: 1.686 ± 0.043
2.575AsnVal: 2.575 ± 0.051
0.529AsnTrp: 0.529 ± 0.023
1.055AsnTyr: 1.055 ± 0.034
0.0AsnXaa: 0.0 ± 0.0
Pro
3.298ProAla: 3.298 ± 0.064
0.312ProCys: 0.312 ± 0.017
2.785ProAsp: 2.785 ± 0.053
3.678ProGlu: 3.678 ± 0.054
1.902ProPhe: 1.902 ± 0.047
3.578ProGly: 3.578 ± 0.058
0.965ProHis: 0.965 ± 0.028
2.638ProIle: 2.638 ± 0.049
2.178ProLys: 2.178 ± 0.047
3.923ProLeu: 3.923 ± 0.057
1.003ProMet: 1.003 ± 0.031
1.608ProAsn: 1.608 ± 0.04
1.483ProPro: 1.483 ± 0.036
1.439ProGln: 1.439 ± 0.038
1.764ProArg: 1.764 ± 0.046
2.243ProSer: 2.243 ± 0.042
2.093ProThr: 2.093 ± 0.047
4.079ProVal: 4.079 ± 0.07
0.501ProTrp: 0.501 ± 0.022
1.498ProTyr: 1.498 ± 0.038
0.0ProXaa: 0.0 ± 0.0
Gln
3.48GlnAla: 3.48 ± 0.063
0.23GlnCys: 0.23 ± 0.015
1.686GlnAsp: 1.686 ± 0.043
2.911GlnGlu: 2.911 ± 0.062
1.445GlnPhe: 1.445 ± 0.04
2.594GlnGly: 2.594 ± 0.046
0.721GlnHis: 0.721 ± 0.025
2.806GlnIle: 2.806 ± 0.058
2.193GlnLys: 2.193 ± 0.053
3.851GlnLeu: 3.851 ± 0.066
1.225GlnMet: 1.225 ± 0.037
1.423GlnAsn: 1.423 ± 0.03
1.559GlnPro: 1.559 ± 0.043
1.872GlnGln: 1.872 ± 0.048
1.912GlnArg: 1.912 ± 0.044
2.103GlnSer: 2.103 ± 0.054
2.236GlnThr: 2.236 ± 0.045
3.035GlnVal: 3.035 ± 0.06
0.414GlnTrp: 0.414 ± 0.018
1.215GlnTyr: 1.215 ± 0.032
0.0GlnXaa: 0.0 ± 0.0
Arg
3.457ArgAla: 3.457 ± 0.061
0.395ArgCys: 0.395 ± 0.024
2.665ArgAsp: 2.665 ± 0.042
4.491ArgGlu: 4.491 ± 0.06
2.433ArgPhe: 2.433 ± 0.052
3.29ArgGly: 3.29 ± 0.06
1.123ArgHis: 1.123 ± 0.03
4.388ArgIle: 4.388 ± 0.074
3.556ArgLys: 3.556 ± 0.063
5.546ArgLeu: 5.546 ± 0.078
1.734ArgMet: 1.734 ± 0.04
2.242ArgAsn: 2.242 ± 0.046
2.075ArgPro: 2.075 ± 0.046
2.35ArgGln: 2.35 ± 0.05
2.961ArgArg: 2.961 ± 0.049
2.89ArgSer: 2.89 ± 0.055
2.744ArgThr: 2.744 ± 0.052
3.92ArgVal: 3.92 ± 0.056
0.677ArgTrp: 0.677 ± 0.024
1.868ArgTyr: 1.868 ± 0.045
0.0ArgXaa: 0.0 ± 0.0
Ser
4.101SerAla: 4.101 ± 0.072
0.448SerCys: 0.448 ± 0.024
2.651SerAsp: 2.651 ± 0.053
3.345SerGlu: 3.345 ± 0.061
2.637SerPhe: 2.637 ± 0.05
4.864SerGly: 4.864 ± 0.077
1.205SerHis: 1.205 ± 0.033
3.753SerIle: 3.753 ± 0.069
2.757SerLys: 2.757 ± 0.056
5.956SerLeu: 5.956 ± 0.088
1.534SerMet: 1.534 ± 0.041
1.862SerAsn: 1.862 ± 0.041
2.341SerPro: 2.341 ± 0.049
1.949SerGln: 1.949 ± 0.046
3.046SerArg: 3.046 ± 0.058
3.112SerSer: 3.112 ± 0.059
2.763SerThr: 2.763 ± 0.056
4.332SerVal: 4.332 ± 0.07
0.626SerTrp: 0.626 ± 0.032
1.69SerTyr: 1.69 ± 0.039
0.0SerXaa: 0.0 ± 0.0
Thr
4.584ThrAla: 4.584 ± 0.078
0.4ThrCys: 0.4 ± 0.022
2.616ThrAsp: 2.616 ± 0.05
3.25ThrGlu: 3.25 ± 0.058
2.152ThrPhe: 2.152 ± 0.044
5.147ThrGly: 5.147 ± 0.061
1.048ThrHis: 1.048 ± 0.031
3.62ThrIle: 3.62 ± 0.063
2.42ThrLys: 2.42 ± 0.051
5.125ThrLeu: 5.125 ± 0.074
1.311ThrMet: 1.311 ± 0.037
1.787ThrAsn: 1.787 ± 0.044
2.518ThrPro: 2.518 ± 0.051
1.518ThrGln: 1.518 ± 0.042
2.56ThrArg: 2.56 ± 0.049
2.865ThrSer: 2.865 ± 0.058
2.687ThrThr: 2.687 ± 0.051
4.81ThrVal: 4.81 ± 0.067
0.568ThrTrp: 0.568 ± 0.027
1.599ThrTyr: 1.599 ± 0.038
0.0ThrXaa: 0.0 ± 0.0
Val
6.586ValAla: 6.586 ± 0.092
0.703ValCys: 0.703 ± 0.029
3.913ValAsp: 3.913 ± 0.061
5.246ValGlu: 5.246 ± 0.071
3.054ValPhe: 3.054 ± 0.059
5.251ValGly: 5.251 ± 0.08
1.534ValHis: 1.534 ± 0.04
5.202ValIle: 5.202 ± 0.074
4.256ValLys: 4.256 ± 0.063
7.957ValLeu: 7.957 ± 0.09
2.11ValMet: 2.11 ± 0.044
2.844ValAsn: 2.844 ± 0.057
3.653ValPro: 3.653 ± 0.057
2.88ValGln: 2.88 ± 0.054
4.228ValArg: 4.228 ± 0.062
4.613ValSer: 4.613 ± 0.068
4.496ValThr: 4.496 ± 0.067
6.2ValVal: 6.2 ± 0.086
0.855ValTrp: 0.855 ± 0.031
2.231ValTyr: 2.231 ± 0.049
0.0ValXaa: 0.0 ± 0.0
Trp
0.836TrpAla: 0.836 ± 0.026
0.084TrpCys: 0.084 ± 0.008
0.567TrpAsp: 0.567 ± 0.024
0.768TrpGlu: 0.768 ± 0.026
0.497TrpPhe: 0.497 ± 0.022
0.805TrpGly: 0.805 ± 0.026
0.201TrpHis: 0.201 ± 0.014
0.869TrpIle: 0.869 ± 0.031
0.742TrpLys: 0.742 ± 0.027
1.306TrpLeu: 1.306 ± 0.04
0.395TrpMet: 0.395 ± 0.024
0.583TrpAsn: 0.583 ± 0.022
0.405TrpPro: 0.405 ± 0.021
0.494TrpGln: 0.494 ± 0.022
0.625TrpArg: 0.625 ± 0.025
0.623TrpSer: 0.623 ± 0.027
0.599TrpThr: 0.599 ± 0.026
0.785TrpVal: 0.785 ± 0.029
0.181TrpTrp: 0.181 ± 0.012
0.351TrpTyr: 0.351 ± 0.021
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.204TyrAla: 2.204 ± 0.048
0.297TyrCys: 0.297 ± 0.019
1.627TyrAsp: 1.627 ± 0.044
2.057TyrGlu: 2.057 ± 0.046
1.259TyrPhe: 1.259 ± 0.038
2.617TyrGly: 2.617 ± 0.052
0.742TyrHis: 0.742 ± 0.026
1.766TyrIle: 1.766 ± 0.042
1.532TyrLys: 1.532 ± 0.034
3.291TyrLeu: 3.291 ± 0.055
0.711TyrMet: 0.711 ± 0.026
1.079TyrAsn: 1.079 ± 0.029
1.475TyrPro: 1.475 ± 0.039
1.282TyrGln: 1.282 ± 0.036
1.956TyrArg: 1.956 ± 0.044
1.755TyrSer: 1.755 ± 0.035
1.505TyrThr: 1.505 ± 0.038
2.207TyrVal: 2.207 ± 0.044
0.397TyrTrp: 0.397 ± 0.019
1.084TyrTyr: 1.084 ± 0.032
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3692 proteins (1079243 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski