Amino acid dipepetide frequency for Sporomusa silvacetica DSM 10669

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.839AlaAla: 9.839 ± 0.117
1.172AlaCys: 1.172 ± 0.033
4.534AlaAsp: 4.534 ± 0.052
5.631AlaGlu: 5.631 ± 0.068
3.04AlaPhe: 3.04 ± 0.047
7.376AlaGly: 7.376 ± 0.073
1.339AlaHis: 1.339 ± 0.027
6.855AlaIle: 6.855 ± 0.064
5.483AlaLys: 5.483 ± 0.067
8.235AlaLeu: 8.235 ± 0.077
2.569AlaMet: 2.569 ± 0.044
3.411AlaAsn: 3.411 ± 0.047
2.537AlaPro: 2.537 ± 0.041
3.207AlaGln: 3.207 ± 0.045
3.583AlaArg: 3.583 ± 0.05
4.583AlaSer: 4.583 ± 0.064
4.649AlaThr: 4.649 ± 0.061
7.292AlaVal: 7.292 ± 0.071
0.775AlaTrp: 0.775 ± 0.02
2.639AlaTyr: 2.639 ± 0.047
0.0AlaXaa: 0.0 ± 0.0
Cys
0.929CysAla: 0.929 ± 0.027
0.283CysCys: 0.283 ± 0.015
0.607CysAsp: 0.607 ± 0.021
0.693CysGlu: 0.693 ± 0.021
0.539CysPhe: 0.539 ± 0.019
1.438CysGly: 1.438 ± 0.037
0.332CysHis: 0.332 ± 0.017
0.896CysIle: 0.896 ± 0.025
0.685CysLys: 0.685 ± 0.02
1.236CysLeu: 1.236 ± 0.029
0.362CysMet: 0.362 ± 0.014
0.531CysAsn: 0.531 ± 0.02
0.754CysPro: 0.754 ± 0.026
0.607CysGln: 0.607 ± 0.018
0.693CysArg: 0.693 ± 0.02
0.873CysSer: 0.873 ± 0.024
0.653CysThr: 0.653 ± 0.021
0.838CysVal: 0.838 ± 0.024
0.133CysTrp: 0.133 ± 0.009
0.441CysTyr: 0.441 ± 0.02
0.0CysXaa: 0.0 ± 0.0
Asp
3.708AspAla: 3.708 ± 0.051
0.721AspCys: 0.721 ± 0.024
2.234AspAsp: 2.234 ± 0.041
3.162AspGlu: 3.162 ± 0.05
2.102AspPhe: 2.102 ± 0.038
3.42AspGly: 3.42 ± 0.041
0.808AspHis: 0.808 ± 0.021
4.44AspIle: 4.44 ± 0.058
3.425AspLys: 3.425 ± 0.051
4.338AspLeu: 4.338 ± 0.057
1.444AspMet: 1.444 ± 0.03
2.244AspAsn: 2.244 ± 0.039
1.81AspPro: 1.81 ± 0.034
1.53AspGln: 1.53 ± 0.032
2.2AspArg: 2.2 ± 0.041
2.726AspSer: 2.726 ± 0.046
2.738AspThr: 2.738 ± 0.044
3.55AspVal: 3.55 ± 0.056
0.592AspTrp: 0.592 ± 0.019
1.976AspTyr: 1.976 ± 0.036
0.0AspXaa: 0.0 ± 0.0
Glu
5.259GluAla: 5.259 ± 0.06
0.717GluCys: 0.717 ± 0.02
2.604GluAsp: 2.604 ± 0.048
4.227GluGlu: 4.227 ± 0.066
2.398GluPhe: 2.398 ± 0.038
3.353GluGly: 3.353 ± 0.047
1.21GluHis: 1.21 ± 0.03
4.986GluIle: 4.986 ± 0.069
4.604GluLys: 4.604 ± 0.064
6.595GluLeu: 6.595 ± 0.063
1.827GluMet: 1.827 ± 0.039
2.767GluAsn: 2.767 ± 0.046
1.834GluPro: 1.834 ± 0.03
3.222GluGln: 3.222 ± 0.051
3.17GluArg: 3.17 ± 0.045
2.724GluSer: 2.724 ± 0.043
3.212GluThr: 3.212 ± 0.046
4.368GluVal: 4.368 ± 0.049
0.602GluTrp: 0.602 ± 0.02
2.214GluTyr: 2.214 ± 0.041
0.0GluXaa: 0.0 ± 0.0
Phe
3.349PheAla: 3.349 ± 0.044
0.609PheCys: 0.609 ± 0.021
2.158PheAsp: 2.158 ± 0.034
2.03PheGlu: 2.03 ± 0.035
1.692PhePhe: 1.692 ± 0.036
3.165PheGly: 3.165 ± 0.045
0.693PheHis: 0.693 ± 0.021
2.889PheIle: 2.889 ± 0.042
1.961PheLys: 1.961 ± 0.034
3.613PheLeu: 3.613 ± 0.052
1.025PheMet: 1.025 ± 0.029
1.654PheAsn: 1.654 ± 0.032
1.464PhePro: 1.464 ± 0.032
1.111PheGln: 1.111 ± 0.026
1.586PheArg: 1.586 ± 0.032
2.715PheSer: 2.715 ± 0.046
2.348PheThr: 2.348 ± 0.037
2.66PheVal: 2.66 ± 0.049
0.458PheTrp: 0.458 ± 0.019
1.34PheTyr: 1.34 ± 0.029
0.0PheXaa: 0.0 ± 0.0
Gly
5.777GlyAla: 5.777 ± 0.062
1.264GlyCys: 1.264 ± 0.028
3.332GlyAsp: 3.332 ± 0.055
4.008GlyGlu: 4.008 ± 0.046
3.108GlyPhe: 3.108 ± 0.051
5.447GlyGly: 5.447 ± 0.076
1.414GlyHis: 1.414 ± 0.032
6.328GlyIle: 6.328 ± 0.074
4.911GlyLys: 4.911 ± 0.059
7.17GlyLeu: 7.17 ± 0.074
2.345GlyMet: 2.345 ± 0.038
3.02GlyAsn: 3.02 ± 0.054
1.902GlyPro: 1.902 ± 0.033
2.873GlyGln: 2.873 ± 0.049
3.292GlyArg: 3.292 ± 0.049
4.204GlySer: 4.204 ± 0.063
4.277GlyThr: 4.277 ± 0.078
5.461GlyVal: 5.461 ± 0.058
0.785GlyTrp: 0.785 ± 0.022
2.916GlyTyr: 2.916 ± 0.042
0.0GlyXaa: 0.0 ± 0.0
His
1.312HisAla: 1.312 ± 0.029
0.324HisCys: 0.324 ± 0.014
0.902HisAsp: 0.902 ± 0.025
1.024HisGlu: 1.024 ± 0.025
0.742HisPhe: 0.742 ± 0.021
1.369HisGly: 1.369 ± 0.026
0.458HisHis: 0.458 ± 0.018
1.398HisIle: 1.398 ± 0.028
0.992HisLys: 0.992 ± 0.024
1.742HisLeu: 1.742 ± 0.038
0.475HisMet: 0.475 ± 0.017
0.825HisAsn: 0.825 ± 0.021
0.934HisPro: 0.934 ± 0.025
0.623HisGln: 0.623 ± 0.021
0.808HisArg: 0.808 ± 0.025
1.059HisSer: 1.059 ± 0.024
1.024HisThr: 1.024 ± 0.027
1.17HisVal: 1.17 ± 0.025
0.221HisTrp: 0.221 ± 0.01
0.707HisTyr: 0.707 ± 0.021
0.0HisXaa: 0.0 ± 0.0
Ile
7.151IleAla: 7.151 ± 0.074
1.011IleCys: 1.011 ± 0.024
4.199IleAsp: 4.199 ± 0.053
4.63IleGlu: 4.63 ± 0.059
2.654IlePhe: 2.654 ± 0.043
5.745IleGly: 5.745 ± 0.07
1.328IleHis: 1.328 ± 0.031
5.944IleIle: 5.944 ± 0.077
4.542IleLys: 4.542 ± 0.055
6.606IleLeu: 6.606 ± 0.072
1.954IleMet: 1.954 ± 0.036
3.485IleAsn: 3.485 ± 0.046
3.462IlePro: 3.462 ± 0.048
2.34IleGln: 2.34 ± 0.041
3.258IleArg: 3.258 ± 0.043
4.763IleSer: 4.763 ± 0.058
4.764IleThr: 4.764 ± 0.064
5.663IleVal: 5.663 ± 0.069
0.615IleTrp: 0.615 ± 0.019
2.17IleTyr: 2.17 ± 0.039
0.0IleXaa: 0.0 ± 0.0
Lys
5.119LysAla: 5.119 ± 0.058
0.678LysCys: 0.678 ± 0.02
2.909LysAsp: 2.909 ± 0.041
4.301LysGlu: 4.301 ± 0.061
1.893LysPhe: 1.893 ± 0.032
3.565LysGly: 3.565 ± 0.052
0.993LysHis: 0.993 ± 0.022
4.586LysIle: 4.586 ± 0.049
4.034LysLys: 4.034 ± 0.062
5.815LysLeu: 5.815 ± 0.066
1.883LysMet: 1.883 ± 0.037
2.89LysAsn: 2.89 ± 0.044
2.426LysPro: 2.426 ± 0.035
2.647LysGln: 2.647 ± 0.04
2.673LysArg: 2.673 ± 0.034
3.032LysSer: 3.032 ± 0.052
3.394LysThr: 3.394 ± 0.048
4.55LysVal: 4.55 ± 0.058
0.57LysTrp: 0.57 ± 0.021
2.159LysTyr: 2.159 ± 0.035
0.0LysXaa: 0.0 ± 0.0
Leu
9.811LeuAla: 9.811 ± 0.094
1.249LeuCys: 1.249 ± 0.03
4.587LeuAsp: 4.587 ± 0.047
5.697LeuGlu: 5.697 ± 0.069
3.789LeuPhe: 3.789 ± 0.056
7.164LeuGly: 7.164 ± 0.073
1.713LeuHis: 1.713 ± 0.034
6.364LeuIle: 6.364 ± 0.065
5.618LeuLys: 5.618 ± 0.057
9.693LeuLeu: 9.693 ± 0.095
2.279LeuMet: 2.279 ± 0.041
3.923LeuAsn: 3.923 ± 0.051
4.493LeuPro: 4.493 ± 0.057
3.355LeuGln: 3.355 ± 0.044
4.178LeuArg: 4.178 ± 0.053
6.184LeuSer: 6.184 ± 0.058
6.15LeuThr: 6.15 ± 0.064
6.907LeuVal: 6.907 ± 0.073
0.886LeuTrp: 0.886 ± 0.027
2.773LeuTyr: 2.773 ± 0.053
0.0LeuXaa: 0.0 ± 0.0
Met
2.722MetAla: 2.722 ± 0.045
0.292MetCys: 0.292 ± 0.015
1.32MetAsp: 1.32 ± 0.029
1.682MetGlu: 1.682 ± 0.032
0.9MetPhe: 0.9 ± 0.024
1.938MetGly: 1.938 ± 0.031
0.423MetHis: 0.423 ± 0.015
1.846MetIle: 1.846 ± 0.03
1.702MetLys: 1.702 ± 0.031
2.816MetLeu: 2.816 ± 0.043
0.689MetMet: 0.689 ± 0.021
1.259MetAsn: 1.259 ± 0.025
1.267MetPro: 1.267 ± 0.028
1.041MetGln: 1.041 ± 0.024
1.202MetArg: 1.202 ± 0.025
1.748MetSer: 1.748 ± 0.034
1.558MetThr: 1.558 ± 0.032
2.037MetVal: 2.037 ± 0.037
0.204MetTrp: 0.204 ± 0.012
0.672MetTyr: 0.672 ± 0.023
0.0MetXaa: 0.0 ± 0.0
Asn
3.151AsnAla: 3.151 ± 0.054
0.613AsnCys: 0.613 ± 0.021
1.991AsnAsp: 1.991 ± 0.038
2.343AsnGlu: 2.343 ± 0.035
1.662AsnPhe: 1.662 ± 0.036
2.91AsnGly: 2.91 ± 0.049
0.81AsnHis: 0.81 ± 0.022
3.56AsnIle: 3.56 ± 0.047
2.687AsnLys: 2.687 ± 0.043
4.12AsnLeu: 4.12 ± 0.052
1.151AsnMet: 1.151 ± 0.026
1.985AsnAsn: 1.985 ± 0.038
2.156AsnPro: 2.156 ± 0.041
1.659AsnGln: 1.659 ± 0.034
1.973AsnArg: 1.973 ± 0.033
2.527AsnSer: 2.527 ± 0.047
2.367AsnThr: 2.367 ± 0.037
2.792AsnVal: 2.792 ± 0.046
0.508AsnTrp: 0.508 ± 0.015
1.595AsnTyr: 1.595 ± 0.037
0.0AsnXaa: 0.0 ± 0.0
Pro
3.402ProAla: 3.402 ± 0.048
0.464ProCys: 0.464 ± 0.019
2.362ProAsp: 2.362 ± 0.038
3.192ProGlu: 3.192 ± 0.047
1.61ProPhe: 1.61 ± 0.03
3.158ProGly: 3.158 ± 0.05
0.766ProHis: 0.766 ± 0.02
2.629ProIle: 2.629 ± 0.039
1.928ProLys: 1.928 ± 0.033
3.724ProLeu: 3.724 ± 0.047
0.902ProMet: 0.902 ± 0.023
1.465ProAsn: 1.465 ± 0.031
1.395ProPro: 1.395 ± 0.03
1.499ProGln: 1.499 ± 0.035
1.396ProArg: 1.396 ± 0.031
1.96ProSer: 1.96 ± 0.032
1.993ProThr: 1.993 ± 0.036
3.447ProVal: 3.447 ± 0.05
0.423ProTrp: 0.423 ± 0.017
1.35ProTyr: 1.35 ± 0.027
0.0ProXaa: 0.0 ± 0.0
Gln
4.002GlnAla: 4.002 ± 0.054
0.456GlnCys: 0.456 ± 0.016
1.833GlnAsp: 1.833 ± 0.035
2.75GlnGlu: 2.75 ± 0.05
1.28GlnPhe: 1.28 ± 0.026
2.694GlnGly: 2.694 ± 0.048
0.725GlnHis: 0.725 ± 0.019
2.644GlnIle: 2.644 ± 0.042
2.159GlnLys: 2.159 ± 0.039
3.769GlnLeu: 3.769 ± 0.051
1.066GlnMet: 1.066 ± 0.028
1.507GlnAsn: 1.507 ± 0.029
1.51GlnPro: 1.51 ± 0.029
2.031GlnGln: 2.031 ± 0.044
1.606GlnArg: 1.606 ± 0.031
2.05GlnSer: 2.05 ± 0.039
2.077GlnThr: 2.077 ± 0.042
2.804GlnVal: 2.804 ± 0.043
0.343GlnTrp: 0.343 ± 0.015
1.333GlnTyr: 1.333 ± 0.031
0.0GlnXaa: 0.0 ± 0.0
Arg
2.986ArgAla: 2.986 ± 0.045
0.568ArgCys: 0.568 ± 0.023
2.057ArgAsp: 2.057 ± 0.04
3.181ArgGlu: 3.181 ± 0.049
1.772ArgPhe: 1.772 ± 0.031
2.722ArgGly: 2.722 ± 0.049
0.918ArgHis: 0.918 ± 0.026
3.347ArgIle: 3.347 ± 0.048
2.693ArgLys: 2.693 ± 0.039
4.676ArgLeu: 4.676 ± 0.066
1.329ArgMet: 1.329 ± 0.029
1.821ArgAsn: 1.821 ± 0.031
1.654ArgPro: 1.654 ± 0.033
2.375ArgGln: 2.375 ± 0.043
2.247ArgArg: 2.247 ± 0.043
2.217ArgSer: 2.217 ± 0.034
2.192ArgThr: 2.192 ± 0.037
3.026ArgVal: 3.026 ± 0.042
0.47ArgTrp: 0.47 ± 0.016
1.631ArgTyr: 1.631 ± 0.032
0.0ArgXaa: 0.0 ± 0.0
Ser
4.644SerAla: 4.644 ± 0.064
0.811SerCys: 0.811 ± 0.024
2.723SerAsp: 2.723 ± 0.044
3.166SerGlu: 3.166 ± 0.05
2.457SerPhe: 2.457 ± 0.042
4.873SerGly: 4.873 ± 0.071
1.101SerHis: 1.101 ± 0.028
4.245SerIle: 4.245 ± 0.054
3.168SerLys: 3.168 ± 0.044
5.76SerLeu: 5.76 ± 0.062
1.571SerMet: 1.571 ± 0.034
2.246SerAsn: 2.246 ± 0.04
2.219SerPro: 2.219 ± 0.039
2.321SerGln: 2.321 ± 0.037
2.603SerArg: 2.603 ± 0.044
3.591SerSer: 3.591 ± 0.068
3.103SerThr: 3.103 ± 0.062
3.995SerVal: 3.995 ± 0.054
0.65SerTrp: 0.65 ± 0.021
1.991SerTyr: 1.991 ± 0.039
0.0SerXaa: 0.0 ± 0.0
Thr
5.522ThrAla: 5.522 ± 0.063
0.662ThrCys: 0.662 ± 0.021
2.805ThrAsp: 2.805 ± 0.045
3.206ThrGlu: 3.206 ± 0.046
2.005ThrPhe: 2.005 ± 0.039
5.066ThrGly: 5.066 ± 0.068
0.969ThrHis: 0.969 ± 0.025
4.628ThrIle: 4.628 ± 0.057
3.034ThrLys: 3.034 ± 0.045
5.215ThrLeu: 5.215 ± 0.056
1.473ThrMet: 1.473 ± 0.037
2.336ThrAsn: 2.336 ± 0.039
2.56ThrPro: 2.56 ± 0.041
1.78ThrGln: 1.78 ± 0.034
2.079ThrArg: 2.079 ± 0.033
3.072ThrSer: 3.072 ± 0.064
3.533ThrThr: 3.533 ± 0.066
4.683ThrVal: 4.683 ± 0.062
0.531ThrTrp: 0.531 ± 0.021
1.73ThrTyr: 1.73 ± 0.036
0.0ThrXaa: 0.0 ± 0.0
Val
6.864ValAla: 6.864 ± 0.072
0.966ValCys: 0.966 ± 0.026
3.752ValAsp: 3.752 ± 0.054
4.452ValGlu: 4.452 ± 0.064
2.976ValPhe: 2.976 ± 0.038
5.096ValGly: 5.096 ± 0.066
1.179ValHis: 1.179 ± 0.024
5.778ValIle: 5.778 ± 0.065
4.237ValLys: 4.237 ± 0.058
7.125ValLeu: 7.125 ± 0.073
1.981ValMet: 1.981 ± 0.032
3.199ValAsn: 3.199 ± 0.046
2.866ValPro: 2.866 ± 0.045
2.383ValGln: 2.383 ± 0.036
3.147ValArg: 3.147 ± 0.044
4.547ValSer: 4.547 ± 0.058
4.553ValThr: 4.553 ± 0.062
5.882ValVal: 5.882 ± 0.074
0.662ValTrp: 0.662 ± 0.019
2.202ValTyr: 2.202 ± 0.036
0.0ValXaa: 0.0 ± 0.0
Trp
0.692TrpAla: 0.692 ± 0.021
0.123TrpCys: 0.123 ± 0.009
0.479TrpAsp: 0.479 ± 0.019
0.631TrpGlu: 0.631 ± 0.019
0.408TrpPhe: 0.408 ± 0.015
0.77TrpGly: 0.77 ± 0.023
0.225TrpHis: 0.225 ± 0.012
0.598TrpIle: 0.598 ± 0.02
0.475TrpLys: 0.475 ± 0.018
1.22TrpLeu: 1.22 ± 0.031
0.255TrpMet: 0.255 ± 0.013
0.45TrpAsn: 0.45 ± 0.018
0.365TrpPro: 0.365 ± 0.015
0.587TrpGln: 0.587 ± 0.021
0.537TrpArg: 0.537 ± 0.019
0.54TrpSer: 0.54 ± 0.017
0.457TrpThr: 0.457 ± 0.019
0.673TrpVal: 0.673 ± 0.02
0.162TrpTrp: 0.162 ± 0.011
0.344TrpTyr: 0.344 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.512TyrAla: 2.512 ± 0.035
0.542TyrCys: 0.542 ± 0.019
1.778TyrAsp: 1.778 ± 0.037
1.886TyrGlu: 1.886 ± 0.034
1.495TyrPhe: 1.495 ± 0.033
2.565TyrGly: 2.565 ± 0.042
0.705TyrHis: 0.705 ± 0.021
2.32TyrIle: 2.32 ± 0.036
1.779TyrLys: 1.779 ± 0.036
3.346TyrLeu: 3.346 ± 0.054
0.737TyrMet: 0.737 ± 0.021
1.541TyrAsn: 1.541 ± 0.032
1.456TyrPro: 1.456 ± 0.034
1.445TyrGln: 1.445 ± 0.029
1.652TyrArg: 1.652 ± 0.037
2.124TyrSer: 2.124 ± 0.043
1.849TyrThr: 1.849 ± 0.04
2.049TyrVal: 2.049 ± 0.043
0.402TyrTrp: 0.402 ± 0.017
1.367TyrTyr: 1.367 ± 0.035
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5636 proteins (1641199 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski