Amino acid dipepetide frequency for Pseudescherichia vulneris NBRC 102420

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.069AlaAla: 11.069 ± 0.138
1.093AlaCys: 1.093 ± 0.033
5.276AlaAsp: 5.276 ± 0.073
5.911AlaGlu: 5.911 ± 0.083
3.664AlaPhe: 3.664 ± 0.063
7.918AlaGly: 7.918 ± 0.08
1.891AlaHis: 1.891 ± 0.043
5.87AlaIle: 5.87 ± 0.071
3.855AlaLys: 3.855 ± 0.058
12.3AlaLeu: 12.3 ± 0.123
3.023AlaMet: 3.023 ± 0.057
3.25AlaAsn: 3.25 ± 0.059
3.763AlaPro: 3.763 ± 0.064
4.869AlaGln: 4.869 ± 0.066
5.782AlaArg: 5.782 ± 0.075
5.765AlaSer: 5.765 ± 0.08
4.984AlaThr: 4.984 ± 0.068
7.067AlaVal: 7.067 ± 0.091
1.746AlaTrp: 1.746 ± 0.042
2.081AlaTyr: 2.081 ± 0.04
0.0AlaXaa: 0.0 ± 0.0
Cys
0.94CysAla: 0.94 ± 0.034
0.146CysCys: 0.146 ± 0.011
0.59CysAsp: 0.59 ± 0.025
0.569CysGlu: 0.569 ± 0.022
0.405CysPhe: 0.405 ± 0.017
0.972CysGly: 0.972 ± 0.026
0.285CysHis: 0.285 ± 0.016
0.529CysIle: 0.529 ± 0.022
0.286CysLys: 0.286 ± 0.014
0.97CysLeu: 0.97 ± 0.025
0.233CysMet: 0.233 ± 0.013
0.324CysAsn: 0.324 ± 0.017
0.469CysPro: 0.469 ± 0.021
0.42CysGln: 0.42 ± 0.018
0.622CysArg: 0.622 ± 0.021
0.615CysSer: 0.615 ± 0.022
0.495CysThr: 0.495 ± 0.022
0.707CysVal: 0.707 ± 0.022
0.177CysTrp: 0.177 ± 0.013
0.31CysTyr: 0.31 ± 0.016
0.0CysXaa: 0.0 ± 0.0
Asp
5.592AspAla: 5.592 ± 0.063
0.479AspCys: 0.479 ± 0.017
3.025AspAsp: 3.025 ± 0.057
3.724AspGlu: 3.724 ± 0.061
2.062AspPhe: 2.062 ± 0.036
3.947AspGly: 3.947 ± 0.061
0.95AspHis: 0.95 ± 0.029
3.363AspIle: 3.363 ± 0.048
2.38AspLys: 2.38 ± 0.048
4.895AspLeu: 4.895 ± 0.061
1.344AspMet: 1.344 ± 0.034
2.2AspAsn: 2.2 ± 0.044
2.444AspPro: 2.444 ± 0.045
1.663AspGln: 1.663 ± 0.038
2.981AspArg: 2.981 ± 0.05
2.846AspSer: 2.846 ± 0.048
2.638AspThr: 2.638 ± 0.048
3.821AspVal: 3.821 ± 0.054
0.818AspTrp: 0.818 ± 0.023
1.887AspTyr: 1.887 ± 0.035
0.0AspXaa: 0.0 ± 0.0
Glu
5.707GluAla: 5.707 ± 0.082
0.459GluCys: 0.459 ± 0.02
2.359GluAsp: 2.359 ± 0.049
3.465GluGlu: 3.465 ± 0.063
1.881GluPhe: 1.881 ± 0.04
3.783GluGly: 3.783 ± 0.057
1.352GluHis: 1.352 ± 0.032
3.294GluIle: 3.294 ± 0.051
3.12GluLys: 3.12 ± 0.056
5.808GluLeu: 5.808 ± 0.066
1.764GluMet: 1.764 ± 0.037
2.252GluAsn: 2.252 ± 0.042
2.115GluPro: 2.115 ± 0.044
3.352GluGln: 3.352 ± 0.06
3.75GluArg: 3.75 ± 0.067
3.067GluSer: 3.067 ± 0.043
3.023GluThr: 3.023 ± 0.052
3.82GluVal: 3.82 ± 0.057
0.812GluTrp: 0.812 ± 0.026
1.47GluTyr: 1.47 ± 0.039
0.0GluXaa: 0.0 ± 0.0
Phe
3.793PheAla: 3.793 ± 0.06
0.512PheCys: 0.512 ± 0.019
2.41PheAsp: 2.41 ± 0.048
1.71PheGlu: 1.71 ± 0.04
1.611PhePhe: 1.611 ± 0.039
3.177PheGly: 3.177 ± 0.058
0.794PheHis: 0.794 ± 0.027
2.475PheIle: 2.475 ± 0.049
1.282PheLys: 1.282 ± 0.032
3.17PheLeu: 3.17 ± 0.059
0.98PheMet: 0.98 ± 0.028
1.719PheAsn: 1.719 ± 0.04
1.542PhePro: 1.542 ± 0.037
1.163PheGln: 1.163 ± 0.032
1.902PheArg: 1.902 ± 0.03
3.005PheSer: 3.005 ± 0.045
2.474PheThr: 2.474 ± 0.046
2.461PheVal: 2.461 ± 0.046
0.622PheTrp: 0.622 ± 0.024
1.179PheTyr: 1.179 ± 0.027
0.0PheXaa: 0.0 ± 0.0
Gly
6.284GlyAla: 6.284 ± 0.075
0.981GlyCys: 0.981 ± 0.03
3.865GlyAsp: 3.865 ± 0.064
4.75GlyGlu: 4.75 ± 0.061
3.245GlyPhe: 3.245 ± 0.055
5.517GlyGly: 5.517 ± 0.078
1.635GlyHis: 1.635 ± 0.039
4.886GlyIle: 4.886 ± 0.065
3.842GlyLys: 3.842 ± 0.056
7.569GlyLeu: 7.569 ± 0.107
2.386GlyMet: 2.386 ± 0.048
2.691GlyAsn: 2.691 ± 0.054
2.098GlyPro: 2.098 ± 0.038
3.056GlyGln: 3.056 ± 0.05
3.933GlyArg: 3.933 ± 0.058
4.386GlySer: 4.386 ± 0.065
3.97GlyThr: 3.97 ± 0.055
5.909GlyVal: 5.909 ± 0.065
1.343GlyTrp: 1.343 ± 0.033
2.587GlyTyr: 2.587 ± 0.045
0.0GlyXaa: 0.0 ± 0.0
His
2.009HisAla: 2.009 ± 0.041
0.271HisCys: 0.271 ± 0.014
1.206HisAsp: 1.206 ± 0.033
1.122HisGlu: 1.122 ± 0.029
1.02HisPhe: 1.02 ± 0.027
1.763HisGly: 1.763 ± 0.041
0.769HisHis: 0.769 ± 0.029
1.231HisIle: 1.231 ± 0.031
0.693HisLys: 0.693 ± 0.027
2.192HisLeu: 2.192 ± 0.041
0.536HisMet: 0.536 ± 0.02
0.821HisAsn: 0.821 ± 0.027
1.265HisPro: 1.265 ± 0.032
1.058HisGln: 1.058 ± 0.027
1.24HisArg: 1.24 ± 0.031
1.216HisSer: 1.216 ± 0.031
1.084HisThr: 1.084 ± 0.031
1.268HisVal: 1.268 ± 0.033
0.4HisTrp: 0.4 ± 0.017
0.875HisTyr: 0.875 ± 0.025
0.0HisXaa: 0.0 ± 0.0
Ile
6.45IleAla: 6.45 ± 0.081
0.63IleCys: 0.63 ± 0.024
3.601IleAsp: 3.601 ± 0.046
3.198IleGlu: 3.198 ± 0.054
1.983IlePhe: 1.983 ± 0.044
4.452IleGly: 4.452 ± 0.074
1.11IleHis: 1.11 ± 0.028
3.118IleIle: 3.118 ± 0.056
2.289IleLys: 2.289 ± 0.052
4.768IleLeu: 4.768 ± 0.076
1.208IleMet: 1.208 ± 0.032
2.598IleAsn: 2.598 ± 0.05
2.615IlePro: 2.615 ± 0.049
1.805IleGln: 1.805 ± 0.036
2.933IleArg: 2.933 ± 0.048
3.541IleSer: 3.541 ± 0.058
3.484IleThr: 3.484 ± 0.055
4.095IleVal: 4.095 ± 0.059
0.683IleTrp: 0.683 ± 0.022
1.512IleTyr: 1.512 ± 0.036
0.0IleXaa: 0.0 ± 0.0
Lys
4.309LysAla: 4.309 ± 0.067
0.262LysCys: 0.262 ± 0.016
1.956LysAsp: 1.956 ± 0.04
2.393LysGlu: 2.393 ± 0.053
1.107LysPhe: 1.107 ± 0.033
2.812LysGly: 2.812 ± 0.048
0.849LysHis: 0.849 ± 0.026
2.21LysIle: 2.21 ± 0.05
2.068LysLys: 2.068 ± 0.045
4.058LysLeu: 4.058 ± 0.062
1.208LysMet: 1.208 ± 0.033
1.557LysAsn: 1.557 ± 0.036
2.125LysPro: 2.125 ± 0.045
1.886LysGln: 1.886 ± 0.038
2.545LysArg: 2.545 ± 0.048
2.286LysSer: 2.286 ± 0.043
2.458LysThr: 2.458 ± 0.047
2.979LysVal: 2.979 ± 0.048
0.443LysTrp: 0.443 ± 0.017
1.113LysTyr: 1.113 ± 0.031
0.0LysXaa: 0.0 ± 0.0
Leu
11.915LeuAla: 11.915 ± 0.129
1.208LeuCys: 1.208 ± 0.034
5.48LeuAsp: 5.48 ± 0.066
5.37LeuGlu: 5.37 ± 0.074
4.234LeuPhe: 4.234 ± 0.062
7.368LeuGly: 7.368 ± 0.095
2.266LeuHis: 2.266 ± 0.041
5.721LeuIle: 5.721 ± 0.074
4.464LeuLys: 4.464 ± 0.061
11.962LeuLeu: 11.962 ± 0.151
2.907LeuMet: 2.907 ± 0.05
4.283LeuAsn: 4.283 ± 0.063
5.747LeuPro: 5.747 ± 0.067
3.931LeuGln: 3.931 ± 0.056
6.196LeuArg: 6.196 ± 0.087
7.199LeuSer: 7.199 ± 0.084
6.47LeuThr: 6.47 ± 0.076
7.229LeuVal: 7.229 ± 0.085
1.471LeuTrp: 1.471 ± 0.041
2.587LeuTyr: 2.587 ± 0.038
0.0LeuXaa: 0.0 ± 0.0
Met
2.924MetAla: 2.924 ± 0.053
0.189MetCys: 0.189 ± 0.012
1.219MetAsp: 1.219 ± 0.03
1.23MetGlu: 1.23 ± 0.032
0.856MetPhe: 0.856 ± 0.023
1.861MetGly: 1.861 ± 0.041
0.531MetHis: 0.531 ± 0.019
1.415MetIle: 1.415 ± 0.034
1.325MetLys: 1.325 ± 0.031
3.135MetLeu: 3.135 ± 0.054
0.895MetMet: 0.895 ± 0.032
1.048MetAsn: 1.048 ± 0.029
1.379MetPro: 1.379 ± 0.032
1.305MetGln: 1.305 ± 0.033
1.483MetArg: 1.483 ± 0.036
1.829MetSer: 1.829 ± 0.038
1.772MetThr: 1.772 ± 0.036
1.973MetVal: 1.973 ± 0.041
0.251MetTrp: 0.251 ± 0.013
0.515MetTyr: 0.515 ± 0.019
0.0MetXaa: 0.0 ± 0.0
Asn
3.707AsnAla: 3.707 ± 0.056
0.303AsnCys: 0.303 ± 0.015
2.108AsnAsp: 2.108 ± 0.04
1.901AsnGlu: 1.901 ± 0.04
1.306AsnPhe: 1.306 ± 0.035
3.104AsnGly: 3.104 ± 0.054
0.784AsnHis: 0.784 ± 0.026
2.181AsnIle: 2.181 ± 0.041
1.473AsnLys: 1.473 ± 0.037
3.504AsnLeu: 3.504 ± 0.057
0.914AsnMet: 0.914 ± 0.027
1.519AsnAsn: 1.519 ± 0.038
2.044AsnPro: 2.044 ± 0.04
1.552AsnGln: 1.552 ± 0.033
2.052AsnArg: 2.052 ± 0.036
2.043AsnSer: 2.043 ± 0.04
1.956AsnThr: 1.956 ± 0.042
2.681AsnVal: 2.681 ± 0.052
0.545AsnTrp: 0.545 ± 0.021
1.15AsnTyr: 1.15 ± 0.03
0.0AsnXaa: 0.0 ± 0.0
Pro
4.898ProAla: 4.898 ± 0.076
0.348ProCys: 0.348 ± 0.017
2.905ProAsp: 2.905 ± 0.049
3.485ProGlu: 3.485 ± 0.06
1.809ProPhe: 1.809 ± 0.036
3.557ProGly: 3.557 ± 0.063
1.04ProHis: 1.04 ± 0.031
1.877ProIle: 1.877 ± 0.041
1.489ProLys: 1.489 ± 0.037
5.204ProLeu: 5.204 ± 0.069
1.079ProMet: 1.079 ± 0.028
1.26ProAsn: 1.26 ± 0.027
1.76ProPro: 1.76 ± 0.043
2.242ProGln: 2.242 ± 0.046
2.052ProArg: 2.052 ± 0.039
2.119ProSer: 2.119 ± 0.039
2.138ProThr: 2.138 ± 0.04
3.908ProVal: 3.908 ± 0.058
0.783ProTrp: 0.783 ± 0.028
1.182ProTyr: 1.182 ± 0.033
0.0ProXaa: 0.0 ± 0.0
Gln
4.901GlnAla: 4.901 ± 0.079
0.346GlnCys: 0.346 ± 0.014
1.867GlnAsp: 1.867 ± 0.038
2.211GlnGlu: 2.211 ± 0.048
1.459GlnPhe: 1.459 ± 0.036
3.132GlnGly: 3.132 ± 0.046
1.175GlnHis: 1.175 ± 0.025
2.25GlnIle: 2.25 ± 0.046
1.759GlnLys: 1.759 ± 0.04
4.959GlnLeu: 4.959 ± 0.072
1.191GlnMet: 1.191 ± 0.032
1.431GlnAsn: 1.431 ± 0.033
2.276GlnPro: 2.276 ± 0.054
3.286GlnGln: 3.286 ± 0.077
3.099GlnArg: 3.099 ± 0.059
2.384GlnSer: 2.384 ± 0.044
2.312GlnThr: 2.312 ± 0.047
2.949GlnVal: 2.949 ± 0.048
0.647GlnTrp: 0.647 ± 0.022
1.159GlnTyr: 1.159 ± 0.032
0.0GlnXaa: 0.0 ± 0.0
Arg
4.825ArgAla: 4.825 ± 0.062
0.568ArgCys: 0.568 ± 0.02
3.155ArgAsp: 3.155 ± 0.056
3.748ArgGlu: 3.748 ± 0.062
2.586ArgPhe: 2.586 ± 0.039
3.546ArgGly: 3.546 ± 0.058
1.522ArgHis: 1.522 ± 0.036
3.335ArgIle: 3.335 ± 0.055
2.203ArgLys: 2.203 ± 0.042
6.741ArgLeu: 6.741 ± 0.087
1.596ArgMet: 1.596 ± 0.035
2.015ArgAsn: 2.015 ± 0.041
2.337ArgPro: 2.337 ± 0.045
3.063ArgGln: 3.063 ± 0.052
3.549ArgArg: 3.549 ± 0.069
2.821ArgSer: 2.821 ± 0.046
2.643ArgThr: 2.643 ± 0.05
3.964ArgVal: 3.964 ± 0.048
1.074ArgTrp: 1.074 ± 0.032
2.196ArgTyr: 2.196 ± 0.043
0.0ArgXaa: 0.0 ± 0.0
Ser
5.816SerAla: 5.816 ± 0.077
0.516SerCys: 0.516 ± 0.022
3.109SerAsp: 3.109 ± 0.054
3.266SerGlu: 3.266 ± 0.05
2.231SerPhe: 2.231 ± 0.043
5.307SerGly: 5.307 ± 0.069
1.361SerHis: 1.361 ± 0.036
2.937SerIle: 2.937 ± 0.05
1.949SerLys: 1.949 ± 0.039
6.64SerLeu: 6.64 ± 0.079
1.457SerMet: 1.457 ± 0.033
1.933SerAsn: 1.933 ± 0.039
2.618SerPro: 2.618 ± 0.054
2.536SerGln: 2.536 ± 0.049
3.49SerArg: 3.49 ± 0.062
3.415SerSer: 3.415 ± 0.068
3.092SerThr: 3.092 ± 0.056
4.391SerVal: 4.391 ± 0.059
0.979SerTrp: 0.979 ± 0.026
1.601SerTyr: 1.601 ± 0.038
0.0SerXaa: 0.0 ± 0.0
Thr
5.155ThrAla: 5.155 ± 0.07
0.481ThrCys: 0.481 ± 0.02
2.751ThrAsp: 2.751 ± 0.058
2.641ThrGlu: 2.641 ± 0.049
2.153ThrPhe: 2.153 ± 0.047
4.622ThrGly: 4.622 ± 0.059
1.255ThrHis: 1.255 ± 0.029
2.898ThrIle: 2.898 ± 0.056
1.538ThrLys: 1.538 ± 0.035
7.656ThrLeu: 7.656 ± 0.088
1.14ThrMet: 1.14 ± 0.029
1.548ThrAsn: 1.548 ± 0.034
3.355ThrPro: 3.355 ± 0.059
2.311ThrGln: 2.311 ± 0.049
3.105ThrArg: 3.105 ± 0.054
2.911ThrSer: 2.911 ± 0.046
2.953ThrThr: 2.953 ± 0.056
4.094ThrVal: 4.094 ± 0.088
0.842ThrTrp: 0.842 ± 0.026
1.243ThrTyr: 1.243 ± 0.035
0.0ThrXaa: 0.0 ± 0.0
Val
7.294ValAla: 7.294 ± 0.081
0.72ValCys: 0.72 ± 0.028
3.815ValAsp: 3.815 ± 0.059
4.072ValGlu: 4.072 ± 0.061
2.527ValPhe: 2.527 ± 0.048
4.926ValGly: 4.926 ± 0.066
1.299ValHis: 1.299 ± 0.032
4.343ValIle: 4.343 ± 0.066
3.177ValLys: 3.177 ± 0.051
7.307ValLeu: 7.307 ± 0.087
2.249ValMet: 2.249 ± 0.049
2.864ValAsn: 2.864 ± 0.053
3.145ValPro: 3.145 ± 0.053
2.489ValGln: 2.489 ± 0.051
3.779ValArg: 3.779 ± 0.056
4.612ValSer: 4.612 ± 0.053
4.564ValThr: 4.564 ± 0.075
5.651ValVal: 5.651 ± 0.071
1.011ValTrp: 1.011 ± 0.029
1.769ValTyr: 1.769 ± 0.038
0.0ValXaa: 0.0 ± 0.0
Trp
1.066TrpAla: 1.066 ± 0.028
0.182TrpCys: 0.182 ± 0.013
0.676TrpAsp: 0.676 ± 0.027
0.624TrpGlu: 0.624 ± 0.021
0.721TrpPhe: 0.721 ± 0.025
0.923TrpGly: 0.923 ± 0.033
0.476TrpHis: 0.476 ± 0.02
0.753TrpIle: 0.753 ± 0.024
0.502TrpLys: 0.502 ± 0.021
2.312TrpLeu: 2.312 ± 0.055
0.426TrpMet: 0.426 ± 0.019
0.508TrpAsn: 0.508 ± 0.019
0.693TrpPro: 0.693 ± 0.023
1.224TrpGln: 1.224 ± 0.038
1.126TrpArg: 1.126 ± 0.03
0.834TrpSer: 0.834 ± 0.028
0.615TrpThr: 0.615 ± 0.023
0.951TrpVal: 0.951 ± 0.028
0.236TrpTrp: 0.236 ± 0.014
0.46TrpTyr: 0.46 ± 0.019
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.523TyrAla: 2.523 ± 0.046
0.373TyrCys: 0.373 ± 0.017
1.684TyrAsp: 1.684 ± 0.038
1.264TyrGlu: 1.264 ± 0.033
1.124TyrPhe: 1.124 ± 0.03
2.246TyrGly: 2.246 ± 0.046
0.695TyrHis: 0.695 ± 0.024
1.385TyrIle: 1.385 ± 0.032
0.928TyrLys: 0.928 ± 0.022
2.857TyrLeu: 2.857 ± 0.048
0.597TyrMet: 0.597 ± 0.019
1.002TyrAsn: 1.002 ± 0.032
1.348TyrPro: 1.348 ± 0.034
1.582TyrGln: 1.582 ± 0.037
1.867TyrArg: 1.867 ± 0.039
1.75TyrSer: 1.75 ± 0.042
1.531TyrThr: 1.531 ± 0.033
1.696TyrVal: 1.696 ± 0.034
0.424TyrTrp: 0.424 ± 0.017
0.912TyrTyr: 0.912 ± 0.029
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4196 proteins (1303881 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski