Amino acid dipepetide frequency for Marinobacter sp. R17

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.521AlaAla: 10.521 ± 0.108
1.039AlaCys: 1.039 ± 0.029
6.393AlaAsp: 6.393 ± 0.073
6.381AlaGlu: 6.381 ± 0.081
3.778AlaPhe: 3.778 ± 0.063
8.631AlaGly: 8.631 ± 0.097
2.025AlaHis: 2.025 ± 0.039
5.448AlaIle: 5.448 ± 0.076
3.001AlaLys: 3.001 ± 0.061
11.618AlaLeu: 11.618 ± 0.124
2.928AlaMet: 2.928 ± 0.047
2.788AlaAsn: 2.788 ± 0.048
4.191AlaPro: 4.191 ± 0.069
3.648AlaGln: 3.648 ± 0.054
6.832AlaArg: 6.832 ± 0.08
5.891AlaSer: 5.891 ± 0.075
4.935AlaThr: 4.935 ± 0.071
7.308AlaVal: 7.308 ± 0.092
1.387AlaTrp: 1.387 ± 0.037
2.513AlaTyr: 2.513 ± 0.047
0.0AlaXaa: 0.0 ± 0.0
Cys
0.833CysAla: 0.833 ± 0.023
0.126CysCys: 0.126 ± 0.01
0.603CysAsp: 0.603 ± 0.022
0.555CysGlu: 0.555 ± 0.018
0.346CysPhe: 0.346 ± 0.017
0.92CysGly: 0.92 ± 0.028
0.303CysHis: 0.303 ± 0.016
0.437CysIle: 0.437 ± 0.019
0.258CysLys: 0.258 ± 0.014
0.989CysLeu: 0.989 ± 0.026
0.162CysMet: 0.162 ± 0.011
0.26CysAsn: 0.26 ± 0.014
0.504CysPro: 0.504 ± 0.019
0.379CysGln: 0.379 ± 0.017
0.645CysArg: 0.645 ± 0.024
0.51CysSer: 0.51 ± 0.019
0.433CysThr: 0.433 ± 0.02
0.644CysVal: 0.644 ± 0.022
0.133CysTrp: 0.133 ± 0.01
0.254CysTyr: 0.254 ± 0.013
0.0CysXaa: 0.0 ± 0.0
Asp
6.251AspAla: 6.251 ± 0.074
0.538AspCys: 0.538 ± 0.021
4.218AspAsp: 4.218 ± 0.07
4.159AspGlu: 4.159 ± 0.064
2.306AspPhe: 2.306 ± 0.045
5.008AspGly: 5.008 ± 0.085
1.496AspHis: 1.496 ± 0.035
3.317AspIle: 3.317 ± 0.051
2.196AspLys: 2.196 ± 0.053
6.142AspLeu: 6.142 ± 0.072
1.535AspMet: 1.535 ± 0.034
2.048AspAsn: 2.048 ± 0.044
3.088AspPro: 3.088 ± 0.047
2.551AspGln: 2.551 ± 0.047
4.311AspArg: 4.311 ± 0.062
3.18AspSer: 3.18 ± 0.058
3.175AspThr: 3.175 ± 0.051
4.471AspVal: 4.471 ± 0.07
1.116AspTrp: 1.116 ± 0.032
2.017AspTyr: 2.017 ± 0.042
0.0AspXaa: 0.0 ± 0.0
Glu
6.971GluAla: 6.971 ± 0.091
0.436GluCys: 0.436 ± 0.019
3.461GluAsp: 3.461 ± 0.058
3.708GluGlu: 3.708 ± 0.071
1.886GluPhe: 1.886 ± 0.038
4.212GluGly: 4.212 ± 0.061
1.571GluHis: 1.571 ± 0.038
3.083GluIle: 3.083 ± 0.056
2.741GluLys: 2.741 ± 0.055
6.19GluLeu: 6.19 ± 0.087
1.535GluMet: 1.535 ± 0.038
1.983GluAsn: 1.983 ± 0.041
2.875GluPro: 2.875 ± 0.053
3.241GluGln: 3.241 ± 0.056
4.807GluArg: 4.807 ± 0.076
3.463GluSer: 3.463 ± 0.054
3.456GluThr: 3.456 ± 0.052
3.984GluVal: 3.984 ± 0.06
0.764GluTrp: 0.764 ± 0.024
1.372GluTyr: 1.372 ± 0.031
0.0GluXaa: 0.0 ± 0.0
Phe
3.418PheAla: 3.418 ± 0.057
0.44PheCys: 0.44 ± 0.017
2.544PheAsp: 2.544 ± 0.045
2.283PheGlu: 2.283 ± 0.045
1.522PhePhe: 1.522 ± 0.04
3.241PheGly: 3.241 ± 0.052
0.815PheHis: 0.815 ± 0.021
1.898PheIle: 1.898 ± 0.038
1.141PheLys: 1.141 ± 0.027
3.575PheLeu: 3.575 ± 0.07
0.909PheMet: 0.909 ± 0.028
1.352PheAsn: 1.352 ± 0.035
1.551PhePro: 1.551 ± 0.039
1.235PheGln: 1.235 ± 0.029
2.433PheArg: 2.433 ± 0.046
2.516PheSer: 2.516 ± 0.045
1.991PheThr: 1.991 ± 0.043
2.574PheVal: 2.574 ± 0.048
0.562PheTrp: 0.562 ± 0.024
1.105PheTyr: 1.105 ± 0.028
0.0PheXaa: 0.0 ± 0.0
Gly
6.857GlyAla: 6.857 ± 0.091
0.944GlyCys: 0.944 ± 0.028
4.828GlyAsp: 4.828 ± 0.063
4.944GlyGlu: 4.944 ± 0.068
3.323GlyPhe: 3.323 ± 0.05
5.898GlyGly: 5.898 ± 0.086
1.982GlyHis: 1.982 ± 0.043
4.59GlyIle: 4.59 ± 0.071
3.023GlyLys: 3.023 ± 0.054
8.561GlyLeu: 8.561 ± 0.111
2.297GlyMet: 2.297 ± 0.05
2.315GlyAsn: 2.315 ± 0.047
2.703GlyPro: 2.703 ± 0.044
3.493GlyGln: 3.493 ± 0.053
4.868GlyArg: 4.868 ± 0.065
4.278GlySer: 4.278 ± 0.067
3.909GlyThr: 3.909 ± 0.058
5.72GlyVal: 5.72 ± 0.082
1.25GlyTrp: 1.25 ± 0.035
2.545GlyTyr: 2.545 ± 0.044
0.0GlyXaa: 0.0 ± 0.0
His
2.012HisAla: 2.012 ± 0.039
0.33HisCys: 0.33 ± 0.016
1.332HisAsp: 1.332 ± 0.036
1.206HisGlu: 1.206 ± 0.033
1.048HisPhe: 1.048 ± 0.03
1.813HisGly: 1.813 ± 0.043
0.751HisHis: 0.751 ± 0.027
1.139HisIle: 1.139 ± 0.028
0.668HisLys: 0.668 ± 0.021
2.446HisLeu: 2.446 ± 0.041
0.537HisMet: 0.537 ± 0.022
0.656HisAsn: 0.656 ± 0.023
1.497HisPro: 1.497 ± 0.035
0.958HisGln: 0.958 ± 0.029
1.65HisArg: 1.65 ± 0.045
1.197HisSer: 1.197 ± 0.029
1.104HisThr: 1.104 ± 0.033
1.444HisVal: 1.444 ± 0.039
0.461HisTrp: 0.461 ± 0.021
0.825HisTyr: 0.825 ± 0.023
0.0HisXaa: 0.0 ± 0.0
Ile
5.488IleAla: 5.488 ± 0.081
0.499IleCys: 0.499 ± 0.019
3.597IleAsp: 3.597 ± 0.066
3.567IleGlu: 3.567 ± 0.055
1.637IlePhe: 1.637 ± 0.039
4.344IleGly: 4.344 ± 0.065
1.193IleHis: 1.193 ± 0.028
2.347IleIle: 2.347 ± 0.048
1.729IleLys: 1.729 ± 0.038
4.702IleLeu: 4.702 ± 0.072
1.045IleMet: 1.045 ± 0.03
1.905IleAsn: 1.905 ± 0.044
2.473IlePro: 2.473 ± 0.048
1.92IleGln: 1.92 ± 0.045
3.645IleArg: 3.645 ± 0.061
2.954IleSer: 2.954 ± 0.051
2.798IleThr: 2.798 ± 0.055
3.59IleVal: 3.59 ± 0.054
0.596IleTrp: 0.596 ± 0.021
1.245IleTyr: 1.245 ± 0.028
0.0IleXaa: 0.0 ± 0.0
Lys
3.957LysAla: 3.957 ± 0.061
0.171LysCys: 0.171 ± 0.013
2.006LysAsp: 2.006 ± 0.042
1.918LysGlu: 1.918 ± 0.045
0.84LysPhe: 0.84 ± 0.028
2.629LysGly: 2.629 ± 0.056
0.709LysHis: 0.709 ± 0.023
1.483LysIle: 1.483 ± 0.034
1.49LysLys: 1.49 ± 0.044
3.515LysLeu: 3.515 ± 0.057
0.793LysMet: 0.793 ± 0.024
0.95LysAsn: 0.95 ± 0.03
1.964LysPro: 1.964 ± 0.05
1.578LysGln: 1.578 ± 0.037
2.456LysArg: 2.456 ± 0.045
1.864LysSer: 1.864 ± 0.042
2.068LysThr: 2.068 ± 0.047
2.568LysVal: 2.568 ± 0.049
0.353LysTrp: 0.353 ± 0.018
0.736LysTyr: 0.736 ± 0.024
0.0LysXaa: 0.0 ± 0.0
Leu
11.942LeuAla: 11.942 ± 0.13
1.029LeuCys: 1.029 ± 0.025
6.797LeuAsp: 6.797 ± 0.082
6.432LeuGlu: 6.432 ± 0.074
3.999LeuPhe: 3.999 ± 0.072
7.942LeuGly: 7.942 ± 0.087
2.183LeuHis: 2.183 ± 0.042
5.448LeuIle: 5.448 ± 0.076
4.071LeuLys: 4.071 ± 0.065
10.663LeuLeu: 10.663 ± 0.132
2.743LeuMet: 2.743 ± 0.051
3.419LeuAsn: 3.419 ± 0.048
5.381LeuPro: 5.381 ± 0.076
3.871LeuGln: 3.871 ± 0.061
6.663LeuArg: 6.663 ± 0.092
6.827LeuSer: 6.827 ± 0.078
5.95LeuThr: 5.95 ± 0.075
7.472LeuVal: 7.472 ± 0.09
1.263LeuTrp: 1.263 ± 0.037
2.373LeuTyr: 2.373 ± 0.045
0.0LeuXaa: 0.0 ± 0.0
Met
3.085MetAla: 3.085 ± 0.051
0.161MetCys: 0.161 ± 0.012
1.496MetAsp: 1.496 ± 0.034
1.304MetGlu: 1.304 ± 0.033
0.668MetPhe: 0.668 ± 0.026
1.963MetGly: 1.963 ± 0.039
0.506MetHis: 0.506 ± 0.019
1.243MetIle: 1.243 ± 0.032
1.037MetLys: 1.037 ± 0.027
2.547MetLeu: 2.547 ± 0.043
0.687MetMet: 0.687 ± 0.024
0.852MetAsn: 0.852 ± 0.028
1.391MetPro: 1.391 ± 0.031
0.983MetGln: 0.983 ± 0.029
1.507MetArg: 1.507 ± 0.034
1.698MetSer: 1.698 ± 0.035
1.799MetThr: 1.799 ± 0.041
1.731MetVal: 1.731 ± 0.036
0.169MetTrp: 0.169 ± 0.012
0.4MetTyr: 0.4 ± 0.017
0.0MetXaa: 0.0 ± 0.0
Asn
3.119AsnAla: 3.119 ± 0.052
0.281AsnCys: 0.281 ± 0.016
1.8AsnAsp: 1.8 ± 0.039
1.728AsnGlu: 1.728 ± 0.033
1.007AsnPhe: 1.007 ± 0.033
2.592AsnGly: 2.592 ± 0.051
0.693AsnHis: 0.693 ± 0.022
1.472AsnIle: 1.472 ± 0.034
0.901AsnLys: 0.901 ± 0.028
3.273AsnLeu: 3.273 ± 0.052
0.718AsnMet: 0.718 ± 0.022
0.981AsnAsn: 0.981 ± 0.03
1.982AsnPro: 1.982 ± 0.039
1.237AsnGln: 1.237 ± 0.03
2.324AsnArg: 2.324 ± 0.045
1.466AsnSer: 1.466 ± 0.049
1.633AsnThr: 1.633 ± 0.038
2.12AsnVal: 2.12 ± 0.046
0.449AsnTrp: 0.449 ± 0.021
0.834AsnTyr: 0.834 ± 0.026
0.0AsnXaa: 0.0 ± 0.0
Pro
4.906ProAla: 4.906 ± 0.066
0.321ProCys: 0.321 ± 0.017
3.948ProAsp: 3.948 ± 0.06
4.17ProGlu: 4.17 ± 0.066
1.785ProPhe: 1.785 ± 0.038
3.981ProGly: 3.981 ± 0.049
0.999ProHis: 0.999 ± 0.026
2.155ProIle: 2.155 ± 0.041
1.475ProLys: 1.475 ± 0.043
4.804ProLeu: 4.804 ± 0.058
1.194ProMet: 1.194 ± 0.033
1.35ProAsn: 1.35 ± 0.036
1.926ProPro: 1.926 ± 0.043
1.72ProGln: 1.72 ± 0.04
2.429ProArg: 2.429 ± 0.046
2.564ProSer: 2.564 ± 0.044
2.259ProThr: 2.259 ± 0.038
4.16ProVal: 4.16 ± 0.061
0.694ProTrp: 0.694 ± 0.022
1.289ProTyr: 1.289 ± 0.027
0.0ProXaa: 0.0 ± 0.0
Gln
4.762GlnAla: 4.762 ± 0.072
0.332GlnCys: 0.332 ± 0.017
2.042GlnAsp: 2.042 ± 0.039
2.018GlnGlu: 2.018 ± 0.044
1.383GlnPhe: 1.383 ± 0.031
2.964GlnGly: 2.964 ± 0.049
0.946GlnHis: 0.946 ± 0.028
1.905GlnIle: 1.905 ± 0.037
1.408GlnLys: 1.408 ± 0.039
4.325GlnLeu: 4.325 ± 0.068
1.006GlnMet: 1.006 ± 0.023
1.045GlnAsn: 1.045 ± 0.03
2.141GlnPro: 2.141 ± 0.044
2.14GlnGln: 2.14 ± 0.054
2.987GlnArg: 2.987 ± 0.061
2.443GlnSer: 2.443 ± 0.041
2.164GlnThr: 2.164 ± 0.04
3.06GlnVal: 3.06 ± 0.055
0.638GlnTrp: 0.638 ± 0.02
0.986GlnTyr: 0.986 ± 0.028
0.0GlnXaa: 0.0 ± 0.0
Arg
5.659ArgAla: 5.659 ± 0.076
0.616ArgCys: 0.616 ± 0.02
4.106ArgAsp: 4.106 ± 0.051
4.525ArgGlu: 4.525 ± 0.064
2.948ArgPhe: 2.948 ± 0.052
3.903ArgGly: 3.903 ± 0.051
1.899ArgHis: 1.899 ± 0.044
3.805ArgIle: 3.805 ± 0.053
2.505ArgLys: 2.505 ± 0.048
7.83ArgLeu: 7.83 ± 0.084
1.759ArgMet: 1.759 ± 0.034
2.092ArgAsn: 2.092 ± 0.038
2.925ArgPro: 2.925 ± 0.051
3.383ArgGln: 3.383 ± 0.058
4.933ArgArg: 4.933 ± 0.078
3.428ArgSer: 3.428 ± 0.053
3.019ArgThr: 3.019 ± 0.052
4.707ArgVal: 4.707 ± 0.06
1.085ArgTrp: 1.085 ± 0.03
2.228ArgTyr: 2.228 ± 0.049
0.0ArgXaa: 0.0 ± 0.0
Ser
5.579SerAla: 5.579 ± 0.061
0.432SerCys: 0.432 ± 0.019
3.693SerAsp: 3.693 ± 0.064
3.44SerGlu: 3.44 ± 0.052
2.122SerPhe: 2.122 ± 0.041
5.326SerGly: 5.326 ± 0.077
1.324SerHis: 1.324 ± 0.031
2.881SerIle: 2.881 ± 0.043
1.663SerLys: 1.663 ± 0.034
6.401SerLeu: 6.401 ± 0.064
1.498SerMet: 1.498 ± 0.036
1.676SerAsn: 1.676 ± 0.048
2.844SerPro: 2.844 ± 0.046
2.253SerGln: 2.253 ± 0.038
3.831SerArg: 3.831 ± 0.053
3.317SerSer: 3.317 ± 0.061
2.848SerThr: 2.848 ± 0.053
4.069SerVal: 4.069 ± 0.056
0.731SerTrp: 0.731 ± 0.024
1.37SerTyr: 1.37 ± 0.035
0.0SerXaa: 0.0 ± 0.0
Thr
5.188ThrAla: 5.188 ± 0.063
0.472ThrCys: 0.472 ± 0.02
3.25ThrAsp: 3.25 ± 0.06
2.882ThrGlu: 2.882 ± 0.05
1.965ThrPhe: 1.965 ± 0.042
4.754ThrGly: 4.754 ± 0.07
1.165ThrHis: 1.165 ± 0.035
2.523ThrIle: 2.523 ± 0.054
1.155ThrLys: 1.155 ± 0.03
6.576ThrLeu: 6.576 ± 0.081
1.044ThrMet: 1.044 ± 0.029
1.382ThrAsn: 1.382 ± 0.037
3.237ThrPro: 3.237 ± 0.046
1.813ThrGln: 1.813 ± 0.041
3.401ThrArg: 3.401 ± 0.062
2.792ThrSer: 2.792 ± 0.045
2.839ThrThr: 2.839 ± 0.054
4.116ThrVal: 4.116 ± 0.062
0.712ThrTrp: 0.712 ± 0.024
1.408ThrTyr: 1.408 ± 0.039
0.0ThrXaa: 0.0 ± 0.0
Val
7.373ValAla: 7.373 ± 0.087
0.701ValCys: 0.701 ± 0.024
4.581ValAsp: 4.581 ± 0.059
4.436ValGlu: 4.436 ± 0.06
2.787ValPhe: 2.787 ± 0.054
5.158ValGly: 5.158 ± 0.078
1.463ValHis: 1.463 ± 0.033
4.239ValIle: 4.239 ± 0.068
2.378ValLys: 2.378 ± 0.045
7.376ValLeu: 7.376 ± 0.091
1.958ValMet: 1.958 ± 0.037
2.33ValAsn: 2.33 ± 0.042
3.383ValPro: 3.383 ± 0.045
2.244ValGln: 2.244 ± 0.047
4.453ValArg: 4.453 ± 0.056
4.575ValSer: 4.575 ± 0.057
4.287ValThr: 4.287 ± 0.062
5.613ValVal: 5.613 ± 0.078
0.858ValTrp: 0.858 ± 0.026
1.779ValTyr: 1.779 ± 0.038
0.0ValXaa: 0.0 ± 0.0
Trp
1.047TrpAla: 1.047 ± 0.029
0.152TrpCys: 0.152 ± 0.01
0.681TrpAsp: 0.681 ± 0.024
0.616TrpGlu: 0.616 ± 0.023
0.604TrpPhe: 0.604 ± 0.026
0.9TrpGly: 0.9 ± 0.028
0.408TrpHis: 0.408 ± 0.018
0.718TrpIle: 0.718 ± 0.026
0.43TrpLys: 0.43 ± 0.019
2.04TrpLeu: 2.04 ± 0.046
0.379TrpMet: 0.379 ± 0.017
0.426TrpAsn: 0.426 ± 0.017
0.668TrpPro: 0.668 ± 0.026
0.826TrpGln: 0.826 ± 0.029
0.967TrpArg: 0.967 ± 0.025
0.837TrpSer: 0.837 ± 0.028
0.625TrpThr: 0.625 ± 0.024
0.931TrpVal: 0.931 ± 0.028
0.206TrpTrp: 0.206 ± 0.014
0.379TrpTyr: 0.379 ± 0.017
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.287TyrAla: 2.287 ± 0.045
0.275TyrCys: 0.275 ± 0.014
1.742TyrAsp: 1.742 ± 0.048
1.466TyrGlu: 1.466 ± 0.04
1.151TyrPhe: 1.151 ± 0.031
2.15TyrGly: 2.15 ± 0.04
0.644TyrHis: 0.644 ± 0.024
1.144TyrIle: 1.144 ± 0.028
0.782TyrLys: 0.782 ± 0.023
2.894TyrLeu: 2.894 ± 0.05
0.5TyrMet: 0.5 ± 0.021
0.799TyrAsn: 0.799 ± 0.025
1.407TyrPro: 1.407 ± 0.033
1.218TyrGln: 1.218 ± 0.032
2.215TyrArg: 2.215 ± 0.049
1.498TyrSer: 1.498 ± 0.037
1.383TyrThr: 1.383 ± 0.034
1.69TyrVal: 1.69 ± 0.037
0.411TyrTrp: 0.411 ± 0.018
0.819TyrTyr: 0.819 ± 0.026
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4075 proteins (1336245 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski