Amino acid dipepetide frequency for Oceanisphaera avium

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.111AlaAla: 10.111 ± 0.136
1.134AlaCys: 1.134 ± 0.038
5.058AlaAsp: 5.058 ± 0.079
6.252AlaGlu: 6.252 ± 0.112
3.413AlaPhe: 3.413 ± 0.064
7.168AlaGly: 7.168 ± 0.114
2.441AlaHis: 2.441 ± 0.057
6.055AlaIle: 6.055 ± 0.089
4.99AlaLys: 4.99 ± 0.085
12.583AlaLeu: 12.583 ± 0.157
2.807AlaMet: 2.807 ± 0.067
3.753AlaAsn: 3.753 ± 0.07
4.106AlaPro: 4.106 ± 0.096
5.559AlaGln: 5.559 ± 0.097
5.52AlaArg: 5.52 ± 0.092
5.809AlaSer: 5.809 ± 0.081
4.745AlaThr: 4.745 ± 0.082
6.196AlaVal: 6.196 ± 0.105
1.304AlaTrp: 1.304 ± 0.04
2.414AlaTyr: 2.414 ± 0.048
0.0AlaXaa: 0.0 ± 0.0
Cys
0.926CysAla: 0.926 ± 0.039
0.145CysCys: 0.145 ± 0.014
0.514CysAsp: 0.514 ± 0.027
0.581CysGlu: 0.581 ± 0.027
0.389CysPhe: 0.389 ± 0.022
0.87CysGly: 0.87 ± 0.032
0.394CysHis: 0.394 ± 0.024
0.515CysIle: 0.515 ± 0.028
0.341CysLys: 0.341 ± 0.022
1.026CysLeu: 1.026 ± 0.037
0.207CysMet: 0.207 ± 0.018
0.282CysAsn: 0.282 ± 0.019
0.523CysPro: 0.523 ± 0.025
0.553CysGln: 0.553 ± 0.03
0.479CysArg: 0.479 ± 0.025
0.612CysSer: 0.612 ± 0.027
0.453CysThr: 0.453 ± 0.025
0.661CysVal: 0.661 ± 0.032
0.139CysTrp: 0.139 ± 0.013
0.349CysTyr: 0.349 ± 0.02
0.0CysXaa: 0.0 ± 0.0
Asp
4.859AspAla: 4.859 ± 0.099
0.489AspCys: 0.489 ± 0.024
2.611AspAsp: 2.611 ± 0.065
3.622AspGlu: 3.622 ± 0.077
2.344AspPhe: 2.344 ± 0.055
3.398AspGly: 3.398 ± 0.071
1.082AspHis: 1.082 ± 0.035
3.613AspIle: 3.613 ± 0.072
2.782AspLys: 2.782 ± 0.061
5.167AspLeu: 5.167 ± 0.078
1.474AspMet: 1.474 ± 0.043
2.126AspAsn: 2.126 ± 0.055
1.987AspPro: 1.987 ± 0.058
1.789AspGln: 1.789 ± 0.045
2.095AspArg: 2.095 ± 0.058
2.934AspSer: 2.934 ± 0.063
2.778AspThr: 2.778 ± 0.066
3.109AspVal: 3.109 ± 0.063
0.848AspTrp: 0.848 ± 0.033
1.925AspTyr: 1.925 ± 0.05
0.0AspXaa: 0.0 ± 0.0
Glu
5.572GluAla: 5.572 ± 0.089
0.464GluCys: 0.464 ± 0.026
2.559GluAsp: 2.559 ± 0.073
3.153GluGlu: 3.153 ± 0.088
2.102GluPhe: 2.102 ± 0.047
3.421GluGly: 3.421 ± 0.069
1.906GluHis: 1.906 ± 0.047
2.934GluIle: 2.934 ± 0.061
2.571GluLys: 2.571 ± 0.062
7.769GluLeu: 7.769 ± 0.113
1.395GluMet: 1.395 ± 0.043
1.791GluAsn: 1.791 ± 0.055
2.318GluPro: 2.318 ± 0.064
5.486GluGln: 5.486 ± 0.11
3.853GluArg: 3.853 ± 0.075
2.67GluSer: 2.67 ± 0.068
2.505GluThr: 2.505 ± 0.057
4.254GluVal: 4.254 ± 0.075
0.623GluTrp: 0.623 ± 0.03
1.544GluTyr: 1.544 ± 0.041
0.0GluXaa: 0.0 ± 0.0
Phe
3.626PheAla: 3.626 ± 0.067
0.472PheCys: 0.472 ± 0.028
2.317PheAsp: 2.317 ± 0.05
2.123PheGlu: 2.123 ± 0.059
1.521PhePhe: 1.521 ± 0.048
3.036PheGly: 3.036 ± 0.069
0.76PheHis: 0.76 ± 0.033
2.639PheIle: 2.639 ± 0.067
1.687PheLys: 1.687 ± 0.05
3.166PheLeu: 3.166 ± 0.066
1.005PheMet: 1.005 ± 0.037
1.86PheAsn: 1.86 ± 0.051
1.313PhePro: 1.313 ± 0.04
1.037PheGln: 1.037 ± 0.036
1.536PheArg: 1.536 ± 0.048
3.203PheSer: 3.203 ± 0.075
2.096PheThr: 2.096 ± 0.051
2.34PheVal: 2.34 ± 0.064
0.548PheTrp: 0.548 ± 0.026
1.262PheTyr: 1.262 ± 0.039
0.0PheXaa: 0.0 ± 0.0
Gly
6.198GlyAla: 6.198 ± 0.102
0.903GlyCys: 0.903 ± 0.038
3.378GlyAsp: 3.378 ± 0.065
4.578GlyGlu: 4.578 ± 0.089
3.178GlyPhe: 3.178 ± 0.061
4.89GlyGly: 4.89 ± 0.096
1.809GlyHis: 1.809 ± 0.047
4.42GlyIle: 4.42 ± 0.092
3.424GlyLys: 3.424 ± 0.061
8.149GlyLeu: 8.149 ± 0.119
2.004GlyMet: 2.004 ± 0.053
2.156GlyAsn: 2.156 ± 0.058
2.13GlyPro: 2.13 ± 0.056
3.537GlyGln: 3.537 ± 0.077
3.68GlyArg: 3.68 ± 0.08
3.778GlySer: 3.778 ± 0.07
3.261GlyThr: 3.261 ± 0.072
5.271GlyVal: 5.271 ± 0.09
0.998GlyTrp: 0.998 ± 0.036
2.244GlyTyr: 2.244 ± 0.057
0.0GlyXaa: 0.0 ± 0.0
His
2.21HisAla: 2.21 ± 0.052
0.4HisCys: 0.4 ± 0.022
1.119HisAsp: 1.119 ± 0.041
1.153HisGlu: 1.153 ± 0.044
1.155HisPhe: 1.155 ± 0.041
1.758HisGly: 1.758 ± 0.049
0.864HisHis: 0.864 ± 0.036
1.469HisIle: 1.469 ± 0.039
0.963HisLys: 0.963 ± 0.033
2.808HisLeu: 2.808 ± 0.056
0.498HisMet: 0.498 ± 0.027
0.909HisAsn: 0.909 ± 0.036
1.347HisPro: 1.347 ± 0.039
1.513HisGln: 1.513 ± 0.046
1.205HisArg: 1.205 ± 0.038
1.567HisSer: 1.567 ± 0.045
1.321HisThr: 1.321 ± 0.039
1.258HisVal: 1.258 ± 0.041
0.536HisTrp: 0.536 ± 0.026
1.033HisTyr: 1.033 ± 0.039
0.0HisXaa: 0.0 ± 0.0
Ile
6.128IleAla: 6.128 ± 0.102
0.594IleCys: 0.594 ± 0.028
3.408IleAsp: 3.408 ± 0.076
3.696IleGlu: 3.696 ± 0.073
1.868IlePhe: 1.868 ± 0.05
4.309IleGly: 4.309 ± 0.082
1.094IleHis: 1.094 ± 0.045
3.356IleIle: 3.356 ± 0.077
2.885IleLys: 2.885 ± 0.055
5.047IleLeu: 5.047 ± 0.087
1.366IleMet: 1.366 ± 0.041
2.698IleAsn: 2.698 ± 0.057
2.26IlePro: 2.26 ± 0.063
1.96IleGln: 1.96 ± 0.049
2.793IleArg: 2.793 ± 0.061
3.975IleSer: 3.975 ± 0.068
3.368IleThr: 3.368 ± 0.057
3.468IleVal: 3.468 ± 0.073
0.608IleTrp: 0.608 ± 0.027
1.479IleTyr: 1.479 ± 0.041
0.0IleXaa: 0.0 ± 0.0
Lys
4.539LysAla: 4.539 ± 0.08
0.228LysCys: 0.228 ± 0.016
2.339LysAsp: 2.339 ± 0.065
2.747LysGlu: 2.747 ± 0.06
1.099LysPhe: 1.099 ± 0.036
3.008LysGly: 3.008 ± 0.065
0.987LysHis: 0.987 ± 0.034
2.201LysIle: 2.201 ± 0.055
2.067LysLys: 2.067 ± 0.061
4.679LysLeu: 4.679 ± 0.09
1.064LysMet: 1.064 ± 0.037
1.526LysAsn: 1.526 ± 0.046
2.118LysPro: 2.118 ± 0.058
2.343LysGln: 2.343 ± 0.057
2.62LysArg: 2.62 ± 0.065
2.301LysSer: 2.301 ± 0.064
2.157LysThr: 2.157 ± 0.052
3.566LysVal: 3.566 ± 0.073
0.424LysTrp: 0.424 ± 0.023
0.959LysTyr: 0.959 ± 0.036
0.0LysXaa: 0.0 ± 0.0
Leu
14.084LeuAla: 14.084 ± 0.167
1.193LeuCys: 1.193 ± 0.042
6.246LeuAsp: 6.246 ± 0.084
6.225LeuGlu: 6.225 ± 0.102
4.106LeuPhe: 4.106 ± 0.088
8.067LeuGly: 8.067 ± 0.126
2.526LeuHis: 2.526 ± 0.058
6.098LeuIle: 6.098 ± 0.099
4.777LeuLys: 4.777 ± 0.082
13.855LeuLeu: 13.855 ± 0.232
2.96LeuMet: 2.96 ± 0.061
4.587LeuAsn: 4.587 ± 0.079
5.482LeuPro: 5.482 ± 0.094
4.631LeuGln: 4.631 ± 0.08
5.65LeuArg: 5.65 ± 0.091
8.703LeuSer: 8.703 ± 0.13
6.689LeuThr: 6.689 ± 0.096
8.131LeuVal: 8.131 ± 0.116
1.346LeuTrp: 1.346 ± 0.049
2.753LeuTyr: 2.753 ± 0.062
0.0LeuXaa: 0.0 ± 0.0
Met
3.014MetAla: 3.014 ± 0.072
0.162MetCys: 0.162 ± 0.014
1.15MetAsp: 1.15 ± 0.042
1.19MetGlu: 1.19 ± 0.044
0.715MetPhe: 0.715 ± 0.031
1.78MetGly: 1.78 ± 0.051
0.483MetHis: 0.483 ± 0.024
1.258MetIle: 1.258 ± 0.036
1.095MetLys: 1.095 ± 0.036
2.885MetLeu: 2.885 ± 0.062
0.746MetMet: 0.746 ± 0.032
0.943MetAsn: 0.943 ± 0.033
1.249MetPro: 1.249 ± 0.043
1.199MetGln: 1.199 ± 0.038
1.228MetArg: 1.228 ± 0.04
1.905MetSer: 1.905 ± 0.052
1.509MetThr: 1.509 ± 0.039
1.757MetVal: 1.757 ± 0.047
0.223MetTrp: 0.223 ± 0.016
0.455MetTyr: 0.455 ± 0.026
0.0MetXaa: 0.0 ± 0.0
Asn
3.427AsnAla: 3.427 ± 0.065
0.299AsnCys: 0.299 ± 0.021
1.888AsnAsp: 1.888 ± 0.056
2.167AsnGlu: 2.167 ± 0.054
1.214AsnPhe: 1.214 ± 0.04
2.442AsnGly: 2.442 ± 0.062
0.87AsnHis: 0.87 ± 0.034
2.295AsnIle: 2.295 ± 0.054
1.869AsnLys: 1.869 ± 0.05
3.73AsnLeu: 3.73 ± 0.069
0.909AsnMet: 0.909 ± 0.039
1.673AsnAsn: 1.673 ± 0.056
1.88AsnPro: 1.88 ± 0.052
1.908AsnGln: 1.908 ± 0.051
1.726AsnArg: 1.726 ± 0.046
2.066AsnSer: 2.066 ± 0.052
1.997AsnThr: 1.997 ± 0.054
2.078AsnVal: 2.078 ± 0.055
0.538AsnTrp: 0.538 ± 0.025
1.022AsnTyr: 1.022 ± 0.036
0.0AsnXaa: 0.0 ± 0.0
Pro
4.111ProAla: 4.111 ± 0.081
0.36ProCys: 0.36 ± 0.021
2.255ProAsp: 2.255 ± 0.053
3.22ProGlu: 3.22 ± 0.065
1.722ProPhe: 1.722 ± 0.05
2.705ProGly: 2.705 ± 0.066
1.118ProHis: 1.118 ± 0.041
2.325ProIle: 2.325 ± 0.061
1.899ProLys: 1.899 ± 0.057
5.319ProLeu: 5.319 ± 0.094
0.99ProMet: 0.99 ± 0.035
1.712ProAsn: 1.712 ± 0.051
1.487ProPro: 1.487 ± 0.05
2.007ProGln: 2.007 ± 0.046
1.75ProArg: 1.75 ± 0.051
2.763ProSer: 2.763 ± 0.07
2.145ProThr: 2.145 ± 0.053
3.166ProVal: 3.166 ± 0.066
0.64ProTrp: 0.64 ± 0.026
1.238ProTyr: 1.238 ± 0.038
0.0ProXaa: 0.0 ± 0.0
Gln
6.475GlnAla: 6.475 ± 0.093
0.403GlnCys: 0.403 ± 0.021
2.53GlnAsp: 2.53 ± 0.064
2.743GlnGlu: 2.743 ± 0.066
1.646GlnPhe: 1.646 ± 0.047
4.047GlnGly: 4.047 ± 0.085
1.595GlnHis: 1.595 ± 0.045
2.186GlnIle: 2.186 ± 0.054
1.457GlnLys: 1.457 ± 0.049
7.012GlnLeu: 7.012 ± 0.118
1.018GlnMet: 1.018 ± 0.036
1.12GlnAsn: 1.12 ± 0.035
2.236GlnPro: 2.236 ± 0.054
4.076GlnGln: 4.076 ± 0.125
3.015GlnArg: 3.015 ± 0.073
2.608GlnSer: 2.608 ± 0.065
2.28GlnThr: 2.28 ± 0.053
4.12GlnVal: 4.12 ± 0.072
0.816GlnTrp: 0.816 ± 0.034
1.235GlnTyr: 1.235 ± 0.044
0.0GlnXaa: 0.0 ± 0.0
Arg
4.668ArgAla: 4.668 ± 0.082
0.502ArgCys: 0.502 ± 0.027
2.539ArgAsp: 2.539 ± 0.065
3.272ArgGlu: 3.272 ± 0.072
2.542ArgPhe: 2.542 ± 0.058
2.88ArgGly: 2.88 ± 0.066
1.622ArgHis: 1.622 ± 0.051
3.118ArgIle: 3.118 ± 0.068
2.014ArgLys: 2.014 ± 0.053
6.844ArgLeu: 6.844 ± 0.122
1.265ArgMet: 1.265 ± 0.044
1.746ArgAsn: 1.746 ± 0.047
2.063ArgPro: 2.063 ± 0.051
3.038ArgGln: 3.038 ± 0.065
3.117ArgArg: 3.117 ± 0.071
2.649ArgSer: 2.649 ± 0.06
2.283ArgThr: 2.283 ± 0.063
3.47ArgVal: 3.47 ± 0.071
0.769ArgTrp: 0.769 ± 0.031
1.857ArgTyr: 1.857 ± 0.052
0.0ArgXaa: 0.0 ± 0.0
Ser
6.305SerAla: 6.305 ± 0.087
0.568SerCys: 0.568 ± 0.028
3.099SerAsp: 3.099 ± 0.07
3.636SerGlu: 3.636 ± 0.066
2.463SerPhe: 2.463 ± 0.056
4.727SerGly: 4.727 ± 0.079
1.72SerHis: 1.72 ± 0.049
2.965SerIle: 2.965 ± 0.066
2.215SerLys: 2.215 ± 0.056
7.396SerLeu: 7.396 ± 0.107
1.413SerMet: 1.413 ± 0.04
1.857SerAsn: 1.857 ± 0.055
2.665SerPro: 2.665 ± 0.057
3.536SerGln: 3.536 ± 0.074
3.34SerArg: 3.34 ± 0.062
3.774SerSer: 3.774 ± 0.073
2.763SerThr: 2.763 ± 0.066
4.034SerVal: 4.034 ± 0.078
0.854SerTrp: 0.854 ± 0.035
1.647SerTyr: 1.647 ± 0.051
0.0SerXaa: 0.0 ± 0.0
Thr
4.613ThrAla: 4.613 ± 0.077
0.419ThrCys: 0.419 ± 0.024
2.618ThrAsp: 2.618 ± 0.057
2.75ThrGlu: 2.75 ± 0.069
1.696ThrPhe: 1.696 ± 0.043
3.936ThrGly: 3.936 ± 0.076
1.257ThrHis: 1.257 ± 0.042
2.704ThrIle: 2.704 ± 0.059
1.801ThrLys: 1.801 ± 0.044
7.124ThrLeu: 7.124 ± 0.093
1.067ThrMet: 1.067 ± 0.037
1.595ThrAsn: 1.595 ± 0.047
3.035ThrPro: 3.035 ± 0.053
2.564ThrGln: 2.564 ± 0.065
2.629ThrArg: 2.629 ± 0.057
2.74ThrSer: 2.74 ± 0.065
2.528ThrThr: 2.528 ± 0.068
3.414ThrVal: 3.414 ± 0.077
0.658ThrTrp: 0.658 ± 0.029
1.067ThrTyr: 1.067 ± 0.035
0.0ThrXaa: 0.0 ± 0.0
Val
7.301ValAla: 7.301 ± 0.112
0.691ValCys: 0.691 ± 0.029
3.507ValAsp: 3.507 ± 0.072
3.942ValGlu: 3.942 ± 0.066
2.599ValPhe: 2.599 ± 0.06
4.718ValGly: 4.718 ± 0.096
1.367ValHis: 1.367 ± 0.044
4.335ValIle: 4.335 ± 0.081
2.863ValLys: 2.863 ± 0.067
7.663ValLeu: 7.663 ± 0.11
1.886ValMet: 1.886 ± 0.044
2.498ValAsn: 2.498 ± 0.067
2.739ValPro: 2.739 ± 0.06
2.414ValGln: 2.414 ± 0.06
3.448ValArg: 3.448 ± 0.065
4.589ValSer: 4.589 ± 0.075
3.726ValThr: 3.726 ± 0.072
5.126ValVal: 5.126 ± 0.086
0.814ValTrp: 0.814 ± 0.031
1.715ValTyr: 1.715 ± 0.045
0.0ValXaa: 0.0 ± 0.0
Trp
0.975TrpAla: 0.975 ± 0.035
0.177TrpCys: 0.177 ± 0.015
0.483TrpAsp: 0.483 ± 0.027
0.482TrpGlu: 0.482 ± 0.023
0.572TrpPhe: 0.572 ± 0.024
0.764TrpGly: 0.764 ± 0.039
0.439TrpHis: 0.439 ± 0.028
0.581TrpIle: 0.581 ± 0.026
0.287TrpLys: 0.287 ± 0.019
2.498TrpLeu: 2.498 ± 0.066
0.295TrpMet: 0.295 ± 0.019
0.334TrpAsn: 0.334 ± 0.02
0.651TrpPro: 0.651 ± 0.03
1.397TrpGln: 1.397 ± 0.049
0.864TrpArg: 0.864 ± 0.03
0.707TrpSer: 0.707 ± 0.034
0.453TrpThr: 0.453 ± 0.024
0.815TrpVal: 0.815 ± 0.032
0.217TrpTrp: 0.217 ± 0.018
0.331TrpTyr: 0.331 ± 0.021
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.279TyrAla: 2.279 ± 0.054
0.357TyrCys: 0.357 ± 0.022
1.416TyrAsp: 1.416 ± 0.043
1.358TyrGlu: 1.358 ± 0.039
1.204TyrPhe: 1.204 ± 0.042
2.059TyrGly: 2.059 ± 0.053
0.753TyrHis: 0.753 ± 0.035
1.292TyrIle: 1.292 ± 0.043
0.985TyrLys: 0.985 ± 0.041
3.386TyrLeu: 3.386 ± 0.063
0.576TyrMet: 0.576 ± 0.026
0.83TyrAsn: 0.83 ± 0.033
1.339TyrPro: 1.339 ± 0.042
2.076TyrGln: 2.076 ± 0.059
1.711TyrArg: 1.711 ± 0.048
1.624TyrSer: 1.624 ± 0.044
1.161TyrThr: 1.161 ± 0.042
1.707TyrVal: 1.707 ± 0.045
0.416TyrTrp: 0.416 ± 0.027
0.904TyrTyr: 0.904 ± 0.035
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2484 proteins (797316 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski