Amino acid dipepetide frequency for Oceanibaculum pacificum

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
18.272AlaAla: 18.272 ± 0.217
1.091AlaCys: 1.091 ± 0.037
7.268AlaAsp: 7.268 ± 0.105
8.224AlaGlu: 8.224 ± 0.102
4.134AlaPhe: 4.134 ± 0.07
11.463AlaGly: 11.463 ± 0.133
2.233AlaHis: 2.233 ± 0.051
6.49AlaIle: 6.49 ± 0.07
3.922AlaLys: 3.922 ± 0.064
15.138AlaLeu: 15.138 ± 0.16
3.705AlaMet: 3.705 ± 0.057
2.783AlaAsn: 2.783 ± 0.056
5.631AlaPro: 5.631 ± 0.092
4.083AlaGln: 4.083 ± 0.071
8.479AlaArg: 8.479 ± 0.106
5.603AlaSer: 5.603 ± 0.071
5.831AlaThr: 5.831 ± 0.072
8.936AlaVal: 8.936 ± 0.103
1.433AlaTrp: 1.433 ± 0.038
2.844AlaTyr: 2.844 ± 0.046
0.0AlaXaa: 0.0 ± 0.0
Cys
0.905CysAla: 0.905 ± 0.029
0.095CysCys: 0.095 ± 0.011
0.52CysAsp: 0.52 ± 0.024
0.371CysGlu: 0.371 ± 0.017
0.364CysPhe: 0.364 ± 0.018
0.929CysGly: 0.929 ± 0.029
0.238CysHis: 0.238 ± 0.017
0.367CysIle: 0.367 ± 0.017
0.171CysLys: 0.171 ± 0.012
0.868CysLeu: 0.868 ± 0.032
0.147CysMet: 0.147 ± 0.011
0.182CysAsn: 0.182 ± 0.014
0.471CysPro: 0.471 ± 0.022
0.263CysGln: 0.263 ± 0.016
0.651CysArg: 0.651 ± 0.026
0.364CysSer: 0.364 ± 0.018
0.411CysThr: 0.411 ± 0.02
0.599CysVal: 0.599 ± 0.024
0.113CysTrp: 0.113 ± 0.011
0.205CysTyr: 0.205 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
6.757AspAla: 6.757 ± 0.091
0.475AspCys: 0.475 ± 0.022
2.831AspAsp: 2.831 ± 0.052
3.131AspGlu: 3.131 ± 0.05
2.238AspPhe: 2.238 ± 0.049
5.281AspGly: 5.281 ± 0.091
1.118AspHis: 1.118 ± 0.03
3.433AspIle: 3.433 ± 0.055
1.787AspLys: 1.787 ± 0.044
5.969AspLeu: 5.969 ± 0.085
1.488AspMet: 1.488 ± 0.039
1.217AspAsn: 1.217 ± 0.038
3.546AspPro: 3.546 ± 0.063
1.629AspGln: 1.629 ± 0.04
4.549AspArg: 4.549 ± 0.069
2.62AspSer: 2.62 ± 0.059
2.573AspThr: 2.573 ± 0.056
3.804AspVal: 3.804 ± 0.061
1.022AspTrp: 1.022 ± 0.025
1.573AspTyr: 1.573 ± 0.042
0.0AspXaa: 0.0 ± 0.0
Glu
8.088GluAla: 8.088 ± 0.11
0.349GluCys: 0.349 ± 0.02
2.771GluAsp: 2.771 ± 0.054
3.329GluGlu: 3.329 ± 0.066
1.616GluPhe: 1.616 ± 0.042
4.513GluGly: 4.513 ± 0.067
1.099GluHis: 1.099 ± 0.034
3.515GluIle: 3.515 ± 0.06
2.394GluLys: 2.394 ± 0.054
5.212GluLeu: 5.212 ± 0.073
1.76GluMet: 1.76 ± 0.04
1.475GluAsn: 1.475 ± 0.036
2.622GluPro: 2.622 ± 0.051
2.283GluGln: 2.283 ± 0.044
4.906GluArg: 4.906 ± 0.09
2.509GluSer: 2.509 ± 0.048
3.462GluThr: 3.462 ± 0.052
3.711GluVal: 3.711 ± 0.058
0.609GluTrp: 0.609 ± 0.023
0.996GluTyr: 0.996 ± 0.035
0.0GluXaa: 0.0 ± 0.0
Phe
4.387PheAla: 4.387 ± 0.071
0.383PheCys: 0.383 ± 0.019
2.407PheAsp: 2.407 ± 0.047
1.935PheGlu: 1.935 ± 0.043
1.412PhePhe: 1.412 ± 0.044
3.573PheGly: 3.573 ± 0.056
0.699PheHis: 0.699 ± 0.024
1.638PheIle: 1.638 ± 0.038
0.962PheLys: 0.962 ± 0.032
3.65PheLeu: 3.65 ± 0.062
0.77PheMet: 0.77 ± 0.025
0.939PheAsn: 0.939 ± 0.029
1.604PhePro: 1.604 ± 0.04
1.152PheGln: 1.152 ± 0.032
2.236PheArg: 2.236 ± 0.045
1.893PheSer: 1.893 ± 0.042
1.972PheThr: 1.972 ± 0.046
2.6PheVal: 2.6 ± 0.056
0.539PheTrp: 0.539 ± 0.021
0.928PheTyr: 0.928 ± 0.026
0.0PheXaa: 0.0 ± 0.0
Gly
9.429GlyAla: 9.429 ± 0.105
0.875GlyCys: 0.875 ± 0.029
4.85GlyAsp: 4.85 ± 0.071
4.825GlyGlu: 4.825 ± 0.067
3.701GlyPhe: 3.701 ± 0.059
7.847GlyGly: 7.847 ± 0.129
1.838GlyHis: 1.838 ± 0.042
4.864GlyIle: 4.864 ± 0.07
3.343GlyLys: 3.343 ± 0.063
9.594GlyLeu: 9.594 ± 0.103
2.579GlyMet: 2.579 ± 0.05
2.079GlyAsn: 2.079 ± 0.049
3.699GlyPro: 3.699 ± 0.057
3.058GlyGln: 3.058 ± 0.054
6.284GlyArg: 6.284 ± 0.094
4.396GlySer: 4.396 ± 0.074
4.388GlyThr: 4.388 ± 0.078
6.456GlyVal: 6.456 ± 0.08
1.42GlyTrp: 1.42 ± 0.036
2.454GlyTyr: 2.454 ± 0.047
0.0GlyXaa: 0.0 ± 0.0
His
2.304HisAla: 2.304 ± 0.052
0.221HisCys: 0.221 ± 0.015
1.126HisAsp: 1.126 ± 0.036
0.974HisGlu: 0.974 ± 0.028
0.783HisPhe: 0.783 ± 0.027
1.79HisGly: 1.79 ± 0.051
0.537HisHis: 0.537 ± 0.025
0.944HisIle: 0.944 ± 0.03
0.494HisLys: 0.494 ± 0.021
2.116HisLeu: 2.116 ± 0.042
0.473HisMet: 0.473 ± 0.022
0.476HisAsn: 0.476 ± 0.021
1.374HisPro: 1.374 ± 0.042
0.567HisGln: 0.567 ± 0.021
1.428HisArg: 1.428 ± 0.039
0.793HisSer: 0.793 ± 0.028
0.789HisThr: 0.789 ± 0.028
1.338HisVal: 1.338 ± 0.035
0.32HisTrp: 0.32 ± 0.017
0.565HisTyr: 0.565 ± 0.024
0.0HisXaa: 0.0 ± 0.0
Ile
7.36IleAla: 7.36 ± 0.085
0.496IleCys: 0.496 ± 0.021
3.89IleAsp: 3.89 ± 0.066
3.602IleGlu: 3.602 ± 0.058
1.762IlePhe: 1.762 ± 0.049
5.292IleGly: 5.292 ± 0.079
0.92IleHis: 0.92 ± 0.029
2.074IleIle: 2.074 ± 0.048
1.293IleLys: 1.293 ± 0.036
5.009IleLeu: 5.009 ± 0.065
0.999IleMet: 0.999 ± 0.032
1.259IleAsn: 1.259 ± 0.034
2.339IlePro: 2.339 ± 0.046
1.288IleGln: 1.288 ± 0.03
3.357IleArg: 3.357 ± 0.052
2.558IleSer: 2.558 ± 0.048
2.394IleThr: 2.394 ± 0.062
4.431IleVal: 4.431 ± 0.067
0.577IleTrp: 0.577 ± 0.024
1.118IleTyr: 1.118 ± 0.034
0.0IleXaa: 0.0 ± 0.0
Lys
4.171LysAla: 4.171 ± 0.068
0.144LysCys: 0.144 ± 0.01
1.671LysAsp: 1.671 ± 0.042
1.697LysGlu: 1.697 ± 0.052
0.855LysPhe: 0.855 ± 0.03
2.731LysGly: 2.731 ± 0.064
0.579LysHis: 0.579 ± 0.024
1.673LysIle: 1.673 ± 0.04
1.274LysLys: 1.274 ± 0.045
3.513LysLeu: 3.513 ± 0.063
0.773LysMet: 0.773 ± 0.029
0.774LysAsn: 0.774 ± 0.029
2.059LysPro: 2.059 ± 0.042
1.164LysGln: 1.164 ± 0.032
2.308LysArg: 2.308 ± 0.048
1.588LysSer: 1.588 ± 0.043
1.736LysThr: 1.736 ± 0.044
2.377LysVal: 2.377 ± 0.056
0.296LysTrp: 0.296 ± 0.017
0.607LysTyr: 0.607 ± 0.024
0.0LysXaa: 0.0 ± 0.0
Leu
14.902LeuAla: 14.902 ± 0.171
0.887LeuCys: 0.887 ± 0.029
6.257LeuAsp: 6.257 ± 0.09
5.686LeuGlu: 5.686 ± 0.071
3.799LeuPhe: 3.799 ± 0.071
8.908LeuGly: 8.908 ± 0.102
1.953LeuHis: 1.953 ± 0.048
5.314LeuIle: 5.314 ± 0.074
3.429LeuLys: 3.429 ± 0.056
11.3LeuLeu: 11.3 ± 0.149
2.586LeuMet: 2.586 ± 0.048
2.629LeuAsn: 2.629 ± 0.049
6.271LeuPro: 6.271 ± 0.08
3.073LeuGln: 3.073 ± 0.05
7.38LeuArg: 7.38 ± 0.096
6.167LeuSer: 6.167 ± 0.09
6.04LeuThr: 6.04 ± 0.088
7.568LeuVal: 7.568 ± 0.092
1.216LeuTrp: 1.216 ± 0.036
2.428LeuTyr: 2.428 ± 0.047
0.0LeuXaa: 0.0 ± 0.0
Met
3.501MetAla: 3.501 ± 0.064
0.136MetCys: 0.136 ± 0.011
1.197MetAsp: 1.197 ± 0.037
1.217MetGlu: 1.217 ± 0.032
0.693MetPhe: 0.693 ± 0.027
1.992MetGly: 1.992 ± 0.051
0.427MetHis: 0.427 ± 0.022
1.319MetIle: 1.319 ± 0.034
0.986MetLys: 0.986 ± 0.03
2.809MetLeu: 2.809 ± 0.054
0.693MetMet: 0.693 ± 0.028
0.71MetAsn: 0.71 ± 0.025
1.582MetPro: 1.582 ± 0.037
0.912MetGln: 0.912 ± 0.028
1.822MetArg: 1.822 ± 0.038
1.607MetSer: 1.607 ± 0.039
1.759MetThr: 1.759 ± 0.039
1.854MetVal: 1.854 ± 0.037
0.189MetTrp: 0.189 ± 0.014
0.37MetTyr: 0.37 ± 0.017
0.0MetXaa: 0.0 ± 0.0
Asn
2.949AsnAla: 2.949 ± 0.058
0.224AsnCys: 0.224 ± 0.014
1.319AsnAsp: 1.319 ± 0.044
1.169AsnGlu: 1.169 ± 0.039
0.923AsnPhe: 0.923 ± 0.032
2.177AsnGly: 2.177 ± 0.052
0.472AsnHis: 0.472 ± 0.02
1.317AsnIle: 1.317 ± 0.039
0.709AsnLys: 0.709 ± 0.027
2.57AsnLeu: 2.57 ± 0.059
0.589AsnMet: 0.589 ± 0.024
0.683AsnAsn: 0.683 ± 0.028
1.781AsnPro: 1.781 ± 0.047
0.756AsnGln: 0.756 ± 0.025
1.79AsnArg: 1.79 ± 0.035
1.117AsnSer: 1.117 ± 0.036
1.134AsnThr: 1.134 ± 0.037
1.701AsnVal: 1.701 ± 0.043
0.379AsnTrp: 0.379 ± 0.02
0.601AsnTyr: 0.601 ± 0.025
0.0AsnXaa: 0.0 ± 0.0
Pro
7.218ProAla: 7.218 ± 0.11
0.308ProCys: 0.308 ± 0.018
3.61ProAsp: 3.61 ± 0.063
3.65ProGlu: 3.65 ± 0.06
1.97ProPhe: 1.97 ± 0.042
4.817ProGly: 4.817 ± 0.08
0.988ProHis: 0.988 ± 0.031
2.364ProIle: 2.364 ± 0.048
1.681ProLys: 1.681 ± 0.046
5.121ProLeu: 5.121 ± 0.079
1.317ProMet: 1.317 ± 0.033
1.271ProAsn: 1.271 ± 0.029
2.827ProPro: 2.827 ± 0.066
1.625ProGln: 1.625 ± 0.043
2.865ProArg: 2.865 ± 0.044
2.516ProSer: 2.516 ± 0.04
2.645ProThr: 2.645 ± 0.043
4.416ProVal: 4.416 ± 0.065
0.67ProTrp: 0.67 ± 0.028
1.293ProTyr: 1.293 ± 0.035
0.0ProXaa: 0.0 ± 0.0
Gln
4.774GlnAla: 4.774 ± 0.072
0.198GlnCys: 0.198 ± 0.015
1.5GlnAsp: 1.5 ± 0.04
1.639GlnGlu: 1.639 ± 0.042
0.967GlnPhe: 0.967 ± 0.028
2.771GlnGly: 2.771 ± 0.054
0.687GlnHis: 0.687 ± 0.026
1.813GlnIle: 1.813 ± 0.034
1.066GlnLys: 1.066 ± 0.032
2.824GlnLeu: 2.824 ± 0.046
0.957GlnMet: 0.957 ± 0.024
0.817GlnAsn: 0.817 ± 0.03
2.058GlnPro: 2.058 ± 0.047
1.452GlnGln: 1.452 ± 0.047
2.665GlnArg: 2.665 ± 0.054
1.582GlnSer: 1.582 ± 0.039
1.635GlnThr: 1.635 ± 0.048
2.373GlnVal: 2.373 ± 0.043
0.321GlnTrp: 0.321 ± 0.018
0.58GlnTyr: 0.58 ± 0.026
0.0GlnXaa: 0.0 ± 0.0
Arg
7.713ArgAla: 7.713 ± 0.087
0.581ArgCys: 0.581 ± 0.026
4.18ArgAsp: 4.18 ± 0.062
4.023ArgGlu: 4.023 ± 0.056
2.791ArgPhe: 2.791 ± 0.05
4.823ArgGly: 4.823 ± 0.074
1.702ArgHis: 1.702 ± 0.04
4.069ArgIle: 4.069 ± 0.051
2.301ArgLys: 2.301 ± 0.05
8.899ArgLeu: 8.899 ± 0.118
1.844ArgMet: 1.844 ± 0.042
1.729ArgAsn: 1.729 ± 0.039
3.793ArgPro: 3.793 ± 0.063
2.982ArgGln: 2.982 ± 0.053
6.159ArgArg: 6.159 ± 0.097
3.005ArgSer: 3.005 ± 0.054
3.081ArgThr: 3.081 ± 0.05
4.44ArgVal: 4.44 ± 0.071
0.95ArgTrp: 0.95 ± 0.03
1.844ArgTyr: 1.844 ± 0.044
0.0ArgXaa: 0.0 ± 0.0
Ser
5.683SerAla: 5.683 ± 0.069
0.397SerCys: 0.397 ± 0.021
2.6SerAsp: 2.6 ± 0.055
2.39SerGlu: 2.39 ± 0.046
2.143SerPhe: 2.143 ± 0.046
5.027SerGly: 5.027 ± 0.081
0.967SerHis: 0.967 ± 0.028
2.53SerIle: 2.53 ± 0.046
1.351SerLys: 1.351 ± 0.037
5.342SerLeu: 5.342 ± 0.075
1.215SerMet: 1.215 ± 0.032
1.215SerAsn: 1.215 ± 0.04
2.53SerPro: 2.53 ± 0.054
1.514SerGln: 1.514 ± 0.038
3.193SerArg: 3.193 ± 0.047
2.351SerSer: 2.351 ± 0.054
2.421SerThr: 2.421 ± 0.051
3.722SerVal: 3.722 ± 0.059
0.704SerTrp: 0.704 ± 0.026
1.323SerTyr: 1.323 ± 0.036
0.0SerXaa: 0.0 ± 0.0
Thr
6.183ThrAla: 6.183 ± 0.078
0.389ThrCys: 0.389 ± 0.019
2.781ThrAsp: 2.781 ± 0.064
2.75ThrGlu: 2.75 ± 0.051
1.568ThrPhe: 1.568 ± 0.036
5.266ThrGly: 5.266 ± 0.07
0.945ThrHis: 0.945 ± 0.029
2.776ThrIle: 2.776 ± 0.067
1.316ThrLys: 1.316 ± 0.035
6.131ThrLeu: 6.131 ± 0.078
1.13ThrMet: 1.13 ± 0.036
1.173ThrAsn: 1.173 ± 0.037
3.322ThrPro: 3.322 ± 0.046
1.481ThrGln: 1.481 ± 0.038
3.008ThrArg: 3.008 ± 0.055
2.242ThrSer: 2.242 ± 0.046
2.407ThrThr: 2.407 ± 0.06
4.406ThrVal: 4.406 ± 0.067
0.517ThrTrp: 0.517 ± 0.021
1.195ThrTyr: 1.195 ± 0.038
0.0ThrXaa: 0.0 ± 0.0
Val
9.144ValAla: 9.144 ± 0.099
0.622ValCys: 0.622 ± 0.023
4.1ValAsp: 4.1 ± 0.062
4.734ValGlu: 4.734 ± 0.07
2.588ValPhe: 2.588 ± 0.054
5.656ValGly: 5.656 ± 0.075
1.19ValHis: 1.19 ± 0.037
3.932ValIle: 3.932 ± 0.066
2.404ValLys: 2.404 ± 0.049
7.575ValLeu: 7.575 ± 0.088
1.887ValMet: 1.887 ± 0.041
1.988ValAsn: 1.988 ± 0.047
3.812ValPro: 3.812 ± 0.062
2.098ValGln: 2.098 ± 0.046
4.611ValArg: 4.611 ± 0.057
3.864ValSer: 3.864 ± 0.068
4.557ValThr: 4.557 ± 0.069
5.369ValVal: 5.369 ± 0.084
0.846ValTrp: 0.846 ± 0.031
1.525ValTyr: 1.525 ± 0.037
0.0ValXaa: 0.0 ± 0.0
Trp
1.184TrpAla: 1.184 ± 0.033
0.124TrpCys: 0.124 ± 0.01
0.644TrpAsp: 0.644 ± 0.024
0.56TrpGlu: 0.56 ± 0.022
0.462TrpPhe: 0.462 ± 0.018
0.894TrpGly: 0.894 ± 0.029
0.325TrpHis: 0.325 ± 0.016
0.598TrpIle: 0.598 ± 0.026
0.422TrpLys: 0.422 ± 0.02
1.616TrpLeu: 1.616 ± 0.042
0.344TrpMet: 0.344 ± 0.018
0.382TrpAsn: 0.382 ± 0.019
0.716TrpPro: 0.716 ± 0.024
0.578TrpGln: 0.578 ± 0.019
1.214TrpArg: 1.214 ± 0.036
0.672TrpSer: 0.672 ± 0.024
0.63TrpThr: 0.63 ± 0.025
0.832TrpVal: 0.832 ± 0.029
0.199TrpTrp: 0.199 ± 0.013
0.285TrpTyr: 0.285 ± 0.017
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.638TyrAla: 2.638 ± 0.045
0.239TyrCys: 0.239 ± 0.014
1.52TyrAsp: 1.52 ± 0.039
1.315TyrGlu: 1.315 ± 0.038
0.91TyrPhe: 0.91 ± 0.026
2.229TyrGly: 2.229 ± 0.048
0.494TyrHis: 0.494 ± 0.018
0.991TyrIle: 0.991 ± 0.028
0.689TyrLys: 0.689 ± 0.027
2.535TyrLeu: 2.535 ± 0.047
0.521TyrMet: 0.521 ± 0.02
0.576TyrAsn: 0.576 ± 0.023
1.137TyrPro: 1.137 ± 0.034
0.72TyrGln: 0.72 ± 0.024
2.002TyrArg: 2.002 ± 0.038
1.172TyrSer: 1.172 ± 0.033
1.117TyrThr: 1.117 ± 0.036
1.569TyrVal: 1.569 ± 0.041
0.36TyrTrp: 0.36 ± 0.019
0.653TyrTyr: 0.653 ± 0.024
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3560 proteins (1140145 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski