Amino acid dipepetide frequency for Pseudooceanicola marinus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.419AlaAla: 17.419 ± 0.163
1.113AlaCys: 1.113 ± 0.031
6.861AlaAsp: 6.861 ± 0.093
9.429AlaGlu: 9.429 ± 0.109
4.284AlaPhe: 4.284 ± 0.06
11.417AlaGly: 11.417 ± 0.113
2.323AlaHis: 2.323 ± 0.046
5.412AlaIle: 5.412 ± 0.07
3.268AlaLys: 3.268 ± 0.056
14.859AlaLeu: 14.859 ± 0.142
3.812AlaMet: 3.812 ± 0.054
2.39AlaAsn: 2.39 ± 0.038
6.827AlaPro: 6.827 ± 0.098
4.54AlaGln: 4.54 ± 0.065
9.979AlaArg: 9.979 ± 0.116
5.599AlaSer: 5.599 ± 0.069
6.122AlaThr: 6.122 ± 0.074
8.252AlaVal: 8.252 ± 0.081
1.603AlaTrp: 1.603 ± 0.036
2.458AlaTyr: 2.458 ± 0.04
0.0AlaXaa: 0.0 ± 0.0
Cys
1.098CysAla: 1.098 ± 0.028
0.112CysCys: 0.112 ± 0.01
0.625CysAsp: 0.625 ± 0.022
0.422CysGlu: 0.422 ± 0.018
0.307CysPhe: 0.307 ± 0.015
0.921CysGly: 0.921 ± 0.028
0.264CysHis: 0.264 ± 0.016
0.357CysIle: 0.357 ± 0.016
0.185CysLys: 0.185 ± 0.012
0.927CysLeu: 0.927 ± 0.028
0.155CysMet: 0.155 ± 0.012
0.19CysAsn: 0.19 ± 0.013
0.511CysPro: 0.511 ± 0.023
0.239CysGln: 0.239 ± 0.013
0.595CysArg: 0.595 ± 0.02
0.415CysSer: 0.415 ± 0.021
0.424CysThr: 0.424 ± 0.018
0.562CysVal: 0.562 ± 0.022
0.118CysTrp: 0.118 ± 0.01
0.197CysTyr: 0.197 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
6.909AspAla: 6.909 ± 0.088
0.499AspCys: 0.499 ± 0.022
3.273AspAsp: 3.273 ± 0.083
3.625AspGlu: 3.625 ± 0.059
2.268AspPhe: 2.268 ± 0.045
5.579AspGly: 5.579 ± 0.087
1.382AspHis: 1.382 ± 0.034
2.726AspIle: 2.726 ± 0.05
1.376AspLys: 1.376 ± 0.037
6.582AspLeu: 6.582 ± 0.082
1.755AspMet: 1.755 ± 0.032
1.176AspAsn: 1.176 ± 0.034
4.021AspPro: 4.021 ± 0.067
1.892AspGln: 1.892 ± 0.04
4.667AspArg: 4.667 ± 0.064
2.384AspSer: 2.384 ± 0.049
2.842AspThr: 2.842 ± 0.059
3.918AspVal: 3.918 ± 0.062
1.187AspTrp: 1.187 ± 0.028
1.607AspTyr: 1.607 ± 0.039
0.0AspXaa: 0.0 ± 0.0
Glu
9.649GluAla: 9.649 ± 0.099
0.316GluCys: 0.316 ± 0.014
3.768GluAsp: 3.768 ± 0.063
4.524GluGlu: 4.524 ± 0.065
1.615GluPhe: 1.615 ± 0.033
5.682GluGly: 5.682 ± 0.078
1.068GluHis: 1.068 ± 0.03
3.673GluIle: 3.673 ± 0.054
2.071GluLys: 2.071 ± 0.05
5.512GluLeu: 5.512 ± 0.069
2.054GluMet: 2.054 ± 0.043
1.581GluAsn: 1.581 ± 0.038
2.49GluPro: 2.49 ± 0.051
1.927GluGln: 1.927 ± 0.042
4.079GluArg: 4.079 ± 0.065
2.15GluSer: 2.15 ± 0.042
4.055GluThr: 4.055 ± 0.051
4.818GluVal: 4.818 ± 0.062
0.662GluTrp: 0.662 ± 0.021
1.013GluTyr: 1.013 ± 0.031
0.0GluXaa: 0.0 ± 0.0
Phe
4.311PheAla: 4.311 ± 0.057
0.397PheCys: 0.397 ± 0.017
2.738PheAsp: 2.738 ± 0.048
2.044PheGlu: 2.044 ± 0.04
1.359PhePhe: 1.359 ± 0.043
3.595PheGly: 3.595 ± 0.06
0.759PheHis: 0.759 ± 0.025
1.486PheIle: 1.486 ± 0.032
0.741PheLys: 0.741 ± 0.027
3.455PheLeu: 3.455 ± 0.067
0.827PheMet: 0.827 ± 0.027
0.923PheAsn: 0.923 ± 0.029
1.549PhePro: 1.549 ± 0.035
0.937PheGln: 0.937 ± 0.025
2.275PheArg: 2.275 ± 0.038
1.969PheSer: 1.969 ± 0.044
2.066PheThr: 2.066 ± 0.04
2.475PheVal: 2.475 ± 0.041
0.574PheTrp: 0.574 ± 0.022
0.87PheTyr: 0.87 ± 0.03
0.0PheXaa: 0.0 ± 0.0
Gly
10.687GlyAla: 10.687 ± 0.105
0.844GlyCys: 0.844 ± 0.024
4.802GlyAsp: 4.802 ± 0.102
5.199GlyGlu: 5.199 ± 0.073
3.53GlyPhe: 3.53 ± 0.049
7.796GlyGly: 7.796 ± 0.129
1.935GlyHis: 1.935 ± 0.037
4.313GlyIle: 4.313 ± 0.064
2.762GlyLys: 2.762 ± 0.056
9.833GlyLeu: 9.833 ± 0.104
2.727GlyMet: 2.727 ± 0.05
1.986GlyAsn: 1.986 ± 0.047
4.106GlyPro: 4.106 ± 0.056
3.312GlyGln: 3.312 ± 0.059
6.191GlyArg: 6.191 ± 0.076
4.267GlySer: 4.267 ± 0.062
4.874GlyThr: 4.874 ± 0.065
6.337GlyVal: 6.337 ± 0.074
1.443GlyTrp: 1.443 ± 0.034
2.339GlyTyr: 2.339 ± 0.039
0.0GlyXaa: 0.0 ± 0.0
His
2.313HisAla: 2.313 ± 0.051
0.234HisCys: 0.234 ± 0.014
1.278HisAsp: 1.278 ± 0.036
1.068HisGlu: 1.068 ± 0.03
0.845HisPhe: 0.845 ± 0.023
1.923HisGly: 1.923 ± 0.045
0.555HisHis: 0.555 ± 0.024
0.816HisIle: 0.816 ± 0.026
0.39HisLys: 0.39 ± 0.016
2.153HisLeu: 2.153 ± 0.042
0.543HisMet: 0.543 ± 0.022
0.404HisAsn: 0.404 ± 0.019
1.417HisPro: 1.417 ± 0.033
0.571HisGln: 0.571 ± 0.024
1.303HisArg: 1.303 ± 0.033
0.858HisSer: 0.858 ± 0.027
0.744HisThr: 0.744 ± 0.024
1.566HisVal: 1.566 ± 0.036
0.352HisTrp: 0.352 ± 0.017
0.534HisTyr: 0.534 ± 0.022
0.0HisXaa: 0.0 ± 0.0
Ile
6.519IleAla: 6.519 ± 0.07
0.586IleCys: 0.586 ± 0.023
3.145IleAsp: 3.145 ± 0.054
3.295IleGlu: 3.295 ± 0.047
1.731IlePhe: 1.731 ± 0.044
4.461IleGly: 4.461 ± 0.068
0.847IleHis: 0.847 ± 0.026
1.825IleIle: 1.825 ± 0.047
1.052IleLys: 1.052 ± 0.035
4.807IleLeu: 4.807 ± 0.062
0.97IleMet: 0.97 ± 0.03
1.153IleAsn: 1.153 ± 0.032
2.243IlePro: 2.243 ± 0.042
1.059IleGln: 1.059 ± 0.027
3.238IleArg: 3.238 ± 0.051
2.839IleSer: 2.839 ± 0.045
2.751IleThr: 2.751 ± 0.048
3.309IleVal: 3.309 ± 0.062
0.64IleTrp: 0.64 ± 0.025
1.076IleTyr: 1.076 ± 0.029
0.0IleXaa: 0.0 ± 0.0
Lys
3.323LysAla: 3.323 ± 0.058
0.161LysCys: 0.161 ± 0.01
1.514LysAsp: 1.514 ± 0.028
1.483LysGlu: 1.483 ± 0.036
0.76LysPhe: 0.76 ± 0.022
2.414LysGly: 2.414 ± 0.042
0.506LysHis: 0.506 ± 0.02
1.448LysIle: 1.448 ± 0.035
1.062LysLys: 1.062 ± 0.032
2.699LysLeu: 2.699 ± 0.052
0.769LysMet: 0.769 ± 0.025
0.635LysAsn: 0.635 ± 0.02
1.62LysPro: 1.62 ± 0.047
0.741LysGln: 0.741 ± 0.025
1.961LysArg: 1.961 ± 0.046
1.473LysSer: 1.473 ± 0.037
1.658LysThr: 1.658 ± 0.038
2.149LysVal: 2.149 ± 0.041
0.332LysTrp: 0.332 ± 0.015
0.566LysTyr: 0.566 ± 0.023
0.0LysXaa: 0.0 ± 0.0
Leu
14.183LeuAla: 14.183 ± 0.128
0.971LeuCys: 0.971 ± 0.033
6.401LeuAsp: 6.401 ± 0.067
5.612LeuGlu: 5.612 ± 0.072
3.46LeuPhe: 3.46 ± 0.057
9.046LeuGly: 9.046 ± 0.105
1.945LeuHis: 1.945 ± 0.04
4.856LeuIle: 4.856 ± 0.061
3.051LeuLys: 3.051 ± 0.053
9.874LeuLeu: 9.874 ± 0.115
2.726LeuMet: 2.726 ± 0.049
2.401LeuAsn: 2.401 ± 0.049
6.223LeuPro: 6.223 ± 0.076
2.793LeuGln: 2.793 ± 0.048
7.621LeuArg: 7.621 ± 0.082
6.986LeuSer: 6.986 ± 0.084
6.342LeuThr: 6.342 ± 0.081
7.133LeuVal: 7.133 ± 0.082
1.338LeuTrp: 1.338 ± 0.038
1.977LeuTyr: 1.977 ± 0.039
0.0LeuXaa: 0.0 ± 0.0
Met
3.563MetAla: 3.563 ± 0.053
0.186MetCys: 0.186 ± 0.011
1.429MetAsp: 1.429 ± 0.034
1.448MetGlu: 1.448 ± 0.036
0.768MetPhe: 0.768 ± 0.025
2.292MetGly: 2.292 ± 0.045
0.414MetHis: 0.414 ± 0.019
1.496MetIle: 1.496 ± 0.038
1.018MetLys: 1.018 ± 0.028
2.677MetLeu: 2.677 ± 0.05
0.776MetMet: 0.776 ± 0.026
0.79MetAsn: 0.79 ± 0.025
1.546MetPro: 1.546 ± 0.037
1.023MetGln: 1.023 ± 0.029
1.979MetArg: 1.979 ± 0.038
1.824MetSer: 1.824 ± 0.031
2.232MetThr: 2.232 ± 0.045
1.837MetVal: 1.837 ± 0.04
0.228MetTrp: 0.228 ± 0.013
0.319MetTyr: 0.319 ± 0.016
0.0MetXaa: 0.0 ± 0.0
Asn
2.82AsnAla: 2.82 ± 0.049
0.223AsnCys: 0.223 ± 0.014
1.286AsnAsp: 1.286 ± 0.038
1.071AsnGlu: 1.071 ± 0.028
0.842AsnPhe: 0.842 ± 0.026
2.09AsnGly: 2.09 ± 0.048
0.439AsnHis: 0.439 ± 0.021
1.212AsnIle: 1.212 ± 0.03
0.497AsnLys: 0.497 ± 0.022
2.353AsnLeu: 2.353 ± 0.043
0.587AsnMet: 0.587 ± 0.019
0.526AsnAsn: 0.526 ± 0.022
1.642AsnPro: 1.642 ± 0.036
0.623AsnGln: 0.623 ± 0.021
1.628AsnArg: 1.628 ± 0.037
1.094AsnSer: 1.094 ± 0.031
1.141AsnThr: 1.141 ± 0.031
1.653AsnVal: 1.653 ± 0.038
0.386AsnTrp: 0.386 ± 0.018
0.62AsnTyr: 0.62 ± 0.024
0.0AsnXaa: 0.0 ± 0.0
Pro
6.657ProAla: 6.657 ± 0.096
0.385ProCys: 0.385 ± 0.019
3.97ProAsp: 3.97 ± 0.062
4.929ProGlu: 4.929 ± 0.066
1.905ProPhe: 1.905 ± 0.038
5.057ProGly: 5.057 ± 0.072
1.081ProHis: 1.081 ± 0.031
2.008ProIle: 2.008 ± 0.043
1.443ProLys: 1.443 ± 0.034
5.197ProLeu: 5.197 ± 0.076
1.376ProMet: 1.376 ± 0.035
1.116ProAsn: 1.116 ± 0.03
2.501ProPro: 2.501 ± 0.061
1.76ProGln: 1.76 ± 0.041
3.189ProArg: 3.189 ± 0.052
2.521ProSer: 2.521 ± 0.046
2.398ProThr: 2.398 ± 0.048
4.399ProVal: 4.399 ± 0.069
0.72ProTrp: 0.72 ± 0.024
1.189ProTyr: 1.189 ± 0.031
0.0ProXaa: 0.0 ± 0.0
Gln
4.245GlnAla: 4.245 ± 0.066
0.198GlnCys: 0.198 ± 0.011
1.726GlnAsp: 1.726 ± 0.037
1.659GlnGlu: 1.659 ± 0.035
0.94GlnPhe: 0.94 ± 0.028
2.763GlnGly: 2.763 ± 0.041
0.561GlnHis: 0.561 ± 0.02
1.878GlnIle: 1.878 ± 0.038
0.953GlnLys: 0.953 ± 0.029
2.855GlnLeu: 2.855 ± 0.046
1.056GlnMet: 1.056 ± 0.03
0.757GlnAsn: 0.757 ± 0.025
1.7GlnPro: 1.7 ± 0.037
1.172GlnGln: 1.172 ± 0.034
2.133GlnArg: 2.133 ± 0.04
1.634GlnSer: 1.634 ± 0.034
1.68GlnThr: 1.68 ± 0.035
2.531GlnVal: 2.531 ± 0.043
0.363GlnTrp: 0.363 ± 0.017
0.568GlnTyr: 0.568 ± 0.021
0.0GlnXaa: 0.0 ± 0.0
Arg
9.16ArgAla: 9.16 ± 0.099
0.492ArgCys: 0.492 ± 0.018
4.39ArgAsp: 4.39 ± 0.064
4.091ArgGlu: 4.091 ± 0.064
2.584ArgPhe: 2.584 ± 0.048
5.039ArgGly: 5.039 ± 0.063
1.621ArgHis: 1.621 ± 0.036
3.796ArgIle: 3.796 ± 0.048
2.034ArgLys: 2.034 ± 0.043
7.985ArgLeu: 7.985 ± 0.106
2.051ArgMet: 2.051 ± 0.041
1.681ArgAsn: 1.681 ± 0.036
3.754ArgPro: 3.754 ± 0.065
2.424ArgGln: 2.424 ± 0.049
5.559ArgArg: 5.559 ± 0.078
3.288ArgSer: 3.288 ± 0.057
2.851ArgThr: 2.851 ± 0.047
4.716ArgVal: 4.716 ± 0.062
0.863ArgTrp: 0.863 ± 0.026
1.587ArgTyr: 1.587 ± 0.042
0.0ArgXaa: 0.0 ± 0.0
Ser
5.919SerAla: 5.919 ± 0.078
0.444SerCys: 0.444 ± 0.02
3.181SerAsp: 3.181 ± 0.056
2.889SerGlu: 2.889 ± 0.054
2.137SerPhe: 2.137 ± 0.044
5.427SerGly: 5.427 ± 0.082
1.058SerHis: 1.058 ± 0.031
2.27SerIle: 2.27 ± 0.041
1.258SerLys: 1.258 ± 0.036
5.163SerLeu: 5.163 ± 0.075
1.371SerMet: 1.371 ± 0.033
1.178SerAsn: 1.178 ± 0.03
2.586SerPro: 2.586 ± 0.052
1.538SerGln: 1.538 ± 0.033
3.307SerArg: 3.307 ± 0.056
2.608SerSer: 2.608 ± 0.05
2.49SerThr: 2.49 ± 0.047
3.659SerVal: 3.659 ± 0.059
0.735SerTrp: 0.735 ± 0.024
1.275SerTyr: 1.275 ± 0.031
0.0SerXaa: 0.0 ± 0.0
Thr
6.315ThrAla: 6.315 ± 0.07
0.484ThrCys: 0.484 ± 0.02
3.012ThrAsp: 3.012 ± 0.053
3.294ThrGlu: 3.294 ± 0.051
2.002ThrPhe: 2.002 ± 0.042
5.551ThrGly: 5.551 ± 0.079
1.083ThrHis: 1.083 ± 0.029
2.449ThrIle: 2.449 ± 0.044
1.216ThrLys: 1.216 ± 0.031
6.255ThrLeu: 6.255 ± 0.077
1.281ThrMet: 1.281 ± 0.031
1.133ThrAsn: 1.133 ± 0.033
3.619ThrPro: 3.619 ± 0.058
1.512ThrGln: 1.512 ± 0.034
3.738ThrArg: 3.738 ± 0.055
2.647ThrSer: 2.647 ± 0.05
2.75ThrThr: 2.75 ± 0.048
3.902ThrVal: 3.902 ± 0.06
0.733ThrTrp: 0.733 ± 0.026
1.196ThrTyr: 1.196 ± 0.033
0.0ThrXaa: 0.0 ± 0.0
Val
8.958ValAla: 8.958 ± 0.095
0.638ValCys: 0.638 ± 0.023
3.915ValAsp: 3.915 ± 0.065
4.623ValGlu: 4.623 ± 0.061
2.656ValPhe: 2.656 ± 0.048
5.091ValGly: 5.091 ± 0.071
1.174ValHis: 1.174 ± 0.028
3.937ValIle: 3.937 ± 0.062
1.993ValLys: 1.993 ± 0.045
7.826ValLeu: 7.826 ± 0.098
2.068ValMet: 2.068 ± 0.045
1.793ValAsn: 1.793 ± 0.034
3.646ValPro: 3.646 ± 0.061
2.044ValGln: 2.044 ± 0.038
3.944ValArg: 3.944 ± 0.053
4.082ValSer: 4.082 ± 0.059
4.938ValThr: 4.938 ± 0.068
5.491ValVal: 5.491 ± 0.07
0.855ValTrp: 0.855 ± 0.024
1.388ValTyr: 1.388 ± 0.034
0.0ValXaa: 0.0 ± 0.0
Trp
1.413TrpAla: 1.413 ± 0.037
0.118TrpCys: 0.118 ± 0.01
0.755TrpAsp: 0.755 ± 0.025
0.677TrpGlu: 0.677 ± 0.023
0.515TrpPhe: 0.515 ± 0.02
1.018TrpGly: 1.018 ± 0.032
0.351TrpHis: 0.351 ± 0.018
0.722TrpIle: 0.722 ± 0.026
0.426TrpLys: 0.426 ± 0.018
1.688TrpLeu: 1.688 ± 0.041
0.412TrpMet: 0.412 ± 0.016
0.385TrpAsn: 0.385 ± 0.018
0.716TrpPro: 0.716 ± 0.024
0.603TrpGln: 0.603 ± 0.024
1.05TrpArg: 1.05 ± 0.029
0.747TrpSer: 0.747 ± 0.025
0.732TrpThr: 0.732 ± 0.024
0.889TrpVal: 0.889 ± 0.028
0.217TrpTrp: 0.217 ± 0.013
0.266TrpTyr: 0.266 ± 0.016
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.507TyrAla: 2.507 ± 0.049
0.225TyrCys: 0.225 ± 0.012
1.599TyrAsp: 1.599 ± 0.04
1.323TyrGlu: 1.323 ± 0.04
0.845TyrPhe: 0.845 ± 0.023
2.118TyrGly: 2.118 ± 0.048
0.504TyrHis: 0.504 ± 0.019
0.86TyrIle: 0.86 ± 0.024
0.476TyrLys: 0.476 ± 0.018
2.241TyrLeu: 2.241 ± 0.04
0.527TyrMet: 0.527 ± 0.018
0.535TyrAsn: 0.535 ± 0.019
1.086TyrPro: 1.086 ± 0.031
0.66TyrGln: 0.66 ± 0.025
1.527TyrArg: 1.527 ± 0.033
1.109TyrSer: 1.109 ± 0.035
1.084TyrThr: 1.084 ± 0.032
1.467TyrVal: 1.467 ± 0.038
0.351TyrTrp: 0.351 ± 0.017
0.589TyrTyr: 0.589 ± 0.024
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4205 proteins (1317918 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski