Amino acid dipepetide frequency for Algoriphagus aquimarinus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.082AlaAla: 5.082 ± 0.075
0.549AlaCys: 0.549 ± 0.017
3.837AlaAsp: 3.837 ± 0.054
4.727AlaGlu: 4.727 ± 0.063
3.556AlaPhe: 3.556 ± 0.048
5.296AlaGly: 5.296 ± 0.074
1.196AlaHis: 1.196 ± 0.024
5.202AlaIle: 5.202 ± 0.065
4.552AlaLys: 4.552 ± 0.064
6.364AlaLeu: 6.364 ± 0.078
1.845AlaMet: 1.845 ± 0.041
3.279AlaAsn: 3.279 ± 0.048
2.306AlaPro: 2.306 ± 0.052
2.569AlaGln: 2.569 ± 0.039
2.289AlaArg: 2.289 ± 0.042
4.642AlaSer: 4.642 ± 0.064
3.509AlaThr: 3.509 ± 0.057
4.223AlaVal: 4.223 ± 0.063
0.8AlaTrp: 0.8 ± 0.025
2.513AlaTyr: 2.513 ± 0.038
0.0AlaXaa: 0.0 ± 0.0
Cys
0.46CysAla: 0.46 ± 0.017
0.075CysCys: 0.075 ± 0.007
0.383CysAsp: 0.383 ± 0.021
0.42CysGlu: 0.42 ± 0.02
0.354CysPhe: 0.354 ± 0.013
0.548CysGly: 0.548 ± 0.024
0.177CysHis: 0.177 ± 0.015
0.44CysIle: 0.44 ± 0.016
0.36CysLys: 0.36 ± 0.016
0.584CysLeu: 0.584 ± 0.021
0.139CysMet: 0.139 ± 0.009
0.272CysAsn: 0.272 ± 0.015
0.292CysPro: 0.292 ± 0.017
0.245CysGln: 0.245 ± 0.013
0.214CysArg: 0.214 ± 0.011
0.533CysSer: 0.533 ± 0.019
0.37CysThr: 0.37 ± 0.014
0.391CysVal: 0.391 ± 0.016
0.071CysTrp: 0.071 ± 0.008
0.248CysTyr: 0.248 ± 0.013
0.0CysXaa: 0.0 ± 0.0
Asp
3.464AspAla: 3.464 ± 0.047
0.348AspCys: 0.348 ± 0.017
2.567AspAsp: 2.567 ± 0.05
3.907AspGlu: 3.907 ± 0.059
3.699AspPhe: 3.699 ± 0.054
4.131AspGly: 4.131 ± 0.071
1.024AspHis: 1.024 ± 0.027
3.694AspIle: 3.694 ± 0.044
3.55AspLys: 3.55 ± 0.048
5.932AspLeu: 5.932 ± 0.062
1.288AspMet: 1.288 ± 0.033
2.377AspAsn: 2.377 ± 0.047
2.575AspPro: 2.575 ± 0.046
2.3AspGln: 2.3 ± 0.042
2.305AspArg: 2.305 ± 0.046
3.467AspSer: 3.467 ± 0.05
2.305AspThr: 2.305 ± 0.034
3.166AspVal: 3.166 ± 0.05
0.935AspTrp: 0.935 ± 0.024
2.553AspTyr: 2.553 ± 0.047
0.0AspXaa: 0.0 ± 0.0
Glu
4.687GluAla: 4.687 ± 0.065
0.284GluCys: 0.284 ± 0.014
3.544GluAsp: 3.544 ± 0.048
5.302GluGlu: 5.302 ± 0.075
3.268GluPhe: 3.268 ± 0.047
4.309GluGly: 4.309 ± 0.055
1.003GluHis: 1.003 ± 0.027
5.568GluIle: 5.568 ± 0.071
5.698GluLys: 5.698 ± 0.082
6.922GluLeu: 6.922 ± 0.071
1.902GluMet: 1.902 ± 0.037
4.102GluAsn: 4.102 ± 0.056
1.747GluPro: 1.747 ± 0.04
2.286GluGln: 2.286 ± 0.041
2.625GluArg: 2.625 ± 0.039
4.09GluSer: 4.09 ± 0.057
3.272GluThr: 3.272 ± 0.057
4.885GluVal: 4.885 ± 0.065
0.789GluTrp: 0.789 ± 0.023
2.307GluTyr: 2.307 ± 0.038
0.0GluXaa: 0.0 ± 0.0
Phe
3.431PheAla: 3.431 ± 0.048
0.381PheCys: 0.381 ± 0.017
3.247PheAsp: 3.247 ± 0.044
3.519PheGlu: 3.519 ± 0.051
2.776PhePhe: 2.776 ± 0.047
3.844PheGly: 3.844 ± 0.061
0.912PheHis: 0.912 ± 0.027
3.375PheIle: 3.375 ± 0.053
2.994PheLys: 2.994 ± 0.046
5.033PheLeu: 5.033 ± 0.072
1.144PheMet: 1.144 ± 0.026
2.608PheAsn: 2.608 ± 0.042
2.015PhePro: 2.015 ± 0.039
1.908PheGln: 1.908 ± 0.035
1.96PheArg: 1.96 ± 0.038
4.127PheSer: 4.127 ± 0.058
3.097PheThr: 3.097 ± 0.058
2.921PheVal: 2.921 ± 0.045
0.717PheTrp: 0.717 ± 0.024
2.042PheTyr: 2.042 ± 0.037
0.0PheXaa: 0.0 ± 0.0
Gly
4.625GlyAla: 4.625 ± 0.065
0.502GlyCys: 0.502 ± 0.022
3.752GlyAsp: 3.752 ± 0.063
4.47GlyGlu: 4.47 ± 0.06
3.928GlyPhe: 3.928 ± 0.057
5.033GlyGly: 5.033 ± 0.091
1.205GlyHis: 1.205 ± 0.034
5.571GlyIle: 5.571 ± 0.065
5.334GlyLys: 5.334 ± 0.065
6.664GlyLeu: 6.664 ± 0.07
1.992GlyMet: 1.992 ± 0.041
3.696GlyAsn: 3.696 ± 0.063
1.713GlyPro: 1.713 ± 0.036
2.263GlyGln: 2.263 ± 0.039
2.567GlyArg: 2.567 ± 0.052
4.465GlySer: 4.465 ± 0.073
3.935GlyThr: 3.935 ± 0.073
4.817GlyVal: 4.817 ± 0.069
0.933GlyTrp: 0.933 ± 0.025
2.949GlyTyr: 2.949 ± 0.04
0.0GlyXaa: 0.0 ± 0.0
His
1.114HisAla: 1.114 ± 0.032
0.156HisCys: 0.156 ± 0.01
0.818HisAsp: 0.818 ± 0.023
1.114HisGlu: 1.114 ± 0.027
1.097HisPhe: 1.097 ± 0.031
1.218HisGly: 1.218 ± 0.029
0.467HisHis: 0.467 ± 0.018
1.154HisIle: 1.154 ± 0.029
0.918HisLys: 0.918 ± 0.026
1.882HisLeu: 1.882 ± 0.039
0.399HisMet: 0.399 ± 0.015
0.719HisAsn: 0.719 ± 0.021
1.005HisPro: 1.005 ± 0.028
0.74HisGln: 0.74 ± 0.021
0.715HisArg: 0.715 ± 0.019
1.128HisSer: 1.128 ± 0.028
0.899HisThr: 0.899 ± 0.025
0.973HisVal: 0.973 ± 0.028
0.263HisTrp: 0.263 ± 0.015
0.728HisTyr: 0.728 ± 0.021
0.0HisXaa: 0.0 ± 0.0
Ile
5.261IleAla: 5.261 ± 0.063
0.594IleCys: 0.594 ± 0.023
4.113IleAsp: 4.113 ± 0.049
4.679IleGlu: 4.679 ± 0.063
3.438IlePhe: 3.438 ± 0.061
5.077IleGly: 5.077 ± 0.074
1.386IleHis: 1.386 ± 0.034
4.611IleIle: 4.611 ± 0.063
4.523IleLys: 4.523 ± 0.061
7.101IleLeu: 7.101 ± 0.087
1.37IleMet: 1.37 ± 0.033
3.689IleAsn: 3.689 ± 0.058
3.419IlePro: 3.419 ± 0.049
2.866IleGln: 2.866 ± 0.047
2.957IleArg: 2.957 ± 0.044
5.739IleSer: 5.739 ± 0.065
4.145IleThr: 4.145 ± 0.059
4.114IleVal: 4.114 ± 0.059
0.838IleTrp: 0.838 ± 0.025
2.573IleTyr: 2.573 ± 0.043
0.0IleXaa: 0.0 ± 0.0
Lys
4.853LysAla: 4.853 ± 0.064
0.266LysCys: 0.266 ± 0.013
3.848LysAsp: 3.848 ± 0.06
5.124LysGlu: 5.124 ± 0.072
2.696LysPhe: 2.696 ± 0.044
4.484LysGly: 4.484 ± 0.056
1.101LysHis: 1.101 ± 0.026
5.063LysIle: 5.063 ± 0.065
5.056LysLys: 5.056 ± 0.074
6.195LysLeu: 6.195 ± 0.074
1.916LysMet: 1.916 ± 0.04
3.75LysAsn: 3.75 ± 0.05
2.524LysPro: 2.524 ± 0.042
2.036LysGln: 2.036 ± 0.037
2.511LysArg: 2.511 ± 0.047
4.709LysSer: 4.709 ± 0.063
3.614LysThr: 3.614 ± 0.048
4.58LysVal: 4.58 ± 0.055
0.801LysTrp: 0.801 ± 0.023
2.522LysTyr: 2.522 ± 0.044
0.0LysXaa: 0.0 ± 0.0
Leu
7.087LeuAla: 7.087 ± 0.082
0.678LeuCys: 0.678 ± 0.025
5.795LeuAsp: 5.795 ± 0.067
6.715LeuGlu: 6.715 ± 0.075
4.804LeuPhe: 4.804 ± 0.068
6.715LeuGly: 6.715 ± 0.067
1.557LeuHis: 1.557 ± 0.036
7.242LeuIle: 7.242 ± 0.086
6.713LeuLys: 6.713 ± 0.075
9.251LeuLeu: 9.251 ± 0.105
2.322LeuMet: 2.322 ± 0.044
4.935LeuAsn: 4.935 ± 0.064
4.045LeuPro: 4.045 ± 0.059
3.206LeuGln: 3.206 ± 0.048
3.572LeuArg: 3.572 ± 0.052
7.031LeuSer: 7.031 ± 0.064
5.352LeuThr: 5.352 ± 0.075
6.149LeuVal: 6.149 ± 0.07
0.972LeuTrp: 0.972 ± 0.027
3.063LeuTyr: 3.063 ± 0.05
0.0LeuXaa: 0.0 ± 0.0
Met
1.886MetAla: 1.886 ± 0.041
0.123MetCys: 0.123 ± 0.01
1.488MetAsp: 1.488 ± 0.03
1.715MetGlu: 1.715 ± 0.037
0.761MetPhe: 0.761 ± 0.024
1.75MetGly: 1.75 ± 0.034
0.417MetHis: 0.417 ± 0.016
1.824MetIle: 1.824 ± 0.036
2.161MetLys: 2.161 ± 0.038
2.177MetLeu: 2.177 ± 0.042
0.688MetMet: 0.688 ± 0.021
1.413MetAsn: 1.413 ± 0.029
1.042MetPro: 1.042 ± 0.024
0.84MetGln: 0.84 ± 0.027
0.972MetArg: 0.972 ± 0.024
1.514MetSer: 1.514 ± 0.029
1.291MetThr: 1.291 ± 0.029
1.523MetVal: 1.523 ± 0.033
0.195MetTrp: 0.195 ± 0.011
0.647MetTyr: 0.647 ± 0.02
0.0MetXaa: 0.0 ± 0.0
Asn
3.254AsnAla: 3.254 ± 0.042
0.327AsnCys: 0.327 ± 0.016
2.439AsnAsp: 2.439 ± 0.046
3.097AsnGlu: 3.097 ± 0.046
2.739AsnPhe: 2.739 ± 0.043
3.645AsnGly: 3.645 ± 0.059
0.919AsnHis: 0.919 ± 0.026
3.317AsnIle: 3.317 ± 0.051
2.76AsnLys: 2.76 ± 0.044
5.21AsnLeu: 5.21 ± 0.064
1.173AsnMet: 1.173 ± 0.026
2.308AsnAsn: 2.308 ± 0.051
2.95AsnPro: 2.95 ± 0.046
2.271AsnGln: 2.271 ± 0.047
2.102AsnArg: 2.102 ± 0.04
3.821AsnSer: 3.821 ± 0.051
2.766AsnThr: 2.766 ± 0.046
2.819AsnVal: 2.819 ± 0.047
0.819AsnTrp: 0.819 ± 0.026
2.34AsnTyr: 2.34 ± 0.04
0.0AsnXaa: 0.0 ± 0.0
Pro
2.828ProAla: 2.828 ± 0.048
0.176ProCys: 0.176 ± 0.01
2.558ProAsp: 2.558 ± 0.041
3.488ProGlu: 3.488 ± 0.052
2.093ProPhe: 2.093 ± 0.035
2.671ProGly: 2.671 ± 0.045
0.676ProHis: 0.676 ± 0.021
2.792ProIle: 2.792 ± 0.049
2.449ProLys: 2.449 ± 0.047
3.328ProLeu: 3.328 ± 0.046
0.913ProMet: 0.913 ± 0.025
2.088ProAsn: 2.088 ± 0.041
0.94ProPro: 0.94 ± 0.026
1.204ProGln: 1.204 ± 0.028
1.136ProArg: 1.136 ± 0.029
2.55ProSer: 2.55 ± 0.041
2.161ProThr: 2.161 ± 0.045
2.694ProVal: 2.694 ± 0.041
0.465ProTrp: 0.465 ± 0.019
1.426ProTyr: 1.426 ± 0.03
0.0ProXaa: 0.0 ± 0.0
Gln
2.477GlnAla: 2.477 ± 0.045
0.167GlnCys: 0.167 ± 0.01
1.85GlnAsp: 1.85 ± 0.031
2.616GlnGlu: 2.616 ± 0.044
1.672GlnPhe: 1.672 ± 0.036
2.156GlnGly: 2.156 ± 0.042
0.598GlnHis: 0.598 ± 0.025
2.769GlnIle: 2.769 ± 0.045
2.664GlnLys: 2.664 ± 0.045
3.689GlnLeu: 3.689 ± 0.061
0.89GlnMet: 0.89 ± 0.025
2.025GlnAsn: 2.025 ± 0.039
1.193GlnPro: 1.193 ± 0.027
1.313GlnGln: 1.313 ± 0.033
1.409GlnArg: 1.409 ± 0.031
2.356GlnSer: 2.356 ± 0.041
1.934GlnThr: 1.934 ± 0.037
2.575GlnVal: 2.575 ± 0.041
0.377GlnTrp: 0.377 ± 0.016
1.201GlnTyr: 1.201 ± 0.029
0.0GlnXaa: 0.0 ± 0.0
Arg
2.323ArgAla: 2.323 ± 0.043
0.173ArgCys: 0.173 ± 0.011
2.021ArgAsp: 2.021 ± 0.038
2.663ArgGlu: 2.663 ± 0.044
2.079ArgPhe: 2.079 ± 0.037
2.287ArgGly: 2.287 ± 0.039
0.588ArgHis: 0.588 ± 0.022
3.041ArgIle: 3.041 ± 0.048
2.902ArgLys: 2.902 ± 0.044
3.603ArgLeu: 3.603 ± 0.052
1.122ArgMet: 1.122 ± 0.032
2.097ArgAsn: 2.097 ± 0.044
1.333ArgPro: 1.333 ± 0.028
1.229ArgGln: 1.229 ± 0.03
1.492ArgArg: 1.492 ± 0.035
2.294ArgSer: 2.294 ± 0.043
2.006ArgThr: 2.006 ± 0.036
2.436ArgVal: 2.436 ± 0.039
0.481ArgTrp: 0.481 ± 0.017
1.541ArgTyr: 1.541 ± 0.036
0.0ArgXaa: 0.0 ± 0.0
Ser
4.26SerAla: 4.26 ± 0.059
0.64SerCys: 0.64 ± 0.02
3.778SerAsp: 3.778 ± 0.053
4.46SerGlu: 4.46 ± 0.057
3.91SerPhe: 3.91 ± 0.058
5.313SerGly: 5.313 ± 0.075
1.191SerHis: 1.191 ± 0.03
5.114SerIle: 5.114 ± 0.063
4.627SerLys: 4.627 ± 0.059
6.828SerLeu: 6.828 ± 0.073
1.565SerMet: 1.565 ± 0.031
3.39SerAsn: 3.39 ± 0.051
2.683SerPro: 2.683 ± 0.048
2.572SerGln: 2.572 ± 0.044
2.61SerArg: 2.61 ± 0.046
4.994SerSer: 4.994 ± 0.067
3.683SerThr: 3.683 ± 0.048
4.108SerVal: 4.108 ± 0.061
0.862SerTrp: 0.862 ± 0.026
2.715SerTyr: 2.715 ± 0.047
0.0SerXaa: 0.0 ± 0.0
Thr
3.742ThrAla: 3.742 ± 0.064
0.348ThrCys: 0.348 ± 0.018
3.04ThrAsp: 3.04 ± 0.058
3.333ThrGlu: 3.333 ± 0.047
2.868ThrPhe: 2.868 ± 0.054
4.264ThrGly: 4.264 ± 0.072
0.972ThrHis: 0.972 ± 0.022
3.974ThrIle: 3.974 ± 0.05
3.022ThrLys: 3.022 ± 0.045
5.221ThrLeu: 5.221 ± 0.063
1.031ThrMet: 1.031 ± 0.028
2.502ThrAsn: 2.502 ± 0.049
2.468ThrPro: 2.468 ± 0.046
1.907ThrGln: 1.907 ± 0.034
1.744ThrArg: 1.744 ± 0.036
3.716ThrSer: 3.716 ± 0.052
2.905ThrThr: 2.905 ± 0.058
3.497ThrVal: 3.497 ± 0.073
0.73ThrTrp: 0.73 ± 0.024
2.185ThrTyr: 2.185 ± 0.05
0.0ThrXaa: 0.0 ± 0.0
Val
4.352ValAla: 4.352 ± 0.062
0.465ValCys: 0.465 ± 0.021
3.572ValAsp: 3.572 ± 0.057
4.07ValGlu: 4.07 ± 0.06
3.335ValPhe: 3.335 ± 0.054
4.23ValGly: 4.23 ± 0.068
1.048ValHis: 1.048 ± 0.029
4.687ValIle: 4.687 ± 0.062
4.234ValLys: 4.234 ± 0.059
6.116ValLeu: 6.116 ± 0.07
1.54ValMet: 1.54 ± 0.031
3.293ValAsn: 3.293 ± 0.057
2.367ValPro: 2.367 ± 0.04
2.025ValGln: 2.025 ± 0.039
2.331ValArg: 2.331 ± 0.044
4.572ValSer: 4.572 ± 0.058
3.469ValThr: 3.469 ± 0.068
4.0ValVal: 4.0 ± 0.06
0.732ValTrp: 0.732 ± 0.021
2.23ValTyr: 2.23 ± 0.04
0.0ValXaa: 0.0 ± 0.0
Trp
0.786TrpAla: 0.786 ± 0.025
0.075TrpCys: 0.075 ± 0.007
0.771TrpAsp: 0.771 ± 0.023
0.853TrpGlu: 0.853 ± 0.026
0.629TrpPhe: 0.629 ± 0.022
0.826TrpGly: 0.826 ± 0.026
0.256TrpHis: 0.256 ± 0.014
0.903TrpIle: 0.903 ± 0.025
0.957TrpLys: 0.957 ± 0.025
1.115TrpLeu: 1.115 ± 0.029
0.432TrpMet: 0.432 ± 0.018
0.67TrpAsn: 0.67 ± 0.022
0.403TrpPro: 0.403 ± 0.02
0.44TrpGln: 0.44 ± 0.016
0.508TrpArg: 0.508 ± 0.019
0.762TrpSer: 0.762 ± 0.023
0.666TrpThr: 0.666 ± 0.023
0.802TrpVal: 0.802 ± 0.025
0.187TrpTrp: 0.187 ± 0.013
0.449TrpTyr: 0.449 ± 0.017
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.366TyrAla: 2.366 ± 0.042
0.251TyrCys: 0.251 ± 0.013
2.169TyrAsp: 2.169 ± 0.043
2.316TyrGlu: 2.316 ± 0.042
2.35TyrPhe: 2.35 ± 0.04
2.616TyrGly: 2.616 ± 0.043
0.813TyrHis: 0.813 ± 0.025
2.15TyrIle: 2.15 ± 0.04
2.175TyrLys: 2.175 ± 0.034
3.989TyrLeu: 3.989 ± 0.058
0.756TyrMet: 0.756 ± 0.024
1.862TyrAsn: 1.862 ± 0.038
1.616TyrPro: 1.616 ± 0.036
1.709TyrGln: 1.709 ± 0.035
1.692TyrArg: 1.692 ± 0.033
2.797TyrSer: 2.797 ± 0.048
2.088TyrThr: 2.088 ± 0.044
1.994TyrVal: 1.994 ± 0.037
0.524TyrTrp: 0.524 ± 0.017
1.654TyrTyr: 1.654 ± 0.037
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4362 proteins (1512798 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski