Amino acid dipepetide frequency for Roseobacter cerasinus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.877AlaAla: 15.877 ± 0.162
1.174AlaCys: 1.174 ± 0.034
7.077AlaAsp: 7.077 ± 0.087
8.18AlaGlu: 8.18 ± 0.084
4.444AlaPhe: 4.444 ± 0.062
9.912AlaGly: 9.912 ± 0.094
2.344AlaHis: 2.344 ± 0.044
5.639AlaIle: 5.639 ± 0.073
3.676AlaLys: 3.676 ± 0.069
13.75AlaLeu: 13.75 ± 0.137
3.694AlaMet: 3.694 ± 0.06
2.597AlaAsn: 2.597 ± 0.047
5.732AlaPro: 5.732 ± 0.079
5.037AlaGln: 5.037 ± 0.064
8.456AlaArg: 8.456 ± 0.089
5.688AlaSer: 5.688 ± 0.065
6.046AlaThr: 6.046 ± 0.072
8.408AlaVal: 8.408 ± 0.086
1.405AlaTrp: 1.405 ± 0.034
2.512AlaTyr: 2.512 ± 0.048
0.0AlaXaa: 0.0 ± 0.0
Cys
1.174CysAla: 1.174 ± 0.032
0.118CysCys: 0.118 ± 0.009
0.638CysAsp: 0.638 ± 0.021
0.473CysGlu: 0.473 ± 0.02
0.359CysPhe: 0.359 ± 0.015
0.987CysGly: 0.987 ± 0.033
0.271CysHis: 0.271 ± 0.016
0.434CysIle: 0.434 ± 0.018
0.228CysLys: 0.228 ± 0.015
0.917CysLeu: 0.917 ± 0.027
0.187CysMet: 0.187 ± 0.011
0.225CysAsn: 0.225 ± 0.012
0.484CysPro: 0.484 ± 0.021
0.261CysGln: 0.261 ± 0.014
0.541CysArg: 0.541 ± 0.021
0.459CysSer: 0.459 ± 0.021
0.503CysThr: 0.503 ± 0.021
0.687CysVal: 0.687 ± 0.024
0.104CysTrp: 0.104 ± 0.008
0.216CysTyr: 0.216 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
7.583AspAla: 7.583 ± 0.087
0.494AspCys: 0.494 ± 0.021
3.538AspAsp: 3.538 ± 0.086
3.399AspGlu: 3.399 ± 0.067
2.342AspPhe: 2.342 ± 0.045
5.721AspGly: 5.721 ± 0.126
1.456AspHis: 1.456 ± 0.036
3.347AspIle: 3.347 ± 0.053
1.624AspLys: 1.624 ± 0.032
6.613AspLeu: 6.613 ± 0.087
1.691AspMet: 1.691 ± 0.038
1.339AspAsn: 1.339 ± 0.038
3.6AspPro: 3.6 ± 0.055
2.243AspGln: 2.243 ± 0.044
4.094AspArg: 4.094 ± 0.047
2.177AspSer: 2.177 ± 0.048
3.286AspThr: 3.286 ± 0.091
4.776AspVal: 4.776 ± 0.062
1.139AspTrp: 1.139 ± 0.029
1.548AspTyr: 1.548 ± 0.035
0.0AspXaa: 0.0 ± 0.0
Glu
7.94GluAla: 7.94 ± 0.087
0.337GluCys: 0.337 ± 0.014
3.535GluAsp: 3.535 ± 0.057
3.44GluGlu: 3.44 ± 0.06
1.849GluPhe: 1.849 ± 0.041
4.588GluGly: 4.588 ± 0.068
1.156GluHis: 1.156 ± 0.031
3.612GluIle: 3.612 ± 0.055
2.026GluLys: 2.026 ± 0.046
4.972GluLeu: 4.972 ± 0.073
1.828GluMet: 1.828 ± 0.036
1.741GluAsn: 1.741 ± 0.035
2.432GluPro: 2.432 ± 0.057
2.137GluGln: 2.137 ± 0.047
3.984GluArg: 3.984 ± 0.058
2.013GluSer: 2.013 ± 0.04
4.111GluThr: 4.111 ± 0.06
4.441GluVal: 4.441 ± 0.064
0.634GluTrp: 0.634 ± 0.024
1.002GluTyr: 1.002 ± 0.033
0.0GluXaa: 0.0 ± 0.0
Phe
4.605PheAla: 4.605 ± 0.054
0.452PheCys: 0.452 ± 0.016
3.071PheAsp: 3.071 ± 0.047
2.506PheGlu: 2.506 ± 0.044
1.546PhePhe: 1.546 ± 0.038
3.8PheGly: 3.8 ± 0.053
0.776PheHis: 0.776 ± 0.025
1.717PheIle: 1.717 ± 0.039
1.028PheLys: 1.028 ± 0.025
3.401PheLeu: 3.401 ± 0.059
0.869PheMet: 0.869 ± 0.028
1.077PheAsn: 1.077 ± 0.033
1.491PhePro: 1.491 ± 0.033
1.198PheGln: 1.198 ± 0.03
2.013PheArg: 2.013 ± 0.047
2.218PheSer: 2.218 ± 0.044
2.056PheThr: 2.056 ± 0.043
2.793PheVal: 2.793 ± 0.051
0.579PheTrp: 0.579 ± 0.026
0.931PheTyr: 0.931 ± 0.031
0.0PheXaa: 0.0 ± 0.0
Gly
9.707GlyAla: 9.707 ± 0.107
0.901GlyCys: 0.901 ± 0.029
4.769GlyAsp: 4.769 ± 0.099
4.385GlyGlu: 4.385 ± 0.067
3.769GlyPhe: 3.769 ± 0.058
7.282GlyGly: 7.282 ± 0.15
1.883GlyHis: 1.883 ± 0.044
4.241GlyIle: 4.241 ± 0.062
2.875GlyLys: 2.875 ± 0.045
9.148GlyLeu: 9.148 ± 0.091
2.411GlyMet: 2.411 ± 0.047
2.093GlyAsn: 2.093 ± 0.074
3.679GlyPro: 3.679 ± 0.052
3.359GlyGln: 3.359 ± 0.056
5.385GlyArg: 5.385 ± 0.065
4.291GlySer: 4.291 ± 0.084
4.835GlyThr: 4.835 ± 0.072
6.438GlyVal: 6.438 ± 0.078
1.412GlyTrp: 1.412 ± 0.04
2.323GlyTyr: 2.323 ± 0.044
0.0GlyXaa: 0.0 ± 0.0
His
2.364HisAla: 2.364 ± 0.041
0.248HisCys: 0.248 ± 0.014
1.285HisAsp: 1.285 ± 0.03
1.062HisGlu: 1.062 ± 0.028
0.835HisPhe: 0.835 ± 0.025
1.863HisGly: 1.863 ± 0.04
0.548HisHis: 0.548 ± 0.021
1.035HisIle: 1.035 ± 0.028
0.555HisLys: 0.555 ± 0.022
2.261HisLeu: 2.261 ± 0.043
0.614HisMet: 0.614 ± 0.022
0.468HisAsn: 0.468 ± 0.018
1.335HisPro: 1.335 ± 0.037
0.638HisGln: 0.638 ± 0.024
1.345HisArg: 1.345 ± 0.036
0.984HisSer: 0.984 ± 0.026
0.926HisThr: 0.926 ± 0.027
1.533HisVal: 1.533 ± 0.034
0.356HisTrp: 0.356 ± 0.018
0.57HisTyr: 0.57 ± 0.021
0.0HisXaa: 0.0 ± 0.0
Ile
6.782IleAla: 6.782 ± 0.082
0.69IleCys: 0.69 ± 0.023
3.602IleAsp: 3.602 ± 0.061
3.431IleGlu: 3.431 ± 0.056
1.909IlePhe: 1.909 ± 0.043
4.781IleGly: 4.781 ± 0.066
0.923IleHis: 0.923 ± 0.027
2.28IleIle: 2.28 ± 0.048
1.527IleLys: 1.527 ± 0.034
4.626IleLeu: 4.626 ± 0.067
1.146IleMet: 1.146 ± 0.031
1.533IleAsn: 1.533 ± 0.036
2.31IlePro: 2.31 ± 0.042
1.262IleGln: 1.262 ± 0.034
3.035IleArg: 3.035 ± 0.05
3.156IleSer: 3.156 ± 0.053
3.141IleThr: 3.141 ± 0.057
3.76IleVal: 3.76 ± 0.061
0.777IleTrp: 0.777 ± 0.025
1.213IleTyr: 1.213 ± 0.03
0.0IleXaa: 0.0 ± 0.0
Lys
3.792LysAla: 3.792 ± 0.061
0.193LysCys: 0.193 ± 0.012
1.808LysAsp: 1.808 ± 0.041
1.536LysGlu: 1.536 ± 0.038
0.869LysPhe: 0.869 ± 0.023
2.544LysGly: 2.544 ± 0.049
0.626LysHis: 0.626 ± 0.024
1.67LysIle: 1.67 ± 0.037
1.109LysLys: 1.109 ± 0.039
2.872LysLeu: 2.872 ± 0.053
0.857LysMet: 0.857 ± 0.025
0.772LysAsn: 0.772 ± 0.024
1.749LysPro: 1.749 ± 0.042
0.956LysGln: 0.956 ± 0.026
2.171LysArg: 2.171 ± 0.043
1.811LysSer: 1.811 ± 0.039
2.021LysThr: 2.021 ± 0.046
2.191LysVal: 2.191 ± 0.048
0.388LysTrp: 0.388 ± 0.015
0.622LysTyr: 0.622 ± 0.021
0.0LysXaa: 0.0 ± 0.0
Leu
12.473LeuAla: 12.473 ± 0.107
1.025LeuCys: 1.025 ± 0.029
6.173LeuAsp: 6.173 ± 0.08
5.35LeuGlu: 5.35 ± 0.068
3.585LeuPhe: 3.585 ± 0.058
8.232LeuGly: 8.232 ± 0.09
1.934LeuHis: 1.934 ± 0.043
5.41LeuIle: 5.41 ± 0.076
3.135LeuLys: 3.135 ± 0.056
9.021LeuLeu: 9.021 ± 0.134
2.694LeuMet: 2.694 ± 0.048
2.74LeuAsn: 2.74 ± 0.047
5.531LeuPro: 5.531 ± 0.075
3.184LeuGln: 3.184 ± 0.049
7.048LeuArg: 7.048 ± 0.082
6.706LeuSer: 6.706 ± 0.08
6.127LeuThr: 6.127 ± 0.078
6.649LeuVal: 6.649 ± 0.07
1.306LeuTrp: 1.306 ± 0.033
1.994LeuTyr: 1.994 ± 0.041
0.0LeuXaa: 0.0 ± 0.0
Met
3.492MetAla: 3.492 ± 0.055
0.207MetCys: 0.207 ± 0.012
1.473MetAsp: 1.473 ± 0.034
1.306MetGlu: 1.306 ± 0.033
0.806MetPhe: 0.806 ± 0.027
2.273MetGly: 2.273 ± 0.049
0.502MetHis: 0.502 ± 0.019
1.642MetIle: 1.642 ± 0.031
1.002MetLys: 1.002 ± 0.028
2.666MetLeu: 2.666 ± 0.046
0.835MetMet: 0.835 ± 0.028
0.8MetAsn: 0.8 ± 0.023
1.465MetPro: 1.465 ± 0.034
1.141MetGln: 1.141 ± 0.027
1.863MetArg: 1.863 ± 0.041
1.867MetSer: 1.867 ± 0.036
2.076MetThr: 2.076 ± 0.042
1.853MetVal: 1.853 ± 0.04
0.27MetTrp: 0.27 ± 0.015
0.364MetTyr: 0.364 ± 0.017
0.0MetXaa: 0.0 ± 0.0
Asn
3.104AsnAla: 3.104 ± 0.054
0.264AsnCys: 0.264 ± 0.016
1.573AsnAsp: 1.573 ± 0.053
1.183AsnGlu: 1.183 ± 0.032
0.988AsnPhe: 0.988 ± 0.03
2.345AsnGly: 2.345 ± 0.059
0.507AsnHis: 0.507 ± 0.022
1.447AsnIle: 1.447 ± 0.039
0.648AsnLys: 0.648 ± 0.024
2.405AsnLeu: 2.405 ± 0.044
0.727AsnMet: 0.727 ± 0.022
0.709AsnAsn: 0.709 ± 0.032
1.766AsnPro: 1.766 ± 0.037
0.77AsnGln: 0.77 ± 0.024
1.726AsnArg: 1.726 ± 0.035
1.201AsnSer: 1.201 ± 0.03
1.485AsnThr: 1.485 ± 0.04
1.824AsnVal: 1.824 ± 0.042
0.455AsnTrp: 0.455 ± 0.021
0.663AsnTyr: 0.663 ± 0.024
0.0AsnXaa: 0.0 ± 0.0
Pro
5.349ProAla: 5.349 ± 0.075
0.373ProCys: 0.373 ± 0.021
3.997ProAsp: 3.997 ± 0.056
4.045ProGlu: 4.045 ± 0.066
1.963ProPhe: 1.963 ± 0.045
4.313ProGly: 4.313 ± 0.065
1.081ProHis: 1.081 ± 0.03
2.3ProIle: 2.3 ± 0.046
1.694ProLys: 1.694 ± 0.045
4.607ProLeu: 4.607 ± 0.06
1.328ProMet: 1.328 ± 0.033
1.356ProAsn: 1.356 ± 0.033
2.277ProPro: 2.277 ± 0.049
1.838ProGln: 1.838 ± 0.037
2.703ProArg: 2.703 ± 0.052
2.519ProSer: 2.519 ± 0.049
2.555ProThr: 2.555 ± 0.045
4.06ProVal: 4.06 ± 0.059
0.7ProTrp: 0.7 ± 0.024
1.094ProTyr: 1.094 ± 0.029
0.0ProXaa: 0.0 ± 0.0
Gln
4.43GlnAla: 4.43 ± 0.069
0.216GlnCys: 0.216 ± 0.014
1.893GlnAsp: 1.893 ± 0.037
1.747GlnGlu: 1.747 ± 0.041
1.15GlnPhe: 1.15 ± 0.03
2.723GlnGly: 2.723 ± 0.05
0.683GlnHis: 0.683 ± 0.024
2.269GlnIle: 2.269 ± 0.041
1.056GlnLys: 1.056 ± 0.029
3.192GlnLeu: 3.192 ± 0.052
1.183GlnMet: 1.183 ± 0.028
0.998GlnAsn: 0.998 ± 0.024
1.785GlnPro: 1.785 ± 0.044
1.274GlnGln: 1.274 ± 0.035
2.445GlnArg: 2.445 ± 0.055
2.151GlnSer: 2.151 ± 0.042
2.25GlnThr: 2.25 ± 0.043
2.561GlnVal: 2.561 ± 0.04
0.376GlnTrp: 0.376 ± 0.018
0.612GlnTyr: 0.612 ± 0.023
0.0GlnXaa: 0.0 ± 0.0
Arg
7.921ArgAla: 7.921 ± 0.083
0.5ArgCys: 0.5 ± 0.021
4.134ArgAsp: 4.134 ± 0.064
3.475ArgGlu: 3.475 ± 0.053
2.698ArgPhe: 2.698 ± 0.047
4.437ArgGly: 4.437 ± 0.07
1.538ArgHis: 1.538 ± 0.038
3.597ArgIle: 3.597 ± 0.063
2.217ArgLys: 2.217 ± 0.043
6.896ArgLeu: 6.896 ± 0.084
1.929ArgMet: 1.929 ± 0.046
1.644ArgAsn: 1.644 ± 0.035
3.119ArgPro: 3.119 ± 0.046
2.419ArgGln: 2.419 ± 0.052
4.695ArgArg: 4.695 ± 0.074
3.382ArgSer: 3.382 ± 0.057
3.053ArgThr: 3.053 ± 0.053
4.636ArgVal: 4.636 ± 0.061
0.914ArgTrp: 0.914 ± 0.028
1.576ArgTyr: 1.576 ± 0.038
0.0ArgXaa: 0.0 ± 0.0
Ser
5.872SerAla: 5.872 ± 0.075
0.458SerCys: 0.458 ± 0.019
3.588SerAsp: 3.588 ± 0.062
2.887SerGlu: 2.887 ± 0.044
2.401SerPhe: 2.401 ± 0.042
5.425SerGly: 5.425 ± 0.087
1.119SerHis: 1.119 ± 0.026
2.582SerIle: 2.582 ± 0.05
1.631SerLys: 1.631 ± 0.038
5.137SerLeu: 5.137 ± 0.06
1.494SerMet: 1.494 ± 0.036
1.443SerAsn: 1.443 ± 0.042
2.49SerPro: 2.49 ± 0.04
1.675SerGln: 1.675 ± 0.039
3.093SerArg: 3.093 ± 0.05
2.673SerSer: 2.673 ± 0.059
2.742SerThr: 2.742 ± 0.053
3.864SerVal: 3.864 ± 0.061
0.709SerTrp: 0.709 ± 0.026
1.393SerTyr: 1.393 ± 0.03
0.0SerXaa: 0.0 ± 0.0
Thr
6.602ThrAla: 6.602 ± 0.074
0.519ThrCys: 0.519 ± 0.021
3.312ThrAsp: 3.312 ± 0.062
3.02ThrGlu: 3.02 ± 0.056
2.095ThrPhe: 2.095 ± 0.051
5.416ThrGly: 5.416 ± 0.084
1.232ThrHis: 1.232 ± 0.032
2.746ThrIle: 2.746 ± 0.062
1.47ThrLys: 1.47 ± 0.037
6.381ThrLeu: 6.381 ± 0.072
1.3ThrMet: 1.3 ± 0.032
1.373ThrAsn: 1.373 ± 0.037
3.465ThrPro: 3.465 ± 0.054
1.888ThrGln: 1.888 ± 0.038
3.556ThrArg: 3.556 ± 0.057
3.023ThrSer: 3.023 ± 0.053
2.996ThrThr: 2.996 ± 0.077
4.364ThrVal: 4.364 ± 0.071
0.733ThrTrp: 0.733 ± 0.024
1.333ThrTyr: 1.333 ± 0.036
0.0ThrXaa: 0.0 ± 0.0
Val
8.657ValAla: 8.657 ± 0.09
0.697ValCys: 0.697 ± 0.023
4.156ValAsp: 4.156 ± 0.059
4.431ValGlu: 4.431 ± 0.059
3.002ValPhe: 3.002 ± 0.047
5.533ValGly: 5.533 ± 0.066
1.346ValHis: 1.346 ± 0.031
4.262ValIle: 4.262 ± 0.065
2.091ValLys: 2.091 ± 0.044
7.611ValLeu: 7.611 ± 0.085
2.165ValMet: 2.165 ± 0.041
1.895ValAsn: 1.895 ± 0.045
3.657ValPro: 3.657 ± 0.057
2.35ValGln: 2.35 ± 0.039
4.041ValArg: 4.041 ± 0.057
4.404ValSer: 4.404 ± 0.061
4.709ValThr: 4.709 ± 0.07
5.791ValVal: 5.791 ± 0.08
0.956ValTrp: 0.956 ± 0.027
1.43ValTyr: 1.43 ± 0.031
0.0ValXaa: 0.0 ± 0.0
Trp
1.362TrpAla: 1.362 ± 0.035
0.144TrpCys: 0.144 ± 0.009
0.769TrpAsp: 0.769 ± 0.024
0.648TrpGlu: 0.648 ± 0.025
0.606TrpPhe: 0.606 ± 0.022
0.959TrpGly: 0.959 ± 0.034
0.34TrpHis: 0.34 ± 0.016
0.713TrpIle: 0.713 ± 0.024
0.399TrpLys: 0.399 ± 0.016
1.6TrpLeu: 1.6 ± 0.045
0.452TrpMet: 0.452 ± 0.02
0.414TrpAsn: 0.414 ± 0.017
0.71TrpPro: 0.71 ± 0.023
0.604TrpGln: 0.604 ± 0.021
1.039TrpArg: 1.039 ± 0.029
0.829TrpSer: 0.829 ± 0.026
0.748TrpThr: 0.748 ± 0.027
0.951TrpVal: 0.951 ± 0.028
0.214TrpTrp: 0.214 ± 0.013
0.301TrpTyr: 0.301 ± 0.017
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.562TyrAla: 2.562 ± 0.048
0.255TyrCys: 0.255 ± 0.016
1.62TyrAsp: 1.62 ± 0.032
1.274TyrGlu: 1.274 ± 0.032
0.914TyrPhe: 0.914 ± 0.025
2.048TyrGly: 2.048 ± 0.037
0.559TyrHis: 0.559 ± 0.021
0.943TyrIle: 0.943 ± 0.025
0.564TyrLys: 0.564 ± 0.024
2.24TyrLeu: 2.24 ± 0.045
0.499TyrMet: 0.499 ± 0.019
0.574TyrAsn: 0.574 ± 0.019
1.074TyrPro: 1.074 ± 0.031
0.76TyrGln: 0.76 ± 0.024
1.548TyrArg: 1.548 ± 0.036
1.125TyrSer: 1.125 ± 0.032
1.157TyrThr: 1.157 ± 0.031
1.605TyrVal: 1.605 ± 0.037
0.375TyrTrp: 0.375 ± 0.017
0.593TyrTyr: 0.593 ± 0.022
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4343 proteins (1330952 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski