Amino acid dipepetide frequency for Lewinellaceae bacterium SD302

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.521AlaAla: 7.521 ± 0.09
0.812AlaCys: 0.812 ± 0.031
5.125AlaAsp: 5.125 ± 0.065
5.341AlaGlu: 5.341 ± 0.071
3.399AlaPhe: 3.399 ± 0.055
7.114AlaGly: 7.114 ± 0.087
1.241AlaHis: 1.241 ± 0.031
4.7AlaIle: 4.7 ± 0.06
3.414AlaLys: 3.414 ± 0.064
6.976AlaLeu: 6.976 ± 0.089
1.686AlaMet: 1.686 ± 0.036
3.587AlaAsn: 3.587 ± 0.055
3.034AlaPro: 3.034 ± 0.056
2.764AlaGln: 2.764 ± 0.046
3.823AlaArg: 3.823 ± 0.062
4.424AlaSer: 4.424 ± 0.052
5.189AlaThr: 5.189 ± 0.088
4.786AlaVal: 4.786 ± 0.062
0.835AlaTrp: 0.835 ± 0.026
2.792AlaTyr: 2.792 ± 0.047
0.0AlaXaa: 0.0 ± 0.0
Cys
0.699CysAla: 0.699 ± 0.027
0.147CysCys: 0.147 ± 0.013
0.699CysAsp: 0.699 ± 0.04
0.589CysGlu: 0.589 ± 0.027
0.537CysPhe: 0.537 ± 0.028
0.909CysGly: 0.909 ± 0.039
0.204CysHis: 0.204 ± 0.013
0.481CysIle: 0.481 ± 0.015
0.274CysLys: 0.274 ± 0.013
0.915CysLeu: 0.915 ± 0.025
0.153CysMet: 0.153 ± 0.011
0.496CysAsn: 0.496 ± 0.025
0.571CysPro: 0.571 ± 0.038
0.375CysGln: 0.375 ± 0.017
0.45CysArg: 0.45 ± 0.02
0.739CysSer: 0.739 ± 0.036
0.584CysThr: 0.584 ± 0.033
0.574CysVal: 0.574 ± 0.026
0.126CysTrp: 0.126 ± 0.01
0.333CysTyr: 0.333 ± 0.015
0.0CysXaa: 0.0 ± 0.0
Asp
4.584AspAla: 4.584 ± 0.092
0.78AspCys: 0.78 ± 0.04
3.834AspAsp: 3.834 ± 0.111
4.272AspGlu: 4.272 ± 0.052
3.613AspPhe: 3.613 ± 0.062
5.703AspGly: 5.703 ± 0.135
1.18AspHis: 1.18 ± 0.031
3.484AspIle: 3.484 ± 0.062
2.311AspLys: 2.311 ± 0.052
6.529AspLeu: 6.529 ± 0.078
1.078AspMet: 1.078 ± 0.028
3.059AspAsn: 3.059 ± 0.068
3.112AspPro: 3.112 ± 0.061
2.392AspGln: 2.392 ± 0.038
3.406AspArg: 3.406 ± 0.055
3.331AspSer: 3.331 ± 0.054
3.147AspThr: 3.147 ± 0.08
3.843AspVal: 3.843 ± 0.068
0.969AspTrp: 0.969 ± 0.027
2.834AspTyr: 2.834 ± 0.051
0.0AspXaa: 0.0 ± 0.0
Glu
5.299GluAla: 5.299 ± 0.085
0.483GluCys: 0.483 ± 0.023
3.943GluAsp: 3.943 ± 0.068
4.687GluGlu: 4.687 ± 0.078
2.672GluPhe: 2.672 ± 0.043
4.225GluGly: 4.225 ± 0.061
1.018GluHis: 1.018 ± 0.029
4.379GluIle: 4.379 ± 0.062
3.327GluLys: 3.327 ± 0.068
7.437GluLeu: 7.437 ± 0.089
1.631GluMet: 1.631 ± 0.035
3.268GluAsn: 3.268 ± 0.046
2.321GluPro: 2.321 ± 0.042
2.681GluGln: 2.681 ± 0.049
3.669GluArg: 3.669 ± 0.061
3.274GluSer: 3.274 ± 0.048
3.238GluThr: 3.238 ± 0.052
4.731GluVal: 4.731 ± 0.052
0.85GluTrp: 0.85 ± 0.027
2.172GluTyr: 2.172 ± 0.039
0.0GluXaa: 0.0 ± 0.0
Phe
3.444PheAla: 3.444 ± 0.053
0.485PheCys: 0.485 ± 0.017
3.248PheAsp: 3.248 ± 0.059
2.784PheGlu: 2.784 ± 0.04
2.3PhePhe: 2.3 ± 0.044
3.888PheGly: 3.888 ± 0.057
0.768PheHis: 0.768 ± 0.025
2.604PheIle: 2.604 ± 0.044
1.721PheLys: 1.721 ± 0.042
4.419PheLeu: 4.419 ± 0.065
0.834PheMet: 0.834 ± 0.023
2.457PheAsn: 2.457 ± 0.051
1.914PhePro: 1.914 ± 0.038
1.609PheGln: 1.609 ± 0.032
2.403PheArg: 2.403 ± 0.053
3.652PheSer: 3.652 ± 0.058
3.272PheThr: 3.272 ± 0.06
2.954PheVal: 2.954 ± 0.049
0.579PheTrp: 0.579 ± 0.021
1.798PheTyr: 1.798 ± 0.041
0.0PheXaa: 0.0 ± 0.0
Gly
5.115GlyAla: 5.115 ± 0.069
1.085GlyCys: 1.085 ± 0.053
4.749GlyAsp: 4.749 ± 0.097
5.039GlyGlu: 5.039 ± 0.063
3.638GlyPhe: 3.638 ± 0.052
6.514GlyGly: 6.514 ± 0.139
1.31GlyHis: 1.31 ± 0.034
4.654GlyIle: 4.654 ± 0.064
3.751GlyLys: 3.751 ± 0.082
7.369GlyLeu: 7.369 ± 0.079
1.825GlyMet: 1.825 ± 0.04
3.97GlyAsn: 3.97 ± 0.084
2.608GlyPro: 2.608 ± 0.051
3.151GlyGln: 3.151 ± 0.052
3.959GlyArg: 3.959 ± 0.065
4.923GlySer: 4.923 ± 0.071
4.619GlyThr: 4.619 ± 0.083
4.869GlyVal: 4.869 ± 0.064
1.005GlyTrp: 1.005 ± 0.029
2.829GlyTyr: 2.829 ± 0.049
0.0GlyXaa: 0.0 ± 0.0
His
1.14HisAla: 1.14 ± 0.032
0.227HisCys: 0.227 ± 0.013
0.961HisAsp: 0.961 ± 0.027
0.996HisGlu: 0.996 ± 0.031
1.009HisPhe: 1.009 ± 0.027
1.179HisGly: 1.179 ± 0.033
0.505HisHis: 0.505 ± 0.022
0.829HisIle: 0.829 ± 0.024
0.576HisLys: 0.576 ± 0.019
2.027HisLeu: 2.027 ± 0.043
0.255HisMet: 0.255 ± 0.012
0.666HisAsn: 0.666 ± 0.025
1.173HisPro: 1.173 ± 0.031
0.696HisGln: 0.696 ± 0.023
1.092HisArg: 1.092 ± 0.03
0.909HisSer: 0.909 ± 0.026
0.82HisThr: 0.82 ± 0.024
0.941HisVal: 0.941 ± 0.024
0.281HisTrp: 0.281 ± 0.014
0.887HisTyr: 0.887 ± 0.027
0.0HisXaa: 0.0 ± 0.0
Ile
4.62IleAla: 4.62 ± 0.066
0.705IleCys: 0.705 ± 0.024
4.408IleAsp: 4.408 ± 0.064
4.076IleGlu: 4.076 ± 0.062
2.725IlePhe: 2.725 ± 0.052
4.607IleGly: 4.607 ± 0.071
1.088IleHis: 1.088 ± 0.029
3.643IleIle: 3.643 ± 0.062
2.505IleLys: 2.505 ± 0.05
5.286IleLeu: 5.286 ± 0.073
1.137IleMet: 1.137 ± 0.03
3.106IleAsn: 3.106 ± 0.048
2.636IlePro: 2.636 ± 0.047
1.944IleGln: 1.944 ± 0.038
3.043IleArg: 3.043 ± 0.047
4.196IleSer: 4.196 ± 0.06
3.911IleThr: 3.911 ± 0.081
3.864IleVal: 3.864 ± 0.054
0.657IleTrp: 0.657 ± 0.019
2.251IleTyr: 2.251 ± 0.039
0.0IleXaa: 0.0 ± 0.0
Lys
3.424LysAla: 3.424 ± 0.071
0.211LysCys: 0.211 ± 0.013
2.264LysAsp: 2.264 ± 0.051
2.91LysGlu: 2.91 ± 0.057
1.571LysPhe: 1.571 ± 0.036
2.665LysGly: 2.665 ± 0.056
0.725LysHis: 0.725 ± 0.025
2.916LysIle: 2.916 ± 0.059
2.586LysLys: 2.586 ± 0.069
4.417LysLeu: 4.417 ± 0.081
1.252LysMet: 1.252 ± 0.034
1.897LysAsn: 1.897 ± 0.041
1.673LysPro: 1.673 ± 0.039
1.571LysGln: 1.571 ± 0.043
2.359LysArg: 2.359 ± 0.051
2.511LysSer: 2.511 ± 0.051
2.359LysThr: 2.359 ± 0.05
3.029LysVal: 3.029 ± 0.055
0.483LysTrp: 0.483 ± 0.02
1.511LysTyr: 1.511 ± 0.043
0.0LysXaa: 0.0 ± 0.0
Leu
8.523LeuAla: 8.523 ± 0.091
0.844LeuCys: 0.844 ± 0.024
6.185LeuAsp: 6.185 ± 0.069
6.418LeuGlu: 6.418 ± 0.073
4.5LeuPhe: 4.5 ± 0.066
6.723LeuGly: 6.723 ± 0.076
1.678LeuHis: 1.678 ± 0.038
5.842LeuIle: 5.842 ± 0.066
4.385LeuLys: 4.385 ± 0.079
10.832LeuLeu: 10.832 ± 0.139
1.866LeuMet: 1.866 ± 0.039
4.898LeuAsn: 4.898 ± 0.065
5.13LeuPro: 5.13 ± 0.066
3.316LeuGln: 3.316 ± 0.05
5.728LeuArg: 5.728 ± 0.084
6.991LeuSer: 6.991 ± 0.078
6.11LeuThr: 6.11 ± 0.09
6.121LeuVal: 6.121 ± 0.067
0.951LeuTrp: 0.951 ± 0.028
3.217LeuTyr: 3.217 ± 0.062
0.0LeuXaa: 0.0 ± 0.0
Met
1.776MetAla: 1.776 ± 0.042
0.115MetCys: 0.115 ± 0.009
1.276MetAsp: 1.276 ± 0.031
1.197MetGlu: 1.197 ± 0.03
0.604MetPhe: 0.604 ± 0.02
1.453MetGly: 1.453 ± 0.032
0.378MetHis: 0.378 ± 0.016
1.333MetIle: 1.333 ± 0.027
1.155MetLys: 1.155 ± 0.031
1.987MetLeu: 1.987 ± 0.037
0.507MetMet: 0.507 ± 0.018
0.995MetAsn: 0.995 ± 0.025
1.028MetPro: 1.028 ± 0.027
0.805MetGln: 0.805 ± 0.023
1.125MetArg: 1.125 ± 0.025
1.291MetSer: 1.291 ± 0.03
1.245MetThr: 1.245 ± 0.031
1.342MetVal: 1.342 ± 0.027
0.15MetTrp: 0.15 ± 0.01
0.513MetTyr: 0.513 ± 0.018
0.0MetXaa: 0.0 ± 0.0
Asn
3.33AsnAla: 3.33 ± 0.052
0.653AsnCys: 0.653 ± 0.043
3.188AsnAsp: 3.188 ± 0.058
2.955AsnGlu: 2.955 ± 0.051
2.457AsnPhe: 2.457 ± 0.042
4.552AsnGly: 4.552 ± 0.086
0.792AsnHis: 0.792 ± 0.022
2.734AsnIle: 2.734 ± 0.049
1.697AsnLys: 1.697 ± 0.036
4.542AsnLeu: 4.542 ± 0.057
0.842AsnMet: 0.842 ± 0.019
2.516AsnAsn: 2.516 ± 0.055
2.688AsnPro: 2.688 ± 0.05
1.834AsnGln: 1.834 ± 0.037
2.441AsnArg: 2.441 ± 0.046
2.858AsnSer: 2.858 ± 0.047
2.652AsnThr: 2.652 ± 0.054
3.02AsnVal: 3.02 ± 0.046
0.82AsnTrp: 0.82 ± 0.026
2.185AsnTyr: 2.185 ± 0.039
0.0AsnXaa: 0.0 ± 0.0
Pro
4.069ProAla: 4.069 ± 0.063
0.329ProCys: 0.329 ± 0.023
3.354ProAsp: 3.354 ± 0.063
3.638ProGlu: 3.638 ± 0.049
2.042ProPhe: 2.042 ± 0.042
3.702ProGly: 3.702 ± 0.064
0.701ProHis: 0.701 ± 0.022
2.404ProIle: 2.404 ± 0.042
1.591ProLys: 1.591 ± 0.036
3.922ProLeu: 3.922 ± 0.051
0.736ProMet: 0.736 ± 0.019
2.275ProAsn: 2.275 ± 0.043
1.907ProPro: 1.907 ± 0.068
1.429ProGln: 1.429 ± 0.031
1.683ProArg: 1.683 ± 0.035
2.382ProSer: 2.382 ± 0.045
2.72ProThr: 2.72 ± 0.05
3.365ProVal: 3.365 ± 0.049
0.468ProTrp: 0.468 ± 0.017
1.482ProTyr: 1.482 ± 0.034
0.0ProXaa: 0.0 ± 0.0
Gln
2.776GlnAla: 2.776 ± 0.04
0.235GlnCys: 0.235 ± 0.014
1.997GlnAsp: 1.997 ± 0.039
2.307GlnGlu: 2.307 ± 0.04
1.597GlnPhe: 1.597 ± 0.037
2.263GlnGly: 2.263 ± 0.047
0.647GlnHis: 0.647 ± 0.022
2.142GlnIle: 2.142 ± 0.037
1.484GlnLys: 1.484 ± 0.037
4.658GlnLeu: 4.658 ± 0.069
0.836GlnMet: 0.836 ± 0.025
1.593GlnAsn: 1.593 ± 0.032
1.658GlnPro: 1.658 ± 0.032
1.816GlnGln: 1.816 ± 0.043
2.218GlnArg: 2.218 ± 0.044
2.261GlnSer: 2.261 ± 0.041
2.045GlnThr: 2.045 ± 0.037
2.389GlnVal: 2.389 ± 0.043
0.437GlnTrp: 0.437 ± 0.017
1.251GlnTyr: 1.251 ± 0.029
0.0GlnXaa: 0.0 ± 0.0
Arg
3.606ArgAla: 3.606 ± 0.061
0.359ArgCys: 0.359 ± 0.016
2.809ArgAsp: 2.809 ± 0.051
3.455ArgGlu: 3.455 ± 0.064
2.669ArgPhe: 2.669 ± 0.049
3.03ArgGly: 3.03 ± 0.055
1.015ArgHis: 1.015 ± 0.028
3.59ArgIle: 3.59 ± 0.048
2.821ArgLys: 2.821 ± 0.057
5.597ArgLeu: 5.597 ± 0.085
1.356ArgMet: 1.356 ± 0.032
2.551ArgAsn: 2.551 ± 0.043
2.23ArgPro: 2.23 ± 0.042
2.226ArgGln: 2.226 ± 0.046
3.199ArgArg: 3.199 ± 0.059
3.178ArgSer: 3.178 ± 0.049
2.482ArgThr: 2.482 ± 0.039
3.177ArgVal: 3.177 ± 0.049
0.715ArgTrp: 0.715 ± 0.024
2.281ArgTyr: 2.281 ± 0.05
0.0ArgXaa: 0.0 ± 0.0
Ser
4.648SerAla: 4.648 ± 0.059
0.757SerCys: 0.757 ± 0.032
3.592SerAsp: 3.592 ± 0.062
3.495SerGlu: 3.495 ± 0.049
3.164SerPhe: 3.164 ± 0.043
5.646SerGly: 5.646 ± 0.077
0.893SerHis: 0.893 ± 0.027
4.075SerIle: 4.075 ± 0.059
2.401SerLys: 2.401 ± 0.049
6.365SerLeu: 6.365 ± 0.071
1.197SerMet: 1.197 ± 0.028
2.899SerAsn: 2.899 ± 0.049
2.863SerPro: 2.863 ± 0.048
2.043SerGln: 2.043 ± 0.037
2.93SerArg: 2.93 ± 0.054
4.115SerSer: 4.115 ± 0.063
3.566SerThr: 3.566 ± 0.053
4.166SerVal: 4.166 ± 0.065
0.896SerTrp: 0.896 ± 0.027
2.354SerTyr: 2.354 ± 0.042
0.0SerXaa: 0.0 ± 0.0
Thr
4.914ThrAla: 4.914 ± 0.083
0.524ThrCys: 0.524 ± 0.029
4.218ThrAsp: 4.218 ± 0.099
3.665ThrGlu: 3.665 ± 0.051
2.928ThrPhe: 2.928 ± 0.059
4.867ThrGly: 4.867 ± 0.073
0.911ThrHis: 0.911 ± 0.025
4.135ThrIle: 4.135 ± 0.088
1.926ThrLys: 1.926 ± 0.042
5.68ThrLeu: 5.68 ± 0.085
0.974ThrMet: 0.974 ± 0.027
2.666ThrAsn: 2.666 ± 0.049
2.874ThrPro: 2.874 ± 0.059
1.662ThrGln: 1.662 ± 0.038
2.253ThrArg: 2.253 ± 0.045
3.45ThrSer: 3.45 ± 0.052
3.793ThrThr: 3.793 ± 0.081
4.659ThrVal: 4.659 ± 0.114
0.52ThrTrp: 0.52 ± 0.021
2.422ThrTyr: 2.422 ± 0.063
0.0ThrXaa: 0.0 ± 0.0
Val
5.556ValAla: 5.556 ± 0.069
0.616ValCys: 0.616 ± 0.025
4.449ValAsp: 4.449 ± 0.055
4.38ValGlu: 4.38 ± 0.058
3.056ValPhe: 3.056 ± 0.049
4.357ValGly: 4.357 ± 0.064
1.095ValHis: 1.095 ± 0.031
4.244ValIle: 4.244 ± 0.057
2.637ValLys: 2.637 ± 0.048
5.771ValLeu: 5.771 ± 0.064
1.313ValMet: 1.313 ± 0.031
3.503ValAsn: 3.503 ± 0.057
2.662ValPro: 2.662 ± 0.045
1.983ValGln: 1.983 ± 0.036
3.214ValArg: 3.214 ± 0.059
4.516ValSer: 4.516 ± 0.061
4.4ValThr: 4.4 ± 0.118
4.602ValVal: 4.602 ± 0.066
0.715ValTrp: 0.715 ± 0.025
2.382ValTyr: 2.382 ± 0.041
0.0ValXaa: 0.0 ± 0.0
Trp
0.778TrpAla: 0.778 ± 0.026
0.111TrpCys: 0.111 ± 0.009
0.659TrpAsp: 0.659 ± 0.023
0.721TrpGlu: 0.721 ± 0.022
0.52TrpPhe: 0.52 ± 0.02
0.815TrpGly: 0.815 ± 0.024
0.23TrpHis: 0.23 ± 0.013
0.617TrpIle: 0.617 ± 0.021
0.56TrpLys: 0.56 ± 0.021
1.423TrpLeu: 1.423 ± 0.037
0.311TrpMet: 0.311 ± 0.015
0.621TrpAsn: 0.621 ± 0.02
0.442TrpPro: 0.442 ± 0.016
0.589TrpGln: 0.589 ± 0.019
0.78TrpArg: 0.78 ± 0.027
0.886TrpSer: 0.886 ± 0.032
0.733TrpThr: 0.733 ± 0.024
0.713TrpVal: 0.713 ± 0.023
0.256TrpTrp: 0.256 ± 0.015
0.447TrpTyr: 0.447 ± 0.018
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.741TyrAla: 2.741 ± 0.042
0.376TyrCys: 0.376 ± 0.019
2.504TyrAsp: 2.504 ± 0.044
2.382TyrGlu: 2.382 ± 0.038
2.132TyrPhe: 2.132 ± 0.046
2.772TyrGly: 2.772 ± 0.05
0.787TyrHis: 0.787 ± 0.025
1.601TyrIle: 1.601 ± 0.03
1.212TyrLys: 1.212 ± 0.035
4.106TyrLeu: 4.106 ± 0.055
0.483TyrMet: 0.483 ± 0.018
1.732TyrAsn: 1.732 ± 0.042
1.674TyrPro: 1.674 ± 0.038
1.725TyrGln: 1.725 ± 0.034
2.571TyrArg: 2.571 ± 0.046
2.181TyrSer: 2.181 ± 0.041
2.155TyrThr: 2.155 ± 0.054
2.308TyrVal: 2.308 ± 0.04
0.502TyrTrp: 0.502 ± 0.019
1.639TyrTyr: 1.639 ± 0.035
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4052 proteins (1596819 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski