Amino acid dipepetide frequency for Companilactobacillus farciminis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.884AlaAla: 4.884 ± 0.102
0.326AlaCys: 0.326 ± 0.023
4.143AlaAsp: 4.143 ± 0.088
3.382AlaGlu: 3.382 ± 0.08
2.914AlaPhe: 2.914 ± 0.069
4.629AlaGly: 4.629 ± 0.101
1.14AlaHis: 1.14 ± 0.043
5.534AlaIle: 5.534 ± 0.114
5.362AlaLys: 5.362 ± 0.116
6.368AlaLeu: 6.368 ± 0.098
1.859AlaMet: 1.859 ± 0.057
3.458AlaAsn: 3.458 ± 0.074
1.896AlaPro: 1.896 ± 0.059
2.782AlaGln: 2.782 ± 0.066
2.113AlaArg: 2.113 ± 0.056
3.962AlaSer: 3.962 ± 0.074
4.163AlaThr: 4.163 ± 0.083
4.7AlaVal: 4.7 ± 0.097
0.596AlaTrp: 0.596 ± 0.028
2.22AlaTyr: 2.22 ± 0.055
0.0AlaXaa: 0.0 ± 0.0
Cys
0.345CysAla: 0.345 ± 0.024
0.038CysCys: 0.038 ± 0.008
0.312CysAsp: 0.312 ± 0.022
0.204CysGlu: 0.204 ± 0.018
0.223CysPhe: 0.223 ± 0.018
0.414CysGly: 0.414 ± 0.023
0.132CysHis: 0.132 ± 0.013
0.296CysIle: 0.296 ± 0.021
0.209CysLys: 0.209 ± 0.017
0.477CysLeu: 0.477 ± 0.024
0.123CysMet: 0.123 ± 0.013
0.162CysAsn: 0.162 ± 0.016
0.203CysPro: 0.203 ± 0.018
0.184CysGln: 0.184 ± 0.015
0.155CysArg: 0.155 ± 0.015
0.303CysSer: 0.303 ± 0.02
0.228CysThr: 0.228 ± 0.02
0.304CysVal: 0.304 ± 0.019
0.051CysTrp: 0.051 ± 0.01
0.145CysTyr: 0.145 ± 0.015
0.0CysXaa: 0.0 ± 0.0
Asp
3.759AspAla: 3.759 ± 0.088
0.262AspCys: 0.262 ± 0.024
4.362AspAsp: 4.362 ± 0.104
4.347AspGlu: 4.347 ± 0.098
3.287AspPhe: 3.287 ± 0.071
3.89AspGly: 3.89 ± 0.089
1.254AspHis: 1.254 ± 0.041
4.555AspIle: 4.555 ± 0.1
5.192AspLys: 5.192 ± 0.095
6.11AspLeu: 6.11 ± 0.099
1.616AspMet: 1.616 ± 0.054
3.645AspAsn: 3.645 ± 0.085
2.238AspPro: 2.238 ± 0.067
2.568AspGln: 2.568 ± 0.061
1.955AspArg: 1.955 ± 0.057
3.811AspSer: 3.811 ± 0.091
3.313AspThr: 3.313 ± 0.085
4.143AspVal: 4.143 ± 0.097
0.709AspTrp: 0.709 ± 0.031
2.794AspTyr: 2.794 ± 0.065
0.0AspXaa: 0.0 ± 0.0
Glu
3.374GluAla: 3.374 ± 0.083
0.177GluCys: 0.177 ± 0.017
3.423GluAsp: 3.423 ± 0.082
3.426GluGlu: 3.426 ± 0.076
2.419GluPhe: 2.419 ± 0.061
2.601GluGly: 2.601 ± 0.076
1.138GluHis: 1.138 ± 0.047
4.566GluIle: 4.566 ± 0.09
5.101GluLys: 5.101 ± 0.082
5.592GluLeu: 5.592 ± 0.102
1.698GluMet: 1.698 ± 0.049
3.6GluAsn: 3.6 ± 0.083
1.543GluPro: 1.543 ± 0.049
2.319GluGln: 2.319 ± 0.063
1.869GluArg: 1.869 ± 0.056
2.911GluSer: 2.911 ± 0.069
3.02GluThr: 3.02 ± 0.067
3.642GluVal: 3.642 ± 0.081
0.53GluTrp: 0.53 ± 0.031
2.071GluTyr: 2.071 ± 0.056
0.0GluXaa: 0.0 ± 0.0
Phe
2.846PheAla: 2.846 ± 0.068
0.236PheCys: 0.236 ± 0.017
3.179PheAsp: 3.179 ± 0.064
2.365PheGlu: 2.365 ± 0.063
2.204PhePhe: 2.204 ± 0.075
3.348PheGly: 3.348 ± 0.08
0.797PheHis: 0.797 ± 0.033
3.639PheIle: 3.639 ± 0.089
2.923PheLys: 2.923 ± 0.062
4.197PheLeu: 4.197 ± 0.099
1.106PheMet: 1.106 ± 0.036
2.655PheAsn: 2.655 ± 0.06
1.452PhePro: 1.452 ± 0.049
1.44PheGln: 1.44 ± 0.04
1.239PheArg: 1.239 ± 0.042
3.317PheSer: 3.317 ± 0.076
2.633PheThr: 2.633 ± 0.056
3.32PheVal: 3.32 ± 0.071
0.542PheTrp: 0.542 ± 0.033
1.8PheTyr: 1.8 ± 0.053
0.0PheXaa: 0.0 ± 0.0
Gly
4.226GlyAla: 4.226 ± 0.082
0.346GlyCys: 0.346 ± 0.019
3.524GlyAsp: 3.524 ± 0.081
2.981GlyGlu: 2.981 ± 0.067
3.126GlyPhe: 3.126 ± 0.078
4.059GlyGly: 4.059 ± 0.108
1.261GlyHis: 1.261 ± 0.046
6.0GlyIle: 6.0 ± 0.094
4.794GlyLys: 4.794 ± 0.081
6.214GlyLeu: 6.214 ± 0.105
1.826GlyMet: 1.826 ± 0.058
3.161GlyAsn: 3.161 ± 0.081
1.482GlyPro: 1.482 ± 0.048
2.484GlyGln: 2.484 ± 0.066
2.119GlyArg: 2.119 ± 0.059
4.253GlySer: 4.253 ± 0.077
4.211GlyThr: 4.211 ± 0.09
4.704GlyVal: 4.704 ± 0.089
0.597GlyTrp: 0.597 ± 0.032
2.749GlyTyr: 2.749 ± 0.063
0.0GlyXaa: 0.0 ± 0.0
His
1.132HisAla: 1.132 ± 0.042
0.096HisCys: 0.096 ± 0.011
1.135HisAsp: 1.135 ± 0.039
0.977HisGlu: 0.977 ± 0.035
0.943HisPhe: 0.943 ± 0.042
1.298HisGly: 1.298 ± 0.044
0.564HisHis: 0.564 ± 0.028
1.362HisIle: 1.362 ± 0.048
1.1HisLys: 1.1 ± 0.044
1.836HisLeu: 1.836 ± 0.059
0.462HisMet: 0.462 ± 0.029
1.069HisAsn: 1.069 ± 0.043
0.871HisPro: 0.871 ± 0.038
0.839HisGln: 0.839 ± 0.032
0.738HisArg: 0.738 ± 0.032
1.11HisSer: 1.11 ± 0.043
0.935HisThr: 0.935 ± 0.036
1.223HisVal: 1.223 ± 0.048
0.212HisTrp: 0.212 ± 0.018
0.793HisTyr: 0.793 ± 0.032
0.0HisXaa: 0.0 ± 0.0
Ile
5.889IleAla: 5.889 ± 0.095
0.507IleCys: 0.507 ± 0.028
5.236IleAsp: 5.236 ± 0.088
4.143IleGlu: 4.143 ± 0.085
3.768IlePhe: 3.768 ± 0.093
5.494IleGly: 5.494 ± 0.102
1.335IleHis: 1.335 ± 0.048
6.16IleIle: 6.16 ± 0.112
5.771IleLys: 5.771 ± 0.095
7.349IleLeu: 7.349 ± 0.134
1.975IleMet: 1.975 ± 0.058
4.627IleAsn: 4.627 ± 0.09
3.046IlePro: 3.046 ± 0.078
2.714IleGln: 2.714 ± 0.073
2.309IleArg: 2.309 ± 0.064
5.542IleSer: 5.542 ± 0.094
4.756IleThr: 4.756 ± 0.083
5.795IleVal: 5.795 ± 0.103
0.643IleTrp: 0.643 ± 0.029
2.697IleTyr: 2.697 ± 0.07
0.0IleXaa: 0.0 ± 0.0
Lys
4.784LysAla: 4.784 ± 0.091
0.174LysCys: 0.174 ± 0.018
5.066LysAsp: 5.066 ± 0.088
4.889LysGlu: 4.889 ± 0.099
2.913LysPhe: 2.913 ± 0.064
3.55LysGly: 3.55 ± 0.081
1.371LysHis: 1.371 ± 0.051
6.23LysIle: 6.23 ± 0.092
7.094LysLys: 7.094 ± 0.121
6.742LysLeu: 6.742 ± 0.103
2.593LysMet: 2.593 ± 0.059
5.205LysAsn: 5.205 ± 0.095
2.306LysPro: 2.306 ± 0.061
3.062LysGln: 3.062 ± 0.071
2.843LysArg: 2.843 ± 0.062
4.31LysSer: 4.31 ± 0.097
4.674LysThr: 4.674 ± 0.085
5.062LysVal: 5.062 ± 0.094
0.643LysTrp: 0.643 ± 0.03
3.392LysTyr: 3.392 ± 0.074
0.0LysXaa: 0.0 ± 0.0
Leu
6.776LeuAla: 6.776 ± 0.11
0.499LeuCys: 0.499 ± 0.031
5.871LeuAsp: 5.871 ± 0.109
4.707LeuGlu: 4.707 ± 0.081
3.981LeuPhe: 3.981 ± 0.088
6.378LeuGly: 6.378 ± 0.117
1.6LeuHis: 1.6 ± 0.046
7.395LeuIle: 7.395 ± 0.153
7.217LeuLys: 7.217 ± 0.104
8.714LeuLeu: 8.714 ± 0.144
2.469LeuMet: 2.469 ± 0.058
5.671LeuAsn: 5.671 ± 0.109
3.836LeuPro: 3.836 ± 0.079
3.362LeuGln: 3.362 ± 0.083
3.091LeuArg: 3.091 ± 0.08
6.652LeuSer: 6.652 ± 0.108
5.881LeuThr: 5.881 ± 0.095
6.494LeuVal: 6.494 ± 0.117
0.755LeuTrp: 0.755 ± 0.034
2.813LeuTyr: 2.813 ± 0.064
0.0LeuXaa: 0.0 ± 0.0
Met
2.024MetAla: 2.024 ± 0.058
0.123MetCys: 0.123 ± 0.013
1.556MetAsp: 1.556 ± 0.05
1.316MetGlu: 1.316 ± 0.049
1.074MetPhe: 1.074 ± 0.042
1.791MetGly: 1.791 ± 0.046
0.464MetHis: 0.464 ± 0.025
2.203MetIle: 2.203 ± 0.056
2.122MetLys: 2.122 ± 0.052
2.275MetLeu: 2.275 ± 0.066
0.835MetMet: 0.835 ± 0.034
1.677MetAsn: 1.677 ± 0.054
1.098MetPro: 1.098 ± 0.04
1.046MetGln: 1.046 ± 0.035
0.932MetArg: 0.932 ± 0.038
1.898MetSer: 1.898 ± 0.055
1.819MetThr: 1.819 ± 0.047
1.68MetVal: 1.68 ± 0.05
0.199MetTrp: 0.199 ± 0.019
0.729MetTyr: 0.729 ± 0.029
0.0MetXaa: 0.0 ± 0.0
Asn
3.384AsnAla: 3.384 ± 0.07
0.299AsnCys: 0.299 ± 0.023
3.771AsnAsp: 3.771 ± 0.083
3.219AsnGlu: 3.219 ± 0.075
2.558AsnPhe: 2.558 ± 0.063
3.987AsnGly: 3.987 ± 0.1
1.309AsnHis: 1.309 ± 0.046
4.268AsnIle: 4.268 ± 0.087
4.376AsnLys: 4.376 ± 0.091
5.416AsnLeu: 5.416 ± 0.099
1.362AsnMet: 1.362 ± 0.042
3.877AsnAsn: 3.877 ± 0.119
2.39AsnPro: 2.39 ± 0.059
2.622AsnGln: 2.622 ± 0.069
2.035AsnArg: 2.035 ± 0.059
3.803AsnSer: 3.803 ± 0.105
2.874AsnThr: 2.874 ± 0.08
3.675AsnVal: 3.675 ± 0.069
0.654AsnTrp: 0.654 ± 0.034
2.513AsnTyr: 2.513 ± 0.062
0.0AsnXaa: 0.0 ± 0.0
Pro
2.116ProAla: 2.116 ± 0.066
0.129ProCys: 0.129 ± 0.011
2.381ProAsp: 2.381 ± 0.066
2.58ProGlu: 2.58 ± 0.063
1.665ProPhe: 1.665 ± 0.059
1.948ProGly: 1.948 ± 0.061
0.581ProHis: 0.581 ± 0.027
2.735ProIle: 2.735 ± 0.071
2.551ProLys: 2.551 ± 0.063
2.923ProLeu: 2.923 ± 0.057
0.851ProMet: 0.851 ± 0.033
2.056ProAsn: 2.056 ± 0.064
0.497ProPro: 0.497 ± 0.028
1.326ProGln: 1.326 ± 0.041
0.977ProArg: 0.977 ± 0.045
2.023ProSer: 2.023 ± 0.06
2.226ProThr: 2.226 ± 0.07
2.59ProVal: 2.59 ± 0.067
0.348ProTrp: 0.348 ± 0.021
1.298ProTyr: 1.298 ± 0.043
0.0ProXaa: 0.0 ± 0.0
Gln
2.92GlnAla: 2.92 ± 0.079
0.1GlnCys: 0.1 ± 0.012
1.993GlnAsp: 1.993 ± 0.06
2.343GlnGlu: 2.343 ± 0.067
1.51GlnPhe: 1.51 ± 0.051
1.906GlnGly: 1.906 ± 0.05
0.677GlnHis: 0.677 ± 0.033
3.207GlnIle: 3.207 ± 0.069
3.498GlnLys: 3.498 ± 0.079
3.869GlnLeu: 3.869 ± 0.087
1.177GlnMet: 1.177 ± 0.052
2.41GlnAsn: 2.41 ± 0.064
1.236GlnPro: 1.236 ± 0.038
1.851GlnGln: 1.851 ± 0.056
1.625GlnArg: 1.625 ± 0.052
2.098GlnSer: 2.098 ± 0.059
2.49GlnThr: 2.49 ± 0.065
2.684GlnVal: 2.684 ± 0.07
0.351GlnTrp: 0.351 ± 0.024
1.371GlnTyr: 1.371 ± 0.043
0.0GlnXaa: 0.0 ± 0.0
Arg
2.053ArgAla: 2.053 ± 0.061
0.141ArgCys: 0.141 ± 0.015
2.04ArgAsp: 2.04 ± 0.063
1.951ArgGlu: 1.951 ± 0.056
1.532ArgPhe: 1.532 ± 0.049
1.891ArgGly: 1.891 ± 0.053
0.713ArgHis: 0.713 ± 0.035
2.591ArgIle: 2.591 ± 0.061
2.601ArgLys: 2.601 ± 0.073
3.284ArgLeu: 3.284 ± 0.079
1.006ArgMet: 1.006 ± 0.038
1.974ArgAsn: 1.974 ± 0.055
1.083ArgPro: 1.083 ± 0.047
1.419ArgGln: 1.419 ± 0.048
1.588ArgArg: 1.588 ± 0.055
1.785ArgSer: 1.785 ± 0.051
1.697ArgThr: 1.697 ± 0.052
2.322ArgVal: 2.322 ± 0.06
0.271ArgTrp: 0.271 ± 0.018
1.42ArgTyr: 1.42 ± 0.046
0.0ArgXaa: 0.0 ± 0.0
Ser
4.013SerAla: 4.013 ± 0.086
0.204SerCys: 0.204 ± 0.019
4.401SerAsp: 4.401 ± 0.098
3.501SerGlu: 3.501 ± 0.075
3.132SerPhe: 3.132 ± 0.071
4.665SerGly: 4.665 ± 0.083
1.13SerHis: 1.13 ± 0.04
4.971SerIle: 4.971 ± 0.092
4.679SerLys: 4.679 ± 0.078
5.987SerLeu: 5.987 ± 0.085
1.596SerMet: 1.596 ± 0.059
3.484SerAsn: 3.484 ± 0.096
1.884SerPro: 1.884 ± 0.06
2.487SerGln: 2.487 ± 0.065
2.132SerArg: 2.132 ± 0.056
4.329SerSer: 4.329 ± 0.12
3.79SerThr: 3.79 ± 0.087
4.221SerVal: 4.221 ± 0.079
0.701SerTrp: 0.701 ± 0.034
2.419SerTyr: 2.419 ± 0.061
0.0SerXaa: 0.0 ± 0.0
Thr
4.071ThrAla: 4.071 ± 0.097
0.216ThrCys: 0.216 ± 0.017
3.877ThrAsp: 3.877 ± 0.096
2.89ThrGlu: 2.89 ± 0.073
2.553ThrPhe: 2.553 ± 0.06
4.34ThrGly: 4.34 ± 0.091
0.987ThrHis: 0.987 ± 0.041
5.12ThrIle: 5.12 ± 0.091
4.491ThrLys: 4.491 ± 0.088
5.288ThrLeu: 5.288 ± 0.102
1.39ThrMet: 1.39 ± 0.045
3.487ThrAsn: 3.487 ± 0.093
2.523ThrPro: 2.523 ± 0.061
1.914ThrGln: 1.914 ± 0.061
1.698ThrArg: 1.698 ± 0.05
3.846ThrSer: 3.846 ± 0.083
3.943ThrThr: 3.943 ± 0.137
4.378ThrVal: 4.378 ± 0.104
0.584ThrTrp: 0.584 ± 0.027
2.155ThrTyr: 2.155 ± 0.059
0.0ThrXaa: 0.0 ± 0.0
Val
5.124ValAla: 5.124 ± 0.099
0.4ValCys: 0.4 ± 0.024
4.578ValAsp: 4.578 ± 0.09
3.635ValGlu: 3.635 ± 0.079
2.885ValPhe: 2.885 ± 0.073
4.878ValGly: 4.878 ± 0.099
1.123ValHis: 1.123 ± 0.041
5.645ValIle: 5.645 ± 0.113
4.932ValLys: 4.932 ± 0.089
6.389ValLeu: 6.389 ± 0.096
1.748ValMet: 1.748 ± 0.052
3.558ValAsn: 3.558 ± 0.075
2.59ValPro: 2.59 ± 0.069
2.294ValGln: 2.294 ± 0.067
2.02ValArg: 2.02 ± 0.063
4.698ValSer: 4.698 ± 0.083
4.497ValThr: 4.497 ± 0.099
4.94ValVal: 4.94 ± 0.1
0.588ValTrp: 0.588 ± 0.03
2.32ValTyr: 2.32 ± 0.06
0.0ValXaa: 0.0 ± 0.0
Trp
0.541TrpAla: 0.541 ± 0.029
0.049TrpCys: 0.049 ± 0.007
0.532TrpAsp: 0.532 ± 0.027
0.399TrpGlu: 0.399 ± 0.027
0.512TrpPhe: 0.512 ± 0.031
0.585TrpGly: 0.585 ± 0.03
0.239TrpHis: 0.239 ± 0.02
0.83TrpIle: 0.83 ± 0.037
0.591TrpLys: 0.591 ± 0.032
1.068TrpLeu: 1.068 ± 0.044
0.275TrpMet: 0.275 ± 0.019
0.59TrpAsn: 0.59 ± 0.028
0.246TrpPro: 0.246 ± 0.02
0.439TrpGln: 0.439 ± 0.023
0.339TrpArg: 0.339 ± 0.022
0.603TrpSer: 0.603 ± 0.028
0.562TrpThr: 0.562 ± 0.032
0.577TrpVal: 0.577 ± 0.031
0.146TrpTrp: 0.146 ± 0.017
0.413TrpTyr: 0.413 ± 0.029
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.171TyrAla: 2.171 ± 0.05
0.185TyrCys: 0.185 ± 0.016
2.429TyrAsp: 2.429 ± 0.056
1.767TyrGlu: 1.767 ± 0.06
2.04TyrPhe: 2.04 ± 0.053
2.465TyrGly: 2.465 ± 0.059
0.881TyrHis: 0.881 ± 0.036
2.451TyrIle: 2.451 ± 0.063
2.301TyrLys: 2.301 ± 0.057
4.063TyrLeu: 4.063 ± 0.094
0.893TyrMet: 0.893 ± 0.034
2.023TyrAsn: 2.023 ± 0.054
1.38TyrPro: 1.38 ± 0.043
2.162TyrGln: 2.162 ± 0.066
1.585TyrArg: 1.585 ± 0.056
2.469TyrSer: 2.469 ± 0.066
2.04TyrThr: 2.04 ± 0.063
2.388TyrVal: 2.388 ± 0.066
0.417TyrTrp: 0.417 ± 0.025
1.678TyrTyr: 1.678 ± 0.068
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2217 proteins (690053 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski