Amino acid dipepetide frequency for Ilyobacter polytropus (strain ATCC 51220 / DSM 2926 / LMG 16218 / CuHBu1)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.403AlaAla: 4.403 ± 0.1
0.698AlaCys: 0.698 ± 0.033
2.783AlaAsp: 2.783 ± 0.06
4.027AlaGlu: 4.027 ± 0.071
2.676AlaPhe: 2.676 ± 0.055
4.684AlaGly: 4.684 ± 0.092
0.891AlaHis: 0.891 ± 0.033
5.078AlaIle: 5.078 ± 0.082
4.58AlaLys: 4.58 ± 0.083
5.778AlaLeu: 5.778 ± 0.101
1.85AlaMet: 1.85 ± 0.05
2.08AlaAsn: 2.08 ± 0.051
1.503AlaPro: 1.503 ± 0.047
1.324AlaGln: 1.324 ± 0.036
2.143AlaArg: 2.143 ± 0.047
3.454AlaSer: 3.454 ± 0.062
2.867AlaThr: 2.867 ± 0.065
4.718AlaVal: 4.718 ± 0.085
0.386AlaTrp: 0.386 ± 0.025
2.094AlaTyr: 2.094 ± 0.051
0.0AlaXaa: 0.0 ± 0.0
Cys
0.522CysAla: 0.522 ± 0.025
0.123CysCys: 0.123 ± 0.012
0.508CysAsp: 0.508 ± 0.025
0.659CysGlu: 0.659 ± 0.031
0.368CysPhe: 0.368 ± 0.021
1.013CysGly: 1.013 ± 0.042
0.255CysHis: 0.255 ± 0.016
0.79CysIle: 0.79 ± 0.032
0.686CysLys: 0.686 ± 0.033
0.664CysLeu: 0.664 ± 0.026
0.238CysMet: 0.238 ± 0.016
0.419CysAsn: 0.419 ± 0.02
0.481CysPro: 0.481 ± 0.027
0.236CysGln: 0.236 ± 0.017
0.372CysArg: 0.372 ± 0.022
0.646CysSer: 0.646 ± 0.029
0.505CysThr: 0.505 ± 0.025
0.583CysVal: 0.583 ± 0.027
0.056CysTrp: 0.056 ± 0.009
0.307CysTyr: 0.307 ± 0.02
0.0CysXaa: 0.0 ± 0.0
Asp
2.749AspAla: 2.749 ± 0.066
0.442AspCys: 0.442 ± 0.022
2.568AspAsp: 2.568 ± 0.062
4.469AspGlu: 4.469 ± 0.069
2.928AspPhe: 2.928 ± 0.057
3.606AspGly: 3.606 ± 0.073
0.806AspHis: 0.806 ± 0.032
5.785AspIle: 5.785 ± 0.08
4.877AspLys: 4.877 ± 0.073
5.294AspLeu: 5.294 ± 0.063
1.645AspMet: 1.645 ± 0.043
2.379AspAsn: 2.379 ± 0.055
1.649AspPro: 1.649 ± 0.047
0.981AspGln: 0.981 ± 0.033
2.141AspArg: 2.141 ± 0.053
3.219AspSer: 3.219 ± 0.073
2.526AspThr: 2.526 ± 0.05
3.505AspVal: 3.505 ± 0.06
0.411AspTrp: 0.411 ± 0.024
2.657AspTyr: 2.657 ± 0.054
0.0AspXaa: 0.0 ± 0.0
Glu
4.572GluAla: 4.572 ± 0.091
0.578GluCys: 0.578 ± 0.024
4.574GluAsp: 4.574 ± 0.077
7.435GluGlu: 7.435 ± 0.123
3.477GluPhe: 3.477 ± 0.062
4.572GluGly: 4.572 ± 0.083
0.979GluHis: 0.979 ± 0.033
8.121GluIle: 8.121 ± 0.1
10.271GluLys: 10.271 ± 0.146
7.24GluLeu: 7.24 ± 0.118
2.195GluMet: 2.195 ± 0.054
5.565GluAsn: 5.565 ± 0.086
1.32GluPro: 1.32 ± 0.04
1.293GluGln: 1.293 ± 0.042
2.944GluArg: 2.944 ± 0.064
3.927GluSer: 3.927 ± 0.067
3.625GluThr: 3.625 ± 0.063
5.346GluVal: 5.346 ± 0.092
0.501GluTrp: 0.501 ± 0.025
3.132GluTyr: 3.132 ± 0.067
0.0GluXaa: 0.0 ± 0.0
Phe
2.367PheAla: 2.367 ± 0.054
0.38PheCys: 0.38 ± 0.02
2.634PheAsp: 2.634 ± 0.051
3.181PheGlu: 3.181 ± 0.064
2.479PhePhe: 2.479 ± 0.065
3.392PheGly: 3.392 ± 0.078
0.673PheHis: 0.673 ± 0.026
4.356PheIle: 4.356 ± 0.092
3.72PheLys: 3.72 ± 0.078
4.49PheLeu: 4.49 ± 0.075
1.369PheMet: 1.369 ± 0.04
2.195PheAsn: 2.195 ± 0.052
1.392PhePro: 1.392 ± 0.042
1.171PheGln: 1.171 ± 0.038
1.526PheArg: 1.526 ± 0.047
3.581PheSer: 3.581 ± 0.077
2.37PheThr: 2.37 ± 0.051
2.756PheVal: 2.756 ± 0.064
0.347PheTrp: 0.347 ± 0.023
1.926PheTyr: 1.926 ± 0.054
0.0PheXaa: 0.0 ± 0.0
Gly
4.628GlyAla: 4.628 ± 0.098
0.853GlyCys: 0.853 ± 0.033
3.885GlyAsp: 3.885 ± 0.074
5.311GlyGlu: 5.311 ± 0.087
3.201GlyPhe: 3.201 ± 0.064
5.222GlyGly: 5.222 ± 0.109
1.178GlyHis: 1.178 ± 0.036
7.365GlyIle: 7.365 ± 0.119
6.364GlyLys: 6.364 ± 0.097
5.75GlyLeu: 5.75 ± 0.099
2.247GlyMet: 2.247 ± 0.057
3.069GlyAsn: 3.069 ± 0.063
1.417GlyPro: 1.417 ± 0.048
1.423GlyGln: 1.423 ± 0.042
2.475GlyArg: 2.475 ± 0.051
3.921GlySer: 3.921 ± 0.071
3.539GlyThr: 3.539 ± 0.073
5.435GlyVal: 5.435 ± 0.091
0.571GlyTrp: 0.571 ± 0.032
3.063GlyTyr: 3.063 ± 0.061
0.0GlyXaa: 0.0 ± 0.0
His
0.721HisAla: 0.721 ± 0.031
0.2HisCys: 0.2 ± 0.017
0.711HisAsp: 0.711 ± 0.028
0.986HisGlu: 0.986 ± 0.034
0.702HisPhe: 0.702 ± 0.028
1.128HisGly: 1.128 ± 0.038
0.368HisHis: 0.368 ± 0.022
1.304HisIle: 1.304 ± 0.044
1.144HisLys: 1.144 ± 0.035
1.359HisLeu: 1.359 ± 0.041
0.425HisMet: 0.425 ± 0.025
0.704HisAsn: 0.704 ± 0.029
0.745HisPro: 0.745 ± 0.03
0.378HisGln: 0.378 ± 0.023
0.622HisArg: 0.622 ± 0.028
0.945HisSer: 0.945 ± 0.033
0.743HisThr: 0.743 ± 0.028
0.816HisVal: 0.816 ± 0.031
0.113HisTrp: 0.113 ± 0.012
0.616HisTyr: 0.616 ± 0.024
0.0HisXaa: 0.0 ± 0.0
Ile
5.431IleAla: 5.431 ± 0.096
0.833IleCys: 0.833 ± 0.031
5.158IleAsp: 5.158 ± 0.08
7.429IleGlu: 7.429 ± 0.11
4.63IlePhe: 4.63 ± 0.092
6.193IleGly: 6.193 ± 0.091
1.228IleHis: 1.228 ± 0.036
7.893IleIle: 7.893 ± 0.114
8.418IleLys: 8.418 ± 0.122
8.968IleLeu: 8.968 ± 0.115
2.435IleMet: 2.435 ± 0.057
4.53IleAsn: 4.53 ± 0.076
3.242IlePro: 3.242 ± 0.067
1.898IleGln: 1.898 ± 0.047
3.026IleArg: 3.026 ± 0.059
6.526IleSer: 6.526 ± 0.109
4.502IleThr: 4.502 ± 0.071
5.344IleVal: 5.344 ± 0.079
0.528IleTrp: 0.528 ± 0.027
3.241IleTyr: 3.241 ± 0.065
0.0IleXaa: 0.0 ± 0.0
Lys
4.877LysAla: 4.877 ± 0.079
0.744LysCys: 0.744 ± 0.031
5.384LysAsp: 5.384 ± 0.081
9.006LysGlu: 9.006 ± 0.116
3.603LysPhe: 3.603 ± 0.067
5.549LysGly: 5.549 ± 0.088
1.095LysHis: 1.095 ± 0.04
9.195LysIle: 9.195 ± 0.119
10.211LysLys: 10.211 ± 0.128
7.802LysLeu: 7.802 ± 0.1
2.709LysMet: 2.709 ± 0.052
6.608LysAsn: 6.608 ± 0.111
2.088LysPro: 2.088 ± 0.046
1.684LysGln: 1.684 ± 0.049
3.547LysArg: 3.547 ± 0.066
5.346LysSer: 5.346 ± 0.09
4.283LysThr: 4.283 ± 0.071
5.692LysVal: 5.692 ± 0.088
0.639LysTrp: 0.639 ± 0.024
3.917LysTyr: 3.917 ± 0.062
0.0LysXaa: 0.0 ± 0.0
Leu
5.352LeuAla: 5.352 ± 0.081
0.792LeuCys: 0.792 ± 0.032
5.271LeuAsp: 5.271 ± 0.092
8.125LeuGlu: 8.125 ± 0.113
3.918LeuPhe: 3.918 ± 0.085
6.887LeuGly: 6.887 ± 0.115
1.198LeuHis: 1.198 ± 0.039
7.346LeuIle: 7.346 ± 0.104
9.297LeuLys: 9.297 ± 0.11
7.915LeuLeu: 7.915 ± 0.112
2.565LeuMet: 2.565 ± 0.058
4.63LeuAsn: 4.63 ± 0.084
2.885LeuPro: 2.885 ± 0.069
1.879LeuGln: 1.879 ± 0.049
3.265LeuArg: 3.265 ± 0.061
6.319LeuSer: 6.319 ± 0.092
4.338LeuThr: 4.338 ± 0.082
5.466LeuVal: 5.466 ± 0.089
0.585LeuTrp: 0.585 ± 0.025
3.005LeuTyr: 3.005 ± 0.064
0.0LeuXaa: 0.0 ± 0.0
Met
2.14MetAla: 2.14 ± 0.051
0.188MetCys: 0.188 ± 0.013
1.551MetAsp: 1.551 ± 0.041
2.42MetGlu: 2.42 ± 0.056
0.985MetPhe: 0.985 ± 0.037
2.318MetGly: 2.318 ± 0.06
0.359MetHis: 0.359 ± 0.022
2.246MetIle: 2.246 ± 0.055
3.083MetLys: 3.083 ± 0.058
2.296MetLeu: 2.296 ± 0.045
0.795MetMet: 0.795 ± 0.032
1.424MetAsn: 1.424 ± 0.042
0.88MetPro: 0.88 ± 0.035
0.542MetGln: 0.542 ± 0.024
1.138MetArg: 1.138 ± 0.036
1.696MetSer: 1.696 ± 0.043
1.422MetThr: 1.422 ± 0.047
1.828MetVal: 1.828 ± 0.048
0.172MetTrp: 0.172 ± 0.014
0.734MetTyr: 0.734 ± 0.028
0.0MetXaa: 0.0 ± 0.0
Asn
2.299AsnAla: 2.299 ± 0.053
0.507AsnCys: 0.507 ± 0.024
2.233AsnAsp: 2.233 ± 0.057
3.332AsnGlu: 3.332 ± 0.07
2.623AsnPhe: 2.623 ± 0.052
3.219AsnGly: 3.219 ± 0.064
0.75AsnHis: 0.75 ± 0.029
5.6AsnIle: 5.6 ± 0.088
4.602AsnLys: 4.602 ± 0.074
5.188AsnLeu: 5.188 ± 0.085
1.455AsnMet: 1.455 ± 0.042
2.672AsnAsn: 2.672 ± 0.057
2.194AsnPro: 2.194 ± 0.048
1.12AsnGln: 1.12 ± 0.035
1.951AsnArg: 1.951 ± 0.044
3.258AsnSer: 3.258 ± 0.069
2.283AsnThr: 2.283 ± 0.059
2.834AsnVal: 2.834 ± 0.053
0.393AsnTrp: 0.393 ± 0.022
2.364AsnTyr: 2.364 ± 0.057
0.0AsnXaa: 0.0 ± 0.0
Pro
1.747ProAla: 1.747 ± 0.052
0.303ProCys: 0.303 ± 0.02
1.612ProAsp: 1.612 ± 0.049
2.775ProGlu: 2.775 ± 0.06
1.475ProPhe: 1.475 ± 0.043
2.211ProGly: 2.211 ± 0.059
0.534ProHis: 0.534 ± 0.028
2.297ProIle: 2.297 ± 0.055
2.375ProLys: 2.375 ± 0.055
2.647ProLeu: 2.647 ± 0.051
0.79ProMet: 0.79 ± 0.024
1.236ProAsn: 1.236 ± 0.036
0.641ProPro: 0.641 ± 0.029
0.716ProGln: 0.716 ± 0.03
0.911ProArg: 0.911 ± 0.034
1.65ProSer: 1.65 ± 0.042
1.368ProThr: 1.368 ± 0.041
2.446ProVal: 2.446 ± 0.055
0.266ProTrp: 0.266 ± 0.017
1.234ProTyr: 1.234 ± 0.041
0.0ProXaa: 0.0 ± 0.0
Gln
1.239GlnAla: 1.239 ± 0.042
0.231GlnCys: 0.231 ± 0.016
1.049GlnAsp: 1.049 ± 0.036
1.887GlnGlu: 1.887 ± 0.052
0.801GlnPhe: 0.801 ± 0.033
1.629GlnGly: 1.629 ± 0.044
0.302GlnHis: 0.302 ± 0.019
1.787GlnIle: 1.787 ± 0.049
1.967GlnLys: 1.967 ± 0.044
1.867GlnLeu: 1.867 ± 0.048
0.727GlnMet: 0.727 ± 0.035
1.212GlnAsn: 1.212 ± 0.043
0.514GlnPro: 0.514 ± 0.026
0.464GlnGln: 0.464 ± 0.027
0.911GlnArg: 0.911 ± 0.028
1.186GlnSer: 1.186 ± 0.039
0.874GlnThr: 0.874 ± 0.031
1.405GlnVal: 1.405 ± 0.036
0.215GlnTrp: 0.215 ± 0.016
0.754GlnTyr: 0.754 ± 0.027
0.0GlnXaa: 0.0 ± 0.0
Arg
1.937ArgAla: 1.937 ± 0.058
0.388ArgCys: 0.388 ± 0.025
2.257ArgAsp: 2.257 ± 0.057
3.744ArgGlu: 3.744 ± 0.073
1.475ArgPhe: 1.475 ± 0.04
2.474ArgGly: 2.474 ± 0.058
0.545ArgHis: 0.545 ± 0.027
3.189ArgIle: 3.189 ± 0.068
3.548ArgLys: 3.548 ± 0.074
3.026ArgLeu: 3.026 ± 0.065
1.018ArgMet: 1.018 ± 0.033
1.883ArgAsn: 1.883 ± 0.049
0.919ArgPro: 0.919 ± 0.033
0.821ArgGln: 0.821 ± 0.027
1.483ArgArg: 1.483 ± 0.046
1.905ArgSer: 1.905 ± 0.049
1.528ArgThr: 1.528 ± 0.038
2.583ArgVal: 2.583 ± 0.063
0.289ArgTrp: 0.289 ± 0.019
1.546ArgTyr: 1.546 ± 0.049
0.0ArgXaa: 0.0 ± 0.0
Ser
3.304SerAla: 3.304 ± 0.07
0.584SerCys: 0.584 ± 0.03
3.336SerAsp: 3.336 ± 0.072
4.765SerGlu: 4.765 ± 0.08
3.171SerPhe: 3.171 ± 0.061
4.743SerGly: 4.743 ± 0.076
0.989SerHis: 0.989 ± 0.031
5.522SerIle: 5.522 ± 0.085
5.686SerLys: 5.686 ± 0.084
5.895SerLeu: 5.895 ± 0.095
1.619SerMet: 1.619 ± 0.047
2.749SerAsn: 2.749 ± 0.06
1.876SerPro: 1.876 ± 0.055
1.672SerGln: 1.672 ± 0.051
2.353SerArg: 2.353 ± 0.059
4.072SerSer: 4.072 ± 0.075
2.875SerThr: 2.875 ± 0.058
3.98SerVal: 3.98 ± 0.068
0.443SerTrp: 0.443 ± 0.025
2.587SerTyr: 2.587 ± 0.063
0.0SerXaa: 0.0 ± 0.0
Thr
3.181ThrAla: 3.181 ± 0.069
0.469ThrCys: 0.469 ± 0.025
2.574ThrAsp: 2.574 ± 0.055
3.43ThrGlu: 3.43 ± 0.07
2.123ThrPhe: 2.123 ± 0.047
4.43ThrGly: 4.43 ± 0.07
0.841ThrHis: 0.841 ± 0.034
3.95ThrIle: 3.95 ± 0.064
3.467ThrLys: 3.467 ± 0.057
4.681ThrLeu: 4.681 ± 0.071
1.154ThrMet: 1.154 ± 0.038
1.997ThrAsn: 1.997 ± 0.046
1.936ThrPro: 1.936 ± 0.043
0.986ThrGln: 0.986 ± 0.036
1.572ThrArg: 1.572 ± 0.05
3.048ThrSer: 3.048 ± 0.06
2.557ThrThr: 2.557 ± 0.059
3.416ThrVal: 3.416 ± 0.068
0.324ThrTrp: 0.324 ± 0.02
1.764ThrTyr: 1.764 ± 0.05
0.0ThrXaa: 0.0 ± 0.0
Val
4.281ValAla: 4.281 ± 0.071
0.669ValCys: 0.669 ± 0.029
3.786ValAsp: 3.786 ± 0.056
5.206ValGlu: 5.206 ± 0.084
3.126ValPhe: 3.126 ± 0.062
4.392ValGly: 4.392 ± 0.086
0.959ValHis: 0.959 ± 0.032
5.815ValIle: 5.815 ± 0.085
5.727ValLys: 5.727 ± 0.077
5.795ValLeu: 5.795 ± 0.09
1.779ValMet: 1.779 ± 0.049
2.971ValAsn: 2.971 ± 0.065
2.032ValPro: 2.032 ± 0.049
1.305ValGln: 1.305 ± 0.044
2.218ValArg: 2.218 ± 0.053
4.313ValSer: 4.313 ± 0.079
3.458ValThr: 3.458 ± 0.071
4.79ValVal: 4.79 ± 0.086
0.413ValTrp: 0.413 ± 0.023
2.383ValTyr: 2.383 ± 0.056
0.0ValXaa: 0.0 ± 0.0
Trp
0.379TrpAla: 0.379 ± 0.022
0.089TrpCys: 0.089 ± 0.011
0.466TrpAsp: 0.466 ± 0.025
0.614TrpGlu: 0.614 ± 0.029
0.308TrpPhe: 0.308 ± 0.02
0.568TrpGly: 0.568 ± 0.027
0.12TrpHis: 0.12 ± 0.012
0.599TrpIle: 0.599 ± 0.028
0.59TrpLys: 0.59 ± 0.031
0.54TrpLeu: 0.54 ± 0.026
0.198TrpMet: 0.198 ± 0.014
0.426TrpAsn: 0.426 ± 0.024
0.163TrpPro: 0.163 ± 0.014
0.185TrpGln: 0.185 ± 0.015
0.239TrpArg: 0.239 ± 0.018
0.457TrpSer: 0.457 ± 0.026
0.294TrpThr: 0.294 ± 0.017
0.419TrpVal: 0.419 ± 0.022
0.09TrpTrp: 0.09 ± 0.011
0.28TrpTyr: 0.28 ± 0.02
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.888TyrAla: 1.888 ± 0.045
0.357TyrCys: 0.357 ± 0.022
2.297TyrAsp: 2.297 ± 0.056
2.866TyrGlu: 2.866 ± 0.063
2.241TyrPhe: 2.241 ± 0.053
2.688TyrGly: 2.688 ± 0.053
0.658TyrHis: 0.658 ± 0.029
3.219TyrIle: 3.219 ± 0.067
3.368TyrLys: 3.368 ± 0.066
3.84TyrLeu: 3.84 ± 0.084
1.003TyrMet: 1.003 ± 0.033
2.124TyrAsn: 2.124 ± 0.058
1.353TyrPro: 1.353 ± 0.04
0.947TyrGln: 0.947 ± 0.034
1.72TyrArg: 1.72 ± 0.046
2.753TyrSer: 2.753 ± 0.059
1.944TyrThr: 1.944 ± 0.054
2.056TyrVal: 2.056 ± 0.044
0.283TyrTrp: 0.283 ± 0.02
1.753TyrTyr: 1.753 ± 0.055
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2859 proteins (873442 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski