Amino acid dipepetide frequency for Balneolaceae bacterium YR4-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.147AlaAla: 5.147 ± 0.074
0.541AlaCys: 0.541 ± 0.023
3.83AlaAsp: 3.83 ± 0.062
5.018AlaGlu: 5.018 ± 0.072
3.226AlaPhe: 3.226 ± 0.055
5.385AlaGly: 5.385 ± 0.082
1.242AlaHis: 1.242 ± 0.038
4.977AlaIle: 4.977 ± 0.075
3.497AlaLys: 3.497 ± 0.059
6.757AlaLeu: 6.757 ± 0.098
1.836AlaMet: 1.836 ± 0.043
2.629AlaAsn: 2.629 ± 0.053
2.121AlaPro: 2.121 ± 0.041
2.391AlaGln: 2.391 ± 0.052
3.182AlaArg: 3.182 ± 0.058
4.248AlaSer: 4.248 ± 0.068
3.439AlaThr: 3.439 ± 0.062
4.547AlaVal: 4.547 ± 0.069
0.781AlaTrp: 0.781 ± 0.026
2.515AlaTyr: 2.515 ± 0.046
0.0AlaXaa: 0.0 ± 0.0
Cys
0.413CysAla: 0.413 ± 0.021
0.094CysCys: 0.094 ± 0.011
0.377CysAsp: 0.377 ± 0.019
0.381CysGlu: 0.381 ± 0.018
0.3CysPhe: 0.3 ± 0.018
0.573CysGly: 0.573 ± 0.023
0.188CysHis: 0.188 ± 0.015
0.434CysIle: 0.434 ± 0.02
0.306CysLys: 0.306 ± 0.017
0.576CysLeu: 0.576 ± 0.022
0.132CysMet: 0.132 ± 0.011
0.291CysAsn: 0.291 ± 0.015
0.318CysPro: 0.318 ± 0.022
0.166CysGln: 0.166 ± 0.013
0.289CysArg: 0.289 ± 0.017
0.48CysSer: 0.48 ± 0.019
0.363CysThr: 0.363 ± 0.019
0.385CysVal: 0.385 ± 0.018
0.068CysTrp: 0.068 ± 0.008
0.208CysTyr: 0.208 ± 0.014
0.0CysXaa: 0.0 ± 0.0
Asp
3.662AspAla: 3.662 ± 0.054
0.314AspCys: 0.314 ± 0.018
3.365AspAsp: 3.365 ± 0.065
4.889AspGlu: 4.889 ± 0.079
3.185AspPhe: 3.185 ± 0.056
4.132AspGly: 4.132 ± 0.082
1.242AspHis: 1.242 ± 0.038
4.497AspIle: 4.497 ± 0.067
3.289AspLys: 3.289 ± 0.063
5.885AspLeu: 5.885 ± 0.082
1.35AspMet: 1.35 ± 0.036
2.542AspAsn: 2.542 ± 0.051
2.635AspPro: 2.635 ± 0.055
2.083AspGln: 2.083 ± 0.041
3.112AspArg: 3.112 ± 0.051
3.856AspSer: 3.856 ± 0.07
3.065AspThr: 3.065 ± 0.049
3.557AspVal: 3.557 ± 0.062
0.824AspTrp: 0.824 ± 0.029
2.559AspTyr: 2.559 ± 0.05
0.0AspXaa: 0.0 ± 0.0
Glu
5.399GluAla: 5.399 ± 0.079
0.324GluCys: 0.324 ± 0.018
4.53GluAsp: 4.53 ± 0.064
7.243GluGlu: 7.243 ± 0.126
2.874GluPhe: 2.874 ± 0.055
4.833GluGly: 4.833 ± 0.065
1.54GluHis: 1.54 ± 0.04
5.465GluIle: 5.465 ± 0.062
5.21GluLys: 5.21 ± 0.091
7.352GluLeu: 7.352 ± 0.097
2.092GluMet: 2.092 ± 0.045
4.168GluAsn: 4.168 ± 0.068
2.494GluPro: 2.494 ± 0.044
3.44GluGln: 3.44 ± 0.063
3.577GluArg: 3.577 ± 0.065
4.456GluSer: 4.456 ± 0.067
3.87GluThr: 3.87 ± 0.058
4.94GluVal: 4.94 ± 0.076
0.969GluTrp: 0.969 ± 0.034
2.589GluTyr: 2.589 ± 0.049
0.0GluXaa: 0.0 ± 0.0
Phe
2.919PheAla: 2.919 ± 0.058
0.319PheCys: 0.319 ± 0.017
3.099PheAsp: 3.099 ± 0.065
3.516PheGlu: 3.516 ± 0.063
2.246PhePhe: 2.246 ± 0.046
3.543PheGly: 3.543 ± 0.061
0.846PheHis: 0.846 ± 0.03
3.173PheIle: 3.173 ± 0.056
2.44PheLys: 2.44 ± 0.05
4.193PheLeu: 4.193 ± 0.078
1.082PheMet: 1.082 ± 0.033
2.366PheAsn: 2.366 ± 0.056
1.576PhePro: 1.576 ± 0.041
1.503PheGln: 1.503 ± 0.037
2.162PheArg: 2.162 ± 0.053
3.689PheSer: 3.689 ± 0.069
2.747PheThr: 2.747 ± 0.049
2.703PheVal: 2.703 ± 0.053
0.681PheTrp: 0.681 ± 0.027
1.761PheTyr: 1.761 ± 0.041
0.0PheXaa: 0.0 ± 0.0
Gly
4.45GlyAla: 4.45 ± 0.066
0.603GlyCys: 0.603 ± 0.028
3.862GlyAsp: 3.862 ± 0.07
4.85GlyGlu: 4.85 ± 0.073
3.507GlyPhe: 3.507 ± 0.055
5.197GlyGly: 5.197 ± 0.099
1.308GlyHis: 1.308 ± 0.038
5.794GlyIle: 5.794 ± 0.071
4.125GlyLys: 4.125 ± 0.067
6.624GlyLeu: 6.624 ± 0.089
1.992GlyMet: 1.992 ± 0.053
3.433GlyAsn: 3.433 ± 0.06
1.927GlyPro: 1.927 ± 0.046
2.166GlyGln: 2.166 ± 0.047
2.973GlyArg: 2.973 ± 0.057
4.98GlySer: 4.98 ± 0.079
4.343GlyThr: 4.343 ± 0.074
4.475GlyVal: 4.475 ± 0.07
1.041GlyTrp: 1.041 ± 0.029
3.026GlyTyr: 3.026 ± 0.055
0.001GlyXaa: 0.001 ± 0.001
His
1.149HisAla: 1.149 ± 0.027
0.171HisCys: 0.171 ± 0.013
1.025HisAsp: 1.025 ± 0.033
1.252HisGlu: 1.252 ± 0.032
1.061HisPhe: 1.061 ± 0.031
1.311HisGly: 1.311 ± 0.039
0.547HisHis: 0.547 ± 0.024
1.337HisIle: 1.337 ± 0.031
0.983HisLys: 0.983 ± 0.032
1.857HisLeu: 1.857 ± 0.04
0.408HisMet: 0.408 ± 0.02
0.842HisAsn: 0.842 ± 0.026
1.116HisPro: 1.116 ± 0.031
0.599HisGln: 0.599 ± 0.021
1.018HisArg: 1.018 ± 0.034
1.239HisSer: 1.239 ± 0.036
0.959HisThr: 0.959 ± 0.034
1.15HisVal: 1.15 ± 0.039
0.272HisTrp: 0.272 ± 0.015
0.806HisTyr: 0.806 ± 0.03
0.0HisXaa: 0.0 ± 0.0
Ile
5.125IleAla: 5.125 ± 0.074
0.535IleCys: 0.535 ± 0.022
4.535IleAsp: 4.535 ± 0.061
5.421IleGlu: 5.421 ± 0.078
3.084IlePhe: 3.084 ± 0.062
4.959IleGly: 4.959 ± 0.075
1.404IleHis: 1.404 ± 0.039
4.806IleIle: 4.806 ± 0.072
3.706IleLys: 3.706 ± 0.06
6.201IleLeu: 6.201 ± 0.099
1.401IleMet: 1.401 ± 0.044
3.475IleAsn: 3.475 ± 0.061
3.213IlePro: 3.213 ± 0.054
2.406IleGln: 2.406 ± 0.045
3.486IleArg: 3.486 ± 0.06
5.381IleSer: 5.381 ± 0.072
4.212IleThr: 4.212 ± 0.064
4.078IleVal: 4.078 ± 0.059
0.808IleTrp: 0.808 ± 0.028
2.448IleTyr: 2.448 ± 0.048
0.001IleXaa: 0.001 ± 0.001
Lys
4.082LysAla: 4.082 ± 0.07
0.27LysCys: 0.27 ± 0.015
3.323LysAsp: 3.323 ± 0.06
5.358LysGlu: 5.358 ± 0.09
1.905LysPhe: 1.905 ± 0.039
3.491LysGly: 3.491 ± 0.057
1.103LysHis: 1.103 ± 0.031
3.656LysIle: 3.656 ± 0.063
4.383LysLys: 4.383 ± 0.074
5.258LysLeu: 5.258 ± 0.082
1.43LysMet: 1.43 ± 0.037
3.013LysAsn: 3.013 ± 0.056
2.303LysPro: 2.303 ± 0.048
2.226LysGln: 2.226 ± 0.044
2.763LysArg: 2.763 ± 0.059
3.599LysSer: 3.599 ± 0.061
3.038LysThr: 3.038 ± 0.054
3.787LysVal: 3.787 ± 0.066
0.602LysTrp: 0.602 ± 0.02
2.021LysTyr: 2.021 ± 0.043
0.0LysXaa: 0.0 ± 0.0
Leu
6.635LeuAla: 6.635 ± 0.09
0.617LeuCys: 0.617 ± 0.025
5.594LeuAsp: 5.594 ± 0.088
6.888LeuGlu: 6.888 ± 0.103
4.667LeuPhe: 4.667 ± 0.086
6.361LeuGly: 6.361 ± 0.076
1.673LeuHis: 1.673 ± 0.037
6.345LeuIle: 6.345 ± 0.081
5.923LeuLys: 5.923 ± 0.072
9.231LeuLeu: 9.231 ± 0.138
2.209LeuMet: 2.209 ± 0.045
4.849LeuAsn: 4.849 ± 0.065
3.944LeuPro: 3.944 ± 0.067
3.623LeuGln: 3.623 ± 0.065
4.151LeuArg: 4.151 ± 0.065
7.078LeuSer: 7.078 ± 0.099
5.089LeuThr: 5.089 ± 0.071
5.758LeuVal: 5.758 ± 0.081
1.023LeuTrp: 1.023 ± 0.036
3.106LeuTyr: 3.106 ± 0.061
0.0LeuXaa: 0.0 ± 0.0
Met
1.968MetAla: 1.968 ± 0.042
0.095MetCys: 0.095 ± 0.01
1.506MetAsp: 1.506 ± 0.04
1.789MetGlu: 1.789 ± 0.045
0.76MetPhe: 0.76 ± 0.03
1.796MetGly: 1.796 ± 0.045
0.448MetHis: 0.448 ± 0.024
1.656MetIle: 1.656 ± 0.042
1.747MetLys: 1.747 ± 0.041
2.247MetLeu: 2.247 ± 0.051
0.672MetMet: 0.672 ± 0.03
1.339MetAsn: 1.339 ± 0.033
1.08MetPro: 1.08 ± 0.029
0.961MetGln: 0.961 ± 0.035
1.051MetArg: 1.051 ± 0.03
1.617MetSer: 1.617 ± 0.038
1.178MetThr: 1.178 ± 0.036
1.587MetVal: 1.587 ± 0.04
0.192MetTrp: 0.192 ± 0.012
0.647MetTyr: 0.647 ± 0.025
0.0MetXaa: 0.0 ± 0.0
Asn
3.082AsnAla: 3.082 ± 0.068
0.299AsnCys: 0.299 ± 0.018
2.574AsnAsp: 2.574 ± 0.049
3.398AsnGlu: 3.398 ± 0.061
2.201AsnPhe: 2.201 ± 0.056
3.382AsnGly: 3.382 ± 0.068
0.932AsnHis: 0.932 ± 0.03
3.656AsnIle: 3.656 ± 0.061
2.579AsnLys: 2.579 ± 0.051
4.629AsnLeu: 4.629 ± 0.07
1.26AsnMet: 1.26 ± 0.033
2.583AsnAsn: 2.583 ± 0.056
2.618AsnPro: 2.618 ± 0.053
1.766AsnGln: 1.766 ± 0.045
2.697AsnArg: 2.697 ± 0.053
3.124AsnSer: 3.124 ± 0.053
2.554AsnThr: 2.554 ± 0.05
2.791AsnVal: 2.791 ± 0.052
0.659AsnTrp: 0.659 ± 0.027
2.06AsnTyr: 2.06 ± 0.046
0.0AsnXaa: 0.0 ± 0.0
Pro
2.475ProAla: 2.475 ± 0.051
0.187ProCys: 0.187 ± 0.013
3.162ProAsp: 3.162 ± 0.061
3.806ProGlu: 3.806 ± 0.06
1.987ProPhe: 1.987 ± 0.04
2.935ProGly: 2.935 ± 0.056
0.723ProHis: 0.723 ± 0.028
2.595ProIle: 2.595 ± 0.048
1.968ProLys: 1.968 ± 0.041
3.322ProLeu: 3.322 ± 0.059
0.809ProMet: 0.809 ± 0.028
1.802ProAsn: 1.802 ± 0.038
1.297ProPro: 1.297 ± 0.047
1.258ProGln: 1.258 ± 0.035
1.355ProArg: 1.355 ± 0.036
2.384ProSer: 2.384 ± 0.045
1.858ProThr: 1.858 ± 0.04
2.967ProVal: 2.967 ± 0.052
0.458ProTrp: 0.458 ± 0.021
1.469ProTyr: 1.469 ± 0.036
0.0ProXaa: 0.0 ± 0.0
Gln
2.532GlnAla: 2.532 ± 0.056
0.167GlnCys: 0.167 ± 0.012
1.947GlnAsp: 1.947 ± 0.039
2.834GlnGlu: 2.834 ± 0.057
1.592GlnPhe: 1.592 ± 0.041
2.09GlnGly: 2.09 ± 0.043
0.639GlnHis: 0.639 ± 0.022
2.365GlnIle: 2.365 ± 0.047
2.388GlnLys: 2.388 ± 0.048
3.45GlnLeu: 3.45 ± 0.066
0.937GlnMet: 0.937 ± 0.036
2.04GlnAsn: 2.04 ± 0.05
1.394GlnPro: 1.394 ± 0.038
1.965GlnGln: 1.965 ± 0.059
1.779GlnArg: 1.779 ± 0.045
2.409GlnSer: 2.409 ± 0.052
1.911GlnThr: 1.911 ± 0.045
2.138GlnVal: 2.138 ± 0.043
0.446GlnTrp: 0.446 ± 0.021
1.282GlnTyr: 1.282 ± 0.036
0.0GlnXaa: 0.0 ± 0.0
Arg
2.873ArgAla: 2.873 ± 0.052
0.259ArgCys: 0.259 ± 0.017
2.706ArgAsp: 2.706 ± 0.056
3.82ArgGlu: 3.82 ± 0.059
2.571ArgPhe: 2.571 ± 0.051
2.69ArgGly: 2.69 ± 0.053
0.886ArgHis: 0.886 ± 0.03
3.461ArgIle: 3.461 ± 0.057
3.017ArgLys: 3.017 ± 0.061
4.388ArgLeu: 4.388 ± 0.074
1.195ArgMet: 1.195 ± 0.034
2.49ArgAsn: 2.49 ± 0.049
1.593ArgPro: 1.593 ± 0.034
1.779ArgGln: 1.779 ± 0.04
2.236ArgArg: 2.236 ± 0.048
3.029ArgSer: 3.029 ± 0.057
2.313ArgThr: 2.313 ± 0.049
2.962ArgVal: 2.962 ± 0.064
0.607ArgTrp: 0.607 ± 0.024
2.073ArgTyr: 2.073 ± 0.049
0.0ArgXaa: 0.0 ± 0.0
Ser
4.193SerAla: 4.193 ± 0.057
0.47SerCys: 0.47 ± 0.022
4.14SerAsp: 4.14 ± 0.069
4.913SerGlu: 4.913 ± 0.075
3.344SerPhe: 3.344 ± 0.065
5.49SerGly: 5.49 ± 0.081
1.178SerHis: 1.178 ± 0.034
4.879SerIle: 4.879 ± 0.076
3.537SerLys: 3.537 ± 0.058
6.557SerLeu: 6.557 ± 0.083
1.66SerMet: 1.66 ± 0.038
3.102SerAsn: 3.102 ± 0.056
2.357SerPro: 2.357 ± 0.049
2.128SerGln: 2.128 ± 0.044
3.153SerArg: 3.153 ± 0.055
4.702SerSer: 4.702 ± 0.077
3.616SerThr: 3.616 ± 0.06
4.421SerVal: 4.421 ± 0.062
0.898SerTrp: 0.898 ± 0.029
2.701SerTyr: 2.701 ± 0.054
0.0SerXaa: 0.0 ± 0.0
Thr
3.682ThrAla: 3.682 ± 0.064
0.31ThrCys: 0.31 ± 0.018
3.454ThrAsp: 3.454 ± 0.061
3.747ThrGlu: 3.747 ± 0.057
2.722ThrPhe: 2.722 ± 0.055
4.782ThrGly: 4.782 ± 0.071
0.968ThrHis: 0.968 ± 0.034
3.917ThrIle: 3.917 ± 0.065
2.414ThrLys: 2.414 ± 0.052
5.341ThrLeu: 5.341 ± 0.068
1.173ThrMet: 1.173 ± 0.034
2.281ThrAsn: 2.281 ± 0.047
2.252ThrPro: 2.252 ± 0.047
1.716ThrGln: 1.716 ± 0.045
2.202ThrArg: 2.202 ± 0.046
3.278ThrSer: 3.278 ± 0.057
2.87ThrThr: 2.87 ± 0.054
3.955ThrVal: 3.955 ± 0.061
0.643ThrTrp: 0.643 ± 0.029
2.081ThrTyr: 2.081 ± 0.042
0.0ThrXaa: 0.0 ± 0.0
Val
4.415ValAla: 4.415 ± 0.07
0.407ValCys: 0.407 ± 0.023
3.925ValAsp: 3.925 ± 0.06
4.698ValGlu: 4.698 ± 0.069
2.896ValPhe: 2.896 ± 0.053
4.209ValGly: 4.209 ± 0.065
1.138ValHis: 1.138 ± 0.031
4.622ValIle: 4.622 ± 0.072
3.387ValLys: 3.387 ± 0.057
5.929ValLeu: 5.929 ± 0.079
1.563ValMet: 1.563 ± 0.041
3.1ValAsn: 3.1 ± 0.06
2.621ValPro: 2.621 ± 0.049
2.16ValGln: 2.16 ± 0.045
2.864ValArg: 2.864 ± 0.059
4.443ValSer: 4.443 ± 0.072
3.809ValThr: 3.809 ± 0.059
4.051ValVal: 4.051 ± 0.075
0.701ValTrp: 0.701 ± 0.026
2.167ValTyr: 2.167 ± 0.048
0.001ValXaa: 0.001 ± 0.001
Trp
0.777TrpAla: 0.777 ± 0.027
0.092TrpCys: 0.092 ± 0.009
0.744TrpAsp: 0.744 ± 0.029
0.837TrpGlu: 0.837 ± 0.031
0.583TrpPhe: 0.583 ± 0.024
0.836TrpGly: 0.836 ± 0.03
0.301TrpHis: 0.301 ± 0.018
0.893TrpIle: 0.893 ± 0.033
0.788TrpLys: 0.788 ± 0.033
1.217TrpLeu: 1.217 ± 0.039
0.366TrpMet: 0.366 ± 0.018
0.641TrpAsn: 0.641 ± 0.023
0.362TrpPro: 0.362 ± 0.019
0.51TrpGln: 0.51 ± 0.023
0.546TrpArg: 0.546 ± 0.022
0.831TrpSer: 0.831 ± 0.03
0.593TrpThr: 0.593 ± 0.027
0.819TrpVal: 0.819 ± 0.034
0.212TrpTrp: 0.212 ± 0.014
0.466TrpTyr: 0.466 ± 0.022
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.33TyrAla: 2.33 ± 0.042
0.267TyrCys: 0.267 ± 0.016
2.344TyrAsp: 2.344 ± 0.05
2.706TyrGlu: 2.706 ± 0.053
1.854TyrPhe: 1.854 ± 0.045
2.682TyrGly: 2.682 ± 0.048
0.798TyrHis: 0.798 ± 0.028
2.155TyrIle: 2.155 ± 0.044
1.922TyrLys: 1.922 ± 0.044
3.75TyrLeu: 3.75 ± 0.068
0.769TyrMet: 0.769 ± 0.031
1.859TyrAsn: 1.859 ± 0.053
1.594TyrPro: 1.594 ± 0.035
1.457TyrGln: 1.457 ± 0.034
2.352TyrArg: 2.352 ± 0.055
2.618TyrSer: 2.618 ± 0.05
1.96TyrThr: 1.96 ± 0.041
2.036TyrVal: 2.036 ± 0.043
0.53TyrTrp: 0.53 ± 0.024
1.552TyrTyr: 1.552 ± 0.043
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.001XaaLeu: 0.001 ± 0.001
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.001XaaGln: 0.001 ± 0.001
0.001XaaArg: 0.001 ± 0.001
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3201 proteins (1117832 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski