Amino acid dipepetide frequency for Acetatifactor muris

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.058AlaAla: 8.058 ± 0.102
1.15AlaCys: 1.15 ± 0.029
4.738AlaAsp: 4.738 ± 0.059
6.568AlaGlu: 6.568 ± 0.076
3.142AlaPhe: 3.142 ± 0.05
6.967AlaGly: 6.967 ± 0.084
1.11AlaHis: 1.11 ± 0.027
4.196AlaIle: 4.196 ± 0.057
3.868AlaLys: 3.868 ± 0.053
7.107AlaLeu: 7.107 ± 0.083
2.283AlaMet: 2.283 ± 0.037
2.332AlaAsn: 2.332 ± 0.044
2.12AlaPro: 2.12 ± 0.046
2.447AlaGln: 2.447 ± 0.051
3.477AlaArg: 3.477 ± 0.046
3.647AlaSer: 3.647 ± 0.056
2.788AlaThr: 2.788 ± 0.052
6.538AlaVal: 6.538 ± 0.068
0.791AlaTrp: 0.791 ± 0.019
2.912AlaTyr: 2.912 ± 0.042
0.0AlaXaa: 0.0 ± 0.0
Cys
1.039CysAla: 1.039 ± 0.026
0.335CysCys: 0.335 ± 0.015
0.869CysAsp: 0.869 ± 0.023
0.96CysGlu: 0.96 ± 0.026
0.74CysPhe: 0.74 ± 0.023
1.675CysGly: 1.675 ± 0.038
0.36CysHis: 0.36 ± 0.014
1.135CysIle: 1.135 ± 0.03
0.763CysLys: 0.763 ± 0.021
1.397CysLeu: 1.397 ± 0.032
0.521CysMet: 0.521 ± 0.017
0.602CysAsn: 0.602 ± 0.018
0.666CysPro: 0.666 ± 0.025
0.405CysGln: 0.405 ± 0.018
1.137CysArg: 1.137 ± 0.026
0.879CysSer: 0.879 ± 0.023
0.748CysThr: 0.748 ± 0.021
1.067CysVal: 1.067 ± 0.023
0.162CysTrp: 0.162 ± 0.009
0.662CysTyr: 0.662 ± 0.024
0.0CysXaa: 0.0 ± 0.0
Asp
3.89AspAla: 3.89 ± 0.053
0.907AspCys: 0.907 ± 0.021
2.496AspAsp: 2.496 ± 0.044
4.226AspGlu: 4.226 ± 0.048
2.742AspPhe: 2.742 ± 0.041
4.647AspGly: 4.647 ± 0.061
0.695AspHis: 0.695 ± 0.019
4.206AspIle: 4.206 ± 0.058
3.116AspLys: 3.116 ± 0.052
4.181AspLeu: 4.181 ± 0.048
1.916AspMet: 1.916 ± 0.035
2.206AspAsn: 2.206 ± 0.039
1.558AspPro: 1.558 ± 0.033
1.094AspGln: 1.094 ± 0.024
3.319AspArg: 3.319 ± 0.048
3.281AspSer: 3.281 ± 0.038
3.184AspThr: 3.184 ± 0.044
3.383AspVal: 3.383 ± 0.049
0.735AspTrp: 0.735 ± 0.022
2.764AspTyr: 2.764 ± 0.042
0.0AspXaa: 0.0 ± 0.0
Glu
6.196GluAla: 6.196 ± 0.072
0.947GluCys: 0.947 ± 0.027
4.45GluAsp: 4.45 ± 0.057
8.125GluGlu: 8.125 ± 0.102
2.54GluPhe: 2.54 ± 0.038
5.001GluGly: 5.001 ± 0.061
1.35GluHis: 1.35 ± 0.026
5.561GluIle: 5.561 ± 0.06
6.819GluLys: 6.819 ± 0.073
7.11GluLeu: 7.11 ± 0.067
2.572GluMet: 2.572 ± 0.038
4.286GluAsn: 4.286 ± 0.046
2.121GluPro: 2.121 ± 0.045
3.332GluGln: 3.332 ± 0.051
4.379GluArg: 4.379 ± 0.063
3.88GluSer: 3.88 ± 0.128
3.878GluThr: 3.878 ± 0.053
4.306GluVal: 4.306 ± 0.057
0.909GluTrp: 0.909 ± 0.025
3.52GluTyr: 3.52 ± 0.055
0.0GluXaa: 0.0 ± 0.0
Phe
2.975PheAla: 2.975 ± 0.044
0.867PheCys: 0.867 ± 0.023
2.389PheAsp: 2.389 ± 0.042
2.656PheGlu: 2.656 ± 0.041
1.911PhePhe: 1.911 ± 0.039
2.965PheGly: 2.965 ± 0.053
0.86PheHis: 0.86 ± 0.022
2.456PheIle: 2.456 ± 0.039
1.577PheLys: 1.577 ± 0.029
4.25PheLeu: 4.25 ± 0.054
1.098PheMet: 1.098 ± 0.026
1.381PheAsn: 1.381 ± 0.027
1.468PhePro: 1.468 ± 0.028
1.442PheGln: 1.442 ± 0.03
2.345PheArg: 2.345 ± 0.034
3.041PheSer: 3.041 ± 0.044
2.274PheThr: 2.274 ± 0.039
2.673PheVal: 2.673 ± 0.042
0.514PheTrp: 0.514 ± 0.019
1.888PheTyr: 1.888 ± 0.033
0.0PheXaa: 0.0 ± 0.0
Gly
4.9GlyAla: 4.9 ± 0.075
1.3GlyCys: 1.3 ± 0.028
3.595GlyAsp: 3.595 ± 0.055
5.748GlyGlu: 5.748 ± 0.068
3.106GlyPhe: 3.106 ± 0.045
5.165GlyGly: 5.165 ± 0.071
1.261GlyHis: 1.261 ± 0.03
5.868GlyIle: 5.868 ± 0.061
5.372GlyLys: 5.372 ± 0.056
5.92GlyLeu: 5.92 ± 0.063
2.589GlyMet: 2.589 ± 0.036
3.447GlyAsn: 3.447 ± 0.05
1.284GlyPro: 1.284 ± 0.037
2.398GlyGln: 2.398 ± 0.038
4.251GlyArg: 4.251 ± 0.054
4.123GlySer: 4.123 ± 0.058
4.067GlyThr: 4.067 ± 0.059
4.404GlyVal: 4.404 ± 0.055
0.871GlyTrp: 0.871 ± 0.023
3.426GlyTyr: 3.426 ± 0.046
0.0GlyXaa: 0.0 ± 0.0
His
1.056HisAla: 1.056 ± 0.027
0.326HisCys: 0.326 ± 0.014
0.904HisAsp: 0.904 ± 0.027
1.049HisGlu: 1.049 ± 0.025
0.901HisPhe: 0.901 ± 0.025
1.238HisGly: 1.238 ± 0.029
0.385HisHis: 0.385 ± 0.019
1.335HisIle: 1.335 ± 0.026
0.856HisLys: 0.856 ± 0.023
1.491HisLeu: 1.491 ± 0.029
0.539HisMet: 0.539 ± 0.02
0.713HisAsn: 0.713 ± 0.022
0.846HisPro: 0.846 ± 0.026
0.521HisGln: 0.521 ± 0.017
1.05HisArg: 1.05 ± 0.026
1.015HisSer: 1.015 ± 0.021
0.92HisThr: 0.92 ± 0.023
1.0HisVal: 1.0 ± 0.027
0.217HisTrp: 0.217 ± 0.011
0.814HisTyr: 0.814 ± 0.022
0.0HisXaa: 0.0 ± 0.0
Ile
5.016IleAla: 5.016 ± 0.059
1.317IleCys: 1.317 ± 0.029
3.497IleAsp: 3.497 ± 0.05
4.138IleGlu: 4.138 ± 0.056
2.78IlePhe: 2.78 ± 0.046
4.238IleGly: 4.238 ± 0.058
1.275IleHis: 1.275 ± 0.026
4.156IleIle: 4.156 ± 0.061
3.199IleLys: 3.199 ± 0.048
6.982IleLeu: 6.982 ± 0.071
1.76IleMet: 1.76 ± 0.032
2.493IleAsn: 2.493 ± 0.04
3.003IlePro: 3.003 ± 0.042
2.168IleGln: 2.168 ± 0.035
4.102IleArg: 4.102 ± 0.052
4.477IleSer: 4.477 ± 0.054
3.665IleThr: 3.665 ± 0.04
4.277IleVal: 4.277 ± 0.061
0.731IleTrp: 0.731 ± 0.02
2.787IleTyr: 2.787 ± 0.045
0.0IleXaa: 0.0 ± 0.0
Lys
4.926LysAla: 4.926 ± 0.066
0.809LysCys: 0.809 ± 0.023
3.083LysAsp: 3.083 ± 0.048
5.749LysGlu: 5.749 ± 0.066
1.723LysPhe: 1.723 ± 0.033
3.967LysGly: 3.967 ± 0.054
0.906LysHis: 0.906 ± 0.025
4.053LysIle: 4.053 ± 0.054
4.977LysLys: 4.977 ± 0.068
4.942LysLeu: 4.942 ± 0.062
1.884LysMet: 1.884 ± 0.037
2.902LysAsn: 2.902 ± 0.043
1.893LysPro: 1.893 ± 0.032
2.152LysGln: 2.152 ± 0.036
3.262LysArg: 3.262 ± 0.047
3.142LysSer: 3.142 ± 0.044
3.166LysThr: 3.166 ± 0.044
3.77LysVal: 3.77 ± 0.05
0.687LysTrp: 0.687 ± 0.021
2.665LysTyr: 2.665 ± 0.045
0.0LysXaa: 0.0 ± 0.0
Leu
6.985LeuAla: 6.985 ± 0.078
1.679LeuCys: 1.679 ± 0.035
4.846LeuAsp: 4.846 ± 0.057
6.884LeuGlu: 6.884 ± 0.075
4.045LeuPhe: 4.045 ± 0.061
5.529LeuGly: 5.529 ± 0.06
1.586LeuHis: 1.586 ± 0.033
5.373LeuIle: 5.373 ± 0.061
5.693LeuLys: 5.693 ± 0.063
9.347LeuLeu: 9.347 ± 0.093
2.608LeuMet: 2.608 ± 0.037
3.772LeuAsn: 3.772 ± 0.043
3.856LeuPro: 3.856 ± 0.053
3.204LeuGln: 3.204 ± 0.049
4.51LeuArg: 4.51 ± 0.051
6.294LeuSer: 6.294 ± 0.072
5.307LeuThr: 5.307 ± 0.068
5.073LeuVal: 5.073 ± 0.06
1.011LeuTrp: 1.011 ± 0.029
3.91LeuTyr: 3.91 ± 0.056
0.0LeuXaa: 0.0 ± 0.0
Met
2.534MetAla: 2.534 ± 0.041
0.404MetCys: 0.404 ± 0.017
1.758MetAsp: 1.758 ± 0.032
2.922MetGlu: 2.922 ± 0.048
0.931MetPhe: 0.931 ± 0.021
2.08MetGly: 2.08 ± 0.036
0.432MetHis: 0.432 ± 0.015
1.878MetIle: 1.878 ± 0.042
2.271MetLys: 2.271 ± 0.039
2.854MetLeu: 2.854 ± 0.045
0.882MetMet: 0.882 ± 0.021
1.451MetAsn: 1.451 ± 0.033
1.119MetPro: 1.119 ± 0.026
1.11MetGln: 1.11 ± 0.025
1.516MetArg: 1.516 ± 0.029
1.666MetSer: 1.666 ± 0.032
1.682MetThr: 1.682 ± 0.032
1.814MetVal: 1.814 ± 0.033
0.298MetTrp: 0.298 ± 0.014
0.943MetTyr: 0.943 ± 0.023
0.0MetXaa: 0.0 ± 0.0
Asn
3.208AsnAla: 3.208 ± 0.045
0.671AsnCys: 0.671 ± 0.019
1.974AsnAsp: 1.974 ± 0.035
2.646AsnGlu: 2.646 ± 0.04
1.515AsnPhe: 1.515 ± 0.03
3.597AsnGly: 3.597 ± 0.047
0.769AsnHis: 0.769 ± 0.023
3.031AsnIle: 3.031 ± 0.048
1.949AsnLys: 1.949 ± 0.036
3.732AsnLeu: 3.732 ± 0.047
1.277AsnMet: 1.277 ± 0.028
1.703AsnAsn: 1.703 ± 0.036
1.982AsnPro: 1.982 ± 0.028
1.378AsnGln: 1.378 ± 0.033
2.461AsnArg: 2.461 ± 0.041
2.345AsnSer: 2.345 ± 0.042
2.173AsnThr: 2.173 ± 0.039
2.766AsnVal: 2.766 ± 0.044
0.447AsnTrp: 0.447 ± 0.017
1.823AsnTyr: 1.823 ± 0.035
0.0AsnXaa: 0.0 ± 0.0
Pro
2.619ProAla: 2.619 ± 0.04
0.502ProCys: 0.502 ± 0.017
2.516ProAsp: 2.516 ± 0.039
3.951ProGlu: 3.951 ± 0.059
1.548ProPhe: 1.548 ± 0.031
2.647ProGly: 2.647 ± 0.043
0.583ProHis: 0.583 ± 0.021
1.696ProIle: 1.696 ± 0.03
1.652ProLys: 1.652 ± 0.033
2.75ProLeu: 2.75 ± 0.042
0.883ProMet: 0.883 ± 0.022
1.076ProAsn: 1.076 ± 0.025
0.912ProPro: 0.912 ± 0.027
1.115ProGln: 1.115 ± 0.022
1.261ProArg: 1.261 ± 0.028
1.642ProSer: 1.642 ± 0.036
1.439ProThr: 1.439 ± 0.033
3.136ProVal: 3.136 ± 0.041
0.368ProTrp: 0.368 ± 0.015
1.48ProTyr: 1.48 ± 0.03
0.0ProXaa: 0.0 ± 0.0
Gln
2.699GlnAla: 2.699 ± 0.044
0.388GlnCys: 0.388 ± 0.014
1.616GlnAsp: 1.616 ± 0.031
3.086GlnGlu: 3.086 ± 0.046
1.193GlnPhe: 1.193 ± 0.025
2.137GlnGly: 2.137 ± 0.045
0.488GlnHis: 0.488 ± 0.017
2.408GlnIle: 2.408 ± 0.036
2.54GlnLys: 2.54 ± 0.041
2.842GlnLeu: 2.842 ± 0.041
1.17GlnMet: 1.17 ± 0.026
1.68GlnAsn: 1.68 ± 0.033
1.016GlnPro: 1.016 ± 0.028
1.246GlnGln: 1.246 ± 0.027
1.728GlnArg: 1.728 ± 0.034
1.744GlnSer: 1.744 ± 0.035
1.812GlnThr: 1.812 ± 0.029
2.08GlnVal: 2.08 ± 0.037
0.4GlnTrp: 0.4 ± 0.014
1.474GlnTyr: 1.474 ± 0.032
0.0GlnXaa: 0.0 ± 0.0
Arg
3.358ArgAla: 3.358 ± 0.048
0.707ArgCys: 0.707 ± 0.021
2.778ArgAsp: 2.778 ± 0.051
5.45ArgGlu: 5.45 ± 0.072
2.186ArgPhe: 2.186 ± 0.035
3.17ArgGly: 3.17 ± 0.045
1.002ArgHis: 1.002 ± 0.024
3.893ArgIle: 3.893 ± 0.05
4.196ArgLys: 4.196 ± 0.055
4.969ArgLeu: 4.969 ± 0.056
1.862ArgMet: 1.862 ± 0.038
2.414ArgAsn: 2.414 ± 0.04
1.638ArgPro: 1.638 ± 0.029
2.445ArgGln: 2.445 ± 0.043
3.472ArgArg: 3.472 ± 0.052
2.452ArgSer: 2.452 ± 0.04
2.736ArgThr: 2.736 ± 0.042
2.906ArgVal: 2.906 ± 0.048
0.574ArgTrp: 0.574 ± 0.019
2.364ArgTyr: 2.364 ± 0.041
0.0ArgXaa: 0.0 ± 0.0
Ser
4.404SerAla: 4.404 ± 0.059
0.897SerCys: 0.897 ± 0.023
3.153SerAsp: 3.153 ± 0.05
4.118SerGlu: 4.118 ± 0.125
2.694SerPhe: 2.694 ± 0.04
5.22SerGly: 5.22 ± 0.066
1.042SerHis: 1.042 ± 0.023
3.561SerIle: 3.561 ± 0.05
2.54SerLys: 2.54 ± 0.04
5.259SerLeu: 5.259 ± 0.061
1.769SerMet: 1.769 ± 0.033
2.041SerAsn: 2.041 ± 0.036
1.917SerPro: 1.917 ± 0.036
1.811SerGln: 1.811 ± 0.038
3.271SerArg: 3.271 ± 0.037
3.287SerSer: 3.287 ± 0.055
2.723SerThr: 2.723 ± 0.12
4.322SerVal: 4.322 ± 0.06
0.658SerTrp: 0.658 ± 0.019
2.375SerTyr: 2.375 ± 0.039
0.0SerXaa: 0.0 ± 0.0
Thr
4.639ThrAla: 4.639 ± 0.059
0.666ThrCys: 0.666 ± 0.021
3.167ThrAsp: 3.167 ± 0.045
4.277ThrGlu: 4.277 ± 0.047
2.059ThrPhe: 2.059 ± 0.035
4.823ThrGly: 4.823 ± 0.065
0.857ThrHis: 0.857 ± 0.022
3.247ThrIle: 3.247 ± 0.041
2.414ThrLys: 2.414 ± 0.035
4.723ThrLeu: 4.723 ± 0.056
1.366ThrMet: 1.366 ± 0.031
1.68ThrAsn: 1.68 ± 0.03
2.063ThrPro: 2.063 ± 0.04
1.438ThrGln: 1.438 ± 0.029
2.18ThrArg: 2.18 ± 0.043
2.662ThrSer: 2.662 ± 0.107
2.3ThrThr: 2.3 ± 0.04
4.374ThrVal: 4.374 ± 0.063
0.524ThrTrp: 0.524 ± 0.018
1.97ThrTyr: 1.97 ± 0.031
0.0ThrXaa: 0.0 ± 0.0
Val
4.165ValAla: 4.165 ± 0.06
1.274ValCys: 1.274 ± 0.026
3.479ValAsp: 3.479 ± 0.05
4.66ValGlu: 4.66 ± 0.061
2.833ValPhe: 2.833 ± 0.044
3.898ValGly: 3.898 ± 0.058
1.06ValHis: 1.06 ± 0.024
4.495ValIle: 4.495 ± 0.056
3.994ValLys: 3.994 ± 0.06
6.447ValLeu: 6.447 ± 0.072
2.015ValMet: 2.015 ± 0.04
2.787ValAsn: 2.787 ± 0.043
2.462ValPro: 2.462 ± 0.04
1.962ValGln: 1.962 ± 0.036
3.6ValArg: 3.6 ± 0.047
4.495ValSer: 4.495 ± 0.072
3.89ValThr: 3.89 ± 0.056
4.161ValVal: 4.161 ± 0.065
0.8ValTrp: 0.8 ± 0.023
2.778ValTyr: 2.778 ± 0.039
0.0ValXaa: 0.0 ± 0.0
Trp
0.66TrpAla: 0.66 ± 0.022
0.22TrpCys: 0.22 ± 0.011
0.64TrpAsp: 0.64 ± 0.02
0.93TrpGlu: 0.93 ± 0.024
0.485TrpPhe: 0.485 ± 0.017
0.826TrpGly: 0.826 ± 0.023
0.238TrpHis: 0.238 ± 0.011
0.709TrpIle: 0.709 ± 0.02
0.858TrpLys: 0.858 ± 0.025
1.114TrpLeu: 1.114 ± 0.025
0.365TrpMet: 0.365 ± 0.016
0.67TrpAsn: 0.67 ± 0.019
0.255TrpPro: 0.255 ± 0.011
0.498TrpGln: 0.498 ± 0.016
0.554TrpArg: 0.554 ± 0.017
0.54TrpSer: 0.54 ± 0.021
0.491TrpThr: 0.491 ± 0.019
0.59TrpVal: 0.59 ± 0.018
0.16TrpTrp: 0.16 ± 0.01
0.545TrpTyr: 0.545 ± 0.018
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.914TyrAla: 2.914 ± 0.042
0.758TyrCys: 0.758 ± 0.023
2.601TyrAsp: 2.601 ± 0.044
3.138TyrGlu: 3.138 ± 0.044
1.958TyrPhe: 1.958 ± 0.032
3.305TyrGly: 3.305 ± 0.047
0.918TyrHis: 0.918 ± 0.02
2.803TyrIle: 2.803 ± 0.042
2.004TyrLys: 2.004 ± 0.037
4.093TyrLeu: 4.093 ± 0.051
1.184TyrMet: 1.184 ± 0.025
1.811TyrAsn: 1.811 ± 0.03
1.595TyrPro: 1.595 ± 0.038
1.579TyrGln: 1.579 ± 0.025
2.698TyrArg: 2.698 ± 0.041
2.449TyrSer: 2.449 ± 0.045
2.186TyrThr: 2.186 ± 0.039
2.611TyrVal: 2.611 ± 0.038
0.493TyrTrp: 0.493 ± 0.018
1.994TyrTyr: 1.994 ± 0.035
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5687 proteins (1754436 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski