Amino acid dipepetide frequency for Fibrobacter sp. UWB8

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.12AlaAla: 8.12 ± 0.14
1.335AlaCys: 1.335 ± 0.038
4.901AlaAsp: 4.901 ± 0.068
5.973AlaGlu: 5.973 ± 0.097
3.785AlaPhe: 3.785 ± 0.072
6.427AlaGly: 6.427 ± 0.089
1.337AlaHis: 1.337 ± 0.032
4.859AlaIle: 4.859 ± 0.069
5.888AlaLys: 5.888 ± 0.094
7.949AlaLeu: 7.949 ± 0.108
2.328AlaMet: 2.328 ± 0.052
3.428AlaAsn: 3.428 ± 0.055
3.212AlaPro: 3.212 ± 0.08
2.63AlaGln: 2.63 ± 0.062
3.433AlaArg: 3.433 ± 0.061
5.065AlaSer: 5.065 ± 0.075
4.359AlaThr: 4.359 ± 0.07
5.88AlaVal: 5.88 ± 0.091
0.929AlaTrp: 0.929 ± 0.033
2.868AlaTyr: 2.868 ± 0.058
0.0AlaXaa: 0.0 ± 0.0
Cys
1.111CysAla: 1.111 ± 0.036
0.216CysCys: 0.216 ± 0.016
0.93CysAsp: 0.93 ± 0.038
0.848CysGlu: 0.848 ± 0.032
0.57CysPhe: 0.57 ± 0.023
1.188CysGly: 1.188 ± 0.04
0.247CysHis: 0.247 ± 0.017
0.799CysIle: 0.799 ± 0.027
0.801CysLys: 0.801 ± 0.031
0.987CysLeu: 0.987 ± 0.031
0.28CysMet: 0.28 ± 0.017
0.578CysAsn: 0.578 ± 0.025
0.693CysPro: 0.693 ± 0.028
0.282CysGln: 0.282 ± 0.018
0.542CysArg: 0.542 ± 0.025
1.015CysSer: 1.015 ± 0.04
0.688CysThr: 0.688 ± 0.029
0.953CysVal: 0.953 ± 0.036
0.127CysTrp: 0.127 ± 0.011
0.542CysTyr: 0.542 ± 0.023
0.0CysXaa: 0.0 ± 0.0
Asp
4.828AspAla: 4.828 ± 0.072
0.775AspCys: 0.775 ± 0.028
3.489AspAsp: 3.489 ± 0.071
4.114AspGlu: 4.114 ± 0.07
3.096AspPhe: 3.096 ± 0.059
4.865AspGly: 4.865 ± 0.083
0.914AspHis: 0.914 ± 0.036
3.66AspIle: 3.66 ± 0.057
3.663AspLys: 3.663 ± 0.065
4.865AspLeu: 4.865 ± 0.066
1.561AspMet: 1.561 ± 0.037
2.515AspAsn: 2.515 ± 0.054
2.404AspPro: 2.404 ± 0.064
1.245AspGln: 1.245 ± 0.038
2.39AspArg: 2.39 ± 0.05
5.19AspSer: 5.19 ± 0.122
3.226AspThr: 3.226 ± 0.073
3.884AspVal: 3.884 ± 0.061
0.858AspTrp: 0.858 ± 0.032
2.675AspTyr: 2.675 ± 0.054
0.0AspXaa: 0.0 ± 0.0
Glu
4.94GluAla: 4.94 ± 0.085
0.793GluCys: 0.793 ± 0.031
3.525GluAsp: 3.525 ± 0.066
4.337GluGlu: 4.337 ± 0.081
2.911GluPhe: 2.911 ± 0.059
4.309GluGly: 4.309 ± 0.069
1.197GluHis: 1.197 ± 0.04
4.41GluIle: 4.41 ± 0.063
5.245GluLys: 5.245 ± 0.081
5.552GluLeu: 5.552 ± 0.086
1.887GluMet: 1.887 ± 0.043
3.9GluAsn: 3.9 ± 0.057
2.195GluPro: 2.195 ± 0.056
2.104GluGln: 2.104 ± 0.043
3.108GluArg: 3.108 ± 0.06
4.228GluSer: 4.228 ± 0.062
3.543GluThr: 3.543 ± 0.065
3.743GluVal: 3.743 ± 0.063
0.95GluTrp: 0.95 ± 0.032
2.488GluTyr: 2.488 ± 0.057
0.0GluXaa: 0.0 ± 0.0
Phe
4.238PheAla: 4.238 ± 0.07
0.724PheCys: 0.724 ± 0.028
3.113PheAsp: 3.113 ± 0.054
2.988PheGlu: 2.988 ± 0.052
2.308PhePhe: 2.308 ± 0.057
3.567PheGly: 3.567 ± 0.065
0.783PheHis: 0.783 ± 0.027
2.378PheIle: 2.378 ± 0.059
2.877PheLys: 2.877 ± 0.054
3.838PheLeu: 3.838 ± 0.074
1.162PheMet: 1.162 ± 0.032
2.018PheAsn: 2.018 ± 0.042
1.647PhePro: 1.647 ± 0.046
1.111PheGln: 1.111 ± 0.034
1.969PheArg: 1.969 ± 0.04
3.096PheSer: 3.096 ± 0.053
2.485PheThr: 2.485 ± 0.058
3.439PheVal: 3.439 ± 0.056
0.682PheTrp: 0.682 ± 0.029
1.818PheTyr: 1.818 ± 0.042
0.0PheXaa: 0.0 ± 0.0
Gly
5.551GlyAla: 5.551 ± 0.09
1.066GlyCys: 1.066 ± 0.042
4.363GlyAsp: 4.363 ± 0.082
4.607GlyGlu: 4.607 ± 0.07
3.672GlyPhe: 3.672 ± 0.06
5.324GlyGly: 5.324 ± 0.109
1.352GlyHis: 1.352 ± 0.038
4.872GlyIle: 4.872 ± 0.08
5.591GlyLys: 5.591 ± 0.084
6.014GlyLeu: 6.014 ± 0.087
1.993GlyMet: 1.993 ± 0.041
3.534GlyAsn: 3.534 ± 0.076
1.692GlyPro: 1.692 ± 0.046
1.792GlyGln: 1.792 ± 0.045
3.109GlyArg: 3.109 ± 0.062
4.628GlySer: 4.628 ± 0.085
4.151GlyThr: 4.151 ± 0.077
5.236GlyVal: 5.236 ± 0.083
1.061GlyTrp: 1.061 ± 0.042
2.942GlyTyr: 2.942 ± 0.063
0.0GlyXaa: 0.0 ± 0.0
His
1.305HisAla: 1.305 ± 0.036
0.286HisCys: 0.286 ± 0.018
0.914HisAsp: 0.914 ± 0.027
1.029HisGlu: 1.029 ± 0.028
1.037HisPhe: 1.037 ± 0.031
1.219HisGly: 1.219 ± 0.034
0.473HisHis: 0.473 ± 0.023
1.108HisIle: 1.108 ± 0.035
1.039HisLys: 1.039 ± 0.033
1.618HisLeu: 1.618 ± 0.04
0.356HisMet: 0.356 ± 0.02
0.748HisAsn: 0.748 ± 0.03
0.96HisPro: 0.96 ± 0.036
0.542HisGln: 0.542 ± 0.023
0.805HisArg: 0.805 ± 0.028
1.037HisSer: 1.037 ± 0.032
0.809HisThr: 0.809 ± 0.029
1.172HisVal: 1.172 ± 0.032
0.291HisTrp: 0.291 ± 0.016
0.814HisTyr: 0.814 ± 0.027
0.0HisXaa: 0.0 ± 0.0
Ile
5.616IleAla: 5.616 ± 0.081
0.876IleCys: 0.876 ± 0.033
3.911IleAsp: 3.911 ± 0.062
3.826IleGlu: 3.826 ± 0.064
2.501IlePhe: 2.501 ± 0.053
4.065IleGly: 4.065 ± 0.079
1.128IleHis: 1.128 ± 0.035
3.036IleIle: 3.036 ± 0.069
3.553IleLys: 3.553 ± 0.069
5.231IleLeu: 5.231 ± 0.082
1.205IleMet: 1.205 ± 0.037
2.263IleAsn: 2.263 ± 0.045
3.009IlePro: 3.009 ± 0.057
1.893IleGln: 1.893 ± 0.042
2.914IleArg: 2.914 ± 0.053
3.908IleSer: 3.908 ± 0.069
3.04IleThr: 3.04 ± 0.055
4.334IleVal: 4.334 ± 0.075
0.638IleTrp: 0.638 ± 0.029
2.121IleTyr: 2.121 ± 0.047
0.0IleXaa: 0.0 ± 0.0
Lys
5.698LysAla: 5.698 ± 0.089
0.719LysCys: 0.719 ± 0.03
4.29LysAsp: 4.29 ± 0.073
4.39LysGlu: 4.39 ± 0.075
2.808LysPhe: 2.808 ± 0.05
4.363LysGly: 4.363 ± 0.066
1.046LysHis: 1.046 ± 0.032
4.353LysIle: 4.353 ± 0.068
5.444LysLys: 5.444 ± 0.104
5.416LysLeu: 5.416 ± 0.079
2.128LysMet: 2.128 ± 0.043
3.749LysAsn: 3.749 ± 0.067
2.43LysPro: 2.43 ± 0.062
2.095LysGln: 2.095 ± 0.05
2.88LysArg: 2.88 ± 0.054
4.11LysSer: 4.11 ± 0.072
3.693LysThr: 3.693 ± 0.063
4.578LysVal: 4.578 ± 0.071
0.781LysTrp: 0.781 ± 0.028
2.347LysTyr: 2.347 ± 0.051
0.0LysXaa: 0.0 ± 0.0
Leu
7.534LeuAla: 7.534 ± 0.096
1.221LeuCys: 1.221 ± 0.032
5.459LeuAsp: 5.459 ± 0.082
5.432LeuGlu: 5.432 ± 0.073
4.07LeuPhe: 4.07 ± 0.069
6.039LeuGly: 6.039 ± 0.079
1.534LeuHis: 1.534 ± 0.04
4.325LeuIle: 4.325 ± 0.079
6.047LeuLys: 6.047 ± 0.079
7.663LeuLeu: 7.663 ± 0.115
2.165LeuMet: 2.165 ± 0.045
4.005LeuAsn: 4.005 ± 0.06
3.874LeuPro: 3.874 ± 0.064
2.935LeuGln: 2.935 ± 0.057
4.217LeuArg: 4.217 ± 0.078
5.889LeuSer: 5.889 ± 0.09
4.637LeuThr: 4.637 ± 0.069
5.765LeuVal: 5.765 ± 0.081
1.057LeuTrp: 1.057 ± 0.036
3.123LeuTyr: 3.123 ± 0.058
0.0LeuXaa: 0.0 ± 0.0
Met
2.358MetAla: 2.358 ± 0.054
0.215MetCys: 0.215 ± 0.014
1.623MetAsp: 1.623 ± 0.039
1.618MetGlu: 1.618 ± 0.04
1.043MetPhe: 1.043 ± 0.034
1.962MetGly: 1.962 ± 0.044
0.437MetHis: 0.437 ± 0.021
1.294MetIle: 1.294 ± 0.034
1.923MetLys: 1.923 ± 0.036
2.303MetLeu: 2.303 ± 0.045
0.705MetMet: 0.705 ± 0.028
1.361MetAsn: 1.361 ± 0.041
1.17MetPro: 1.17 ± 0.034
0.956MetGln: 0.956 ± 0.029
1.255MetArg: 1.255 ± 0.036
1.619MetSer: 1.619 ± 0.037
1.469MetThr: 1.469 ± 0.035
1.804MetVal: 1.804 ± 0.042
0.258MetTrp: 0.258 ± 0.015
0.692MetTyr: 0.692 ± 0.028
0.0MetXaa: 0.0 ± 0.0
Asn
3.954AsnAla: 3.954 ± 0.059
0.592AsnCys: 0.592 ± 0.026
2.647AsnAsp: 2.647 ± 0.054
2.773AsnGlu: 2.773 ± 0.049
2.143AsnPhe: 2.143 ± 0.053
3.954AsnGly: 3.954 ± 0.075
0.875AsnHis: 0.875 ± 0.031
2.799AsnIle: 2.799 ± 0.05
2.701AsnLys: 2.701 ± 0.056
4.037AsnLeu: 4.037 ± 0.057
1.198AsnMet: 1.198 ± 0.031
1.974AsnAsn: 1.974 ± 0.055
2.321AsnPro: 2.321 ± 0.054
1.3AsnGln: 1.3 ± 0.038
2.134AsnArg: 2.134 ± 0.043
2.828AsnSer: 2.828 ± 0.061
2.158AsnThr: 2.158 ± 0.052
3.238AsnVal: 3.238 ± 0.067
0.674AsnTrp: 0.674 ± 0.025
1.871AsnTyr: 1.871 ± 0.048
0.0AsnXaa: 0.0 ± 0.0
Pro
3.667ProAla: 3.667 ± 0.097
0.449ProCys: 0.449 ± 0.02
2.564ProAsp: 2.564 ± 0.065
3.497ProGlu: 3.497 ± 0.067
1.783ProPhe: 1.783 ± 0.043
2.545ProGly: 2.545 ± 0.063
0.683ProHis: 0.683 ± 0.027
2.096ProIle: 2.096 ± 0.046
2.482ProLys: 2.482 ± 0.047
3.24ProLeu: 3.24 ± 0.055
0.986ProMet: 0.986 ± 0.031
1.692ProAsn: 1.692 ± 0.044
1.12ProPro: 1.12 ± 0.042
1.271ProGln: 1.271 ± 0.037
1.44ProArg: 1.44 ± 0.037
2.268ProSer: 2.268 ± 0.051
2.014ProThr: 2.014 ± 0.05
3.187ProVal: 3.187 ± 0.061
0.516ProTrp: 0.516 ± 0.022
1.386ProTyr: 1.386 ± 0.038
0.0ProXaa: 0.0 ± 0.0
Gln
2.366GlnAla: 2.366 ± 0.056
0.29GlnCys: 0.29 ± 0.015
1.496GlnAsp: 1.496 ± 0.041
1.934GlnGlu: 1.934 ± 0.051
1.333GlnPhe: 1.333 ± 0.038
2.131GlnGly: 2.131 ± 0.045
0.443GlnHis: 0.443 ± 0.02
1.97GlnIle: 1.97 ± 0.043
2.373GlnLys: 2.373 ± 0.05
2.322GlnLeu: 2.322 ± 0.049
0.979GlnMet: 0.979 ± 0.03
1.604GlnAsn: 1.604 ± 0.04
0.998GlnPro: 0.998 ± 0.033
1.002GlnGln: 1.002 ± 0.033
1.172GlnArg: 1.172 ± 0.032
1.728GlnSer: 1.728 ± 0.041
1.573GlnThr: 1.573 ± 0.041
2.119GlnVal: 2.119 ± 0.048
0.405GlnTrp: 0.405 ± 0.021
1.07GlnTyr: 1.07 ± 0.036
0.0GlnXaa: 0.0 ± 0.0
Arg
3.094ArgAla: 3.094 ± 0.053
0.632ArgCys: 0.632 ± 0.026
2.709ArgAsp: 2.709 ± 0.051
3.273ArgGlu: 3.273 ± 0.067
2.221ArgPhe: 2.221 ± 0.048
2.774ArgGly: 2.774 ± 0.056
0.851ArgHis: 0.851 ± 0.028
3.104ArgIle: 3.104 ± 0.068
2.956ArgLys: 2.956 ± 0.059
4.133ArgLeu: 4.133 ± 0.069
1.274ArgMet: 1.274 ± 0.032
2.131ArgAsn: 2.131 ± 0.044
1.55ArgPro: 1.55 ± 0.044
1.289ArgGln: 1.289 ± 0.034
2.158ArgArg: 2.158 ± 0.059
2.442ArgSer: 2.442 ± 0.054
2.203ArgThr: 2.203 ± 0.047
2.973ArgVal: 2.973 ± 0.058
0.609ArgTrp: 0.609 ± 0.025
1.991ArgTyr: 1.991 ± 0.04
0.0ArgXaa: 0.0 ± 0.0
Ser
5.671SerAla: 5.671 ± 0.076
0.819SerCys: 0.819 ± 0.03
3.819SerAsp: 3.819 ± 0.065
4.086SerGlu: 4.086 ± 0.061
3.065SerPhe: 3.065 ± 0.058
5.084SerGly: 5.084 ± 0.094
1.016SerHis: 1.016 ± 0.03
4.076SerIle: 4.076 ± 0.075
3.842SerLys: 3.842 ± 0.064
6.051SerLeu: 6.051 ± 0.088
1.582SerMet: 1.582 ± 0.036
2.625SerAsn: 2.625 ± 0.061
2.245SerPro: 2.245 ± 0.048
1.692SerGln: 1.692 ± 0.043
2.794SerArg: 2.794 ± 0.052
6.544SerSer: 6.544 ± 0.248
3.539SerThr: 3.539 ± 0.064
5.119SerVal: 5.119 ± 0.079
0.949SerTrp: 0.949 ± 0.036
2.348SerTyr: 2.348 ± 0.065
0.0SerXaa: 0.0 ± 0.0
Thr
4.557ThrAla: 4.557 ± 0.079
0.616ThrCys: 0.616 ± 0.027
3.131ThrAsp: 3.131 ± 0.056
3.114ThrGlu: 3.114 ± 0.063
2.434ThrPhe: 2.434 ± 0.049
4.143ThrGly: 4.143 ± 0.078
0.952ThrHis: 0.952 ± 0.033
3.433ThrIle: 3.433 ± 0.063
2.866ThrLys: 2.866 ± 0.057
5.51ThrLeu: 5.51 ± 0.084
1.215ThrMet: 1.215 ± 0.035
2.066ThrAsn: 2.066 ± 0.052
2.659ThrPro: 2.659 ± 0.055
1.43ThrGln: 1.43 ± 0.038
2.1ThrArg: 2.1 ± 0.047
3.168ThrSer: 3.168 ± 0.061
3.095ThrThr: 3.095 ± 0.067
4.263ThrVal: 4.263 ± 0.081
0.704ThrTrp: 0.704 ± 0.028
2.046ThrTyr: 2.046 ± 0.051
0.0ThrXaa: 0.0 ± 0.0
Val
6.208ValAla: 6.208 ± 0.078
1.081ValCys: 1.081 ± 0.039
4.169ValAsp: 4.169 ± 0.073
4.559ValGlu: 4.559 ± 0.065
3.11ValPhe: 3.11 ± 0.06
4.808ValGly: 4.808 ± 0.106
1.197ValHis: 1.197 ± 0.031
3.789ValIle: 3.789 ± 0.069
4.707ValLys: 4.707 ± 0.076
6.052ValLeu: 6.052 ± 0.087
1.689ValMet: 1.689 ± 0.042
3.259ValAsn: 3.259 ± 0.06
2.896ValPro: 2.896 ± 0.06
2.127ValGln: 2.127 ± 0.046
3.311ValArg: 3.311 ± 0.059
4.796ValSer: 4.796 ± 0.073
3.809ValThr: 3.809 ± 0.079
5.194ValVal: 5.194 ± 0.093
0.867ValTrp: 0.867 ± 0.03
2.592ValTyr: 2.592 ± 0.06
0.0ValXaa: 0.0 ± 0.0
Trp
0.874TrpAla: 0.874 ± 0.034
0.159TrpCys: 0.159 ± 0.013
0.768TrpAsp: 0.768 ± 0.03
0.741TrpGlu: 0.741 ± 0.024
0.602TrpPhe: 0.602 ± 0.026
0.887TrpGly: 0.887 ± 0.028
0.347TrpHis: 0.347 ± 0.017
0.78TrpIle: 0.78 ± 0.031
0.881TrpLys: 0.881 ± 0.03
1.088TrpLeu: 1.088 ± 0.036
0.441TrpMet: 0.441 ± 0.021
0.861TrpAsn: 0.861 ± 0.033
0.367TrpPro: 0.367 ± 0.017
0.452TrpGln: 0.452 ± 0.022
0.591TrpArg: 0.591 ± 0.023
0.861TrpSer: 0.861 ± 0.028
0.807TrpThr: 0.807 ± 0.029
0.859TrpVal: 0.859 ± 0.032
0.179TrpTrp: 0.179 ± 0.014
0.525TrpTyr: 0.525 ± 0.023
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.017TyrAla: 3.017 ± 0.055
0.532TyrCys: 0.532 ± 0.026
2.395TyrAsp: 2.395 ± 0.055
2.225TyrGlu: 2.225 ± 0.055
1.749TyrPhe: 1.749 ± 0.043
2.896TyrGly: 2.896 ± 0.065
0.748TyrHis: 0.748 ± 0.03
1.93TyrIle: 1.93 ± 0.041
2.437TyrLys: 2.437 ± 0.059
3.12TyrLeu: 3.12 ± 0.061
0.93TyrMet: 0.93 ± 0.029
1.86TyrAsn: 1.86 ± 0.053
1.447TyrPro: 1.447 ± 0.04
1.16TyrGln: 1.16 ± 0.033
2.047TyrArg: 2.047 ± 0.045
2.549TyrSer: 2.549 ± 0.054
2.205TyrThr: 2.205 ± 0.051
2.478TyrVal: 2.478 ± 0.06
0.534TyrTrp: 0.534 ± 0.024
1.765TyrTyr: 1.765 ± 0.052
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2821 proteins (1063467 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski