Amino acid dipepetide frequency for Candidatus Ornithobacterium hominis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.138AlaAla: 4.138 ± 0.114
0.529AlaCys: 0.529 ± 0.031
3.281AlaAsp: 3.281 ± 0.074
4.961AlaGlu: 4.961 ± 0.113
3.039AlaPhe: 3.039 ± 0.081
4.201AlaGly: 4.201 ± 0.106
1.255AlaHis: 1.255 ± 0.049
4.819AlaIle: 4.819 ± 0.121
5.112AlaLys: 5.112 ± 0.096
6.174AlaLeu: 6.174 ± 0.118
1.588AlaMet: 1.588 ± 0.052
3.248AlaAsn: 3.248 ± 0.085
1.776AlaPro: 1.776 ± 0.06
3.124AlaGln: 3.124 ± 0.064
2.204AlaArg: 2.204 ± 0.059
3.612AlaSer: 3.612 ± 0.086
3.031AlaThr: 3.031 ± 0.069
3.881AlaVal: 3.881 ± 0.098
0.675AlaTrp: 0.675 ± 0.034
2.56AlaTyr: 2.56 ± 0.07
0.0AlaXaa: 0.0 ± 0.0
Cys
0.434CysAla: 0.434 ± 0.03
0.085CysCys: 0.085 ± 0.012
0.299CysAsp: 0.299 ± 0.024
0.425CysGlu: 0.425 ± 0.024
0.382CysPhe: 0.382 ± 0.025
0.533CysGly: 0.533 ± 0.033
0.191CysHis: 0.191 ± 0.021
0.519CysIle: 0.519 ± 0.035
0.375CysLys: 0.375 ± 0.029
0.594CysLeu: 0.594 ± 0.033
0.141CysMet: 0.141 ± 0.016
0.319CysAsn: 0.319 ± 0.026
0.349CysPro: 0.349 ± 0.034
0.245CysGln: 0.245 ± 0.022
0.257CysArg: 0.257 ± 0.024
0.444CysSer: 0.444 ± 0.029
0.363CysThr: 0.363 ± 0.031
0.422CysVal: 0.422 ± 0.028
0.069CysTrp: 0.069 ± 0.012
0.238CysTyr: 0.238 ± 0.02
0.0CysXaa: 0.0 ± 0.0
Asp
3.244AspAla: 3.244 ± 0.076
0.319AspCys: 0.319 ± 0.023
2.425AspAsp: 2.425 ± 0.075
4.463AspGlu: 4.463 ± 0.092
4.034AspPhe: 4.034 ± 0.088
3.036AspGly: 3.036 ± 0.088
0.677AspHis: 0.677 ± 0.035
3.767AspIle: 3.767 ± 0.093
4.046AspLys: 4.046 ± 0.093
5.749AspLeu: 5.749 ± 0.111
1.0AspMet: 1.0 ± 0.042
2.534AspAsn: 2.534 ± 0.073
1.557AspPro: 1.557 ± 0.061
1.397AspGln: 1.397 ± 0.05
1.849AspArg: 1.849 ± 0.062
2.854AspSer: 2.854 ± 0.085
2.168AspThr: 2.168 ± 0.068
3.166AspVal: 3.166 ± 0.083
0.792AspTrp: 0.792 ± 0.042
2.77AspTyr: 2.77 ± 0.071
0.0AspXaa: 0.0 ± 0.0
Glu
4.445GluAla: 4.445 ± 0.102
0.316GluCys: 0.316 ± 0.023
3.517GluAsp: 3.517 ± 0.083
5.942GluGlu: 5.942 ± 0.145
3.388GluPhe: 3.388 ± 0.087
3.515GluGly: 3.515 ± 0.083
1.186GluHis: 1.186 ± 0.049
7.268GluIle: 7.268 ± 0.123
7.867GluLys: 7.867 ± 0.149
6.429GluLeu: 6.429 ± 0.121
1.786GluMet: 1.786 ± 0.061
6.223GluAsn: 6.223 ± 0.114
1.68GluPro: 1.68 ± 0.063
2.491GluGln: 2.491 ± 0.073
2.92GluArg: 2.92 ± 0.08
3.367GluSer: 3.367 ± 0.071
3.237GluThr: 3.237 ± 0.078
4.156GluVal: 4.156 ± 0.087
0.698GluTrp: 0.698 ± 0.034
2.415GluTyr: 2.415 ± 0.067
0.0GluXaa: 0.0 ± 0.0
Phe
2.953PheAla: 2.953 ± 0.075
0.444PheCys: 0.444 ± 0.028
3.149PheAsp: 3.149 ± 0.075
3.513PheGlu: 3.513 ± 0.083
3.076PhePhe: 3.076 ± 0.089
3.529PheGly: 3.529 ± 0.082
1.081PheHis: 1.081 ± 0.042
4.26PheIle: 4.26 ± 0.118
3.678PheLys: 3.678 ± 0.097
5.461PheLeu: 5.461 ± 0.128
1.196PheMet: 1.196 ± 0.042
3.154PheAsn: 3.154 ± 0.086
1.85PhePro: 1.85 ± 0.057
1.857PheGln: 1.857 ± 0.065
1.817PheArg: 1.817 ± 0.056
4.239PheSer: 4.239 ± 0.097
2.871PheThr: 2.871 ± 0.072
2.861PheVal: 2.861 ± 0.08
0.644PheTrp: 0.644 ± 0.034
2.395PheTyr: 2.395 ± 0.067
0.0PheXaa: 0.0 ± 0.0
Gly
3.996GlyAla: 3.996 ± 0.108
0.493GlyCys: 0.493 ± 0.034
3.114GlyAsp: 3.114 ± 0.089
4.324GlyGlu: 4.324 ± 0.088
3.567GlyPhe: 3.567 ± 0.084
4.504GlyGly: 4.504 ± 0.109
1.094GlyHis: 1.094 ± 0.047
5.282GlyIle: 5.282 ± 0.113
5.404GlyLys: 5.404 ± 0.103
5.707GlyLeu: 5.707 ± 0.11
1.524GlyMet: 1.524 ± 0.054
3.602GlyAsn: 3.602 ± 0.108
1.168GlyPro: 1.168 ± 0.047
1.864GlyGln: 1.864 ± 0.059
2.3GlyArg: 2.3 ± 0.082
3.637GlySer: 3.637 ± 0.089
3.185GlyThr: 3.185 ± 0.077
4.055GlyVal: 4.055 ± 0.097
0.722GlyTrp: 0.722 ± 0.042
2.73GlyTyr: 2.73 ± 0.078
0.0GlyXaa: 0.0 ± 0.0
His
1.064HisAla: 1.064 ± 0.044
0.156HisCys: 0.156 ± 0.017
0.816HisAsp: 0.816 ± 0.036
1.09HisGlu: 1.09 ± 0.049
1.255HisPhe: 1.255 ± 0.053
1.095HisGly: 1.095 ± 0.042
0.552HisHis: 0.552 ± 0.035
1.305HisIle: 1.305 ± 0.047
1.111HisLys: 1.111 ± 0.048
2.111HisLeu: 2.111 ± 0.06
0.354HisMet: 0.354 ± 0.023
0.847HisAsn: 0.847 ± 0.043
0.981HisPro: 0.981 ± 0.046
1.17HisGln: 1.17 ± 0.053
0.809HisArg: 0.809 ± 0.041
1.187HisSer: 1.187 ± 0.046
0.88HisThr: 0.88 ± 0.044
0.925HisVal: 0.925 ± 0.04
0.267HisTrp: 0.267 ± 0.02
0.899HisTyr: 0.899 ± 0.036
0.0HisXaa: 0.0 ± 0.0
Ile
5.135IleAla: 5.135 ± 0.106
0.588IleCys: 0.588 ± 0.035
4.541IleAsp: 4.541 ± 0.094
5.817IleGlu: 5.817 ± 0.113
4.411IlePhe: 4.411 ± 0.091
4.688IleGly: 4.688 ± 0.112
1.595IleHis: 1.595 ± 0.057
6.159IleIle: 6.159 ± 0.134
6.022IleLys: 6.022 ± 0.106
7.797IleLeu: 7.797 ± 0.162
1.323IleMet: 1.323 ± 0.06
4.635IleAsn: 4.635 ± 0.094
3.454IlePro: 3.454 ± 0.081
3.69IleGln: 3.69 ± 0.08
2.696IleArg: 2.696 ± 0.073
5.808IleSer: 5.808 ± 0.111
3.893IleThr: 3.893 ± 0.083
3.968IleVal: 3.968 ± 0.11
0.651IleTrp: 0.651 ± 0.036
2.998IleTyr: 2.998 ± 0.078
0.0IleXaa: 0.0 ± 0.0
Lys
5.174LysAla: 5.174 ± 0.087
0.345LysCys: 0.345 ± 0.025
4.312LysAsp: 4.312 ± 0.094
6.693LysGlu: 6.693 ± 0.131
3.604LysPhe: 3.604 ± 0.084
4.631LysGly: 4.631 ± 0.098
1.295LysHis: 1.295 ± 0.052
7.891LysIle: 7.891 ± 0.126
8.15LysLys: 8.15 ± 0.145
6.527LysLeu: 6.527 ± 0.114
2.014LysMet: 2.014 ± 0.061
6.862LysAsn: 6.862 ± 0.106
2.59LysPro: 2.59 ± 0.07
2.543LysGln: 2.543 ± 0.067
2.791LysArg: 2.791 ± 0.069
4.772LysSer: 4.772 ± 0.092
4.199LysThr: 4.199 ± 0.094
4.317LysVal: 4.317 ± 0.092
0.684LysTrp: 0.684 ± 0.037
3.072LysTyr: 3.072 ± 0.075
0.0LysXaa: 0.0 ± 0.0
Leu
6.266LeuAla: 6.266 ± 0.143
0.562LeuCys: 0.562 ± 0.032
4.883LeuAsp: 4.883 ± 0.091
6.579LeuGlu: 6.579 ± 0.124
4.787LeuPhe: 4.787 ± 0.114
6.195LeuGly: 6.195 ± 0.14
1.802LeuHis: 1.802 ± 0.06
7.12LeuIle: 7.12 ± 0.131
8.412LeuLys: 8.412 ± 0.138
8.509LeuLeu: 8.509 ± 0.179
2.185LeuMet: 2.185 ± 0.058
6.417LeuAsn: 6.417 ± 0.108
3.796LeuPro: 3.796 ± 0.075
3.624LeuGln: 3.624 ± 0.082
3.404LeuArg: 3.404 ± 0.072
6.761LeuSer: 6.761 ± 0.104
4.345LeuThr: 4.345 ± 0.092
5.01LeuVal: 5.01 ± 0.102
0.835LeuTrp: 0.835 ± 0.042
3.178LeuTyr: 3.178 ± 0.069
0.0LeuXaa: 0.0 ± 0.0
Met
1.562MetAla: 1.562 ± 0.061
0.111MetCys: 0.111 ± 0.013
0.996MetAsp: 0.996 ± 0.049
1.462MetGlu: 1.462 ± 0.056
0.802MetPhe: 0.802 ± 0.04
1.432MetGly: 1.432 ± 0.057
0.399MetHis: 0.399 ± 0.028
1.606MetIle: 1.606 ± 0.051
2.272MetLys: 2.272 ± 0.061
1.974MetLeu: 1.974 ± 0.061
0.606MetMet: 0.606 ± 0.035
1.458MetAsn: 1.458 ± 0.052
0.871MetPro: 0.871 ± 0.039
0.908MetGln: 0.908 ± 0.039
1.038MetArg: 1.038 ± 0.048
1.269MetSer: 1.269 ± 0.044
0.988MetThr: 0.988 ± 0.038
1.212MetVal: 1.212 ± 0.051
0.191MetTrp: 0.191 ± 0.021
0.627MetTyr: 0.627 ± 0.035
0.0MetXaa: 0.0 ± 0.0
Asn
3.845AsnAla: 3.845 ± 0.092
0.377AsnCys: 0.377 ± 0.027
2.859AsnAsp: 2.859 ± 0.084
3.973AsnGlu: 3.973 ± 0.088
3.866AsnPhe: 3.866 ± 0.103
3.774AsnGly: 3.774 ± 0.09
1.331AsnHis: 1.331 ± 0.051
5.015AsnIle: 5.015 ± 0.098
4.31AsnLys: 4.31 ± 0.106
6.391AsnLeu: 6.391 ± 0.12
1.121AsnMet: 1.121 ± 0.044
3.479AsnAsn: 3.479 ± 0.105
2.982AsnPro: 2.982 ± 0.067
3.47AsnGln: 3.47 ± 0.091
2.191AsnArg: 2.191 ± 0.07
3.571AsnSer: 3.571 ± 0.091
2.876AsnThr: 2.876 ± 0.079
3.218AsnVal: 3.218 ± 0.076
0.755AsnTrp: 0.755 ± 0.037
2.958AsnTyr: 2.958 ± 0.088
0.0AsnXaa: 0.0 ± 0.0
Pro
2.003ProAla: 2.003 ± 0.072
0.26ProCys: 0.26 ± 0.021
1.842ProAsp: 1.842 ± 0.06
2.973ProGlu: 2.973 ± 0.078
1.833ProPhe: 1.833 ± 0.062
1.885ProGly: 1.885 ± 0.064
0.79ProHis: 0.79 ± 0.045
2.638ProIle: 2.638 ± 0.068
2.736ProLys: 2.736 ± 0.081
3.032ProLeu: 3.032 ± 0.071
0.795ProMet: 0.795 ± 0.034
2.128ProAsn: 2.128 ± 0.059
0.922ProPro: 0.922 ± 0.051
1.613ProGln: 1.613 ± 0.051
1.128ProArg: 1.128 ± 0.044
2.196ProSer: 2.196 ± 0.067
1.857ProThr: 1.857 ± 0.058
2.253ProVal: 2.253 ± 0.06
0.328ProTrp: 0.328 ± 0.025
1.541ProTyr: 1.541 ± 0.055
0.0ProXaa: 0.0 ± 0.0
Gln
2.599GlnAla: 2.599 ± 0.074
0.22GlnCys: 0.22 ± 0.018
1.847GlnAsp: 1.847 ± 0.052
3.1GlnGlu: 3.1 ± 0.076
1.687GlnPhe: 1.687 ± 0.054
2.092GlnGly: 2.092 ± 0.06
0.752GlnHis: 0.752 ± 0.038
3.21GlnIle: 3.21 ± 0.076
4.109GlnLys: 4.109 ± 0.097
3.671GlnLeu: 3.671 ± 0.079
1.078GlnMet: 1.078 ± 0.045
3.015GlnAsn: 3.015 ± 0.077
1.33GlnPro: 1.33 ± 0.047
1.882GlnGln: 1.882 ± 0.073
1.68GlnArg: 1.68 ± 0.054
2.222GlnSer: 2.222 ± 0.069
1.812GlnThr: 1.812 ± 0.052
2.199GlnVal: 2.199 ± 0.059
0.469GlnTrp: 0.469 ± 0.028
1.409GlnTyr: 1.409 ± 0.052
0.0GlnXaa: 0.0 ± 0.0
Arg
2.284ArgAla: 2.284 ± 0.061
0.222ArgCys: 0.222 ± 0.02
1.873ArgAsp: 1.873 ± 0.057
2.727ArgGlu: 2.727 ± 0.066
2.008ArgPhe: 2.008 ± 0.061
2.194ArgGly: 2.194 ± 0.068
0.627ArgHis: 0.627 ± 0.038
2.868ArgIle: 2.868 ± 0.068
3.093ArgLys: 3.093 ± 0.073
3.586ArgLeu: 3.586 ± 0.084
0.962ArgMet: 0.962 ± 0.039
2.468ArgAsn: 2.468 ± 0.064
1.165ArgPro: 1.165 ± 0.048
1.265ArgGln: 1.265 ± 0.051
1.545ArgArg: 1.545 ± 0.059
1.838ArgSer: 1.838 ± 0.054
1.573ArgThr: 1.573 ± 0.056
2.196ArgVal: 2.196 ± 0.061
0.434ArgTrp: 0.434 ± 0.026
1.611ArgTyr: 1.611 ± 0.062
0.0ArgXaa: 0.0 ± 0.0
Ser
3.991SerAla: 3.991 ± 0.083
0.594SerCys: 0.594 ± 0.035
3.295SerAsp: 3.295 ± 0.086
3.958SerGlu: 3.958 ± 0.085
3.626SerPhe: 3.626 ± 0.085
4.277SerGly: 4.277 ± 0.099
1.118SerHis: 1.118 ± 0.043
4.966SerIle: 4.966 ± 0.105
4.515SerLys: 4.515 ± 0.093
6.218SerLeu: 6.218 ± 0.132
1.153SerMet: 1.153 ± 0.052
3.005SerAsn: 3.005 ± 0.076
2.328SerPro: 2.328 ± 0.071
2.421SerGln: 2.421 ± 0.068
2.062SerArg: 2.062 ± 0.067
4.031SerSer: 4.031 ± 0.098
3.104SerThr: 3.104 ± 0.078
3.788SerVal: 3.788 ± 0.078
0.686SerTrp: 0.686 ± 0.039
2.572SerTyr: 2.572 ± 0.073
0.0SerXaa: 0.0 ± 0.0
Thr
3.177ThrAla: 3.177 ± 0.074
0.304ThrCys: 0.304 ± 0.024
2.609ThrAsp: 2.609 ± 0.069
3.418ThrGlu: 3.418 ± 0.074
2.388ThrPhe: 2.388 ± 0.066
3.576ThrGly: 3.576 ± 0.078
1.01ThrHis: 1.01 ± 0.044
3.426ThrIle: 3.426 ± 0.074
3.308ThrLys: 3.308 ± 0.064
4.64ThrLeu: 4.64 ± 0.093
0.831ThrMet: 0.831 ± 0.034
2.597ThrAsn: 2.597 ± 0.075
2.328ThrPro: 2.328 ± 0.064
2.201ThrGln: 2.201 ± 0.065
1.64ThrArg: 1.64 ± 0.052
3.079ThrSer: 3.079 ± 0.07
2.493ThrThr: 2.493 ± 0.067
2.562ThrVal: 2.562 ± 0.073
0.545ThrTrp: 0.545 ± 0.032
2.022ThrTyr: 2.022 ± 0.059
0.0ThrXaa: 0.0 ± 0.0
Val
3.748ValAla: 3.748 ± 0.108
0.398ValCys: 0.398 ± 0.026
3.361ValAsp: 3.361 ± 0.092
4.162ValGlu: 4.162 ± 0.1
3.097ValPhe: 3.097 ± 0.083
3.786ValGly: 3.786 ± 0.086
0.944ValHis: 0.944 ± 0.045
4.383ValIle: 4.383 ± 0.088
4.249ValLys: 4.249 ± 0.078
5.136ValLeu: 5.136 ± 0.101
1.199ValMet: 1.199 ± 0.045
3.274ValAsn: 3.274 ± 0.084
1.979ValPro: 1.979 ± 0.071
1.949ValGln: 1.949 ± 0.055
2.05ValArg: 2.05 ± 0.054
3.673ValSer: 3.673 ± 0.088
2.694ValThr: 2.694 ± 0.067
3.583ValVal: 3.583 ± 0.1
0.514ValTrp: 0.514 ± 0.031
2.199ValTyr: 2.199 ± 0.061
0.0ValXaa: 0.0 ± 0.0
Trp
0.703TrpAla: 0.703 ± 0.035
0.099TrpCys: 0.099 ± 0.014
0.608TrpAsp: 0.608 ± 0.028
0.765TrpGlu: 0.765 ± 0.034
0.575TrpPhe: 0.575 ± 0.03
0.684TrpGly: 0.684 ± 0.037
0.203TrpHis: 0.203 ± 0.021
0.651TrpIle: 0.651 ± 0.04
0.771TrpLys: 0.771 ± 0.037
0.993TrpLeu: 0.993 ± 0.044
0.273TrpMet: 0.273 ± 0.021
0.668TrpAsn: 0.668 ± 0.033
0.241TrpPro: 0.241 ± 0.022
0.545TrpGln: 0.545 ± 0.031
0.46TrpArg: 0.46 ± 0.034
0.554TrpSer: 0.554 ± 0.033
0.455TrpThr: 0.455 ± 0.028
0.776TrpVal: 0.776 ± 0.039
0.163TrpTrp: 0.163 ± 0.017
0.429TrpTyr: 0.429 ± 0.03
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.449TyrAla: 2.449 ± 0.071
0.259TyrCys: 0.259 ± 0.021
2.224TyrAsp: 2.224 ± 0.075
2.5TyrGlu: 2.5 ± 0.072
2.604TyrPhe: 2.604 ± 0.087
2.645TyrGly: 2.645 ± 0.072
0.974TyrHis: 0.974 ± 0.046
2.717TyrIle: 2.717 ± 0.069
2.927TyrLys: 2.927 ± 0.086
3.972TyrLeu: 3.972 ± 0.087
0.653TyrMet: 0.653 ± 0.033
2.428TyrAsn: 2.428 ± 0.082
1.408TyrPro: 1.408 ± 0.052
2.121TyrGln: 2.121 ± 0.073
1.698TyrArg: 1.698 ± 0.051
2.592TyrSer: 2.592 ± 0.077
2.133TyrThr: 2.133 ± 0.066
1.831TyrVal: 1.831 ± 0.058
0.493TyrTrp: 0.493 ± 0.033
1.941TyrTyr: 1.941 ± 0.078
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1744 proteins (576101 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski