Amino acid dipepetide frequency for Fusobacterium sp. CAG:439

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.289AlaAla: 5.289 ± 0.131
1.008AlaCys: 1.008 ± 0.046
4.014AlaAsp: 4.014 ± 0.086
4.864AlaGlu: 4.864 ± 0.1
2.892AlaPhe: 2.892 ± 0.067
4.852AlaGly: 4.852 ± 0.099
1.021AlaHis: 1.021 ± 0.039
5.076AlaIle: 5.076 ± 0.109
6.105AlaLys: 6.105 ± 0.117
6.684AlaLeu: 6.684 ± 0.121
1.877AlaMet: 1.877 ± 0.061
3.411AlaAsn: 3.411 ± 0.078
1.952AlaPro: 1.952 ± 0.061
2.653AlaGln: 2.653 ± 0.074
2.492AlaArg: 2.492 ± 0.073
3.997AlaSer: 3.997 ± 0.088
2.619AlaThr: 2.619 ± 0.082
4.995AlaVal: 4.995 ± 0.09
0.389AlaTrp: 0.389 ± 0.025
2.534AlaTyr: 2.534 ± 0.07
0.0AlaXaa: 0.0 ± 0.0
Cys
0.985CysAla: 0.985 ± 0.038
0.245CysCys: 0.245 ± 0.021
0.874CysAsp: 0.874 ± 0.043
0.946CysGlu: 0.946 ± 0.041
0.715CysPhe: 0.715 ± 0.033
1.23CysGly: 1.23 ± 0.05
0.223CysHis: 0.223 ± 0.017
0.955CysIle: 0.955 ± 0.045
1.013CysLys: 1.013 ± 0.041
0.949CysLeu: 0.949 ± 0.039
0.304CysMet: 0.304 ± 0.024
0.634CysAsn: 0.634 ± 0.035
0.637CysPro: 0.637 ± 0.04
0.343CysGln: 0.343 ± 0.026
0.456CysArg: 0.456 ± 0.029
0.869CysSer: 0.869 ± 0.038
0.663CysThr: 0.663 ± 0.034
0.808CysVal: 0.808 ± 0.04
0.112CysTrp: 0.112 ± 0.017
0.492CysTyr: 0.492 ± 0.027
0.0CysXaa: 0.0 ± 0.0
Asp
3.805AspAla: 3.805 ± 0.076
0.877AspCys: 0.877 ± 0.042
3.363AspAsp: 3.363 ± 0.085
4.6AspGlu: 4.6 ± 0.107
3.251AspPhe: 3.251 ± 0.075
3.767AspGly: 3.767 ± 0.091
0.474AspHis: 0.474 ± 0.026
5.537AspIle: 5.537 ± 0.107
5.679AspLys: 5.679 ± 0.089
5.115AspLeu: 5.115 ± 0.11
1.483AspMet: 1.483 ± 0.044
3.589AspAsn: 3.589 ± 0.088
1.383AspPro: 1.383 ± 0.052
0.85AspGln: 0.85 ± 0.039
1.874AspArg: 1.874 ± 0.06
2.228AspSer: 2.228 ± 0.056
2.979AspThr: 2.979 ± 0.127
3.728AspVal: 3.728 ± 0.075
0.548AspTrp: 0.548 ± 0.03
3.135AspTyr: 3.135 ± 0.068
0.0AspXaa: 0.0 ± 0.0
Glu
4.515GluAla: 4.515 ± 0.111
0.797GluCys: 0.797 ± 0.037
3.603GluAsp: 3.603 ± 0.083
5.044GluGlu: 5.044 ± 0.102
3.262GluPhe: 3.262 ± 0.076
3.146GluGly: 3.146 ± 0.077
1.144GluHis: 1.144 ± 0.043
6.233GluIle: 6.233 ± 0.116
7.358GluLys: 7.358 ± 0.127
6.643GluLeu: 6.643 ± 0.11
1.832GluMet: 1.832 ± 0.06
5.29GluAsn: 5.29 ± 0.093
1.559GluPro: 1.559 ± 0.057
2.509GluGln: 2.509 ± 0.069
2.569GluArg: 2.569 ± 0.071
2.369GluSer: 2.369 ± 0.06
3.572GluThr: 3.572 ± 0.073
3.89GluVal: 3.89 ± 0.08
0.481GluTrp: 0.481 ± 0.03
3.032GluTyr: 3.032 ± 0.077
0.0GluXaa: 0.0 ± 0.0
Phe
3.265PheAla: 3.265 ± 0.074
0.613PheCys: 0.613 ± 0.031
3.11PheAsp: 3.11 ± 0.084
3.252PheGlu: 3.252 ± 0.074
1.893PhePhe: 1.893 ± 0.064
2.825PheGly: 2.825 ± 0.067
0.654PheHis: 0.654 ± 0.029
3.619PheIle: 3.619 ± 0.098
3.858PheLys: 3.858 ± 0.083
3.766PheLeu: 3.766 ± 0.089
1.153PheMet: 1.153 ± 0.043
2.786PheAsn: 2.786 ± 0.08
1.255PhePro: 1.255 ± 0.045
1.209PheGln: 1.209 ± 0.044
1.404PheArg: 1.404 ± 0.049
3.093PheSer: 3.093 ± 0.076
2.481PheThr: 2.481 ± 0.062
2.837PheVal: 2.837 ± 0.071
0.365PheTrp: 0.365 ± 0.024
1.902PheTyr: 1.902 ± 0.059
0.0PheXaa: 0.0 ± 0.0
Gly
4.224GlyAla: 4.224 ± 0.111
1.108GlyCys: 1.108 ± 0.045
3.045GlyAsp: 3.045 ± 0.068
3.605GlyGlu: 3.605 ± 0.079
2.979GlyPhe: 2.979 ± 0.065
4.057GlyGly: 4.057 ± 0.132
1.063GlyHis: 1.063 ± 0.041
5.339GlyIle: 5.339 ± 0.101
5.262GlyLys: 5.262 ± 0.094
5.278GlyLeu: 5.278 ± 0.103
1.631GlyMet: 1.631 ± 0.054
3.066GlyAsn: 3.066 ± 0.09
1.138GlyPro: 1.138 ± 0.047
1.785GlyGln: 1.785 ± 0.056
2.294GlyArg: 2.294 ± 0.069
3.497GlySer: 3.497 ± 0.096
3.519GlyThr: 3.519 ± 0.093
3.987GlyVal: 3.987 ± 0.086
0.568GlyTrp: 0.568 ± 0.034
2.564GlyTyr: 2.564 ± 0.064
0.005GlyXaa: 0.005 ± 0.003
His
0.904HisAla: 0.904 ± 0.037
0.247HisCys: 0.247 ± 0.02
0.785HisAsp: 0.785 ± 0.038
0.938HisGlu: 0.938 ± 0.044
0.719HisPhe: 0.719 ± 0.029
0.922HisGly: 0.922 ± 0.04
0.303HisHis: 0.303 ± 0.024
1.256HisIle: 1.256 ± 0.043
1.358HisLys: 1.358 ± 0.05
1.289HisLeu: 1.289 ± 0.041
0.301HisMet: 0.301 ± 0.02
0.994HisAsn: 0.994 ± 0.042
0.802HisPro: 0.802 ± 0.034
0.487HisGln: 0.487 ± 0.028
0.535HisArg: 0.535 ± 0.03
0.935HisSer: 0.935 ± 0.04
0.819HisThr: 0.819 ± 0.038
0.755HisVal: 0.755 ± 0.034
0.148HisTrp: 0.148 ± 0.019
0.655HisTyr: 0.655 ± 0.032
0.0HisXaa: 0.0 ± 0.0
Ile
5.96IleAla: 5.96 ± 0.106
1.066IleCys: 1.066 ± 0.046
5.023IleAsp: 5.023 ± 0.093
5.733IleGlu: 5.733 ± 0.094
3.503IlePhe: 3.503 ± 0.094
4.771IleGly: 4.771 ± 0.099
1.069IleHis: 1.069 ± 0.037
6.55IleIle: 6.55 ± 0.12
7.277IleLys: 7.277 ± 0.106
7.246IleLeu: 7.246 ± 0.125
1.826IleMet: 1.826 ± 0.054
5.435IleAsn: 5.435 ± 0.106
3.062IlePro: 3.062 ± 0.071
2.428IleGln: 2.428 ± 0.057
2.834IleArg: 2.834 ± 0.07
5.407IleSer: 5.407 ± 0.102
4.611IleThr: 4.611 ± 0.082
4.429IleVal: 4.429 ± 0.09
0.432IleTrp: 0.432 ± 0.028
2.981IleTyr: 2.981 ± 0.067
0.0IleXaa: 0.0 ± 0.0
Lys
5.699LysAla: 5.699 ± 0.13
0.993LysCys: 0.993 ± 0.044
5.306LysAsp: 5.306 ± 0.095
6.923LysGlu: 6.923 ± 0.116
3.686LysPhe: 3.686 ± 0.084
4.393LysGly: 4.393 ± 0.076
1.403LysHis: 1.403 ± 0.046
7.722LysIle: 7.722 ± 0.127
8.023LysLys: 8.023 ± 0.133
7.851LysLeu: 7.851 ± 0.105
2.31LysMet: 2.31 ± 0.068
6.509LysAsn: 6.509 ± 0.108
2.931LysPro: 2.931 ± 0.078
3.08LysGln: 3.08 ± 0.073
3.187LysArg: 3.187 ± 0.072
4.989LysSer: 4.989 ± 0.082
5.271LysThr: 5.271 ± 0.097
4.764LysVal: 4.764 ± 0.081
0.587LysTrp: 0.587 ± 0.032
4.118LysTyr: 4.118 ± 0.095
0.0LysXaa: 0.0 ± 0.0
Leu
6.326LeuAla: 6.326 ± 0.114
1.111LeuCys: 1.111 ± 0.049
4.936LeuAsp: 4.936 ± 0.092
5.896LeuGlu: 5.896 ± 0.111
3.752LeuPhe: 3.752 ± 0.101
5.424LeuGly: 5.424 ± 0.09
1.458LeuHis: 1.458 ± 0.052
7.061LeuIle: 7.061 ± 0.119
8.556LeuLys: 8.556 ± 0.122
7.275LeuLeu: 7.275 ± 0.127
2.238LeuMet: 2.238 ± 0.063
5.88LeuAsn: 5.88 ± 0.095
3.377LeuPro: 3.377 ± 0.071
2.871LeuGln: 2.871 ± 0.07
3.193LeuArg: 3.193 ± 0.075
6.139LeuSer: 6.139 ± 0.097
5.226LeuThr: 5.226 ± 0.107
4.546LeuVal: 4.546 ± 0.104
0.52LeuTrp: 0.52 ± 0.032
3.077LeuTyr: 3.077 ± 0.073
0.005LeuXaa: 0.005 ± 0.003
Met
1.806MetAla: 1.806 ± 0.054
0.27MetCys: 0.27 ± 0.023
1.33MetAsp: 1.33 ± 0.048
1.344MetGlu: 1.344 ± 0.051
1.116MetPhe: 1.116 ± 0.042
1.431MetGly: 1.431 ± 0.052
0.414MetHis: 0.414 ± 0.023
1.71MetIle: 1.71 ± 0.054
2.218MetLys: 2.218 ± 0.059
2.253MetLeu: 2.253 ± 0.06
0.651MetMet: 0.651 ± 0.035
1.603MetAsn: 1.603 ± 0.057
1.18MetPro: 1.18 ± 0.046
1.127MetGln: 1.127 ± 0.05
0.921MetArg: 0.921 ± 0.038
1.693MetSer: 1.693 ± 0.055
1.645MetThr: 1.645 ± 0.059
1.273MetVal: 1.273 ± 0.044
0.158MetTrp: 0.158 ± 0.017
0.861MetTyr: 0.861 ± 0.039
0.002MetXaa: 0.002 ± 0.001
Asn
4.087AsnAla: 4.087 ± 0.098
0.826AsnCys: 0.826 ± 0.042
3.212AsnAsp: 3.212 ± 0.076
3.608AsnGlu: 3.608 ± 0.062
3.041AsnPhe: 3.041 ± 0.069
4.203AsnGly: 4.203 ± 0.119
0.844AsnHis: 0.844 ± 0.03
5.769AsnIle: 5.769 ± 0.117
5.492AsnLys: 5.492 ± 0.115
5.637AsnLeu: 5.637 ± 0.113
1.531AsnMet: 1.531 ± 0.047
4.295AsnAsn: 4.295 ± 0.118
2.65AsnPro: 2.65 ± 0.068
1.785AsnGln: 1.785 ± 0.053
1.918AsnArg: 1.918 ± 0.063
4.256AsnSer: 4.256 ± 0.111
3.162AsnThr: 3.162 ± 0.098
3.433AsnVal: 3.433 ± 0.069
0.632AsnTrp: 0.632 ± 0.036
2.903AsnTyr: 2.903 ± 0.071
0.002AsnXaa: 0.002 ± 0.002
Pro
2.358ProAla: 2.358 ± 0.069
0.493ProCys: 0.493 ± 0.029
2.508ProAsp: 2.508 ± 0.064
2.884ProGlu: 2.884 ± 0.076
1.459ProPhe: 1.459 ± 0.054
1.331ProGly: 1.331 ± 0.049
0.595ProHis: 0.595 ± 0.032
2.061ProIle: 2.061 ± 0.052
2.43ProLys: 2.43 ± 0.062
2.669ProLeu: 2.669 ± 0.065
0.726ProMet: 0.726 ± 0.034
1.985ProAsn: 1.985 ± 0.062
0.985ProPro: 0.985 ± 0.05
1.443ProGln: 1.443 ± 0.055
0.89ProArg: 0.89 ± 0.034
2.091ProSer: 2.091 ± 0.052
1.401ProThr: 1.401 ± 0.056
2.786ProVal: 2.786 ± 0.075
0.208ProTrp: 0.208 ± 0.021
1.467ProTyr: 1.467 ± 0.052
0.0ProXaa: 0.0 ± 0.0
Gln
2.47GlnAla: 2.47 ± 0.068
0.325GlnCys: 0.325 ± 0.024
1.618GlnAsp: 1.618 ± 0.049
2.264GlnGlu: 2.264 ± 0.066
1.183GlnPhe: 1.183 ± 0.042
1.626GlnGly: 1.626 ± 0.047
0.448GlnHis: 0.448 ± 0.025
2.765GlnIle: 2.765 ± 0.068
3.198GlnLys: 3.198 ± 0.084
2.665GlnLeu: 2.665 ± 0.07
0.891GlnMet: 0.891 ± 0.033
2.411GlnAsn: 2.411 ± 0.061
0.922GlnPro: 0.922 ± 0.046
1.35GlnGln: 1.35 ± 0.062
1.256GlnArg: 1.256 ± 0.04
1.788GlnSer: 1.788 ± 0.053
1.907GlnThr: 1.907 ± 0.055
1.738GlnVal: 1.738 ± 0.053
0.233GlnTrp: 0.233 ± 0.018
1.311GlnTyr: 1.311 ± 0.045
0.0GlnXaa: 0.0 ± 0.0
Arg
2.258ArgAla: 2.258 ± 0.067
0.432ArgCys: 0.432 ± 0.028
1.949ArgAsp: 1.949 ± 0.058
2.684ArgGlu: 2.684 ± 0.068
1.668ArgPhe: 1.668 ± 0.048
1.941ArgGly: 1.941 ± 0.061
0.607ArgHis: 0.607 ± 0.035
2.929ArgIle: 2.929 ± 0.067
3.138ArgLys: 3.138 ± 0.079
3.338ArgLeu: 3.338 ± 0.086
0.861ArgMet: 0.861 ± 0.041
2.108ArgAsn: 2.108 ± 0.059
1.099ArgPro: 1.099 ± 0.044
1.291ArgGln: 1.291 ± 0.041
1.35ArgArg: 1.35 ± 0.048
1.668ArgSer: 1.668 ± 0.055
1.751ArgThr: 1.751 ± 0.058
2.094ArgVal: 2.094 ± 0.062
0.268ArgTrp: 0.268 ± 0.02
1.442ArgTyr: 1.442 ± 0.053
0.0ArgXaa: 0.0 ± 0.0
Ser
4.029SerAla: 4.029 ± 0.088
0.707SerCys: 0.707 ± 0.033
3.636SerAsp: 3.636 ± 0.076
3.831SerGlu: 3.831 ± 0.086
2.737SerPhe: 2.737 ± 0.071
4.287SerGly: 4.287 ± 0.093
0.955SerHis: 0.955 ± 0.034
4.538SerIle: 4.538 ± 0.084
4.866SerLys: 4.866 ± 0.082
5.154SerLeu: 5.154 ± 0.09
1.445SerMet: 1.445 ± 0.048
3.311SerAsn: 3.311 ± 0.079
1.859SerPro: 1.859 ± 0.059
2.199SerGln: 2.199 ± 0.056
2.083SerArg: 2.083 ± 0.054
4.028SerSer: 4.028 ± 0.088
3.026SerThr: 3.026 ± 0.075
3.855SerVal: 3.855 ± 0.067
0.46SerTrp: 0.46 ± 0.024
2.409SerTyr: 2.409 ± 0.066
0.0SerXaa: 0.0 ± 0.0
Thr
4.011ThrAla: 4.011 ± 0.091
0.657ThrCys: 0.657 ± 0.037
3.505ThrAsp: 3.505 ± 0.131
3.304ThrGlu: 3.304 ± 0.081
2.338ThrPhe: 2.338 ± 0.066
3.725ThrGly: 3.725 ± 0.078
0.805ThrHis: 0.805 ± 0.037
3.951ThrIle: 3.951 ± 0.086
4.042ThrLys: 4.042 ± 0.079
5.209ThrLeu: 5.209 ± 0.103
1.122ThrMet: 1.122 ± 0.04
2.806ThrAsn: 2.806 ± 0.078
2.233ThrPro: 2.233 ± 0.07
1.678ThrGln: 1.678 ± 0.049
1.791ThrArg: 1.791 ± 0.05
3.269ThrSer: 3.269 ± 0.083
2.795ThrThr: 2.795 ± 0.077
3.801ThrVal: 3.801 ± 0.074
0.297ThrTrp: 0.297 ± 0.026
2.093ThrTyr: 2.093 ± 0.058
0.0ThrXaa: 0.0 ± 0.0
Val
3.78ValAla: 3.78 ± 0.086
0.935ValCys: 0.935 ± 0.043
3.497ValAsp: 3.497 ± 0.07
3.998ValGlu: 3.998 ± 0.093
2.692ValPhe: 2.692 ± 0.071
3.12ValGly: 3.12 ± 0.075
0.88ValHis: 0.88 ± 0.034
5.002ValIle: 5.002 ± 0.111
5.673ValLys: 5.673 ± 0.092
5.532ValLeu: 5.532 ± 0.095
1.589ValMet: 1.589 ± 0.049
3.634ValAsn: 3.634 ± 0.085
2.091ValPro: 2.091 ± 0.059
1.884ValGln: 1.884 ± 0.064
2.058ValArg: 2.058 ± 0.062
3.951ValSer: 3.951 ± 0.079
3.418ValThr: 3.418 ± 0.083
3.772ValVal: 3.772 ± 0.087
0.414ValTrp: 0.414 ± 0.027
2.261ValTyr: 2.261 ± 0.065
0.003ValXaa: 0.003 ± 0.002
Trp
0.485TrpAla: 0.485 ± 0.027
0.105TrpCys: 0.105 ± 0.013
0.432TrpAsp: 0.432 ± 0.024
0.549TrpGlu: 0.549 ± 0.032
0.354TrpPhe: 0.354 ± 0.023
0.571TrpGly: 0.571 ± 0.029
0.169TrpHis: 0.169 ± 0.015
0.484TrpIle: 0.484 ± 0.03
0.51TrpLys: 0.51 ± 0.029
0.755TrpLeu: 0.755 ± 0.032
0.203TrpMet: 0.203 ± 0.019
0.414TrpAsn: 0.414 ± 0.027
0.133TrpPro: 0.133 ± 0.016
0.264TrpGln: 0.264 ± 0.024
0.25TrpArg: 0.25 ± 0.021
0.39TrpSer: 0.39 ± 0.023
0.331TrpThr: 0.331 ± 0.021
0.448TrpVal: 0.448 ± 0.029
0.092TrpTrp: 0.092 ± 0.013
0.303TrpTyr: 0.303 ± 0.025
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.467TyrAla: 2.467 ± 0.062
0.635TyrCys: 0.635 ± 0.035
2.52TyrAsp: 2.52 ± 0.075
2.581TyrGlu: 2.581 ± 0.067
2.1TyrPhe: 2.1 ± 0.064
2.392TyrGly: 2.392 ± 0.067
0.626TyrHis: 0.626 ± 0.03
3.085TyrIle: 3.085 ± 0.076
3.689TyrLys: 3.689 ± 0.083
3.556TyrLeu: 3.556 ± 0.081
1.028TyrMet: 1.028 ± 0.041
3.121TyrAsn: 3.121 ± 0.084
1.523TyrPro: 1.523 ± 0.062
1.133TyrGln: 1.133 ± 0.045
1.522TyrArg: 1.522 ± 0.048
2.757TyrSer: 2.757 ± 0.07
2.233TyrThr: 2.233 ± 0.068
2.239TyrVal: 2.239 ± 0.058
0.331TyrTrp: 0.331 ± 0.022
1.801TyrTyr: 1.801 ± 0.06
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.002XaaAla: 0.002 ± 0.001
0.003XaaCys: 0.003 ± 0.002
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.003XaaLys: 0.003 ± 0.002
0.003XaaLeu: 0.003 ± 0.002
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.002XaaArg: 0.002 ± 0.002
0.0XaaSer: 0.0 ± 0.0
0.002XaaThr: 0.002 ± 0.002
0.002XaaVal: 0.002 ± 0.001
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.023XaaXaa: 0.023 ± 0.008
Statistics based on 2311 proteins (640807 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski