Amino acid dipepetide frequency for Faecalibacterium sp. CAG:82

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.751AlaAla: 14.751 ± 0.277
1.767AlaCys: 1.767 ± 0.05
5.969AlaAsp: 5.969 ± 0.122
7.49AlaGlu: 7.49 ± 0.138
3.571AlaPhe: 3.571 ± 0.079
8.425AlaGly: 8.425 ± 0.146
1.801AlaHis: 1.801 ± 0.062
5.152AlaIle: 5.152 ± 0.109
5.16AlaLys: 5.16 ± 0.105
11.384AlaLeu: 11.384 ± 0.197
2.868AlaMet: 2.868 ± 0.076
2.977AlaAsn: 2.977 ± 0.071
3.746AlaPro: 3.746 ± 0.092
3.994AlaGln: 3.994 ± 0.103
5.482AlaArg: 5.482 ± 0.124
5.167AlaSer: 5.167 ± 0.114
4.025AlaThr: 4.025 ± 0.091
8.624AlaVal: 8.624 ± 0.133
0.865AlaTrp: 0.865 ± 0.039
2.71AlaTyr: 2.71 ± 0.08
0.0AlaXaa: 0.0 ± 0.0
Cys
1.86CysAla: 1.86 ± 0.055
0.369CysCys: 0.369 ± 0.028
0.963CysAsp: 0.963 ± 0.035
1.035CysGlu: 1.035 ± 0.045
0.655CysPhe: 0.655 ± 0.037
1.937CysGly: 1.937 ± 0.068
0.357CysHis: 0.357 ± 0.024
1.004CysIle: 1.004 ± 0.041
0.788CysLys: 0.788 ± 0.04
1.396CysLeu: 1.396 ± 0.05
0.47CysMet: 0.47 ± 0.028
0.567CysAsn: 0.567 ± 0.033
0.873CysPro: 0.873 ± 0.044
0.466CysGln: 0.466 ± 0.029
0.909CysArg: 0.909 ± 0.043
0.933CysSer: 0.933 ± 0.045
1.135CysThr: 1.135 ± 0.061
1.316CysVal: 1.316 ± 0.051
0.179CysTrp: 0.179 ± 0.016
0.568CysTyr: 0.568 ± 0.026
0.0CysXaa: 0.0 ± 0.0
Asp
5.848AspAla: 5.848 ± 0.124
0.922AspCys: 0.922 ± 0.04
2.769AspAsp: 2.769 ± 0.09
4.042AspGlu: 4.042 ± 0.095
2.301AspPhe: 2.301 ± 0.07
4.692AspGly: 4.692 ± 0.095
1.077AspHis: 1.077 ± 0.047
3.009AspIle: 3.009 ± 0.061
2.383AspLys: 2.383 ± 0.078
4.912AspLeu: 4.912 ± 0.105
1.242AspMet: 1.242 ± 0.043
1.632AspAsn: 1.632 ± 0.054
2.687AspPro: 2.687 ± 0.075
1.455AspGln: 1.455 ± 0.056
2.35AspArg: 2.35 ± 0.069
2.682AspSer: 2.682 ± 0.079
3.144AspThr: 3.144 ± 0.069
3.843AspVal: 3.843 ± 0.087
0.64AspTrp: 0.64 ± 0.033
2.264AspTyr: 2.264 ± 0.075
0.0AspXaa: 0.0 ± 0.0
Glu
6.678GluAla: 6.678 ± 0.122
0.815GluCys: 0.815 ± 0.037
3.441GluAsp: 3.441 ± 0.07
5.072GluGlu: 5.072 ± 0.125
1.932GluPhe: 1.932 ± 0.068
4.335GluGly: 4.335 ± 0.098
1.375GluHis: 1.375 ± 0.052
3.462GluIle: 3.462 ± 0.077
4.318GluLys: 4.318 ± 0.091
6.609GluLeu: 6.609 ± 0.133
2.146GluMet: 2.146 ± 0.058
3.035GluAsn: 3.035 ± 0.065
2.279GluPro: 2.279 ± 0.074
3.155GluGln: 3.155 ± 0.081
3.273GluArg: 3.273 ± 0.092
2.904GluSer: 2.904 ± 0.07
3.566GluThr: 3.566 ± 0.086
4.095GluVal: 4.095 ± 0.089
0.568GluTrp: 0.568 ± 0.028
2.218GluTyr: 2.218 ± 0.069
0.0GluXaa: 0.0 ± 0.0
Phe
3.858PheAla: 3.858 ± 0.072
0.848PheCys: 0.848 ± 0.039
2.286PheAsp: 2.286 ± 0.058
2.1PheGlu: 2.1 ± 0.068
1.433PhePhe: 1.433 ± 0.053
3.242PheGly: 3.242 ± 0.085
0.771PheHis: 0.771 ± 0.037
1.813PheIle: 1.813 ± 0.064
1.396PheLys: 1.396 ± 0.052
3.705PheLeu: 3.705 ± 0.105
0.842PheMet: 0.842 ± 0.042
1.207PheAsn: 1.207 ± 0.051
1.384PhePro: 1.384 ± 0.045
1.093PheGln: 1.093 ± 0.051
1.729PheArg: 1.729 ± 0.054
2.347PheSer: 2.347 ± 0.074
2.367PheThr: 2.367 ± 0.073
2.723PheVal: 2.723 ± 0.071
0.468PheTrp: 0.468 ± 0.032
1.377PheTyr: 1.377 ± 0.052
0.0PheXaa: 0.0 ± 0.0
Gly
7.177GlyAla: 7.177 ± 0.115
1.693GlyCys: 1.693 ± 0.056
3.783GlyAsp: 3.783 ± 0.08
4.718GlyGlu: 4.718 ± 0.09
3.147GlyPhe: 3.147 ± 0.069
6.459GlyGly: 6.459 ± 0.144
1.564GlyHis: 1.564 ± 0.064
5.014GlyIle: 5.014 ± 0.106
4.68GlyLys: 4.68 ± 0.095
7.375GlyLeu: 7.375 ± 0.135
2.589GlyMet: 2.589 ± 0.064
2.429GlyAsn: 2.429 ± 0.063
2.104GlyPro: 2.104 ± 0.062
2.675GlyGln: 2.675 ± 0.069
3.971GlyArg: 3.971 ± 0.097
4.604GlySer: 4.604 ± 0.111
4.728GlyThr: 4.728 ± 0.089
6.408GlyVal: 6.408 ± 0.101
0.979GlyTrp: 0.979 ± 0.049
3.011GlyTyr: 3.011 ± 0.07
0.0GlyXaa: 0.0 ± 0.0
His
1.61HisAla: 1.61 ± 0.053
0.458HisCys: 0.458 ± 0.034
1.014HisAsp: 1.014 ± 0.05
0.979HisGlu: 0.979 ± 0.045
0.873HisPhe: 0.873 ± 0.035
1.602HisGly: 1.602 ± 0.051
0.521HisHis: 0.521 ± 0.039
1.162HisIle: 1.162 ± 0.048
0.905HisLys: 0.905 ± 0.039
1.879HisLeu: 1.879 ± 0.064
0.465HisMet: 0.465 ± 0.026
0.637HisAsn: 0.637 ± 0.033
1.154HisPro: 1.154 ± 0.045
0.609HisGln: 0.609 ± 0.034
0.938HisArg: 0.938 ± 0.043
1.036HisSer: 1.036 ± 0.047
1.251HisThr: 1.251 ± 0.053
1.147HisVal: 1.147 ± 0.042
0.235HisTrp: 0.235 ± 0.022
0.791HisTyr: 0.791 ± 0.036
0.0HisXaa: 0.0 ± 0.0
Ile
5.215IleAla: 5.215 ± 0.091
1.013IleCys: 1.013 ± 0.038
2.961IleAsp: 2.961 ± 0.071
3.036IleGlu: 3.036 ± 0.072
1.942IlePhe: 1.942 ± 0.068
4.093IleGly: 4.093 ± 0.092
1.059IleHis: 1.059 ± 0.044
2.766IleIle: 2.766 ± 0.08
2.221IleLys: 2.221 ± 0.076
5.564IleLeu: 5.564 ± 0.113
1.265IleMet: 1.265 ± 0.056
1.642IleAsn: 1.642 ± 0.058
2.613IlePro: 2.613 ± 0.068
1.729IleGln: 1.729 ± 0.055
2.795IleArg: 2.795 ± 0.07
3.181IleSer: 3.181 ± 0.073
3.361IleThr: 3.361 ± 0.078
4.051IleVal: 4.051 ± 0.094
0.46IleTrp: 0.46 ± 0.028
1.663IleTyr: 1.663 ± 0.052
0.0IleXaa: 0.0 ± 0.0
Lys
5.344LysAla: 5.344 ± 0.103
0.582LysCys: 0.582 ± 0.033
2.577LysAsp: 2.577 ± 0.067
3.492LysGlu: 3.492 ± 0.089
1.372LysPhe: 1.372 ± 0.049
3.436LysGly: 3.436 ± 0.089
0.885LysHis: 0.885 ± 0.039
2.716LysIle: 2.716 ± 0.073
3.414LysLys: 3.414 ± 0.093
4.817LysLeu: 4.817 ± 0.093
1.602LysMet: 1.602 ± 0.06
2.168LysAsn: 2.168 ± 0.059
2.173LysPro: 2.173 ± 0.07
2.015LysGln: 2.015 ± 0.067
2.313LysArg: 2.313 ± 0.07
2.175LysSer: 2.175 ± 0.066
3.087LysThr: 3.087 ± 0.078
3.169LysVal: 3.169 ± 0.084
0.448LysTrp: 0.448 ± 0.024
1.714LysTyr: 1.714 ± 0.055
0.0LysXaa: 0.0 ± 0.0
Leu
10.826LeuAla: 10.826 ± 0.169
2.276LeuCys: 2.276 ± 0.069
5.858LeuAsp: 5.858 ± 0.097
5.805LeuGlu: 5.805 ± 0.112
3.761LeuPhe: 3.761 ± 0.099
7.458LeuGly: 7.458 ± 0.137
2.019LeuHis: 2.019 ± 0.059
4.48LeuIle: 4.48 ± 0.096
4.459LeuLys: 4.459 ± 0.098
10.624LeuLeu: 10.624 ± 0.203
2.616LeuMet: 2.616 ± 0.064
3.494LeuAsn: 3.494 ± 0.082
5.08LeuPro: 5.08 ± 0.096
3.123LeuGln: 3.123 ± 0.083
5.496LeuArg: 5.496 ± 0.123
5.851LeuSer: 5.851 ± 0.112
6.258LeuThr: 6.258 ± 0.111
6.913LeuVal: 6.913 ± 0.129
1.074LeuTrp: 1.074 ± 0.049
3.315LeuTyr: 3.315 ± 0.084
0.0LeuXaa: 0.0 ± 0.0
Met
3.084MetAla: 3.084 ± 0.07
0.279MetCys: 0.279 ± 0.021
1.576MetAsp: 1.576 ± 0.048
1.961MetGlu: 1.961 ± 0.062
0.841MetPhe: 0.841 ± 0.039
2.235MetGly: 2.235 ± 0.065
0.458MetHis: 0.458 ± 0.029
1.356MetIle: 1.356 ± 0.048
1.666MetLys: 1.666 ± 0.049
2.703MetLeu: 2.703 ± 0.077
0.807MetMet: 0.807 ± 0.042
1.157MetAsn: 1.157 ± 0.051
1.29MetPro: 1.29 ± 0.053
1.093MetGln: 1.093 ± 0.043
1.273MetArg: 1.273 ± 0.051
1.619MetSer: 1.619 ± 0.043
1.671MetThr: 1.671 ± 0.053
1.918MetVal: 1.918 ± 0.062
0.182MetTrp: 0.182 ± 0.019
0.669MetTyr: 0.669 ± 0.035
0.0MetXaa: 0.0 ± 0.0
Asn
3.603AsnAla: 3.603 ± 0.085
0.562AsnCys: 0.562 ± 0.032
1.702AsnAsp: 1.702 ± 0.056
2.034AsnGlu: 2.034 ± 0.071
1.317AsnPhe: 1.317 ± 0.044
3.137AsnGly: 3.137 ± 0.087
0.671AsnHis: 0.671 ± 0.031
2.187AsnIle: 2.187 ± 0.056
1.481AsnLys: 1.481 ± 0.052
3.241AsnLeu: 3.241 ± 0.076
0.921AsnMet: 0.921 ± 0.038
1.125AsnAsn: 1.125 ± 0.048
1.753AsnPro: 1.753 ± 0.057
1.055AsnGln: 1.055 ± 0.042
1.576AsnArg: 1.576 ± 0.053
1.649AsnSer: 1.649 ± 0.05
2.044AsnThr: 2.044 ± 0.066
2.519AsnVal: 2.519 ± 0.066
0.357AsnTrp: 0.357 ± 0.027
1.345AsnTyr: 1.345 ± 0.051
0.0AsnXaa: 0.0 ± 0.0
Pro
4.835ProAla: 4.835 ± 0.109
0.633ProCys: 0.633 ± 0.034
2.728ProAsp: 2.728 ± 0.073
4.064ProGlu: 4.064 ± 0.098
1.545ProPhe: 1.545 ± 0.052
3.608ProGly: 3.608 ± 0.072
0.788ProHis: 0.788 ± 0.039
1.925ProIle: 1.925 ± 0.061
1.819ProLys: 1.819 ± 0.061
3.843ProLeu: 3.843 ± 0.086
1.101ProMet: 1.101 ± 0.042
1.29ProAsn: 1.29 ± 0.047
1.202ProPro: 1.202 ± 0.051
1.448ProGln: 1.448 ± 0.05
1.746ProArg: 1.746 ± 0.058
1.928ProSer: 1.928 ± 0.062
2.156ProThr: 2.156 ± 0.068
3.516ProVal: 3.516 ± 0.083
0.429ProTrp: 0.429 ± 0.024
1.453ProTyr: 1.453 ± 0.049
0.0ProXaa: 0.0 ± 0.0
Gln
3.334GlnAla: 3.334 ± 0.085
0.563GlnCys: 0.563 ± 0.032
1.496GlnAsp: 1.496 ± 0.05
2.134GlnGlu: 2.134 ± 0.065
1.239GlnPhe: 1.239 ± 0.045
2.366GlnGly: 2.366 ± 0.064
0.647GlnHis: 0.647 ± 0.036
1.932GlnIle: 1.932 ± 0.061
2.284GlnLys: 2.284 ± 0.066
3.894GlnLeu: 3.894 ± 0.093
1.174GlnMet: 1.174 ± 0.046
1.571GlnAsn: 1.571 ± 0.057
1.51GlnPro: 1.51 ± 0.057
1.755GlnGln: 1.755 ± 0.061
1.967GlnArg: 1.967 ± 0.069
1.877GlnSer: 1.877 ± 0.062
1.882GlnThr: 1.882 ± 0.065
2.33GlnVal: 2.33 ± 0.061
0.449GlnTrp: 0.449 ± 0.03
1.275GlnTyr: 1.275 ± 0.052
0.0GlnXaa: 0.0 ± 0.0
Arg
5.057ArgAla: 5.057 ± 0.125
0.911ArgCys: 0.911 ± 0.037
2.463ArgAsp: 2.463 ± 0.067
3.462ArgGlu: 3.462 ± 0.095
2.042ArgPhe: 2.042 ± 0.061
3.355ArgGly: 3.355 ± 0.075
0.951ArgHis: 0.951 ± 0.045
2.524ArgIle: 2.524 ± 0.069
2.721ArgLys: 2.721 ± 0.067
4.89ArgLeu: 4.89 ± 0.113
1.659ArgMet: 1.659 ± 0.055
1.683ArgAsn: 1.683 ± 0.053
2.029ArgPro: 2.029 ± 0.067
2.163ArgGln: 2.163 ± 0.075
3.404ArgArg: 3.404 ± 0.096
2.412ArgSer: 2.412 ± 0.073
2.774ArgThr: 2.774 ± 0.076
3.76ArgVal: 3.76 ± 0.081
0.584ArgTrp: 0.584 ± 0.036
1.813ArgTyr: 1.813 ± 0.057
0.0ArgXaa: 0.0 ± 0.0
Ser
5.645SerAla: 5.645 ± 0.112
0.924SerCys: 0.924 ± 0.052
2.655SerAsp: 2.655 ± 0.077
2.978SerGlu: 2.978 ± 0.076
2.214SerPhe: 2.214 ± 0.072
4.994SerGly: 4.994 ± 0.106
0.991SerHis: 0.991 ± 0.045
3.064SerIle: 3.064 ± 0.084
2.158SerLys: 2.158 ± 0.062
4.965SerLeu: 4.965 ± 0.096
1.542SerMet: 1.542 ± 0.049
1.636SerAsn: 1.636 ± 0.045
1.949SerPro: 1.949 ± 0.056
1.656SerGln: 1.656 ± 0.051
2.771SerArg: 2.771 ± 0.079
3.404SerSer: 3.404 ± 0.123
3.186SerThr: 3.186 ± 0.076
4.095SerVal: 4.095 ± 0.087
0.528SerTrp: 0.528 ± 0.033
1.899SerTyr: 1.899 ± 0.06
0.0SerXaa: 0.0 ± 0.0
Thr
6.512ThrAla: 6.512 ± 0.128
0.848ThrCys: 0.848 ± 0.04
3.004ThrAsp: 3.004 ± 0.081
3.407ThrGlu: 3.407 ± 0.084
2.053ThrPhe: 2.053 ± 0.063
4.885ThrGly: 4.885 ± 0.096
1.004ThrHis: 1.004 ± 0.043
3.142ThrIle: 3.142 ± 0.073
2.231ThrLys: 2.231 ± 0.068
6.243ThrLeu: 6.243 ± 0.107
1.472ThrMet: 1.472 ± 0.063
1.704ThrAsn: 1.704 ± 0.061
3.074ThrPro: 3.074 ± 0.083
1.651ThrGln: 1.651 ± 0.052
2.71ThrArg: 2.71 ± 0.069
2.754ThrSer: 2.754 ± 0.074
3.232ThrThr: 3.232 ± 0.089
4.949ThrVal: 4.949 ± 0.115
0.504ThrTrp: 0.504 ± 0.029
1.707ThrTyr: 1.707 ± 0.056
0.0ThrXaa: 0.0 ± 0.0
Val
6.621ValAla: 6.621 ± 0.109
1.532ValCys: 1.532 ± 0.053
4.001ValAsp: 4.001 ± 0.078
4.857ValGlu: 4.857 ± 0.095
2.953ValPhe: 2.953 ± 0.09
5.259ValGly: 5.259 ± 0.104
1.311ValHis: 1.311 ± 0.051
3.821ValIle: 3.821 ± 0.095
3.338ValLys: 3.338 ± 0.091
8.261ValLeu: 8.261 ± 0.15
1.908ValMet: 1.908 ± 0.061
2.48ValAsn: 2.48 ± 0.058
3.346ValPro: 3.346 ± 0.075
2.728ValGln: 2.728 ± 0.069
3.646ValArg: 3.646 ± 0.084
4.428ValSer: 4.428 ± 0.098
4.417ValThr: 4.417 ± 0.092
5.674ValVal: 5.674 ± 0.127
0.783ValTrp: 0.783 ± 0.04
2.418ValTyr: 2.418 ± 0.07
0.002ValXaa: 0.002 ± 0.002
Trp
0.861TrpAla: 0.861 ± 0.044
0.179TrpCys: 0.179 ± 0.018
0.553TrpAsp: 0.553 ± 0.031
0.638TrpGlu: 0.638 ± 0.035
0.385TrpPhe: 0.385 ± 0.03
0.768TrpGly: 0.768 ± 0.04
0.214TrpHis: 0.214 ± 0.02
0.357TrpIle: 0.357 ± 0.024
0.575TrpLys: 0.575 ± 0.029
1.174TrpLeu: 1.174 ± 0.049
0.359TrpMet: 0.359 ± 0.026
0.488TrpAsn: 0.488 ± 0.032
0.402TrpPro: 0.402 ± 0.026
0.533TrpGln: 0.533 ± 0.033
0.478TrpArg: 0.478 ± 0.03
0.483TrpSer: 0.483 ± 0.029
0.495TrpThr: 0.495 ± 0.029
0.723TrpVal: 0.723 ± 0.036
0.128TrpTrp: 0.128 ± 0.014
0.398TrpTyr: 0.398 ± 0.032
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.206TyrAla: 3.206 ± 0.082
0.608TyrCys: 0.608 ± 0.036
2.095TyrAsp: 2.095 ± 0.073
1.971TyrGlu: 1.971 ± 0.064
1.413TyrPhe: 1.413 ± 0.049
2.824TyrGly: 2.824 ± 0.069
0.803TyrHis: 0.803 ± 0.039
1.743TyrIle: 1.743 ± 0.067
1.501TyrLys: 1.501 ± 0.055
3.309TyrLeu: 3.309 ± 0.077
0.788TyrMet: 0.788 ± 0.04
1.356TyrAsn: 1.356 ± 0.047
1.414TyrPro: 1.414 ± 0.05
1.282TyrGln: 1.282 ± 0.04
1.83TyrArg: 1.83 ± 0.06
1.818TyrSer: 1.818 ± 0.054
2.303TyrThr: 2.303 ± 0.07
2.01TyrVal: 2.01 ± 0.065
0.335TyrTrp: 0.335 ± 0.027
1.411TyrTyr: 1.411 ± 0.06
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.002XaaLeu: 0.002 ± 0.002
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.003XaaXaa: 0.003 ± 0.004
Statistics based on 1856 proteins (587560 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski