Amino acid dipepetide frequency for Dialister sp. CAG:588

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.479AlaAla: 6.479 ± 0.199
0.954AlaCys: 0.954 ± 0.058
3.919AlaAsp: 3.919 ± 0.115
4.87AlaGlu: 4.87 ± 0.135
3.11AlaPhe: 3.11 ± 0.11
5.573AlaGly: 5.573 ± 0.138
1.338AlaHis: 1.338 ± 0.058
6.11AlaIle: 6.11 ± 0.146
4.701AlaLys: 4.701 ± 0.128
7.141AlaLeu: 7.141 ± 0.168
2.224AlaMet: 2.224 ± 0.091
2.729AlaAsn: 2.729 ± 0.094
1.985AlaPro: 1.985 ± 0.082
2.38AlaGln: 2.38 ± 0.099
3.116AlaArg: 3.116 ± 0.098
4.043AlaSer: 4.043 ± 0.114
3.733AlaThr: 3.733 ± 0.112
5.765AlaVal: 5.765 ± 0.162
0.57AlaTrp: 0.57 ± 0.041
2.557AlaTyr: 2.557 ± 0.092
0.0AlaXaa: 0.0 ± 0.0
Cys
0.815CysAla: 0.815 ± 0.054
0.168CysCys: 0.168 ± 0.025
0.659CysAsp: 0.659 ± 0.049
0.641CysGlu: 0.641 ± 0.046
0.588CysPhe: 0.588 ± 0.042
1.119CysGly: 1.119 ± 0.06
0.292CysHis: 0.292 ± 0.033
1.175CysIle: 1.175 ± 0.064
0.715CysLys: 0.715 ± 0.055
1.093CysLeu: 1.093 ± 0.06
0.366CysMet: 0.366 ± 0.034
0.532CysAsn: 0.532 ± 0.034
0.49CysPro: 0.49 ± 0.035
0.325CysGln: 0.325 ± 0.031
0.511CysArg: 0.511 ± 0.04
0.729CysSer: 0.729 ± 0.049
0.608CysThr: 0.608 ± 0.042
0.753CysVal: 0.753 ± 0.045
0.095CysTrp: 0.095 ± 0.017
0.467CysTyr: 0.467 ± 0.045
0.0CysXaa: 0.0 ± 0.0
Asp
3.594AspAla: 3.594 ± 0.121
0.585AspCys: 0.585 ± 0.048
2.236AspAsp: 2.236 ± 0.097
3.316AspGlu: 3.316 ± 0.11
2.398AspPhe: 2.398 ± 0.082
3.437AspGly: 3.437 ± 0.114
0.759AspHis: 0.759 ± 0.054
4.749AspIle: 4.749 ± 0.113
4.016AspLys: 4.016 ± 0.136
4.294AspLeu: 4.294 ± 0.12
1.778AspMet: 1.778 ± 0.07
2.138AspAsn: 2.138 ± 0.084
1.409AspPro: 1.409 ± 0.058
0.986AspGln: 0.986 ± 0.05
2.073AspArg: 2.073 ± 0.084
2.997AspSer: 2.997 ± 0.102
2.891AspThr: 2.891 ± 0.095
3.792AspVal: 3.792 ± 0.132
0.52AspTrp: 0.52 ± 0.037
1.943AspTyr: 1.943 ± 0.087
0.0AspXaa: 0.0 ± 0.0
Glu
4.961GluAla: 4.961 ± 0.136
0.667GluCys: 0.667 ± 0.05
3.322GluAsp: 3.322 ± 0.121
6.476GluGlu: 6.476 ± 0.173
2.253GluPhe: 2.253 ± 0.091
4.261GluGly: 4.261 ± 0.13
1.394GluHis: 1.394 ± 0.056
5.401GluIle: 5.401 ± 0.123
6.078GluLys: 6.078 ± 0.165
6.479GluLeu: 6.479 ± 0.152
2.354GluMet: 2.354 ± 0.095
3.641GluAsn: 3.641 ± 0.095
1.571GluPro: 1.571 ± 0.06
2.841GluGln: 2.841 ± 0.11
3.086GluArg: 3.086 ± 0.098
3.786GluSer: 3.786 ± 0.101
3.245GluThr: 3.245 ± 0.095
4.4GluVal: 4.4 ± 0.115
0.644GluTrp: 0.644 ± 0.045
2.256GluTyr: 2.256 ± 0.079
0.003GluXaa: 0.003 ± 0.003
Phe
2.554PheAla: 2.554 ± 0.084
0.608PheCys: 0.608 ± 0.042
2.144PheAsp: 2.144 ± 0.077
2.129PheGlu: 2.129 ± 0.072
1.964PhePhe: 1.964 ± 0.09
2.847PheGly: 2.847 ± 0.107
0.883PheHis: 0.883 ± 0.058
3.984PheIle: 3.984 ± 0.137
2.357PheLys: 2.357 ± 0.087
3.981PheLeu: 3.981 ± 0.136
1.323PheMet: 1.323 ± 0.071
1.872PheAsn: 1.872 ± 0.088
1.418PhePro: 1.418 ± 0.07
1.037PheGln: 1.037 ± 0.054
1.654PheArg: 1.654 ± 0.074
3.021PheSer: 3.021 ± 0.102
2.398PheThr: 2.398 ± 0.083
2.354PheVal: 2.354 ± 0.102
0.47PheTrp: 0.47 ± 0.042
1.698PheTyr: 1.698 ± 0.076
0.0PheXaa: 0.0 ± 0.0
Gly
5.059GlyAla: 5.059 ± 0.145
0.907GlyCys: 0.907 ± 0.059
3.485GlyAsp: 3.485 ± 0.093
4.27GlyGlu: 4.27 ± 0.122
2.794GlyPhe: 2.794 ± 0.099
4.781GlyGly: 4.781 ± 0.132
1.533GlyHis: 1.533 ± 0.078
6.916GlyIle: 6.916 ± 0.175
5.342GlyLys: 5.342 ± 0.142
5.965GlyLeu: 5.965 ± 0.151
2.203GlyMet: 2.203 ± 0.093
3.062GlyAsn: 3.062 ± 0.12
1.415GlyPro: 1.415 ± 0.071
2.283GlyGln: 2.283 ± 0.081
3.133GlyArg: 3.133 ± 0.121
3.783GlySer: 3.783 ± 0.114
4.155GlyThr: 4.155 ± 0.126
5.112GlyVal: 5.112 ± 0.14
0.623GlyTrp: 0.623 ± 0.044
2.932GlyTyr: 2.932 ± 0.084
0.003GlyXaa: 0.003 ± 0.003
His
1.462HisAla: 1.462 ± 0.067
0.263HisCys: 0.263 ± 0.026
0.91HisAsp: 0.91 ± 0.05
1.084HisGlu: 1.084 ± 0.065
0.898HisPhe: 0.898 ± 0.049
1.444HisGly: 1.444 ± 0.067
0.591HisHis: 0.591 ± 0.051
2.159HisIle: 2.159 ± 0.089
1.308HisLys: 1.308 ± 0.064
2.011HisLeu: 2.011 ± 0.082
0.732HisMet: 0.732 ± 0.05
0.853HisAsn: 0.853 ± 0.051
0.995HisPro: 0.995 ± 0.058
0.685HisGln: 0.685 ± 0.046
0.907HisArg: 0.907 ± 0.052
1.264HisSer: 1.264 ± 0.067
1.161HisThr: 1.161 ± 0.051
1.568HisVal: 1.568 ± 0.074
0.233HisTrp: 0.233 ± 0.028
0.815HisTyr: 0.815 ± 0.044
0.0HisXaa: 0.0 ± 0.0
Ile
6.624IleAla: 6.624 ± 0.138
1.184IleCys: 1.184 ± 0.068
4.341IleAsp: 4.341 ± 0.118
5.277IleGlu: 5.277 ± 0.133
3.03IlePhe: 3.03 ± 0.114
6.14IleGly: 6.14 ± 0.137
1.987IleHis: 1.987 ± 0.079
6.656IleIle: 6.656 ± 0.193
5.528IleLys: 5.528 ± 0.127
7.968IleLeu: 7.968 ± 0.199
2.227IleMet: 2.227 ± 0.073
3.399IleAsn: 3.399 ± 0.121
3.866IlePro: 3.866 ± 0.117
2.699IleGln: 2.699 ± 0.107
3.815IleArg: 3.815 ± 0.117
6.06IleSer: 6.06 ± 0.131
4.911IleThr: 4.911 ± 0.118
5.546IleVal: 5.546 ± 0.151
0.732IleTrp: 0.732 ± 0.054
2.897IleTyr: 2.897 ± 0.107
0.003IleXaa: 0.003 ± 0.003
Lys
4.988LysAla: 4.988 ± 0.161
0.526LysCys: 0.526 ± 0.04
3.742LysAsp: 3.742 ± 0.12
7.454LysGlu: 7.454 ± 0.16
1.719LysPhe: 1.719 ± 0.071
4.362LysGly: 4.362 ± 0.119
1.376LysHis: 1.376 ± 0.061
5.327LysIle: 5.327 ± 0.144
6.506LysLys: 6.506 ± 0.145
5.404LysLeu: 5.404 ± 0.129
2.259LysMet: 2.259 ± 0.082
3.966LysAsn: 3.966 ± 0.11
2.02LysPro: 2.02 ± 0.079
3.062LysGln: 3.062 ± 0.096
3.151LysArg: 3.151 ± 0.105
3.892LysSer: 3.892 ± 0.133
3.641LysThr: 3.641 ± 0.098
4.728LysVal: 4.728 ± 0.126
0.644LysTrp: 0.644 ± 0.044
2.41LysTyr: 2.41 ± 0.079
0.0LysXaa: 0.0 ± 0.0
Leu
7.061LeuAla: 7.061 ± 0.153
1.276LeuCys: 1.276 ± 0.061
4.524LeuAsp: 4.524 ± 0.125
5.759LeuGlu: 5.759 ± 0.133
4.164LeuPhe: 4.164 ± 0.128
6.526LeuGly: 6.526 ± 0.154
2.2LeuHis: 2.2 ± 0.094
6.807LeuIle: 6.807 ± 0.163
5.924LeuLys: 5.924 ± 0.145
9.264LeuLeu: 9.264 ± 0.213
2.569LeuMet: 2.569 ± 0.075
3.423LeuAsn: 3.423 ± 0.102
3.771LeuPro: 3.771 ± 0.103
3.237LeuGln: 3.237 ± 0.109
3.703LeuArg: 3.703 ± 0.112
6.925LeuSer: 6.925 ± 0.171
5.342LeuThr: 5.342 ± 0.133
5.629LeuVal: 5.629 ± 0.135
0.883LeuTrp: 0.883 ± 0.053
3.237LeuTyr: 3.237 ± 0.092
0.006LeuXaa: 0.006 ± 0.004
Met
2.436MetAla: 2.436 ± 0.084
0.254MetCys: 0.254 ± 0.027
1.468MetAsp: 1.468 ± 0.063
2.191MetGlu: 2.191 ± 0.083
0.977MetPhe: 0.977 ± 0.061
2.12MetGly: 2.12 ± 0.088
0.712MetHis: 0.712 ± 0.043
2.309MetIle: 2.309 ± 0.093
2.552MetLys: 2.552 ± 0.089
2.395MetLeu: 2.395 ± 0.082
0.851MetMet: 0.851 ± 0.051
1.663MetAsn: 1.663 ± 0.062
1.102MetPro: 1.102 ± 0.058
1.267MetGln: 1.267 ± 0.057
1.223MetArg: 1.223 ± 0.069
1.905MetSer: 1.905 ± 0.083
1.775MetThr: 1.775 ± 0.065
2.005MetVal: 2.005 ± 0.08
0.189MetTrp: 0.189 ± 0.026
0.93MetTyr: 0.93 ± 0.05
0.003MetXaa: 0.003 ± 0.003
Asn
2.959AsnAla: 2.959 ± 0.101
0.475AsnCys: 0.475 ± 0.038
2.014AsnAsp: 2.014 ± 0.077
2.811AsnGlu: 2.811 ± 0.102
1.524AsnPhe: 1.524 ± 0.063
3.051AsnGly: 3.051 ± 0.116
1.087AsnHis: 1.087 ± 0.07
3.863AsnIle: 3.863 ± 0.146
3.75AsnLys: 3.75 ± 0.12
3.736AsnLeu: 3.736 ± 0.109
1.226AsnMet: 1.226 ± 0.058
2.041AsnAsn: 2.041 ± 0.089
2.026AsnPro: 2.026 ± 0.081
1.595AsnGln: 1.595 ± 0.066
1.979AsnArg: 1.979 ± 0.079
2.398AsnSer: 2.398 ± 0.077
2.487AsnThr: 2.487 ± 0.082
2.918AsnVal: 2.918 ± 0.089
0.443AsnTrp: 0.443 ± 0.035
1.494AsnTyr: 1.494 ± 0.062
0.003AsnXaa: 0.003 ± 0.003
Pro
2.221ProAla: 2.221 ± 0.075
0.431ProCys: 0.431 ± 0.038
1.781ProAsp: 1.781 ± 0.073
2.622ProGlu: 2.622 ± 0.094
1.801ProPhe: 1.801 ± 0.07
2.15ProGly: 2.15 ± 0.091
0.756ProHis: 0.756 ± 0.048
2.935ProIle: 2.935 ± 0.094
1.831ProLys: 1.831 ± 0.076
3.269ProLeu: 3.269 ± 0.106
0.954ProMet: 0.954 ± 0.057
1.311ProAsn: 1.311 ± 0.063
0.803ProPro: 0.803 ± 0.052
1.037ProGln: 1.037 ± 0.061
1.028ProArg: 1.028 ± 0.051
1.846ProSer: 1.846 ± 0.065
1.922ProThr: 1.922 ± 0.077
2.98ProVal: 2.98 ± 0.113
0.334ProTrp: 0.334 ± 0.032
1.397ProTyr: 1.397 ± 0.06
0.0ProXaa: 0.0 ± 0.0
Gln
2.67GlnAla: 2.67 ± 0.104
0.328GlnCys: 0.328 ± 0.028
1.459GlnAsp: 1.459 ± 0.063
2.516GlnGlu: 2.516 ± 0.094
1.187GlnPhe: 1.187 ± 0.059
2.274GlnGly: 2.274 ± 0.082
0.611GlnHis: 0.611 ± 0.042
2.85GlnIle: 2.85 ± 0.084
2.469GlnLys: 2.469 ± 0.098
3.24GlnLeu: 3.24 ± 0.112
1.102GlnMet: 1.102 ± 0.052
1.471GlnAsn: 1.471 ± 0.063
0.874GlnPro: 0.874 ± 0.052
1.329GlnGln: 1.329 ± 0.066
1.639GlnArg: 1.639 ± 0.079
1.79GlnSer: 1.79 ± 0.072
1.648GlnThr: 1.648 ± 0.07
2.342GlnVal: 2.342 ± 0.083
0.357GlnTrp: 0.357 ± 0.03
1.166GlnTyr: 1.166 ± 0.057
0.0GlnXaa: 0.0 ± 0.0
Arg
2.829ArgAla: 2.829 ± 0.09
0.511ArgCys: 0.511 ± 0.044
2.002ArgAsp: 2.002 ± 0.095
3.31ArgGlu: 3.31 ± 0.113
1.863ArgPhe: 1.863 ± 0.063
2.711ArgGly: 2.711 ± 0.089
0.848ArgHis: 0.848 ± 0.046
3.688ArgIle: 3.688 ± 0.114
3.443ArgLys: 3.443 ± 0.097
3.993ArgLeu: 3.993 ± 0.103
1.373ArgMet: 1.373 ± 0.067
2.209ArgAsn: 2.209 ± 0.073
1.255ArgPro: 1.255 ± 0.072
1.68ArgGln: 1.68 ± 0.071
2.126ArgArg: 2.126 ± 0.093
2.088ArgSer: 2.088 ± 0.081
2.126ArgThr: 2.126 ± 0.088
2.841ArgVal: 2.841 ± 0.101
0.428ArgTrp: 0.428 ± 0.037
1.542ArgTyr: 1.542 ± 0.076
0.0ArgXaa: 0.0 ± 0.0
Ser
4.238SerAla: 4.238 ± 0.122
0.8SerCys: 0.8 ± 0.049
3.027SerAsp: 3.027 ± 0.091
3.535SerGlu: 3.535 ± 0.112
3.059SerPhe: 3.059 ± 0.092
4.368SerGly: 4.368 ± 0.113
1.355SerHis: 1.355 ± 0.069
5.525SerIle: 5.525 ± 0.123
3.688SerLys: 3.688 ± 0.115
6.172SerLeu: 6.172 ± 0.155
1.908SerMet: 1.908 ± 0.079
2.312SerAsn: 2.312 ± 0.086
1.931SerPro: 1.931 ± 0.075
1.731SerGln: 1.731 ± 0.073
2.487SerArg: 2.487 ± 0.086
3.975SerSer: 3.975 ± 0.129
3.172SerThr: 3.172 ± 0.105
4.3SerVal: 4.3 ± 0.122
0.679SerTrp: 0.679 ± 0.044
2.584SerTyr: 2.584 ± 0.086
0.006SerXaa: 0.006 ± 0.004
Thr
4.241ThrAla: 4.241 ± 0.118
0.694ThrCys: 0.694 ± 0.054
2.779ThrAsp: 2.779 ± 0.086
3.31ThrGlu: 3.31 ± 0.104
2.132ThrPhe: 2.132 ± 0.081
4.427ThrGly: 4.427 ± 0.122
1.158ThrHis: 1.158 ± 0.056
4.769ThrIle: 4.769 ± 0.129
3.375ThrLys: 3.375 ± 0.097
5.387ThrLeu: 5.387 ± 0.126
1.562ThrMet: 1.562 ± 0.067
2.097ThrAsn: 2.097 ± 0.077
2.233ThrPro: 2.233 ± 0.086
1.485ThrGln: 1.485 ± 0.074
2.197ThrArg: 2.197 ± 0.077
3.095ThrSer: 3.095 ± 0.109
3.092ThrThr: 3.092 ± 0.099
4.235ThrVal: 4.235 ± 0.125
0.588ThrTrp: 0.588 ± 0.039
2.176ThrTyr: 2.176 ± 0.079
0.0ThrXaa: 0.0 ± 0.0
Val
5.124ValAla: 5.124 ± 0.121
0.942ValCys: 0.942 ± 0.054
3.644ValAsp: 3.644 ± 0.11
4.53ValGlu: 4.53 ± 0.126
3.024ValPhe: 3.024 ± 0.096
4.672ValGly: 4.672 ± 0.134
1.488ValHis: 1.488 ± 0.071
5.679ValIle: 5.679 ± 0.152
4.303ValLys: 4.303 ± 0.109
6.37ValLeu: 6.37 ± 0.127
1.934ValMet: 1.934 ± 0.075
2.944ValAsn: 2.944 ± 0.096
2.557ValPro: 2.557 ± 0.086
2.014ValGln: 2.014 ± 0.081
2.962ValArg: 2.962 ± 0.103
4.69ValSer: 4.69 ± 0.119
4.205ValThr: 4.205 ± 0.115
5.091ValVal: 5.091 ± 0.142
0.659ValTrp: 0.659 ± 0.042
2.419ValTyr: 2.419 ± 0.082
0.0ValXaa: 0.0 ± 0.0
Trp
0.602TrpAla: 0.602 ± 0.045
0.118TrpCys: 0.118 ± 0.017
0.475TrpAsp: 0.475 ± 0.034
0.667TrpGlu: 0.667 ± 0.049
0.461TrpPhe: 0.461 ± 0.036
0.75TrpGly: 0.75 ± 0.044
0.171TrpHis: 0.171 ± 0.025
0.889TrpIle: 0.889 ± 0.054
0.871TrpLys: 0.871 ± 0.048
0.797TrpLeu: 0.797 ± 0.052
0.313TrpMet: 0.313 ± 0.034
0.585TrpAsn: 0.585 ± 0.039
0.263TrpPro: 0.263 ± 0.03
0.369TrpGln: 0.369 ± 0.032
0.348TrpArg: 0.348 ± 0.03
0.532TrpSer: 0.532 ± 0.038
0.428TrpThr: 0.428 ± 0.036
0.487TrpVal: 0.487 ± 0.041
0.13TrpTrp: 0.13 ± 0.022
0.39TrpTyr: 0.39 ± 0.039
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.374TyrAla: 2.374 ± 0.086
0.449TyrCys: 0.449 ± 0.041
1.982TyrAsp: 1.982 ± 0.085
2.348TyrGlu: 2.348 ± 0.099
1.745TyrPhe: 1.745 ± 0.078
2.743TyrGly: 2.743 ± 0.084
0.803TyrHis: 0.803 ± 0.056
3.095TyrIle: 3.095 ± 0.099
2.416TyrLys: 2.416 ± 0.096
3.272TyrLeu: 3.272 ± 0.104
1.066TyrMet: 1.066 ± 0.05
1.633TyrAsn: 1.633 ± 0.064
1.426TyrPro: 1.426 ± 0.053
1.279TyrGln: 1.279 ± 0.06
1.728TyrArg: 1.728 ± 0.076
2.035TyrSer: 2.035 ± 0.082
2.088TyrThr: 2.088 ± 0.078
2.404TyrVal: 2.404 ± 0.094
0.425TyrTrp: 0.425 ± 0.038
1.485TyrTyr: 1.485 ± 0.071
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.003XaaAla: 0.003 ± 0.003
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.003XaaGlu: 0.003 ± 0.003
0.003XaaPhe: 0.003 ± 0.003
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.003XaaIle: 0.003 ± 0.003
0.003XaaLys: 0.003 ± 0.003
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.003XaaArg: 0.003 ± 0.003
0.0XaaSer: 0.0 ± 0.0
0.003XaaThr: 0.003 ± 0.003
0.003XaaVal: 0.003 ± 0.003
0.003XaaTrp: 0.003 ± 0.003
0.0XaaTyr: 0.0 ± 0.0
0.038XaaXaa: 0.038 ± 0.015
Statistics based on 1171 proteins (338624 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski