Amino acid dipepetide frequency for Firmicutes bacterium CAG:345

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.91AlaAla: 2.91 ± 0.111
0.769AlaCys: 0.769 ± 0.048
2.884AlaAsp: 2.884 ± 0.085
3.058AlaGlu: 3.058 ± 0.095
2.845AlaPhe: 2.845 ± 0.105
3.016AlaGly: 3.016 ± 0.102
0.838AlaHis: 0.838 ± 0.044
5.188AlaIle: 5.188 ± 0.126
4.376AlaLys: 4.376 ± 0.107
5.546AlaLeu: 5.546 ± 0.112
1.336AlaMet: 1.336 ± 0.061
2.957AlaAsn: 2.957 ± 0.082
1.287AlaPro: 1.287 ± 0.051
1.419AlaGln: 1.419 ± 0.071
1.741AlaArg: 1.741 ± 0.071
3.616AlaSer: 3.616 ± 0.087
2.877AlaThr: 2.877 ± 0.098
3.095AlaVal: 3.095 ± 0.091
0.309AlaTrp: 0.309 ± 0.022
2.347AlaTyr: 2.347 ± 0.076
0.0AlaXaa: 0.0 ± 0.0
Cys
0.598CysAla: 0.598 ± 0.042
0.199CysCys: 0.199 ± 0.022
0.679CysAsp: 0.679 ± 0.038
0.573CysGlu: 0.573 ± 0.035
0.767CysPhe: 0.767 ± 0.041
0.95CysGly: 0.95 ± 0.046
0.277CysHis: 0.277 ± 0.028
0.881CysIle: 0.881 ± 0.044
0.687CysLys: 0.687 ± 0.039
1.22CysLeu: 1.22 ± 0.051
0.175CysMet: 0.175 ± 0.02
0.718CysAsn: 0.718 ± 0.037
0.512CysPro: 0.512 ± 0.032
0.307CysGln: 0.307 ± 0.023
0.368CysArg: 0.368 ± 0.031
1.009CysSer: 1.009 ± 0.048
0.622CysThr: 0.622 ± 0.035
0.504CysVal: 0.504 ± 0.029
0.114CysTrp: 0.114 ± 0.017
0.724CysTyr: 0.724 ± 0.043
0.0CysXaa: 0.0 ± 0.0
Asp
2.78AspAla: 2.78 ± 0.097
0.628AspCys: 0.628 ± 0.037
3.65AspAsp: 3.65 ± 0.083
5.204AspGlu: 5.204 ± 0.116
3.463AspPhe: 3.463 ± 0.093
3.689AspGly: 3.689 ± 0.101
0.862AspHis: 0.862 ± 0.041
5.306AspIle: 5.306 ± 0.105
5.078AspLys: 5.078 ± 0.11
5.397AspLeu: 5.397 ± 0.118
1.151AspMet: 1.151 ± 0.052
3.911AspAsn: 3.911 ± 0.099
1.615AspPro: 1.615 ± 0.058
1.184AspGln: 1.184 ± 0.052
1.641AspArg: 1.641 ± 0.061
3.892AspSer: 3.892 ± 0.103
2.607AspThr: 2.607 ± 0.086
3.429AspVal: 3.429 ± 0.085
0.362AspTrp: 0.362 ± 0.024
3.49AspTyr: 3.49 ± 0.078
0.0AspXaa: 0.0 ± 0.0
Glu
3.516GluAla: 3.516 ± 0.098
0.6GluCys: 0.6 ± 0.039
4.14GluAsp: 4.14 ± 0.095
5.993GluGlu: 5.993 ± 0.123
2.869GluPhe: 2.869 ± 0.087
3.361GluGly: 3.361 ± 0.092
0.974GluHis: 0.974 ± 0.047
6.42GluIle: 6.42 ± 0.134
7.841GluLys: 7.841 ± 0.162
5.794GluLeu: 5.794 ± 0.114
1.401GluMet: 1.401 ± 0.055
5.757GluAsn: 5.757 ± 0.118
1.318GluPro: 1.318 ± 0.057
1.647GluGln: 1.647 ± 0.055
1.916GluArg: 1.916 ± 0.067
3.136GluSer: 3.136 ± 0.088
3.052GluThr: 3.052 ± 0.081
4.022GluVal: 4.022 ± 0.111
0.348GluTrp: 0.348 ± 0.029
3.386GluTyr: 3.386 ± 0.083
0.002GluXaa: 0.002 ± 0.002
Phe
2.8PheAla: 2.8 ± 0.087
0.614PheCys: 0.614 ± 0.038
3.512PheAsp: 3.512 ± 0.095
3.278PheGlu: 3.278 ± 0.082
2.843PhePhe: 2.843 ± 0.094
2.953PheGly: 2.953 ± 0.084
0.738PheHis: 0.738 ± 0.043
4.529PheIle: 4.529 ± 0.142
4.016PheLys: 4.016 ± 0.093
4.84PheLeu: 4.84 ± 0.136
0.954PheMet: 0.954 ± 0.042
3.294PheAsn: 3.294 ± 0.091
1.485PhePro: 1.485 ± 0.057
1.118PheGln: 1.118 ± 0.05
1.243PheArg: 1.243 ± 0.052
4.342PheSer: 4.342 ± 0.111
2.603PheThr: 2.603 ± 0.085
3.199PheVal: 3.199 ± 0.079
0.291PheTrp: 0.291 ± 0.027
2.688PheTyr: 2.688 ± 0.093
0.0PheXaa: 0.0 ± 0.0
Gly
3.016GlyAla: 3.016 ± 0.1
0.797GlyCys: 0.797 ± 0.042
2.973GlyAsp: 2.973 ± 0.082
3.557GlyGlu: 3.557 ± 0.097
2.837GlyPhe: 2.837 ± 0.075
3.418GlyGly: 3.418 ± 0.098
1.037GlyHis: 1.037 ± 0.047
5.259GlyIle: 5.259 ± 0.119
5.153GlyLys: 5.153 ± 0.107
4.911GlyLeu: 4.911 ± 0.099
1.224GlyMet: 1.224 ± 0.051
3.368GlyAsn: 3.368 ± 0.099
1.041GlyPro: 1.041 ± 0.051
1.491GlyGln: 1.491 ± 0.053
1.806GlyArg: 1.806 ± 0.08
3.437GlySer: 3.437 ± 0.099
3.719GlyThr: 3.719 ± 0.117
3.738GlyVal: 3.738 ± 0.103
0.464GlyTrp: 0.464 ± 0.035
3.081GlyTyr: 3.081 ± 0.085
0.0GlyXaa: 0.0 ± 0.0
His
0.714HisAla: 0.714 ± 0.048
0.189HisCys: 0.189 ± 0.018
0.901HisAsp: 0.901 ± 0.039
0.899HisGlu: 0.899 ± 0.051
0.911HisPhe: 0.911 ± 0.045
0.901HisGly: 0.901 ± 0.047
0.368HisHis: 0.368 ± 0.03
1.314HisIle: 1.314 ± 0.06
0.962HisLys: 0.962 ± 0.046
1.56HisLeu: 1.56 ± 0.065
0.281HisMet: 0.281 ± 0.026
0.887HisAsn: 0.887 ± 0.051
0.667HisPro: 0.667 ± 0.042
0.482HisGln: 0.482 ± 0.034
0.504HisArg: 0.504 ± 0.033
0.956HisSer: 0.956 ± 0.05
0.655HisThr: 0.655 ± 0.037
0.905HisVal: 0.905 ± 0.041
0.092HisTrp: 0.092 ± 0.014
0.828HisTyr: 0.828 ± 0.039
0.0HisXaa: 0.0 ± 0.0
Ile
5.035IleAla: 5.035 ± 0.106
1.279IleCys: 1.279 ± 0.056
6.176IleAsp: 6.176 ± 0.121
6.107IleGlu: 6.107 ± 0.124
4.673IlePhe: 4.673 ± 0.135
5.255IleGly: 5.255 ± 0.12
1.212IleHis: 1.212 ± 0.05
8.338IleIle: 8.338 ± 0.175
7.784IleLys: 7.784 ± 0.143
8.275IleLeu: 8.275 ± 0.161
1.773IleMet: 1.773 ± 0.079
6.176IleAsn: 6.176 ± 0.139
3.054IlePro: 3.054 ± 0.081
1.932IleGln: 1.932 ± 0.061
2.556IleArg: 2.556 ± 0.082
7.475IleSer: 7.475 ± 0.153
4.691IleThr: 4.691 ± 0.111
5.503IleVal: 5.503 ± 0.117
0.431IleTrp: 0.431 ± 0.035
4.541IleTyr: 4.541 ± 0.092
0.0IleXaa: 0.0 ± 0.0
Lys
4.555LysAla: 4.555 ± 0.102
0.754LysCys: 0.754 ± 0.041
5.912LysAsp: 5.912 ± 0.135
7.699LysGlu: 7.699 ± 0.152
3.148LysPhe: 3.148 ± 0.084
3.778LysGly: 3.778 ± 0.094
1.082LysHis: 1.082 ± 0.047
8.549LysIle: 8.549 ± 0.151
9.017LysLys: 9.017 ± 0.177
6.672LysLeu: 6.672 ± 0.146
2.459LysMet: 2.459 ± 0.074
7.272LysAsn: 7.272 ± 0.153
1.985LysPro: 1.985 ± 0.078
2.068LysGln: 2.068 ± 0.072
2.92LysArg: 2.92 ± 0.079
4.661LysSer: 4.661 ± 0.107
4.752LysThr: 4.752 ± 0.09
5.088LysVal: 5.088 ± 0.128
0.49LysTrp: 0.49 ± 0.033
4.541LysTyr: 4.541 ± 0.109
0.0LysXaa: 0.0 ± 0.0
Leu
5.279LeuAla: 5.279 ± 0.105
1.035LeuCys: 1.035 ± 0.047
5.478LeuAsp: 5.478 ± 0.123
5.55LeuGlu: 5.55 ± 0.135
4.921LeuPhe: 4.921 ± 0.151
5.332LeuGly: 5.332 ± 0.134
1.245LeuHis: 1.245 ± 0.055
8.163LeuIle: 8.163 ± 0.187
8.659LeuLys: 8.659 ± 0.168
8.529LeuLeu: 8.529 ± 0.178
1.865LeuMet: 1.865 ± 0.076
6.613LeuAsn: 6.613 ± 0.122
3.113LeuPro: 3.113 ± 0.08
2.149LeuGln: 2.149 ± 0.074
2.662LeuArg: 2.662 ± 0.085
7.303LeuSer: 7.303 ± 0.13
5.261LeuThr: 5.261 ± 0.12
5.454LeuVal: 5.454 ± 0.113
0.504LeuTrp: 0.504 ± 0.038
3.786LeuTyr: 3.786 ± 0.101
0.0LeuXaa: 0.0 ± 0.0
Met
1.322MetAla: 1.322 ± 0.056
0.205MetCys: 0.205 ± 0.02
1.192MetAsp: 1.192 ± 0.05
1.306MetGlu: 1.306 ± 0.054
0.899MetPhe: 0.899 ± 0.049
1.043MetGly: 1.043 ± 0.054
0.311MetHis: 0.311 ± 0.025
1.916MetIle: 1.916 ± 0.067
2.351MetLys: 2.351 ± 0.071
1.765MetLeu: 1.765 ± 0.065
0.565MetMet: 0.565 ± 0.032
1.523MetAsn: 1.523 ± 0.06
0.785MetPro: 0.785 ± 0.039
0.551MetGln: 0.551 ± 0.034
0.756MetArg: 0.756 ± 0.035
1.326MetSer: 1.326 ± 0.054
1.204MetThr: 1.204 ± 0.047
1.238MetVal: 1.238 ± 0.048
0.13MetTrp: 0.13 ± 0.018
0.71MetTyr: 0.71 ± 0.039
0.0MetXaa: 0.0 ± 0.0
Asn
3.258AsnAla: 3.258 ± 0.084
0.817AsnCys: 0.817 ± 0.049
3.689AsnAsp: 3.689 ± 0.104
4.358AsnGlu: 4.358 ± 0.109
3.494AsnPhe: 3.494 ± 0.099
4.037AsnGly: 4.037 ± 0.113
1.055AsnHis: 1.055 ± 0.046
6.745AsnIle: 6.745 ± 0.158
5.885AsnLys: 5.885 ± 0.135
6.333AsnLeu: 6.333 ± 0.122
1.509AsnMet: 1.509 ± 0.061
4.929AsnAsn: 4.929 ± 0.132
2.097AsnPro: 2.097 ± 0.058
1.637AsnGln: 1.637 ± 0.059
1.763AsnArg: 1.763 ± 0.066
4.803AsnSer: 4.803 ± 0.131
3.152AsnThr: 3.152 ± 0.093
3.821AsnVal: 3.821 ± 0.098
0.419AsnTrp: 0.419 ± 0.029
4.094AsnTyr: 4.094 ± 0.103
0.0AsnXaa: 0.0 ± 0.0
Pro
1.293ProAla: 1.293 ± 0.051
0.346ProCys: 0.346 ± 0.031
1.552ProAsp: 1.552 ± 0.057
2.119ProGlu: 2.119 ± 0.074
1.708ProPhe: 1.708 ± 0.057
1.436ProGly: 1.436 ± 0.064
0.478ProHis: 0.478 ± 0.034
2.461ProIle: 2.461 ± 0.077
1.901ProLys: 1.901 ± 0.058
2.849ProLeu: 2.849 ± 0.084
0.541ProMet: 0.541 ± 0.036
1.659ProAsn: 1.659 ± 0.06
0.474ProPro: 0.474 ± 0.039
0.779ProGln: 0.779 ± 0.046
0.732ProArg: 0.732 ± 0.039
2.379ProSer: 2.379 ± 0.07
1.755ProThr: 1.755 ± 0.066
1.832ProVal: 1.832 ± 0.058
0.216ProTrp: 0.216 ± 0.02
1.596ProTyr: 1.596 ± 0.055
0.0ProXaa: 0.0 ± 0.0
Gln
1.328GlnAla: 1.328 ± 0.054
0.23GlnCys: 0.23 ± 0.023
1.44GlnAsp: 1.44 ± 0.058
1.769GlnGlu: 1.769 ± 0.06
1.064GlnPhe: 1.064 ± 0.045
1.342GlnGly: 1.342 ± 0.061
0.301GlnHis: 0.301 ± 0.026
2.322GlnIle: 2.322 ± 0.077
2.548GlnLys: 2.548 ± 0.081
1.863GlnLeu: 1.863 ± 0.062
0.614GlnMet: 0.614 ± 0.035
1.962GlnAsn: 1.962 ± 0.074
0.578GlnPro: 0.578 ± 0.032
0.698GlnGln: 0.698 ± 0.045
0.785GlnArg: 0.785 ± 0.044
1.348GlnSer: 1.348 ± 0.062
1.277GlnThr: 1.277 ± 0.046
1.373GlnVal: 1.373 ± 0.056
0.167GlnTrp: 0.167 ± 0.02
1.23GlnTyr: 1.23 ± 0.054
0.0GlnXaa: 0.0 ± 0.0
Arg
1.521ArgAla: 1.521 ± 0.065
0.37ArgCys: 0.37 ± 0.031
1.417ArgAsp: 1.417 ± 0.06
1.708ArgGlu: 1.708 ± 0.06
1.515ArgPhe: 1.515 ± 0.061
1.55ArgGly: 1.55 ± 0.062
0.592ArgHis: 0.592 ± 0.04
2.8ArgIle: 2.8 ± 0.078
2.875ArgLys: 2.875 ± 0.09
2.896ArgLeu: 2.896 ± 0.083
0.724ArgMet: 0.724 ± 0.038
1.82ArgAsn: 1.82 ± 0.064
0.885ArgPro: 0.885 ± 0.045
0.887ArgGln: 0.887 ± 0.039
1.32ArgArg: 1.32 ± 0.061
1.672ArgSer: 1.672 ± 0.062
1.462ArgThr: 1.462 ± 0.057
1.806ArgVal: 1.806 ± 0.066
0.226ArgTrp: 0.226 ± 0.024
1.539ArgTyr: 1.539 ± 0.058
0.0ArgXaa: 0.0 ± 0.0
Ser
3.416SerAla: 3.416 ± 0.098
0.986SerCys: 0.986 ± 0.041
3.549SerAsp: 3.549 ± 0.098
3.927SerGlu: 3.927 ± 0.092
4.281SerPhe: 4.281 ± 0.102
4.138SerGly: 4.138 ± 0.095
0.94SerHis: 0.94 ± 0.044
6.572SerIle: 6.572 ± 0.128
5.464SerLys: 5.464 ± 0.113
7.488SerLeu: 7.488 ± 0.147
1.371SerMet: 1.371 ± 0.054
4.626SerAsn: 4.626 ± 0.124
1.788SerPro: 1.788 ± 0.061
1.753SerGln: 1.753 ± 0.065
1.993SerArg: 1.993 ± 0.061
8.563SerSer: 8.563 ± 0.413
4.305SerThr: 4.305 ± 0.141
3.597SerVal: 3.597 ± 0.095
0.584SerTrp: 0.584 ± 0.037
3.904SerTyr: 3.904 ± 0.113
0.0SerXaa: 0.0 ± 0.0
Thr
2.857ThrAla: 2.857 ± 0.086
0.571ThrCys: 0.571 ± 0.04
2.873ThrAsp: 2.873 ± 0.079
2.853ThrGlu: 2.853 ± 0.084
3.008ThrPhe: 3.008 ± 0.079
3.237ThrGly: 3.237 ± 0.097
0.87ThrHis: 0.87 ± 0.048
5.602ThrIle: 5.602 ± 0.131
3.88ThrLys: 3.88 ± 0.091
5.401ThrLeu: 5.401 ± 0.113
0.996ThrMet: 0.996 ± 0.046
3.266ThrAsn: 3.266 ± 0.093
1.91ThrPro: 1.91 ± 0.073
1.027ThrGln: 1.027 ± 0.044
1.507ThrArg: 1.507 ± 0.059
4.472ThrSer: 4.472 ± 0.151
3.18ThrThr: 3.18 ± 0.136
2.88ThrVal: 2.88 ± 0.104
0.37ThrTrp: 0.37 ± 0.028
2.902ThrTyr: 2.902 ± 0.095
0.0ThrXaa: 0.0 ± 0.0
Val
3.4ValAla: 3.4 ± 0.11
0.728ValCys: 0.728 ± 0.042
3.75ValAsp: 3.75 ± 0.106
3.978ValGlu: 3.978 ± 0.111
2.902ValPhe: 2.902 ± 0.079
3.571ValGly: 3.571 ± 0.095
0.781ValHis: 0.781 ± 0.039
5.216ValIle: 5.216 ± 0.116
4.496ValLys: 4.496 ± 0.094
5.582ValLeu: 5.582 ± 0.104
1.22ValMet: 1.22 ± 0.054
3.311ValAsn: 3.311 ± 0.097
1.812ValPro: 1.812 ± 0.069
1.31ValGln: 1.31 ± 0.053
1.578ValArg: 1.578 ± 0.065
4.472ValSer: 4.472 ± 0.111
3.624ValThr: 3.624 ± 0.126
3.73ValVal: 3.73 ± 0.111
0.303ValTrp: 0.303 ± 0.027
2.552ValTyr: 2.552 ± 0.072
0.0ValXaa: 0.0 ± 0.0
Trp
0.348TrpAla: 0.348 ± 0.029
0.075TrpCys: 0.075 ± 0.012
0.323TrpAsp: 0.323 ± 0.027
0.307TrpGlu: 0.307 ± 0.025
0.291TrpPhe: 0.291 ± 0.026
0.395TrpGly: 0.395 ± 0.032
0.128TrpHis: 0.128 ± 0.017
0.557TrpIle: 0.557 ± 0.035
0.508TrpLys: 0.508 ± 0.035
0.604TrpLeu: 0.604 ± 0.041
0.153TrpMet: 0.153 ± 0.02
0.484TrpAsn: 0.484 ± 0.035
0.14TrpPro: 0.14 ± 0.017
0.157TrpGln: 0.157 ± 0.02
0.195TrpArg: 0.195 ± 0.022
0.449TrpSer: 0.449 ± 0.034
0.329TrpThr: 0.329 ± 0.023
0.323TrpVal: 0.323 ± 0.023
0.094TrpTrp: 0.094 ± 0.017
0.374TrpTyr: 0.374 ± 0.033
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.467TyrAla: 2.467 ± 0.071
0.689TyrCys: 0.689 ± 0.045
3.248TyrAsp: 3.248 ± 0.095
3.252TyrGlu: 3.252 ± 0.085
2.902TyrPhe: 2.902 ± 0.095
2.924TyrGly: 2.924 ± 0.072
0.885TyrHis: 0.885 ± 0.046
4.13TyrIle: 4.13 ± 0.098
3.921TyrLys: 3.921 ± 0.107
5.379TyrLeu: 5.379 ± 0.133
0.783TyrMet: 0.783 ± 0.042
3.361TyrAsn: 3.361 ± 0.086
1.562TyrPro: 1.562 ± 0.058
1.643TyrGln: 1.643 ± 0.06
1.598TyrArg: 1.598 ± 0.058
3.935TyrSer: 3.935 ± 0.098
2.534TyrThr: 2.534 ± 0.086
2.778TyrVal: 2.778 ± 0.087
0.323TyrTrp: 0.323 ± 0.026
2.967TyrTyr: 2.967 ± 0.091
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.002XaaMet: 0.002 ± 0.002
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1462 proteins (491748 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski