Amino acid dipepetide frequency for Clostridium bartlettii CAG:1329

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.538AlaAla: 3.538 ± 0.086
0.83AlaCys: 0.83 ± 0.036
2.752AlaAsp: 2.752 ± 0.069
3.137AlaGlu: 3.137 ± 0.078
2.318AlaPhe: 2.318 ± 0.06
3.766AlaGly: 3.766 ± 0.084
0.81AlaHis: 0.81 ± 0.03
5.643AlaIle: 5.643 ± 0.088
5.102AlaLys: 5.102 ± 0.101
5.581AlaLeu: 5.581 ± 0.103
1.801AlaMet: 1.801 ± 0.048
2.925AlaAsn: 2.925 ± 0.061
1.356AlaPro: 1.356 ± 0.037
1.687AlaGln: 1.687 ± 0.045
1.867AlaArg: 1.867 ± 0.046
3.341AlaSer: 3.341 ± 0.068
3.096AlaThr: 3.096 ± 0.065
4.047AlaVal: 4.047 ± 0.081
0.328AlaTrp: 0.328 ± 0.02
2.175AlaTyr: 2.175 ± 0.051
0.0AlaXaa: 0.0 ± 0.0
Cys
0.79CysAla: 0.79 ± 0.034
0.21CysCys: 0.21 ± 0.02
0.873CysAsp: 0.873 ± 0.034
1.004CysGlu: 1.004 ± 0.034
0.486CysPhe: 0.486 ± 0.024
1.309CysGly: 1.309 ± 0.05
0.225CysHis: 0.225 ± 0.018
1.228CysIle: 1.228 ± 0.043
1.158CysLys: 1.158 ± 0.046
1.014CysLeu: 1.014 ± 0.038
0.354CysMet: 0.354 ± 0.023
0.752CysAsn: 0.752 ± 0.032
0.555CysPro: 0.555 ± 0.027
0.334CysGln: 0.334 ± 0.02
0.411CysArg: 0.411 ± 0.023
0.872CysSer: 0.872 ± 0.034
0.647CysThr: 0.647 ± 0.027
0.834CysVal: 0.834 ± 0.032
0.071CysTrp: 0.071 ± 0.009
0.484CysTyr: 0.484 ± 0.026
0.0CysXaa: 0.0 ± 0.0
Asp
3.108AspAla: 3.108 ± 0.062
0.721AspCys: 0.721 ± 0.033
3.577AspAsp: 3.577 ± 0.082
5.481AspGlu: 5.481 ± 0.089
2.939AspPhe: 2.939 ± 0.066
3.526AspGly: 3.526 ± 0.072
0.623AspHis: 0.623 ± 0.03
6.72AspIle: 6.72 ± 0.098
6.016AspLys: 6.016 ± 0.096
5.768AspLeu: 5.768 ± 0.086
1.673AspMet: 1.673 ± 0.044
3.601AspAsn: 3.601 ± 0.074
1.327AspPro: 1.327 ± 0.041
0.97AspGln: 0.97 ± 0.034
1.899AspArg: 1.899 ± 0.059
3.251AspSer: 3.251 ± 0.063
2.847AspThr: 2.847 ± 0.064
3.894AspVal: 3.894 ± 0.074
0.355AspTrp: 0.355 ± 0.024
2.909AspTyr: 2.909 ± 0.064
0.0AspXaa: 0.0 ± 0.0
Glu
3.974GluAla: 3.974 ± 0.078
0.786GluCys: 0.786 ± 0.029
4.942GluAsp: 4.942 ± 0.103
6.54GluGlu: 6.54 ± 0.109
3.12GluPhe: 3.12 ± 0.064
4.007GluGly: 4.007 ± 0.068
0.926GluHis: 0.926 ± 0.034
7.156GluIle: 7.156 ± 0.108
7.36GluLys: 7.36 ± 0.099
6.49GluLeu: 6.49 ± 0.101
1.967GluMet: 1.967 ± 0.053
5.643GluAsn: 5.643 ± 0.099
1.397GluPro: 1.397 ± 0.049
2.002GluGln: 2.002 ± 0.059
2.324GluArg: 2.324 ± 0.062
3.494GluSer: 3.494 ± 0.062
3.027GluThr: 3.027 ± 0.059
4.939GluVal: 4.939 ± 0.088
0.334GluTrp: 0.334 ± 0.02
3.17GluTyr: 3.17 ± 0.067
0.0GluXaa: 0.0 ± 0.0
Phe
2.359PheAla: 2.359 ± 0.054
0.587PheCys: 0.587 ± 0.024
2.79PheAsp: 2.79 ± 0.069
3.027PheGlu: 3.027 ± 0.067
1.867PhePhe: 1.867 ± 0.065
2.851PheGly: 2.851 ± 0.069
0.469PheHis: 0.469 ± 0.022
4.123PheIle: 4.123 ± 0.087
3.617PheLys: 3.617 ± 0.06
3.812PheLeu: 3.812 ± 0.082
1.265PheMet: 1.265 ± 0.039
2.652PheAsn: 2.652 ± 0.064
1.027PhePro: 1.027 ± 0.037
0.764PheGln: 0.764 ± 0.03
1.269PheArg: 1.269 ± 0.039
2.767PheSer: 2.767 ± 0.056
2.244PheThr: 2.244 ± 0.054
2.891PheVal: 2.891 ± 0.065
0.253PheTrp: 0.253 ± 0.017
1.74PheTyr: 1.74 ± 0.052
0.0PheXaa: 0.0 ± 0.0
Gly
4.03GlyAla: 4.03 ± 0.09
1.179GlyCys: 1.179 ± 0.049
3.349GlyAsp: 3.349 ± 0.061
4.016GlyGlu: 4.016 ± 0.079
3.02GlyPhe: 3.02 ± 0.064
4.132GlyGly: 4.132 ± 0.086
1.02GlyHis: 1.02 ± 0.042
6.519GlyIle: 6.519 ± 0.099
5.289GlyLys: 5.289 ± 0.091
5.46GlyLeu: 5.46 ± 0.08
1.787GlyMet: 1.787 ± 0.047
3.145GlyAsn: 3.145 ± 0.065
1.208GlyPro: 1.208 ± 0.039
1.599GlyGln: 1.599 ± 0.047
2.08GlyArg: 2.08 ± 0.055
3.605GlySer: 3.605 ± 0.075
3.409GlyThr: 3.409 ± 0.076
4.735GlyVal: 4.735 ± 0.078
0.426GlyTrp: 0.426 ± 0.023
3.12GlyTyr: 3.12 ± 0.061
0.0GlyXaa: 0.0 ± 0.0
His
0.736HisAla: 0.736 ± 0.031
0.212HisCys: 0.212 ± 0.019
0.762HisAsp: 0.762 ± 0.031
0.879HisGlu: 0.879 ± 0.03
0.598HisPhe: 0.598 ± 0.025
0.928HisGly: 0.928 ± 0.041
0.253HisHis: 0.253 ± 0.02
1.31HisIle: 1.31 ± 0.039
1.092HisLys: 1.092 ± 0.032
1.1HisLeu: 1.1 ± 0.04
0.355HisMet: 0.355 ± 0.02
0.81HisAsn: 0.81 ± 0.031
0.61HisPro: 0.61 ± 0.027
0.338HisGln: 0.338 ± 0.022
0.493HisArg: 0.493 ± 0.024
0.799HisSer: 0.799 ± 0.034
0.709HisThr: 0.709 ± 0.031
0.758HisVal: 0.758 ± 0.033
0.112HisTrp: 0.112 ± 0.012
0.544HisTyr: 0.544 ± 0.027
0.0HisXaa: 0.0 ± 0.0
Ile
5.61IleAla: 5.61 ± 0.103
1.478IleCys: 1.478 ± 0.046
6.454IleAsp: 6.454 ± 0.098
7.07IleGlu: 7.07 ± 0.108
3.951IlePhe: 3.951 ± 0.085
6.143IleGly: 6.143 ± 0.095
1.189IleHis: 1.189 ± 0.04
8.631IleIle: 8.631 ± 0.125
8.81IleLys: 8.81 ± 0.118
8.847IleLeu: 8.847 ± 0.117
2.243IleMet: 2.243 ± 0.058
6.007IleAsn: 6.007 ± 0.107
3.09IlePro: 3.09 ± 0.061
2.34IleGln: 2.34 ± 0.057
2.745IleArg: 2.745 ± 0.058
6.524IleSer: 6.524 ± 0.103
4.601IleThr: 4.601 ± 0.078
6.331IleVal: 6.331 ± 0.103
0.428IleTrp: 0.428 ± 0.025
3.717IleTyr: 3.717 ± 0.08
0.0IleXaa: 0.0 ± 0.0
Lys
4.628LysAla: 4.628 ± 0.085
1.047LysCys: 1.047 ± 0.043
6.221LysAsp: 6.221 ± 0.103
8.404LysGlu: 8.404 ± 0.122
3.333LysPhe: 3.333 ± 0.06
4.715LysGly: 4.715 ± 0.085
1.159LysHis: 1.159 ± 0.043
8.507LysIle: 8.507 ± 0.135
8.425LysLys: 8.425 ± 0.117
7.444LysLeu: 7.444 ± 0.096
2.565LysMet: 2.565 ± 0.057
6.664LysAsn: 6.664 ± 0.114
2.043LysPro: 2.043 ± 0.051
2.541LysGln: 2.541 ± 0.056
2.901LysArg: 2.901 ± 0.067
5.399LysSer: 5.399 ± 0.089
4.381LysThr: 4.381 ± 0.077
5.926LysVal: 5.926 ± 0.1
0.511LysTrp: 0.511 ± 0.023
4.495LysTyr: 4.495 ± 0.072
0.0LysXaa: 0.0 ± 0.0
Leu
5.222LeuAla: 5.222 ± 0.079
1.208LeuCys: 1.208 ± 0.04
5.962LeuAsp: 5.962 ± 0.083
6.548LeuGlu: 6.548 ± 0.1
3.695LeuPhe: 3.695 ± 0.086
6.127LeuGly: 6.127 ± 0.097
1.102LeuHis: 1.102 ± 0.034
7.58LeuIle: 7.58 ± 0.122
8.189LeuLys: 8.189 ± 0.118
7.236LeuLeu: 7.236 ± 0.112
2.159LeuMet: 2.159 ± 0.061
6.004LeuAsn: 6.004 ± 0.089
2.592LeuPro: 2.592 ± 0.056
2.229LeuGln: 2.229 ± 0.049
2.822LeuArg: 2.822 ± 0.061
6.185LeuSer: 6.185 ± 0.11
4.306LeuThr: 4.306 ± 0.079
5.604LeuVal: 5.604 ± 0.086
0.468LeuTrp: 0.468 ± 0.024
3.26LeuTyr: 3.26 ± 0.054
0.0LeuXaa: 0.0 ± 0.0
Met
1.854MetAla: 1.854 ± 0.051
0.338MetCys: 0.338 ± 0.018
1.69MetAsp: 1.69 ± 0.046
1.813MetGlu: 1.813 ± 0.043
1.01MetPhe: 1.01 ± 0.037
1.907MetGly: 1.907 ± 0.049
0.368MetHis: 0.368 ± 0.021
2.348MetIle: 2.348 ± 0.057
2.477MetLys: 2.477 ± 0.054
2.188MetLeu: 2.188 ± 0.05
0.785MetMet: 0.785 ± 0.035
1.661MetAsn: 1.661 ± 0.046
0.921MetPro: 0.921 ± 0.034
0.75MetGln: 0.75 ± 0.032
0.897MetArg: 0.897 ± 0.035
1.706MetSer: 1.706 ± 0.047
1.441MetThr: 1.441 ± 0.036
1.637MetVal: 1.637 ± 0.055
0.155MetTrp: 0.155 ± 0.014
0.873MetTyr: 0.873 ± 0.037
0.0MetXaa: 0.0 ± 0.0
Asn
2.996AsnAla: 2.996 ± 0.068
0.822AsnCys: 0.822 ± 0.041
3.225AsnAsp: 3.225 ± 0.061
4.237AsnGlu: 4.237 ± 0.082
2.509AsnPhe: 2.509 ± 0.06
3.457AsnGly: 3.457 ± 0.078
0.824AsnHis: 0.824 ± 0.034
6.928AsnIle: 6.928 ± 0.108
6.373AsnLys: 6.373 ± 0.11
5.878AsnLeu: 5.878 ± 0.091
1.792AsnMet: 1.792 ± 0.044
4.461AsnAsn: 4.461 ± 0.111
2.19AsnPro: 2.19 ± 0.065
1.79AsnGln: 1.79 ± 0.048
1.904AsnArg: 1.904 ± 0.046
3.741AsnSer: 3.741 ± 0.08
3.191AsnThr: 3.191 ± 0.074
3.563AsnVal: 3.563 ± 0.067
0.412AsnTrp: 0.412 ± 0.02
2.9AsnTyr: 2.9 ± 0.066
0.0AsnXaa: 0.0 ± 0.0
Pro
1.408ProAla: 1.408 ± 0.046
0.341ProCys: 0.341 ± 0.023
1.434ProAsp: 1.434 ± 0.048
2.038ProGlu: 2.038 ± 0.061
1.197ProPhe: 1.197 ± 0.035
1.749ProGly: 1.749 ± 0.054
0.465ProHis: 0.465 ± 0.027
2.581ProIle: 2.581 ± 0.059
2.231ProLys: 2.231 ± 0.051
2.203ProLeu: 2.203 ± 0.053
0.708ProMet: 0.708 ± 0.028
1.581ProAsn: 1.581 ± 0.051
0.497ProPro: 0.497 ± 0.029
0.864ProGln: 0.864 ± 0.028
0.789ProArg: 0.789 ± 0.03
1.68ProSer: 1.68 ± 0.046
1.619ProThr: 1.619 ± 0.043
2.047ProVal: 2.047 ± 0.057
0.199ProTrp: 0.199 ± 0.016
1.214ProTyr: 1.214 ± 0.039
0.0ProXaa: 0.0 ± 0.0
Gln
1.48GlnAla: 1.48 ± 0.044
0.27GlnCys: 0.27 ± 0.022
1.497GlnAsp: 1.497 ± 0.04
1.911GlnGlu: 1.911 ± 0.049
0.943GlnPhe: 0.943 ± 0.032
1.511GlnGly: 1.511 ± 0.046
0.341GlnHis: 0.341 ± 0.02
2.525GlnIle: 2.525 ± 0.056
2.441GlnLys: 2.441 ± 0.058
2.086GlnLeu: 2.086 ± 0.051
0.743GlnMet: 0.743 ± 0.029
1.938GlnAsn: 1.938 ± 0.053
0.615GlnPro: 0.615 ± 0.029
0.818GlnGln: 0.818 ± 0.035
0.967GlnArg: 0.967 ± 0.039
1.42GlnSer: 1.42 ± 0.045
1.229GlnThr: 1.229 ± 0.041
1.646GlnVal: 1.646 ± 0.04
0.143GlnTrp: 0.143 ± 0.013
1.146GlnTyr: 1.146 ± 0.039
0.0GlnXaa: 0.0 ± 0.0
Arg
1.793ArgAla: 1.793 ± 0.048
0.447ArgCys: 0.447 ± 0.026
1.989ArgAsp: 1.989 ± 0.051
2.653ArgGlu: 2.653 ± 0.059
1.371ArgPhe: 1.371 ± 0.042
1.947ArgGly: 1.947 ± 0.053
0.468ArgHis: 0.468 ± 0.027
2.796ArgIle: 2.796 ± 0.051
2.876ArgLys: 2.876 ± 0.065
2.736ArgLeu: 2.736 ± 0.059
0.892ArgMet: 0.892 ± 0.034
1.921ArgAsn: 1.921 ± 0.051
0.835ArgPro: 0.835 ± 0.03
0.959ArgGln: 0.959 ± 0.037
1.345ArgArg: 1.345 ± 0.04
1.435ArgSer: 1.435 ± 0.045
1.515ArgThr: 1.515 ± 0.039
2.209ArgVal: 2.209 ± 0.056
0.189ArgTrp: 0.189 ± 0.015
1.397ArgTyr: 1.397 ± 0.041
0.0ArgXaa: 0.0 ± 0.0
Ser
3.097SerAla: 3.097 ± 0.063
0.777SerCys: 0.777 ± 0.035
3.469SerAsp: 3.469 ± 0.067
3.855SerGlu: 3.855 ± 0.069
2.722SerPhe: 2.722 ± 0.056
3.957SerGly: 3.957 ± 0.069
0.903SerHis: 0.903 ± 0.035
6.067SerIle: 6.067 ± 0.097
5.768SerLys: 5.768 ± 0.089
5.535SerLeu: 5.535 ± 0.082
1.622SerMet: 1.622 ± 0.045
3.806SerAsn: 3.806 ± 0.085
1.494SerPro: 1.494 ± 0.047
1.761SerGln: 1.761 ± 0.047
1.979SerArg: 1.979 ± 0.048
4.18SerSer: 4.18 ± 0.106
3.225SerThr: 3.225 ± 0.071
3.761SerVal: 3.761 ± 0.079
0.403SerTrp: 0.403 ± 0.025
2.739SerTyr: 2.739 ± 0.069
0.0SerXaa: 0.0 ± 0.0
Thr
2.833ThrAla: 2.833 ± 0.068
0.634ThrCys: 0.634 ± 0.029
2.66ThrAsp: 2.66 ± 0.063
2.901ThrGlu: 2.901 ± 0.063
2.126ThrPhe: 2.126 ± 0.055
3.629ThrGly: 3.629 ± 0.077
0.773ThrHis: 0.773 ± 0.029
4.777ThrIle: 4.777 ± 0.076
4.124ThrLys: 4.124 ± 0.08
4.72ThrLeu: 4.72 ± 0.081
1.232ThrMet: 1.232 ± 0.043
2.798ThrAsn: 2.798 ± 0.071
1.775ThrPro: 1.775 ± 0.051
1.326ThrGln: 1.326 ± 0.04
1.489ThrArg: 1.489 ± 0.044
3.391ThrSer: 3.391 ± 0.09
2.91ThrThr: 2.91 ± 0.075
3.64ThrVal: 3.64 ± 0.075
0.328ThrTrp: 0.328 ± 0.021
2.167ThrTyr: 2.167 ± 0.051
0.0ThrXaa: 0.0 ± 0.0
Val
4.227ValAla: 4.227 ± 0.079
1.047ValCys: 1.047 ± 0.042
4.36ValAsp: 4.36 ± 0.072
4.727ValGlu: 4.727 ± 0.088
2.925ValPhe: 2.925 ± 0.059
4.578ValGly: 4.578 ± 0.087
0.829ValHis: 0.829 ± 0.031
5.868ValIle: 5.868 ± 0.085
5.452ValLys: 5.452 ± 0.087
6.115ValLeu: 6.115 ± 0.087
1.649ValMet: 1.649 ± 0.042
3.589ValAsn: 3.589 ± 0.068
1.925ValPro: 1.925 ± 0.054
1.469ValGln: 1.469 ± 0.046
2.036ValArg: 2.036 ± 0.052
4.343ValSer: 4.343 ± 0.083
3.285ValThr: 3.285 ± 0.069
4.954ValVal: 4.954 ± 0.09
0.356ValTrp: 0.356 ± 0.018
2.536ValTyr: 2.536 ± 0.061
0.0ValXaa: 0.0 ± 0.0
Trp
0.338TrpAla: 0.338 ± 0.024
0.074TrpCys: 0.074 ± 0.008
0.386TrpAsp: 0.386 ± 0.02
0.353TrpGlu: 0.353 ± 0.021
0.276TrpPhe: 0.276 ± 0.021
0.404TrpGly: 0.404 ± 0.023
0.1TrpHis: 0.1 ± 0.012
0.594TrpIle: 0.594 ± 0.028
0.41TrpLys: 0.41 ± 0.022
0.481TrpLeu: 0.481 ± 0.023
0.183TrpMet: 0.183 ± 0.014
0.338TrpAsn: 0.338 ± 0.02
0.12TrpPro: 0.12 ± 0.012
0.18TrpGln: 0.18 ± 0.016
0.187TrpArg: 0.187 ± 0.016
0.33TrpSer: 0.33 ± 0.021
0.294TrpThr: 0.294 ± 0.022
0.413TrpVal: 0.413 ± 0.027
0.063TrpTrp: 0.063 ± 0.008
0.28TrpTyr: 0.28 ± 0.022
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.079TyrAla: 2.079 ± 0.047
0.602TyrCys: 0.602 ± 0.027
2.81TyrAsp: 2.81 ± 0.061
3.004TyrGlu: 3.004 ± 0.074
1.966TyrPhe: 1.966 ± 0.048
2.485TyrGly: 2.485 ± 0.056
0.553TyrHis: 0.553 ± 0.029
4.31TyrIle: 4.31 ± 0.089
4.099TyrLys: 4.099 ± 0.075
3.806TyrLeu: 3.806 ± 0.063
1.045TyrMet: 1.045 ± 0.038
2.896TyrAsn: 2.896 ± 0.069
1.266TyrPro: 1.266 ± 0.047
0.952TyrGln: 0.952 ± 0.037
1.372TyrArg: 1.372 ± 0.043
2.654TyrSer: 2.654 ± 0.057
2.254TyrThr: 2.254 ± 0.049
2.441TyrVal: 2.441 ± 0.066
0.273TyrTrp: 0.273 ± 0.019
1.981TyrTyr: 1.981 ± 0.062
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2686 proteins (839455 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski