Amino acid dipepetide frequency for Alistipes putredinis CAG:67

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.384AlaAla: 9.384 ± 0.138
1.044AlaCys: 1.044 ± 0.046
5.414AlaAsp: 5.414 ± 0.104
6.374AlaGlu: 6.374 ± 0.135
3.293AlaPhe: 3.293 ± 0.073
6.859AlaGly: 6.859 ± 0.116
1.37AlaHis: 1.37 ± 0.046
4.957AlaIle: 4.957 ± 0.107
3.971AlaLys: 3.971 ± 0.105
8.174AlaLeu: 8.174 ± 0.121
2.164AlaMet: 2.164 ± 0.067
2.525AlaAsn: 2.525 ± 0.069
3.195AlaPro: 3.195 ± 0.074
2.978AlaGln: 2.978 ± 0.068
5.219AlaArg: 5.219 ± 0.119
4.674AlaSer: 4.674 ± 0.098
4.411AlaThr: 4.411 ± 0.105
6.678AlaVal: 6.678 ± 0.126
0.852AlaTrp: 0.852 ± 0.039
2.72AlaTyr: 2.72 ± 0.074
0.0AlaXaa: 0.0 ± 0.0
Cys
0.955CysAla: 0.955 ± 0.041
0.263CysCys: 0.263 ± 0.022
0.721CysAsp: 0.721 ± 0.035
0.703CysGlu: 0.703 ± 0.034
0.533CysPhe: 0.533 ± 0.029
1.274CysGly: 1.274 ± 0.054
0.27CysHis: 0.27 ± 0.021
0.827CysIle: 0.827 ± 0.038
0.558CysLys: 0.558 ± 0.033
1.013CysLeu: 1.013 ± 0.039
0.266CysMet: 0.266 ± 0.022
0.452CysAsn: 0.452 ± 0.028
0.63CysPro: 0.63 ± 0.039
0.295CysGln: 0.295 ± 0.021
1.028CysArg: 1.028 ± 0.048
0.86CysSer: 0.86 ± 0.034
0.761CysThr: 0.761 ± 0.038
0.774CysVal: 0.774 ± 0.041
0.136CysTrp: 0.136 ± 0.017
0.47CysTyr: 0.47 ± 0.032
0.0CysXaa: 0.0 ± 0.0
Asp
4.555AspAla: 4.555 ± 0.093
0.723AspCys: 0.723 ± 0.04
2.925AspAsp: 2.925 ± 0.077
3.966AspGlu: 3.966 ± 0.088
2.897AspPhe: 2.897 ± 0.066
4.18AspGly: 4.18 ± 0.087
0.963AspHis: 0.963 ± 0.05
3.582AspIle: 3.582 ± 0.083
3.104AspLys: 3.104 ± 0.082
4.901AspLeu: 4.901 ± 0.108
1.377AspMet: 1.377 ± 0.046
2.158AspAsn: 2.158 ± 0.059
2.558AspPro: 2.558 ± 0.067
1.329AspGln: 1.329 ± 0.048
3.746AspArg: 3.746 ± 0.099
3.018AspSer: 3.018 ± 0.085
2.973AspThr: 2.973 ± 0.08
3.577AspVal: 3.577 ± 0.08
0.73AspTrp: 0.73 ± 0.034
2.636AspTyr: 2.636 ± 0.068
0.0AspXaa: 0.0 ± 0.0
Glu
5.96GluAla: 5.96 ± 0.119
0.695GluCys: 0.695 ± 0.038
2.879GluAsp: 2.879 ± 0.073
5.159GluGlu: 5.159 ± 0.1
2.666GluPhe: 2.666 ± 0.064
4.648GluGly: 4.648 ± 0.095
1.276GluHis: 1.276 ± 0.049
4.759GluIle: 4.759 ± 0.11
4.393GluLys: 4.393 ± 0.098
6.342GluLeu: 6.342 ± 0.112
1.991GluMet: 1.991 ± 0.06
3.005GluAsn: 3.005 ± 0.075
2.265GluPro: 2.265 ± 0.066
2.737GluGln: 2.737 ± 0.067
4.486GluArg: 4.486 ± 0.103
3.283GluSer: 3.283 ± 0.083
3.205GluThr: 3.205 ± 0.072
4.56GluVal: 4.56 ± 0.097
0.948GluTrp: 0.948 ± 0.04
2.614GluTyr: 2.614 ± 0.066
0.0GluXaa: 0.0 ± 0.0
Phe
3.662PheAla: 3.662 ± 0.076
0.629PheCys: 0.629 ± 0.03
3.053PheAsp: 3.053 ± 0.074
2.591PheGlu: 2.591 ± 0.066
2.027PhePhe: 2.027 ± 0.067
3.678PheGly: 3.678 ± 0.083
0.726PheHis: 0.726 ± 0.037
2.548PheIle: 2.548 ± 0.068
1.696PheLys: 1.696 ± 0.054
3.622PheLeu: 3.622 ± 0.086
1.049PheMet: 1.049 ± 0.046
1.716PheAsn: 1.716 ± 0.062
1.666PhePro: 1.666 ± 0.052
1.079PheGln: 1.079 ± 0.039
2.783PheArg: 2.783 ± 0.074
2.962PheSer: 2.962 ± 0.075
2.681PheThr: 2.681 ± 0.07
2.929PheVal: 2.929 ± 0.075
0.511PheTrp: 0.511 ± 0.029
1.678PheTyr: 1.678 ± 0.057
0.002PheXaa: 0.002 ± 0.001
Gly
5.747GlyAla: 5.747 ± 0.11
1.112GlyCys: 1.112 ± 0.048
3.88GlyAsp: 3.88 ± 0.083
4.842GlyGlu: 4.842 ± 0.089
3.38GlyPhe: 3.38 ± 0.078
5.588GlyGly: 5.588 ± 0.112
1.493GlyHis: 1.493 ± 0.054
5.45GlyIle: 5.45 ± 0.109
4.188GlyLys: 4.188 ± 0.089
6.179GlyLeu: 6.179 ± 0.122
2.143GlyMet: 2.143 ± 0.069
2.811GlyAsn: 2.811 ± 0.069
1.671GlyPro: 1.671 ± 0.054
2.23GlyGln: 2.23 ± 0.057
4.689GlyArg: 4.689 ± 0.1
4.153GlySer: 4.153 ± 0.086
4.213GlyThr: 4.213 ± 0.084
5.56GlyVal: 5.56 ± 0.102
0.912GlyTrp: 0.912 ± 0.042
3.187GlyTyr: 3.187 ± 0.083
0.005GlyXaa: 0.005 ± 0.003
His
1.324HisAla: 1.324 ± 0.049
0.278HisCys: 0.278 ± 0.024
0.998HisAsp: 0.998 ± 0.045
0.989HisGlu: 0.989 ± 0.039
0.872HisPhe: 0.872 ± 0.035
1.264HisGly: 1.264 ± 0.048
0.424HisHis: 0.424 ± 0.031
1.183HisIle: 1.183 ± 0.051
0.917HisLys: 0.917 ± 0.045
1.709HisLeu: 1.709 ± 0.057
0.369HisMet: 0.369 ± 0.024
0.803HisAsn: 0.803 ± 0.033
1.092HisPro: 1.092 ± 0.047
0.452HisGln: 0.452 ± 0.028
1.279HisArg: 1.279 ± 0.053
1.074HisSer: 1.074 ± 0.049
1.13HisThr: 1.13 ± 0.043
1.023HisVal: 1.023 ± 0.041
0.182HisTrp: 0.182 ± 0.019
0.781HisTyr: 0.781 ± 0.036
0.0HisXaa: 0.0 ± 0.0
Ile
6.24IleAla: 6.24 ± 0.115
0.852IleCys: 0.852 ± 0.043
4.527IleAsp: 4.527 ± 0.092
4.588IleGlu: 4.588 ± 0.095
2.581IlePhe: 2.581 ± 0.076
4.842IleGly: 4.842 ± 0.096
1.168IleHis: 1.168 ± 0.044
3.447IleIle: 3.447 ± 0.089
2.654IleLys: 2.654 ± 0.08
5.608IleLeu: 5.608 ± 0.095
1.266IleMet: 1.266 ± 0.05
2.386IleAsn: 2.386 ± 0.075
3.227IlePro: 3.227 ± 0.078
1.762IleGln: 1.762 ± 0.055
4.246IleArg: 4.246 ± 0.1
3.738IleSer: 3.738 ± 0.085
3.556IleThr: 3.556 ± 0.072
4.972IleVal: 4.972 ± 0.099
0.551IleTrp: 0.551 ± 0.03
2.358IleTyr: 2.358 ± 0.068
0.0IleXaa: 0.0 ± 0.0
Lys
4.281LysAla: 4.281 ± 0.092
0.465LysCys: 0.465 ± 0.031
2.426LysAsp: 2.426 ± 0.075
3.73LysGlu: 3.73 ± 0.097
1.934LysPhe: 1.934 ± 0.064
3.519LysGly: 3.519 ± 0.084
0.875LysHis: 0.875 ± 0.04
3.869LysIle: 3.869 ± 0.096
3.225LysLys: 3.225 ± 0.09
4.396LysLeu: 4.396 ± 0.101
1.875LysMet: 1.875 ± 0.057
2.381LysAsn: 2.381 ± 0.068
1.986LysPro: 1.986 ± 0.063
1.855LysGln: 1.855 ± 0.062
2.952LysArg: 2.952 ± 0.073
2.69LysSer: 2.69 ± 0.074
3.011LysThr: 3.011 ± 0.074
3.384LysVal: 3.384 ± 0.077
0.544LysTrp: 0.544 ± 0.027
2.159LysTyr: 2.159 ± 0.057
0.0LysXaa: 0.0 ± 0.0
Leu
8.098LeuAla: 8.098 ± 0.136
1.43LeuCys: 1.43 ± 0.047
4.985LeuAsp: 4.985 ± 0.095
5.384LeuGlu: 5.384 ± 0.107
4.251LeuPhe: 4.251 ± 0.091
6.407LeuGly: 6.407 ± 0.126
1.865LeuHis: 1.865 ± 0.064
5.204LeuIle: 5.204 ± 0.093
4.815LeuLys: 4.815 ± 0.092
9.408LeuLeu: 9.408 ± 0.191
2.254LeuMet: 2.254 ± 0.062
3.374LeuAsn: 3.374 ± 0.081
4.297LeuPro: 4.297 ± 0.083
3.079LeuGln: 3.079 ± 0.068
6.104LeuArg: 6.104 ± 0.119
6.064LeuSer: 6.064 ± 0.127
5.331LeuThr: 5.331 ± 0.089
5.748LeuVal: 5.748 ± 0.103
1.011LeuTrp: 1.011 ± 0.043
3.213LeuTyr: 3.213 ± 0.078
0.0LeuXaa: 0.0 ± 0.0
Met
2.229MetAla: 2.229 ± 0.063
0.23MetCys: 0.23 ± 0.019
1.236MetAsp: 1.236 ± 0.046
1.676MetGlu: 1.676 ± 0.063
0.953MetPhe: 0.953 ± 0.041
1.714MetGly: 1.714 ± 0.061
0.414MetHis: 0.414 ± 0.026
1.605MetIle: 1.605 ± 0.058
1.941MetLys: 1.941 ± 0.053
2.527MetLeu: 2.527 ± 0.069
0.675MetMet: 0.675 ± 0.035
1.14MetAsn: 1.14 ± 0.042
1.281MetPro: 1.281 ± 0.047
1.023MetGln: 1.023 ± 0.041
1.56MetArg: 1.56 ± 0.05
1.62MetSer: 1.62 ± 0.054
1.623MetThr: 1.623 ± 0.055
1.463MetVal: 1.463 ± 0.048
0.268MetTrp: 0.268 ± 0.022
0.697MetTyr: 0.697 ± 0.033
0.0MetXaa: 0.0 ± 0.0
Asn
2.977AsnAla: 2.977 ± 0.079
0.486AsnCys: 0.486 ± 0.032
2.09AsnAsp: 2.09 ± 0.061
2.103AsnGlu: 2.103 ± 0.055
1.666AsnPhe: 1.666 ± 0.054
3.015AsnGly: 3.015 ± 0.072
0.685AsnHis: 0.685 ± 0.038
2.757AsnIle: 2.757 ± 0.068
2.02AsnLys: 2.02 ± 0.068
3.577AsnLeu: 3.577 ± 0.096
1.023AsnMet: 1.023 ± 0.046
1.673AsnAsn: 1.673 ± 0.054
2.287AsnPro: 2.287 ± 0.063
1.057AsnGln: 1.057 ± 0.045
2.417AsnArg: 2.417 ± 0.065
1.966AsnSer: 1.966 ± 0.052
2.126AsnThr: 2.126 ± 0.062
2.348AsnVal: 2.348 ± 0.066
0.425AsnTrp: 0.425 ± 0.027
1.694AsnTyr: 1.694 ± 0.056
0.0AsnXaa: 0.0 ± 0.0
Pro
3.918ProAla: 3.918 ± 0.077
0.516ProCys: 0.516 ± 0.033
2.94ProAsp: 2.94 ± 0.075
3.998ProGlu: 3.998 ± 0.098
1.739ProPhe: 1.739 ± 0.054
2.813ProGly: 2.813 ± 0.07
0.748ProHis: 0.748 ± 0.039
2.254ProIle: 2.254 ± 0.064
1.843ProLys: 1.843 ± 0.056
3.553ProLeu: 3.553 ± 0.079
0.98ProMet: 0.98 ± 0.037
1.36ProAsn: 1.36 ± 0.055
1.133ProPro: 1.133 ± 0.041
1.572ProGln: 1.572 ± 0.05
2.04ProArg: 2.04 ± 0.073
2.426ProSer: 2.426 ± 0.063
2.214ProThr: 2.214 ± 0.058
3.428ProVal: 3.428 ± 0.08
0.377ProTrp: 0.377 ± 0.023
1.628ProTyr: 1.628 ± 0.055
0.002ProXaa: 0.002 ± 0.002
Gln
2.735GlnAla: 2.735 ± 0.067
0.261GlnCys: 0.261 ± 0.022
1.362GlnAsp: 1.362 ± 0.045
2.068GlnGlu: 2.068 ± 0.071
1.224GlnPhe: 1.224 ± 0.045
2.055GlnGly: 2.055 ± 0.057
0.604GlnHis: 0.604 ± 0.031
2.335GlnIle: 2.335 ± 0.072
1.747GlnLys: 1.747 ± 0.056
2.929GlnLeu: 2.929 ± 0.073
1.071GlnMet: 1.071 ± 0.039
1.395GlnAsn: 1.395 ± 0.047
1.33GlnPro: 1.33 ± 0.047
1.415GlnGln: 1.415 ± 0.053
2.09GlnArg: 2.09 ± 0.06
1.754GlnSer: 1.754 ± 0.054
2.042GlnThr: 2.042 ± 0.059
2.153GlnVal: 2.153 ± 0.069
0.361GlnTrp: 0.361 ± 0.024
1.276GlnTyr: 1.276 ± 0.052
0.0GlnXaa: 0.0 ± 0.0
Arg
4.239ArgAla: 4.239 ± 0.09
0.687ArgCys: 0.687 ± 0.034
2.962ArgAsp: 2.962 ± 0.068
4.699ArgGlu: 4.699 ± 0.106
2.753ArgPhe: 2.753 ± 0.069
3.528ArgGly: 3.528 ± 0.082
1.35ArgHis: 1.35 ± 0.049
5.02ArgIle: 5.02 ± 0.109
3.548ArgLys: 3.548 ± 0.087
5.955ArgLeu: 5.955 ± 0.122
1.996ArgMet: 1.996 ± 0.066
2.745ArgAsn: 2.745 ± 0.073
2.469ArgPro: 2.469 ± 0.06
2.484ArgGln: 2.484 ± 0.074
4.777ArgArg: 4.777 ± 0.123
3.278ArgSer: 3.278 ± 0.082
3.606ArgThr: 3.606 ± 0.084
3.534ArgVal: 3.534 ± 0.089
0.779ArgTrp: 0.779 ± 0.037
2.772ArgTyr: 2.772 ± 0.064
0.0ArgXaa: 0.0 ± 0.0
Ser
4.658SerAla: 4.658 ± 0.083
0.712SerCys: 0.712 ± 0.037
3.395SerAsp: 3.395 ± 0.085
3.505SerGlu: 3.505 ± 0.083
2.763SerPhe: 2.763 ± 0.082
4.893SerGly: 4.893 ± 0.104
0.97SerHis: 0.97 ± 0.043
3.503SerIle: 3.503 ± 0.075
2.712SerLys: 2.712 ± 0.058
5.54SerLeu: 5.54 ± 0.115
1.355SerMet: 1.355 ± 0.046
2.014SerAsn: 2.014 ± 0.063
2.421SerPro: 2.421 ± 0.063
1.761SerGln: 1.761 ± 0.051
3.246SerArg: 3.246 ± 0.079
3.389SerSer: 3.389 ± 0.079
2.815SerThr: 2.815 ± 0.078
4.34SerVal: 4.34 ± 0.103
0.619SerTrp: 0.619 ± 0.032
2.361SerTyr: 2.361 ± 0.071
0.0SerXaa: 0.0 ± 0.0
Thr
5.057ThrAla: 5.057 ± 0.118
0.551ThrCys: 0.551 ± 0.032
3.313ThrAsp: 3.313 ± 0.078
3.531ThrGlu: 3.531 ± 0.075
2.494ThrPhe: 2.494 ± 0.075
4.486ThrGly: 4.486 ± 0.087
0.953ThrHis: 0.953 ± 0.04
3.751ThrIle: 3.751 ± 0.085
2.217ThrLys: 2.217 ± 0.069
5.79ThrLeu: 5.79 ± 0.091
1.181ThrMet: 1.181 ± 0.053
1.766ThrAsn: 1.766 ± 0.054
3.268ThrPro: 3.268 ± 0.075
1.666ThrGln: 1.666 ± 0.051
2.82ThrArg: 2.82 ± 0.07
2.982ThrSer: 2.982 ± 0.074
3.324ThrThr: 3.324 ± 0.088
4.461ThrVal: 4.461 ± 0.087
0.541ThrTrp: 0.541 ± 0.035
2.029ThrTyr: 2.029 ± 0.065
0.0ThrXaa: 0.0 ± 0.0
Val
6.356ValAla: 6.356 ± 0.12
1.084ValCys: 1.084 ± 0.052
3.738ValAsp: 3.738 ± 0.094
4.83ValGlu: 4.83 ± 0.1
2.863ValPhe: 2.863 ± 0.064
4.916ValGly: 4.916 ± 0.114
1.117ValHis: 1.117 ± 0.049
4.398ValIle: 4.398 ± 0.096
3.652ValLys: 3.652 ± 0.08
6.385ValLeu: 6.385 ± 0.115
1.633ValMet: 1.633 ± 0.064
2.507ValAsn: 2.507 ± 0.068
2.805ValPro: 2.805 ± 0.063
1.929ValGln: 1.929 ± 0.055
4.345ValArg: 4.345 ± 0.09
4.233ValSer: 4.233 ± 0.086
4.147ValThr: 4.147 ± 0.101
5.677ValVal: 5.677 ± 0.109
0.826ValTrp: 0.826 ± 0.038
2.287ValTyr: 2.287 ± 0.065
0.002ValXaa: 0.002 ± 0.002
Trp
0.793TrpAla: 0.793 ± 0.038
0.19TrpCys: 0.19 ± 0.016
0.548TrpAsp: 0.548 ± 0.032
0.675TrpGlu: 0.675 ± 0.035
0.566TrpPhe: 0.566 ± 0.034
0.837TrpGly: 0.837 ± 0.038
0.227TrpHis: 0.227 ± 0.019
0.812TrpIle: 0.812 ± 0.04
0.531TrpLys: 0.531 ± 0.035
1.104TrpLeu: 1.104 ± 0.043
0.341TrpMet: 0.341 ± 0.021
0.5TrpAsn: 0.5 ± 0.029
0.238TrpPro: 0.238 ± 0.021
0.468TrpGln: 0.468 ± 0.03
0.703TrpArg: 0.703 ± 0.036
0.677TrpSer: 0.677 ± 0.036
0.612TrpThr: 0.612 ± 0.036
0.733TrpVal: 0.733 ± 0.034
0.192TrpTrp: 0.192 ± 0.017
0.417TrpTyr: 0.417 ± 0.027
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.093TyrAla: 3.093 ± 0.074
0.581TyrCys: 0.581 ± 0.034
2.507TyrAsp: 2.507 ± 0.062
2.459TyrGlu: 2.459 ± 0.057
1.832TyrPhe: 1.832 ± 0.054
2.917TyrGly: 2.917 ± 0.071
0.645TyrHis: 0.645 ± 0.033
2.308TyrIle: 2.308 ± 0.062
1.825TyrLys: 1.825 ± 0.058
3.612TyrLeu: 3.612 ± 0.079
0.865TyrMet: 0.865 ± 0.043
1.684TyrAsn: 1.684 ± 0.061
1.66TyrPro: 1.66 ± 0.061
1.036TyrGln: 1.036 ± 0.043
2.647TyrArg: 2.647 ± 0.07
2.126TyrSer: 2.126 ± 0.06
2.398TyrThr: 2.398 ± 0.067
2.383TyrVal: 2.383 ± 0.066
0.399TyrTrp: 0.399 ± 0.027
1.701TyrTyr: 1.701 ± 0.066
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.003XaaPro: 0.003 ± 0.002
0.0XaaGln: 0.0 ± 0.0
0.002XaaArg: 0.002 ± 0.002
0.003XaaSer: 0.003 ± 0.002
0.0XaaThr: 0.0 ± 0.0
0.002XaaVal: 0.002 ± 0.001
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.008XaaXaa: 0.008 ± 0.004
Statistics based on 1913 proteins (604354 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski