Amino acid dipepetide frequency for Peptostreptococcus anaerobius CAG:621

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.057AlaAla: 4.057 ± 0.109
0.829AlaCys: 0.829 ± 0.044
3.625AlaAsp: 3.625 ± 0.117
3.382AlaGlu: 3.382 ± 0.09
2.277AlaPhe: 2.277 ± 0.074
4.585AlaGly: 4.585 ± 0.102
0.885AlaHis: 0.885 ± 0.04
5.894AlaIle: 5.894 ± 0.123
5.335AlaLys: 5.335 ± 0.122
5.276AlaLeu: 5.276 ± 0.113
2.027AlaMet: 2.027 ± 0.065
3.02AlaAsn: 3.02 ± 0.089
1.503AlaPro: 1.503 ± 0.061
1.688AlaGln: 1.688 ± 0.069
2.836AlaArg: 2.836 ± 0.076
3.833AlaSer: 3.833 ± 0.099
3.079AlaThr: 3.079 ± 0.079
4.314AlaVal: 4.314 ± 0.096
0.375AlaTrp: 0.375 ± 0.031
2.35AlaTyr: 2.35 ± 0.07
0.0AlaXaa: 0.0 ± 0.0
Cys
0.6CysAla: 0.6 ± 0.037
0.169CysCys: 0.169 ± 0.023
0.704CysAsp: 0.704 ± 0.036
0.651CysGlu: 0.651 ± 0.033
0.408CysPhe: 0.408 ± 0.03
1.081CysGly: 1.081 ± 0.048
0.254CysHis: 0.254 ± 0.023
0.863CysIle: 0.863 ± 0.044
0.783CysLys: 0.783 ± 0.045
0.947CysLeu: 0.947 ± 0.049
0.351CysMet: 0.351 ± 0.027
0.432CysAsn: 0.432 ± 0.034
0.491CysPro: 0.491 ± 0.036
0.302CysGln: 0.302 ± 0.023
0.383CysArg: 0.383 ± 0.028
0.789CysSer: 0.789 ± 0.039
0.469CysThr: 0.469 ± 0.031
0.682CysVal: 0.682 ± 0.038
0.075CysTrp: 0.075 ± 0.012
0.309CysTyr: 0.309 ± 0.025
0.0CysXaa: 0.0 ± 0.0
Asp
3.448AspAla: 3.448 ± 0.098
0.664AspCys: 0.664 ± 0.034
3.731AspAsp: 3.731 ± 0.087
4.85AspGlu: 4.85 ± 0.12
3.058AspPhe: 3.058 ± 0.084
4.09AspGly: 4.09 ± 0.122
0.934AspHis: 0.934 ± 0.037
6.747AspIle: 6.747 ± 0.135
6.565AspLys: 6.565 ± 0.133
6.367AspLeu: 6.367 ± 0.138
2.321AspMet: 2.321 ± 0.075
2.946AspAsn: 2.946 ± 0.078
1.857AspPro: 1.857 ± 0.077
1.8AspGln: 1.8 ± 0.062
3.062AspArg: 3.062 ± 0.078
3.976AspSer: 3.976 ± 0.092
3.047AspThr: 3.047 ± 0.086
4.064AspVal: 4.064 ± 0.084
0.388AspTrp: 0.388 ± 0.028
3.369AspTyr: 3.369 ± 0.078
0.0AspXaa: 0.0 ± 0.0
Glu
4.405GluAla: 4.405 ± 0.105
0.574GluCys: 0.574 ± 0.038
5.32GluAsp: 5.32 ± 0.118
6.251GluGlu: 6.251 ± 0.11
2.935GluPhe: 2.935 ± 0.073
4.108GluGly: 4.108 ± 0.097
0.826GluHis: 0.826 ± 0.038
6.172GluIle: 6.172 ± 0.116
6.878GluLys: 6.878 ± 0.116
5.97GluLeu: 5.97 ± 0.114
2.166GluMet: 2.166 ± 0.07
4.353GluAsn: 4.353 ± 0.099
1.445GluPro: 1.445 ± 0.052
1.372GluGln: 1.372 ± 0.058
2.83GluArg: 2.83 ± 0.08
3.779GluSer: 3.779 ± 0.089
2.744GluThr: 2.744 ± 0.073
4.826GluVal: 4.826 ± 0.119
0.351GluTrp: 0.351 ± 0.025
2.876GluTyr: 2.876 ± 0.071
0.0GluXaa: 0.0 ± 0.0
Phe
2.42PheAla: 2.42 ± 0.077
0.498PheCys: 0.498 ± 0.031
3.045PheAsp: 3.045 ± 0.085
2.904PheGlu: 2.904 ± 0.087
1.64PhePhe: 1.64 ± 0.071
2.737PheGly: 2.737 ± 0.089
0.423PheHis: 0.423 ± 0.03
3.507PheIle: 3.507 ± 0.086
3.202PheLys: 3.202 ± 0.071
3.325PheLeu: 3.325 ± 0.086
1.273PheMet: 1.273 ± 0.055
2.056PheAsn: 2.056 ± 0.069
1.057PhePro: 1.057 ± 0.044
0.693PheGln: 0.693 ± 0.039
1.414PheArg: 1.414 ± 0.052
2.819PheSer: 2.819 ± 0.087
2.188PheThr: 2.188 ± 0.071
2.97PheVal: 2.97 ± 0.077
0.296PheTrp: 0.296 ± 0.027
1.65PheTyr: 1.65 ± 0.056
0.0PheXaa: 0.0 ± 0.0
Gly
4.364GlyAla: 4.364 ± 0.104
0.91GlyCys: 0.91 ± 0.056
3.807GlyAsp: 3.807 ± 0.1
4.018GlyGlu: 4.018 ± 0.092
2.922GlyPhe: 2.922 ± 0.087
4.677GlyGly: 4.677 ± 0.112
1.241GlyHis: 1.241 ± 0.049
6.413GlyIle: 6.413 ± 0.127
5.662GlyLys: 5.662 ± 0.135
6.462GlyLeu: 6.462 ± 0.127
2.148GlyMet: 2.148 ± 0.073
2.867GlyAsn: 2.867 ± 0.083
1.596GlyPro: 1.596 ± 0.058
2.144GlyGln: 2.144 ± 0.066
2.74GlyArg: 2.74 ± 0.085
3.985GlySer: 3.985 ± 0.101
3.209GlyThr: 3.209 ± 0.101
5.078GlyVal: 5.078 ± 0.099
0.491GlyTrp: 0.491 ± 0.035
3.194GlyTyr: 3.194 ± 0.107
0.0GlyXaa: 0.0 ± 0.0
His
0.774HisAla: 0.774 ± 0.034
0.158HisCys: 0.158 ± 0.017
0.853HisAsp: 0.853 ± 0.043
0.89HisGlu: 0.89 ± 0.046
0.552HisPhe: 0.552 ± 0.028
0.982HisGly: 0.982 ± 0.043
0.32HisHis: 0.32 ± 0.03
1.401HisIle: 1.401 ± 0.05
1.061HisLys: 1.061 ± 0.042
1.177HisLeu: 1.177 ± 0.045
0.456HisMet: 0.456 ± 0.029
0.644HisAsn: 0.644 ± 0.032
0.614HisPro: 0.614 ± 0.033
0.362HisGln: 0.362 ± 0.03
0.64HisArg: 0.64 ± 0.035
0.903HisSer: 0.903 ± 0.046
0.789HisThr: 0.789 ± 0.035
0.813HisVal: 0.813 ± 0.036
0.074HisTrp: 0.074 ± 0.013
0.533HisTyr: 0.533 ± 0.037
0.0HisXaa: 0.0 ± 0.0
Ile
5.596IleAla: 5.596 ± 0.114
1.043IleCys: 1.043 ± 0.045
6.762IleAsp: 6.762 ± 0.117
6.582IleGlu: 6.582 ± 0.135
3.605IlePhe: 3.605 ± 0.105
6.266IleGly: 6.266 ± 0.12
1.124IleHis: 1.124 ± 0.05
7.123IleIle: 7.123 ± 0.155
7.395IleLys: 7.395 ± 0.121
7.794IleLeu: 7.794 ± 0.161
2.385IleMet: 2.385 ± 0.075
4.715IleAsn: 4.715 ± 0.107
2.869IlePro: 2.869 ± 0.081
1.846IleGln: 1.846 ± 0.061
3.485IleArg: 3.485 ± 0.086
6.319IleSer: 6.319 ± 0.146
3.66IleThr: 3.66 ± 0.081
6.324IleVal: 6.324 ± 0.122
0.425IleTrp: 0.425 ± 0.028
3.272IleTyr: 3.272 ± 0.081
0.0IleXaa: 0.0 ± 0.0
Lys
5.021LysAla: 5.021 ± 0.132
0.66LysCys: 0.66 ± 0.038
6.38LysAsp: 6.38 ± 0.159
7.007LysGlu: 7.007 ± 0.122
3.009LysPhe: 3.009 ± 0.08
4.811LysGly: 4.811 ± 0.097
1.133LysHis: 1.133 ± 0.046
7.415LysIle: 7.415 ± 0.127
8.401LysLys: 8.401 ± 0.158
7.373LysLeu: 7.373 ± 0.13
2.685LysMet: 2.685 ± 0.069
5.734LysAsn: 5.734 ± 0.116
2.299LysPro: 2.299 ± 0.097
1.889LysGln: 1.889 ± 0.06
3.261LysArg: 3.261 ± 0.085
5.307LysSer: 5.307 ± 0.105
4.121LysThr: 4.121 ± 0.116
5.547LysVal: 5.547 ± 0.131
0.511LysTrp: 0.511 ± 0.036
3.842LysTyr: 3.842 ± 0.082
0.0LysXaa: 0.0 ± 0.0
Leu
6.161LeuAla: 6.161 ± 0.128
0.881LeuCys: 0.881 ± 0.039
6.455LeuAsp: 6.455 ± 0.132
6.751LeuGlu: 6.751 ± 0.141
3.288LeuPhe: 3.288 ± 0.091
6.402LeuGly: 6.402 ± 0.122
1.019LeuHis: 1.019 ± 0.042
6.617LeuIle: 6.617 ± 0.156
6.968LeuLys: 6.968 ± 0.127
7.167LeuLeu: 7.167 ± 0.165
2.652LeuMet: 2.652 ± 0.069
4.43LeuAsn: 4.43 ± 0.097
2.705LeuPro: 2.705 ± 0.084
1.517LeuGln: 1.517 ± 0.059
3.468LeuArg: 3.468 ± 0.088
6.194LeuSer: 6.194 ± 0.135
4.061LeuThr: 4.061 ± 0.085
6.972LeuVal: 6.972 ± 0.11
0.427LeuTrp: 0.427 ± 0.028
3.018LeuTyr: 3.018 ± 0.083
0.0LeuXaa: 0.0 ± 0.0
Met
2.236MetAla: 2.236 ± 0.067
0.298MetCys: 0.298 ± 0.023
2.273MetAsp: 2.273 ± 0.07
1.97MetGlu: 1.97 ± 0.066
1.096MetPhe: 1.096 ± 0.04
2.234MetGly: 2.234 ± 0.074
0.348MetHis: 0.348 ± 0.025
2.394MetIle: 2.394 ± 0.069
2.61MetLys: 2.61 ± 0.075
2.405MetLeu: 2.405 ± 0.07
0.962MetMet: 0.962 ± 0.044
1.56MetAsn: 1.56 ± 0.062
0.925MetPro: 0.925 ± 0.044
0.69MetGln: 0.69 ± 0.041
1.153MetArg: 1.153 ± 0.048
2.148MetSer: 2.148 ± 0.063
1.707MetThr: 1.707 ± 0.066
2.337MetVal: 2.337 ± 0.077
0.177MetTrp: 0.177 ± 0.016
1.041MetTyr: 1.041 ± 0.047
0.0MetXaa: 0.0 ± 0.0
Asn
2.67AsnAla: 2.67 ± 0.081
0.497AsnCys: 0.497 ± 0.033
2.832AsnAsp: 2.832 ± 0.074
3.204AsnGlu: 3.204 ± 0.079
2.017AsnPhe: 2.017 ± 0.062
3.183AsnGly: 3.183 ± 0.092
0.754AsnHis: 0.754 ± 0.03
5.536AsnIle: 5.536 ± 0.108
4.947AsnLys: 4.947 ± 0.115
4.601AsnLeu: 4.601 ± 0.098
1.736AsnMet: 1.736 ± 0.064
2.828AsnAsn: 2.828 ± 0.086
2.192AsnPro: 2.192 ± 0.067
1.377AsnGln: 1.377 ± 0.055
2.211AsnArg: 2.211 ± 0.067
3.141AsnSer: 3.141 ± 0.084
2.634AsnThr: 2.634 ± 0.082
2.974AsnVal: 2.974 ± 0.076
0.357AsnTrp: 0.357 ± 0.024
2.111AsnTyr: 2.111 ± 0.074
0.0AsnXaa: 0.0 ± 0.0
Pro
1.753ProAla: 1.753 ± 0.065
0.327ProCys: 0.327 ± 0.023
1.977ProAsp: 1.977 ± 0.078
2.332ProGlu: 2.332 ± 0.074
1.195ProPhe: 1.195 ± 0.047
1.929ProGly: 1.929 ± 0.069
0.48ProHis: 0.48 ± 0.029
2.632ProIle: 2.632 ± 0.071
2.293ProLys: 2.293 ± 0.078
2.236ProLeu: 2.236 ± 0.068
0.874ProMet: 0.874 ± 0.041
1.504ProAsn: 1.504 ± 0.067
0.566ProPro: 0.566 ± 0.029
0.885ProGln: 0.885 ± 0.046
1.08ProArg: 1.08 ± 0.042
1.944ProSer: 1.944 ± 0.06
1.526ProThr: 1.526 ± 0.072
2.341ProVal: 2.341 ± 0.074
0.18ProTrp: 0.18 ± 0.019
1.054ProTyr: 1.054 ± 0.043
0.0ProXaa: 0.0 ± 0.0
Gln
1.8GlnAla: 1.8 ± 0.068
0.191GlnCys: 0.191 ± 0.017
1.571GlnAsp: 1.571 ± 0.053
1.848GlnGlu: 1.848 ± 0.061
0.853GlnPhe: 0.853 ± 0.041
1.574GlnGly: 1.574 ± 0.057
0.303GlnHis: 0.303 ± 0.021
2.063GlnIle: 2.063 ± 0.057
2.054GlnLys: 2.054 ± 0.062
2.041GlnLeu: 2.041 ± 0.064
0.831GlnMet: 0.831 ± 0.038
1.251GlnAsn: 1.251 ± 0.049
0.653GlnPro: 0.653 ± 0.04
0.528GlnGln: 0.528 ± 0.033
1.102GlnArg: 1.102 ± 0.044
1.28GlnSer: 1.28 ± 0.05
1.21GlnThr: 1.21 ± 0.05
1.898GlnVal: 1.898 ± 0.066
0.182GlnTrp: 0.182 ± 0.018
0.883GlnTyr: 0.883 ± 0.04
0.0GlnXaa: 0.0 ± 0.0
Arg
2.396ArgAla: 2.396 ± 0.07
0.388ArgCys: 0.388 ± 0.03
2.698ArgAsp: 2.698 ± 0.072
3.145ArgGlu: 3.145 ± 0.086
1.672ArgPhe: 1.672 ± 0.054
2.628ArgGly: 2.628 ± 0.076
0.682ArgHis: 0.682 ± 0.035
3.548ArgIle: 3.548 ± 0.089
3.239ArgLys: 3.239 ± 0.092
3.893ArgLeu: 3.893 ± 0.098
1.214ArgMet: 1.214 ± 0.052
2.023ArgAsn: 2.023 ± 0.062
1.328ArgPro: 1.328 ± 0.049
1.322ArgGln: 1.322 ± 0.055
1.865ArgArg: 1.865 ± 0.064
2.336ArgSer: 2.336 ± 0.069
1.896ArgThr: 1.896 ± 0.062
3.114ArgVal: 3.114 ± 0.071
0.223ArgTrp: 0.223 ± 0.024
1.753ArgTyr: 1.753 ± 0.058
0.0ArgXaa: 0.0 ± 0.0
Ser
3.369SerAla: 3.369 ± 0.085
0.653SerCys: 0.653 ± 0.039
3.873SerAsp: 3.873 ± 0.091
3.54SerGlu: 3.54 ± 0.087
2.685SerPhe: 2.685 ± 0.071
4.666SerGly: 4.666 ± 0.101
1.048SerHis: 1.048 ± 0.051
6.306SerIle: 6.306 ± 0.126
5.811SerLys: 5.811 ± 0.105
6.174SerLeu: 6.174 ± 0.136
2.028SerMet: 2.028 ± 0.063
3.172SerAsn: 3.172 ± 0.077
1.85SerPro: 1.85 ± 0.068
1.951SerGln: 1.951 ± 0.061
2.823SerArg: 2.823 ± 0.082
4.607SerSer: 4.607 ± 0.13
3.06SerThr: 3.06 ± 0.084
4.296SerVal: 4.296 ± 0.103
0.383SerTrp: 0.383 ± 0.024
2.735SerTyr: 2.735 ± 0.075
0.0SerXaa: 0.0 ± 0.0
Thr
3.005ThrAla: 3.005 ± 0.104
0.506ThrCys: 0.506 ± 0.034
2.841ThrAsp: 2.841 ± 0.094
2.595ThrGlu: 2.595 ± 0.1
1.826ThrPhe: 1.826 ± 0.074
4.031ThrGly: 4.031 ± 0.096
0.714ThrHis: 0.714 ± 0.032
4.335ThrIle: 4.335 ± 0.093
3.634ThrLys: 3.634 ± 0.105
3.73ThrLeu: 3.73 ± 0.092
1.24ThrMet: 1.24 ± 0.042
2.459ThrAsn: 2.459 ± 0.069
1.679ThrPro: 1.679 ± 0.086
1.085ThrGln: 1.085 ± 0.044
2.223ThrArg: 2.223 ± 0.066
3.566ThrSer: 3.566 ± 0.091
2.529ThrThr: 2.529 ± 0.113
3.342ThrVal: 3.342 ± 0.111
0.316ThrTrp: 0.316 ± 0.026
1.909ThrTyr: 1.909 ± 0.066
0.0ThrXaa: 0.0 ± 0.0
Val
4.732ValAla: 4.732 ± 0.107
0.89ValCys: 0.89 ± 0.044
5.282ValAsp: 5.282 ± 0.091
5.159ValGlu: 5.159 ± 0.113
3.119ValPhe: 3.119 ± 0.073
4.936ValGly: 4.936 ± 0.121
0.855ValHis: 0.855 ± 0.046
5.512ValIle: 5.512 ± 0.115
5.309ValLys: 5.309 ± 0.114
6.209ValLeu: 6.209 ± 0.132
1.856ValMet: 1.856 ± 0.075
3.402ValAsn: 3.402 ± 0.086
2.03ValPro: 2.03 ± 0.063
1.4ValGln: 1.4 ± 0.047
2.65ValArg: 2.65 ± 0.076
4.862ValSer: 4.862 ± 0.108
3.314ValThr: 3.314 ± 0.113
5.302ValVal: 5.302 ± 0.111
0.395ValTrp: 0.395 ± 0.029
2.832ValTyr: 2.832 ± 0.072
0.0ValXaa: 0.0 ± 0.0
Trp
0.348TrpAla: 0.348 ± 0.022
0.081TrpCys: 0.081 ± 0.015
0.425TrpAsp: 0.425 ± 0.036
0.322TrpGlu: 0.322 ± 0.027
0.25TrpPhe: 0.25 ± 0.021
0.43TrpGly: 0.43 ± 0.03
0.097TrpHis: 0.097 ± 0.013
0.515TrpIle: 0.515 ± 0.033
0.465TrpLys: 0.465 ± 0.033
0.495TrpLeu: 0.495 ± 0.033
0.204TrpMet: 0.204 ± 0.02
0.326TrpAsn: 0.326 ± 0.024
0.166TrpPro: 0.166 ± 0.018
0.177TrpGln: 0.177 ± 0.017
0.193TrpArg: 0.193 ± 0.02
0.397TrpSer: 0.397 ± 0.025
0.32TrpThr: 0.32 ± 0.023
0.405TrpVal: 0.405 ± 0.029
0.055TrpTrp: 0.055 ± 0.008
0.274TrpTyr: 0.274 ± 0.021
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.014TyrAla: 2.014 ± 0.054
0.526TyrCys: 0.526 ± 0.033
2.832TyrAsp: 2.832 ± 0.091
2.782TyrGlu: 2.782 ± 0.09
1.71TyrPhe: 1.71 ± 0.057
2.58TyrGly: 2.58 ± 0.077
0.557TyrHis: 0.557 ± 0.034
3.597TyrIle: 3.597 ± 0.092
3.79TyrLys: 3.79 ± 0.088
3.513TyrLeu: 3.513 ± 0.078
1.153TyrMet: 1.153 ± 0.043
2.08TyrAsn: 2.08 ± 0.069
1.282TyrPro: 1.282 ± 0.046
1.089TyrGln: 1.089 ± 0.044
1.942TyrArg: 1.942 ± 0.056
2.845TyrSer: 2.845 ± 0.073
2.001TyrThr: 2.001 ± 0.063
2.448TyrVal: 2.448 ± 0.078
0.263TyrTrp: 0.263 ± 0.021
1.887TyrTyr: 1.887 ± 0.063
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1713 proteins (543760 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski