Amino acid dipepetide frequency for Dolosigranulum pigrum ATCC 51524

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.131AlaAla: 5.131 ± 0.121
0.46AlaCys: 0.46 ± 0.03
4.671AlaAsp: 4.671 ± 0.106
5.836AlaGlu: 5.836 ± 0.124
2.775AlaPhe: 2.775 ± 0.071
5.293AlaGly: 5.293 ± 0.136
1.532AlaHis: 1.532 ± 0.061
5.97AlaIle: 5.97 ± 0.099
4.111AlaLys: 4.111 ± 0.107
7.362AlaLeu: 7.362 ± 0.136
1.881AlaMet: 1.881 ± 0.067
2.983AlaAsn: 2.983 ± 0.08
2.129AlaPro: 2.129 ± 0.072
3.314AlaGln: 3.314 ± 0.087
2.946AlaArg: 2.946 ± 0.099
4.241AlaSer: 4.241 ± 0.079
4.377AlaThr: 4.377 ± 0.086
5.144AlaVal: 5.144 ± 0.098
0.54AlaTrp: 0.54 ± 0.034
2.536AlaTyr: 2.536 ± 0.082
0.0AlaXaa: 0.0 ± 0.0
Cys
0.309CysAla: 0.309 ± 0.026
0.076CysCys: 0.076 ± 0.014
0.296CysAsp: 0.296 ± 0.027
0.298CysGlu: 0.298 ± 0.026
0.199CysPhe: 0.199 ± 0.018
0.543CysGly: 0.543 ± 0.034
0.16CysHis: 0.16 ± 0.018
0.303CysIle: 0.303 ± 0.025
0.216CysLys: 0.216 ± 0.021
0.471CysLeu: 0.471 ± 0.031
0.119CysMet: 0.119 ± 0.013
0.195CysAsn: 0.195 ± 0.022
0.251CysPro: 0.251 ± 0.023
0.259CysGln: 0.259 ± 0.023
0.225CysArg: 0.225 ± 0.021
0.329CysSer: 0.329 ± 0.025
0.246CysThr: 0.246 ± 0.022
0.301CysVal: 0.301 ± 0.024
0.048CysTrp: 0.048 ± 0.011
0.192CysTyr: 0.192 ± 0.021
0.0CysXaa: 0.0 ± 0.0
Asp
3.919AspAla: 3.919 ± 0.1
0.29AspCys: 0.29 ± 0.026
3.497AspAsp: 3.497 ± 0.106
5.529AspGlu: 5.529 ± 0.119
2.544AspPhe: 2.544 ± 0.07
4.035AspGly: 4.035 ± 0.132
1.478AspHis: 1.478 ± 0.056
4.915AspIle: 4.915 ± 0.103
3.714AspLys: 3.714 ± 0.098
5.348AspLeu: 5.348 ± 0.111
1.664AspMet: 1.664 ± 0.053
2.447AspAsn: 2.447 ± 0.073
1.988AspPro: 1.988 ± 0.063
2.991AspGln: 2.991 ± 0.079
2.561AspArg: 2.561 ± 0.073
3.005AspSer: 3.005 ± 0.079
2.996AspThr: 2.996 ± 0.077
4.518AspVal: 4.518 ± 0.098
0.59AspTrp: 0.59 ± 0.038
2.864AspTyr: 2.864 ± 0.091
0.0AspXaa: 0.0 ± 0.0
Glu
6.921GluAla: 6.921 ± 0.134
0.285GluCys: 0.285 ± 0.028
4.416GluAsp: 4.416 ± 0.1
6.792GluGlu: 6.792 ± 0.155
2.598GluPhe: 2.598 ± 0.083
4.094GluGly: 4.094 ± 0.113
1.669GluHis: 1.669 ± 0.057
4.989GluIle: 4.989 ± 0.108
4.838GluLys: 4.838 ± 0.136
7.956GluLeu: 7.956 ± 0.141
2.207GluMet: 2.207 ± 0.071
3.143GluAsn: 3.143 ± 0.083
2.215GluPro: 2.215 ± 0.063
4.379GluGln: 4.379 ± 0.113
3.856GluArg: 3.856 ± 0.088
3.86GluSer: 3.86 ± 0.095
4.349GluThr: 4.349 ± 0.112
5.473GluVal: 5.473 ± 0.107
0.677GluTrp: 0.677 ± 0.037
2.311GluTyr: 2.311 ± 0.06
0.0GluXaa: 0.0 ± 0.0
Phe
2.613PheAla: 2.613 ± 0.07
0.212PheCys: 0.212 ± 0.022
2.717PheAsp: 2.717 ± 0.071
2.605PheGlu: 2.605 ± 0.066
1.982PhePhe: 1.982 ± 0.08
3.046PheGly: 3.046 ± 0.09
0.668PheHis: 0.668 ± 0.037
3.402PheIle: 3.402 ± 0.102
2.419PheLys: 2.419 ± 0.086
3.761PheLeu: 3.761 ± 0.088
1.044PheMet: 1.044 ± 0.047
2.168PheAsn: 2.168 ± 0.07
1.433PhePro: 1.433 ± 0.051
1.446PheGln: 1.446 ± 0.051
1.463PheArg: 1.463 ± 0.057
3.104PheSer: 3.104 ± 0.084
2.337PheThr: 2.337 ± 0.063
2.89PheVal: 2.89 ± 0.076
0.367PheTrp: 0.367 ± 0.031
1.632PheTyr: 1.632 ± 0.062
0.0PheXaa: 0.0 ± 0.0
Gly
4.853GlyAla: 4.853 ± 0.124
0.38GlyCys: 0.38 ± 0.027
3.607GlyAsp: 3.607 ± 0.099
4.742GlyGlu: 4.742 ± 0.118
2.951GlyPhe: 2.951 ± 0.074
4.693GlyGly: 4.693 ± 0.129
1.468GlyHis: 1.468 ± 0.057
5.44GlyIle: 5.44 ± 0.132
3.953GlyLys: 3.953 ± 0.11
6.541GlyLeu: 6.541 ± 0.139
1.935GlyMet: 1.935 ± 0.067
2.423GlyAsn: 2.423 ± 0.082
1.556GlyPro: 1.556 ± 0.055
3.158GlyGln: 3.158 ± 0.08
2.747GlyArg: 2.747 ± 0.084
3.638GlySer: 3.638 ± 0.078
4.066GlyThr: 4.066 ± 0.096
5.051GlyVal: 5.051 ± 0.123
0.569GlyTrp: 0.569 ± 0.038
2.635GlyTyr: 2.635 ± 0.075
0.0GlyXaa: 0.0 ± 0.0
His
1.353HisAla: 1.353 ± 0.054
0.127HisCys: 0.127 ± 0.014
1.28HisAsp: 1.28 ± 0.049
1.5HisGlu: 1.5 ± 0.044
0.914HisPhe: 0.914 ± 0.04
1.383HisGly: 1.383 ± 0.047
0.683HisHis: 0.683 ± 0.037
1.716HisIle: 1.716 ± 0.064
1.027HisLys: 1.027 ± 0.044
2.188HisLeu: 2.188 ± 0.074
0.542HisMet: 0.542 ± 0.036
0.912HisAsn: 0.912 ± 0.042
1.107HisPro: 1.107 ± 0.046
1.04HisGln: 1.04 ± 0.051
0.895HisArg: 0.895 ± 0.047
1.176HisSer: 1.176 ± 0.051
1.118HisThr: 1.118 ± 0.053
1.491HisVal: 1.491 ± 0.05
0.207HisTrp: 0.207 ± 0.021
1.053HisTyr: 1.053 ± 0.042
0.0HisXaa: 0.0 ± 0.0
Ile
6.02IleAla: 6.02 ± 0.103
0.475IleCys: 0.475 ± 0.036
5.268IleAsp: 5.268 ± 0.112
5.97IleGlu: 5.97 ± 0.102
3.216IlePhe: 3.216 ± 0.095
5.492IleGly: 5.492 ± 0.138
1.465IleHis: 1.465 ± 0.053
6.19IleIle: 6.19 ± 0.159
3.966IleLys: 3.966 ± 0.089
6.882IleLeu: 6.882 ± 0.157
1.766IleMet: 1.766 ± 0.064
3.461IleAsn: 3.461 ± 0.094
2.894IlePro: 2.894 ± 0.08
3.184IleGln: 3.184 ± 0.072
2.879IleArg: 2.879 ± 0.071
4.639IleSer: 4.639 ± 0.102
4.23IleThr: 4.23 ± 0.111
5.428IleVal: 5.428 ± 0.101
0.556IleTrp: 0.556 ± 0.035
2.68IleTyr: 2.68 ± 0.075
0.0IleXaa: 0.0 ± 0.0
Lys
4.062LysAla: 4.062 ± 0.101
0.175LysCys: 0.175 ± 0.021
3.313LysAsp: 3.313 ± 0.103
5.222LysGlu: 5.222 ± 0.134
1.835LysPhe: 1.835 ± 0.062
3.136LysGly: 3.136 ± 0.09
1.273LysHis: 1.273 ± 0.053
3.742LysIle: 3.742 ± 0.09
4.293LysLys: 4.293 ± 0.115
5.103LysLeu: 5.103 ± 0.096
1.602LysMet: 1.602 ± 0.056
2.667LysAsn: 2.667 ± 0.098
1.937LysPro: 1.937 ± 0.085
3.478LysGln: 3.478 ± 0.094
3.004LysArg: 3.004 ± 0.084
3.059LysSer: 3.059 ± 0.09
3.18LysThr: 3.18 ± 0.076
3.688LysVal: 3.688 ± 0.088
0.512LysTrp: 0.512 ± 0.033
1.893LysTyr: 1.893 ± 0.063
0.0LysXaa: 0.0 ± 0.0
Leu
7.87LeuAla: 7.87 ± 0.152
0.488LeuCys: 0.488 ± 0.031
5.977LeuAsp: 5.977 ± 0.099
6.792LeuGlu: 6.792 ± 0.113
4.323LeuPhe: 4.323 ± 0.118
6.257LeuGly: 6.257 ± 0.135
1.785LeuHis: 1.785 ± 0.061
7.444LeuIle: 7.444 ± 0.157
5.553LeuLys: 5.553 ± 0.105
9.323LeuLeu: 9.323 ± 0.187
2.641LeuMet: 2.641 ± 0.079
4.466LeuAsn: 4.466 ± 0.097
3.674LeuPro: 3.674 ± 0.094
3.573LeuGln: 3.573 ± 0.086
3.58LeuArg: 3.58 ± 0.084
6.949LeuSer: 6.949 ± 0.136
6.316LeuThr: 6.316 ± 0.122
6.1LeuVal: 6.1 ± 0.107
0.69LeuTrp: 0.69 ± 0.032
3.177LeuTyr: 3.177 ± 0.092
0.0LeuXaa: 0.0 ± 0.0
Met
1.79MetAla: 1.79 ± 0.064
0.1MetCys: 0.1 ± 0.014
1.472MetAsp: 1.472 ± 0.058
1.515MetGlu: 1.515 ± 0.05
0.849MetPhe: 0.849 ± 0.038
1.846MetGly: 1.846 ± 0.072
0.478MetHis: 0.478 ± 0.029
2.226MetIle: 2.226 ± 0.069
1.917MetLys: 1.917 ± 0.056
2.326MetLeu: 2.326 ± 0.077
0.891MetMet: 0.891 ± 0.043
1.513MetAsn: 1.513 ± 0.058
0.88MetPro: 0.88 ± 0.04
0.951MetGln: 0.951 ± 0.043
1.068MetArg: 1.068 ± 0.044
1.874MetSer: 1.874 ± 0.06
1.906MetThr: 1.906 ± 0.062
1.623MetVal: 1.623 ± 0.062
0.182MetTrp: 0.182 ± 0.02
0.722MetTyr: 0.722 ± 0.034
0.0MetXaa: 0.0 ± 0.0
Asn
2.622AsnAla: 2.622 ± 0.075
0.268AsnCys: 0.268 ± 0.026
2.408AsnAsp: 2.408 ± 0.067
3.143AsnGlu: 3.143 ± 0.087
1.706AsnPhe: 1.706 ± 0.062
2.858AsnGly: 2.858 ± 0.086
1.109AsnHis: 1.109 ± 0.046
3.789AsnIle: 3.789 ± 0.098
2.689AsnLys: 2.689 ± 0.098
3.927AsnLeu: 3.927 ± 0.084
1.152AsnMet: 1.152 ± 0.049
2.194AsnAsn: 2.194 ± 0.08
1.907AsnPro: 1.907 ± 0.066
2.326AsnGln: 2.326 ± 0.073
2.06AsnArg: 2.06 ± 0.083
2.129AsnSer: 2.129 ± 0.064
2.192AsnThr: 2.192 ± 0.063
2.741AsnVal: 2.741 ± 0.073
0.484AsnTrp: 0.484 ± 0.029
1.785AsnTyr: 1.785 ± 0.059
0.0AsnXaa: 0.0 ± 0.0
Pro
2.561ProAla: 2.561 ± 0.078
0.128ProCys: 0.128 ± 0.015
2.376ProAsp: 2.376 ± 0.073
3.227ProGlu: 3.227 ± 0.103
1.608ProPhe: 1.608 ± 0.063
2.192ProGly: 2.192 ± 0.084
0.806ProHis: 0.806 ± 0.037
2.648ProIle: 2.648 ± 0.072
1.822ProLys: 1.822 ± 0.064
3.069ProLeu: 3.069 ± 0.08
0.683ProMet: 0.683 ± 0.036
1.692ProAsn: 1.692 ± 0.066
0.748ProPro: 0.748 ± 0.034
1.288ProGln: 1.288 ± 0.053
1.057ProArg: 1.057 ± 0.05
2.03ProSer: 2.03 ± 0.061
2.389ProThr: 2.389 ± 0.064
2.501ProVal: 2.501 ± 0.073
0.29ProTrp: 0.29 ± 0.024
1.373ProTyr: 1.373 ± 0.061
0.0ProXaa: 0.0 ± 0.0
Gln
4.332GlnAla: 4.332 ± 0.108
0.166GlnCys: 0.166 ± 0.018
2.216GlnAsp: 2.216 ± 0.067
3.443GlnGlu: 3.443 ± 0.09
1.863GlnPhe: 1.863 ± 0.068
2.358GlnGly: 2.358 ± 0.07
1.018GlnHis: 1.018 ± 0.043
2.603GlnIle: 2.603 ± 0.063
2.458GlnLys: 2.458 ± 0.085
5.564GlnLeu: 5.564 ± 0.127
1.176GlnMet: 1.176 ± 0.042
1.593GlnAsn: 1.593 ± 0.054
1.755GlnPro: 1.755 ± 0.081
2.695GlnGln: 2.695 ± 0.099
1.785GlnArg: 1.785 ± 0.062
2.844GlnSer: 2.844 ± 0.086
2.663GlnThr: 2.663 ± 0.079
3.179GlnVal: 3.179 ± 0.086
0.419GlnTrp: 0.419 ± 0.032
1.747GlnTyr: 1.747 ± 0.064
0.0GlnXaa: 0.0 ± 0.0
Arg
2.659ArgAla: 2.659 ± 0.075
0.218ArgCys: 0.218 ± 0.021
2.389ArgAsp: 2.389 ± 0.075
3.603ArgGlu: 3.603 ± 0.099
1.747ArgPhe: 1.747 ± 0.059
2.522ArgGly: 2.522 ± 0.072
0.906ArgHis: 0.906 ± 0.043
3.035ArgIle: 3.035 ± 0.084
2.449ArgLys: 2.449 ± 0.08
4.156ArgLeu: 4.156 ± 0.088
1.126ArgMet: 1.126 ± 0.043
1.573ArgAsn: 1.573 ± 0.056
1.347ArgPro: 1.347 ± 0.046
2.434ArgGln: 2.434 ± 0.065
2.012ArgArg: 2.012 ± 0.064
2.241ArgSer: 2.241 ± 0.075
2.097ArgThr: 2.097 ± 0.068
2.842ArgVal: 2.842 ± 0.078
0.346ArgTrp: 0.346 ± 0.026
1.621ArgTyr: 1.621 ± 0.056
0.0ArgXaa: 0.0 ± 0.0
Ser
3.923SerAla: 3.923 ± 0.088
0.259SerCys: 0.259 ± 0.025
3.683SerAsp: 3.683 ± 0.104
4.356SerGlu: 4.356 ± 0.092
2.68SerPhe: 2.68 ± 0.081
4.688SerGly: 4.688 ± 0.097
1.314SerHis: 1.314 ± 0.05
4.533SerIle: 4.533 ± 0.111
2.951SerLys: 2.951 ± 0.074
5.78SerLeu: 5.78 ± 0.107
1.409SerMet: 1.409 ± 0.048
2.563SerAsn: 2.563 ± 0.069
1.978SerPro: 1.978 ± 0.061
2.425SerGln: 2.425 ± 0.057
2.451SerArg: 2.451 ± 0.069
3.61SerSer: 3.61 ± 0.084
3.435SerThr: 3.435 ± 0.081
4.09SerVal: 4.09 ± 0.088
0.534SerTrp: 0.534 ± 0.034
2.248SerTyr: 2.248 ± 0.067
0.0SerXaa: 0.0 ± 0.0
Thr
4.129ThrAla: 4.129 ± 0.098
0.275ThrCys: 0.275 ± 0.024
3.811ThrAsp: 3.811 ± 0.092
4.17ThrGlu: 4.17 ± 0.095
2.518ThrPhe: 2.518 ± 0.068
4.005ThrGly: 4.005 ± 0.091
1.364ThrHis: 1.364 ± 0.052
4.963ThrIle: 4.963 ± 0.097
2.769ThrLys: 2.769 ± 0.071
5.892ThrLeu: 5.892 ± 0.117
1.29ThrMet: 1.29 ± 0.052
2.339ThrAsn: 2.339 ± 0.074
2.594ThrPro: 2.594 ± 0.091
2.071ThrGln: 2.071 ± 0.068
1.902ThrArg: 1.902 ± 0.066
3.29ThrSer: 3.29 ± 0.077
3.607ThrThr: 3.607 ± 0.1
4.617ThrVal: 4.617 ± 0.106
0.441ThrTrp: 0.441 ± 0.033
2.213ThrTyr: 2.213 ± 0.064
0.0ThrXaa: 0.0 ± 0.0
Val
5.32ValAla: 5.32 ± 0.099
0.337ValCys: 0.337 ± 0.027
4.693ValAsp: 4.693 ± 0.102
5.08ValGlu: 5.08 ± 0.12
2.795ValPhe: 2.795 ± 0.079
5.012ValGly: 5.012 ± 0.114
1.347ValHis: 1.347 ± 0.056
5.404ValIle: 5.404 ± 0.131
3.713ValLys: 3.713 ± 0.086
6.482ValLeu: 6.482 ± 0.135
1.9ValMet: 1.9 ± 0.059
2.998ValAsn: 2.998 ± 0.081
2.492ValPro: 2.492 ± 0.071
2.592ValGln: 2.592 ± 0.072
2.713ValArg: 2.713 ± 0.08
4.381ValSer: 4.381 ± 0.095
4.303ValThr: 4.303 ± 0.099
5.162ValVal: 5.162 ± 0.11
0.584ValTrp: 0.584 ± 0.04
2.369ValTyr: 2.369 ± 0.062
0.0ValXaa: 0.0 ± 0.0
Trp
0.497TrpAla: 0.497 ± 0.033
0.056TrpCys: 0.056 ± 0.01
0.43TrpAsp: 0.43 ± 0.026
0.581TrpGlu: 0.581 ± 0.035
0.404TrpPhe: 0.404 ± 0.027
0.553TrpGly: 0.553 ± 0.035
0.203TrpHis: 0.203 ± 0.019
0.625TrpIle: 0.625 ± 0.035
0.439TrpLys: 0.439 ± 0.027
1.105TrpLeu: 1.105 ± 0.044
0.242TrpMet: 0.242 ± 0.024
0.381TrpAsn: 0.381 ± 0.028
0.223TrpPro: 0.223 ± 0.02
0.478TrpGln: 0.478 ± 0.033
0.376TrpArg: 0.376 ± 0.028
0.489TrpSer: 0.489 ± 0.028
0.462TrpThr: 0.462 ± 0.031
0.515TrpVal: 0.515 ± 0.033
0.119TrpTrp: 0.119 ± 0.014
0.296TrpTyr: 0.296 ± 0.026
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.345TyrAla: 2.345 ± 0.076
0.264TyrCys: 0.264 ± 0.024
2.371TyrAsp: 2.371 ± 0.072
2.624TyrGlu: 2.624 ± 0.075
1.805TyrPhe: 1.805 ± 0.058
2.402TyrGly: 2.402 ± 0.076
0.986TyrHis: 0.986 ± 0.043
2.723TyrIle: 2.723 ± 0.082
1.876TyrLys: 1.876 ± 0.062
3.742TyrLeu: 3.742 ± 0.097
0.858TyrMet: 0.858 ± 0.041
1.779TyrAsn: 1.779 ± 0.061
1.381TyrPro: 1.381 ± 0.051
1.794TyrGln: 1.794 ± 0.066
1.744TyrArg: 1.744 ± 0.06
2.021TyrSer: 2.021 ± 0.068
1.991TyrThr: 1.991 ± 0.065
2.322TyrVal: 2.322 ± 0.066
0.318TyrTrp: 0.318 ± 0.023
1.591TyrTyr: 1.591 ± 0.062
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1691 proteins (537359 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski