Amino acid dipepetide frequency for Acidaminococcus sp. CAG:917

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.358AlaAla: 5.358 ± 0.167
1.068AlaCys: 1.068 ± 0.059
4.823AlaAsp: 4.823 ± 0.121
5.201AlaGlu: 5.201 ± 0.132
3.529AlaPhe: 3.529 ± 0.12
4.604AlaGly: 4.604 ± 0.128
0.995AlaHis: 0.995 ± 0.058
5.594AlaIle: 5.594 ± 0.129
6.411AlaLys: 6.411 ± 0.172
7.474AlaLeu: 7.474 ± 0.178
1.964AlaMet: 1.964 ± 0.069
3.09AlaAsn: 3.09 ± 0.106
2.068AlaPro: 2.068 ± 0.086
2.131AlaGln: 2.131 ± 0.084
2.468AlaArg: 2.468 ± 0.086
4.486AlaSer: 4.486 ± 0.129
3.912AlaThr: 3.912 ± 0.129
6.811AlaVal: 6.811 ± 0.133
0.333AlaTrp: 0.333 ± 0.032
2.714AlaTyr: 2.714 ± 0.076
0.01AlaXaa: 0.01 ± 0.005
Cys
1.196CysAla: 1.196 ± 0.069
0.304CysCys: 0.304 ± 0.031
1.369CysAsp: 1.369 ± 0.064
0.981CysGlu: 0.981 ± 0.057
0.771CysPhe: 0.771 ± 0.043
1.526CysGly: 1.526 ± 0.068
0.28CysHis: 0.28 ± 0.027
0.94CysIle: 0.94 ± 0.047
0.894CysLys: 0.894 ± 0.044
1.328CysLeu: 1.328 ± 0.065
0.335CysMet: 0.335 ± 0.029
0.658CysAsn: 0.658 ± 0.046
0.783CysPro: 0.783 ± 0.051
0.313CysGln: 0.313 ± 0.026
0.537CysArg: 0.537 ± 0.038
0.995CysSer: 0.995 ± 0.062
0.665CysThr: 0.665 ± 0.039
1.051CysVal: 1.051 ± 0.065
0.072CysTrp: 0.072 ± 0.013
0.566CysTyr: 0.566 ± 0.042
0.0CysXaa: 0.0 ± 0.0
Asp
3.929AspAla: 3.929 ± 0.11
0.983AspCys: 0.983 ± 0.052
3.531AspAsp: 3.531 ± 0.109
5.245AspGlu: 5.245 ± 0.12
3.625AspPhe: 3.625 ± 0.097
5.218AspGly: 5.218 ± 0.181
0.523AspHis: 0.523 ± 0.04
4.818AspIle: 4.818 ± 0.105
5.505AspLys: 5.505 ± 0.125
4.401AspLeu: 4.401 ± 0.117
1.726AspMet: 1.726 ± 0.062
2.651AspAsn: 2.651 ± 0.085
1.2AspPro: 1.2 ± 0.06
0.721AspGln: 0.721 ± 0.043
2.082AspArg: 2.082 ± 0.075
3.936AspSer: 3.936 ± 0.112
3.196AspThr: 3.196 ± 0.098
4.459AspVal: 4.459 ± 0.11
0.378AspTrp: 0.378 ± 0.032
3.032AspTyr: 3.032 ± 0.101
0.0AspXaa: 0.0 ± 0.0
Glu
5.045GluAla: 5.045 ± 0.121
0.94GluCys: 0.94 ± 0.055
3.384GluAsp: 3.384 ± 0.109
5.216GluGlu: 5.216 ± 0.163
3.01GluPhe: 3.01 ± 0.083
4.404GluGly: 4.404 ± 0.117
0.87GluHis: 0.87 ± 0.044
5.862GluIle: 5.862 ± 0.118
7.187GluLys: 7.187 ± 0.176
5.751GluLeu: 5.751 ± 0.128
1.933GluMet: 1.933 ± 0.08
4.37GluAsn: 4.37 ± 0.101
1.543GluPro: 1.543 ± 0.071
2.169GluGln: 2.169 ± 0.082
3.02GluArg: 3.02 ± 0.103
3.497GluSer: 3.497 ± 0.094
3.524GluThr: 3.524 ± 0.104
4.548GluVal: 4.548 ± 0.099
0.494GluTrp: 0.494 ± 0.04
2.832GluTyr: 2.832 ± 0.08
0.0GluXaa: 0.0 ± 0.0
Phe
3.601PheAla: 3.601 ± 0.093
0.921PheCys: 0.921 ± 0.046
3.664PheAsp: 3.664 ± 0.092
3.249PheGlu: 3.249 ± 0.081
2.082PhePhe: 2.082 ± 0.079
3.415PheGly: 3.415 ± 0.095
0.494PheHis: 0.494 ± 0.036
3.326PheIle: 3.326 ± 0.097
3.919PheLys: 3.919 ± 0.104
3.673PheLeu: 3.673 ± 0.116
1.085PheMet: 1.085 ± 0.051
2.309PheAsn: 2.309 ± 0.076
1.239PhePro: 1.239 ± 0.058
0.682PheGln: 0.682 ± 0.038
1.302PheArg: 1.302 ± 0.06
3.584PheSer: 3.584 ± 0.088
2.497PheThr: 2.497 ± 0.094
3.507PheVal: 3.507 ± 0.097
0.335PheTrp: 0.335 ± 0.028
1.945PheTyr: 1.945 ± 0.081
0.005PheXaa: 0.005 ± 0.003
Gly
5.054GlyAla: 5.054 ± 0.145
1.082GlyCys: 1.082 ± 0.069
4.249GlyAsp: 4.249 ± 0.115
5.088GlyGlu: 5.088 ± 0.121
3.182GlyPhe: 3.182 ± 0.087
5.177GlyGly: 5.177 ± 0.152
0.918GlyHis: 0.918 ± 0.055
5.462GlyIle: 5.462 ± 0.13
6.057GlyLys: 6.057 ± 0.124
5.418GlyLeu: 5.418 ± 0.121
1.764GlyMet: 1.764 ± 0.076
2.912GlyAsn: 2.912 ± 0.132
0.974GlyPro: 0.974 ± 0.049
1.837GlyGln: 1.837 ± 0.074
2.601GlyArg: 2.601 ± 0.084
3.753GlySer: 3.753 ± 0.107
3.688GlyThr: 3.688 ± 0.127
5.611GlyVal: 5.611 ± 0.125
0.501GlyTrp: 0.501 ± 0.039
3.317GlyTyr: 3.317 ± 0.092
0.0GlyXaa: 0.0 ± 0.0
His
0.851HisAla: 0.851 ± 0.047
0.251HisCys: 0.251 ± 0.03
0.764HisAsp: 0.764 ± 0.042
0.653HisGlu: 0.653 ± 0.043
0.668HisPhe: 0.668 ± 0.044
0.993HisGly: 0.993 ± 0.055
0.258HisHis: 0.258 ± 0.026
1.094HisIle: 1.094 ± 0.055
0.836HisLys: 0.836 ± 0.049
1.101HisLeu: 1.101 ± 0.065
0.26HisMet: 0.26 ± 0.023
0.651HisAsn: 0.651 ± 0.046
0.653HisPro: 0.653 ± 0.041
0.294HisGln: 0.294 ± 0.025
0.559HisArg: 0.559 ± 0.039
0.901HisSer: 0.901 ± 0.044
0.766HisThr: 0.766 ± 0.043
0.699HisVal: 0.699 ± 0.042
0.072HisTrp: 0.072 ± 0.011
0.523HisTyr: 0.523 ± 0.043
0.0HisXaa: 0.0 ± 0.0
Ile
6.356IleAla: 6.356 ± 0.148
1.333IleCys: 1.333 ± 0.055
4.958IleAsp: 4.958 ± 0.124
5.428IleGlu: 5.428 ± 0.107
3.17IlePhe: 3.17 ± 0.096
4.895IleGly: 4.895 ± 0.123
0.728IleHis: 0.728 ± 0.045
5.293IleIle: 5.293 ± 0.136
6.006IleLys: 6.006 ± 0.123
6.267IleLeu: 6.267 ± 0.134
1.74IleMet: 1.74 ± 0.074
3.476IleAsn: 3.476 ± 0.102
2.577IlePro: 2.577 ± 0.089
1.222IleGln: 1.222 ± 0.056
2.374IleArg: 2.374 ± 0.069
5.5IleSer: 5.5 ± 0.136
3.866IleThr: 3.866 ± 0.105
5.688IleVal: 5.688 ± 0.134
0.448IleTrp: 0.448 ± 0.033
2.75IleTyr: 2.75 ± 0.085
0.002IleXaa: 0.002 ± 0.002
Lys
6.508LysAla: 6.508 ± 0.146
1.13LysCys: 1.13 ± 0.07
4.642LysAsp: 4.642 ± 0.117
6.585LysGlu: 6.585 ± 0.147
3.145LysPhe: 3.145 ± 0.1
5.115LysGly: 5.115 ± 0.132
0.967LysHis: 0.967 ± 0.044
6.766LysIle: 6.766 ± 0.159
8.118LysLys: 8.118 ± 0.215
6.681LysLeu: 6.681 ± 0.16
2.456LysMet: 2.456 ± 0.088
4.888LysAsn: 4.888 ± 0.119
2.307LysPro: 2.307 ± 0.086
2.512LysGln: 2.512 ± 0.09
3.439LysArg: 3.439 ± 0.111
4.917LysSer: 4.917 ± 0.111
4.924LysThr: 4.924 ± 0.121
5.245LysVal: 5.245 ± 0.129
0.521LysTrp: 0.521 ± 0.032
3.29LysTyr: 3.29 ± 0.084
0.0LysXaa: 0.0 ± 0.0
Leu
6.409LeuAla: 6.409 ± 0.132
1.634LeuCys: 1.634 ± 0.08
5.387LeuAsp: 5.387 ± 0.146
5.201LeuGlu: 5.201 ± 0.137
3.873LeuPhe: 3.873 ± 0.124
5.585LeuGly: 5.585 ± 0.131
1.138LeuHis: 1.138 ± 0.06
6.262LeuIle: 6.262 ± 0.154
8.595LeuLys: 8.595 ± 0.187
7.496LeuLeu: 7.496 ± 0.184
1.894LeuMet: 1.894 ± 0.066
4.517LeuAsn: 4.517 ± 0.113
3.329LeuPro: 3.329 ± 0.09
1.699LeuGln: 1.699 ± 0.064
2.977LeuArg: 2.977 ± 0.092
7.568LeuSer: 7.568 ± 0.146
5.307LeuThr: 5.307 ± 0.131
5.529LeuVal: 5.529 ± 0.123
0.55LeuTrp: 0.55 ± 0.036
3.232LeuTyr: 3.232 ± 0.1
0.005LeuXaa: 0.005 ± 0.003
Met
1.894MetAla: 1.894 ± 0.089
0.325MetCys: 0.325 ± 0.027
1.232MetAsp: 1.232 ± 0.06
1.408MetGlu: 1.408 ± 0.061
1.085MetPhe: 1.085 ± 0.055
1.692MetGly: 1.692 ± 0.079
0.33MetHis: 0.33 ± 0.027
1.697MetIle: 1.697 ± 0.069
1.967MetLys: 1.967 ± 0.072
2.442MetLeu: 2.442 ± 0.086
0.513MetMet: 0.513 ± 0.04
1.155MetAsn: 1.155 ± 0.051
1.2MetPro: 1.2 ± 0.054
0.87MetGln: 0.87 ± 0.044
1.065MetArg: 1.065 ± 0.049
1.788MetSer: 1.788 ± 0.066
1.653MetThr: 1.653 ± 0.059
1.306MetVal: 1.306 ± 0.052
0.159MetTrp: 0.159 ± 0.02
0.672MetTyr: 0.672 ± 0.041
0.002MetXaa: 0.002 ± 0.002
Asn
3.777AsnAla: 3.777 ± 0.107
0.747AsnCys: 0.747 ± 0.046
2.449AsnAsp: 2.449 ± 0.093
2.955AsnGlu: 2.955 ± 0.088
2.152AsnPhe: 2.152 ± 0.078
3.755AsnGly: 3.755 ± 0.125
0.636AsnHis: 0.636 ± 0.043
3.502AsnIle: 3.502 ± 0.104
3.374AsnLys: 3.374 ± 0.1
4.527AsnLeu: 4.527 ± 0.113
1.282AsnMet: 1.282 ± 0.05
2.051AsnAsn: 2.051 ± 0.086
2.104AsnPro: 2.104 ± 0.074
1.027AsnGln: 1.027 ± 0.051
1.579AsnArg: 1.579 ± 0.063
3.037AsnSer: 3.037 ± 0.108
2.379AsnThr: 2.379 ± 0.087
3.427AsnVal: 3.427 ± 0.108
0.352AsnTrp: 0.352 ± 0.03
2.123AsnTyr: 2.123 ± 0.12
0.005AsnXaa: 0.005 ± 0.003
Pro
1.909ProAla: 1.909 ± 0.067
0.48ProCys: 0.48 ± 0.042
2.136ProAsp: 2.136 ± 0.078
2.34ProGlu: 2.34 ± 0.081
1.738ProPhe: 1.738 ± 0.077
1.208ProGly: 1.208 ± 0.057
0.525ProHis: 0.525 ± 0.04
2.037ProIle: 2.037 ± 0.071
2.439ProLys: 2.439 ± 0.08
2.849ProLeu: 2.849 ± 0.086
0.721ProMet: 0.721 ± 0.046
1.246ProAsn: 1.246 ± 0.067
0.844ProPro: 0.844 ± 0.047
1.007ProGln: 1.007 ± 0.048
0.841ProArg: 0.841 ± 0.048
2.357ProSer: 2.357 ± 0.077
1.909ProThr: 1.909 ± 0.07
2.473ProVal: 2.473 ± 0.089
0.234ProTrp: 0.234 ± 0.024
1.528ProTyr: 1.528 ± 0.062
0.0ProXaa: 0.0 ± 0.0
Gln
1.658GlnAla: 1.658 ± 0.075
0.268GlnCys: 0.268 ± 0.026
0.993GlnAsp: 0.993 ± 0.053
1.244GlnGlu: 1.244 ± 0.065
1.082GlnPhe: 1.082 ± 0.055
1.574GlnGly: 1.574 ± 0.065
0.27GlnHis: 0.27 ± 0.029
2.184GlnIle: 2.184 ± 0.076
2.567GlnLys: 2.567 ± 0.09
1.998GlnLeu: 1.998 ± 0.07
0.67GlnMet: 0.67 ± 0.04
1.711GlnAsn: 1.711 ± 0.07
0.74GlnPro: 0.74 ± 0.043
0.677GlnGln: 0.677 ± 0.05
1.106GlnArg: 1.106 ± 0.055
1.661GlnSer: 1.661 ± 0.07
1.591GlnThr: 1.591 ± 0.065
1.302GlnVal: 1.302 ± 0.055
0.161GlnTrp: 0.161 ± 0.022
0.926GlnTyr: 0.926 ± 0.048
0.005GlnXaa: 0.005 ± 0.004
Arg
2.569ArgAla: 2.569 ± 0.088
0.523ArgCys: 0.523 ± 0.039
1.772ArgAsp: 1.772 ± 0.073
2.579ArgGlu: 2.579 ± 0.088
1.817ArgPhe: 1.817 ± 0.068
2.23ArgGly: 2.23 ± 0.089
0.709ArgHis: 0.709 ± 0.041
2.728ArgIle: 2.728 ± 0.085
2.646ArgLys: 2.646 ± 0.085
3.652ArgLeu: 3.652 ± 0.107
0.827ArgMet: 0.827 ± 0.041
1.417ArgAsn: 1.417 ± 0.059
1.077ArgPro: 1.077 ± 0.057
1.371ArgGln: 1.371 ± 0.067
1.651ArgArg: 1.651 ± 0.08
1.846ArgSer: 1.846 ± 0.066
1.752ArgThr: 1.752 ± 0.065
2.463ArgVal: 2.463 ± 0.075
0.246ArgTrp: 0.246 ± 0.025
1.697ArgTyr: 1.697 ± 0.064
0.0ArgXaa: 0.0 ± 0.0
Ser
5.05SerAla: 5.05 ± 0.121
0.839SerCys: 0.839 ± 0.051
4.987SerAsp: 4.987 ± 0.113
4.71SerGlu: 4.71 ± 0.117
3.237SerPhe: 3.237 ± 0.108
5.091SerGly: 5.091 ± 0.144
0.892SerHis: 0.892 ± 0.052
4.298SerIle: 4.298 ± 0.103
4.567SerLys: 4.567 ± 0.109
6.522SerLeu: 6.522 ± 0.144
1.403SerMet: 1.403 ± 0.057
2.625SerAsn: 2.625 ± 0.102
2.029SerPro: 2.029 ± 0.084
1.731SerGln: 1.731 ± 0.067
2.15SerArg: 2.15 ± 0.08
4.37SerSer: 4.37 ± 0.135
3.085SerThr: 3.085 ± 0.101
5.541SerVal: 5.541 ± 0.136
0.359SerTrp: 0.359 ± 0.031
2.577SerTyr: 2.577 ± 0.09
0.002SerXaa: 0.002 ± 0.002
Thr
4.705ThrAla: 4.705 ± 0.111
0.559ThrCys: 0.559 ± 0.032
3.358ThrAsp: 3.358 ± 0.103
3.439ThrGlu: 3.439 ± 0.099
2.499ThrPhe: 2.499 ± 0.102
4.03ThrGly: 4.03 ± 0.12
0.827ThrHis: 0.827 ± 0.045
3.637ThrIle: 3.637 ± 0.112
3.615ThrLys: 3.615 ± 0.101
5.317ThrLeu: 5.317 ± 0.126
1.147ThrMet: 1.147 ± 0.049
2.099ThrAsn: 2.099 ± 0.091
2.217ThrPro: 2.217 ± 0.085
1.376ThrGln: 1.376 ± 0.064
1.692ThrArg: 1.692 ± 0.057
3.439ThrSer: 3.439 ± 0.131
2.794ThrThr: 2.794 ± 0.119
5.344ThrVal: 5.344 ± 0.122
0.248ThrTrp: 0.248 ± 0.029
2.049ThrTyr: 2.049 ± 0.118
0.0ThrXaa: 0.0 ± 0.0
Val
5.662ValAla: 5.662 ± 0.146
1.398ValCys: 1.398 ± 0.075
4.213ValAsp: 4.213 ± 0.105
5.076ValGlu: 5.076 ± 0.112
3.794ValPhe: 3.794 ± 0.104
4.633ValGly: 4.633 ± 0.114
0.848ValHis: 0.848 ± 0.048
5.122ValIle: 5.122 ± 0.121
6.153ValLys: 6.153 ± 0.13
7.171ValLeu: 7.171 ± 0.12
1.752ValMet: 1.752 ± 0.067
3.186ValAsn: 3.186 ± 0.087
2.398ValPro: 2.398 ± 0.071
1.504ValGln: 1.504 ± 0.052
2.381ValArg: 2.381 ± 0.083
5.175ValSer: 5.175 ± 0.145
4.245ValThr: 4.245 ± 0.121
5.488ValVal: 5.488 ± 0.136
0.46ValTrp: 0.46 ± 0.036
3.015ValTyr: 3.015 ± 0.11
0.002ValXaa: 0.002 ± 0.002
Trp
0.513TrpAla: 0.513 ± 0.04
0.116TrpCys: 0.116 ± 0.018
0.403TrpAsp: 0.403 ± 0.03
0.403TrpGlu: 0.403 ± 0.033
0.304TrpPhe: 0.304 ± 0.025
0.547TrpGly: 0.547 ± 0.036
0.128TrpHis: 0.128 ± 0.016
0.374TrpIle: 0.374 ± 0.025
0.436TrpLys: 0.436 ± 0.04
0.55TrpLeu: 0.55 ± 0.035
0.135TrpMet: 0.135 ± 0.019
0.357TrpAsn: 0.357 ± 0.029
0.087TrpPro: 0.087 ± 0.017
0.251TrpGln: 0.251 ± 0.024
0.239TrpArg: 0.239 ± 0.026
0.364TrpSer: 0.364 ± 0.028
0.335TrpThr: 0.335 ± 0.027
0.349TrpVal: 0.349 ± 0.032
0.101TrpTrp: 0.101 ± 0.017
0.284TrpTyr: 0.284 ± 0.032
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.998TyrAla: 2.998 ± 0.094
0.665TyrCys: 0.665 ± 0.045
2.849TyrAsp: 2.849 ± 0.098
2.593TyrGlu: 2.593 ± 0.077
2.07TyrPhe: 2.07 ± 0.068
2.948TyrGly: 2.948 ± 0.085
0.513TyrHis: 0.513 ± 0.032
3.08TyrIle: 3.08 ± 0.096
2.859TyrLys: 2.859 ± 0.08
3.567TyrLeu: 3.567 ± 0.087
0.846TyrMet: 0.846 ± 0.044
1.873TyrAsn: 1.873 ± 0.086
1.451TyrPro: 1.451 ± 0.069
0.991TyrGln: 0.991 ± 0.049
1.502TyrArg: 1.502 ± 0.063
2.888TyrSer: 2.888 ± 0.1
2.304TyrThr: 2.304 ± 0.121
2.827TyrVal: 2.827 ± 0.153
0.248TyrTrp: 0.248 ± 0.027
1.825TyrTyr: 1.825 ± 0.11
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.002XaaAla: 0.002 ± 0.002
0.0XaaCys: 0.0 ± 0.0
0.002XaaAsp: 0.002 ± 0.002
0.005XaaGlu: 0.005 ± 0.003
0.0XaaPhe: 0.0 ± 0.0
0.007XaaGly: 0.007 ± 0.004
0.0XaaHis: 0.0 ± 0.0
0.002XaaIle: 0.002 ± 0.002
0.0XaaLys: 0.0 ± 0.0
0.002XaaLeu: 0.002 ± 0.002
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.002XaaGln: 0.002 ± 0.003
0.007XaaArg: 0.007 ± 0.004
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.007XaaVal: 0.007 ± 0.003
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.058XaaXaa: 0.058 ± 0.016
Statistics based on 1276 proteins (414890 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski