Amino acid dipepetide frequency for Streptomyces sp. SID7803

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.063AlaAla: 15.063 ± 0.395
0.993AlaCys: 0.993 ± 0.087
6.873AlaAsp: 6.873 ± 0.221
7.233AlaGlu: 7.233 ± 0.232
3.159AlaPhe: 3.159 ± 0.181
10.27AlaGly: 10.27 ± 0.253
2.432AlaHis: 2.432 ± 0.142
3.57AlaIle: 3.57 ± 0.159
3.332AlaLys: 3.332 ± 0.191
11.054AlaLeu: 11.054 ± 0.345
2.871AlaMet: 2.871 ± 0.149
1.922AlaAsn: 1.922 ± 0.11
5.534AlaPro: 5.534 ± 0.216
3.577AlaGln: 3.577 ± 0.152
9.147AlaArg: 9.147 ± 0.295
6.556AlaSer: 6.556 ± 0.212
6.383AlaThr: 6.383 ± 0.236
9.435AlaVal: 9.435 ± 0.258
1.655AlaTrp: 1.655 ± 0.118
2.735AlaTyr: 2.735 ± 0.128
0.0AlaXaa: 0.0 ± 0.0
Cys
1.18CysAla: 1.18 ± 0.094
0.266CysCys: 0.266 ± 0.049
0.59CysAsp: 0.59 ± 0.063
0.324CysGlu: 0.324 ± 0.046
0.237CysPhe: 0.237 ± 0.045
1.065CysGly: 1.065 ± 0.09
0.23CysHis: 0.23 ± 0.036
0.223CysIle: 0.223 ± 0.039
0.158CysLys: 0.158 ± 0.03
0.828CysLeu: 0.828 ± 0.082
0.202CysMet: 0.202 ± 0.04
0.166CysAsn: 0.166 ± 0.031
0.799CysPro: 0.799 ± 0.094
0.237CysGln: 0.237 ± 0.039
1.044CysArg: 1.044 ± 0.079
0.813CysSer: 0.813 ± 0.075
0.727CysThr: 0.727 ± 0.076
0.784CysVal: 0.784 ± 0.076
0.173CysTrp: 0.173 ± 0.034
0.137CysTyr: 0.137 ± 0.028
0.0CysXaa: 0.0 ± 0.0
Asp
6.175AspAla: 6.175 ± 0.212
0.511AspCys: 0.511 ± 0.062
3.886AspAsp: 3.886 ± 0.167
3.814AspGlu: 3.814 ± 0.177
1.936AspPhe: 1.936 ± 0.136
5.707AspGly: 5.707 ± 0.2
1.504AspHis: 1.504 ± 0.116
2.274AspIle: 2.274 ± 0.142
1.713AspLys: 1.713 ± 0.12
5.858AspLeu: 5.858 ± 0.192
0.907AspMet: 0.907 ± 0.073
1.123AspAsn: 1.123 ± 0.101
3.67AspPro: 3.67 ± 0.156
1.821AspGln: 1.821 ± 0.133
5.045AspArg: 5.045 ± 0.177
3.203AspSer: 3.203 ± 0.136
3.483AspThr: 3.483 ± 0.146
4.692AspVal: 4.692 ± 0.177
0.914AspTrp: 0.914 ± 0.076
1.166AspTyr: 1.166 ± 0.106
0.0AspXaa: 0.0 ± 0.0
Glu
6.499GluAla: 6.499 ± 0.239
0.446GluCys: 0.446 ± 0.072
3.181GluAsp: 3.181 ± 0.146
3.541GluGlu: 3.541 ± 0.166
1.662GluPhe: 1.662 ± 0.117
3.901GluGly: 3.901 ± 0.182
1.353GluHis: 1.353 ± 0.095
2.368GluIle: 2.368 ± 0.129
2.238GluLys: 2.238 ± 0.137
6.549GluLeu: 6.549 ± 0.234
1.101GluMet: 1.101 ± 0.103
1.432GluAsn: 1.432 ± 0.108
2.764GluPro: 2.764 ± 0.138
2.584GluGln: 2.584 ± 0.133
5.39GluArg: 5.39 ± 0.223
2.67GluSer: 2.67 ± 0.138
3.023GluThr: 3.023 ± 0.15
4.347GluVal: 4.347 ± 0.164
0.727GluTrp: 0.727 ± 0.071
1.252GluTyr: 1.252 ± 0.089
0.0GluXaa: 0.0 ± 0.0
Phe
3.498PheAla: 3.498 ± 0.169
0.252PheCys: 0.252 ± 0.044
2.195PheAsp: 2.195 ± 0.119
1.756PheGlu: 1.756 ± 0.107
1.065PhePhe: 1.065 ± 0.097
3.174PheGly: 3.174 ± 0.166
0.77PheHis: 0.77 ± 0.075
0.957PheIle: 0.957 ± 0.083
0.806PheLys: 0.806 ± 0.077
2.476PheLeu: 2.476 ± 0.143
0.497PheMet: 0.497 ± 0.065
0.72PheAsn: 0.72 ± 0.075
1.252PhePro: 1.252 ± 0.112
0.684PheGln: 0.684 ± 0.065
1.972PheArg: 1.972 ± 0.105
1.785PheSer: 1.785 ± 0.11
1.814PheThr: 1.814 ± 0.123
2.238PheVal: 2.238 ± 0.138
0.475PheTrp: 0.475 ± 0.056
0.626PheTyr: 0.626 ± 0.062
0.0PheXaa: 0.0 ± 0.0
Gly
8.06GlyAla: 8.06 ± 0.237
1.072GlyCys: 1.072 ± 0.09
4.505GlyAsp: 4.505 ± 0.191
4.599GlyGlu: 4.599 ± 0.173
2.792GlyPhe: 2.792 ± 0.155
11.695GlyGly: 11.695 ± 0.334
2.476GlyHis: 2.476 ± 0.128
3.246GlyIle: 3.246 ± 0.139
2.915GlyLys: 2.915 ± 0.176
7.866GlyLeu: 7.866 ± 0.219
2.238GlyMet: 2.238 ± 0.13
1.993GlyAsn: 1.993 ± 0.131
4.232GlyPro: 4.232 ± 0.178
2.785GlyGln: 2.785 ± 0.138
7.679GlyArg: 7.679 ± 0.255
5.693GlySer: 5.693 ± 0.229
5.606GlyThr: 5.606 ± 0.219
6.117GlyVal: 6.117 ± 0.204
1.36GlyTrp: 1.36 ± 0.099
2.368GlyTyr: 2.368 ± 0.123
0.0GlyXaa: 0.0 ± 0.0
His
2.562HisAla: 2.562 ± 0.142
0.23HisCys: 0.23 ± 0.043
1.353HisAsp: 1.353 ± 0.095
1.339HisGlu: 1.339 ± 0.093
0.777HisPhe: 0.777 ± 0.079
2.267HisGly: 2.267 ± 0.135
0.799HisHis: 0.799 ± 0.081
0.813HisIle: 0.813 ± 0.079
0.561HisLys: 0.561 ± 0.058
2.361HisLeu: 2.361 ± 0.135
0.475HisMet: 0.475 ± 0.06
0.432HisAsn: 0.432 ± 0.052
1.72HisPro: 1.72 ± 0.128
0.799HisGln: 0.799 ± 0.077
2.627HisArg: 2.627 ± 0.173
1.123HisSer: 1.123 ± 0.088
1.303HisThr: 1.303 ± 0.093
1.518HisVal: 1.518 ± 0.1
0.381HisTrp: 0.381 ± 0.053
0.583HisTyr: 0.583 ± 0.062
0.0HisXaa: 0.0 ± 0.0
Ile
4.548IleAla: 4.548 ± 0.185
0.381IleCys: 0.381 ± 0.052
2.562IleAsp: 2.562 ± 0.155
2.425IleGlu: 2.425 ± 0.147
0.77IlePhe: 0.77 ± 0.084
3.382IleGly: 3.382 ± 0.167
0.626IleHis: 0.626 ± 0.063
1.044IleIle: 1.044 ± 0.088
1.058IleLys: 1.058 ± 0.101
2.792IleLeu: 2.792 ± 0.126
0.54IleMet: 0.54 ± 0.076
0.943IleAsn: 0.943 ± 0.084
1.734IlePro: 1.734 ± 0.111
1.029IleGln: 1.029 ± 0.081
2.562IleArg: 2.562 ± 0.139
2.274IleSer: 2.274 ± 0.111
2.281IleThr: 2.281 ± 0.12
3.001IleVal: 3.001 ± 0.149
0.533IleTrp: 0.533 ± 0.076
0.698IleTyr: 0.698 ± 0.073
0.0IleXaa: 0.0 ± 0.0
Lys
3.274LysAla: 3.274 ± 0.182
0.194LysCys: 0.194 ± 0.035
1.619LysAsp: 1.619 ± 0.102
1.547LysGlu: 1.547 ± 0.102
0.655LysPhe: 0.655 ± 0.065
2.317LysGly: 2.317 ± 0.152
0.727LysHis: 0.727 ± 0.071
1.324LysIle: 1.324 ± 0.108
1.432LysLys: 1.432 ± 0.131
2.9LysLeu: 2.9 ± 0.159
0.734LysMet: 0.734 ± 0.08
0.727LysAsn: 0.727 ± 0.083
1.727LysPro: 1.727 ± 0.115
1.065LysGln: 1.065 ± 0.087
1.957LysArg: 1.957 ± 0.119
1.684LysSer: 1.684 ± 0.131
1.655LysThr: 1.655 ± 0.121
2.461LysVal: 2.461 ± 0.161
0.367LysTrp: 0.367 ± 0.058
0.641LysTyr: 0.641 ± 0.062
0.0LysXaa: 0.0 ± 0.0
Leu
11.234LeuAla: 11.234 ± 0.329
0.9LeuCys: 0.9 ± 0.078
6.643LeuAsp: 6.643 ± 0.22
4.721LeuGlu: 4.721 ± 0.188
2.735LeuPhe: 2.735 ± 0.133
7.528LeuGly: 7.528 ± 0.243
2.267LeuHis: 2.267 ± 0.122
3.21LeuIle: 3.21 ± 0.174
2.555LeuLys: 2.555 ± 0.135
9.917LeuLeu: 9.917 ± 0.305
1.67LeuMet: 1.67 ± 0.11
2.029LeuAsn: 2.029 ± 0.131
5.498LeuPro: 5.498 ± 0.178
2.49LeuGln: 2.49 ± 0.132
7.895LeuArg: 7.895 ± 0.242
5.671LeuSer: 5.671 ± 0.189
6.074LeuThr: 6.074 ± 0.238
7.808LeuVal: 7.808 ± 0.299
1.072LeuTrp: 1.072 ± 0.093
1.806LeuTyr: 1.806 ± 0.108
0.0LeuXaa: 0.0 ± 0.0
Met
2.555MetAla: 2.555 ± 0.131
0.158MetCys: 0.158 ± 0.037
1.0MetAsp: 1.0 ± 0.077
0.979MetGlu: 0.979 ± 0.088
0.612MetPhe: 0.612 ± 0.06
1.209MetGly: 1.209 ± 0.092
0.525MetHis: 0.525 ± 0.062
0.806MetIle: 0.806 ± 0.072
0.597MetLys: 0.597 ± 0.076
1.828MetLeu: 1.828 ± 0.109
0.425MetMet: 0.425 ± 0.053
0.669MetAsn: 0.669 ± 0.079
1.245MetPro: 1.245 ± 0.097
0.533MetGln: 0.533 ± 0.059
1.706MetArg: 1.706 ± 0.108
1.979MetSer: 1.979 ± 0.125
1.814MetThr: 1.814 ± 0.104
1.497MetVal: 1.497 ± 0.108
0.288MetTrp: 0.288 ± 0.04
0.41MetTyr: 0.41 ± 0.059
0.0MetXaa: 0.0 ± 0.0
Asn
2.411AsnAla: 2.411 ± 0.118
0.259AsnCys: 0.259 ± 0.047
1.202AsnAsp: 1.202 ± 0.097
1.094AsnGlu: 1.094 ± 0.092
0.727AsnPhe: 0.727 ± 0.07
2.289AsnGly: 2.289 ± 0.137
0.576AsnHis: 0.576 ± 0.07
0.885AsnIle: 0.885 ± 0.078
0.734AsnLys: 0.734 ± 0.069
1.907AsnLeu: 1.907 ± 0.113
0.381AsnMet: 0.381 ± 0.047
0.662AsnAsn: 0.662 ± 0.082
1.367AsnPro: 1.367 ± 0.101
0.605AsnGln: 0.605 ± 0.065
1.706AsnArg: 1.706 ± 0.108
1.375AsnSer: 1.375 ± 0.111
1.331AsnThr: 1.331 ± 0.104
1.698AsnVal: 1.698 ± 0.117
0.345AsnTrp: 0.345 ± 0.058
0.497AsnTyr: 0.497 ± 0.064
0.0AsnXaa: 0.0 ± 0.0
Pro
7.096ProAla: 7.096 ± 0.254
0.641ProCys: 0.641 ± 0.067
3.865ProAsp: 3.865 ± 0.185
3.67ProGlu: 3.67 ± 0.165
1.267ProPhe: 1.267 ± 0.092
5.664ProGly: 5.664 ± 0.213
1.303ProHis: 1.303 ± 0.103
1.418ProIle: 1.418 ± 0.109
1.281ProLys: 1.281 ± 0.095
4.239ProLeu: 4.239 ± 0.177
1.173ProMet: 1.173 ± 0.096
1.058ProAsn: 1.058 ± 0.104
8.01ProPro: 8.01 ± 0.343
1.713ProGln: 1.713 ± 0.122
4.62ProArg: 4.62 ± 0.213
3.728ProSer: 3.728 ± 0.178
3.332ProThr: 3.332 ± 0.152
4.721ProVal: 4.721 ± 0.187
0.849ProTrp: 0.849 ± 0.09
1.231ProTyr: 1.231 ± 0.092
0.0ProXaa: 0.0 ± 0.0
Gln
3.929GlnAla: 3.929 ± 0.166
0.23GlnCys: 0.23 ± 0.039
1.72GlnAsp: 1.72 ± 0.113
1.806GlnGlu: 1.806 ± 0.113
0.842GlnPhe: 0.842 ± 0.069
2.13GlnGly: 2.13 ± 0.13
0.82GlnHis: 0.82 ± 0.083
1.231GlnIle: 1.231 ± 0.097
0.907GlnLys: 0.907 ± 0.087
3.346GlnLeu: 3.346 ± 0.15
0.705GlnMet: 0.705 ± 0.076
0.655GlnAsn: 0.655 ± 0.077
1.605GlnPro: 1.605 ± 0.106
1.598GlnGln: 1.598 ± 0.136
2.533GlnArg: 2.533 ± 0.146
1.367GlnSer: 1.367 ± 0.107
1.483GlnThr: 1.483 ± 0.095
2.612GlnVal: 2.612 ± 0.14
0.533GlnTrp: 0.533 ± 0.064
0.648GlnTyr: 0.648 ± 0.067
0.0GlnXaa: 0.0 ± 0.0
Arg
8.442ArgAla: 8.442 ± 0.25
0.943ArgCys: 0.943 ± 0.101
4.224ArgAsp: 4.224 ± 0.169
4.786ArgGlu: 4.786 ± 0.177
2.332ArgPhe: 2.332 ± 0.129
5.829ArgGly: 5.829 ± 0.225
2.361ArgHis: 2.361 ± 0.145
3.454ArgIle: 3.454 ± 0.157
2.202ArgLys: 2.202 ± 0.13
8.118ArgLeu: 8.118 ± 0.257
2.094ArgMet: 2.094 ± 0.121
1.763ArgAsn: 1.763 ± 0.108
5.75ArgPro: 5.75 ± 0.219
2.8ArgGln: 2.8 ± 0.142
9.946ArgArg: 9.946 ± 0.328
6.052ArgSer: 6.052 ± 0.23
5.757ArgThr: 5.757 ± 0.224
5.527ArgVal: 5.527 ± 0.212
1.245ArgTrp: 1.245 ± 0.09
1.821ArgTyr: 1.821 ± 0.123
0.0ArgXaa: 0.0 ± 0.0
Ser
6.981SerAla: 6.981 ± 0.226
0.885SerCys: 0.885 ± 0.084
3.274SerAsp: 3.274 ± 0.144
3.066SerGlu: 3.066 ± 0.167
1.698SerPhe: 1.698 ± 0.114
6.499SerGly: 6.499 ± 0.249
1.418SerHis: 1.418 ± 0.114
1.842SerIle: 1.842 ± 0.121
1.605SerLys: 1.605 ± 0.109
4.764SerLeu: 4.764 ± 0.18
1.375SerMet: 1.375 ± 0.091
1.49SerAsn: 1.49 ± 0.101
3.908SerPro: 3.908 ± 0.199
1.432SerGln: 1.432 ± 0.109
4.966SerArg: 4.966 ± 0.222
4.541SerSer: 4.541 ± 0.259
4.469SerThr: 4.469 ± 0.219
4.555SerVal: 4.555 ± 0.198
1.202SerTrp: 1.202 ± 0.099
1.454SerTyr: 1.454 ± 0.092
0.0SerXaa: 0.0 ± 0.0
Thr
7.7ThrAla: 7.7 ± 0.226
0.727ThrCys: 0.727 ± 0.064
3.454ThrAsp: 3.454 ± 0.164
3.476ThrGlu: 3.476 ± 0.183
1.857ThrPhe: 1.857 ± 0.127
6.196ThrGly: 6.196 ± 0.252
1.238ThrHis: 1.238 ± 0.091
2.209ThrIle: 2.209 ± 0.144
1.634ThrLys: 1.634 ± 0.12
5.318ThrLeu: 5.318 ± 0.208
1.115ThrMet: 1.115 ± 0.094
1.411ThrAsn: 1.411 ± 0.105
3.929ThrPro: 3.929 ± 0.196
1.331ThrGln: 1.331 ± 0.109
4.419ThrArg: 4.419 ± 0.214
4.21ThrSer: 4.21 ± 0.165
4.361ThrThr: 4.361 ± 0.2
5.563ThrVal: 5.563 ± 0.233
0.972ThrTrp: 0.972 ± 0.09
1.59ThrTyr: 1.59 ± 0.12
0.0ThrXaa: 0.0 ± 0.0
Val
8.456ValAla: 8.456 ± 0.245
0.676ValCys: 0.676 ± 0.074
4.764ValAsp: 4.764 ± 0.163
4.807ValGlu: 4.807 ± 0.214
2.641ValPhe: 2.641 ± 0.138
5.412ValGly: 5.412 ± 0.209
2.008ValHis: 2.008 ± 0.116
3.087ValIle: 3.087 ± 0.142
2.202ValLys: 2.202 ± 0.126
8.125ValLeu: 8.125 ± 0.297
1.698ValMet: 1.698 ± 0.108
1.9ValAsn: 1.9 ± 0.128
4.34ValPro: 4.34 ± 0.187
2.281ValGln: 2.281 ± 0.127
6.894ValArg: 6.894 ± 0.237
4.275ValSer: 4.275 ± 0.177
5.419ValThr: 5.419 ± 0.195
7.053ValVal: 7.053 ± 0.264
0.95ValTrp: 0.95 ± 0.094
1.533ValTyr: 1.533 ± 0.101
0.0ValXaa: 0.0 ± 0.0
Trp
1.454TrpAla: 1.454 ± 0.11
0.23TrpCys: 0.23 ± 0.043
0.856TrpAsp: 0.856 ± 0.079
0.461TrpGlu: 0.461 ± 0.05
0.525TrpPhe: 0.525 ± 0.072
0.741TrpGly: 0.741 ± 0.07
0.317TrpHis: 0.317 ± 0.056
0.655TrpIle: 0.655 ± 0.068
0.446TrpLys: 0.446 ± 0.055
1.367TrpLeu: 1.367 ± 0.117
0.317TrpMet: 0.317 ± 0.044
0.432TrpAsn: 0.432 ± 0.056
0.842TrpPro: 0.842 ± 0.083
0.712TrpGln: 0.712 ± 0.084
1.461TrpArg: 1.461 ± 0.108
1.187TrpSer: 1.187 ± 0.104
0.986TrpThr: 0.986 ± 0.089
1.008TrpVal: 1.008 ± 0.084
0.353TrpTrp: 0.353 ± 0.05
0.403TrpTyr: 0.403 ± 0.053
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.677TyrAla: 2.677 ± 0.141
0.187TyrCys: 0.187 ± 0.04
1.634TyrAsp: 1.634 ± 0.103
1.583TyrGlu: 1.583 ± 0.105
0.72TyrPhe: 0.72 ± 0.066
2.022TyrGly: 2.022 ± 0.12
0.273TyrHis: 0.273 ± 0.043
0.569TyrIle: 0.569 ± 0.064
0.633TyrLys: 0.633 ± 0.065
2.109TyrLeu: 2.109 ± 0.128
0.309TyrMet: 0.309 ± 0.047
0.525TyrAsn: 0.525 ± 0.067
0.957TyrPro: 0.957 ± 0.078
0.626TyrGln: 0.626 ± 0.058
1.878TyrArg: 1.878 ± 0.124
1.238TyrSer: 1.238 ± 0.095
1.411TyrThr: 1.411 ± 0.108
1.886TyrVal: 1.886 ± 0.103
0.36TyrTrp: 0.36 ± 0.054
0.432TyrTyr: 0.432 ± 0.058
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1189 proteins (138954 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski