Amino acid dipepetide frequency for Bacillus phage AR9

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.157AlaAla: 1.157 ± 0.204
0.213AlaCys: 0.213 ± 0.06
2.247AlaAsp: 2.247 ± 0.215
2.14AlaGlu: 2.14 ± 0.157
1.848AlaPhe: 1.848 ± 0.142
2.021AlaGly: 2.021 ± 0.336
0.412AlaHis: 0.412 ± 0.089
3.762AlaIle: 3.762 ± 0.284
3.536AlaLys: 3.536 ± 0.355
3.284AlaLeu: 3.284 ± 0.256
0.851AlaMet: 0.851 ± 0.107
2.911AlaAsn: 2.911 ± 0.197
0.718AlaPro: 0.718 ± 0.117
0.744AlaGln: 0.744 ± 0.108
1.396AlaArg: 1.396 ± 0.128
2.194AlaSer: 2.194 ± 0.251
2.287AlaThr: 2.287 ± 0.231
2.26AlaVal: 2.26 ± 0.164
0.279AlaTrp: 0.279 ± 0.059
1.422AlaTyr: 1.422 ± 0.144
0.0AlaXaa: 0.0 ± 0.0
Cys
0.292CysAla: 0.292 ± 0.069
0.053CysCys: 0.053 ± 0.026
0.452CysAsp: 0.452 ± 0.094
0.439CysGlu: 0.439 ± 0.074
0.306CysPhe: 0.306 ± 0.069
0.532CysGly: 0.532 ± 0.085
0.146CysHis: 0.146 ± 0.049
0.532CysIle: 0.532 ± 0.083
0.758CysLys: 0.758 ± 0.122
0.585CysLeu: 0.585 ± 0.086
0.093CysMet: 0.093 ± 0.037
0.824CysAsn: 0.824 ± 0.155
0.332CysPro: 0.332 ± 0.082
0.173CysGln: 0.173 ± 0.049
0.306CysArg: 0.306 ± 0.075
0.412CysSer: 0.412 ± 0.079
0.439CysThr: 0.439 ± 0.071
0.253CysVal: 0.253 ± 0.06
0.053CysTrp: 0.053 ± 0.027
0.439CysTyr: 0.439 ± 0.082
0.0CysXaa: 0.0 ± 0.0
Asp
2.247AspAla: 2.247 ± 0.153
0.545AspCys: 0.545 ± 0.096
4.374AspAsp: 4.374 ± 0.375
6.009AspGlu: 6.009 ± 0.348
3.417AspPhe: 3.417 ± 0.229
3.363AspGly: 3.363 ± 0.222
0.798AspHis: 0.798 ± 0.119
7.498AspIle: 7.498 ± 0.31
6.102AspLys: 6.102 ± 0.279
6.102AspLeu: 6.102 ± 0.34
1.29AspMet: 1.29 ± 0.124
5.291AspAsn: 5.291 ± 0.274
2.18AspPro: 2.18 ± 0.194
1.462AspGln: 1.462 ± 0.127
2.101AspArg: 2.101 ± 0.176
4.4AspSer: 4.4 ± 0.244
3.417AspThr: 3.417 ± 0.216
3.324AspVal: 3.324 ± 0.222
0.665AspTrp: 0.665 ± 0.105
3.815AspTyr: 3.815 ± 0.238
0.0AspXaa: 0.0 ± 0.0
Glu
2.446GluAla: 2.446 ± 0.196
0.532GluCys: 0.532 ± 0.102
4.919GluAsp: 4.919 ± 0.286
7.498GluGlu: 7.498 ± 0.453
3.935GluPhe: 3.935 ± 0.249
3.204GluGly: 3.204 ± 0.205
1.103GluHis: 1.103 ± 0.122
7.325GluIle: 7.325 ± 0.382
7.91GluLys: 7.91 ± 0.402
7.325GluLeu: 7.325 ± 0.432
2.127GluMet: 2.127 ± 0.163
5.637GluAsn: 5.637 ± 0.272
1.143GluPro: 1.143 ± 0.148
2.101GluGln: 2.101 ± 0.17
2.832GluArg: 2.832 ± 0.235
4.733GluSer: 4.733 ± 0.278
3.457GluThr: 3.457 ± 0.2
4.135GluVal: 4.135 ± 0.285
0.572GluTrp: 0.572 ± 0.092
4.347GluTyr: 4.347 ± 0.312
0.0GluXaa: 0.0 ± 0.0
Phe
1.609PheAla: 1.609 ± 0.137
0.319PheCys: 0.319 ± 0.064
3.403PheAsp: 3.403 ± 0.213
2.951PheGlu: 2.951 ± 0.22
1.742PhePhe: 1.742 ± 0.149
2.247PheGly: 2.247 ± 0.256
0.811PheHis: 0.811 ± 0.128
4.56PheIle: 4.56 ± 0.291
4.547PheLys: 4.547 ± 0.278
4.321PheLeu: 4.321 ± 0.235
1.223PheMet: 1.223 ± 0.14
4.281PheAsn: 4.281 ± 0.23
1.13PhePro: 1.13 ± 0.123
1.024PheGln: 1.024 ± 0.113
1.422PheArg: 1.422 ± 0.121
4.121PheSer: 4.121 ± 0.201
2.38PheThr: 2.38 ± 0.197
1.875PheVal: 1.875 ± 0.143
0.279PheTrp: 0.279 ± 0.06
2.233PheTyr: 2.233 ± 0.195
0.0PheXaa: 0.0 ± 0.0
Gly
1.702GlyAla: 1.702 ± 0.252
0.266GlyCys: 0.266 ± 0.072
2.539GlyAsp: 2.539 ± 0.197
3.297GlyGlu: 3.297 ± 0.334
2.273GlyPhe: 2.273 ± 0.196
2.194GlyGly: 2.194 ± 0.432
0.731GlyHis: 0.731 ± 0.081
4.347GlyIle: 4.347 ± 0.323
5.584GlyLys: 5.584 ± 0.306
3.842GlyLeu: 3.842 ± 0.275
1.143GlyMet: 1.143 ± 0.135
3.802GlyAsn: 3.802 ± 0.253
0.292GlyPro: 0.292 ± 0.061
1.316GlyGln: 1.316 ± 0.153
1.914GlyArg: 1.914 ± 0.243
3.164GlySer: 3.164 ± 0.289
2.659GlyThr: 2.659 ± 0.19
2.712GlyVal: 2.712 ± 0.23
0.319GlyTrp: 0.319 ± 0.059
2.38GlyTyr: 2.38 ± 0.182
0.0GlyXaa: 0.0 ± 0.0
His
0.598HisAla: 0.598 ± 0.099
0.133HisCys: 0.133 ± 0.039
0.877HisAsp: 0.877 ± 0.111
0.851HisGlu: 0.851 ± 0.11
0.944HisPhe: 0.944 ± 0.115
0.545HisGly: 0.545 ± 0.098
0.292HisHis: 0.292 ± 0.066
1.502HisIle: 1.502 ± 0.184
1.17HisLys: 1.17 ± 0.108
1.383HisLeu: 1.383 ± 0.152
0.346HisMet: 0.346 ± 0.066
1.024HisAsn: 1.024 ± 0.112
0.572HisPro: 0.572 ± 0.106
0.266HisGln: 0.266 ± 0.048
0.598HisArg: 0.598 ± 0.094
0.957HisSer: 0.957 ± 0.109
0.545HisThr: 0.545 ± 0.084
0.771HisVal: 0.771 ± 0.111
0.066HisTrp: 0.066 ± 0.031
0.798HisTyr: 0.798 ± 0.113
0.0HisXaa: 0.0 ± 0.0
Ile
3.403IleAla: 3.403 ± 0.244
0.718IleCys: 0.718 ± 0.1
7.671IleAsp: 7.671 ± 0.33
7.551IleGlu: 7.551 ± 0.35
3.576IlePhe: 3.576 ± 0.248
4.228IleGly: 4.228 ± 0.307
1.17IleHis: 1.17 ± 0.126
8.827IleIle: 8.827 ± 0.477
9.293IleLys: 9.293 ± 0.402
7.325IleLeu: 7.325 ± 0.372
1.821IleMet: 1.821 ± 0.14
8.242IleAsn: 8.242 ± 0.419
2.925IlePro: 2.925 ± 0.246
2.632IleGln: 2.632 ± 0.206
3.736IleArg: 3.736 ± 0.206
6.993IleSer: 6.993 ± 0.295
5.437IleThr: 5.437 ± 0.292
4.573IleVal: 4.573 ± 0.274
0.518IleTrp: 0.518 ± 0.078
3.988IleTyr: 3.988 ± 0.258
0.0IleXaa: 0.0 ± 0.0
Lys
3.297LysAla: 3.297 ± 0.272
0.811LysCys: 0.811 ± 0.144
7.179LysAsp: 7.179 ± 0.294
9.306LysGlu: 9.306 ± 0.444
4.374LysPhe: 4.374 ± 0.229
4.719LysGly: 4.719 ± 0.27
1.29LysHis: 1.29 ± 0.157
8.562LysIle: 8.562 ± 0.377
9.359LysLys: 9.359 ± 0.462
8.11LysLeu: 8.11 ± 0.379
2.327LysMet: 2.327 ± 0.195
7.697LysAsn: 7.697 ± 0.358
2.313LysPro: 2.313 ± 0.231
2.366LysGln: 2.366 ± 0.185
4.4LysArg: 4.4 ± 0.287
5.491LysSer: 5.491 ± 0.298
4.945LysThr: 4.945 ± 0.221
5.318LysVal: 5.318 ± 0.3
0.784LysTrp: 0.784 ± 0.123
4.906LysTyr: 4.906 ± 0.324
0.0LysXaa: 0.0 ± 0.0
Leu
3.377LeuAla: 3.377 ± 0.217
0.585LeuCys: 0.585 ± 0.078
6.049LeuAsp: 6.049 ± 0.299
6.74LeuGlu: 6.74 ± 0.345
3.948LeuPhe: 3.948 ± 0.255
3.882LeuGly: 3.882 ± 0.278
1.117LeuHis: 1.117 ± 0.114
7.206LeuIle: 7.206 ± 0.371
8.389LeuLys: 8.389 ± 0.372
7.91LeuLeu: 7.91 ± 0.396
2.247LeuMet: 2.247 ± 0.172
7.538LeuAsn: 7.538 ± 0.335
2.406LeuPro: 2.406 ± 0.178
1.954LeuGln: 1.954 ± 0.164
2.925LeuArg: 2.925 ± 0.207
6.9LeuSer: 6.9 ± 0.396
4.387LeuThr: 4.387 ± 0.223
4.547LeuVal: 4.547 ± 0.252
0.465LeuTrp: 0.465 ± 0.079
3.842LeuTyr: 3.842 ± 0.265
0.0LeuXaa: 0.0 ± 0.0
Met
1.037MetAla: 1.037 ± 0.133
0.12MetCys: 0.12 ± 0.042
1.728MetAsp: 1.728 ± 0.143
1.502MetGlu: 1.502 ± 0.172
0.904MetPhe: 0.904 ± 0.109
1.01MetGly: 1.01 ± 0.127
0.279MetHis: 0.279 ± 0.072
1.888MetIle: 1.888 ± 0.165
2.553MetLys: 2.553 ± 0.21
1.595MetLeu: 1.595 ± 0.158
0.399MetMet: 0.399 ± 0.074
1.795MetAsn: 1.795 ± 0.161
0.625MetPro: 0.625 ± 0.081
0.678MetGln: 0.678 ± 0.091
0.838MetArg: 0.838 ± 0.097
1.861MetSer: 1.861 ± 0.164
1.369MetThr: 1.369 ± 0.127
1.13MetVal: 1.13 ± 0.13
0.12MetTrp: 0.12 ± 0.033
0.891MetTyr: 0.891 ± 0.109
0.0MetXaa: 0.0 ± 0.0
Asn
2.699AsnAla: 2.699 ± 0.272
0.678AsnCys: 0.678 ± 0.103
5.597AsnAsp: 5.597 ± 0.299
5.943AsnGlu: 5.943 ± 0.329
3.643AsnPhe: 3.643 ± 0.249
4.347AsnGly: 4.347 ± 0.311
1.183AsnHis: 1.183 ± 0.121
8.402AsnIle: 8.402 ± 0.372
8.615AsnLys: 8.615 ± 0.371
6.767AsnLeu: 6.767 ± 0.321
1.542AsnMet: 1.542 ± 0.16
7.591AsnAsn: 7.591 ± 0.381
2.273AsnPro: 2.273 ± 0.207
2.167AsnGln: 2.167 ± 0.185
2.632AsnArg: 2.632 ± 0.225
5.916AsnSer: 5.916 ± 0.334
4.493AsnThr: 4.493 ± 0.263
3.443AsnVal: 3.443 ± 0.215
0.479AsnTrp: 0.479 ± 0.08
4.174AsnTyr: 4.174 ± 0.257
0.0AsnXaa: 0.0 ± 0.0
Pro
0.877ProAla: 0.877 ± 0.122
0.146ProCys: 0.146 ± 0.047
1.901ProAsp: 1.901 ± 0.185
1.728ProGlu: 1.728 ± 0.167
1.303ProPhe: 1.303 ± 0.131
0.346ProGly: 0.346 ± 0.07
0.332ProHis: 0.332 ± 0.064
2.446ProIle: 2.446 ± 0.231
2.074ProLys: 2.074 ± 0.187
2.22ProLeu: 2.22 ± 0.169
0.625ProMet: 0.625 ± 0.108
2.154ProAsn: 2.154 ± 0.192
0.558ProPro: 0.558 ± 0.103
0.638ProGln: 0.638 ± 0.097
0.718ProArg: 0.718 ± 0.096
2.207ProSer: 2.207 ± 0.188
1.648ProThr: 1.648 ± 0.178
1.555ProVal: 1.555 ± 0.179
0.133ProTrp: 0.133 ± 0.04
1.462ProTyr: 1.462 ± 0.148
0.0ProXaa: 0.0 ± 0.0
Gln
1.09GlnAla: 1.09 ± 0.129
0.146GlnCys: 0.146 ± 0.044
1.29GlnAsp: 1.29 ± 0.117
2.061GlnGlu: 2.061 ± 0.227
1.25GlnPhe: 1.25 ± 0.137
1.276GlnGly: 1.276 ± 0.142
0.372GlnHis: 0.372 ± 0.073
2.154GlnIle: 2.154 ± 0.147
2.553GlnLys: 2.553 ± 0.188
2.619GlnLeu: 2.619 ± 0.168
0.558GlnMet: 0.558 ± 0.089
1.941GlnAsn: 1.941 ± 0.175
0.651GlnPro: 0.651 ± 0.1
0.824GlnGln: 0.824 ± 0.151
0.851GlnArg: 0.851 ± 0.106
1.489GlnSer: 1.489 ± 0.161
1.409GlnThr: 1.409 ± 0.147
1.329GlnVal: 1.329 ± 0.136
0.292GlnTrp: 0.292 ± 0.07
1.489GlnTyr: 1.489 ± 0.139
0.0GlnXaa: 0.0 ± 0.0
Arg
1.25ArgAla: 1.25 ± 0.132
0.386ArgCys: 0.386 ± 0.102
2.659ArgAsp: 2.659 ± 0.207
2.739ArgGlu: 2.739 ± 0.221
1.768ArgPhe: 1.768 ± 0.134
1.835ArgGly: 1.835 ± 0.194
0.479ArgHis: 0.479 ± 0.085
3.536ArgIle: 3.536 ± 0.253
3.762ArgLys: 3.762 ± 0.262
3.018ArgLeu: 3.018 ± 0.25
1.196ArgMet: 1.196 ± 0.139
2.965ArgAsn: 2.965 ± 0.249
0.891ArgPro: 0.891 ± 0.096
1.024ArgGln: 1.024 ± 0.111
1.555ArgArg: 1.555 ± 0.189
2.22ArgSer: 2.22 ± 0.203
1.529ArgThr: 1.529 ± 0.15
1.875ArgVal: 1.875 ± 0.159
0.319ArgTrp: 0.319 ± 0.07
1.755ArgTyr: 1.755 ± 0.155
0.0ArgXaa: 0.0 ± 0.0
Ser
2.566SerAla: 2.566 ± 0.322
0.412SerCys: 0.412 ± 0.079
4.799SerAsp: 4.799 ± 0.27
4.906SerGlu: 4.906 ± 0.25
3.297SerPhe: 3.297 ± 0.185
3.47SerGly: 3.47 ± 0.348
1.01SerHis: 1.01 ± 0.12
6.847SerIle: 6.847 ± 0.329
6.634SerLys: 6.634 ± 0.324
6.302SerLeu: 6.302 ± 0.263
1.343SerMet: 1.343 ± 0.145
5.69SerAsn: 5.69 ± 0.291
1.875SerPro: 1.875 ± 0.154
1.968SerGln: 1.968 ± 0.152
2.526SerArg: 2.526 ± 0.202
4.972SerSer: 4.972 ± 0.461
3.909SerThr: 3.909 ± 0.272
3.869SerVal: 3.869 ± 0.249
0.505SerTrp: 0.505 ± 0.092
3.284SerTyr: 3.284 ± 0.242
0.0SerXaa: 0.0 ± 0.0
Thr
1.981ThrAla: 1.981 ± 0.226
0.266ThrCys: 0.266 ± 0.053
3.789ThrAsp: 3.789 ± 0.351
4.135ThrGlu: 4.135 ± 0.257
2.606ThrPhe: 2.606 ± 0.172
2.685ThrGly: 2.685 ± 0.307
0.931ThrHis: 0.931 ± 0.092
5.318ThrIle: 5.318 ± 0.293
4.879ThrLys: 4.879 ± 0.248
4.906ThrLeu: 4.906 ± 0.224
0.957ThrMet: 0.957 ± 0.108
3.975ThrAsn: 3.975 ± 0.21
1.409ThrPro: 1.409 ± 0.164
1.05ThrGln: 1.05 ± 0.117
1.821ThrArg: 1.821 ± 0.18
3.802ThrSer: 3.802 ± 0.244
3.417ThrThr: 3.417 ± 0.251
3.55ThrVal: 3.55 ± 0.267
0.386ThrTrp: 0.386 ± 0.078
2.646ThrTyr: 2.646 ± 0.209
0.0ThrXaa: 0.0 ± 0.0
Val
2.021ValAla: 2.021 ± 0.184
0.505ValCys: 0.505 ± 0.092
3.098ValAsp: 3.098 ± 0.195
3.776ValGlu: 3.776 ± 0.204
2.313ValPhe: 2.313 ± 0.19
1.914ValGly: 1.914 ± 0.185
0.811ValHis: 0.811 ± 0.108
4.813ValIle: 4.813 ± 0.244
5.065ValLys: 5.065 ± 0.265
4.387ValLeu: 4.387 ± 0.272
1.064ValMet: 1.064 ± 0.119
4.148ValAsn: 4.148 ± 0.214
1.369ValPro: 1.369 ± 0.117
1.569ValGln: 1.569 ± 0.172
2.021ValArg: 2.021 ± 0.149
4.294ValSer: 4.294 ± 0.267
3.191ValThr: 3.191 ± 0.219
2.845ValVal: 2.845 ± 0.225
0.266ValTrp: 0.266 ± 0.058
2.446ValTyr: 2.446 ± 0.173
0.0ValXaa: 0.0 ± 0.0
Trp
0.213TrpAla: 0.213 ± 0.053
0.08TrpCys: 0.08 ± 0.037
0.492TrpAsp: 0.492 ± 0.078
0.665TrpGlu: 0.665 ± 0.097
0.213TrpPhe: 0.213 ± 0.055
0.306TrpGly: 0.306 ± 0.065
0.146TrpHis: 0.146 ± 0.037
0.479TrpIle: 0.479 ± 0.079
0.691TrpLys: 0.691 ± 0.098
0.651TrpLeu: 0.651 ± 0.089
0.093TrpMet: 0.093 ± 0.033
0.678TrpAsn: 0.678 ± 0.191
0.0TrpPro: 0.0 ± 0.0
0.173TrpGln: 0.173 ± 0.055
0.213TrpArg: 0.213 ± 0.051
0.518TrpSer: 0.518 ± 0.075
0.452TrpThr: 0.452 ± 0.092
0.386TrpVal: 0.386 ± 0.076
0.12TrpTrp: 0.12 ± 0.047
0.412TrpTyr: 0.412 ± 0.115
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.821TyrAla: 1.821 ± 0.176
0.505TyrCys: 0.505 ± 0.073
3.337TyrAsp: 3.337 ± 0.238
3.044TyrGlu: 3.044 ± 0.192
2.818TyrPhe: 2.818 ± 0.266
2.353TyrGly: 2.353 ± 0.19
0.824TyrHis: 0.824 ± 0.121
4.626TyrIle: 4.626 ± 0.316
4.081TyrLys: 4.081 ± 0.263
3.762TyrLeu: 3.762 ± 0.284
1.064TyrMet: 1.064 ± 0.121
4.4TyrAsn: 4.4 ± 0.293
1.409TyrPro: 1.409 ± 0.175
1.516TyrGln: 1.516 ± 0.156
1.954TyrArg: 1.954 ± 0.207
3.55TyrSer: 3.55 ± 0.223
3.137TyrThr: 3.137 ± 0.251
2.167TyrVal: 2.167 ± 0.183
0.359TyrTrp: 0.359 ± 0.075
2.739TyrTyr: 2.739 ± 0.242
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 291 proteins (75221 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski