Amino acid dipepetide frequency for Bacillus phage phiAGATE

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.409AlaAla: 5.409 ± 0.701
0.258AlaCys: 0.258 ± 0.083
4.261AlaAsp: 4.261 ± 0.35
5.198AlaGlu: 5.198 ± 0.48
2.248AlaPhe: 2.248 ± 0.221
3.981AlaGly: 3.981 ± 0.388
0.937AlaHis: 0.937 ± 0.176
4.215AlaIle: 4.215 ± 0.303
4.355AlaLys: 4.355 ± 0.353
5.456AlaLeu: 5.456 ± 0.41
1.569AlaMet: 1.569 ± 0.196
2.974AlaAsn: 2.974 ± 0.247
1.873AlaPro: 1.873 ± 0.387
2.482AlaGln: 2.482 ± 0.318
2.763AlaArg: 2.763 ± 0.232
3.653AlaSer: 3.653 ± 0.337
4.051AlaThr: 4.051 ± 0.478
5.104AlaVal: 5.104 ± 0.354
0.656AlaTrp: 0.656 ± 0.115
2.576AlaTyr: 2.576 ± 0.292
0.0AlaXaa: 0.0 ± 0.0
Cys
0.211CysAla: 0.211 ± 0.065
0.07CysCys: 0.07 ± 0.043
0.304CysAsp: 0.304 ± 0.072
0.492CysGlu: 0.492 ± 0.113
0.304CysPhe: 0.304 ± 0.084
0.492CysGly: 0.492 ± 0.122
0.164CysHis: 0.164 ± 0.059
0.211CysIle: 0.211 ± 0.072
0.632CysLys: 0.632 ± 0.131
0.726CysLeu: 0.726 ± 0.115
0.211CysMet: 0.211 ± 0.057
0.328CysAsn: 0.328 ± 0.092
0.468CysPro: 0.468 ± 0.103
0.117CysGln: 0.117 ± 0.065
0.328CysArg: 0.328 ± 0.087
0.562CysSer: 0.562 ± 0.11
0.304CysThr: 0.304 ± 0.097
0.468CysVal: 0.468 ± 0.1
0.14CysTrp: 0.14 ± 0.062
0.375CysTyr: 0.375 ± 0.1
0.0CysXaa: 0.0 ± 0.0
Asp
3.887AspAla: 3.887 ± 0.393
0.562AspCys: 0.562 ± 0.121
3.7AspAsp: 3.7 ± 0.344
4.8AspGlu: 4.8 ± 0.426
3.489AspPhe: 3.489 ± 0.309
3.91AspGly: 3.91 ± 0.423
0.913AspHis: 0.913 ± 0.139
5.058AspIle: 5.058 ± 0.303
4.613AspLys: 4.613 ± 0.339
5.034AspLeu: 5.034 ± 0.379
2.107AspMet: 2.107 ± 0.204
3.301AspAsn: 3.301 ± 0.309
1.92AspPro: 1.92 ± 0.205
1.99AspGln: 1.99 ± 0.178
3.184AspArg: 3.184 ± 0.279
3.372AspSer: 3.372 ± 0.315
3.278AspThr: 3.278 ± 0.27
4.987AspVal: 4.987 ± 0.329
0.562AspTrp: 0.562 ± 0.132
3.325AspTyr: 3.325 ± 0.261
0.0AspXaa: 0.0 ± 0.0
Glu
5.081GluAla: 5.081 ± 0.387
0.679GluCys: 0.679 ± 0.136
5.011GluAsp: 5.011 ± 0.432
9.319GluGlu: 9.319 ± 0.726
3.208GluPhe: 3.208 ± 0.27
4.777GluGly: 4.777 ± 0.351
1.499GluHis: 1.499 ± 0.196
5.034GluIle: 5.034 ± 0.398
6.299GluLys: 6.299 ± 0.509
7.984GluLeu: 7.984 ± 0.542
2.435GluMet: 2.435 ± 0.265
4.074GluAsn: 4.074 ± 0.308
1.709GluPro: 1.709 ± 0.213
3.114GluGln: 3.114 ± 0.28
3.419GluArg: 3.419 ± 0.32
4.051GluSer: 4.051 ± 0.305
3.255GluThr: 3.255 ± 0.256
6.369GluVal: 6.369 ± 0.414
1.124GluTrp: 1.124 ± 0.201
3.091GluTyr: 3.091 ± 0.352
0.0GluXaa: 0.0 ± 0.0
Phe
1.897PheAla: 1.897 ± 0.186
0.351PheCys: 0.351 ± 0.077
2.997PheAsp: 2.997 ± 0.282
2.997PheGlu: 2.997 ± 0.241
1.522PhePhe: 1.522 ± 0.225
1.967PheGly: 1.967 ± 0.22
0.937PheHis: 0.937 ± 0.162
2.88PheIle: 2.88 ± 0.281
3.114PheLys: 3.114 ± 0.261
3.067PheLeu: 3.067 ± 0.359
0.82PheMet: 0.82 ± 0.145
2.459PheAsn: 2.459 ± 0.242
1.147PhePro: 1.147 ± 0.172
1.335PheGln: 1.335 ± 0.188
1.218PheArg: 1.218 ± 0.168
3.278PheSer: 3.278 ± 0.331
2.88PheThr: 2.88 ± 0.248
2.599PheVal: 2.599 ± 0.324
0.281PheTrp: 0.281 ± 0.079
1.873PheTyr: 1.873 ± 0.26
0.0PheXaa: 0.0 ± 0.0
Gly
4.098GlyAla: 4.098 ± 0.545
0.375GlyCys: 0.375 ± 0.094
3.465GlyAsp: 3.465 ± 0.309
3.957GlyGlu: 3.957 ± 0.338
2.435GlyPhe: 2.435 ± 0.229
4.941GlyGly: 4.941 ± 0.725
1.288GlyHis: 1.288 ± 0.181
3.7GlyIle: 3.7 ± 0.411
4.472GlyLys: 4.472 ± 0.373
4.144GlyLeu: 4.144 ± 0.333
1.803GlyMet: 1.803 ± 0.253
3.746GlyAsn: 3.746 ± 0.312
0.281GlyPro: 0.281 ± 0.083
2.201GlyGln: 2.201 ± 0.284
2.576GlyArg: 2.576 ± 0.249
3.934GlySer: 3.934 ± 0.346
3.77GlyThr: 3.77 ± 0.367
5.151GlyVal: 5.151 ± 0.396
0.773GlyTrp: 0.773 ± 0.151
2.857GlyTyr: 2.857 ± 0.214
0.0GlyXaa: 0.0 ± 0.0
His
0.937HisAla: 0.937 ± 0.154
0.07HisCys: 0.07 ± 0.037
1.194HisAsp: 1.194 ± 0.147
0.796HisGlu: 0.796 ± 0.137
0.749HisPhe: 0.749 ± 0.152
0.96HisGly: 0.96 ± 0.152
0.562HisHis: 0.562 ± 0.099
1.967HisIle: 1.967 ± 0.203
1.499HisLys: 1.499 ± 0.208
1.78HisLeu: 1.78 ± 0.24
0.375HisMet: 0.375 ± 0.103
1.054HisAsn: 1.054 ± 0.149
0.702HisPro: 0.702 ± 0.123
0.375HisGln: 0.375 ± 0.077
0.913HisArg: 0.913 ± 0.184
1.452HisSer: 1.452 ± 0.2
1.007HisThr: 1.007 ± 0.14
1.335HisVal: 1.335 ± 0.17
0.375HisTrp: 0.375 ± 0.097
0.937HisTyr: 0.937 ± 0.142
0.0HisXaa: 0.0 ± 0.0
Ile
4.472IleAla: 4.472 ± 0.369
0.585IleCys: 0.585 ± 0.127
4.355IleAsp: 4.355 ± 0.307
5.175IleGlu: 5.175 ± 0.342
2.107IlePhe: 2.107 ± 0.232
3.536IleGly: 3.536 ± 0.35
1.264IleHis: 1.264 ± 0.187
3.559IleIle: 3.559 ± 0.334
4.823IleLys: 4.823 ± 0.306
4.753IleLeu: 4.753 ± 0.376
1.428IleMet: 1.428 ± 0.165
3.419IleAsn: 3.419 ± 0.292
2.201IlePro: 2.201 ± 0.227
2.318IleGln: 2.318 ± 0.213
3.114IleArg: 3.114 ± 0.229
4.215IleSer: 4.215 ± 0.331
4.191IleThr: 4.191 ± 0.304
4.004IleVal: 4.004 ± 0.309
0.421IleTrp: 0.421 ± 0.092
2.412IleTyr: 2.412 ± 0.274
0.0IleXaa: 0.0 ± 0.0
Lys
4.542LysAla: 4.542 ± 0.357
0.445LysCys: 0.445 ± 0.099
5.034LysAsp: 5.034 ± 0.364
7.75LysGlu: 7.75 ± 0.575
2.763LysPhe: 2.763 ± 0.322
4.496LysGly: 4.496 ± 0.306
1.803LysHis: 1.803 ± 0.212
4.402LysIle: 4.402 ± 0.325
6.603LysLys: 6.603 ± 0.511
5.69LysLeu: 5.69 ± 0.375
1.943LysMet: 1.943 ± 0.204
3.91LysAsn: 3.91 ± 0.254
2.154LysPro: 2.154 ± 0.245
2.763LysGln: 2.763 ± 0.275
3.7LysArg: 3.7 ± 0.308
4.449LysSer: 4.449 ± 0.353
4.074LysThr: 4.074 ± 0.322
4.777LysVal: 4.777 ± 0.344
0.702LysTrp: 0.702 ± 0.149
2.833LysTyr: 2.833 ± 0.254
0.0LysXaa: 0.0 ± 0.0
Leu
5.432LeuAla: 5.432 ± 0.353
0.585LeuCys: 0.585 ± 0.108
5.362LeuAsp: 5.362 ± 0.403
6.603LeuGlu: 6.603 ± 0.47
3.114LeuPhe: 3.114 ± 0.327
4.683LeuGly: 4.683 ± 0.39
1.616LeuHis: 1.616 ± 0.17
4.66LeuIle: 4.66 ± 0.326
6.299LeuLys: 6.299 ± 0.322
5.666LeuLeu: 5.666 ± 0.412
2.131LeuMet: 2.131 ± 0.196
4.777LeuAsn: 4.777 ± 0.348
3.278LeuPro: 3.278 ± 0.311
3.442LeuGln: 3.442 ± 0.375
4.215LeuArg: 4.215 ± 0.372
5.854LeuSer: 5.854 ± 0.472
5.69LeuThr: 5.69 ± 0.366
5.175LeuVal: 5.175 ± 0.407
0.656LeuTrp: 0.656 ± 0.122
3.676LeuTyr: 3.676 ± 0.339
0.0LeuXaa: 0.0 ± 0.0
Met
1.709MetAla: 1.709 ± 0.245
0.14MetCys: 0.14 ± 0.057
1.381MetAsp: 1.381 ± 0.196
1.943MetGlu: 1.943 ± 0.204
1.007MetPhe: 1.007 ± 0.153
1.147MetGly: 1.147 ± 0.17
0.351MetHis: 0.351 ± 0.08
1.569MetIle: 1.569 ± 0.183
2.505MetLys: 2.505 ± 0.25
2.178MetLeu: 2.178 ± 0.222
0.726MetMet: 0.726 ± 0.146
1.616MetAsn: 1.616 ± 0.176
0.96MetPro: 0.96 ± 0.131
0.96MetGln: 0.96 ± 0.152
1.241MetArg: 1.241 ± 0.15
1.826MetSer: 1.826 ± 0.236
1.967MetThr: 1.967 ± 0.188
1.288MetVal: 1.288 ± 0.17
0.281MetTrp: 0.281 ± 0.085
1.03MetTyr: 1.03 ± 0.181
0.0MetXaa: 0.0 ± 0.0
Asn
2.95AsnAla: 2.95 ± 0.327
0.351AsnCys: 0.351 ± 0.08
3.559AsnAsp: 3.559 ± 0.307
3.442AsnGlu: 3.442 ± 0.283
1.78AsnPhe: 1.78 ± 0.248
3.863AsnGly: 3.863 ± 0.358
1.007AsnHis: 1.007 ± 0.165
3.582AsnIle: 3.582 ± 0.337
3.863AsnLys: 3.863 ± 0.239
4.472AsnLeu: 4.472 ± 0.312
1.264AsnMet: 1.264 ± 0.22
3.301AsnAsn: 3.301 ± 0.337
2.763AsnPro: 2.763 ± 0.261
2.248AsnGln: 2.248 ± 0.245
2.576AsnArg: 2.576 ± 0.278
3.934AsnSer: 3.934 ± 0.366
3.208AsnThr: 3.208 ± 0.233
3.489AsnVal: 3.489 ± 0.338
0.562AsnTrp: 0.562 ± 0.116
1.92AsnTyr: 1.92 ± 0.214
0.0AsnXaa: 0.0 ± 0.0
Pro
2.271ProAla: 2.271 ± 0.415
0.117ProCys: 0.117 ± 0.048
1.756ProAsp: 1.756 ± 0.217
2.178ProGlu: 2.178 ± 0.218
1.428ProPhe: 1.428 ± 0.191
0.796ProGly: 0.796 ± 0.131
0.749ProHis: 0.749 ± 0.14
1.826ProIle: 1.826 ± 0.176
2.178ProLys: 2.178 ± 0.22
3.021ProLeu: 3.021 ± 0.269
0.749ProMet: 0.749 ± 0.143
1.943ProAsn: 1.943 ± 0.236
1.171ProPro: 1.171 ± 0.289
1.241ProGln: 1.241 ± 0.215
1.264ProArg: 1.264 ± 0.169
2.693ProSer: 2.693 ± 0.283
2.459ProThr: 2.459 ± 0.294
1.756ProVal: 1.756 ± 0.192
0.117ProTrp: 0.117 ± 0.051
1.522ProTyr: 1.522 ± 0.237
0.0ProXaa: 0.0 ± 0.0
Gln
2.669GlnAla: 2.669 ± 0.263
0.117GlnCys: 0.117 ± 0.058
2.154GlnAsp: 2.154 ± 0.24
3.676GlnGlu: 3.676 ± 0.281
1.522GlnPhe: 1.522 ± 0.162
1.99GlnGly: 1.99 ± 0.291
0.702GlnHis: 0.702 ± 0.139
1.967GlnIle: 1.967 ± 0.223
2.552GlnLys: 2.552 ± 0.262
3.746GlnLeu: 3.746 ± 0.326
1.171GlnMet: 1.171 ± 0.151
1.569GlnAsn: 1.569 ± 0.189
1.171GlnPro: 1.171 ± 0.28
2.107GlnGln: 2.107 ± 0.321
1.311GlnArg: 1.311 ± 0.208
2.435GlnSer: 2.435 ± 0.259
1.92GlnThr: 1.92 ± 0.228
2.622GlnVal: 2.622 ± 0.246
0.421GlnTrp: 0.421 ± 0.099
1.452GlnTyr: 1.452 ± 0.155
0.0GlnXaa: 0.0 ± 0.0
Arg
2.693ArgAla: 2.693 ± 0.24
0.351ArgCys: 0.351 ± 0.096
2.95ArgAsp: 2.95 ± 0.257
4.168ArgGlu: 4.168 ± 0.378
1.99ArgPhe: 1.99 ± 0.182
2.857ArgGly: 2.857 ± 0.307
0.773ArgHis: 0.773 ± 0.15
2.833ArgIle: 2.833 ± 0.27
3.84ArgLys: 3.84 ± 0.292
4.215ArgLeu: 4.215 ± 0.421
1.428ArgMet: 1.428 ± 0.171
2.318ArgAsn: 2.318 ± 0.24
1.077ArgPro: 1.077 ± 0.174
1.756ArgGln: 1.756 ± 0.166
2.224ArgArg: 2.224 ± 0.237
2.646ArgSer: 2.646 ± 0.26
2.412ArgThr: 2.412 ± 0.241
3.559ArgVal: 3.559 ± 0.276
0.445ArgTrp: 0.445 ± 0.098
2.084ArgTyr: 2.084 ± 0.22
0.0ArgXaa: 0.0 ± 0.0
Ser
4.144SerAla: 4.144 ± 0.34
0.515SerCys: 0.515 ± 0.121
3.91SerAsp: 3.91 ± 0.362
4.706SerGlu: 4.706 ± 0.335
3.138SerPhe: 3.138 ± 0.242
4.027SerGly: 4.027 ± 0.445
1.194SerHis: 1.194 ± 0.183
3.934SerIle: 3.934 ± 0.317
5.058SerLys: 5.058 ± 0.324
5.69SerLeu: 5.69 ± 0.334
1.826SerMet: 1.826 ± 0.182
3.489SerAsn: 3.489 ± 0.383
2.318SerPro: 2.318 ± 0.282
2.412SerGln: 2.412 ± 0.192
3.489SerArg: 3.489 ± 0.314
5.502SerSer: 5.502 ± 0.522
4.191SerThr: 4.191 ± 0.363
3.676SerVal: 3.676 ± 0.394
0.773SerTrp: 0.773 ± 0.137
2.388SerTyr: 2.388 ± 0.214
0.0SerXaa: 0.0 ± 0.0
Thr
3.7ThrAla: 3.7 ± 0.346
0.398ThrCys: 0.398 ± 0.084
3.793ThrAsp: 3.793 ± 0.265
4.66ThrGlu: 4.66 ± 0.297
2.388ThrPhe: 2.388 ± 0.301
4.425ThrGly: 4.425 ± 0.413
0.82ThrHis: 0.82 ± 0.12
3.512ThrIle: 3.512 ± 0.373
3.7ThrLys: 3.7 ± 0.27
4.917ThrLeu: 4.917 ± 0.341
1.124ThrMet: 1.124 ± 0.179
3.231ThrAsn: 3.231 ± 0.316
2.154ThrPro: 2.154 ± 0.297
2.435ThrGln: 2.435 ± 0.278
2.95ThrArg: 2.95 ± 0.287
3.981ThrSer: 3.981 ± 0.386
4.144ThrThr: 4.144 ± 0.346
5.526ThrVal: 5.526 ± 0.441
0.398ThrTrp: 0.398 ± 0.092
2.693ThrTyr: 2.693 ± 0.237
0.0ThrXaa: 0.0 ± 0.0
Val
4.519ValAla: 4.519 ± 0.364
0.539ValCys: 0.539 ± 0.121
4.987ValAsp: 4.987 ± 0.343
5.011ValGlu: 5.011 ± 0.376
2.693ValPhe: 2.693 ± 0.308
3.887ValGly: 3.887 ± 0.368
1.358ValHis: 1.358 ± 0.179
4.144ValIle: 4.144 ± 0.362
4.964ValLys: 4.964 ± 0.323
5.737ValLeu: 5.737 ± 0.503
1.592ValMet: 1.592 ± 0.232
3.465ValAsn: 3.465 ± 0.32
2.646ValPro: 2.646 ± 0.231
2.318ValGln: 2.318 ± 0.203
3.278ValArg: 3.278 ± 0.312
5.198ValSer: 5.198 ± 0.345
5.034ValThr: 5.034 ± 0.465
4.87ValVal: 4.87 ± 0.398
0.539ValTrp: 0.539 ± 0.131
3.325ValTyr: 3.325 ± 0.235
0.0ValXaa: 0.0 ± 0.0
Trp
0.656TrpAla: 0.656 ± 0.111
0.047TrpCys: 0.047 ± 0.033
0.773TrpAsp: 0.773 ± 0.146
0.96TrpGlu: 0.96 ± 0.14
0.398TrpPhe: 0.398 ± 0.086
0.562TrpGly: 0.562 ± 0.125
0.258TrpHis: 0.258 ± 0.079
0.539TrpIle: 0.539 ± 0.112
0.796TrpLys: 0.796 ± 0.119
0.679TrpLeu: 0.679 ± 0.125
0.187TrpMet: 0.187 ± 0.071
0.679TrpAsn: 0.679 ± 0.127
0.0TrpPro: 0.0 ± 0.0
0.258TrpGln: 0.258 ± 0.064
0.258TrpArg: 0.258 ± 0.067
0.492TrpSer: 0.492 ± 0.106
0.702TrpThr: 0.702 ± 0.13
0.796TrpVal: 0.796 ± 0.149
0.234TrpTrp: 0.234 ± 0.087
0.515TrpTyr: 0.515 ± 0.116
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.74TyrAla: 2.74 ± 0.217
0.351TyrCys: 0.351 ± 0.083
3.044TyrAsp: 3.044 ± 0.268
3.7TyrGlu: 3.7 ± 0.263
1.358TyrPhe: 1.358 ± 0.176
2.669TyrGly: 2.669 ± 0.269
0.843TyrHis: 0.843 ± 0.155
2.763TyrIle: 2.763 ± 0.267
2.669TyrLys: 2.669 ± 0.237
3.957TyrLeu: 3.957 ± 0.268
0.913TyrMet: 0.913 ± 0.166
2.482TyrAsn: 2.482 ± 0.291
1.241TyrPro: 1.241 ± 0.167
1.405TyrGln: 1.405 ± 0.147
2.599TyrArg: 2.599 ± 0.272
2.857TyrSer: 2.857 ± 0.322
2.412TyrThr: 2.412 ± 0.2
2.552TyrVal: 2.552 ± 0.248
0.328TyrTrp: 0.328 ± 0.084
1.639TyrTyr: 1.639 ± 0.216
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 204 proteins (42709 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski