Amino acid dipepetide frequency for Synechococcus phage ACG-2014g

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.272AlaAla: 6.272 ± 0.518
0.647AlaCys: 0.647 ± 0.151
4.564AlaAsp: 4.564 ± 0.375
4.115AlaGlu: 4.115 ± 0.321
2.785AlaPhe: 2.785 ± 0.257
6.451AlaGly: 6.451 ± 0.49
0.916AlaHis: 0.916 ± 0.15
4.043AlaIle: 4.043 ± 0.258
3.666AlaLys: 3.666 ± 0.495
4.6AlaLeu: 4.6 ± 0.342
1.456AlaMet: 1.456 ± 0.208
4.097AlaAsn: 4.097 ± 0.344
2.588AlaPro: 2.588 ± 0.242
2.174AlaGln: 2.174 ± 0.176
2.498AlaArg: 2.498 ± 0.196
5.337AlaSer: 5.337 ± 0.397
5.93AlaThr: 5.93 ± 0.636
4.169AlaVal: 4.169 ± 0.351
0.503AlaTrp: 0.503 ± 0.089
2.138AlaTyr: 2.138 ± 0.174
0.0AlaXaa: 0.0 ± 0.0
Cys
0.629CysAla: 0.629 ± 0.112
0.054CysCys: 0.054 ± 0.042
0.557CysAsp: 0.557 ± 0.111
0.665CysGlu: 0.665 ± 0.116
0.521CysPhe: 0.521 ± 0.106
0.557CysGly: 0.557 ± 0.115
0.341CysHis: 0.341 ± 0.092
0.593CysIle: 0.593 ± 0.142
0.593CysLys: 0.593 ± 0.12
0.683CysLeu: 0.683 ± 0.118
0.323CysMet: 0.323 ± 0.091
0.413CysAsn: 0.413 ± 0.094
0.323CysPro: 0.323 ± 0.089
0.234CysGln: 0.234 ± 0.078
0.341CysArg: 0.341 ± 0.096
0.503CysSer: 0.503 ± 0.092
0.575CysThr: 0.575 ± 0.113
0.665CysVal: 0.665 ± 0.126
0.072CysTrp: 0.072 ± 0.035
0.305CysTyr: 0.305 ± 0.07
0.0CysXaa: 0.0 ± 0.0
Asp
5.014AspAla: 5.014 ± 0.296
0.557AspCys: 0.557 ± 0.122
4.618AspAsp: 4.618 ± 0.446
4.115AspGlu: 4.115 ± 0.283
3.145AspPhe: 3.145 ± 0.248
6.379AspGly: 6.379 ± 0.556
0.665AspHis: 0.665 ± 0.108
4.079AspIle: 4.079 ± 0.288
3.091AspLys: 3.091 ± 0.327
4.457AspLeu: 4.457 ± 0.275
1.438AspMet: 1.438 ± 0.21
3.882AspAsn: 3.882 ± 0.34
2.803AspPro: 2.803 ± 0.237
1.725AspGln: 1.725 ± 0.173
2.372AspArg: 2.372 ± 0.278
4.403AspSer: 4.403 ± 0.308
4.582AspThr: 4.582 ± 0.349
4.528AspVal: 4.528 ± 0.342
0.881AspTrp: 0.881 ± 0.124
3.001AspTyr: 3.001 ± 0.231
0.0AspXaa: 0.0 ± 0.0
Glu
3.253GluAla: 3.253 ± 0.262
0.845GluCys: 0.845 ± 0.157
3.81GluAsp: 3.81 ± 0.325
4.744GluGlu: 4.744 ± 0.572
3.145GluPhe: 3.145 ± 0.254
3.953GluGly: 3.953 ± 0.222
0.773GluHis: 0.773 ± 0.153
4.241GluIle: 4.241 ± 0.361
3.396GluLys: 3.396 ± 0.413
5.032GluLeu: 5.032 ± 0.365
1.563GluMet: 1.563 ± 0.283
3.648GluAsn: 3.648 ± 0.236
1.851GluPro: 1.851 ± 0.169
2.354GluGln: 2.354 ± 0.275
2.767GluArg: 2.767 ± 0.321
4.007GluSer: 4.007 ± 0.266
4.061GluThr: 4.061 ± 0.334
4.618GluVal: 4.618 ± 0.272
0.988GluTrp: 0.988 ± 0.151
2.731GluTyr: 2.731 ± 0.221
0.0GluXaa: 0.0 ± 0.0
Phe
2.749PheAla: 2.749 ± 0.275
0.539PheCys: 0.539 ± 0.12
3.612PheAsp: 3.612 ± 0.228
2.624PheGlu: 2.624 ± 0.271
2.12PhePhe: 2.12 ± 0.221
3.289PheGly: 3.289 ± 0.229
0.701PheHis: 0.701 ± 0.143
2.588PheIle: 2.588 ± 0.228
2.085PheLys: 2.085 ± 0.216
3.181PheLeu: 3.181 ± 0.333
0.845PheMet: 0.845 ± 0.165
2.965PheAsn: 2.965 ± 0.218
1.815PhePro: 1.815 ± 0.208
1.581PheGln: 1.581 ± 0.165
1.617PheArg: 1.617 ± 0.161
3.342PheSer: 3.342 ± 0.228
3.324PheThr: 3.324 ± 0.407
2.66PheVal: 2.66 ± 0.281
0.377PheTrp: 0.377 ± 0.079
1.635PheTyr: 1.635 ± 0.148
0.0PheXaa: 0.0 ± 0.0
Gly
6.487GlyAla: 6.487 ± 0.509
0.611GlyCys: 0.611 ± 0.111
5.445GlyAsp: 5.445 ± 0.525
4.259GlyGlu: 4.259 ± 0.276
3.181GlyPhe: 3.181 ± 0.262
7.781GlyGly: 7.781 ± 0.965
0.881GlyHis: 0.881 ± 0.131
4.331GlyIle: 4.331 ± 0.324
3.953GlyLys: 3.953 ± 0.406
4.69GlyLeu: 4.69 ± 0.289
1.635GlyMet: 1.635 ± 0.239
4.798GlyAsn: 4.798 ± 0.539
1.923GlyPro: 1.923 ± 0.185
2.66GlyGln: 2.66 ± 0.213
2.983GlyArg: 2.983 ± 0.264
6.721GlySer: 6.721 ± 0.747
6.918GlyThr: 6.918 ± 0.748
5.481GlyVal: 5.481 ± 0.419
1.132GlyTrp: 1.132 ± 0.159
3.36GlyTyr: 3.36 ± 0.231
0.0GlyXaa: 0.0 ± 0.0
His
0.755HisAla: 0.755 ± 0.118
0.162HisCys: 0.162 ± 0.059
0.791HisAsp: 0.791 ± 0.137
0.916HisGlu: 0.916 ± 0.158
0.934HisPhe: 0.934 ± 0.163
1.06HisGly: 1.06 ± 0.141
0.449HisHis: 0.449 ± 0.103
0.845HisIle: 0.845 ± 0.126
0.737HisLys: 0.737 ± 0.14
0.988HisLeu: 0.988 ± 0.165
0.288HisMet: 0.288 ± 0.078
0.665HisAsn: 0.665 ± 0.114
0.845HisPro: 0.845 ± 0.15
0.395HisGln: 0.395 ± 0.08
0.791HisArg: 0.791 ± 0.136
1.006HisSer: 1.006 ± 0.146
1.096HisThr: 1.096 ± 0.14
1.042HisVal: 1.042 ± 0.142
0.288HisTrp: 0.288 ± 0.078
0.773HisTyr: 0.773 ± 0.132
0.0HisXaa: 0.0 ± 0.0
Ile
4.726IleAla: 4.726 ± 0.333
0.737IleCys: 0.737 ± 0.127
4.528IleAsp: 4.528 ± 0.266
4.421IleGlu: 4.421 ± 0.287
2.66IlePhe: 2.66 ± 0.209
4.349IleGly: 4.349 ± 0.368
0.593IleHis: 0.593 ± 0.088
4.007IleIle: 4.007 ± 0.318
3.378IleLys: 3.378 ± 0.267
4.385IleLeu: 4.385 ± 0.318
1.078IleMet: 1.078 ± 0.184
4.295IleAsn: 4.295 ± 0.285
2.731IlePro: 2.731 ± 0.262
2.498IleGln: 2.498 ± 0.233
2.606IleArg: 2.606 ± 0.22
4.762IleSer: 4.762 ± 0.505
5.499IleThr: 5.499 ± 0.584
4.205IleVal: 4.205 ± 0.343
0.683IleTrp: 0.683 ± 0.112
2.031IleTyr: 2.031 ± 0.195
0.0IleXaa: 0.0 ± 0.0
Lys
3.199LysAla: 3.199 ± 0.41
0.503LysCys: 0.503 ± 0.089
3.091LysAsp: 3.091 ± 0.31
3.81LysGlu: 3.81 ± 0.558
2.498LysPhe: 2.498 ± 0.226
3.576LysGly: 3.576 ± 0.381
1.078LysHis: 1.078 ± 0.196
3.864LysIle: 3.864 ± 0.319
4.51LysLys: 4.51 ± 0.704
4.618LysLeu: 4.618 ± 0.36
1.15LysMet: 1.15 ± 0.198
2.875LysAsn: 2.875 ± 0.279
1.887LysPro: 1.887 ± 0.265
1.905LysGln: 1.905 ± 0.209
2.462LysArg: 2.462 ± 0.283
3.702LysSer: 3.702 ± 0.295
3.414LysThr: 3.414 ± 0.232
3.792LysVal: 3.792 ± 0.275
0.719LysTrp: 0.719 ± 0.152
2.696LysTyr: 2.696 ± 0.369
0.0LysXaa: 0.0 ± 0.0
Leu
4.043LeuAla: 4.043 ± 0.297
0.647LeuCys: 0.647 ± 0.133
5.427LeuAsp: 5.427 ± 0.388
4.331LeuGlu: 4.331 ± 0.336
2.947LeuPhe: 2.947 ± 0.2
4.762LeuGly: 4.762 ± 0.309
1.474LeuHis: 1.474 ± 0.206
4.277LeuIle: 4.277 ± 0.285
4.654LeuLys: 4.654 ± 0.464
4.978LeuLeu: 4.978 ± 0.362
1.33LeuMet: 1.33 ± 0.212
4.69LeuAsn: 4.69 ± 0.272
2.839LeuPro: 2.839 ± 0.238
2.893LeuGln: 2.893 ± 0.267
3.199LeuArg: 3.199 ± 0.26
5.517LeuSer: 5.517 ± 0.296
5.786LeuThr: 5.786 ± 0.507
4.475LeuVal: 4.475 ± 0.253
0.665LeuTrp: 0.665 ± 0.127
3.181LeuTyr: 3.181 ± 0.295
0.0LeuXaa: 0.0 ± 0.0
Met
1.42MetAla: 1.42 ± 0.212
0.18MetCys: 0.18 ± 0.061
1.168MetAsp: 1.168 ± 0.218
1.15MetGlu: 1.15 ± 0.211
0.845MetPhe: 0.845 ± 0.16
1.402MetGly: 1.402 ± 0.233
0.395MetHis: 0.395 ± 0.105
1.042MetIle: 1.042 ± 0.185
1.527MetLys: 1.527 ± 0.277
1.599MetLeu: 1.599 ± 0.241
0.467MetMet: 0.467 ± 0.109
1.276MetAsn: 1.276 ± 0.214
0.934MetPro: 0.934 ± 0.163
1.006MetGln: 1.006 ± 0.178
0.934MetArg: 0.934 ± 0.179
1.509MetSer: 1.509 ± 0.207
1.581MetThr: 1.581 ± 0.207
0.863MetVal: 0.863 ± 0.136
0.305MetTrp: 0.305 ± 0.077
0.539MetTyr: 0.539 ± 0.116
0.0MetXaa: 0.0 ± 0.0
Asn
4.295AsnAla: 4.295 ± 0.364
0.449AsnCys: 0.449 ± 0.08
3.001AsnAsp: 3.001 ± 0.23
3.666AsnGlu: 3.666 ± 0.265
2.821AsnPhe: 2.821 ± 0.219
4.51AsnGly: 4.51 ± 0.399
0.827AsnHis: 0.827 ± 0.121
4.582AsnIle: 4.582 ± 0.278
3.037AsnLys: 3.037 ± 0.244
4.546AsnLeu: 4.546 ± 0.273
0.899AsnMet: 0.899 ± 0.16
3.468AsnAsn: 3.468 ± 0.379
3.163AsnPro: 3.163 ± 0.259
1.941AsnGln: 1.941 ± 0.178
2.228AsnArg: 2.228 ± 0.164
4.133AsnSer: 4.133 ± 0.409
4.097AsnThr: 4.097 ± 0.394
4.439AsnVal: 4.439 ± 0.339
0.575AsnTrp: 0.575 ± 0.114
2.767AsnTyr: 2.767 ± 0.216
0.0AsnXaa: 0.0 ± 0.0
Pro
2.426ProAla: 2.426 ± 0.192
0.216ProCys: 0.216 ± 0.054
2.731ProAsp: 2.731 ± 0.243
2.66ProGlu: 2.66 ± 0.241
1.617ProPhe: 1.617 ± 0.191
3.342ProGly: 3.342 ± 0.347
0.737ProHis: 0.737 ± 0.145
2.408ProIle: 2.408 ± 0.313
2.138ProLys: 2.138 ± 0.274
2.156ProLeu: 2.156 ± 0.23
0.647ProMet: 0.647 ± 0.138
1.995ProAsn: 1.995 ± 0.177
1.869ProPro: 1.869 ± 0.206
1.132ProGln: 1.132 ± 0.119
1.617ProArg: 1.617 ± 0.168
3.396ProSer: 3.396 ± 0.285
3.378ProThr: 3.378 ± 0.257
2.462ProVal: 2.462 ± 0.254
0.557ProTrp: 0.557 ± 0.12
1.635ProTyr: 1.635 ± 0.163
0.0ProXaa: 0.0 ± 0.0
Gln
2.013GlnAla: 2.013 ± 0.224
0.305GlnCys: 0.305 ± 0.085
2.156GlnAsp: 2.156 ± 0.177
2.3GlnGlu: 2.3 ± 0.286
1.509GlnPhe: 1.509 ± 0.164
2.336GlnGly: 2.336 ± 0.199
0.611GlnHis: 0.611 ± 0.1
2.534GlnIle: 2.534 ± 0.248
2.336GlnLys: 2.336 ± 0.315
2.947GlnLeu: 2.947 ± 0.217
1.078GlnMet: 1.078 ± 0.217
1.563GlnAsn: 1.563 ± 0.175
1.456GlnPro: 1.456 ± 0.14
1.527GlnGln: 1.527 ± 0.18
1.294GlnArg: 1.294 ± 0.127
2.516GlnSer: 2.516 ± 0.233
2.192GlnThr: 2.192 ± 0.194
2.713GlnVal: 2.713 ± 0.226
0.449GlnTrp: 0.449 ± 0.094
1.851GlnTyr: 1.851 ± 0.199
0.0GlnXaa: 0.0 ± 0.0
Arg
2.857ArgAla: 2.857 ± 0.218
0.341ArgCys: 0.341 ± 0.092
2.408ArgAsp: 2.408 ± 0.245
2.156ArgGlu: 2.156 ± 0.272
1.563ArgPhe: 1.563 ± 0.149
2.785ArgGly: 2.785 ± 0.263
0.773ArgHis: 0.773 ± 0.146
2.767ArgIle: 2.767 ± 0.257
2.498ArgLys: 2.498 ± 0.33
3.271ArgLeu: 3.271 ± 0.211
0.97ArgMet: 0.97 ± 0.19
2.12ArgAsn: 2.12 ± 0.191
1.24ArgPro: 1.24 ± 0.149
1.779ArgGln: 1.779 ± 0.183
1.851ArgArg: 1.851 ± 0.32
2.336ArgSer: 2.336 ± 0.21
2.678ArgThr: 2.678 ± 0.381
3.073ArgVal: 3.073 ± 0.193
0.359ArgTrp: 0.359 ± 0.083
2.156ArgTyr: 2.156 ± 0.217
0.0ArgXaa: 0.0 ± 0.0
Ser
5.445SerAla: 5.445 ± 0.42
0.431SerCys: 0.431 ± 0.104
4.115SerAsp: 4.115 ± 0.305
3.882SerGlu: 3.882 ± 0.298
3.253SerPhe: 3.253 ± 0.221
7.637SerGly: 7.637 ± 0.832
0.863SerHis: 0.863 ± 0.11
4.708SerIle: 4.708 ± 0.466
3.828SerLys: 3.828 ± 0.317
5.517SerLeu: 5.517 ± 0.284
1.312SerMet: 1.312 ± 0.207
4.564SerAsn: 4.564 ± 0.278
2.749SerPro: 2.749 ± 0.295
2.57SerGln: 2.57 ± 0.202
2.552SerArg: 2.552 ± 0.308
5.93SerSer: 5.93 ± 0.529
5.427SerThr: 5.427 ± 0.388
5.086SerVal: 5.086 ± 0.463
0.755SerTrp: 0.755 ± 0.095
3.109SerTyr: 3.109 ± 0.216
0.0SerXaa: 0.0 ± 0.0
Thr
5.93ThrAla: 5.93 ± 0.643
0.575ThrCys: 0.575 ± 0.11
4.726ThrAsp: 4.726 ± 0.472
4.421ThrGlu: 4.421 ± 0.331
3.36ThrPhe: 3.36 ± 0.4
7.26ThrGly: 7.26 ± 0.742
0.934ThrHis: 0.934 ± 0.141
5.032ThrIle: 5.032 ± 0.622
3.217ThrLys: 3.217 ± 0.285
6.254ThrLeu: 6.254 ± 0.389
1.06ThrMet: 1.06 ± 0.161
3.917ThrAsn: 3.917 ± 0.472
3.055ThrPro: 3.055 ± 0.273
2.839ThrGln: 2.839 ± 0.226
2.39ThrArg: 2.39 ± 0.185
5.086ThrSer: 5.086 ± 0.452
6.343ThrThr: 6.343 ± 0.618
6.002ThrVal: 6.002 ± 0.674
0.701ThrTrp: 0.701 ± 0.108
3.091ThrTyr: 3.091 ± 0.25
0.0ThrXaa: 0.0 ± 0.0
Val
4.313ValAla: 4.313 ± 0.29
0.539ValCys: 0.539 ± 0.112
4.906ValAsp: 4.906 ± 0.36
4.259ValGlu: 4.259 ± 0.241
2.749ValPhe: 2.749 ± 0.214
4.708ValGly: 4.708 ± 0.491
0.791ValHis: 0.791 ± 0.141
4.708ValIle: 4.708 ± 0.307
3.522ValLys: 3.522 ± 0.307
4.618ValLeu: 4.618 ± 0.216
1.33ValMet: 1.33 ± 0.184
4.493ValAsn: 4.493 ± 0.365
3.127ValPro: 3.127 ± 0.196
2.444ValGln: 2.444 ± 0.223
2.696ValArg: 2.696 ± 0.201
5.84ValSer: 5.84 ± 0.382
5.858ValThr: 5.858 ± 0.62
4.636ValVal: 4.636 ± 0.298
0.467ValTrp: 0.467 ± 0.079
2.318ValTyr: 2.318 ± 0.222
0.0ValXaa: 0.0 ± 0.0
Trp
0.827TrpAla: 0.827 ± 0.111
0.108TrpCys: 0.108 ± 0.048
0.647TrpAsp: 0.647 ± 0.118
0.755TrpGlu: 0.755 ± 0.146
0.467TrpPhe: 0.467 ± 0.104
0.629TrpGly: 0.629 ± 0.109
0.323TrpHis: 0.323 ± 0.085
0.575TrpIle: 0.575 ± 0.094
0.629TrpLys: 0.629 ± 0.11
0.647TrpLeu: 0.647 ± 0.134
0.377TrpMet: 0.377 ± 0.099
0.737TrpAsn: 0.737 ± 0.137
0.216TrpPro: 0.216 ± 0.066
0.449TrpGln: 0.449 ± 0.085
0.593TrpArg: 0.593 ± 0.088
0.934TrpSer: 0.934 ± 0.128
0.773TrpThr: 0.773 ± 0.128
0.809TrpVal: 0.809 ± 0.126
0.126TrpTrp: 0.126 ± 0.045
0.341TrpTyr: 0.341 ± 0.069
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.318TyrAla: 2.318 ± 0.179
0.503TyrCys: 0.503 ± 0.087
3.432TyrAsp: 3.432 ± 0.289
2.534TyrGlu: 2.534 ± 0.26
1.527TyrPhe: 1.527 ± 0.196
2.462TyrGly: 2.462 ± 0.215
0.701TyrHis: 0.701 ± 0.162
2.947TyrIle: 2.947 ± 0.24
2.552TyrLys: 2.552 ± 0.249
2.947TyrLeu: 2.947 ± 0.216
0.881TyrMet: 0.881 ± 0.184
2.983TyrAsn: 2.983 ± 0.21
1.707TyrPro: 1.707 ± 0.18
1.581TyrGln: 1.581 ± 0.188
2.192TyrArg: 2.192 ± 0.235
2.749TyrSer: 2.749 ± 0.246
2.678TyrThr: 2.678 ± 0.37
2.606TyrVal: 2.606 ± 0.254
0.359TyrTrp: 0.359 ± 0.081
1.995TyrTyr: 1.995 ± 0.183
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 216 proteins (55649 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski