Amino acid dipepetide frequency for Synechococcus phage ACG-2014b

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.086AlaAla: 6.086 ± 0.46
0.512AlaCys: 0.512 ± 0.113
4.569AlaAsp: 4.569 ± 0.337
4.295AlaGlu: 4.295 ± 0.352
2.65AlaPhe: 2.65 ± 0.254
6.36AlaGly: 6.36 ± 0.485
0.914AlaHis: 0.914 ± 0.152
3.984AlaIle: 3.984 ± 0.292
3.6AlaLys: 3.6 ± 0.375
4.752AlaLeu: 4.752 ± 0.338
1.298AlaMet: 1.298 ± 0.188
3.948AlaAsn: 3.948 ± 0.34
2.394AlaPro: 2.394 ± 0.218
2.54AlaGln: 2.54 ± 0.212
2.65AlaArg: 2.65 ± 0.245
5.464AlaSer: 5.464 ± 0.468
5.428AlaThr: 5.428 ± 0.537
4.258AlaVal: 4.258 ± 0.372
0.475AlaTrp: 0.475 ± 0.097
2.01AlaTyr: 2.01 ± 0.203
0.0AlaXaa: 0.0 ± 0.0
Cys
0.548CysAla: 0.548 ± 0.107
0.055CysCys: 0.055 ± 0.037
0.621CysAsp: 0.621 ± 0.143
0.658CysGlu: 0.658 ± 0.134
0.567CysPhe: 0.567 ± 0.123
0.676CysGly: 0.676 ± 0.137
0.274CysHis: 0.274 ± 0.075
0.621CysIle: 0.621 ± 0.117
0.621CysLys: 0.621 ± 0.124
0.621CysLeu: 0.621 ± 0.12
0.311CysMet: 0.311 ± 0.078
0.439CysAsn: 0.439 ± 0.098
0.274CysPro: 0.274 ± 0.077
0.292CysGln: 0.292 ± 0.07
0.347CysArg: 0.347 ± 0.086
0.822CysSer: 0.822 ± 0.15
0.585CysThr: 0.585 ± 0.136
0.621CysVal: 0.621 ± 0.103
0.091CysTrp: 0.091 ± 0.048
0.256CysTyr: 0.256 ± 0.073
0.0CysXaa: 0.0 ± 0.0
Asp
5.099AspAla: 5.099 ± 0.306
0.603AspCys: 0.603 ± 0.125
4.861AspAsp: 4.861 ± 0.41
4.149AspGlu: 4.149 ± 0.292
3.015AspPhe: 3.015 ± 0.214
6.214AspGly: 6.214 ± 0.364
0.987AspHis: 0.987 ± 0.148
4.459AspIle: 4.459 ± 0.352
3.271AspLys: 3.271 ± 0.256
4.569AspLeu: 4.569 ± 0.337
1.608AspMet: 1.608 ± 0.227
3.216AspAsn: 3.216 ± 0.227
3.089AspPro: 3.089 ± 0.28
1.846AspGln: 1.846 ± 0.161
2.321AspArg: 2.321 ± 0.213
4.806AspSer: 4.806 ± 0.307
4.368AspThr: 4.368 ± 0.378
4.496AspVal: 4.496 ± 0.284
0.932AspTrp: 0.932 ± 0.142
3.034AspTyr: 3.034 ± 0.251
0.0AspXaa: 0.0 ± 0.0
Glu
3.582GluAla: 3.582 ± 0.296
0.64GluCys: 0.64 ± 0.113
4.13GluAsp: 4.13 ± 0.365
4.642GluGlu: 4.642 ± 0.449
2.778GluPhe: 2.778 ± 0.233
3.71GluGly: 3.71 ± 0.288
0.804GluHis: 0.804 ± 0.125
4.258GluIle: 4.258 ± 0.249
3.454GluLys: 3.454 ± 0.469
5.556GluLeu: 5.556 ± 0.331
1.608GluMet: 1.608 ± 0.22
3.82GluAsn: 3.82 ± 0.241
1.59GluPro: 1.59 ± 0.174
2.485GluGln: 2.485 ± 0.268
3.052GluArg: 3.052 ± 0.354
3.948GluSer: 3.948 ± 0.317
3.71GluThr: 3.71 ± 0.259
5.19GluVal: 5.19 ± 0.298
0.676GluTrp: 0.676 ± 0.108
2.924GluTyr: 2.924 ± 0.228
0.0GluXaa: 0.0 ± 0.0
Phe
2.814PheAla: 2.814 ± 0.258
0.53PheCys: 0.53 ± 0.128
3.801PheAsp: 3.801 ± 0.294
2.705PheGlu: 2.705 ± 0.199
1.974PhePhe: 1.974 ± 0.164
2.979PheGly: 2.979 ± 0.239
0.457PheHis: 0.457 ± 0.121
2.814PheIle: 2.814 ± 0.216
2.449PheLys: 2.449 ± 0.25
3.052PheLeu: 3.052 ± 0.294
0.95PheMet: 0.95 ± 0.166
2.723PheAsn: 2.723 ± 0.245
1.828PhePro: 1.828 ± 0.183
1.499PheGln: 1.499 ± 0.144
1.462PheArg: 1.462 ± 0.15
3.29PheSer: 3.29 ± 0.278
3.143PheThr: 3.143 ± 0.302
2.668PheVal: 2.668 ± 0.311
0.311PheTrp: 0.311 ± 0.074
1.681PheTyr: 1.681 ± 0.167
0.0PheXaa: 0.0 ± 0.0
Gly
6.031GlyAla: 6.031 ± 0.455
0.658GlyCys: 0.658 ± 0.134
4.898GlyAsp: 4.898 ± 0.32
4.313GlyGlu: 4.313 ± 0.305
2.979GlyPhe: 2.979 ± 0.228
7.438GlyGly: 7.438 ± 0.987
0.877GlyHis: 0.877 ± 0.166
4.35GlyIle: 4.35 ± 0.29
4.039GlyLys: 4.039 ± 0.361
4.624GlyLeu: 4.624 ± 0.27
1.59GlyMet: 1.59 ± 0.28
4.861GlyAsn: 4.861 ± 0.417
1.901GlyPro: 1.901 ± 0.224
2.869GlyGln: 2.869 ± 0.258
2.942GlyArg: 2.942 ± 0.318
6.378GlySer: 6.378 ± 0.584
7.091GlyThr: 7.091 ± 0.766
4.934GlyVal: 4.934 ± 0.333
0.969GlyTrp: 0.969 ± 0.138
3.472GlyTyr: 3.472 ± 0.278
0.0GlyXaa: 0.0 ± 0.0
His
0.749HisAla: 0.749 ± 0.111
0.164HisCys: 0.164 ± 0.057
0.822HisAsp: 0.822 ± 0.152
1.005HisGlu: 1.005 ± 0.178
0.877HisPhe: 0.877 ± 0.149
0.95HisGly: 0.95 ± 0.154
0.311HisHis: 0.311 ± 0.087
0.914HisIle: 0.914 ± 0.129
0.859HisLys: 0.859 ± 0.174
0.95HisLeu: 0.95 ± 0.141
0.238HisMet: 0.238 ± 0.066
0.713HisAsn: 0.713 ± 0.119
0.877HisPro: 0.877 ± 0.181
0.457HisGln: 0.457 ± 0.101
0.548HisArg: 0.548 ± 0.086
0.694HisSer: 0.694 ± 0.089
1.042HisThr: 1.042 ± 0.165
0.932HisVal: 0.932 ± 0.15
0.292HisTrp: 0.292 ± 0.08
0.731HisTyr: 0.731 ± 0.121
0.0HisXaa: 0.0 ± 0.0
Ile
4.112IleAla: 4.112 ± 0.289
0.841IleCys: 0.841 ± 0.173
4.587IleAsp: 4.587 ± 0.275
4.423IleGlu: 4.423 ± 0.338
2.76IlePhe: 2.76 ± 0.179
4.423IleGly: 4.423 ± 0.294
0.676IleHis: 0.676 ± 0.121
4.24IleIle: 4.24 ± 0.322
3.966IleLys: 3.966 ± 0.289
4.861IleLeu: 4.861 ± 0.344
1.042IleMet: 1.042 ± 0.167
4.295IleAsn: 4.295 ± 0.282
2.814IlePro: 2.814 ± 0.23
2.504IleGln: 2.504 ± 0.23
2.339IleArg: 2.339 ± 0.214
5.154IleSer: 5.154 ± 0.483
5.83IleThr: 5.83 ± 0.528
4.167IleVal: 4.167 ± 0.311
0.512IleTrp: 0.512 ± 0.11
2.065IleTyr: 2.065 ± 0.205
0.0IleXaa: 0.0 ± 0.0
Lys
3.052LysAla: 3.052 ± 0.346
0.493LysCys: 0.493 ± 0.108
3.162LysAsp: 3.162 ± 0.354
3.619LysGlu: 3.619 ± 0.422
2.467LysPhe: 2.467 ± 0.277
3.326LysGly: 3.326 ± 0.337
0.877LysHis: 0.877 ± 0.156
4.167LysIle: 4.167 ± 0.254
4.313LysLys: 4.313 ± 0.625
4.806LysLeu: 4.806 ± 0.392
1.572LysMet: 1.572 ± 0.265
3.089LysAsn: 3.089 ± 0.296
1.882LysPro: 1.882 ± 0.237
2.01LysGln: 2.01 ± 0.247
2.321LysArg: 2.321 ± 0.257
3.692LysSer: 3.692 ± 0.344
3.765LysThr: 3.765 ± 0.311
4.057LysVal: 4.057 ± 0.298
0.731LysTrp: 0.731 ± 0.144
2.595LysTyr: 2.595 ± 0.366
0.0LysXaa: 0.0 ± 0.0
Leu
4.13LeuAla: 4.13 ± 0.356
0.896LeuCys: 0.896 ± 0.168
5.665LeuAsp: 5.665 ± 0.313
4.258LeuGlu: 4.258 ± 0.327
3.034LeuPhe: 3.034 ± 0.228
4.861LeuGly: 4.861 ± 0.379
1.243LeuHis: 1.243 ± 0.165
4.532LeuIle: 4.532 ± 0.31
4.605LeuLys: 4.605 ± 0.365
5.428LeuLeu: 5.428 ± 0.414
1.316LeuMet: 1.316 ± 0.198
4.715LeuAsn: 4.715 ± 0.308
3.052LeuPro: 3.052 ± 0.234
3.198LeuGln: 3.198 ± 0.236
3.18LeuArg: 3.18 ± 0.226
4.843LeuSer: 4.843 ± 0.343
5.446LeuThr: 5.446 ± 0.498
4.094LeuVal: 4.094 ± 0.273
0.585LeuTrp: 0.585 ± 0.123
3.216LeuTyr: 3.216 ± 0.257
0.0LeuXaa: 0.0 ± 0.0
Met
1.334MetAla: 1.334 ± 0.193
0.201MetCys: 0.201 ± 0.067
1.151MetAsp: 1.151 ± 0.192
1.298MetGlu: 1.298 ± 0.223
0.749MetPhe: 0.749 ± 0.133
1.243MetGly: 1.243 ± 0.2
0.384MetHis: 0.384 ± 0.091
0.987MetIle: 0.987 ± 0.176
1.352MetLys: 1.352 ± 0.256
1.645MetLeu: 1.645 ± 0.205
0.64MetMet: 0.64 ± 0.15
1.535MetAsn: 1.535 ± 0.209
1.005MetPro: 1.005 ± 0.149
1.023MetGln: 1.023 ± 0.191
0.969MetArg: 0.969 ± 0.193
1.517MetSer: 1.517 ± 0.222
1.553MetThr: 1.553 ± 0.226
0.914MetVal: 0.914 ± 0.124
0.329MetTrp: 0.329 ± 0.089
0.749MetTyr: 0.749 ± 0.128
0.0MetXaa: 0.0 ± 0.0
Asn
4.276AsnAla: 4.276 ± 0.347
0.621AsnCys: 0.621 ± 0.112
3.436AsnAsp: 3.436 ± 0.252
3.509AsnGlu: 3.509 ± 0.224
2.522AsnPhe: 2.522 ± 0.239
4.441AsnGly: 4.441 ± 0.372
0.786AsnHis: 0.786 ± 0.112
4.276AsnIle: 4.276 ± 0.266
3.034AsnLys: 3.034 ± 0.275
4.733AsnLeu: 4.733 ± 0.373
1.023AsnMet: 1.023 ± 0.176
3.326AsnAsn: 3.326 ± 0.359
3.034AsnPro: 3.034 ± 0.233
1.974AsnGln: 1.974 ± 0.189
2.157AsnArg: 2.157 ± 0.155
4.039AsnSer: 4.039 ± 0.323
4.441AsnThr: 4.441 ± 0.428
4.386AsnVal: 4.386 ± 0.363
0.567AsnTrp: 0.567 ± 0.107
2.449AsnTyr: 2.449 ± 0.179
0.0AsnXaa: 0.0 ± 0.0
Pro
2.358ProAla: 2.358 ± 0.248
0.292ProCys: 0.292 ± 0.084
2.723ProAsp: 2.723 ± 0.276
2.613ProGlu: 2.613 ± 0.212
1.627ProPhe: 1.627 ± 0.195
3.308ProGly: 3.308 ± 0.268
0.676ProHis: 0.676 ± 0.122
2.54ProIle: 2.54 ± 0.221
2.321ProLys: 2.321 ± 0.281
2.083ProLeu: 2.083 ± 0.205
0.603ProMet: 0.603 ± 0.137
2.284ProAsn: 2.284 ± 0.168
1.828ProPro: 1.828 ± 0.293
1.261ProGln: 1.261 ± 0.16
1.462ProArg: 1.462 ± 0.148
3.271ProSer: 3.271 ± 0.35
2.961ProThr: 2.961 ± 0.213
2.431ProVal: 2.431 ± 0.268
0.493ProTrp: 0.493 ± 0.11
1.754ProTyr: 1.754 ± 0.207
0.0ProXaa: 0.0 ± 0.0
Gln
1.901GlnAla: 1.901 ± 0.21
0.256GlnCys: 0.256 ± 0.088
2.138GlnAsp: 2.138 ± 0.205
2.467GlnGlu: 2.467 ± 0.281
1.809GlnPhe: 1.809 ± 0.164
2.358GlnGly: 2.358 ± 0.205
0.64GlnHis: 0.64 ± 0.113
2.54GlnIle: 2.54 ± 0.203
2.449GlnLys: 2.449 ± 0.303
3.198GlnLeu: 3.198 ± 0.239
0.932GlnMet: 0.932 ± 0.179
1.901GlnAsn: 1.901 ± 0.186
1.097GlnPro: 1.097 ± 0.148
1.389GlnGln: 1.389 ± 0.187
1.663GlnArg: 1.663 ± 0.183
2.595GlnSer: 2.595 ± 0.199
2.157GlnThr: 2.157 ± 0.316
2.705GlnVal: 2.705 ± 0.201
0.475GlnTrp: 0.475 ± 0.094
1.791GlnTyr: 1.791 ± 0.211
0.0GlnXaa: 0.0 ± 0.0
Arg
2.412ArgAla: 2.412 ± 0.231
0.384ArgCys: 0.384 ± 0.093
2.138ArgAsp: 2.138 ± 0.195
2.321ArgGlu: 2.321 ± 0.25
1.627ArgPhe: 1.627 ± 0.157
2.814ArgGly: 2.814 ± 0.215
0.713ArgHis: 0.713 ± 0.131
3.052ArgIle: 3.052 ± 0.222
2.504ArgLys: 2.504 ± 0.319
3.29ArgLeu: 3.29 ± 0.276
1.078ArgMet: 1.078 ± 0.181
2.449ArgAsn: 2.449 ± 0.221
1.298ArgPro: 1.298 ± 0.137
1.444ArgGln: 1.444 ± 0.169
1.992ArgArg: 1.992 ± 0.287
2.431ArgSer: 2.431 ± 0.291
2.467ArgThr: 2.467 ± 0.29
2.942ArgVal: 2.942 ± 0.26
0.42ArgTrp: 0.42 ± 0.085
2.303ArgTyr: 2.303 ± 0.213
0.0ArgXaa: 0.0 ± 0.0
Ser
5.665SerAla: 5.665 ± 0.367
0.457SerCys: 0.457 ± 0.107
4.222SerAsp: 4.222 ± 0.336
3.783SerGlu: 3.783 ± 0.273
3.582SerPhe: 3.582 ± 0.258
7.347SerGly: 7.347 ± 0.646
0.841SerHis: 0.841 ± 0.125
4.733SerIle: 4.733 ± 0.434
3.18SerLys: 3.18 ± 0.322
5.062SerLeu: 5.062 ± 0.281
1.389SerMet: 1.389 ± 0.173
4.441SerAsn: 4.441 ± 0.315
2.924SerPro: 2.924 ± 0.353
2.723SerGln: 2.723 ± 0.221
2.668SerArg: 2.668 ± 0.206
6.305SerSer: 6.305 ± 0.508
5.556SerThr: 5.556 ± 0.442
4.953SerVal: 4.953 ± 0.474
0.822SerTrp: 0.822 ± 0.134
3.052SerTyr: 3.052 ± 0.21
0.0SerXaa: 0.0 ± 0.0
Thr
6.067ThrAla: 6.067 ± 0.524
0.585ThrCys: 0.585 ± 0.11
4.35ThrAsp: 4.35 ± 0.435
4.551ThrGlu: 4.551 ± 0.265
3.18ThrPhe: 3.18 ± 0.356
6.963ThrGly: 6.963 ± 0.734
0.859ThrHis: 0.859 ± 0.119
5.556ThrIle: 5.556 ± 0.485
3.107ThrLys: 3.107 ± 0.28
5.684ThrLeu: 5.684 ± 0.456
1.023ThrMet: 1.023 ± 0.125
3.966ThrAsn: 3.966 ± 0.469
3.162ThrPro: 3.162 ± 0.241
2.559ThrGln: 2.559 ± 0.205
2.577ThrArg: 2.577 ± 0.26
5.702ThrSer: 5.702 ± 0.569
5.958ThrThr: 5.958 ± 0.651
5.611ThrVal: 5.611 ± 0.598
0.877ThrTrp: 0.877 ± 0.136
3.015ThrTyr: 3.015 ± 0.205
0.0ThrXaa: 0.0 ± 0.0
Val
4.66ValAla: 4.66 ± 0.416
0.475ValCys: 0.475 ± 0.106
5.41ValAsp: 5.41 ± 0.395
4.569ValGlu: 4.569 ± 0.306
2.741ValPhe: 2.741 ± 0.258
4.715ValGly: 4.715 ± 0.425
0.804ValHis: 0.804 ± 0.144
4.167ValIle: 4.167 ± 0.348
3.491ValLys: 3.491 ± 0.254
3.948ValLeu: 3.948 ± 0.223
1.279ValMet: 1.279 ± 0.198
3.984ValAsn: 3.984 ± 0.334
3.143ValPro: 3.143 ± 0.274
2.358ValGln: 2.358 ± 0.22
2.741ValArg: 2.741 ± 0.238
5.135ValSer: 5.135 ± 0.292
6.067ValThr: 6.067 ± 0.489
4.715ValVal: 4.715 ± 0.365
0.676ValTrp: 0.676 ± 0.115
2.65ValTyr: 2.65 ± 0.212
0.0ValXaa: 0.0 ± 0.0
Trp
0.822TrpAla: 0.822 ± 0.123
0.11TrpCys: 0.11 ± 0.049
0.713TrpAsp: 0.713 ± 0.113
0.749TrpGlu: 0.749 ± 0.156
0.384TrpPhe: 0.384 ± 0.094
0.749TrpGly: 0.749 ± 0.112
0.329TrpHis: 0.329 ± 0.102
0.603TrpIle: 0.603 ± 0.122
0.749TrpLys: 0.749 ± 0.156
0.603TrpLeu: 0.603 ± 0.131
0.347TrpMet: 0.347 ± 0.091
0.64TrpAsn: 0.64 ± 0.108
0.146TrpPro: 0.146 ± 0.049
0.457TrpGln: 0.457 ± 0.086
0.457TrpArg: 0.457 ± 0.088
0.914TrpSer: 0.914 ± 0.125
0.749TrpThr: 0.749 ± 0.136
0.713TrpVal: 0.713 ± 0.127
0.146TrpTrp: 0.146 ± 0.056
0.366TrpTyr: 0.366 ± 0.068
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.485TyrAla: 2.485 ± 0.203
0.53TyrCys: 0.53 ± 0.103
3.545TyrAsp: 3.545 ± 0.263
2.632TyrGlu: 2.632 ± 0.246
1.809TyrPhe: 1.809 ± 0.201
2.339TyrGly: 2.339 ± 0.193
0.676TyrHis: 0.676 ± 0.141
2.814TyrIle: 2.814 ± 0.195
2.632TyrLys: 2.632 ± 0.261
2.833TyrLeu: 2.833 ± 0.226
0.713TyrMet: 0.713 ± 0.147
2.687TyrAsn: 2.687 ± 0.227
1.553TyrPro: 1.553 ± 0.185
1.572TyrGln: 1.572 ± 0.176
2.23TyrArg: 2.23 ± 0.222
2.705TyrSer: 2.705 ± 0.192
3.034TyrThr: 3.034 ± 0.305
2.942TyrVal: 2.942 ± 0.281
0.366TyrTrp: 0.366 ± 0.103
1.974TyrTyr: 1.974 ± 0.213
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 215 proteins (54719 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski