Amino acid dipepetide frequency for Synechococcus phage S-SCSM1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.284AlaAla: 4.284 ± 0.413
0.655AlaCys: 0.655 ± 0.096
3.657AlaAsp: 3.657 ± 0.233
3.316AlaGlu: 3.316 ± 0.261
2.047AlaPhe: 2.047 ± 0.161
5.376AlaGly: 5.376 ± 0.461
0.832AlaHis: 0.832 ± 0.133
3.616AlaIle: 3.616 ± 0.204
3.206AlaLys: 3.206 ± 0.327
4.421AlaLeu: 4.421 ± 0.342
1.092AlaMet: 1.092 ± 0.138
3.425AlaAsn: 3.425 ± 0.221
2.374AlaPro: 2.374 ± 0.187
1.706AlaGln: 1.706 ± 0.183
2.374AlaArg: 2.374 ± 0.194
3.957AlaSer: 3.957 ± 0.254
4.585AlaThr: 4.585 ± 0.368
3.861AlaVal: 3.861 ± 0.303
0.505AlaTrp: 0.505 ± 0.084
2.142AlaTyr: 2.142 ± 0.159
0.014AlaXaa: 0.014 ± 0.014
Cys
0.71CysAla: 0.71 ± 0.117
0.123CysCys: 0.123 ± 0.043
0.75CysAsp: 0.75 ± 0.113
0.887CysGlu: 0.887 ± 0.127
0.368CysPhe: 0.368 ± 0.069
0.778CysGly: 0.778 ± 0.123
0.218CysHis: 0.218 ± 0.063
0.75CysIle: 0.75 ± 0.114
0.559CysLys: 0.559 ± 0.08
0.682CysLeu: 0.682 ± 0.114
0.368CysMet: 0.368 ± 0.074
0.491CysAsn: 0.491 ± 0.081
0.559CysPro: 0.559 ± 0.095
0.314CysGln: 0.314 ± 0.067
0.478CysArg: 0.478 ± 0.091
0.901CysSer: 0.901 ± 0.116
0.587CysThr: 0.587 ± 0.086
0.86CysVal: 0.86 ± 0.118
0.136CysTrp: 0.136 ± 0.042
0.464CysTyr: 0.464 ± 0.075
0.0CysXaa: 0.0 ± 0.0
Asp
3.834AspAla: 3.834 ± 0.207
0.955AspCys: 0.955 ± 0.134
4.707AspAsp: 4.707 ± 0.29
4.407AspGlu: 4.407 ± 0.277
3.629AspPhe: 3.629 ± 0.252
5.376AspGly: 5.376 ± 0.308
1.242AspHis: 1.242 ± 0.118
4.707AspIle: 4.707 ± 0.294
3.425AspLys: 3.425 ± 0.259
5.253AspLeu: 5.253 ± 0.264
1.474AspMet: 1.474 ± 0.185
3.82AspAsn: 3.82 ± 0.252
3.425AspPro: 3.425 ± 0.214
2.129AspGln: 2.129 ± 0.192
2.429AspArg: 2.429 ± 0.172
4.23AspSer: 4.23 ± 0.279
3.22AspThr: 3.22 ± 0.195
4.694AspVal: 4.694 ± 0.256
1.037AspTrp: 1.037 ± 0.118
3.493AspTyr: 3.493 ± 0.234
0.0AspXaa: 0.0 ± 0.0
Glu
2.824GluAla: 2.824 ± 0.272
0.682GluCys: 0.682 ± 0.083
4.503GluAsp: 4.503 ± 0.27
5.69GluGlu: 5.69 ± 0.598
3.275GluPhe: 3.275 ± 0.269
3.807GluGly: 3.807 ± 0.248
1.105GluHis: 1.105 ± 0.164
4.544GluIle: 4.544 ± 0.337
4.585GluLys: 4.585 ± 0.441
5.69GluLeu: 5.69 ± 0.374
1.61GluMet: 1.61 ± 0.184
3.793GluAsn: 3.793 ± 0.256
2.115GluPro: 2.115 ± 0.184
2.401GluGln: 2.401 ± 0.288
2.456GluArg: 2.456 ± 0.234
3.698GluSer: 3.698 ± 0.259
3.698GluThr: 3.698 ± 0.25
4.721GluVal: 4.721 ± 0.265
0.914GluTrp: 0.914 ± 0.13
3.07GluTyr: 3.07 ± 0.227
0.0GluXaa: 0.0 ± 0.0
Phe
2.306PheAla: 2.306 ± 0.147
0.6PheCys: 0.6 ± 0.092
3.943PheAsp: 3.943 ± 0.274
2.947PheGlu: 2.947 ± 0.187
1.678PhePhe: 1.678 ± 0.168
2.947PheGly: 2.947 ± 0.224
0.914PheHis: 0.914 ± 0.138
2.265PheIle: 2.265 ± 0.188
2.838PheLys: 2.838 ± 0.221
3.288PheLeu: 3.288 ± 0.212
0.846PheMet: 0.846 ± 0.105
3.152PheAsn: 3.152 ± 0.28
1.938PhePro: 1.938 ± 0.199
1.133PheGln: 1.133 ± 0.124
1.583PheArg: 1.583 ± 0.197
3.302PheSer: 3.302 ± 0.216
3.438PheThr: 3.438 ± 0.266
3.466PheVal: 3.466 ± 0.235
0.464PheTrp: 0.464 ± 0.088
2.019PheTyr: 2.019 ± 0.184
0.0PheXaa: 0.0 ± 0.0
Gly
4.666GlyAla: 4.666 ± 0.392
0.723GlyCys: 0.723 ± 0.109
4.694GlyAsp: 4.694 ± 0.35
4.025GlyGlu: 4.025 ± 0.229
3.111GlyPhe: 3.111 ± 0.263
7.382GlyGly: 7.382 ± 0.769
1.173GlyHis: 1.173 ± 0.151
5.471GlyIle: 5.471 ± 0.504
3.67GlyLys: 3.67 ± 0.278
5.294GlyLeu: 5.294 ± 0.338
1.419GlyMet: 1.419 ± 0.144
4.434GlyAsn: 4.434 ± 0.321
1.924GlyPro: 1.924 ± 0.173
2.306GlyGln: 2.306 ± 0.192
3.125GlyArg: 3.125 ± 0.18
6.809GlySer: 6.809 ± 0.532
6.113GlyThr: 6.113 ± 0.508
5.403GlyVal: 5.403 ± 0.409
1.228GlyTrp: 1.228 ± 0.139
3.316GlyTyr: 3.316 ± 0.222
0.014GlyXaa: 0.014 ± 0.014
His
0.873HisAla: 0.873 ± 0.115
0.3HisCys: 0.3 ± 0.062
1.16HisAsp: 1.16 ± 0.127
1.378HisGlu: 1.378 ± 0.17
0.941HisPhe: 0.941 ± 0.146
1.528HisGly: 1.528 ± 0.144
0.587HisHis: 0.587 ± 0.104
0.982HisIle: 0.982 ± 0.11
1.419HisLys: 1.419 ± 0.131
1.378HisLeu: 1.378 ± 0.187
0.287HisMet: 0.287 ± 0.063
1.201HisAsn: 1.201 ± 0.128
0.901HisPro: 0.901 ± 0.137
0.641HisGln: 0.641 ± 0.095
0.928HisArg: 0.928 ± 0.132
1.064HisSer: 1.064 ± 0.115
1.269HisThr: 1.269 ± 0.163
1.037HisVal: 1.037 ± 0.147
0.3HisTrp: 0.3 ± 0.073
0.901HisTyr: 0.901 ± 0.126
0.0HisXaa: 0.0 ± 0.0
Ile
3.875IleAla: 3.875 ± 0.262
0.805IleCys: 0.805 ± 0.12
5.103IleAsp: 5.103 ± 0.323
4.421IleGlu: 4.421 ± 0.284
2.715IlePhe: 2.715 ± 0.255
4.585IleGly: 4.585 ± 0.354
1.16IleHis: 1.16 ± 0.126
3.602IleIle: 3.602 ± 0.231
4.093IleLys: 4.093 ± 0.301
4.721IleLeu: 4.721 ± 0.304
1.201IleMet: 1.201 ± 0.15
3.861IleAsn: 3.861 ± 0.295
2.756IlePro: 2.756 ± 0.211
2.238IleGln: 2.238 ± 0.173
2.784IleArg: 2.784 ± 0.18
4.735IleSer: 4.735 ± 0.322
4.776IleThr: 4.776 ± 0.421
4.134IleVal: 4.134 ± 0.247
0.546IleTrp: 0.546 ± 0.097
2.743IleTyr: 2.743 ± 0.168
0.0IleXaa: 0.0 ± 0.0
Lys
3.193LysAla: 3.193 ± 0.334
0.546LysCys: 0.546 ± 0.106
3.616LysAsp: 3.616 ± 0.231
4.544LysGlu: 4.544 ± 0.414
2.797LysPhe: 2.797 ± 0.266
2.906LysGly: 2.906 ± 0.2
1.187LysHis: 1.187 ± 0.143
4.107LysIle: 4.107 ± 0.338
5.54LysLys: 5.54 ± 0.6
5.103LysLeu: 5.103 ± 0.36
1.706LysMet: 1.706 ± 0.207
3.711LysAsn: 3.711 ± 0.28
1.992LysPro: 1.992 ± 0.196
2.142LysGln: 2.142 ± 0.215
2.374LysArg: 2.374 ± 0.239
3.93LysSer: 3.93 ± 0.252
3.548LysThr: 3.548 ± 0.25
4.175LysVal: 4.175 ± 0.227
0.587LysTrp: 0.587 ± 0.103
2.934LysTyr: 2.934 ± 0.222
0.0LysXaa: 0.0 ± 0.0
Leu
4.107LeuAla: 4.107 ± 0.326
0.587LeuCys: 0.587 ± 0.108
5.403LeuAsp: 5.403 ± 0.318
5.485LeuGlu: 5.485 ± 0.397
3.07LeuPhe: 3.07 ± 0.182
4.639LeuGly: 4.639 ± 0.28
1.528LeuHis: 1.528 ± 0.187
4.243LeuIle: 4.243 ± 0.281
5.035LeuLys: 5.035 ± 0.383
4.666LeuLeu: 4.666 ± 0.286
1.474LeuMet: 1.474 ± 0.187
5.021LeuAsn: 5.021 ± 0.276
2.893LeuPro: 2.893 ± 0.24
2.729LeuGln: 2.729 ± 0.218
3.438LeuArg: 3.438 ± 0.199
5.758LeuSer: 5.758 ± 0.279
4.844LeuThr: 4.844 ± 0.286
4.012LeuVal: 4.012 ± 0.227
0.805LeuTrp: 0.805 ± 0.126
3.316LeuTyr: 3.316 ± 0.259
0.0LeuXaa: 0.0 ± 0.0
Met
1.187MetAla: 1.187 ± 0.153
0.368MetCys: 0.368 ± 0.081
1.283MetAsp: 1.283 ± 0.173
1.378MetGlu: 1.378 ± 0.196
0.86MetPhe: 0.86 ± 0.117
1.146MetGly: 1.146 ± 0.131
0.478MetHis: 0.478 ± 0.104
1.105MetIle: 1.105 ± 0.135
1.528MetLys: 1.528 ± 0.186
1.555MetLeu: 1.555 ± 0.187
0.491MetMet: 0.491 ± 0.077
1.228MetAsn: 1.228 ± 0.146
0.914MetPro: 0.914 ± 0.122
0.723MetGln: 0.723 ± 0.104
0.969MetArg: 0.969 ± 0.109
1.61MetSer: 1.61 ± 0.189
1.446MetThr: 1.446 ± 0.164
1.092MetVal: 1.092 ± 0.128
0.355MetTrp: 0.355 ± 0.086
0.832MetTyr: 0.832 ± 0.106
0.0MetXaa: 0.0 ± 0.0
Asn
3.275AsnAla: 3.275 ± 0.256
0.723AsnCys: 0.723 ± 0.11
3.834AsnAsp: 3.834 ± 0.272
3.002AsnGlu: 3.002 ± 0.215
3.166AsnPhe: 3.166 ± 0.207
4.653AsnGly: 4.653 ± 0.355
1.324AsnHis: 1.324 ± 0.145
4.366AsnIle: 4.366 ± 0.314
3.084AsnLys: 3.084 ± 0.21
4.434AsnLeu: 4.434 ± 0.298
0.996AsnMet: 0.996 ± 0.109
3.766AsnAsn: 3.766 ± 0.266
3.043AsnPro: 3.043 ± 0.274
2.156AsnGln: 2.156 ± 0.186
2.279AsnArg: 2.279 ± 0.186
4.121AsnSer: 4.121 ± 0.304
3.711AsnThr: 3.711 ± 0.242
4.83AsnVal: 4.83 ± 0.284
0.71AsnTrp: 0.71 ± 0.103
2.647AsnTyr: 2.647 ± 0.226
0.0AsnXaa: 0.0 ± 0.0
Pro
2.224ProAla: 2.224 ± 0.212
0.409ProCys: 0.409 ± 0.085
2.661ProAsp: 2.661 ± 0.228
2.975ProGlu: 2.975 ± 0.258
1.883ProPhe: 1.883 ± 0.178
3.316ProGly: 3.316 ± 0.276
0.832ProHis: 0.832 ± 0.102
2.265ProIle: 2.265 ± 0.233
2.306ProLys: 2.306 ± 0.214
2.511ProLeu: 2.511 ± 0.204
0.682ProMet: 0.682 ± 0.104
2.442ProAsn: 2.442 ± 0.222
1.446ProPro: 1.446 ± 0.156
1.46ProGln: 1.46 ± 0.191
2.32ProArg: 2.32 ± 0.529
2.947ProSer: 2.947 ± 0.236
3.029ProThr: 3.029 ± 0.249
2.401ProVal: 2.401 ± 0.159
0.628ProTrp: 0.628 ± 0.087
1.665ProTyr: 1.665 ± 0.134
0.0ProXaa: 0.0 ± 0.0
Gln
2.006GlnAla: 2.006 ± 0.189
0.45GlnCys: 0.45 ± 0.077
1.978GlnAsp: 1.978 ± 0.175
2.401GlnGlu: 2.401 ± 0.23
1.419GlnPhe: 1.419 ± 0.15
2.333GlnGly: 2.333 ± 0.211
0.669GlnHis: 0.669 ± 0.131
2.565GlnIle: 2.565 ± 0.219
2.142GlnLys: 2.142 ± 0.233
2.456GlnLeu: 2.456 ± 0.192
0.86GlnMet: 0.86 ± 0.116
1.869GlnAsn: 1.869 ± 0.158
1.528GlnPro: 1.528 ± 0.211
1.269GlnGln: 1.269 ± 0.146
1.269GlnArg: 1.269 ± 0.164
2.292GlnSer: 2.292 ± 0.199
1.856GlnThr: 1.856 ± 0.166
2.088GlnVal: 2.088 ± 0.175
0.491GlnTrp: 0.491 ± 0.085
1.733GlnTyr: 1.733 ± 0.187
0.014GlnXaa: 0.014 ± 0.014
Arg
2.279ArgAla: 2.279 ± 0.23
0.314ArgCys: 0.314 ± 0.068
2.442ArgAsp: 2.442 ± 0.188
2.824ArgGlu: 2.824 ± 0.28
1.787ArgPhe: 1.787 ± 0.152
2.579ArgGly: 2.579 ± 0.217
0.669ArgHis: 0.669 ± 0.11
2.906ArgIle: 2.906 ± 0.207
2.797ArgLys: 2.797 ± 0.242
3.07ArgLeu: 3.07 ± 0.21
0.969ArgMet: 0.969 ± 0.128
2.115ArgAsn: 2.115 ± 0.205
1.815ArgPro: 1.815 ± 0.422
1.405ArgGln: 1.405 ± 0.246
1.965ArgArg: 1.965 ± 0.231
2.592ArgSer: 2.592 ± 0.226
2.442ArgThr: 2.442 ± 0.2
2.906ArgVal: 2.906 ± 0.242
0.396ArgTrp: 0.396 ± 0.072
2.456ArgTyr: 2.456 ± 0.216
0.0ArgXaa: 0.0 ± 0.0
Ser
4.216SerAla: 4.216 ± 0.249
0.682SerCys: 0.682 ± 0.114
4.68SerAsp: 4.68 ± 0.284
3.875SerGlu: 3.875 ± 0.253
3.548SerPhe: 3.548 ± 0.202
6.931SerGly: 6.931 ± 0.566
1.173SerHis: 1.173 ± 0.134
4.776SerIle: 4.776 ± 0.386
3.848SerLys: 3.848 ± 0.263
5.199SerLeu: 5.199 ± 0.236
1.378SerMet: 1.378 ± 0.156
4.148SerAsn: 4.148 ± 0.325
2.743SerPro: 2.743 ± 0.225
2.197SerGln: 2.197 ± 0.17
2.074SerArg: 2.074 ± 0.155
5.963SerSer: 5.963 ± 0.429
5.335SerThr: 5.335 ± 0.556
4.626SerVal: 4.626 ± 0.36
0.873SerTrp: 0.873 ± 0.105
3.206SerTyr: 3.206 ± 0.239
0.0SerXaa: 0.0 ± 0.0
Thr
4.707ThrAla: 4.707 ± 0.427
0.6ThrCys: 0.6 ± 0.092
3.971ThrAsp: 3.971 ± 0.275
3.657ThrGlu: 3.657 ± 0.213
3.357ThrPhe: 3.357 ± 0.261
6.345ThrGly: 6.345 ± 0.572
1.31ThrHis: 1.31 ± 0.168
4.83ThrIle: 4.83 ± 0.314
3.493ThrLys: 3.493 ± 0.256
5.212ThrLeu: 5.212 ± 0.329
0.941ThrMet: 0.941 ± 0.105
4.093ThrAsn: 4.093 ± 0.282
3.111ThrPro: 3.111 ± 0.242
2.019ThrGln: 2.019 ± 0.186
2.292ThrArg: 2.292 ± 0.182
5.212ThrSer: 5.212 ± 0.522
4.912ThrThr: 4.912 ± 0.513
4.585ThrVal: 4.585 ± 0.331
0.819ThrTrp: 0.819 ± 0.11
2.824ThrTyr: 2.824 ± 0.188
0.0ThrXaa: 0.0 ± 0.0
Val
4.134ValAla: 4.134 ± 0.234
0.723ValCys: 0.723 ± 0.105
4.53ValAsp: 4.53 ± 0.272
4.23ValGlu: 4.23 ± 0.214
2.743ValPhe: 2.743 ± 0.188
5.84ValGly: 5.84 ± 0.565
1.133ValHis: 1.133 ± 0.142
4.134ValIle: 4.134 ± 0.224
3.752ValLys: 3.752 ± 0.206
4.175ValLeu: 4.175 ± 0.256
1.501ValMet: 1.501 ± 0.152
4.039ValAsn: 4.039 ± 0.222
2.743ValPro: 2.743 ± 0.226
2.415ValGln: 2.415 ± 0.216
3.056ValArg: 3.056 ± 0.198
4.83ValSer: 4.83 ± 0.341
5.649ValThr: 5.649 ± 0.49
4.216ValVal: 4.216 ± 0.29
0.628ValTrp: 0.628 ± 0.111
2.633ValTyr: 2.633 ± 0.2
0.0ValXaa: 0.0 ± 0.0
Trp
0.682TrpAla: 0.682 ± 0.111
0.177TrpCys: 0.177 ± 0.049
0.928TrpAsp: 0.928 ± 0.137
0.846TrpGlu: 0.846 ± 0.127
0.546TrpPhe: 0.546 ± 0.086
0.846TrpGly: 0.846 ± 0.103
0.437TrpHis: 0.437 ± 0.095
0.75TrpIle: 0.75 ± 0.112
0.778TrpLys: 0.778 ± 0.112
0.737TrpLeu: 0.737 ± 0.106
0.423TrpMet: 0.423 ± 0.087
0.737TrpAsn: 0.737 ± 0.104
0.177TrpPro: 0.177 ± 0.048
0.423TrpGln: 0.423 ± 0.073
0.641TrpArg: 0.641 ± 0.109
0.641TrpSer: 0.641 ± 0.103
0.778TrpThr: 0.778 ± 0.128
0.914TrpVal: 0.914 ± 0.133
0.177TrpTrp: 0.177 ± 0.047
0.587TrpTyr: 0.587 ± 0.082
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.06TyrAla: 2.06 ± 0.187
0.559TyrCys: 0.559 ± 0.089
3.629TyrAsp: 3.629 ± 0.227
2.893TyrGlu: 2.893 ± 0.219
2.115TyrPhe: 2.115 ± 0.186
2.988TyrGly: 2.988 ± 0.226
1.105TyrHis: 1.105 ± 0.132
2.865TyrIle: 2.865 ± 0.211
2.538TyrLys: 2.538 ± 0.25
3.22TyrLeu: 3.22 ± 0.243
0.819TyrMet: 0.819 ± 0.123
2.77TyrAsn: 2.77 ± 0.287
2.006TyrPro: 2.006 ± 0.177
1.883TyrGln: 1.883 ± 0.19
1.897TyrArg: 1.897 ± 0.161
2.879TyrSer: 2.879 ± 0.211
3.056TyrThr: 3.056 ± 0.26
3.07TyrVal: 3.07 ± 0.235
0.628TyrTrp: 0.628 ± 0.103
2.32TyrTyr: 2.32 ± 0.192
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.014XaaAsp: 0.014 ± 0.014
0.0XaaGlu: 0.0 ± 0.0
0.014XaaPhe: 0.014 ± 0.014
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.014XaaVal: 0.014 ± 0.014
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 292 proteins (73290 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski