Amino acid dipepetide frequency for Cyanophage Syn10

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.063AlaAla: 6.063 ± 0.556
0.538AlaCys: 0.538 ± 0.139
4.506AlaAsp: 4.506 ± 0.262
3.875AlaGlu: 3.875 ± 0.368
3.078AlaPhe: 3.078 ± 0.207
5.933AlaGly: 5.933 ± 0.533
0.742AlaHis: 0.742 ± 0.119
4.042AlaIle: 4.042 ± 0.402
3.671AlaLys: 3.671 ± 0.374
4.654AlaLeu: 4.654 ± 0.309
1.224AlaMet: 1.224 ± 0.177
4.024AlaAsn: 4.024 ± 0.417
3.059AlaPro: 3.059 ± 0.229
2.633AlaGln: 2.633 ± 0.222
2.466AlaArg: 2.466 ± 0.25
4.654AlaSer: 4.654 ± 0.295
5.655AlaThr: 5.655 ± 0.481
4.024AlaVal: 4.024 ± 0.277
0.63AlaTrp: 0.63 ± 0.119
2.244AlaTyr: 2.244 ± 0.21
0.0AlaXaa: 0.0 ± 0.0
Cys
0.668CysAla: 0.668 ± 0.129
0.111CysCys: 0.111 ± 0.052
0.723CysAsp: 0.723 ± 0.133
0.556CysGlu: 0.556 ± 0.102
0.482CysPhe: 0.482 ± 0.105
0.779CysGly: 0.779 ± 0.164
0.223CysHis: 0.223 ± 0.065
0.593CysIle: 0.593 ± 0.128
0.668CysLys: 0.668 ± 0.106
0.668CysLeu: 0.668 ± 0.133
0.334CysMet: 0.334 ± 0.087
0.593CysAsn: 0.593 ± 0.116
0.445CysPro: 0.445 ± 0.109
0.352CysGln: 0.352 ± 0.086
0.445CysArg: 0.445 ± 0.108
0.593CysSer: 0.593 ± 0.113
0.519CysThr: 0.519 ± 0.112
0.464CysVal: 0.464 ± 0.114
0.13CysTrp: 0.13 ± 0.047
0.445CysTyr: 0.445 ± 0.085
0.0CysXaa: 0.0 ± 0.0
Asp
5.414AspAla: 5.414 ± 0.297
0.853AspCys: 0.853 ± 0.153
4.784AspAsp: 4.784 ± 0.386
3.801AspGlu: 3.801 ± 0.276
2.985AspPhe: 2.985 ± 0.269
6.1AspGly: 6.1 ± 0.476
1.057AspHis: 1.057 ± 0.154
4.339AspIle: 4.339 ± 0.327
3.041AspLys: 3.041 ± 0.35
4.839AspLeu: 4.839 ± 0.313
1.279AspMet: 1.279 ± 0.169
3.56AspAsn: 3.56 ± 0.303
3.56AspPro: 3.56 ± 0.303
2.188AspGln: 2.188 ± 0.188
2.614AspArg: 2.614 ± 0.24
3.745AspSer: 3.745 ± 0.337
4.469AspThr: 4.469 ± 0.352
4.246AspVal: 4.246 ± 0.268
1.113AspTrp: 1.113 ± 0.145
3.875AspTyr: 3.875 ± 0.246
0.0AspXaa: 0.0 ± 0.0
Glu
3.041GluAla: 3.041 ± 0.324
0.834GluCys: 0.834 ± 0.137
3.949GluAsp: 3.949 ± 0.283
4.58GluGlu: 4.58 ± 0.448
3.171GluPhe: 3.171 ± 0.23
3.912GluGly: 3.912 ± 0.227
0.946GluHis: 0.946 ± 0.176
4.673GluIle: 4.673 ± 0.327
3.783GluLys: 3.783 ± 0.504
4.19GluLeu: 4.19 ± 0.283
1.632GluMet: 1.632 ± 0.25
3.504GluAsn: 3.504 ± 0.272
1.502GluPro: 1.502 ± 0.159
2.114GluGln: 2.114 ± 0.189
2.54GluArg: 2.54 ± 0.319
3.949GluSer: 3.949 ± 0.319
4.153GluThr: 4.153 ± 0.321
4.394GluVal: 4.394 ± 0.264
0.983GluTrp: 0.983 ± 0.17
2.818GluTyr: 2.818 ± 0.229
0.0GluXaa: 0.0 ± 0.0
Phe
2.8PheAla: 2.8 ± 0.191
0.482PheCys: 0.482 ± 0.094
3.226PheAsp: 3.226 ± 0.264
2.726PheGlu: 2.726 ± 0.254
1.724PhePhe: 1.724 ± 0.2
3.282PheGly: 3.282 ± 0.26
0.705PheHis: 0.705 ± 0.151
2.633PheIle: 2.633 ± 0.275
2.299PheLys: 2.299 ± 0.213
2.818PheLeu: 2.818 ± 0.329
1.001PheMet: 1.001 ± 0.175
3.022PheAsn: 3.022 ± 0.237
1.817PhePro: 1.817 ± 0.215
1.706PheGln: 1.706 ± 0.193
1.891PheArg: 1.891 ± 0.218
3.3PheSer: 3.3 ± 0.226
3.226PheThr: 3.226 ± 0.253
2.874PheVal: 2.874 ± 0.335
0.371PheTrp: 0.371 ± 0.087
2.003PheTyr: 2.003 ± 0.18
0.0PheXaa: 0.0 ± 0.0
Gly
5.618GlyAla: 5.618 ± 0.499
0.63GlyCys: 0.63 ± 0.142
5.266GlyAsp: 5.266 ± 0.513
4.432GlyGlu: 4.432 ± 0.279
3.208GlyPhe: 3.208 ± 0.262
8.381GlyGly: 8.381 ± 1.136
1.261GlyHis: 1.261 ± 0.182
4.153GlyIle: 4.153 ± 0.35
3.857GlyLys: 3.857 ± 0.408
4.394GlyLeu: 4.394 ± 0.297
1.316GlyMet: 1.316 ± 0.216
4.839GlyAsn: 4.839 ± 0.51
2.262GlyPro: 2.262 ± 0.192
2.633GlyGln: 2.633 ± 0.209
3.115GlyArg: 3.115 ± 0.2
6.045GlySer: 6.045 ± 0.71
6.657GlyThr: 6.657 ± 0.689
4.914GlyVal: 4.914 ± 0.334
1.298GlyTrp: 1.298 ± 0.162
3.467GlyTyr: 3.467 ± 0.383
0.0GlyXaa: 0.0 ± 0.0
His
0.871HisAla: 0.871 ± 0.137
0.111HisCys: 0.111 ± 0.046
1.131HisAsp: 1.131 ± 0.179
0.834HisGlu: 0.834 ± 0.149
0.983HisPhe: 0.983 ± 0.133
1.038HisGly: 1.038 ± 0.158
0.445HisHis: 0.445 ± 0.107
1.038HisIle: 1.038 ± 0.148
0.797HisLys: 0.797 ± 0.142
1.279HisLeu: 1.279 ± 0.173
0.482HisMet: 0.482 ± 0.124
0.946HisAsn: 0.946 ± 0.169
1.001HisPro: 1.001 ± 0.173
0.668HisGln: 0.668 ± 0.124
0.649HisArg: 0.649 ± 0.126
1.094HisSer: 1.094 ± 0.13
1.02HisThr: 1.02 ± 0.158
1.168HisVal: 1.168 ± 0.195
0.223HisTrp: 0.223 ± 0.072
0.89HisTyr: 0.89 ± 0.147
0.0HisXaa: 0.0 ± 0.0
Ile
3.82IleAla: 3.82 ± 0.307
0.63IleCys: 0.63 ± 0.138
4.988IleAsp: 4.988 ± 0.331
4.265IleGlu: 4.265 ± 0.271
2.577IlePhe: 2.577 ± 0.215
3.968IleGly: 3.968 ± 0.33
0.89IleHis: 0.89 ± 0.159
3.541IleIle: 3.541 ± 0.288
3.838IleLys: 3.838 ± 0.29
4.747IleLeu: 4.747 ± 0.394
1.187IleMet: 1.187 ± 0.159
4.172IleAsn: 4.172 ± 0.261
3.152IlePro: 3.152 ± 0.287
2.818IleGln: 2.818 ± 0.298
2.466IleArg: 2.466 ± 0.207
4.116IleSer: 4.116 ± 0.546
5.637IleThr: 5.637 ± 0.654
4.153IleVal: 4.153 ± 0.33
0.742IleTrp: 0.742 ± 0.135
2.299IleTyr: 2.299 ± 0.207
0.0IleXaa: 0.0 ± 0.0
Lys
3.041LysAla: 3.041 ± 0.294
0.556LysCys: 0.556 ± 0.103
3.708LysAsp: 3.708 ± 0.35
3.857LysGlu: 3.857 ± 0.502
2.448LysPhe: 2.448 ± 0.286
2.985LysGly: 2.985 ± 0.276
1.057LysHis: 1.057 ± 0.22
3.875LysIle: 3.875 ± 0.324
4.394LysLys: 4.394 ± 0.706
5.043LysLeu: 5.043 ± 0.381
1.409LysMet: 1.409 ± 0.263
2.744LysAsn: 2.744 ± 0.254
1.928LysPro: 1.928 ± 0.213
2.003LysGln: 2.003 ± 0.308
2.429LysArg: 2.429 ± 0.32
3.486LysSer: 3.486 ± 0.316
3.523LysThr: 3.523 ± 0.296
3.504LysVal: 3.504 ± 0.224
0.76LysTrp: 0.76 ± 0.129
3.3LysTyr: 3.3 ± 0.445
0.0LysXaa: 0.0 ± 0.0
Leu
4.598LeuAla: 4.598 ± 0.304
0.871LeuCys: 0.871 ± 0.17
5.451LeuAsp: 5.451 ± 0.312
4.543LeuGlu: 4.543 ± 0.376
2.429LeuPhe: 2.429 ± 0.21
4.487LeuGly: 4.487 ± 0.277
1.409LeuHis: 1.409 ± 0.184
3.968LeuIle: 3.968 ± 0.258
4.432LeuLys: 4.432 ± 0.472
5.136LeuLeu: 5.136 ± 0.353
1.52LeuMet: 1.52 ± 0.225
4.413LeuAsn: 4.413 ± 0.325
2.893LeuPro: 2.893 ± 0.26
2.893LeuGln: 2.893 ± 0.265
3.263LeuArg: 3.263 ± 0.266
4.728LeuSer: 4.728 ± 0.236
5.34LeuThr: 5.34 ± 0.614
4.339LeuVal: 4.339 ± 0.258
0.649LeuTrp: 0.649 ± 0.131
3.523LeuTyr: 3.523 ± 0.264
0.0LeuXaa: 0.0 ± 0.0
Met
1.502MetAla: 1.502 ± 0.202
0.223MetCys: 0.223 ± 0.069
1.131MetAsp: 1.131 ± 0.171
1.131MetGlu: 1.131 ± 0.227
0.742MetPhe: 0.742 ± 0.158
1.038MetGly: 1.038 ± 0.153
0.334MetHis: 0.334 ± 0.1
1.15MetIle: 1.15 ± 0.172
1.78MetLys: 1.78 ± 0.294
1.65MetLeu: 1.65 ± 0.227
0.501MetMet: 0.501 ± 0.103
1.224MetAsn: 1.224 ± 0.157
1.057MetPro: 1.057 ± 0.189
1.15MetGln: 1.15 ± 0.184
0.964MetArg: 0.964 ± 0.129
1.669MetSer: 1.669 ± 0.257
1.595MetThr: 1.595 ± 0.208
1.15MetVal: 1.15 ± 0.146
0.223MetTrp: 0.223 ± 0.069
0.723MetTyr: 0.723 ± 0.116
0.0MetXaa: 0.0 ± 0.0
Asn
4.061AsnAla: 4.061 ± 0.451
0.593AsnCys: 0.593 ± 0.105
3.449AsnAsp: 3.449 ± 0.245
3.078AsnGlu: 3.078 ± 0.235
2.651AsnPhe: 2.651 ± 0.225
4.728AsnGly: 4.728 ± 0.48
1.001AsnHis: 1.001 ± 0.185
4.172AsnIle: 4.172 ± 0.366
3.319AsnLys: 3.319 ± 0.311
4.635AsnLeu: 4.635 ± 0.384
0.816AsnMet: 0.816 ± 0.144
3.764AsnAsn: 3.764 ± 0.305
3.208AsnPro: 3.208 ± 0.261
2.281AsnGln: 2.281 ± 0.223
2.429AsnArg: 2.429 ± 0.194
3.801AsnSer: 3.801 ± 0.311
4.024AsnThr: 4.024 ± 0.517
4.357AsnVal: 4.357 ± 0.322
0.779AsnTrp: 0.779 ± 0.119
2.485AsnTyr: 2.485 ± 0.162
0.0AsnXaa: 0.0 ± 0.0
Pro
2.707ProAla: 2.707 ± 0.227
0.389ProCys: 0.389 ± 0.123
2.707ProAsp: 2.707 ± 0.299
3.059ProGlu: 3.059 ± 0.279
1.724ProPhe: 1.724 ± 0.212
3.412ProGly: 3.412 ± 0.349
0.927ProHis: 0.927 ± 0.128
2.651ProIle: 2.651 ± 0.253
1.91ProLys: 1.91 ± 0.21
2.299ProLeu: 2.299 ± 0.211
0.612ProMet: 0.612 ± 0.127
2.262ProAsn: 2.262 ± 0.225
1.761ProPro: 1.761 ± 0.204
1.354ProGln: 1.354 ± 0.15
1.428ProArg: 1.428 ± 0.162
3.134ProSer: 3.134 ± 0.221
3.467ProThr: 3.467 ± 0.221
2.503ProVal: 2.503 ± 0.248
0.482ProTrp: 0.482 ± 0.089
1.965ProTyr: 1.965 ± 0.205
0.0ProXaa: 0.0 ± 0.0
Gln
1.928GlnAla: 1.928 ± 0.185
0.241GlnCys: 0.241 ± 0.073
2.169GlnAsp: 2.169 ± 0.188
2.225GlnGlu: 2.225 ± 0.206
1.52GlnPhe: 1.52 ± 0.183
2.559GlnGly: 2.559 ± 0.236
0.723GlnHis: 0.723 ± 0.132
2.633GlnIle: 2.633 ± 0.185
2.392GlnLys: 2.392 ± 0.273
3.059GlnLeu: 3.059 ± 0.263
1.168GlnMet: 1.168 ± 0.17
2.04GlnAsn: 2.04 ± 0.207
1.224GlnPro: 1.224 ± 0.138
1.354GlnGln: 1.354 ± 0.167
1.52GlnArg: 1.52 ± 0.153
2.318GlnSer: 2.318 ± 0.203
2.633GlnThr: 2.633 ± 0.25
2.855GlnVal: 2.855 ± 0.221
0.556GlnTrp: 0.556 ± 0.087
1.891GlnTyr: 1.891 ± 0.201
0.0GlnXaa: 0.0 ± 0.0
Arg
2.614ArgAla: 2.614 ± 0.238
0.352ArgCys: 0.352 ± 0.07
2.392ArgAsp: 2.392 ± 0.201
2.466ArgGlu: 2.466 ± 0.288
2.114ArgPhe: 2.114 ± 0.189
2.763ArgGly: 2.763 ± 0.208
0.834ArgHis: 0.834 ± 0.164
3.059ArgIle: 3.059 ± 0.246
2.577ArgLys: 2.577 ± 0.286
3.152ArgLeu: 3.152 ± 0.22
1.094ArgMet: 1.094 ± 0.177
2.225ArgAsn: 2.225 ± 0.219
1.316ArgPro: 1.316 ± 0.15
1.65ArgGln: 1.65 ± 0.187
1.873ArgArg: 1.873 ± 0.254
2.466ArgSer: 2.466 ± 0.206
2.151ArgThr: 2.151 ± 0.218
2.93ArgVal: 2.93 ± 0.274
0.538ArgTrp: 0.538 ± 0.114
2.225ArgTyr: 2.225 ± 0.222
0.0ArgXaa: 0.0 ± 0.0
Ser
5.025SerAla: 5.025 ± 0.298
0.612SerCys: 0.612 ± 0.103
4.45SerAsp: 4.45 ± 0.293
3.078SerGlu: 3.078 ± 0.267
3.3SerPhe: 3.3 ± 0.236
7.064SerGly: 7.064 ± 0.734
1.057SerHis: 1.057 ± 0.149
4.228SerIle: 4.228 ± 0.304
3.412SerLys: 3.412 ± 0.341
4.765SerLeu: 4.765 ± 0.221
1.632SerMet: 1.632 ± 0.226
4.024SerAsn: 4.024 ± 0.334
2.485SerPro: 2.485 ± 0.253
2.225SerGln: 2.225 ± 0.167
2.559SerArg: 2.559 ± 0.212
5.451SerSer: 5.451 ± 0.528
4.988SerThr: 4.988 ± 0.373
4.357SerVal: 4.357 ± 0.361
0.668SerTrp: 0.668 ± 0.127
2.855SerTyr: 2.855 ± 0.223
0.0SerXaa: 0.0 ± 0.0
Thr
5.915ThrAla: 5.915 ± 0.55
0.371ThrCys: 0.371 ± 0.09
4.487ThrAsp: 4.487 ± 0.369
3.949ThrGlu: 3.949 ± 0.232
3.541ThrPhe: 3.541 ± 0.569
6.935ThrGly: 6.935 ± 0.704
1.001ThrHis: 1.001 ± 0.133
5.581ThrIle: 5.581 ± 0.521
3.189ThrLys: 3.189 ± 0.302
5.674ThrLeu: 5.674 ± 0.479
1.205ThrMet: 1.205 ± 0.172
4.413ThrAsn: 4.413 ± 0.518
3.356ThrPro: 3.356 ± 0.214
2.614ThrGln: 2.614 ± 0.207
2.596ThrArg: 2.596 ± 0.188
5.08ThrSer: 5.08 ± 0.489
5.674ThrThr: 5.674 ± 0.643
5.229ThrVal: 5.229 ± 0.502
0.871ThrTrp: 0.871 ± 0.106
2.614ThrTyr: 2.614 ± 0.219
0.0ThrXaa: 0.0 ± 0.0
Val
4.839ValAla: 4.839 ± 0.301
0.63ValCys: 0.63 ± 0.117
4.895ValAsp: 4.895 ± 0.312
4.617ValGlu: 4.617 ± 0.337
2.707ValPhe: 2.707 ± 0.217
4.951ValGly: 4.951 ± 0.422
0.89ValHis: 0.89 ± 0.159
4.302ValIle: 4.302 ± 0.43
3.523ValLys: 3.523 ± 0.28
3.894ValLeu: 3.894 ± 0.248
1.242ValMet: 1.242 ± 0.188
4.098ValAsn: 4.098 ± 0.351
2.485ValPro: 2.485 ± 0.195
2.114ValGln: 2.114 ± 0.193
2.596ValArg: 2.596 ± 0.197
5.192ValSer: 5.192 ± 0.35
5.841ValThr: 5.841 ± 0.502
4.839ValVal: 4.839 ± 0.418
0.649ValTrp: 0.649 ± 0.097
2.188ValTyr: 2.188 ± 0.19
0.0ValXaa: 0.0 ± 0.0
Trp
0.853TrpAla: 0.853 ± 0.116
0.204TrpCys: 0.204 ± 0.061
1.001TrpAsp: 1.001 ± 0.156
0.742TrpGlu: 0.742 ± 0.158
0.538TrpPhe: 0.538 ± 0.095
0.797TrpGly: 0.797 ± 0.118
0.408TrpHis: 0.408 ± 0.09
0.63TrpIle: 0.63 ± 0.108
0.909TrpLys: 0.909 ± 0.16
0.742TrpLeu: 0.742 ± 0.162
0.389TrpMet: 0.389 ± 0.094
0.853TrpAsn: 0.853 ± 0.117
0.297TrpPro: 0.297 ± 0.08
0.426TrpGln: 0.426 ± 0.079
0.575TrpArg: 0.575 ± 0.119
0.76TrpSer: 0.76 ± 0.116
0.76TrpThr: 0.76 ± 0.118
0.927TrpVal: 0.927 ± 0.123
0.167TrpTrp: 0.167 ± 0.042
0.408TrpTyr: 0.408 ± 0.078
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.336TyrAla: 2.336 ± 0.173
0.556TyrCys: 0.556 ± 0.107
3.449TyrAsp: 3.449 ± 0.296
2.855TyrGlu: 2.855 ± 0.25
1.984TyrPhe: 1.984 ± 0.216
2.651TyrGly: 2.651 ± 0.198
0.76TyrHis: 0.76 ± 0.109
2.744TyrIle: 2.744 ± 0.275
2.262TyrLys: 2.262 ± 0.269
3.245TyrLeu: 3.245 ± 0.262
0.871TyrMet: 0.871 ± 0.17
2.911TyrAsn: 2.911 ± 0.252
1.947TyrPro: 1.947 ± 0.194
1.836TyrGln: 1.836 ± 0.22
2.373TyrArg: 2.373 ± 0.189
2.651TyrSer: 2.651 ± 0.238
2.967TyrThr: 2.967 ± 0.263
3.263TyrVal: 3.263 ± 0.256
0.556TyrTrp: 0.556 ± 0.136
1.947TyrTyr: 1.947 ± 0.158
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 205 proteins (53933 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski