Amino acid dipepetide frequency for Synechococcus phage syn9

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.18AlaAla: 6.18 ± 0.607
0.554AlaCys: 0.554 ± 0.126
4.345AlaAsp: 4.345 ± 0.273
3.826AlaGlu: 3.826 ± 0.316
3.133AlaPhe: 3.133 ± 0.209
6.146AlaGly: 6.146 ± 0.555
0.744AlaHis: 0.744 ± 0.105
4.103AlaIle: 4.103 ± 0.375
3.947AlaLys: 3.947 ± 0.409
4.674AlaLeu: 4.674 ± 0.308
1.298AlaMet: 1.298 ± 0.182
4.016AlaAsn: 4.016 ± 0.401
3.081AlaPro: 3.081 ± 0.236
2.683AlaGln: 2.683 ± 0.195
2.476AlaArg: 2.476 ± 0.197
4.865AlaSer: 4.865 ± 0.358
5.557AlaThr: 5.557 ± 0.506
4.086AlaVal: 4.086 ± 0.296
0.641AlaTrp: 0.641 ± 0.108
2.181AlaTyr: 2.181 ± 0.233
0.0AlaXaa: 0.0 ± 0.0
Cys
0.623CysAla: 0.623 ± 0.107
0.104CysCys: 0.104 ± 0.046
0.762CysAsp: 0.762 ± 0.139
0.537CysGlu: 0.537 ± 0.094
0.433CysPhe: 0.433 ± 0.113
0.744CysGly: 0.744 ± 0.158
0.208CysHis: 0.208 ± 0.069
0.641CysIle: 0.641 ± 0.131
0.744CysLys: 0.744 ± 0.118
0.658CysLeu: 0.658 ± 0.133
0.329CysMet: 0.329 ± 0.079
0.554CysAsn: 0.554 ± 0.108
0.433CysPro: 0.433 ± 0.096
0.364CysGln: 0.364 ± 0.086
0.467CysArg: 0.467 ± 0.102
0.571CysSer: 0.571 ± 0.116
0.502CysThr: 0.502 ± 0.103
0.467CysVal: 0.467 ± 0.083
0.138CysTrp: 0.138 ± 0.051
0.45CysTyr: 0.45 ± 0.1
0.0CysXaa: 0.0 ± 0.0
Asp
5.488AspAla: 5.488 ± 0.326
0.796AspCys: 0.796 ± 0.136
4.622AspAsp: 4.622 ± 0.356
3.895AspGlu: 3.895 ± 0.344
2.943AspPhe: 2.943 ± 0.253
5.921AspGly: 5.921 ± 0.489
1.039AspHis: 1.039 ± 0.168
4.276AspIle: 4.276 ± 0.28
3.081AspLys: 3.081 ± 0.323
4.865AspLeu: 4.865 ± 0.335
1.368AspMet: 1.368 ± 0.187
3.601AspAsn: 3.601 ± 0.251
3.514AspPro: 3.514 ± 0.305
2.112AspGln: 2.112 ± 0.164
2.562AspArg: 2.562 ± 0.225
3.826AspSer: 3.826 ± 0.305
4.432AspThr: 4.432 ± 0.333
4.172AspVal: 4.172 ± 0.313
1.073AspTrp: 1.073 ± 0.149
3.757AspTyr: 3.757 ± 0.249
0.0AspXaa: 0.0 ± 0.0
Glu
3.064GluAla: 3.064 ± 0.28
0.814GluCys: 0.814 ± 0.135
3.912GluAsp: 3.912 ± 0.309
4.657GluGlu: 4.657 ± 0.408
3.185GluPhe: 3.185 ± 0.171
3.982GluGly: 3.982 ± 0.251
0.935GluHis: 0.935 ± 0.162
4.657GluIle: 4.657 ± 0.257
3.947GluLys: 3.947 ± 0.409
4.103GluLeu: 4.103 ± 0.261
1.731GluMet: 1.731 ± 0.265
3.428GluAsn: 3.428 ± 0.298
1.454GluPro: 1.454 ± 0.181
2.112GluGln: 2.112 ± 0.188
2.493GluArg: 2.493 ± 0.27
3.999GluSer: 3.999 ± 0.313
4.12GluThr: 4.12 ± 0.313
4.328GluVal: 4.328 ± 0.28
0.935GluTrp: 0.935 ± 0.154
2.77GluTyr: 2.77 ± 0.245
0.0GluXaa: 0.0 ± 0.0
Phe
2.787PheAla: 2.787 ± 0.196
0.45PheCys: 0.45 ± 0.079
3.341PheAsp: 3.341 ± 0.248
2.701PheGlu: 2.701 ± 0.218
1.679PhePhe: 1.679 ± 0.19
3.255PheGly: 3.255 ± 0.297
0.675PheHis: 0.675 ± 0.121
2.701PheIle: 2.701 ± 0.226
2.354PheLys: 2.354 ± 0.221
2.908PheLeu: 2.908 ± 0.312
0.987PheMet: 0.987 ± 0.174
2.978PheAsn: 2.978 ± 0.211
1.748PhePro: 1.748 ± 0.218
1.662PheGln: 1.662 ± 0.202
1.818PheArg: 1.818 ± 0.194
3.237PheSer: 3.237 ± 0.203
3.099PheThr: 3.099 ± 0.221
2.908PheVal: 2.908 ± 0.282
0.381PheTrp: 0.381 ± 0.084
1.974PheTyr: 1.974 ± 0.185
0.0PheXaa: 0.0 ± 0.0
Gly
5.817GlyAla: 5.817 ± 0.563
0.658GlyCys: 0.658 ± 0.162
5.228GlyAsp: 5.228 ± 0.425
4.484GlyGlu: 4.484 ± 0.286
3.307GlyPhe: 3.307 ± 0.23
8.587GlyGly: 8.587 ± 1.432
1.281GlyHis: 1.281 ± 0.185
4.189GlyIle: 4.189 ± 0.357
4.068GlyLys: 4.068 ± 0.452
4.622GlyLeu: 4.622 ± 0.271
1.437GlyMet: 1.437 ± 0.222
4.951GlyAsn: 4.951 ± 0.52
2.32GlyPro: 2.32 ± 0.212
2.562GlyGln: 2.562 ± 0.213
2.995GlyArg: 2.995 ± 0.189
5.938GlySer: 5.938 ± 0.612
6.561GlyThr: 6.561 ± 0.588
4.986GlyVal: 4.986 ± 0.316
1.35GlyTrp: 1.35 ± 0.174
3.428GlyTyr: 3.428 ± 0.333
0.0GlyXaa: 0.0 ± 0.0
His
0.883HisAla: 0.883 ± 0.128
0.138HisCys: 0.138 ± 0.059
1.108HisAsp: 1.108 ± 0.186
0.796HisGlu: 0.796 ± 0.141
1.021HisPhe: 1.021 ± 0.143
1.039HisGly: 1.039 ± 0.163
0.467HisHis: 0.467 ± 0.129
0.969HisIle: 0.969 ± 0.122
0.814HisLys: 0.814 ± 0.165
1.264HisLeu: 1.264 ± 0.176
0.519HisMet: 0.519 ± 0.122
0.918HisAsn: 0.918 ± 0.153
1.021HisPro: 1.021 ± 0.147
0.623HisGln: 0.623 ± 0.107
0.623HisArg: 0.623 ± 0.113
1.108HisSer: 1.108 ± 0.146
0.969HisThr: 0.969 ± 0.163
1.143HisVal: 1.143 ± 0.176
0.208HisTrp: 0.208 ± 0.063
0.848HisTyr: 0.848 ± 0.146
0.0HisXaa: 0.0 ± 0.0
Ile
3.912IleAla: 3.912 ± 0.306
0.606IleCys: 0.606 ± 0.138
4.968IleAsp: 4.968 ± 0.304
4.12IleGlu: 4.12 ± 0.276
2.51IlePhe: 2.51 ± 0.246
4.207IleGly: 4.207 ± 0.337
0.866IleHis: 0.866 ± 0.142
3.583IleIle: 3.583 ± 0.275
3.86IleLys: 3.86 ± 0.301
4.674IleLeu: 4.674 ± 0.394
1.212IleMet: 1.212 ± 0.15
4.103IleAsn: 4.103 ± 0.256
3.081IlePro: 3.081 ± 0.321
2.856IleGln: 2.856 ± 0.272
2.476IleArg: 2.476 ± 0.21
4.155IleSer: 4.155 ± 0.473
5.574IleThr: 5.574 ± 0.582
3.982IleVal: 3.982 ± 0.324
0.727IleTrp: 0.727 ± 0.133
2.216IleTyr: 2.216 ± 0.227
0.0IleXaa: 0.0 ± 0.0
Lys
3.237LysAla: 3.237 ± 0.356
0.571LysCys: 0.571 ± 0.108
3.739LysAsp: 3.739 ± 0.263
3.964LysGlu: 3.964 ± 0.427
2.493LysPhe: 2.493 ± 0.249
3.255LysGly: 3.255 ± 0.325
1.004LysHis: 1.004 ± 0.184
3.999LysIle: 3.999 ± 0.355
4.726LysLys: 4.726 ± 0.713
5.003LysLeu: 5.003 ± 0.348
1.437LysMet: 1.437 ± 0.251
2.856LysAsn: 2.856 ± 0.263
2.025LysPro: 2.025 ± 0.222
2.06LysGln: 2.06 ± 0.279
2.458LysArg: 2.458 ± 0.266
3.532LysSer: 3.532 ± 0.291
3.514LysThr: 3.514 ± 0.254
3.618LysVal: 3.618 ± 0.271
0.779LysTrp: 0.779 ± 0.116
3.22LysTyr: 3.22 ± 0.369
0.0LysXaa: 0.0 ± 0.0
Leu
4.639LeuAla: 4.639 ± 0.263
0.831LeuCys: 0.831 ± 0.137
5.522LeuAsp: 5.522 ± 0.313
4.553LeuGlu: 4.553 ± 0.41
2.389LeuPhe: 2.389 ± 0.206
4.795LeuGly: 4.795 ± 0.424
1.385LeuHis: 1.385 ± 0.207
3.982LeuIle: 3.982 ± 0.276
4.449LeuLys: 4.449 ± 0.395
5.228LeuLeu: 5.228 ± 0.387
1.593LeuMet: 1.593 ± 0.215
4.466LeuAsn: 4.466 ± 0.281
2.839LeuPro: 2.839 ± 0.221
2.874LeuGln: 2.874 ± 0.244
3.203LeuArg: 3.203 ± 0.242
4.761LeuSer: 4.761 ± 0.233
5.297LeuThr: 5.297 ± 0.556
4.363LeuVal: 4.363 ± 0.31
0.675LeuTrp: 0.675 ± 0.142
3.445LeuTyr: 3.445 ± 0.283
0.0LeuXaa: 0.0 ± 0.0
Met
1.575MetAla: 1.575 ± 0.196
0.26MetCys: 0.26 ± 0.088
1.16MetAsp: 1.16 ± 0.183
1.143MetGlu: 1.143 ± 0.186
0.762MetPhe: 0.762 ± 0.149
1.091MetGly: 1.091 ± 0.156
0.415MetHis: 0.415 ± 0.106
1.229MetIle: 1.229 ± 0.181
1.8MetLys: 1.8 ± 0.251
1.766MetLeu: 1.766 ± 0.241
0.571MetMet: 0.571 ± 0.113
1.177MetAsn: 1.177 ± 0.177
1.108MetPro: 1.108 ± 0.207
1.073MetGln: 1.073 ± 0.178
0.969MetArg: 0.969 ± 0.129
1.748MetSer: 1.748 ± 0.242
1.558MetThr: 1.558 ± 0.239
1.194MetVal: 1.194 ± 0.169
0.242MetTrp: 0.242 ± 0.064
0.71MetTyr: 0.71 ± 0.11
0.0MetXaa: 0.0 ± 0.0
Asn
4.034AsnAla: 4.034 ± 0.448
0.589AsnCys: 0.589 ± 0.11
3.497AsnAsp: 3.497 ± 0.243
3.081AsnGlu: 3.081 ± 0.221
2.597AsnPhe: 2.597 ± 0.215
4.639AsnGly: 4.639 ± 0.365
0.935AsnHis: 0.935 ± 0.155
4.086AsnIle: 4.086 ± 0.331
3.393AsnLys: 3.393 ± 0.329
4.709AsnLeu: 4.709 ± 0.332
0.796AsnMet: 0.796 ± 0.136
3.826AsnAsn: 3.826 ± 0.247
3.307AsnPro: 3.307 ± 0.277
2.233AsnGln: 2.233 ± 0.225
2.424AsnArg: 2.424 ± 0.219
3.722AsnSer: 3.722 ± 0.315
3.964AsnThr: 3.964 ± 0.501
4.276AsnVal: 4.276 ± 0.339
0.762AsnTrp: 0.762 ± 0.135
2.337AsnTyr: 2.337 ± 0.179
0.0AsnXaa: 0.0 ± 0.0
Pro
2.718ProAla: 2.718 ± 0.22
0.398ProCys: 0.398 ± 0.099
2.683ProAsp: 2.683 ± 0.293
3.03ProGlu: 3.03 ± 0.274
1.731ProPhe: 1.731 ± 0.187
3.393ProGly: 3.393 ± 0.335
0.9ProHis: 0.9 ± 0.116
2.649ProIle: 2.649 ± 0.25
1.922ProLys: 1.922 ± 0.2
2.354ProLeu: 2.354 ± 0.208
0.589ProMet: 0.589 ± 0.108
2.268ProAsn: 2.268 ± 0.221
1.731ProPro: 1.731 ± 0.199
1.368ProGln: 1.368 ± 0.161
1.541ProArg: 1.541 ± 0.174
3.133ProSer: 3.133 ± 0.225
3.324ProThr: 3.324 ± 0.218
2.545ProVal: 2.545 ± 0.248
0.519ProTrp: 0.519 ± 0.098
1.922ProTyr: 1.922 ± 0.196
0.0ProXaa: 0.0 ± 0.0
Gln
2.043GlnAla: 2.043 ± 0.208
0.242GlnCys: 0.242 ± 0.069
2.112GlnAsp: 2.112 ± 0.177
2.285GlnGlu: 2.285 ± 0.171
1.575GlnPhe: 1.575 ± 0.175
2.458GlnGly: 2.458 ± 0.196
0.727GlnHis: 0.727 ± 0.136
2.51GlnIle: 2.51 ± 0.233
2.406GlnLys: 2.406 ± 0.243
3.099GlnLeu: 3.099 ± 0.215
1.194GlnMet: 1.194 ± 0.196
1.922GlnAsn: 1.922 ± 0.22
1.212GlnPro: 1.212 ± 0.13
1.471GlnGln: 1.471 ± 0.174
1.558GlnArg: 1.558 ± 0.15
2.406GlnSer: 2.406 ± 0.179
2.562GlnThr: 2.562 ± 0.253
2.787GlnVal: 2.787 ± 0.198
0.589GlnTrp: 0.589 ± 0.095
1.818GlnTyr: 1.818 ± 0.202
0.0GlnXaa: 0.0 ± 0.0
Arg
2.701ArgAla: 2.701 ± 0.254
0.329ArgCys: 0.329 ± 0.069
2.406ArgAsp: 2.406 ± 0.193
2.441ArgGlu: 2.441 ± 0.234
2.008ArgPhe: 2.008 ± 0.173
2.787ArgGly: 2.787 ± 0.2
0.814ArgHis: 0.814 ± 0.147
3.064ArgIle: 3.064 ± 0.231
2.562ArgLys: 2.562 ± 0.274
3.203ArgLeu: 3.203 ± 0.254
1.108ArgMet: 1.108 ± 0.143
2.216ArgAsn: 2.216 ± 0.213
1.246ArgPro: 1.246 ± 0.168
1.662ArgGln: 1.662 ± 0.203
1.956ArgArg: 1.956 ± 0.234
2.527ArgSer: 2.527 ± 0.241
2.181ArgThr: 2.181 ± 0.224
2.874ArgVal: 2.874 ± 0.234
0.519ArgTrp: 0.519 ± 0.12
2.147ArgTyr: 2.147 ± 0.216
0.0ArgXaa: 0.0 ± 0.0
Ser
5.055SerAla: 5.055 ± 0.319
0.571SerCys: 0.571 ± 0.115
4.432SerAsp: 4.432 ± 0.311
3.116SerGlu: 3.116 ± 0.269
3.255SerPhe: 3.255 ± 0.217
7.288SerGly: 7.288 ± 0.752
1.056SerHis: 1.056 ± 0.141
4.155SerIle: 4.155 ± 0.318
3.462SerLys: 3.462 ± 0.313
4.813SerLeu: 4.813 ± 0.276
1.662SerMet: 1.662 ± 0.248
3.982SerAsn: 3.982 ± 0.302
2.51SerPro: 2.51 ± 0.244
2.268SerGln: 2.268 ± 0.18
2.579SerArg: 2.579 ± 0.181
5.436SerSer: 5.436 ± 0.57
5.107SerThr: 5.107 ± 0.42
4.432SerVal: 4.432 ± 0.345
0.658SerTrp: 0.658 ± 0.113
2.787SerTyr: 2.787 ± 0.189
0.0SerXaa: 0.0 ± 0.0
Thr
5.851ThrAla: 5.851 ± 0.553
0.381ThrCys: 0.381 ± 0.074
4.293ThrAsp: 4.293 ± 0.371
3.912ThrGlu: 3.912 ± 0.248
3.514ThrPhe: 3.514 ± 0.481
6.907ThrGly: 6.907 ± 0.658
1.056ThrHis: 1.056 ± 0.132
5.349ThrIle: 5.349 ± 0.486
3.237ThrLys: 3.237 ± 0.269
5.54ThrLeu: 5.54 ± 0.523
1.212ThrMet: 1.212 ± 0.171
4.207ThrAsn: 4.207 ± 0.465
3.376ThrPro: 3.376 ± 0.308
2.579ThrGln: 2.579 ± 0.207
2.545ThrArg: 2.545 ± 0.193
5.003ThrSer: 5.003 ± 0.461
5.626ThrThr: 5.626 ± 0.618
5.09ThrVal: 5.09 ± 0.445
0.866ThrTrp: 0.866 ± 0.109
2.562ThrTyr: 2.562 ± 0.194
0.0ThrXaa: 0.0 ± 0.0
Val
4.778ValAla: 4.778 ± 0.302
0.692ValCys: 0.692 ± 0.132
4.813ValAsp: 4.813 ± 0.347
4.588ValGlu: 4.588 ± 0.306
2.683ValPhe: 2.683 ± 0.245
4.899ValGly: 4.899 ± 0.402
0.866ValHis: 0.866 ± 0.138
4.207ValIle: 4.207 ± 0.418
3.566ValLys: 3.566 ± 0.339
3.912ValLeu: 3.912 ± 0.235
1.264ValMet: 1.264 ± 0.181
4.034ValAsn: 4.034 ± 0.386
2.545ValPro: 2.545 ± 0.236
2.164ValGln: 2.164 ± 0.189
2.597ValArg: 2.597 ± 0.206
5.245ValSer: 5.245 ± 0.358
5.592ValThr: 5.592 ± 0.518
4.657ValVal: 4.657 ± 0.389
0.675ValTrp: 0.675 ± 0.096
2.199ValTyr: 2.199 ± 0.177
0.0ValXaa: 0.0 ± 0.0
Trp
0.9TrpAla: 0.9 ± 0.14
0.225TrpCys: 0.225 ± 0.072
0.987TrpAsp: 0.987 ± 0.145
0.762TrpGlu: 0.762 ± 0.138
0.519TrpPhe: 0.519 ± 0.112
0.762TrpGly: 0.762 ± 0.144
0.381TrpHis: 0.381 ± 0.093
0.658TrpIle: 0.658 ± 0.102
0.883TrpLys: 0.883 ± 0.171
0.744TrpLeu: 0.744 ± 0.127
0.381TrpMet: 0.381 ± 0.093
0.9TrpAsn: 0.9 ± 0.132
0.277TrpPro: 0.277 ± 0.065
0.45TrpGln: 0.45 ± 0.08
0.641TrpArg: 0.641 ± 0.112
0.796TrpSer: 0.796 ± 0.126
0.727TrpThr: 0.727 ± 0.102
0.883TrpVal: 0.883 ± 0.119
0.156TrpTrp: 0.156 ± 0.053
0.415TrpTyr: 0.415 ± 0.077
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.25TyrAla: 2.25 ± 0.166
0.519TyrCys: 0.519 ± 0.104
3.41TyrAsp: 3.41 ± 0.275
2.735TyrGlu: 2.735 ± 0.268
1.904TyrPhe: 1.904 ± 0.214
2.614TyrGly: 2.614 ± 0.194
0.727TyrHis: 0.727 ± 0.108
2.683TyrIle: 2.683 ± 0.257
2.268TyrLys: 2.268 ± 0.243
3.151TyrLeu: 3.151 ± 0.213
0.848TyrMet: 0.848 ± 0.165
2.804TyrAsn: 2.804 ± 0.268
1.887TyrPro: 1.887 ± 0.164
1.8TyrGln: 1.8 ± 0.19
2.372TyrArg: 2.372 ± 0.2
2.614TyrSer: 2.614 ± 0.208
2.839TyrThr: 2.839 ± 0.221
3.185TyrVal: 3.185 ± 0.231
0.554TyrTrp: 0.554 ± 0.138
1.887TyrTyr: 1.887 ± 0.179
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 226 proteins (57766 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski