Amino acid dipepetide frequency for Prochlorococcus phage Syn1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.781AlaAla: 5.781 ± 0.456
0.564AlaCys: 0.564 ± 0.115
4.074AlaAsp: 4.074 ± 0.252
3.833AlaGlu: 3.833 ± 0.287
2.818AlaPhe: 2.818 ± 0.187
6.087AlaGly: 6.087 ± 0.447
1.015AlaHis: 1.015 ± 0.106
3.639AlaIle: 3.639 ± 0.25
3.43AlaLys: 3.43 ± 0.36
4.815AlaLeu: 4.815 ± 0.317
1.401AlaMet: 1.401 ± 0.194
4.01AlaAsn: 4.01 ± 0.355
3.092AlaPro: 3.092 ± 0.31
2.126AlaGln: 2.126 ± 0.173
2.528AlaArg: 2.528 ± 0.19
5.652AlaSer: 5.652 ± 0.287
5.604AlaThr: 5.604 ± 0.384
4.203AlaVal: 4.203 ± 0.272
0.58AlaTrp: 0.58 ± 0.097
2.287AlaTyr: 2.287 ± 0.21
0.0AlaXaa: 0.0 ± 0.0
Cys
0.596CysAla: 0.596 ± 0.105
0.145CysCys: 0.145 ± 0.049
0.692CysAsp: 0.692 ± 0.141
0.612CysGlu: 0.612 ± 0.118
0.403CysPhe: 0.403 ± 0.101
0.789CysGly: 0.789 ± 0.152
0.225CysHis: 0.225 ± 0.059
0.451CysIle: 0.451 ± 0.074
0.531CysLys: 0.531 ± 0.097
0.66CysLeu: 0.66 ± 0.129
0.209CysMet: 0.209 ± 0.062
0.29CysAsn: 0.29 ± 0.079
0.338CysPro: 0.338 ± 0.086
0.403CysGln: 0.403 ± 0.079
0.322CysArg: 0.322 ± 0.084
0.548CysSer: 0.548 ± 0.094
0.419CysThr: 0.419 ± 0.104
0.612CysVal: 0.612 ± 0.089
0.113CysTrp: 0.113 ± 0.042
0.37CysTyr: 0.37 ± 0.066
0.0CysXaa: 0.0 ± 0.0
Asp
4.412AspAla: 4.412 ± 0.35
0.709AspCys: 0.709 ± 0.118
4.59AspAsp: 4.59 ± 0.356
3.929AspGlu: 3.929 ± 0.311
3.591AspPhe: 3.591 ± 0.257
5.846AspGly: 5.846 ± 0.442
0.918AspHis: 0.918 ± 0.141
4.445AspIle: 4.445 ± 0.257
3.35AspLys: 3.35 ± 0.299
4.863AspLeu: 4.863 ± 0.29
1.482AspMet: 1.482 ± 0.195
3.72AspAsn: 3.72 ± 0.274
3.092AspPro: 3.092 ± 0.274
1.965AspGln: 1.965 ± 0.204
2.399AspArg: 2.399 ± 0.188
4.573AspSer: 4.573 ± 0.391
4.267AspThr: 4.267 ± 0.264
4.38AspVal: 4.38 ± 0.261
1.031AspTrp: 1.031 ± 0.173
2.963AspTyr: 2.963 ± 0.25
0.0AspXaa: 0.0 ± 0.0
Glu
3.591GluAla: 3.591 ± 0.272
0.531GluCys: 0.531 ± 0.107
3.945GluAsp: 3.945 ± 0.285
4.267GluGlu: 4.267 ± 0.554
2.931GluPhe: 2.931 ± 0.228
4.123GluGly: 4.123 ± 0.231
1.031GluHis: 1.031 ± 0.155
4.396GluIle: 4.396 ± 0.3
3.35GluLys: 3.35 ± 0.431
4.944GluLeu: 4.944 ± 0.329
1.482GluMet: 1.482 ± 0.215
3.237GluAsn: 3.237 ± 0.242
1.659GluPro: 1.659 ± 0.173
2.77GluGln: 2.77 ± 0.213
2.287GluArg: 2.287 ± 0.227
3.833GluSer: 3.833 ± 0.287
3.929GluThr: 3.929 ± 0.321
4.267GluVal: 4.267 ± 0.247
0.789GluTrp: 0.789 ± 0.125
2.673GluTyr: 2.673 ± 0.221
0.0GluXaa: 0.0 ± 0.0
Phe
2.754PheAla: 2.754 ± 0.174
0.58PheCys: 0.58 ± 0.098
3.672PheAsp: 3.672 ± 0.277
2.641PheGlu: 2.641 ± 0.234
1.562PhePhe: 1.562 ± 0.157
3.333PheGly: 3.333 ± 0.241
0.612PheHis: 0.612 ± 0.113
2.866PheIle: 2.866 ± 0.197
2.464PheLys: 2.464 ± 0.25
3.06PheLeu: 3.06 ± 0.184
0.692PheMet: 0.692 ± 0.135
2.77PheAsn: 2.77 ± 0.226
1.53PhePro: 1.53 ± 0.148
1.707PheGln: 1.707 ± 0.186
1.691PheArg: 1.691 ± 0.223
3.752PheSer: 3.752 ± 0.259
3.543PheThr: 3.543 ± 0.268
3.172PheVal: 3.172 ± 0.227
0.386PheTrp: 0.386 ± 0.085
1.82PheTyr: 1.82 ± 0.167
0.0PheXaa: 0.0 ± 0.0
Gly
5.862GlyAla: 5.862 ± 0.506
0.531GlyCys: 0.531 ± 0.116
5.62GlyAsp: 5.62 ± 0.371
4.235GlyGlu: 4.235 ± 0.233
3.076GlyPhe: 3.076 ± 0.195
8.97GlyGly: 8.97 ± 0.952
0.934GlyHis: 0.934 ± 0.16
4.557GlyIle: 4.557 ± 0.303
3.704GlyLys: 3.704 ± 0.296
4.912GlyLeu: 4.912 ± 0.285
1.498GlyMet: 1.498 ± 0.181
5.234GlyAsn: 5.234 ± 0.572
1.965GlyPro: 1.965 ± 0.206
2.689GlyGln: 2.689 ± 0.268
3.221GlyArg: 3.221 ± 0.301
6.908GlySer: 6.908 ± 0.464
6.264GlyThr: 6.264 ± 0.505
5.427GlyVal: 5.427 ± 0.327
0.998GlyTrp: 0.998 ± 0.18
2.899GlyTyr: 2.899 ± 0.245
0.0GlyXaa: 0.0 ± 0.0
His
0.725HisAla: 0.725 ± 0.122
0.225HisCys: 0.225 ± 0.068
0.982HisAsp: 0.982 ± 0.144
0.853HisGlu: 0.853 ± 0.133
0.692HisPhe: 0.692 ± 0.106
1.063HisGly: 1.063 ± 0.175
0.354HisHis: 0.354 ± 0.08
0.95HisIle: 0.95 ± 0.128
1.095HisLys: 1.095 ± 0.158
0.918HisLeu: 0.918 ± 0.167
0.354HisMet: 0.354 ± 0.082
0.725HisAsn: 0.725 ± 0.132
0.709HisPro: 0.709 ± 0.122
0.483HisGln: 0.483 ± 0.105
0.596HisArg: 0.596 ± 0.097
0.95HisSer: 0.95 ± 0.141
1.047HisThr: 1.047 ± 0.182
0.773HisVal: 0.773 ± 0.133
0.242HisTrp: 0.242 ± 0.067
0.886HisTyr: 0.886 ± 0.152
0.0HisXaa: 0.0 ± 0.0
Ile
4.348IleAla: 4.348 ± 0.326
0.676IleCys: 0.676 ± 0.103
5.153IleAsp: 5.153 ± 0.277
4.461IleGlu: 4.461 ± 0.279
2.238IlePhe: 2.238 ± 0.188
4.267IleGly: 4.267 ± 0.29
0.692IleHis: 0.692 ± 0.127
3.961IleIle: 3.961 ± 0.295
4.155IleLys: 4.155 ± 0.34
4.187IleLeu: 4.187 ± 0.259
1.127IleMet: 1.127 ± 0.153
4.477IleAsn: 4.477 ± 0.278
2.85IlePro: 2.85 ± 0.243
2.496IleGln: 2.496 ± 0.192
2.689IleArg: 2.689 ± 0.205
4.751IleSer: 4.751 ± 0.322
5.604IleThr: 5.604 ± 0.426
3.929IleVal: 3.929 ± 0.296
0.435IleTrp: 0.435 ± 0.086
1.997IleTyr: 1.997 ± 0.198
0.0IleXaa: 0.0 ± 0.0
Lys
2.979LysAla: 2.979 ± 0.284
0.612LysCys: 0.612 ± 0.108
3.14LysAsp: 3.14 ± 0.31
3.591LysGlu: 3.591 ± 0.414
2.818LysPhe: 2.818 ± 0.262
3.172LysGly: 3.172 ± 0.247
0.853LysHis: 0.853 ± 0.155
3.768LysIle: 3.768 ± 0.268
4.139LysLys: 4.139 ± 0.557
4.155LysLeu: 4.155 ± 0.388
1.594LysMet: 1.594 ± 0.268
3.108LysAsn: 3.108 ± 0.295
1.884LysPro: 1.884 ± 0.231
2.335LysGln: 2.335 ± 0.279
2.448LysArg: 2.448 ± 0.233
3.752LysSer: 3.752 ± 0.269
3.752LysThr: 3.752 ± 0.295
3.672LysVal: 3.672 ± 0.24
0.644LysTrp: 0.644 ± 0.112
2.625LysTyr: 2.625 ± 0.269
0.0LysXaa: 0.0 ± 0.0
Leu
4.734LeuAla: 4.734 ± 0.263
0.596LeuCys: 0.596 ± 0.117
5.25LeuAsp: 5.25 ± 0.28
4.606LeuGlu: 4.606 ± 0.293
2.818LeuPhe: 2.818 ± 0.24
4.734LeuGly: 4.734 ± 0.32
1.433LeuHis: 1.433 ± 0.158
3.978LeuIle: 3.978 ± 0.242
4.3LeuLys: 4.3 ± 0.418
5.04LeuLeu: 5.04 ± 0.43
1.852LeuMet: 1.852 ± 0.219
4.622LeuAsn: 4.622 ± 0.256
3.189LeuPro: 3.189 ± 0.209
2.722LeuGln: 2.722 ± 0.237
3.462LeuArg: 3.462 ± 0.272
5.137LeuSer: 5.137 ± 0.274
5.556LeuThr: 5.556 ± 0.421
4.67LeuVal: 4.67 ± 0.278
0.612LeuTrp: 0.612 ± 0.102
2.883LeuTyr: 2.883 ± 0.236
0.0LeuXaa: 0.0 ± 0.0
Met
1.369MetAla: 1.369 ± 0.22
0.193MetCys: 0.193 ± 0.063
1.127MetAsp: 1.127 ± 0.186
0.95MetGlu: 0.95 ± 0.148
0.821MetPhe: 0.821 ± 0.143
1.32MetGly: 1.32 ± 0.177
0.29MetHis: 0.29 ± 0.083
1.304MetIle: 1.304 ± 0.177
1.514MetLys: 1.514 ± 0.212
1.594MetLeu: 1.594 ± 0.238
0.725MetMet: 0.725 ± 0.133
1.24MetAsn: 1.24 ± 0.163
1.015MetPro: 1.015 ± 0.126
0.821MetGln: 0.821 ± 0.156
0.982MetArg: 0.982 ± 0.125
1.514MetSer: 1.514 ± 0.217
1.675MetThr: 1.675 ± 0.251
1.143MetVal: 1.143 ± 0.155
0.354MetTrp: 0.354 ± 0.085
0.692MetTyr: 0.692 ± 0.137
0.0MetXaa: 0.0 ± 0.0
Asn
4.267AsnAla: 4.267 ± 0.392
0.451AsnCys: 0.451 ± 0.082
3.333AsnAsp: 3.333 ± 0.275
3.172AsnGlu: 3.172 ± 0.209
3.06AsnPhe: 3.06 ± 0.259
4.976AsnGly: 4.976 ± 0.43
0.789AsnHis: 0.789 ± 0.123
4.348AsnIle: 4.348 ± 0.277
3.011AsnLys: 3.011 ± 0.245
4.863AsnLeu: 4.863 ± 0.266
0.934AsnMet: 0.934 ± 0.149
3.704AsnAsn: 3.704 ± 0.325
3.398AsnPro: 3.398 ± 0.241
2.142AsnGln: 2.142 ± 0.214
2.19AsnArg: 2.19 ± 0.17
4.187AsnSer: 4.187 ± 0.318
4.734AsnThr: 4.734 ± 0.445
3.849AsnVal: 3.849 ± 0.306
0.66AsnTrp: 0.66 ± 0.103
2.528AsnTyr: 2.528 ± 0.269
0.0AsnXaa: 0.0 ± 0.0
Pro
2.85ProAla: 2.85 ± 0.303
0.209ProCys: 0.209 ± 0.059
3.189ProAsp: 3.189 ± 0.276
2.673ProGlu: 2.673 ± 0.186
1.707ProPhe: 1.707 ± 0.183
3.027ProGly: 3.027 ± 0.268
0.564ProHis: 0.564 ± 0.118
2.319ProIle: 2.319 ± 0.254
1.949ProLys: 1.949 ± 0.196
2.915ProLeu: 2.915 ± 0.231
0.564ProMet: 0.564 ± 0.129
2.544ProAsn: 2.544 ± 0.227
1.82ProPro: 1.82 ± 0.228
1.578ProGln: 1.578 ± 0.16
1.369ProArg: 1.369 ± 0.193
2.883ProSer: 2.883 ± 0.211
3.301ProThr: 3.301 ± 0.277
2.722ProVal: 2.722 ± 0.262
0.548ProTrp: 0.548 ± 0.095
1.514ProTyr: 1.514 ± 0.214
0.0ProXaa: 0.0 ± 0.0
Gln
2.142GlnAla: 2.142 ± 0.172
0.225GlnCys: 0.225 ± 0.062
1.949GlnAsp: 1.949 ± 0.177
2.625GlnGlu: 2.625 ± 0.266
1.755GlnPhe: 1.755 ± 0.16
2.689GlnGly: 2.689 ± 0.187
0.548GlnHis: 0.548 ± 0.093
2.818GlnIle: 2.818 ± 0.254
2.255GlnLys: 2.255 ± 0.282
2.85GlnLeu: 2.85 ± 0.205
0.805GlnMet: 0.805 ± 0.127
2.19GlnAsn: 2.19 ± 0.182
1.208GlnPro: 1.208 ± 0.137
1.659GlnGln: 1.659 ± 0.186
1.61GlnArg: 1.61 ± 0.155
2.448GlnSer: 2.448 ± 0.216
2.496GlnThr: 2.496 ± 0.279
2.464GlnVal: 2.464 ± 0.199
0.451GlnTrp: 0.451 ± 0.098
1.723GlnTyr: 1.723 ± 0.149
0.0GlnXaa: 0.0 ± 0.0
Arg
2.544ArgAla: 2.544 ± 0.19
0.386ArgCys: 0.386 ± 0.082
2.416ArgAsp: 2.416 ± 0.216
2.238ArgGlu: 2.238 ± 0.233
2.045ArgPhe: 2.045 ± 0.173
2.802ArgGly: 2.802 ± 0.224
0.692ArgHis: 0.692 ± 0.139
2.85ArgIle: 2.85 ± 0.259
2.657ArgLys: 2.657 ± 0.279
2.963ArgLeu: 2.963 ± 0.208
1.095ArgMet: 1.095 ± 0.173
1.981ArgAsn: 1.981 ± 0.211
1.465ArgPro: 1.465 ± 0.147
1.337ArgGln: 1.337 ± 0.165
2.287ArgArg: 2.287 ± 0.326
2.738ArgSer: 2.738 ± 0.205
2.351ArgThr: 2.351 ± 0.23
2.657ArgVal: 2.657 ± 0.196
0.58ArgTrp: 0.58 ± 0.123
2.077ArgTyr: 2.077 ± 0.201
0.0ArgXaa: 0.0 ± 0.0
Ser
5.395SerAla: 5.395 ± 0.234
0.515SerCys: 0.515 ± 0.117
4.879SerAsp: 4.879 ± 0.264
3.414SerGlu: 3.414 ± 0.239
3.623SerPhe: 3.623 ± 0.258
7.81SerGly: 7.81 ± 0.552
1.031SerHis: 1.031 ± 0.161
4.928SerIle: 4.928 ± 0.311
3.43SerLys: 3.43 ± 0.312
5.201SerLeu: 5.201 ± 0.3
1.369SerMet: 1.369 ± 0.162
4.606SerAsn: 4.606 ± 0.361
3.108SerPro: 3.108 ± 0.206
2.738SerGln: 2.738 ± 0.205
2.399SerArg: 2.399 ± 0.194
5.54SerSer: 5.54 ± 0.488
5.443SerThr: 5.443 ± 0.369
4.815SerVal: 4.815 ± 0.315
0.676SerTrp: 0.676 ± 0.097
2.673SerTyr: 2.673 ± 0.181
0.0SerXaa: 0.0 ± 0.0
Thr
5.234ThrAla: 5.234 ± 0.334
0.66ThrCys: 0.66 ± 0.117
4.493ThrAsp: 4.493 ± 0.276
4.235ThrGlu: 4.235 ± 0.318
3.478ThrPhe: 3.478 ± 0.378
6.586ThrGly: 6.586 ± 0.517
0.886ThrHis: 0.886 ± 0.145
5.572ThrIle: 5.572 ± 0.567
3.366ThrLys: 3.366 ± 0.295
6.152ThrLeu: 6.152 ± 0.404
1.015ThrMet: 1.015 ± 0.132
4.783ThrAsn: 4.783 ± 0.444
3.72ThrPro: 3.72 ± 0.246
2.963ThrGln: 2.963 ± 0.198
2.464ThrArg: 2.464 ± 0.192
5.234ThrSer: 5.234 ± 0.369
5.733ThrThr: 5.733 ± 0.478
5.91ThrVal: 5.91 ± 0.41
0.628ThrTrp: 0.628 ± 0.11
3.044ThrTyr: 3.044 ± 0.215
0.0ThrXaa: 0.0 ± 0.0
Val
4.976ValAla: 4.976 ± 0.273
0.419ValCys: 0.419 ± 0.078
4.38ValAsp: 4.38 ± 0.288
4.38ValGlu: 4.38 ± 0.25
2.802ValPhe: 2.802 ± 0.225
5.008ValGly: 5.008 ± 0.333
0.934ValHis: 0.934 ± 0.165
4.042ValIle: 4.042 ± 0.263
3.559ValLys: 3.559 ± 0.335
4.412ValLeu: 4.412 ± 0.244
1.385ValMet: 1.385 ± 0.186
3.849ValAsn: 3.849 ± 0.289
2.464ValPro: 2.464 ± 0.259
1.997ValGln: 1.997 ± 0.189
2.738ValArg: 2.738 ± 0.195
5.588ValSer: 5.588 ± 0.309
6.361ValThr: 6.361 ± 0.485
5.057ValVal: 5.057 ± 0.338
0.548ValTrp: 0.548 ± 0.112
2.383ValTyr: 2.383 ± 0.239
0.0ValXaa: 0.0 ± 0.0
Trp
0.66TrpAla: 0.66 ± 0.105
0.081TrpCys: 0.081 ± 0.037
0.709TrpAsp: 0.709 ± 0.116
0.789TrpGlu: 0.789 ± 0.127
0.37TrpPhe: 0.37 ± 0.083
0.676TrpGly: 0.676 ± 0.127
0.306TrpHis: 0.306 ± 0.078
0.805TrpIle: 0.805 ± 0.114
0.757TrpLys: 0.757 ± 0.116
0.612TrpLeu: 0.612 ± 0.114
0.258TrpMet: 0.258 ± 0.077
0.773TrpAsn: 0.773 ± 0.123
0.097TrpPro: 0.097 ± 0.04
0.403TrpGln: 0.403 ± 0.076
0.644TrpArg: 0.644 ± 0.095
0.821TrpSer: 0.821 ± 0.129
0.934TrpThr: 0.934 ± 0.134
0.548TrpVal: 0.548 ± 0.111
0.081TrpTrp: 0.081 ± 0.035
0.467TrpTyr: 0.467 ± 0.079
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.319TyrAla: 2.319 ± 0.214
0.419TyrCys: 0.419 ± 0.104
2.931TyrAsp: 2.931 ± 0.209
2.528TyrGlu: 2.528 ± 0.226
1.997TyrPhe: 1.997 ± 0.232
2.222TyrGly: 2.222 ± 0.203
0.515TyrHis: 0.515 ± 0.102
2.496TyrIle: 2.496 ± 0.26
2.029TyrLys: 2.029 ± 0.201
3.076TyrLeu: 3.076 ± 0.266
0.789TyrMet: 0.789 ± 0.136
2.738TyrAsn: 2.738 ± 0.194
1.643TyrPro: 1.643 ± 0.171
1.578TyrGln: 1.578 ± 0.154
1.852TyrArg: 1.852 ± 0.187
2.786TyrSer: 2.786 ± 0.248
3.14TyrThr: 3.14 ± 0.274
3.011TyrVal: 3.011 ± 0.222
0.435TyrTrp: 0.435 ± 0.085
1.836TyrTyr: 1.836 ± 0.232
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 234 proteins (62099 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski