Amino acid dipepetide frequency for Ostreococcus mediterraneus virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.961AlaAla: 4.961 ± 0.358
1.187AlaCys: 1.187 ± 0.21
3.009AlaAsp: 3.009 ± 0.322
3.139AlaGlu: 3.139 ± 0.301
2.001AlaPhe: 2.001 ± 0.162
4.066AlaGly: 4.066 ± 0.605
1.122AlaHis: 1.122 ± 0.13
3.79AlaIle: 3.79 ± 0.219
4.684AlaLys: 4.684 ± 0.693
5.156AlaLeu: 5.156 ± 0.352
1.854AlaMet: 1.854 ± 0.163
2.928AlaAsn: 2.928 ± 0.572
2.001AlaPro: 2.001 ± 0.254
2.7AlaGln: 2.7 ± 0.347
3.188AlaArg: 3.188 ± 0.261
3.578AlaSer: 3.578 ± 0.226
4.213AlaThr: 4.213 ± 0.473
4.278AlaVal: 4.278 ± 0.667
0.504AlaTrp: 0.504 ± 0.091
2.115AlaTyr: 2.115 ± 0.201
0.0AlaXaa: 0.0 ± 0.0
Cys
0.976CysAla: 0.976 ± 0.13
0.504CysCys: 0.504 ± 0.103
1.204CysAsp: 1.204 ± 0.169
1.594CysGlu: 1.594 ± 0.21
0.716CysPhe: 0.716 ± 0.123
1.22CysGly: 1.22 ± 0.186
0.472CysHis: 0.472 ± 0.09
0.748CysIle: 0.748 ± 0.131
1.236CysLys: 1.236 ± 0.213
1.252CysLeu: 1.252 ± 0.156
0.586CysMet: 0.586 ± 0.097
0.586CysAsn: 0.586 ± 0.102
1.252CysPro: 1.252 ± 0.261
0.39CysGln: 0.39 ± 0.081
1.025CysArg: 1.025 ± 0.169
1.057CysSer: 1.057 ± 0.164
0.797CysThr: 0.797 ± 0.133
0.976CysVal: 0.976 ± 0.151
0.065CysTrp: 0.065 ± 0.04
0.716CysTyr: 0.716 ± 0.121
0.0CysXaa: 0.0 ± 0.0
Asp
3.448AspAla: 3.448 ± 0.225
1.106AspCys: 1.106 ± 0.155
3.969AspAsp: 3.969 ± 0.316
4.652AspGlu: 4.652 ± 0.303
2.586AspPhe: 2.586 ± 0.209
3.79AspGly: 3.79 ± 0.293
1.155AspHis: 1.155 ± 0.158
4.18AspIle: 4.18 ± 0.297
3.53AspLys: 3.53 ± 0.241
5.026AspLeu: 5.026 ± 0.278
1.594AspMet: 1.594 ± 0.211
2.456AspAsn: 2.456 ± 0.228
2.358AspPro: 2.358 ± 0.241
1.334AspGln: 1.334 ± 0.154
2.977AspArg: 2.977 ± 0.193
2.44AspSer: 2.44 ± 0.258
3.611AspThr: 3.611 ± 0.345
4.359AspVal: 4.359 ± 0.369
0.667AspTrp: 0.667 ± 0.111
2.83AspTyr: 2.83 ± 0.236
0.0AspXaa: 0.0 ± 0.0
Glu
4.164GluAla: 4.164 ± 0.489
1.155GluCys: 1.155 ± 0.173
3.839GluAsp: 3.839 ± 0.319
5.807GluGlu: 5.807 ± 0.482
2.944GluPhe: 2.944 ± 0.225
3.286GluGly: 3.286 ± 0.267
1.61GluHis: 1.61 ± 0.197
4.392GluIle: 4.392 ± 0.363
5.01GluLys: 5.01 ± 0.345
5.579GluLeu: 5.579 ± 0.376
2.049GluMet: 2.049 ± 0.182
3.318GluAsn: 3.318 ± 0.21
2.407GluPro: 2.407 ± 0.25
1.984GluGln: 1.984 ± 0.198
3.806GluArg: 3.806 ± 0.325
3.139GluSer: 3.139 ± 0.273
3.757GluThr: 3.757 ± 0.294
4.083GluVal: 4.083 ± 0.289
0.797GluTrp: 0.797 ± 0.111
2.814GluTyr: 2.814 ± 0.207
0.0GluXaa: 0.0 ± 0.0
Phe
2.521PheAla: 2.521 ± 0.197
0.781PheCys: 0.781 ± 0.125
2.977PheAsp: 2.977 ± 0.26
2.456PheGlu: 2.456 ± 0.27
2.033PhePhe: 2.033 ± 0.265
2.44PheGly: 2.44 ± 0.22
0.927PheHis: 0.927 ± 0.137
2.765PheIle: 2.765 ± 0.296
2.7PheLys: 2.7 ± 0.222
3.399PheLeu: 3.399 ± 0.271
1.561PheMet: 1.561 ± 0.172
2.31PheAsn: 2.31 ± 0.211
1.415PhePro: 1.415 ± 0.188
1.106PheGln: 1.106 ± 0.147
1.805PheArg: 1.805 ± 0.183
2.781PheSer: 2.781 ± 0.248
2.098PheThr: 2.098 ± 0.229
3.286PheVal: 3.286 ± 0.284
0.374PheTrp: 0.374 ± 0.08
1.887PheTyr: 1.887 ± 0.183
0.0PheXaa: 0.0 ± 0.0
Gly
3.887GlyAla: 3.887 ± 0.377
1.139GlyCys: 1.139 ± 0.162
3.741GlyAsp: 3.741 ± 0.295
3.953GlyGlu: 3.953 ± 0.25
2.716GlyPhe: 2.716 ± 0.269
5.351GlyGly: 5.351 ± 0.679
1.448GlyHis: 1.448 ± 0.174
3.709GlyIle: 3.709 ± 0.266
4.359GlyLys: 4.359 ± 0.261
5.303GlyLeu: 5.303 ± 0.378
1.464GlyMet: 1.464 ± 0.184
3.839GlyAsn: 3.839 ± 0.673
2.147GlyPro: 2.147 ± 0.23
2.147GlyGln: 2.147 ± 0.269
3.107GlyArg: 3.107 ± 0.34
3.757GlySer: 3.757 ± 0.361
4.05GlyThr: 4.05 ± 0.434
4.131GlyVal: 4.131 ± 0.32
0.764GlyTrp: 0.764 ± 0.111
2.814GlyTyr: 2.814 ± 0.217
0.0GlyXaa: 0.0 ± 0.0
His
1.155HisAla: 1.155 ± 0.145
0.504HisCys: 0.504 ± 0.094
0.943HisAsp: 0.943 ± 0.125
1.301HisGlu: 1.301 ± 0.175
0.911HisPhe: 0.911 ± 0.121
1.496HisGly: 1.496 ± 0.184
0.618HisHis: 0.618 ± 0.099
1.61HisIle: 1.61 ± 0.196
1.496HisLys: 1.496 ± 0.184
1.692HisLeu: 1.692 ± 0.166
0.602HisMet: 0.602 ± 0.108
1.041HisAsn: 1.041 ± 0.157
1.008HisPro: 1.008 ± 0.148
0.667HisGln: 0.667 ± 0.117
0.992HisArg: 0.992 ± 0.126
1.106HisSer: 1.106 ± 0.149
1.578HisThr: 1.578 ± 0.184
1.301HisVal: 1.301 ± 0.182
0.455HisTrp: 0.455 ± 0.09
0.83HisTyr: 0.83 ± 0.129
0.0HisXaa: 0.0 ± 0.0
Ile
3.497IleAla: 3.497 ± 0.241
0.83IleCys: 0.83 ± 0.152
3.904IleAsp: 3.904 ± 0.299
4.294IleGlu: 4.294 ± 0.277
2.375IlePhe: 2.375 ± 0.278
3.855IleGly: 3.855 ± 0.28
1.724IleHis: 1.724 ± 0.197
3.318IleIle: 3.318 ± 0.33
5.335IleLys: 5.335 ± 0.355
5.189IleLeu: 5.189 ± 0.341
1.561IleMet: 1.561 ± 0.177
3.139IleAsn: 3.139 ± 0.285
2.749IlePro: 2.749 ± 0.254
2.407IleGln: 2.407 ± 0.177
2.814IleArg: 2.814 ± 0.213
3.107IleSer: 3.107 ± 0.276
3.741IleThr: 3.741 ± 0.257
4.457IleVal: 4.457 ± 0.345
0.52IleTrp: 0.52 ± 0.073
2.131IleTyr: 2.131 ± 0.18
0.0IleXaa: 0.0 ± 0.0
Lys
3.839LysAla: 3.839 ± 0.519
1.545LysCys: 1.545 ± 0.216
4.083LysAsp: 4.083 ± 0.305
4.88LysGlu: 4.88 ± 0.391
3.172LysPhe: 3.172 ± 0.298
3.399LysGly: 3.399 ± 0.217
1.431LysHis: 1.431 ± 0.172
4.961LysIle: 4.961 ± 0.271
6.766LysLys: 6.766 ± 0.676
5.839LysLeu: 5.839 ± 0.421
2.44LysMet: 2.44 ± 0.239
5.303LysAsn: 5.303 ± 0.718
2.684LysPro: 2.684 ± 0.238
2.472LysGln: 2.472 ± 0.253
4.457LysArg: 4.457 ± 0.506
4.001LysSer: 4.001 ± 0.271
5.449LysThr: 5.449 ± 0.323
4.587LysVal: 4.587 ± 0.277
0.602LysTrp: 0.602 ± 0.093
3.253LysTyr: 3.253 ± 0.282
0.0LysXaa: 0.0 ± 0.0
Leu
4.554LeuAla: 4.554 ± 0.257
1.318LeuCys: 1.318 ± 0.153
5.091LeuAsp: 5.091 ± 0.319
5.4LeuGlu: 5.4 ± 0.335
2.749LeuPhe: 2.749 ± 0.235
4.587LeuGly: 4.587 ± 0.286
1.318LeuHis: 1.318 ± 0.141
4.928LeuIle: 4.928 ± 0.298
6.62LeuLys: 6.62 ± 0.552
6.181LeuLeu: 6.181 ± 0.355
1.887LeuMet: 1.887 ± 0.18
5.091LeuAsn: 5.091 ± 0.953
3.334LeuPro: 3.334 ± 0.226
2.619LeuGln: 2.619 ± 0.261
4.798LeuArg: 4.798 ± 0.299
4.977LeuSer: 4.977 ± 0.271
5.059LeuThr: 5.059 ± 0.38
5.725LeuVal: 5.725 ± 0.37
0.797LeuTrp: 0.797 ± 0.14
2.7LeuTyr: 2.7 ± 0.261
0.0LeuXaa: 0.0 ± 0.0
Met
1.627MetAla: 1.627 ± 0.183
0.586MetCys: 0.586 ± 0.113
1.61MetAsp: 1.61 ± 0.177
1.659MetGlu: 1.659 ± 0.159
1.318MetPhe: 1.318 ± 0.139
1.545MetGly: 1.545 ± 0.173
0.618MetHis: 0.618 ± 0.118
1.578MetIle: 1.578 ± 0.175
2.602MetLys: 2.602 ± 0.227
1.854MetLeu: 1.854 ± 0.234
1.057MetMet: 1.057 ± 0.17
1.903MetAsn: 1.903 ± 0.292
0.927MetPro: 0.927 ± 0.117
0.634MetGln: 0.634 ± 0.088
1.578MetArg: 1.578 ± 0.18
2.424MetSer: 2.424 ± 0.226
1.578MetThr: 1.578 ± 0.17
1.659MetVal: 1.659 ± 0.159
0.439MetTrp: 0.439 ± 0.084
1.529MetTyr: 1.529 ± 0.182
0.0MetXaa: 0.0 ± 0.0
Asn
4.213AsnAla: 4.213 ± 1.017
0.537AsnCys: 0.537 ± 0.104
2.651AsnAsp: 2.651 ± 0.218
2.668AsnGlu: 2.668 ± 0.237
2.83AsnPhe: 2.83 ± 0.236
3.448AsnGly: 3.448 ± 0.337
1.187AsnHis: 1.187 ± 0.14
3.92AsnIle: 3.92 ± 0.278
4.001AsnLys: 4.001 ± 0.695
4.603AsnLeu: 4.603 ± 0.439
1.659AsnMet: 1.659 ± 0.168
3.497AsnAsn: 3.497 ± 0.412
2.326AsnPro: 2.326 ± 0.244
2.147AsnGln: 2.147 ± 0.328
2.977AsnArg: 2.977 ± 0.656
3.09AsnSer: 3.09 ± 0.327
4.408AsnThr: 4.408 ± 0.651
4.961AsnVal: 4.961 ± 0.944
0.569AsnTrp: 0.569 ± 0.096
2.326AsnTyr: 2.326 ± 0.241
0.0AsnXaa: 0.0 ± 0.0
Pro
2.163ProAla: 2.163 ± 0.257
0.699ProCys: 0.699 ± 0.137
2.31ProAsp: 2.31 ± 0.207
3.188ProGlu: 3.188 ± 0.277
1.627ProPhe: 1.627 ± 0.17
2.537ProGly: 2.537 ± 0.216
0.764ProHis: 0.764 ± 0.099
2.245ProIle: 2.245 ± 0.218
3.334ProLys: 3.334 ± 0.283
2.342ProLeu: 2.342 ± 0.262
1.155ProMet: 1.155 ± 0.135
2.342ProAsn: 2.342 ± 0.234
2.489ProPro: 2.489 ± 0.317
1.952ProGln: 1.952 ± 0.176
1.789ProArg: 1.789 ± 0.245
2.733ProSer: 2.733 ± 0.316
2.993ProThr: 2.993 ± 0.272
2.684ProVal: 2.684 ± 0.242
0.374ProTrp: 0.374 ± 0.069
1.496ProTyr: 1.496 ± 0.175
0.0ProXaa: 0.0 ± 0.0
Gln
1.643GlnAla: 1.643 ± 0.21
0.569GlnCys: 0.569 ± 0.089
1.838GlnAsp: 1.838 ± 0.156
2.472GlnGlu: 2.472 ± 0.25
1.399GlnPhe: 1.399 ± 0.198
2.293GlnGly: 2.293 ± 0.567
0.569GlnHis: 0.569 ± 0.104
1.903GlnIle: 1.903 ± 0.178
2.651GlnLys: 2.651 ± 0.262
3.123GlnLeu: 3.123 ± 0.226
1.334GlnMet: 1.334 ± 0.159
1.919GlnAsn: 1.919 ± 0.295
1.627GlnPro: 1.627 ± 0.177
1.415GlnGln: 1.415 ± 0.134
2.098GlnArg: 2.098 ± 0.261
1.822GlnSer: 1.822 ± 0.161
2.033GlnThr: 2.033 ± 0.223
2.098GlnVal: 2.098 ± 0.221
0.39GlnTrp: 0.39 ± 0.092
1.334GlnTyr: 1.334 ± 0.154
0.0GlnXaa: 0.0 ± 0.0
Arg
3.09ArgAla: 3.09 ± 0.355
0.846ArgCys: 0.846 ± 0.163
3.383ArgAsp: 3.383 ± 0.241
3.725ArgGlu: 3.725 ± 0.389
2.131ArgPhe: 2.131 ± 0.179
3.139ArgGly: 3.139 ± 0.245
1.171ArgHis: 1.171 ± 0.143
3.074ArgIle: 3.074 ± 0.243
3.904ArgLys: 3.904 ± 0.41
3.985ArgLeu: 3.985 ± 0.303
1.578ArgMet: 1.578 ± 0.194
2.993ArgAsn: 2.993 ± 0.454
2.147ArgPro: 2.147 ± 0.233
1.984ArgGln: 1.984 ± 0.198
3.204ArgArg: 3.204 ± 0.31
2.96ArgSer: 2.96 ± 0.218
2.391ArgThr: 2.391 ± 0.199
3.676ArgVal: 3.676 ± 0.26
0.586ArgTrp: 0.586 ± 0.103
1.968ArgTyr: 1.968 ± 0.208
0.0ArgXaa: 0.0 ± 0.0
Ser
3.692SerAla: 3.692 ± 0.346
0.732SerCys: 0.732 ± 0.123
3.058SerAsp: 3.058 ± 0.22
3.985SerGlu: 3.985 ± 0.334
2.586SerPhe: 2.586 ± 0.214
4.766SerGly: 4.766 ± 0.433
1.155SerHis: 1.155 ± 0.159
3.416SerIle: 3.416 ± 0.285
3.92SerLys: 3.92 ± 0.26
4.636SerLeu: 4.636 ± 0.343
1.48SerMet: 1.48 ± 0.17
4.262SerAsn: 4.262 ± 0.571
2.342SerPro: 2.342 ± 0.26
2.326SerGln: 2.326 ± 0.24
2.424SerArg: 2.424 ± 0.22
3.855SerSer: 3.855 ± 0.423
3.887SerThr: 3.887 ± 0.324
3.757SerVal: 3.757 ± 0.362
0.716SerTrp: 0.716 ± 0.123
2.082SerTyr: 2.082 ± 0.208
0.0SerXaa: 0.0 ± 0.0
Thr
3.855ThrAla: 3.855 ± 0.556
1.008ThrCys: 1.008 ± 0.151
3.465ThrAsp: 3.465 ± 0.342
3.334ThrGlu: 3.334 ± 0.27
2.472ThrPhe: 2.472 ± 0.217
4.457ThrGly: 4.457 ± 0.481
1.513ThrHis: 1.513 ± 0.196
3.627ThrIle: 3.627 ± 0.272
4.571ThrLys: 4.571 ± 0.275
5.172ThrLeu: 5.172 ± 0.295
1.496ThrMet: 1.496 ± 0.171
4.359ThrAsn: 4.359 ± 0.645
3.188ThrPro: 3.188 ± 0.239
2.456ThrGln: 2.456 ± 0.187
3.318ThrArg: 3.318 ± 0.238
4.359ThrSer: 4.359 ± 0.413
3.969ThrThr: 3.969 ± 0.469
3.79ThrVal: 3.79 ± 0.292
0.716ThrTrp: 0.716 ± 0.113
2.245ThrTyr: 2.245 ± 0.204
0.0ThrXaa: 0.0 ± 0.0
Val
4.408ValAla: 4.408 ± 0.445
1.561ValCys: 1.561 ± 0.195
4.034ValAsp: 4.034 ± 0.269
4.05ValGlu: 4.05 ± 0.274
2.733ValPhe: 2.733 ± 0.246
4.815ValGly: 4.815 ± 0.635
1.35ValHis: 1.35 ± 0.16
3.481ValIle: 3.481 ± 0.307
4.668ValLys: 4.668 ± 0.282
4.75ValLeu: 4.75 ± 0.35
1.692ValMet: 1.692 ± 0.182
3.546ValAsn: 3.546 ± 0.459
3.286ValPro: 3.286 ± 0.279
2.342ValGln: 2.342 ± 0.236
3.497ValArg: 3.497 ± 0.246
4.44ValSer: 4.44 ± 0.483
4.571ValThr: 4.571 ± 0.509
4.457ValVal: 4.457 ± 0.385
0.943ValTrp: 0.943 ± 0.146
3.123ValTyr: 3.123 ± 0.407
0.0ValXaa: 0.0 ± 0.0
Trp
0.423TrpAla: 0.423 ± 0.077
0.277TrpCys: 0.277 ± 0.066
0.586TrpAsp: 0.586 ± 0.1
0.716TrpGlu: 0.716 ± 0.113
0.52TrpPhe: 0.52 ± 0.11
0.732TrpGly: 0.732 ± 0.107
0.309TrpHis: 0.309 ± 0.099
0.797TrpIle: 0.797 ± 0.129
0.862TrpLys: 0.862 ± 0.126
0.813TrpLeu: 0.813 ± 0.116
0.244TrpMet: 0.244 ± 0.068
0.732TrpAsn: 0.732 ± 0.106
0.293TrpPro: 0.293 ± 0.066
0.309TrpGln: 0.309 ± 0.07
0.423TrpArg: 0.423 ± 0.088
0.846TrpSer: 0.846 ± 0.124
0.504TrpThr: 0.504 ± 0.101
0.781TrpVal: 0.781 ± 0.135
0.163TrpTrp: 0.163 ± 0.05
0.407TrpTyr: 0.407 ± 0.086
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.358TyrAla: 2.358 ± 0.254
0.488TyrCys: 0.488 ± 0.097
2.375TyrAsp: 2.375 ± 0.187
2.668TyrGlu: 2.668 ± 0.27
1.805TyrPhe: 1.805 ± 0.176
2.602TyrGly: 2.602 ± 0.24
0.846TyrHis: 0.846 ± 0.141
2.505TyrIle: 2.505 ± 0.224
2.912TyrLys: 2.912 ± 0.207
3.839TyrLeu: 3.839 ± 0.229
1.431TyrMet: 1.431 ± 0.151
2.456TyrAsn: 2.456 ± 0.202
1.236TyrPro: 1.236 ± 0.157
1.139TyrGln: 1.139 ± 0.137
1.659TyrArg: 1.659 ± 0.168
2.602TyrSer: 2.602 ± 0.273
2.684TyrThr: 2.684 ± 0.225
2.7TyrVal: 2.7 ± 0.325
0.293TyrTrp: 0.293 ± 0.059
1.594TyrTyr: 1.594 ± 0.193
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 249 proteins (61481 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski