Amino acid dipepetide frequency for Bathycoccus sp. RCC1105 virus BpV2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.663AlaAla: 2.663 ± 0.404
0.795AlaCys: 0.795 ± 0.113
2.524AlaAsp: 2.524 ± 0.217
2.334AlaGlu: 2.334 ± 0.269
2.196AlaPhe: 2.196 ± 0.215
3.233AlaGly: 3.233 ± 0.435
0.934AlaHis: 0.934 ± 0.112
3.665AlaIle: 3.665 ± 0.492
3.544AlaLys: 3.544 ± 0.428
4.063AlaLeu: 4.063 ± 0.305
1.089AlaMet: 1.089 ± 0.139
3.112AlaAsn: 3.112 ± 0.351
1.746AlaPro: 1.746 ± 0.25
1.625AlaGln: 1.625 ± 0.187
2.161AlaArg: 2.161 ± 0.166
3.769AlaSer: 3.769 ± 0.572
3.406AlaThr: 3.406 ± 0.367
2.559AlaVal: 2.559 ± 0.241
0.605AlaTrp: 0.605 ± 0.106
2.386AlaTyr: 2.386 ± 0.468
0.0AlaXaa: 0.0 ± 0.0
Cys
0.847CysAla: 0.847 ± 0.132
0.311CysCys: 0.311 ± 0.103
0.83CysAsp: 0.83 ± 0.157
0.986CysGlu: 0.986 ± 0.15
0.588CysPhe: 0.588 ± 0.112
0.986CysGly: 0.986 ± 0.139
0.242CysHis: 0.242 ± 0.071
1.072CysIle: 1.072 ± 0.153
1.383CysLys: 1.383 ± 0.234
0.864CysLeu: 0.864 ± 0.143
0.571CysMet: 0.571 ± 0.107
0.916CysAsn: 0.916 ± 0.144
0.709CysPro: 0.709 ± 0.141
0.277CysGln: 0.277 ± 0.073
0.64CysArg: 0.64 ± 0.11
0.951CysSer: 0.951 ± 0.147
0.882CysThr: 0.882 ± 0.151
0.864CysVal: 0.864 ± 0.135
0.121CysTrp: 0.121 ± 0.048
0.484CysTyr: 0.484 ± 0.107
0.0CysXaa: 0.0 ± 0.0
Asp
3.043AspAla: 3.043 ± 0.333
0.571AspCys: 0.571 ± 0.112
4.322AspAsp: 4.322 ± 0.325
4.651AspGlu: 4.651 ± 0.43
2.749AspPhe: 2.749 ± 0.264
4.737AspGly: 4.737 ± 0.581
0.795AspHis: 0.795 ± 0.105
4.893AspIle: 4.893 ± 0.274
3.942AspLys: 3.942 ± 0.396
4.686AspLeu: 4.686 ± 0.293
1.435AspMet: 1.435 ± 0.16
2.922AspAsn: 2.922 ± 0.29
2.472AspPro: 2.472 ± 0.229
1.297AspGln: 1.297 ± 0.133
2.213AspArg: 2.213 ± 0.186
3.25AspSer: 3.25 ± 0.326
5.17AspThr: 5.17 ± 0.373
3.838AspVal: 3.838 ± 0.294
0.778AspTrp: 0.778 ± 0.115
2.974AspTyr: 2.974 ± 0.388
0.0AspXaa: 0.0 ± 0.0
Glu
2.697GluAla: 2.697 ± 0.339
1.158GluCys: 1.158 ± 0.183
3.873GluAsp: 3.873 ± 0.362
4.461GluGlu: 4.461 ± 0.559
2.957GluPhe: 2.957 ± 0.287
3.147GluGly: 3.147 ± 0.326
1.279GluHis: 1.279 ± 0.153
4.271GluIle: 4.271 ± 0.343
5.602GluLys: 5.602 ± 0.59
5.118GluLeu: 5.118 ± 0.415
1.936GluMet: 1.936 ± 0.194
4.201GluAsn: 4.201 ± 0.394
2.3GluPro: 2.3 ± 0.238
1.919GluGln: 1.919 ± 0.232
2.334GluArg: 2.334 ± 0.31
3.423GluSer: 3.423 ± 0.231
4.443GluThr: 4.443 ± 0.336
3.043GluVal: 3.043 ± 0.254
0.657GluTrp: 0.657 ± 0.091
3.199GluTyr: 3.199 ± 0.231
0.0GluXaa: 0.0 ± 0.0
Phe
1.833PheAla: 1.833 ± 0.189
0.761PheCys: 0.761 ± 0.108
3.06PheAsp: 3.06 ± 0.273
2.369PheGlu: 2.369 ± 0.263
2.092PhePhe: 2.092 ± 0.277
3.078PheGly: 3.078 ± 0.411
0.83PheHis: 0.83 ± 0.115
3.164PheIle: 3.164 ± 0.282
3.493PheLys: 3.493 ± 0.24
2.87PheLeu: 2.87 ± 0.22
1.47PheMet: 1.47 ± 0.188
2.593PheAsn: 2.593 ± 0.264
1.124PhePro: 1.124 ± 0.133
0.778PheGln: 0.778 ± 0.113
1.85PheArg: 1.85 ± 0.232
3.25PheSer: 3.25 ± 0.24
2.939PheThr: 2.939 ± 0.366
2.438PheVal: 2.438 ± 0.198
0.363PheTrp: 0.363 ± 0.072
1.902PheTyr: 1.902 ± 0.166
0.0PheXaa: 0.0 ± 0.0
Gly
3.527GlyAla: 3.527 ± 0.781
0.882GlyCys: 0.882 ± 0.117
5.221GlyAsp: 5.221 ± 0.833
3.631GlyGlu: 3.631 ± 0.254
2.732GlyPhe: 2.732 ± 0.29
7.037GlyGly: 7.037 ± 1.639
1.141GlyHis: 1.141 ± 0.146
4.219GlyIle: 4.219 ± 0.293
4.72GlyLys: 4.72 ± 0.379
4.443GlyLeu: 4.443 ± 0.304
1.66GlyMet: 1.66 ± 0.148
4.582GlyAsn: 4.582 ± 0.59
1.764GlyPro: 1.764 ± 0.171
1.798GlyGln: 1.798 ± 0.263
2.593GlyArg: 2.593 ± 0.325
5.239GlySer: 5.239 ± 0.969
6.501GlyThr: 6.501 ± 1.085
3.233GlyVal: 3.233 ± 0.317
0.761GlyTrp: 0.761 ± 0.147
2.887GlyTyr: 2.887 ± 0.226
0.0GlyXaa: 0.0 ± 0.0
His
1.072HisAla: 1.072 ± 0.165
0.398HisCys: 0.398 ± 0.097
1.037HisAsp: 1.037 ± 0.178
1.435HisGlu: 1.435 ± 0.163
0.674HisPhe: 0.674 ± 0.099
1.21HisGly: 1.21 ± 0.165
0.398HisHis: 0.398 ± 0.085
1.228HisIle: 1.228 ± 0.178
1.573HisLys: 1.573 ± 0.19
1.349HisLeu: 1.349 ± 0.181
0.363HisMet: 0.363 ± 0.075
1.003HisAsn: 1.003 ± 0.138
0.83HisPro: 0.83 ± 0.11
0.605HisGln: 0.605 ± 0.106
0.726HisArg: 0.726 ± 0.132
0.899HisSer: 0.899 ± 0.102
1.21HisThr: 1.21 ± 0.189
1.47HisVal: 1.47 ± 0.159
0.19HisTrp: 0.19 ± 0.064
0.968HisTyr: 0.968 ± 0.13
0.0HisXaa: 0.0 ± 0.0
Ile
3.579IleAla: 3.579 ± 0.213
0.934IleCys: 0.934 ± 0.161
5.014IleAsp: 5.014 ± 0.384
5.049IleGlu: 5.049 ± 0.316
2.663IlePhe: 2.663 ± 0.282
4.478IleGly: 4.478 ± 0.395
1.798IleHis: 1.798 ± 0.225
5.135IleIle: 5.135 ± 0.419
5.498IleLys: 5.498 ± 0.399
5.36IleLeu: 5.36 ± 0.366
1.694IleMet: 1.694 ± 0.224
4.063IleAsn: 4.063 ± 0.336
2.922IlePro: 2.922 ± 0.3
2.455IleGln: 2.455 ± 0.196
2.697IleArg: 2.697 ± 0.232
4.979IleSer: 4.979 ± 0.294
4.53IleThr: 4.53 ± 0.477
3.804IleVal: 3.804 ± 0.362
0.415IleTrp: 0.415 ± 0.078
3.095IleTyr: 3.095 ± 0.235
0.0IleXaa: 0.0 ± 0.0
Lys
3.302LysAla: 3.302 ± 0.421
1.383LysCys: 1.383 ± 0.219
4.236LysAsp: 4.236 ± 0.329
4.686LysGlu: 4.686 ± 0.681
3.095LysPhe: 3.095 ± 0.258
3.925LysGly: 3.925 ± 0.332
1.539LysHis: 1.539 ± 0.228
5.809LysIle: 5.809 ± 0.429
7.867LysLys: 7.867 ± 0.934
6.674LysLeu: 6.674 ± 0.356
2.593LysMet: 2.593 ± 0.285
5.515LysAsn: 5.515 ± 0.642
3.026LysPro: 3.026 ± 0.307
2.784LysGln: 2.784 ± 0.274
4.443LysArg: 4.443 ± 0.429
4.564LysSer: 4.564 ± 0.35
5.394LysThr: 5.394 ± 0.515
4.219LysVal: 4.219 ± 0.326
0.778LysTrp: 0.778 ± 0.125
3.942LysTyr: 3.942 ± 0.302
0.0LysXaa: 0.0 ± 0.0
Leu
3.786LeuAla: 3.786 ± 0.28
1.141LeuCys: 1.141 ± 0.147
4.582LeuAsp: 4.582 ± 0.327
4.634LeuGlu: 4.634 ± 0.39
2.939LeuPhe: 2.939 ± 0.325
4.08LeuGly: 4.08 ± 0.255
1.418LeuHis: 1.418 ± 0.171
4.841LeuIle: 4.841 ± 0.337
6.778LeuLys: 6.778 ± 0.545
5.878LeuLeu: 5.878 ± 0.521
1.608LeuMet: 1.608 ± 0.185
4.789LeuAsn: 4.789 ± 0.437
2.732LeuPro: 2.732 ± 0.243
2.386LeuGln: 2.386 ± 0.204
3.769LeuArg: 3.769 ± 0.323
5.965LeuSer: 5.965 ± 0.33
5.083LeuThr: 5.083 ± 0.382
4.461LeuVal: 4.461 ± 0.328
0.778LeuTrp: 0.778 ± 0.11
3.129LeuTyr: 3.129 ± 0.224
0.0LeuXaa: 0.0 ± 0.0
Met
1.21MetAla: 1.21 ± 0.155
0.432MetCys: 0.432 ± 0.089
1.504MetAsp: 1.504 ± 0.197
1.677MetGlu: 1.677 ± 0.198
1.366MetPhe: 1.366 ± 0.177
1.452MetGly: 1.452 ± 0.151
0.484MetHis: 0.484 ± 0.092
1.66MetIle: 1.66 ± 0.185
2.334MetLys: 2.334 ± 0.246
1.694MetLeu: 1.694 ± 0.204
0.726MetMet: 0.726 ± 0.158
1.971MetAsn: 1.971 ± 0.228
0.709MetPro: 0.709 ± 0.13
0.726MetGln: 0.726 ± 0.122
1.072MetArg: 1.072 ± 0.149
2.196MetSer: 2.196 ± 0.221
1.4MetThr: 1.4 ± 0.184
1.418MetVal: 1.418 ± 0.2
0.363MetTrp: 0.363 ± 0.082
1.539MetTyr: 1.539 ± 0.178
0.0MetXaa: 0.0 ± 0.0
Asn
3.199AsnAla: 3.199 ± 0.343
0.657AsnCys: 0.657 ± 0.131
3.112AsnAsp: 3.112 ± 0.235
3.337AsnGlu: 3.337 ± 0.35
2.853AsnPhe: 2.853 ± 0.254
4.34AsnGly: 4.34 ± 0.361
1.107AsnHis: 1.107 ± 0.129
4.634AsnIle: 4.634 ± 0.43
5.17AsnLys: 5.17 ± 0.461
4.374AsnLeu: 4.374 ± 0.38
1.85AsnMet: 1.85 ± 0.202
4.15AsnAsn: 4.15 ± 0.73
2.697AsnPro: 2.697 ± 0.227
1.885AsnGln: 1.885 ± 0.191
2.611AsnArg: 2.611 ± 0.4
3.925AsnSer: 3.925 ± 0.43
4.668AsnThr: 4.668 ± 0.43
5.74AsnVal: 5.74 ± 0.852
0.847AsnTrp: 0.847 ± 0.103
2.697AsnTyr: 2.697 ± 0.234
0.0AsnXaa: 0.0 ± 0.0
Pro
1.677ProAla: 1.677 ± 0.263
0.571ProCys: 0.571 ± 0.101
2.109ProAsp: 2.109 ± 0.218
2.991ProGlu: 2.991 ± 0.3
1.902ProPhe: 1.902 ± 0.217
2.939ProGly: 2.939 ± 0.252
0.622ProHis: 0.622 ± 0.115
2.524ProIle: 2.524 ± 0.248
2.68ProLys: 2.68 ± 0.276
2.196ProLeu: 2.196 ± 0.222
0.951ProMet: 0.951 ± 0.143
1.988ProAsn: 1.988 ± 0.198
2.334ProPro: 2.334 ± 0.336
1.556ProGln: 1.556 ± 0.212
1.643ProArg: 1.643 ± 0.203
2.628ProSer: 2.628 ± 0.228
2.887ProThr: 2.887 ± 0.355
2.472ProVal: 2.472 ± 0.263
0.225ProTrp: 0.225 ± 0.064
1.487ProTyr: 1.487 ± 0.161
0.0ProXaa: 0.0 ± 0.0
Gln
1.591GlnAla: 1.591 ± 0.168
0.38GlnCys: 0.38 ± 0.074
1.643GlnAsp: 1.643 ± 0.188
1.677GlnGlu: 1.677 ± 0.17
1.47GlnPhe: 1.47 ± 0.145
1.487GlnGly: 1.487 ± 0.191
0.519GlnHis: 0.519 ± 0.082
2.161GlnIle: 2.161 ± 0.196
2.49GlnLys: 2.49 ± 0.287
1.936GlnLeu: 1.936 ± 0.212
0.882GlnMet: 0.882 ± 0.112
1.764GlnAsn: 1.764 ± 0.21
1.573GlnPro: 1.573 ± 0.213
1.383GlnGln: 1.383 ± 0.186
1.297GlnArg: 1.297 ± 0.26
2.04GlnSer: 2.04 ± 0.18
2.265GlnThr: 2.265 ± 0.199
1.971GlnVal: 1.971 ± 0.247
0.398GlnTrp: 0.398 ± 0.089
1.487GlnTyr: 1.487 ± 0.161
0.0GlnXaa: 0.0 ± 0.0
Arg
1.781ArgAla: 1.781 ± 0.177
0.519ArgCys: 0.519 ± 0.116
2.369ArgAsp: 2.369 ± 0.2
3.302ArgGlu: 3.302 ± 0.377
1.625ArgPhe: 1.625 ± 0.159
2.006ArgGly: 2.006 ± 0.164
1.037ArgHis: 1.037 ± 0.163
3.078ArgIle: 3.078 ± 0.268
3.856ArgLys: 3.856 ± 0.446
3.32ArgLeu: 3.32 ± 0.464
1.228ArgMet: 1.228 ± 0.182
2.922ArgAsn: 2.922 ± 0.299
1.764ArgPro: 1.764 ± 0.234
1.279ArgGln: 1.279 ± 0.173
1.919ArgArg: 1.919 ± 0.243
2.196ArgSer: 2.196 ± 0.196
2.455ArgThr: 2.455 ± 0.241
2.836ArgVal: 2.836 ± 0.28
0.519ArgTrp: 0.519 ± 0.098
1.781ArgTyr: 1.781 ± 0.175
0.0ArgXaa: 0.0 ± 0.0
Ser
3.302SerAla: 3.302 ± 0.408
0.743SerCys: 0.743 ± 0.102
4.893SerAsp: 4.893 ± 0.643
4.115SerGlu: 4.115 ± 0.263
2.282SerPhe: 2.282 ± 0.2
7.123SerGly: 7.123 ± 1.201
0.986SerHis: 0.986 ± 0.129
4.841SerIle: 4.841 ± 0.267
4.997SerLys: 4.997 ± 0.446
4.789SerLeu: 4.789 ± 0.299
1.712SerMet: 1.712 ± 0.188
5.239SerAsn: 5.239 ± 0.754
2.109SerPro: 2.109 ± 0.228
1.988SerGln: 1.988 ± 0.155
2.663SerArg: 2.663 ± 0.221
5.93SerSer: 5.93 ± 0.671
4.979SerThr: 4.979 ± 0.568
4.409SerVal: 4.409 ± 0.551
0.778SerTrp: 0.778 ± 0.182
2.507SerTyr: 2.507 ± 0.227
0.0SerXaa: 0.0 ± 0.0
Thr
3.544ThrAla: 3.544 ± 0.552
1.055ThrCys: 1.055 ± 0.183
3.735ThrAsp: 3.735 ± 0.34
3.648ThrGlu: 3.648 ± 0.262
3.389ThrPhe: 3.389 ± 0.269
5.619ThrGly: 5.619 ± 0.667
1.262ThrHis: 1.262 ± 0.146
4.858ThrIle: 4.858 ± 0.372
4.72ThrLys: 4.72 ± 0.402
6.484ThrLeu: 6.484 ± 0.491
1.521ThrMet: 1.521 ± 0.176
4.807ThrAsn: 4.807 ± 0.48
3.268ThrPro: 3.268 ± 0.253
2.386ThrGln: 2.386 ± 0.229
2.766ThrArg: 2.766 ± 0.243
5.671ThrSer: 5.671 ± 0.613
5.878ThrThr: 5.878 ± 0.895
4.686ThrVal: 4.686 ± 0.514
0.813ThrTrp: 0.813 ± 0.15
3.199ThrTyr: 3.199 ± 0.474
0.0ThrXaa: 0.0 ± 0.0
Val
2.818ValAla: 2.818 ± 0.418
1.003ValCys: 1.003 ± 0.149
3.147ValAsp: 3.147 ± 0.238
3.268ValGlu: 3.268 ± 0.291
2.282ValPhe: 2.282 ± 0.193
3.562ValGly: 3.562 ± 0.431
1.245ValHis: 1.245 ± 0.149
3.907ValIle: 3.907 ± 0.389
4.599ValLys: 4.599 ± 0.381
4.893ValLeu: 4.893 ± 0.345
1.349ValMet: 1.349 ± 0.175
3.475ValAsn: 3.475 ± 0.302
2.697ValPro: 2.697 ± 0.264
1.936ValGln: 1.936 ± 0.207
2.438ValArg: 2.438 ± 0.266
5.619ValSer: 5.619 ± 0.509
4.893ValThr: 4.893 ± 0.434
3.337ValVal: 3.337 ± 0.26
0.674ValTrp: 0.674 ± 0.094
3.216ValTyr: 3.216 ± 0.289
0.0ValXaa: 0.0 ± 0.0
Trp
0.45TrpAla: 0.45 ± 0.081
0.242TrpCys: 0.242 ± 0.068
0.588TrpAsp: 0.588 ± 0.088
0.605TrpGlu: 0.605 ± 0.12
0.519TrpPhe: 0.519 ± 0.086
1.003TrpGly: 1.003 ± 0.191
0.138TrpHis: 0.138 ± 0.045
0.622TrpIle: 0.622 ± 0.106
0.899TrpLys: 0.899 ± 0.137
0.726TrpLeu: 0.726 ± 0.103
0.207TrpMet: 0.207 ± 0.068
0.795TrpAsn: 0.795 ± 0.13
0.277TrpPro: 0.277 ± 0.068
0.259TrpGln: 0.259 ± 0.067
0.329TrpArg: 0.329 ± 0.074
1.003TrpSer: 1.003 ± 0.216
0.899TrpThr: 0.899 ± 0.19
0.519TrpVal: 0.519 ± 0.094
0.121TrpTrp: 0.121 ± 0.042
0.398TrpTyr: 0.398 ± 0.092
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.49TyrAla: 2.49 ± 0.36
0.588TyrCys: 0.588 ± 0.124
2.939TyrAsp: 2.939 ± 0.262
3.181TyrGlu: 3.181 ± 0.316
1.885TyrPhe: 1.885 ± 0.141
3.129TyrGly: 3.129 ± 0.315
0.864TyrHis: 0.864 ± 0.121
3.579TyrIle: 3.579 ± 0.436
3.631TyrLys: 3.631 ± 0.243
3.06TyrLeu: 3.06 ± 0.266
1.072TyrMet: 1.072 ± 0.135
3.043TyrAsn: 3.043 ± 0.274
1.331TyrPro: 1.331 ± 0.136
1.089TyrGln: 1.089 ± 0.126
1.625TyrArg: 1.625 ± 0.219
2.887TyrSer: 2.887 ± 0.259
3.423TyrThr: 3.423 ± 0.269
3.043TyrVal: 3.043 ± 0.229
0.415TyrTrp: 0.415 ± 0.085
1.833TyrTyr: 1.833 ± 0.197
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 210 proteins (57839 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski