Amino acid dipepetide frequency for Artogeia rapae granulovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.779AlaAla: 1.779 ± 0.301
0.663AlaCys: 0.663 ± 0.12
2.713AlaAsp: 2.713 ± 0.323
1.869AlaGlu: 1.869 ± 0.299
1.96AlaPhe: 1.96 ± 0.264
1.718AlaGly: 1.718 ± 0.247
0.512AlaHis: 0.512 ± 0.126
3.165AlaIle: 3.165 ± 0.335
2.472AlaLys: 2.472 ± 0.294
2.985AlaLeu: 2.985 ± 0.25
1.025AlaMet: 1.025 ± 0.186
3.376AlaAsn: 3.376 ± 0.348
1.779AlaPro: 1.779 ± 0.244
1.628AlaGln: 1.628 ± 0.193
1.146AlaArg: 1.146 ± 0.175
2.02AlaSer: 2.02 ± 0.209
2.171AlaThr: 2.171 ± 0.284
2.653AlaVal: 2.653 ± 0.286
0.301AlaTrp: 0.301 ± 0.095
1.598AlaTyr: 1.598 ± 0.199
0.0AlaXaa: 0.0 ± 0.0
Cys
0.633CysAla: 0.633 ± 0.128
0.603CysCys: 0.603 ± 0.185
1.507CysAsp: 1.507 ± 0.233
1.477CysGlu: 1.477 ± 0.23
1.387CysPhe: 1.387 ± 0.268
1.266CysGly: 1.266 ± 0.178
0.512CysHis: 0.512 ± 0.119
1.839CysIle: 1.839 ± 0.272
2.05CysLys: 2.05 ± 0.344
2.201CysLeu: 2.201 ± 0.302
0.422CysMet: 0.422 ± 0.119
2.231CysAsn: 2.231 ± 0.28
0.904CysPro: 0.904 ± 0.17
0.512CysGln: 0.512 ± 0.13
0.995CysArg: 0.995 ± 0.173
1.658CysSer: 1.658 ± 0.23
1.085CysThr: 1.085 ± 0.182
2.05CysVal: 2.05 ± 0.222
0.09CysTrp: 0.09 ± 0.044
1.266CysTyr: 1.266 ± 0.197
0.0CysXaa: 0.0 ± 0.0
Asp
2.171AspAla: 2.171 ± 0.235
1.236AspCys: 1.236 ± 0.217
4.01AspAsp: 4.01 ± 0.382
4.01AspGlu: 4.01 ± 0.342
2.291AspPhe: 2.291 ± 0.252
2.442AspGly: 2.442 ± 0.32
1.085AspHis: 1.085 ± 0.21
4.04AspIle: 4.04 ± 0.369
4.612AspLys: 4.612 ± 0.446
4.914AspLeu: 4.914 ± 0.391
1.447AspMet: 1.447 ± 0.245
5.879AspAsn: 5.879 ± 0.429
1.628AspPro: 1.628 ± 0.212
1.658AspGln: 1.658 ± 0.205
1.749AspArg: 1.749 ± 0.21
3.497AspSer: 3.497 ± 0.347
3.165AspThr: 3.165 ± 0.314
4.19AspVal: 4.19 ± 0.372
0.482AspTrp: 0.482 ± 0.12
2.683AspTyr: 2.683 ± 0.277
0.0AspXaa: 0.0 ± 0.0
Glu
2.14GluAla: 2.14 ± 0.244
1.658GluCys: 1.658 ± 0.256
2.623GluAsp: 2.623 ± 0.331
4.673GluGlu: 4.673 ± 0.53
2.502GluPhe: 2.502 ± 0.229
1.929GluGly: 1.929 ± 0.25
0.995GluHis: 0.995 ± 0.208
5.306GluIle: 5.306 ± 0.462
6.632GluLys: 6.632 ± 0.744
5.306GluLeu: 5.306 ± 0.502
1.507GluMet: 1.507 ± 0.223
6.15GluAsn: 6.15 ± 0.622
1.96GluPro: 1.96 ± 0.406
2.683GluGln: 2.683 ± 0.314
2.382GluArg: 2.382 ± 0.236
3.346GluSer: 3.346 ± 0.336
4.07GluThr: 4.07 ± 0.426
2.14GluVal: 2.14 ± 0.29
0.754GluTrp: 0.754 ± 0.167
3.135GluTyr: 3.135 ± 0.345
0.0GluXaa: 0.0 ± 0.0
Phe
1.869PheAla: 1.869 ± 0.254
1.266PheCys: 1.266 ± 0.18
3.949PheAsp: 3.949 ± 0.411
3.165PheGlu: 3.165 ± 0.265
2.623PhePhe: 2.623 ± 0.321
2.05PheGly: 2.05 ± 0.269
0.754PheHis: 0.754 ± 0.152
3.889PheIle: 3.889 ± 0.344
3.618PheLys: 3.618 ± 0.346
4.401PheLeu: 4.401 ± 0.429
0.935PheMet: 0.935 ± 0.145
4.552PheAsn: 4.552 ± 0.322
1.176PhePro: 1.176 ± 0.228
1.357PheGln: 1.357 ± 0.203
1.357PheArg: 1.357 ± 0.174
2.562PheSer: 2.562 ± 0.288
2.834PheThr: 2.834 ± 0.238
4.251PheVal: 4.251 ± 0.381
0.241PheTrp: 0.241 ± 0.086
2.894PheTyr: 2.894 ± 0.284
0.0PheXaa: 0.0 ± 0.0
Gly
2.261GlyAla: 2.261 ± 0.304
0.754GlyCys: 0.754 ± 0.163
2.653GlyAsp: 2.653 ± 0.314
2.351GlyGlu: 2.351 ± 0.291
1.779GlyPhe: 1.779 ± 0.254
2.291GlyGly: 2.291 ± 0.277
0.603GlyHis: 0.603 ± 0.138
1.809GlyIle: 1.809 ± 0.192
2.321GlyLys: 2.321 ± 0.287
3.135GlyLeu: 3.135 ± 0.339
0.814GlyMet: 0.814 ± 0.165
1.899GlyAsn: 1.899 ± 0.295
0.724GlyPro: 0.724 ± 0.153
1.146GlyGln: 1.146 ± 0.158
1.266GlyArg: 1.266 ± 0.169
2.11GlySer: 2.11 ± 0.244
1.96GlyThr: 1.96 ± 0.266
3.648GlyVal: 3.648 ± 0.435
0.332GlyTrp: 0.332 ± 0.095
1.839GlyTyr: 1.839 ± 0.223
0.0GlyXaa: 0.0 ± 0.0
His
0.754HisAla: 0.754 ± 0.145
0.362HisCys: 0.362 ± 0.113
1.055HisAsp: 1.055 ± 0.166
1.085HisGlu: 1.085 ± 0.172
1.055HisPhe: 1.055 ± 0.174
0.543HisGly: 0.543 ± 0.131
0.301HisHis: 0.301 ± 0.097
1.598HisIle: 1.598 ± 0.219
1.779HisLys: 1.779 ± 0.225
1.628HisLeu: 1.628 ± 0.218
0.211HisMet: 0.211 ± 0.077
1.899HisAsn: 1.899 ± 0.253
0.844HisPro: 0.844 ± 0.224
0.603HisGln: 0.603 ± 0.153
0.995HisArg: 0.995 ± 0.174
0.935HisSer: 0.935 ± 0.186
1.176HisThr: 1.176 ± 0.161
1.236HisVal: 1.236 ± 0.219
0.09HisTrp: 0.09 ± 0.048
1.176HisTyr: 1.176 ± 0.199
0.0HisXaa: 0.0 ± 0.0
Ile
2.894IleAla: 2.894 ± 0.294
1.99IleCys: 1.99 ± 0.237
5.065IleAsp: 5.065 ± 0.376
5.185IleGlu: 5.185 ± 0.564
3.708IlePhe: 3.708 ± 0.29
2.02IleGly: 2.02 ± 0.269
1.387IleHis: 1.387 ± 0.24
6.482IleIle: 6.482 ± 0.504
7.175IleLys: 7.175 ± 0.532
7.386IleLeu: 7.386 ± 0.543
1.929IleMet: 1.929 ± 0.268
7.627IleAsn: 7.627 ± 0.491
2.442IlePro: 2.442 ± 0.248
2.321IleGln: 2.321 ± 0.297
2.321IleArg: 2.321 ± 0.248
3.798IleSer: 3.798 ± 0.354
4.251IleThr: 4.251 ± 0.341
5.577IleVal: 5.577 ± 0.475
0.543IleTrp: 0.543 ± 0.143
2.743IleTyr: 2.743 ± 0.29
0.0IleXaa: 0.0 ± 0.0
Lys
2.08LysAla: 2.08 ± 0.289
1.929LysCys: 1.929 ± 0.291
2.864LysAsp: 2.864 ± 0.275
5.185LysGlu: 5.185 ± 0.592
3.949LysPhe: 3.949 ± 0.335
2.08LysGly: 2.08 ± 0.284
2.11LysHis: 2.11 ± 0.243
7.687LysIle: 7.687 ± 0.506
8.2LysLys: 8.2 ± 0.747
7.446LysLeu: 7.446 ± 0.48
2.442LysMet: 2.442 ± 0.289
8.712LysAsn: 8.712 ± 0.709
2.412LysPro: 2.412 ± 0.298
3.618LysGln: 3.618 ± 0.351
4.673LysArg: 4.673 ± 0.539
4.371LysSer: 4.371 ± 0.345
5.306LysThr: 5.306 ± 0.472
3.135LysVal: 3.135 ± 0.283
0.904LysTrp: 0.904 ± 0.155
4.401LysTyr: 4.401 ± 0.376
0.0LysXaa: 0.0 ± 0.0
Leu
3.165LeuAla: 3.165 ± 0.319
2.382LeuCys: 2.382 ± 0.235
4.612LeuAsp: 4.612 ± 0.316
5.668LeuGlu: 5.668 ± 0.477
4.643LeuPhe: 4.643 ± 0.422
2.743LeuGly: 2.743 ± 0.307
2.02LeuHis: 2.02 ± 0.293
7.446LeuIle: 7.446 ± 0.459
8.471LeuLys: 8.471 ± 0.554
8.923LeuLeu: 8.923 ± 0.587
2.321LeuMet: 2.321 ± 0.246
8.14LeuAsn: 8.14 ± 0.522
2.804LeuPro: 2.804 ± 0.3
4.281LeuGln: 4.281 ± 0.496
3.467LeuArg: 3.467 ± 0.311
4.884LeuSer: 4.884 ± 0.398
4.914LeuThr: 4.914 ± 0.344
5.457LeuVal: 5.457 ± 0.407
1.176LeuTrp: 1.176 ± 0.184
4.974LeuTyr: 4.974 ± 0.414
0.0LeuXaa: 0.0 ± 0.0
Met
0.935MetAla: 0.935 ± 0.162
0.663MetCys: 0.663 ± 0.135
1.357MetAsp: 1.357 ± 0.19
1.477MetGlu: 1.477 ± 0.269
1.326MetPhe: 1.326 ± 0.207
0.874MetGly: 0.874 ± 0.161
0.633MetHis: 0.633 ± 0.157
1.598MetIle: 1.598 ± 0.182
1.417MetLys: 1.417 ± 0.186
2.894MetLeu: 2.894 ± 0.274
0.573MetMet: 0.573 ± 0.133
2.02MetAsn: 2.02 ± 0.233
0.422MetPro: 0.422 ± 0.126
0.724MetGln: 0.724 ± 0.145
0.724MetArg: 0.724 ± 0.13
1.96MetSer: 1.96 ± 0.254
0.844MetThr: 0.844 ± 0.152
1.357MetVal: 1.357 ± 0.195
0.332MetTrp: 0.332 ± 0.099
1.688MetTyr: 1.688 ± 0.224
0.0MetXaa: 0.0 ± 0.0
Asn
3.407AsnAla: 3.407 ± 0.363
2.231AsnCys: 2.231 ± 0.264
5.607AsnAsp: 5.607 ± 0.484
6.18AsnGlu: 6.18 ± 0.51
4.311AsnPhe: 4.311 ± 0.361
3.618AsnGly: 3.618 ± 0.261
1.085AsnHis: 1.085 ± 0.165
7.296AsnIle: 7.296 ± 0.468
7.868AsnLys: 7.868 ± 0.555
7.205AsnLeu: 7.205 ± 0.524
2.02AsnMet: 2.02 ± 0.23
10.461AsnAsn: 10.461 ± 0.826
1.899AsnPro: 1.899 ± 0.244
2.743AsnGln: 2.743 ± 0.266
3.196AsnArg: 3.196 ± 0.285
5.366AsnSer: 5.366 ± 0.51
6.301AsnThr: 6.301 ± 0.521
6.542AsnVal: 6.542 ± 0.526
0.724AsnTrp: 0.724 ± 0.138
4.311AsnTyr: 4.311 ± 0.307
0.0AsnXaa: 0.0 ± 0.0
Pro
1.326ProAla: 1.326 ± 0.189
0.543ProCys: 0.543 ± 0.145
1.568ProAsp: 1.568 ± 0.2
1.749ProGlu: 1.749 ± 0.375
1.477ProPhe: 1.477 ± 0.196
0.904ProGly: 0.904 ± 0.214
0.814ProHis: 0.814 ± 0.137
2.562ProIle: 2.562 ± 0.297
1.779ProLys: 1.779 ± 0.27
2.924ProLeu: 2.924 ± 0.365
0.482ProMet: 0.482 ± 0.112
2.472ProAsn: 2.472 ± 0.265
1.809ProPro: 1.809 ± 0.577
0.965ProGln: 0.965 ± 0.206
0.874ProArg: 0.874 ± 0.157
2.02ProSer: 2.02 ± 0.303
2.201ProThr: 2.201 ± 0.349
2.442ProVal: 2.442 ± 0.361
0.362ProTrp: 0.362 ± 0.119
1.688ProTyr: 1.688 ± 0.215
0.0ProXaa: 0.0 ± 0.0
Gln
0.754GlnAla: 0.754 ± 0.141
1.176GlnCys: 1.176 ± 0.185
1.326GlnAsp: 1.326 ± 0.201
2.261GlnGlu: 2.261 ± 0.234
1.96GlnPhe: 1.96 ± 0.203
0.874GlnGly: 0.874 ± 0.139
1.025GlnHis: 1.025 ± 0.194
3.075GlnIle: 3.075 ± 0.27
2.804GlnLys: 2.804 ± 0.313
4.13GlnLeu: 4.13 ± 0.336
0.935GlnMet: 0.935 ± 0.144
3.015GlnAsn: 3.015 ± 0.397
0.814GlnPro: 0.814 ± 0.159
1.899GlnGln: 1.899 ± 0.267
1.025GlnArg: 1.025 ± 0.173
2.11GlnSer: 2.11 ± 0.231
2.442GlnThr: 2.442 ± 0.297
1.296GlnVal: 1.296 ± 0.186
0.332GlnTrp: 0.332 ± 0.115
1.718GlnTyr: 1.718 ± 0.217
0.0GlnXaa: 0.0 ± 0.0
Arg
1.779ArgAla: 1.779 ± 0.266
0.935ArgCys: 0.935 ± 0.18
2.11ArgAsp: 2.11 ± 0.252
2.351ArgGlu: 2.351 ± 0.245
1.899ArgPhe: 1.899 ± 0.215
1.266ArgGly: 1.266 ± 0.195
0.904ArgHis: 0.904 ± 0.186
2.351ArgIle: 2.351 ± 0.215
2.351ArgLys: 2.351 ± 0.265
4.492ArgLeu: 4.492 ± 0.424
0.814ArgMet: 0.814 ± 0.182
2.834ArgAsn: 2.834 ± 0.376
0.935ArgPro: 0.935 ± 0.183
1.688ArgGln: 1.688 ± 0.226
1.749ArgArg: 1.749 ± 0.289
1.96ArgSer: 1.96 ± 0.474
1.99ArgThr: 1.99 ± 0.254
2.623ArgVal: 2.623 ± 0.254
0.362ArgTrp: 0.362 ± 0.103
1.869ArgTyr: 1.869 ± 0.22
0.0ArgXaa: 0.0 ± 0.0
Ser
2.472SerAla: 2.472 ± 0.26
1.326SerCys: 1.326 ± 0.217
3.346SerAsp: 3.346 ± 0.279
3.256SerGlu: 3.256 ± 0.328
3.165SerPhe: 3.165 ± 0.304
2.261SerGly: 2.261 ± 0.274
1.085SerHis: 1.085 ± 0.175
3.979SerIle: 3.979 ± 0.353
4.341SerLys: 4.341 ± 0.342
6.12SerLeu: 6.12 ± 0.393
1.477SerMet: 1.477 ± 0.209
4.251SerAsn: 4.251 ± 0.335
2.08SerPro: 2.08 ± 0.265
1.688SerGln: 1.688 ± 0.234
2.412SerArg: 2.412 ± 0.458
3.738SerSer: 3.738 ± 0.402
3.256SerThr: 3.256 ± 0.377
4.221SerVal: 4.221 ± 0.434
0.362SerTrp: 0.362 ± 0.086
2.743SerTyr: 2.743 ± 0.307
0.0SerXaa: 0.0 ± 0.0
Thr
2.14ThrAla: 2.14 ± 0.24
1.176ThrCys: 1.176 ± 0.191
3.075ThrAsp: 3.075 ± 0.26
2.532ThrGlu: 2.532 ± 0.318
2.623ThrPhe: 2.623 ± 0.363
1.869ThrGly: 1.869 ± 0.26
1.296ThrHis: 1.296 ± 0.223
4.793ThrIle: 4.793 ± 0.399
5.577ThrLys: 5.577 ± 0.41
5.668ThrLeu: 5.668 ± 0.434
1.537ThrMet: 1.537 ± 0.223
5.065ThrAsn: 5.065 ± 0.404
2.713ThrPro: 2.713 ± 0.299
2.02ThrGln: 2.02 ± 0.217
2.351ThrArg: 2.351 ± 0.252
3.527ThrSer: 3.527 ± 0.397
4.221ThrThr: 4.221 ± 0.46
4.13ThrVal: 4.13 ± 0.383
0.512ThrTrp: 0.512 ± 0.129
1.839ThrTyr: 1.839 ± 0.23
0.0ThrXaa: 0.0 ± 0.0
Val
2.834ValAla: 2.834 ± 0.315
2.05ValCys: 2.05 ± 0.223
4.371ValAsp: 4.371 ± 0.373
3.738ValGlu: 3.738 ± 0.409
3.738ValPhe: 3.738 ± 0.415
2.834ValGly: 2.834 ± 0.31
1.146ValHis: 1.146 ± 0.198
3.979ValIle: 3.979 ± 0.386
4.673ValLys: 4.673 ± 0.358
5.547ValLeu: 5.547 ± 0.459
1.568ValMet: 1.568 ± 0.255
5.396ValAsn: 5.396 ± 0.451
2.11ValPro: 2.11 ± 0.305
1.96ValGln: 1.96 ± 0.249
2.442ValArg: 2.442 ± 0.273
4.281ValSer: 4.281 ± 0.418
3.437ValThr: 3.437 ± 0.32
4.673ValVal: 4.673 ± 0.492
0.573ValTrp: 0.573 ± 0.155
4.251ValTyr: 4.251 ± 0.385
0.0ValXaa: 0.0 ± 0.0
Trp
0.512TrpAla: 0.512 ± 0.134
0.241TrpCys: 0.241 ± 0.085
0.482TrpAsp: 0.482 ± 0.138
0.422TrpGlu: 0.422 ± 0.127
0.482TrpPhe: 0.482 ± 0.111
0.482TrpGly: 0.482 ± 0.136
0.181TrpHis: 0.181 ± 0.073
0.422TrpIle: 0.422 ± 0.108
0.482TrpLys: 0.482 ± 0.116
0.844TrpLeu: 0.844 ± 0.139
0.121TrpMet: 0.121 ± 0.059
1.025TrpAsn: 1.025 ± 0.176
0.332TrpPro: 0.332 ± 0.107
0.301TrpGln: 0.301 ± 0.083
0.452TrpArg: 0.452 ± 0.113
0.663TrpSer: 0.663 ± 0.161
0.422TrpThr: 0.422 ± 0.1
0.422TrpVal: 0.422 ± 0.111
0.181TrpTrp: 0.181 ± 0.079
0.663TrpTyr: 0.663 ± 0.145
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.628TyrAla: 1.628 ± 0.232
1.507TyrCys: 1.507 ± 0.201
2.894TyrAsp: 2.894 ± 0.337
3.256TyrGlu: 3.256 ± 0.267
2.864TyrPhe: 2.864 ± 0.315
1.628TyrGly: 1.628 ± 0.209
0.874TyrHis: 0.874 ± 0.138
3.527TyrIle: 3.527 ± 0.384
4.884TyrLys: 4.884 ± 0.369
4.341TyrLeu: 4.341 ± 0.344
1.206TyrMet: 1.206 ± 0.21
5.366TyrAsn: 5.366 ± 0.391
1.176TyrPro: 1.176 ± 0.198
1.206TyrGln: 1.206 ± 0.165
1.688TyrArg: 1.688 ± 0.27
2.774TyrSer: 2.774 ± 0.254
2.623TyrThr: 2.623 ± 0.278
3.557TyrVal: 3.557 ± 0.308
0.482TyrTrp: 0.482 ± 0.125
3.256TyrTyr: 3.256 ± 0.383
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 120 proteins (33172 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski