Amino acid dipepetide frequency for Operophtera brumata reovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.532AlaAla: 2.532 ± 0.673
0.633AlaCys: 0.633 ± 0.21
3.797AlaAsp: 3.797 ± 0.57
2.405AlaGlu: 2.405 ± 0.62
3.165AlaPhe: 3.165 ± 0.442
2.278AlaGly: 2.278 ± 0.435
1.519AlaHis: 1.519 ± 0.401
3.924AlaIle: 3.924 ± 0.434
1.392AlaLys: 1.392 ± 0.312
6.329AlaLeu: 6.329 ± 0.536
2.025AlaMet: 2.025 ± 0.573
2.152AlaAsn: 2.152 ± 0.461
2.911AlaPro: 2.911 ± 0.441
2.405AlaGln: 2.405 ± 0.469
4.051AlaArg: 4.051 ± 0.48
2.278AlaSer: 2.278 ± 0.433
3.797AlaThr: 3.797 ± 0.729
3.671AlaVal: 3.671 ± 0.482
0.38AlaTrp: 0.38 ± 0.215
2.405AlaTyr: 2.405 ± 0.814
0.0AlaXaa: 0.0 ± 0.0
Cys
0.759CysAla: 0.759 ± 0.281
0.127CysCys: 0.127 ± 0.13
0.253CysAsp: 0.253 ± 0.15
0.759CysGlu: 0.759 ± 0.274
0.506CysPhe: 0.506 ± 0.188
0.506CysGly: 0.506 ± 0.265
0.253CysHis: 0.253 ± 0.139
0.38CysIle: 0.38 ± 0.253
0.759CysLys: 0.759 ± 0.241
1.139CysLeu: 1.139 ± 0.248
0.127CysMet: 0.127 ± 0.115
0.253CysAsn: 0.253 ± 0.157
0.506CysPro: 0.506 ± 0.209
0.253CysGln: 0.253 ± 0.147
0.633CysArg: 0.633 ± 0.272
0.253CysSer: 0.253 ± 0.158
0.253CysThr: 0.253 ± 0.178
0.886CysVal: 0.886 ± 0.355
0.0CysTrp: 0.0 ± 0.0
0.38CysTyr: 0.38 ± 0.258
0.0CysXaa: 0.0 ± 0.0
Asp
3.291AspAla: 3.291 ± 0.573
0.506AspCys: 0.506 ± 0.208
3.797AspAsp: 3.797 ± 0.756
5.316AspGlu: 5.316 ± 0.738
2.405AspPhe: 2.405 ± 0.535
2.532AspGly: 2.532 ± 0.454
1.519AspHis: 1.519 ± 0.445
3.671AspIle: 3.671 ± 0.661
1.139AspLys: 1.139 ± 0.389
6.456AspLeu: 6.456 ± 0.959
0.886AspMet: 0.886 ± 0.349
1.266AspAsn: 1.266 ± 0.182
1.646AspPro: 1.646 ± 0.445
1.899AspGln: 1.899 ± 0.348
3.418AspArg: 3.418 ± 0.727
4.43AspSer: 4.43 ± 0.7
3.797AspThr: 3.797 ± 0.912
5.316AspVal: 5.316 ± 0.475
1.266AspTrp: 1.266 ± 0.393
1.392AspTyr: 1.392 ± 0.454
0.0AspXaa: 0.0 ± 0.0
Glu
3.924GluAla: 3.924 ± 0.674
0.886GluCys: 0.886 ± 0.411
3.924GluAsp: 3.924 ± 0.443
1.646GluGlu: 1.646 ± 0.347
3.418GluPhe: 3.418 ± 0.76
2.278GluGly: 2.278 ± 0.536
1.266GluHis: 1.266 ± 0.552
4.304GluIle: 4.304 ± 1.035
2.532GluLys: 2.532 ± 0.446
6.456GluLeu: 6.456 ± 0.698
1.392GluMet: 1.392 ± 0.319
2.405GluAsn: 2.405 ± 0.463
1.899GluPro: 1.899 ± 0.365
2.025GluGln: 2.025 ± 0.524
3.924GluArg: 3.924 ± 0.947
3.165GluSer: 3.165 ± 0.461
3.038GluThr: 3.038 ± 0.338
3.038GluVal: 3.038 ± 0.637
0.759GluTrp: 0.759 ± 0.316
2.025GluTyr: 2.025 ± 0.365
0.0GluXaa: 0.0 ± 0.0
Phe
1.646PheAla: 1.646 ± 0.334
0.633PheCys: 0.633 ± 0.181
3.924PheAsp: 3.924 ± 0.351
3.038PheGlu: 3.038 ± 0.448
1.139PhePhe: 1.139 ± 0.292
3.544PheGly: 3.544 ± 0.772
1.392PheHis: 1.392 ± 0.335
2.532PheIle: 2.532 ± 0.622
2.025PheLys: 2.025 ± 0.522
4.304PheLeu: 4.304 ± 0.452
1.013PheMet: 1.013 ± 0.384
2.025PheAsn: 2.025 ± 0.367
2.278PhePro: 2.278 ± 0.519
2.911PheGln: 2.911 ± 0.504
3.165PheArg: 3.165 ± 0.671
4.81PheSer: 4.81 ± 0.625
4.557PheThr: 4.557 ± 0.819
2.278PheVal: 2.278 ± 0.364
0.759PheTrp: 0.759 ± 0.24
1.519PheTyr: 1.519 ± 0.387
0.0PheXaa: 0.0 ± 0.0
Gly
2.658GlyAla: 2.658 ± 0.503
0.506GlyCys: 0.506 ± 0.302
3.418GlyAsp: 3.418 ± 0.496
2.658GlyGlu: 2.658 ± 0.691
1.772GlyPhe: 1.772 ± 0.311
2.152GlyGly: 2.152 ± 0.493
1.013GlyHis: 1.013 ± 0.313
4.177GlyIle: 4.177 ± 0.839
2.658GlyLys: 2.658 ± 0.51
5.316GlyLeu: 5.316 ± 0.892
2.152GlyMet: 2.152 ± 0.37
1.646GlyAsn: 1.646 ± 0.304
1.646GlyPro: 1.646 ± 0.463
2.025GlyGln: 2.025 ± 0.596
2.658GlyArg: 2.658 ± 0.571
4.177GlySer: 4.177 ± 0.467
3.418GlyThr: 3.418 ± 0.424
2.911GlyVal: 2.911 ± 0.409
0.0GlyTrp: 0.0 ± 0.0
3.038GlyTyr: 3.038 ± 0.591
0.0GlyXaa: 0.0 ± 0.0
His
1.139HisAla: 1.139 ± 0.323
0.127HisCys: 0.127 ± 0.1
1.392HisAsp: 1.392 ± 0.395
2.025HisGlu: 2.025 ± 0.326
1.899HisPhe: 1.899 ± 0.431
1.013HisGly: 1.013 ± 0.294
0.38HisHis: 0.38 ± 0.215
0.759HisIle: 0.759 ± 0.188
0.633HisLys: 0.633 ± 0.267
3.797HisLeu: 3.797 ± 0.622
0.886HisMet: 0.886 ± 0.211
0.506HisAsn: 0.506 ± 0.31
1.646HisPro: 1.646 ± 0.473
0.633HisGln: 0.633 ± 0.273
1.013HisArg: 1.013 ± 0.272
1.646HisSer: 1.646 ± 0.422
2.152HisThr: 2.152 ± 0.499
2.152HisVal: 2.152 ± 0.359
0.127HisTrp: 0.127 ± 0.102
1.013HisTyr: 1.013 ± 0.306
0.0HisXaa: 0.0 ± 0.0
Ile
4.177IleAla: 4.177 ± 0.882
0.38IleCys: 0.38 ± 0.194
3.544IleAsp: 3.544 ± 0.545
4.81IleGlu: 4.81 ± 0.594
3.165IlePhe: 3.165 ± 0.637
4.684IleGly: 4.684 ± 0.664
1.139IleHis: 1.139 ± 0.404
3.038IleIle: 3.038 ± 0.527
1.899IleLys: 1.899 ± 0.771
4.81IleLeu: 4.81 ± 1.017
1.266IleMet: 1.266 ± 0.466
3.165IleAsn: 3.165 ± 0.782
3.671IlePro: 3.671 ± 0.374
3.038IleGln: 3.038 ± 0.693
4.051IleArg: 4.051 ± 0.523
5.949IleSer: 5.949 ± 0.812
4.43IleThr: 4.43 ± 0.576
4.304IleVal: 4.304 ± 0.577
0.633IleTrp: 0.633 ± 0.273
1.899IleTyr: 1.899 ± 0.451
0.0IleXaa: 0.0 ± 0.0
Lys
2.025LysAla: 2.025 ± 0.49
0.506LysCys: 0.506 ± 0.196
2.025LysAsp: 2.025 ± 0.4
1.899LysGlu: 1.899 ± 0.505
2.278LysPhe: 2.278 ± 0.566
1.139LysGly: 1.139 ± 0.377
1.013LysHis: 1.013 ± 0.403
2.532LysIle: 2.532 ± 0.49
1.139LysLys: 1.139 ± 0.262
4.177LysLeu: 4.177 ± 1.116
0.886LysMet: 0.886 ± 0.313
0.886LysAsn: 0.886 ± 0.313
1.392LysPro: 1.392 ± 0.298
1.899LysGln: 1.899 ± 0.516
2.911LysArg: 2.911 ± 0.284
1.899LysSer: 1.899 ± 0.526
2.911LysThr: 2.911 ± 0.598
3.671LysVal: 3.671 ± 0.369
0.38LysTrp: 0.38 ± 0.186
1.646LysTyr: 1.646 ± 0.56
0.0LysXaa: 0.0 ± 0.0
Leu
6.076LeuAla: 6.076 ± 0.495
0.759LeuCys: 0.759 ± 0.303
4.177LeuAsp: 4.177 ± 0.718
4.81LeuGlu: 4.81 ± 0.55
5.949LeuPhe: 5.949 ± 0.723
5.696LeuGly: 5.696 ± 0.732
2.911LeuHis: 2.911 ± 0.554
6.456LeuIle: 6.456 ± 0.677
3.544LeuLys: 3.544 ± 0.305
10.886LeuLeu: 10.886 ± 0.807
1.772LeuMet: 1.772 ± 0.289
5.443LeuAsn: 5.443 ± 0.866
4.937LeuPro: 4.937 ± 0.638
5.316LeuGln: 5.316 ± 0.821
6.962LeuArg: 6.962 ± 0.803
8.987LeuSer: 8.987 ± 1.093
10.886LeuThr: 10.886 ± 1.402
6.203LeuVal: 6.203 ± 0.651
1.013LeuTrp: 1.013 ± 0.248
2.658LeuTyr: 2.658 ± 0.441
0.0LeuXaa: 0.0 ± 0.0
Met
1.772MetAla: 1.772 ± 0.485
0.127MetCys: 0.127 ± 0.119
0.886MetAsp: 0.886 ± 0.29
0.886MetGlu: 0.886 ± 0.371
1.646MetPhe: 1.646 ± 0.37
1.139MetGly: 1.139 ± 0.3
0.38MetHis: 0.38 ± 0.163
2.278MetIle: 2.278 ± 0.338
0.759MetLys: 0.759 ± 0.299
2.911MetLeu: 2.911 ± 0.486
0.759MetMet: 0.759 ± 0.309
0.886MetAsn: 0.886 ± 0.289
0.886MetPro: 0.886 ± 0.247
0.633MetGln: 0.633 ± 0.334
1.646MetArg: 1.646 ± 0.598
3.165MetSer: 3.165 ± 0.507
1.772MetThr: 1.772 ± 0.708
2.025MetVal: 2.025 ± 0.53
0.127MetTrp: 0.127 ± 0.1
1.013MetTyr: 1.013 ± 0.283
0.0MetXaa: 0.0 ± 0.0
Asn
2.532AsnAla: 2.532 ± 0.383
0.38AsnCys: 0.38 ± 0.185
2.658AsnAsp: 2.658 ± 0.366
2.278AsnGlu: 2.278 ± 0.438
1.646AsnPhe: 1.646 ± 0.404
2.025AsnGly: 2.025 ± 0.37
0.633AsnHis: 0.633 ± 0.137
2.152AsnIle: 2.152 ± 0.411
1.139AsnLys: 1.139 ± 0.297
3.418AsnLeu: 3.418 ± 0.604
0.886AsnMet: 0.886 ± 0.279
2.405AsnAsn: 2.405 ± 0.489
2.911AsnPro: 2.911 ± 0.635
2.785AsnGln: 2.785 ± 0.472
1.899AsnArg: 1.899 ± 0.375
3.671AsnSer: 3.671 ± 0.697
2.278AsnThr: 2.278 ± 0.565
5.063AsnVal: 5.063 ± 0.625
0.633AsnTrp: 0.633 ± 0.19
1.646AsnTyr: 1.646 ± 0.446
0.0AsnXaa: 0.0 ± 0.0
Pro
2.785ProAla: 2.785 ± 0.437
0.633ProCys: 0.633 ± 0.366
2.278ProAsp: 2.278 ± 0.466
2.785ProGlu: 2.785 ± 0.579
2.785ProPhe: 2.785 ± 0.565
1.899ProGly: 1.899 ± 0.59
1.013ProHis: 1.013 ± 0.44
4.177ProIle: 4.177 ± 0.706
2.278ProLys: 2.278 ± 0.465
4.051ProLeu: 4.051 ± 0.714
1.139ProMet: 1.139 ± 0.261
2.911ProAsn: 2.911 ± 0.582
2.152ProPro: 2.152 ± 0.646
1.139ProGln: 1.139 ± 0.326
2.278ProArg: 2.278 ± 0.202
4.304ProSer: 4.304 ± 0.575
4.43ProThr: 4.43 ± 0.754
2.785ProVal: 2.785 ± 0.593
0.633ProTrp: 0.633 ± 0.157
1.266ProTyr: 1.266 ± 0.346
0.0ProXaa: 0.0 ± 0.0
Gln
2.785GlnAla: 2.785 ± 0.579
0.38GlnCys: 0.38 ± 0.195
1.266GlnAsp: 1.266 ± 0.309
1.519GlnGlu: 1.519 ± 0.445
2.532GlnPhe: 2.532 ± 0.523
1.772GlnGly: 1.772 ± 0.324
1.013GlnHis: 1.013 ± 0.449
2.785GlnIle: 2.785 ± 0.418
1.266GlnLys: 1.266 ± 0.303
6.203GlnLeu: 6.203 ± 0.811
1.772GlnMet: 1.772 ± 0.365
0.886GlnAsn: 0.886 ± 0.257
3.291GlnPro: 3.291 ± 0.553
2.152GlnGln: 2.152 ± 0.708
3.671GlnArg: 3.671 ± 0.716
3.165GlnSer: 3.165 ± 0.73
3.418GlnThr: 3.418 ± 0.657
2.278GlnVal: 2.278 ± 0.713
0.253GlnTrp: 0.253 ± 0.151
1.139GlnTyr: 1.139 ± 0.287
0.0GlnXaa: 0.0 ± 0.0
Arg
3.418ArgAla: 3.418 ± 0.357
0.506ArgCys: 0.506 ± 0.283
3.291ArgAsp: 3.291 ± 0.63
3.544ArgGlu: 3.544 ± 0.556
2.785ArgPhe: 2.785 ± 0.494
2.785ArgGly: 2.785 ± 0.733
1.772ArgHis: 1.772 ± 0.547
5.063ArgIle: 5.063 ± 0.772
1.646ArgLys: 1.646 ± 0.406
7.342ArgLeu: 7.342 ± 0.519
1.772ArgMet: 1.772 ± 0.258
3.038ArgAsn: 3.038 ± 0.714
4.051ArgPro: 4.051 ± 0.935
3.291ArgGln: 3.291 ± 0.45
5.823ArgArg: 5.823 ± 0.57
4.684ArgSer: 4.684 ± 0.786
3.671ArgThr: 3.671 ± 0.769
5.063ArgVal: 5.063 ± 0.491
0.633ArgTrp: 0.633 ± 0.204
2.658ArgTyr: 2.658 ± 0.489
0.0ArgXaa: 0.0 ± 0.0
Ser
3.165SerAla: 3.165 ± 0.537
0.633SerCys: 0.633 ± 0.405
4.937SerAsp: 4.937 ± 0.899
4.051SerGlu: 4.051 ± 0.541
4.051SerPhe: 4.051 ± 0.438
4.81SerGly: 4.81 ± 0.919
1.519SerHis: 1.519 ± 0.445
3.797SerIle: 3.797 ± 0.543
3.797SerLys: 3.797 ± 0.666
7.975SerLeu: 7.975 ± 1.085
1.899SerMet: 1.899 ± 0.278
3.165SerAsn: 3.165 ± 0.603
2.785SerPro: 2.785 ± 0.328
3.038SerGln: 3.038 ± 0.478
3.797SerArg: 3.797 ± 0.454
5.316SerSer: 5.316 ± 0.872
6.203SerThr: 6.203 ± 0.46
6.962SerVal: 6.962 ± 1.061
0.759SerTrp: 0.759 ± 0.249
2.785SerTyr: 2.785 ± 0.699
0.0SerXaa: 0.0 ± 0.0
Thr
3.544ThrAla: 3.544 ± 0.602
0.253ThrCys: 0.253 ± 0.155
3.418ThrAsp: 3.418 ± 0.485
3.165ThrGlu: 3.165 ± 0.508
3.671ThrPhe: 3.671 ± 0.685
2.658ThrGly: 2.658 ± 0.538
2.532ThrHis: 2.532 ± 0.515
5.19ThrIle: 5.19 ± 0.829
4.051ThrLys: 4.051 ± 0.475
8.608ThrLeu: 8.608 ± 0.831
1.392ThrMet: 1.392 ± 0.494
3.924ThrAsn: 3.924 ± 0.544
4.937ThrPro: 4.937 ± 0.87
2.911ThrGln: 2.911 ± 0.392
6.456ThrArg: 6.456 ± 0.558
6.709ThrSer: 6.709 ± 0.767
6.203ThrThr: 6.203 ± 0.928
4.937ThrVal: 4.937 ± 0.998
0.759ThrTrp: 0.759 ± 0.3
2.405ThrTyr: 2.405 ± 0.607
0.0ThrXaa: 0.0 ± 0.0
Val
3.165ValAla: 3.165 ± 0.603
0.253ValCys: 0.253 ± 0.232
4.051ValAsp: 4.051 ± 0.567
3.671ValGlu: 3.671 ± 0.579
2.532ValPhe: 2.532 ± 0.724
4.304ValGly: 4.304 ± 1.036
2.025ValHis: 2.025 ± 0.319
3.924ValIle: 3.924 ± 0.678
3.165ValLys: 3.165 ± 0.639
7.722ValLeu: 7.722 ± 0.974
1.899ValMet: 1.899 ± 0.285
4.43ValAsn: 4.43 ± 0.588
2.278ValPro: 2.278 ± 0.402
3.038ValGln: 3.038 ± 0.564
5.57ValArg: 5.57 ± 0.741
4.684ValSer: 4.684 ± 0.536
7.215ValThr: 7.215 ± 1.244
4.43ValVal: 4.43 ± 0.522
0.506ValTrp: 0.506 ± 0.177
2.152ValTyr: 2.152 ± 0.5
0.0ValXaa: 0.0 ± 0.0
Trp
0.633TrpAla: 0.633 ± 0.246
0.127TrpCys: 0.127 ± 0.119
0.506TrpAsp: 0.506 ± 0.158
0.253TrpGlu: 0.253 ± 0.151
0.633TrpPhe: 0.633 ± 0.325
0.253TrpGly: 0.253 ± 0.207
0.38TrpHis: 0.38 ± 0.221
0.886TrpIle: 0.886 ± 0.197
0.886TrpLys: 0.886 ± 0.251
0.759TrpLeu: 0.759 ± 0.258
0.38TrpMet: 0.38 ± 0.179
0.38TrpAsn: 0.38 ± 0.175
0.633TrpPro: 0.633 ± 0.165
0.253TrpGln: 0.253 ± 0.147
0.506TrpArg: 0.506 ± 0.205
0.633TrpSer: 0.633 ± 0.424
0.633TrpThr: 0.633 ± 0.165
0.253TrpVal: 0.253 ± 0.146
0.127TrpTrp: 0.127 ± 0.13
0.759TrpTyr: 0.759 ± 0.305
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.025TyrAla: 2.025 ± 0.422
0.633TyrCys: 0.633 ± 0.233
2.405TyrAsp: 2.405 ± 0.413
2.911TyrGlu: 2.911 ± 0.544
1.519TyrPhe: 1.519 ± 0.395
2.532TyrGly: 2.532 ± 0.652
1.266TyrHis: 1.266 ± 0.427
2.025TyrIle: 2.025 ± 0.575
0.506TyrLys: 0.506 ± 0.215
2.785TyrLeu: 2.785 ± 0.596
1.266TyrMet: 1.266 ± 0.334
1.266TyrAsn: 1.266 ± 0.295
1.266TyrPro: 1.266 ± 0.363
1.899TyrGln: 1.899 ± 0.464
2.405TyrArg: 2.405 ± 0.58
1.519TyrSer: 1.519 ± 0.371
2.658TyrThr: 2.658 ± 0.624
2.785TyrVal: 2.785 ± 0.494
0.127TyrTrp: 0.127 ± 0.1
0.886TyrTyr: 0.886 ± 0.227
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (7901 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski