Amino acid dipepetide frequency for Mobuck virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.157AlaAla: 4.157 ± 0.833
1.919AlaCys: 1.919 ± 0.636
2.558AlaAsp: 2.558 ± 0.48
4.157AlaGlu: 4.157 ± 1.015
2.718AlaPhe: 2.718 ± 0.682
5.117AlaGly: 5.117 ± 1.254
1.119AlaHis: 1.119 ± 0.351
4.317AlaIle: 4.317 ± 0.977
3.678AlaLys: 3.678 ± 0.837
6.876AlaLeu: 6.876 ± 1.419
1.119AlaMet: 1.119 ± 0.367
2.558AlaAsn: 2.558 ± 0.699
3.038AlaPro: 3.038 ± 0.81
2.558AlaGln: 2.558 ± 0.878
5.437AlaArg: 5.437 ± 0.657
3.838AlaSer: 3.838 ± 0.66
2.558AlaThr: 2.558 ± 1.053
2.718AlaVal: 2.718 ± 0.635
1.439AlaTrp: 1.439 ± 0.329
1.599AlaTyr: 1.599 ± 0.529
0.0AlaXaa: 0.0 ± 0.0
Cys
0.32CysAla: 0.32 ± 0.235
0.16CysCys: 0.16 ± 0.173
0.64CysAsp: 0.64 ± 0.449
0.64CysGlu: 0.64 ± 0.457
0.959CysPhe: 0.959 ± 0.442
0.64CysGly: 0.64 ± 0.315
0.16CysHis: 0.16 ± 0.2
1.759CysIle: 1.759 ± 0.567
1.279CysLys: 1.279 ± 0.498
1.599CysLeu: 1.599 ± 0.612
0.799CysMet: 0.799 ± 0.48
0.32CysAsn: 0.32 ± 0.228
0.16CysPro: 0.16 ± 0.16
0.32CysGln: 0.32 ± 0.179
0.48CysArg: 0.48 ± 0.218
0.64CysSer: 0.64 ± 0.217
1.439CysThr: 1.439 ± 0.565
1.279CysVal: 1.279 ± 0.373
0.16CysTrp: 0.16 ± 0.129
1.279CysTyr: 1.279 ± 0.52
0.0CysXaa: 0.0 ± 0.0
Asp
3.838AspAla: 3.838 ± 0.743
0.32AspCys: 0.32 ± 0.163
3.838AspAsp: 3.838 ± 0.833
6.076AspGlu: 6.076 ± 1.031
1.919AspPhe: 1.919 ± 0.679
3.518AspGly: 3.518 ± 0.515
0.48AspHis: 0.48 ± 0.238
5.277AspIle: 5.277 ± 0.784
3.358AspLys: 3.358 ± 1.353
5.756AspLeu: 5.756 ± 0.689
1.279AspMet: 1.279 ± 0.451
1.119AspAsn: 1.119 ± 0.334
1.599AspPro: 1.599 ± 0.459
1.119AspGln: 1.119 ± 0.417
2.558AspArg: 2.558 ± 0.755
3.038AspSer: 3.038 ± 0.816
2.079AspThr: 2.079 ± 0.742
6.396AspVal: 6.396 ± 0.838
0.799AspTrp: 0.799 ± 0.429
2.398AspTyr: 2.398 ± 0.557
0.0AspXaa: 0.0 ± 0.0
Glu
5.117GluAla: 5.117 ± 1.258
0.959GluCys: 0.959 ± 0.376
3.838GluAsp: 3.838 ± 0.903
5.437GluGlu: 5.437 ± 1.184
2.239GluPhe: 2.239 ± 0.721
3.038GluGly: 3.038 ± 0.671
1.439GluHis: 1.439 ± 0.496
4.477GluIle: 4.477 ± 0.961
3.518GluLys: 3.518 ± 0.595
5.277GluLeu: 5.277 ± 0.691
2.878GluMet: 2.878 ± 0.533
2.878GluAsn: 2.878 ± 0.726
1.919GluPro: 1.919 ± 0.43
1.439GluGln: 1.439 ± 0.72
5.277GluArg: 5.277 ± 0.859
5.277GluSer: 5.277 ± 0.773
5.916GluThr: 5.916 ± 1.304
4.957GluVal: 4.957 ± 0.584
1.119GluTrp: 1.119 ± 0.45
2.398GluTyr: 2.398 ± 0.799
0.0GluXaa: 0.0 ± 0.0
Phe
1.439PheAla: 1.439 ± 0.796
0.799PheCys: 0.799 ± 0.373
2.558PheAsp: 2.558 ± 0.463
3.038PheGlu: 3.038 ± 0.826
1.759PhePhe: 1.759 ± 0.461
2.718PheGly: 2.718 ± 0.443
0.799PheHis: 0.799 ± 0.385
2.558PheIle: 2.558 ± 0.448
1.759PheLys: 1.759 ± 0.481
3.678PheLeu: 3.678 ± 0.686
1.119PheMet: 1.119 ± 0.311
2.398PheAsn: 2.398 ± 0.371
0.959PhePro: 0.959 ± 0.378
0.799PheGln: 0.799 ± 0.329
2.079PheArg: 2.079 ± 0.416
3.997PheSer: 3.997 ± 0.993
2.558PheThr: 2.558 ± 0.586
3.198PheVal: 3.198 ± 0.822
0.48PheTrp: 0.48 ± 0.23
0.799PheTyr: 0.799 ± 0.312
0.0PheXaa: 0.0 ± 0.0
Gly
4.157GlyAla: 4.157 ± 0.832
0.48GlyCys: 0.48 ± 0.225
4.477GlyAsp: 4.477 ± 0.998
3.358GlyGlu: 3.358 ± 0.441
3.038GlyPhe: 3.038 ± 0.669
3.678GlyGly: 3.678 ± 0.891
0.959GlyHis: 0.959 ± 0.296
3.358GlyIle: 3.358 ± 0.599
2.718GlyLys: 2.718 ± 0.753
4.477GlyLeu: 4.477 ± 1.192
1.919GlyMet: 1.919 ± 0.469
2.079GlyAsn: 2.079 ± 0.624
2.558GlyPro: 2.558 ± 0.593
2.398GlyGln: 2.398 ± 0.812
3.678GlyArg: 3.678 ± 0.895
3.678GlySer: 3.678 ± 0.521
3.358GlyThr: 3.358 ± 0.892
4.797GlyVal: 4.797 ± 0.903
0.32GlyTrp: 0.32 ± 0.225
2.878GlyTyr: 2.878 ± 0.636
0.0GlyXaa: 0.0 ± 0.0
His
1.119HisAla: 1.119 ± 0.442
0.32HisCys: 0.32 ± 0.198
1.119HisAsp: 1.119 ± 0.422
1.279HisGlu: 1.279 ± 0.476
0.959HisPhe: 0.959 ± 0.431
1.439HisGly: 1.439 ± 0.577
0.64HisHis: 0.64 ± 0.354
1.599HisIle: 1.599 ± 0.236
1.439HisLys: 1.439 ± 0.513
1.759HisLeu: 1.759 ± 0.405
0.48HisMet: 0.48 ± 0.214
1.279HisAsn: 1.279 ± 0.425
0.64HisPro: 0.64 ± 0.357
0.959HisGln: 0.959 ± 0.378
1.439HisArg: 1.439 ± 0.52
1.599HisSer: 1.599 ± 0.423
0.799HisThr: 0.799 ± 0.404
1.119HisVal: 1.119 ± 0.451
0.32HisTrp: 0.32 ± 0.179
0.959HisTyr: 0.959 ± 0.428
0.0HisXaa: 0.0 ± 0.0
Ile
4.637IleAla: 4.637 ± 0.771
1.119IleCys: 1.119 ± 0.248
4.477IleAsp: 4.477 ± 0.722
3.518IleGlu: 3.518 ± 0.584
3.198IlePhe: 3.198 ± 0.538
3.678IleGly: 3.678 ± 0.699
0.959IleHis: 0.959 ± 0.352
3.358IleIle: 3.358 ± 0.609
5.437IleLys: 5.437 ± 1.66
7.355IleLeu: 7.355 ± 0.991
1.759IleMet: 1.759 ± 0.64
3.838IleAsn: 3.838 ± 0.617
2.878IlePro: 2.878 ± 0.626
3.038IleGln: 3.038 ± 0.431
3.038IleArg: 3.038 ± 1.006
5.756IleSer: 5.756 ± 0.928
3.997IleThr: 3.997 ± 0.699
3.358IleVal: 3.358 ± 0.57
0.799IleTrp: 0.799 ± 0.446
2.239IleTyr: 2.239 ± 0.546
0.0IleXaa: 0.0 ± 0.0
Lys
3.518LysAla: 3.518 ± 0.903
0.64LysCys: 0.64 ± 0.276
2.718LysAsp: 2.718 ± 0.579
4.957LysGlu: 4.957 ± 1.528
3.038LysPhe: 3.038 ± 0.682
2.878LysGly: 2.878 ± 0.538
1.439LysHis: 1.439 ± 0.477
4.317LysIle: 4.317 ± 0.451
3.198LysLys: 3.198 ± 1.022
6.236LysLeu: 6.236 ± 1.452
1.919LysMet: 1.919 ± 1.179
3.358LysAsn: 3.358 ± 0.953
1.439LysPro: 1.439 ± 0.465
1.599LysGln: 1.599 ± 0.514
5.596LysArg: 5.596 ± 1.161
2.558LysSer: 2.558 ± 0.534
5.277LysThr: 5.277 ± 1.622
3.518LysVal: 3.518 ± 0.627
0.48LysTrp: 0.48 ± 0.243
1.599LysTyr: 1.599 ± 0.826
0.0LysXaa: 0.0 ± 0.0
Leu
5.596LeuAla: 5.596 ± 1.115
1.759LeuCys: 1.759 ± 0.585
5.437LeuAsp: 5.437 ± 1.062
5.596LeuGlu: 5.596 ± 0.809
2.239LeuPhe: 2.239 ± 0.608
3.678LeuGly: 3.678 ± 0.606
2.718LeuHis: 2.718 ± 0.673
6.236LeuIle: 6.236 ± 1.25
6.556LeuLys: 6.556 ± 0.768
7.355LeuLeu: 7.355 ± 1.126
4.317LeuMet: 4.317 ± 0.727
4.157LeuAsn: 4.157 ± 0.905
5.117LeuPro: 5.117 ± 0.97
4.157LeuGln: 4.157 ± 0.985
7.035LeuArg: 7.035 ± 0.999
5.437LeuSer: 5.437 ± 0.804
4.477LeuThr: 4.477 ± 0.725
4.637LeuVal: 4.637 ± 0.529
0.799LeuTrp: 0.799 ± 0.285
3.518LeuTyr: 3.518 ± 0.611
0.0LeuXaa: 0.0 ± 0.0
Met
2.558MetAla: 2.558 ± 0.597
0.799MetCys: 0.799 ± 0.368
1.599MetAsp: 1.599 ± 0.625
1.919MetGlu: 1.919 ± 0.577
1.759MetPhe: 1.759 ± 0.648
1.919MetGly: 1.919 ± 0.598
0.799MetHis: 0.799 ± 0.322
3.358MetIle: 3.358 ± 0.524
2.398MetLys: 2.398 ± 0.799
3.198MetLeu: 3.198 ± 0.774
1.759MetMet: 1.759 ± 0.413
1.279MetAsn: 1.279 ± 0.502
1.919MetPro: 1.919 ± 0.947
0.959MetGln: 0.959 ± 0.445
3.198MetArg: 3.198 ± 0.765
2.079MetSer: 2.079 ± 0.761
1.439MetThr: 1.439 ± 0.531
0.959MetVal: 0.959 ± 0.232
0.16MetTrp: 0.16 ± 0.152
0.64MetTyr: 0.64 ± 0.305
0.0MetXaa: 0.0 ± 0.0
Asn
3.678AsnAla: 3.678 ± 0.771
0.32AsnCys: 0.32 ± 0.204
1.599AsnAsp: 1.599 ± 0.364
4.797AsnGlu: 4.797 ± 0.795
2.718AsnPhe: 2.718 ± 0.522
2.239AsnGly: 2.239 ± 0.443
0.64AsnHis: 0.64 ± 0.247
3.038AsnIle: 3.038 ± 0.438
1.119AsnLys: 1.119 ± 0.399
4.157AsnLeu: 4.157 ± 1.062
0.799AsnMet: 0.799 ± 0.275
1.119AsnAsn: 1.119 ± 0.379
1.279AsnPro: 1.279 ± 0.487
1.759AsnGln: 1.759 ± 0.946
1.919AsnArg: 1.919 ± 0.661
2.558AsnSer: 2.558 ± 0.619
2.878AsnThr: 2.878 ± 0.542
3.198AsnVal: 3.198 ± 0.849
0.48AsnTrp: 0.48 ± 0.237
2.878AsnTyr: 2.878 ± 0.76
0.0AsnXaa: 0.0 ± 0.0
Pro
2.558ProAla: 2.558 ± 0.766
0.16ProCys: 0.16 ± 0.2
2.398ProAsp: 2.398 ± 1.189
2.558ProGlu: 2.558 ± 0.702
1.599ProPhe: 1.599 ± 0.516
2.558ProGly: 2.558 ± 0.549
0.64ProHis: 0.64 ± 0.2
2.558ProIle: 2.558 ± 0.859
1.439ProLys: 1.439 ± 0.558
3.358ProLeu: 3.358 ± 0.833
1.599ProMet: 1.599 ± 0.548
2.239ProAsn: 2.239 ± 0.96
1.599ProPro: 1.599 ± 0.56
1.279ProGln: 1.279 ± 0.294
1.279ProArg: 1.279 ± 0.5
2.398ProSer: 2.398 ± 0.778
2.079ProThr: 2.079 ± 0.701
2.878ProVal: 2.878 ± 1.083
0.64ProTrp: 0.64 ± 0.339
2.558ProTyr: 2.558 ± 0.555
0.0ProXaa: 0.0 ± 0.0
Gln
2.079GlnAla: 2.079 ± 0.613
0.959GlnCys: 0.959 ± 0.424
2.398GlnAsp: 2.398 ± 0.467
2.239GlnGlu: 2.239 ± 0.649
0.64GlnPhe: 0.64 ± 0.2
1.599GlnGly: 1.599 ± 0.328
0.799GlnHis: 0.799 ± 0.319
2.558GlnIle: 2.558 ± 0.687
2.398GlnLys: 2.398 ± 0.693
2.398GlnLeu: 2.398 ± 0.519
0.959GlnMet: 0.959 ± 0.288
1.599GlnAsn: 1.599 ± 0.454
1.439GlnPro: 1.439 ± 0.502
0.959GlnGln: 0.959 ± 0.438
2.239GlnArg: 2.239 ± 0.487
2.718GlnSer: 2.718 ± 0.804
2.558GlnThr: 2.558 ± 0.407
1.759GlnVal: 1.759 ± 0.436
0.32GlnTrp: 0.32 ± 0.221
0.959GlnTyr: 0.959 ± 0.402
0.0GlnXaa: 0.0 ± 0.0
Arg
4.957ArgAla: 4.957 ± 0.757
1.119ArgCys: 1.119 ± 0.34
3.678ArgAsp: 3.678 ± 0.697
5.117ArgGlu: 5.117 ± 0.717
3.358ArgPhe: 3.358 ± 0.704
4.477ArgGly: 4.477 ± 0.715
1.599ArgHis: 1.599 ± 0.341
4.637ArgIle: 4.637 ± 0.885
3.997ArgLys: 3.997 ± 0.7
5.117ArgLeu: 5.117 ± 1.089
2.878ArgMet: 2.878 ± 0.82
3.198ArgAsn: 3.198 ± 0.898
1.439ArgPro: 1.439 ± 0.415
1.919ArgGln: 1.919 ± 0.609
5.437ArgArg: 5.437 ± 1.336
5.277ArgSer: 5.277 ± 0.853
3.358ArgThr: 3.358 ± 0.503
4.637ArgVal: 4.637 ± 0.852
0.32ArgTrp: 0.32 ± 0.256
1.119ArgTyr: 1.119 ± 0.435
0.0ArgXaa: 0.0 ± 0.0
Ser
4.957SerAla: 4.957 ± 0.874
0.64SerCys: 0.64 ± 0.316
2.878SerAsp: 2.878 ± 0.54
4.797SerGlu: 4.797 ± 1.111
2.398SerPhe: 2.398 ± 0.541
4.317SerGly: 4.317 ± 0.843
1.119SerHis: 1.119 ± 0.426
2.718SerIle: 2.718 ± 0.609
3.997SerLys: 3.997 ± 1.08
6.236SerLeu: 6.236 ± 0.879
2.398SerMet: 2.398 ± 0.696
2.718SerAsn: 2.718 ± 0.474
3.358SerPro: 3.358 ± 0.704
1.119SerGln: 1.119 ± 0.427
5.117SerArg: 5.117 ± 0.725
3.997SerSer: 3.997 ± 0.642
5.756SerThr: 5.756 ± 0.923
3.358SerVal: 3.358 ± 0.844
0.64SerTrp: 0.64 ± 0.234
2.398SerTyr: 2.398 ± 0.659
0.0SerXaa: 0.0 ± 0.0
Thr
3.838ThrAla: 3.838 ± 0.921
0.48ThrCys: 0.48 ± 0.243
3.838ThrAsp: 3.838 ± 0.593
3.997ThrGlu: 3.997 ± 0.776
1.439ThrPhe: 1.439 ± 0.348
4.157ThrGly: 4.157 ± 0.771
2.079ThrHis: 2.079 ± 0.386
4.637ThrIle: 4.637 ± 0.582
3.838ThrLys: 3.838 ± 1.314
5.756ThrLeu: 5.756 ± 0.6
3.038ThrMet: 3.038 ± 0.618
1.919ThrAsn: 1.919 ± 0.8
3.038ThrPro: 3.038 ± 0.761
2.878ThrGln: 2.878 ± 0.594
5.277ThrArg: 5.277 ± 0.85
2.878ThrSer: 2.878 ± 0.455
5.277ThrThr: 5.277 ± 0.558
3.518ThrVal: 3.518 ± 0.861
0.32ThrTrp: 0.32 ± 0.235
1.279ThrTyr: 1.279 ± 0.312
0.0ThrXaa: 0.0 ± 0.0
Val
3.038ValAla: 3.038 ± 0.861
1.439ValCys: 1.439 ± 0.57
3.838ValAsp: 3.838 ± 0.804
4.317ValGlu: 4.317 ± 0.782
2.079ValPhe: 2.079 ± 0.534
4.157ValGly: 4.157 ± 0.894
1.439ValHis: 1.439 ± 0.449
3.838ValIle: 3.838 ± 0.704
5.117ValLys: 5.117 ± 1.261
5.756ValLeu: 5.756 ± 1.09
2.239ValMet: 2.239 ± 0.531
3.038ValAsn: 3.038 ± 0.955
2.558ValPro: 2.558 ± 0.499
2.398ValGln: 2.398 ± 0.527
3.838ValArg: 3.838 ± 0.813
3.678ValSer: 3.678 ± 0.927
4.637ValThr: 4.637 ± 0.744
4.797ValVal: 4.797 ± 0.664
0.48ValTrp: 0.48 ± 0.317
2.398ValTyr: 2.398 ± 0.491
0.0ValXaa: 0.0 ± 0.0
Trp
0.64TrpAla: 0.64 ± 0.253
0.32TrpCys: 0.32 ± 0.228
0.48TrpAsp: 0.48 ± 0.212
0.32TrpGlu: 0.32 ± 0.2
0.64TrpPhe: 0.64 ± 0.258
0.48TrpGly: 0.48 ± 0.327
0.48TrpHis: 0.48 ± 0.269
1.279TrpIle: 1.279 ± 0.551
0.959TrpLys: 0.959 ± 0.382
0.64TrpLeu: 0.64 ± 0.247
0.32TrpMet: 0.32 ± 0.227
0.799TrpAsn: 0.799 ± 0.478
0.16TrpPro: 0.16 ± 0.192
0.32TrpGln: 0.32 ± 0.265
0.799TrpArg: 0.799 ± 0.362
0.32TrpSer: 0.32 ± 0.305
0.48TrpThr: 0.48 ± 0.305
0.48TrpVal: 0.48 ± 0.264
0.16TrpTrp: 0.16 ± 0.152
0.48TrpTyr: 0.48 ± 0.434
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.279TyrAla: 1.279 ± 0.394
0.32TyrCys: 0.32 ± 0.163
2.398TyrAsp: 2.398 ± 0.39
1.279TyrGlu: 1.279 ± 0.629
0.799TyrPhe: 0.799 ± 0.251
2.239TyrGly: 2.239 ± 0.494
0.959TyrHis: 0.959 ± 0.729
2.558TyrIle: 2.558 ± 0.599
2.079TyrLys: 2.079 ± 0.702
3.838TyrLeu: 3.838 ± 1.07
0.959TyrMet: 0.959 ± 0.26
1.279TyrAsn: 1.279 ± 0.468
1.279TyrPro: 1.279 ± 0.394
1.599TyrGln: 1.599 ± 0.658
1.919TyrArg: 1.919 ± 0.599
3.198TyrSer: 3.198 ± 0.766
2.558TyrThr: 2.558 ± 0.573
3.518TyrVal: 3.518 ± 0.808
0.32TyrTrp: 0.32 ± 0.198
0.799TyrTyr: 0.799 ± 0.295
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (6255 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski