Amino acid dipepetide frequency for Wenling tonguesole paramyxovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.531AlaAla: 4.531 ± 1.264
2.266AlaCys: 2.266 ± 0.651
4.078AlaAsp: 4.078 ± 0.355
3.851AlaGlu: 3.851 ± 0.748
2.039AlaPhe: 2.039 ± 0.92
3.398AlaGly: 3.398 ± 0.588
0.453AlaHis: 0.453 ± 0.206
5.437AlaIle: 5.437 ± 0.979
2.945AlaLys: 2.945 ± 0.802
8.836AlaLeu: 8.836 ± 1.754
2.039AlaMet: 2.039 ± 0.674
2.039AlaAsn: 2.039 ± 0.582
3.172AlaPro: 3.172 ± 0.755
0.68AlaGln: 0.68 ± 0.36
4.758AlaArg: 4.758 ± 1.749
3.851AlaSer: 3.851 ± 0.657
4.758AlaThr: 4.758 ± 1.128
4.078AlaVal: 4.078 ± 1.764
0.906AlaTrp: 0.906 ± 0.333
2.039AlaTyr: 2.039 ± 0.579
0.0AlaXaa: 0.0 ± 0.0
Cys
1.586CysAla: 1.586 ± 0.385
0.227CysCys: 0.227 ± 0.12
1.133CysAsp: 1.133 ± 0.453
0.453CysGlu: 0.453 ± 0.342
1.133CysPhe: 1.133 ± 0.443
0.68CysGly: 0.68 ± 0.473
0.227CysHis: 0.227 ± 0.12
1.812CysIle: 1.812 ± 0.586
0.68CysLys: 0.68 ± 0.245
1.359CysLeu: 1.359 ± 0.408
0.68CysMet: 0.68 ± 0.308
0.906CysAsn: 0.906 ± 0.552
0.906CysPro: 0.906 ± 0.233
0.453CysGln: 0.453 ± 0.24
0.227CysArg: 0.227 ± 0.24
0.68CysSer: 0.68 ± 0.237
2.039CysThr: 2.039 ± 0.94
1.133CysVal: 1.133 ± 0.257
0.453CysTrp: 0.453 ± 0.325
1.133CysTyr: 1.133 ± 0.61
0.0CysXaa: 0.0 ± 0.0
Asp
4.531AspAla: 4.531 ± 0.908
0.906AspCys: 0.906 ± 0.461
3.172AspAsp: 3.172 ± 0.645
2.945AspGlu: 2.945 ± 0.565
0.68AspPhe: 0.68 ± 0.365
2.492AspGly: 2.492 ± 0.428
2.719AspHis: 2.719 ± 1.013
4.078AspIle: 4.078 ± 0.749
3.398AspLys: 3.398 ± 0.489
6.57AspLeu: 6.57 ± 1.068
1.133AspMet: 1.133 ± 0.469
1.586AspAsn: 1.586 ± 0.641
3.398AspPro: 3.398 ± 0.876
2.945AspGln: 2.945 ± 0.891
3.398AspArg: 3.398 ± 0.979
4.078AspSer: 4.078 ± 1.116
4.078AspThr: 4.078 ± 0.914
2.945AspVal: 2.945 ± 0.705
1.133AspTrp: 1.133 ± 0.336
1.359AspTyr: 1.359 ± 0.503
0.0AspXaa: 0.0 ± 0.0
Glu
2.719GluAla: 2.719 ± 0.414
0.68GluCys: 0.68 ± 0.237
4.304GluAsp: 4.304 ± 0.713
1.359GluGlu: 1.359 ± 0.374
1.133GluPhe: 1.133 ± 0.42
2.719GluGly: 2.719 ± 0.669
0.906GluHis: 0.906 ± 0.413
5.437GluIle: 5.437 ± 1.221
2.266GluLys: 2.266 ± 0.821
4.531GluLeu: 4.531 ± 1.055
1.812GluMet: 1.812 ± 0.556
2.492GluAsn: 2.492 ± 0.645
2.719GluPro: 2.719 ± 1.129
2.719GluGln: 2.719 ± 0.673
3.172GluArg: 3.172 ± 0.612
2.492GluSer: 2.492 ± 0.516
3.172GluThr: 3.172 ± 1.042
3.172GluVal: 3.172 ± 0.89
0.0GluTrp: 0.0 ± 0.0
1.812GluTyr: 1.812 ± 0.416
0.0GluXaa: 0.0 ± 0.0
Phe
1.133PheAla: 1.133 ± 0.799
0.68PheCys: 0.68 ± 0.273
2.266PheAsp: 2.266 ± 0.397
1.359PheGlu: 1.359 ± 0.55
2.039PhePhe: 2.039 ± 0.646
1.359PheGly: 1.359 ± 0.352
0.906PheHis: 0.906 ± 0.48
1.812PheIle: 1.812 ± 0.607
2.039PheLys: 2.039 ± 0.341
3.398PheLeu: 3.398 ± 0.775
1.586PheMet: 1.586 ± 0.787
2.719PheAsn: 2.719 ± 0.573
1.359PhePro: 1.359 ± 0.706
1.586PheGln: 1.586 ± 0.563
2.492PheArg: 2.492 ± 0.703
2.945PheSer: 2.945 ± 1.065
1.812PheThr: 1.812 ± 0.611
2.039PheVal: 2.039 ± 0.653
0.0PheTrp: 0.0 ± 0.0
0.68PheTyr: 0.68 ± 0.36
0.0PheXaa: 0.0 ± 0.0
Gly
4.758GlyAla: 4.758 ± 1.005
0.906GlyCys: 0.906 ± 0.46
2.266GlyAsp: 2.266 ± 0.468
2.266GlyGlu: 2.266 ± 0.768
1.812GlyPhe: 1.812 ± 0.365
3.398GlyGly: 3.398 ± 0.709
1.359GlyHis: 1.359 ± 0.52
3.172GlyIle: 3.172 ± 0.442
3.398GlyLys: 3.398 ± 0.971
7.023GlyLeu: 7.023 ± 1.072
1.812GlyMet: 1.812 ± 0.798
1.586GlyAsn: 1.586 ± 0.406
1.359GlyPro: 1.359 ± 0.702
2.492GlyGln: 2.492 ± 1.039
2.945GlyArg: 2.945 ± 0.766
4.531GlySer: 4.531 ± 0.834
4.984GlyThr: 4.984 ± 1.15
3.851GlyVal: 3.851 ± 0.892
0.68GlyTrp: 0.68 ± 0.26
2.266GlyTyr: 2.266 ± 0.594
0.0GlyXaa: 0.0 ± 0.0
His
1.586HisAla: 1.586 ± 0.397
0.453HisCys: 0.453 ± 0.325
2.266HisAsp: 2.266 ± 0.569
0.227HisGlu: 0.227 ± 0.297
0.906HisPhe: 0.906 ± 0.343
1.133HisGly: 1.133 ± 0.469
0.906HisHis: 0.906 ± 0.365
2.492HisIle: 2.492 ± 0.741
1.359HisLys: 1.359 ± 0.46
2.039HisLeu: 2.039 ± 0.801
1.133HisMet: 1.133 ± 0.293
1.812HisAsn: 1.812 ± 0.281
1.133HisPro: 1.133 ± 0.448
0.0HisGln: 0.0 ± 0.0
1.812HisArg: 1.812 ± 0.57
0.453HisSer: 0.453 ± 0.399
2.492HisThr: 2.492 ± 0.706
1.586HisVal: 1.586 ± 0.473
0.68HisTrp: 0.68 ± 0.237
1.133HisTyr: 1.133 ± 0.47
0.0HisXaa: 0.0 ± 0.0
Ile
5.437IleAla: 5.437 ± 1.05
1.586IleCys: 1.586 ± 0.436
3.851IleAsp: 3.851 ± 0.953
3.625IleGlu: 3.625 ± 0.479
2.492IlePhe: 2.492 ± 0.994
3.172IleGly: 3.172 ± 0.855
1.812IleHis: 1.812 ± 0.493
6.797IleIle: 6.797 ± 1.894
4.078IleLys: 4.078 ± 0.559
9.062IleLeu: 9.062 ± 1.073
2.266IleMet: 2.266 ± 0.696
4.078IleAsn: 4.078 ± 1.287
3.625IlePro: 3.625 ± 0.825
3.398IleGln: 3.398 ± 1.202
4.758IleArg: 4.758 ± 1.411
7.476IleSer: 7.476 ± 1.83
3.625IleThr: 3.625 ± 0.938
2.492IleVal: 2.492 ± 0.526
1.586IleTrp: 1.586 ± 0.484
2.266IleTyr: 2.266 ± 0.442
0.0IleXaa: 0.0 ± 0.0
Lys
3.851LysAla: 3.851 ± 0.853
0.906LysCys: 0.906 ± 0.379
4.531LysAsp: 4.531 ± 0.371
1.133LysGlu: 1.133 ± 0.6
1.586LysPhe: 1.586 ± 0.684
3.625LysGly: 3.625 ± 0.885
1.133LysHis: 1.133 ± 0.434
4.984LysIle: 4.984 ± 1.136
3.398LysLys: 3.398 ± 0.669
4.984LysLeu: 4.984 ± 1.092
1.359LysMet: 1.359 ± 0.554
2.492LysAsn: 2.492 ± 0.349
1.586LysPro: 1.586 ± 0.743
0.906LysGln: 0.906 ± 0.328
2.266LysArg: 2.266 ± 0.62
4.758LysSer: 4.758 ± 1.146
3.172LysThr: 3.172 ± 0.761
3.398LysVal: 3.398 ± 1.006
0.68LysTrp: 0.68 ± 0.263
1.586LysTyr: 1.586 ± 0.43
0.0LysXaa: 0.0 ± 0.0
Leu
6.343LeuAla: 6.343 ± 0.92
1.133LeuCys: 1.133 ± 0.326
4.984LeuAsp: 4.984 ± 1.13
7.023LeuGlu: 7.023 ± 1.234
4.758LeuPhe: 4.758 ± 0.671
6.797LeuGly: 6.797 ± 1.14
2.719LeuHis: 2.719 ± 0.683
8.382LeuIle: 8.382 ± 1.085
6.797LeuLys: 6.797 ± 1.231
11.781LeuLeu: 11.781 ± 1.885
3.625LeuMet: 3.625 ± 1.311
4.078LeuAsn: 4.078 ± 1.223
4.304LeuPro: 4.304 ± 0.93
1.812LeuGln: 1.812 ± 0.641
5.437LeuArg: 5.437 ± 1.073
8.609LeuSer: 8.609 ± 0.897
7.703LeuThr: 7.703 ± 1.288
5.211LeuVal: 5.211 ± 1.022
1.586LeuTrp: 1.586 ± 0.664
2.039LeuTyr: 2.039 ± 0.344
0.0LeuXaa: 0.0 ± 0.0
Met
2.039MetAla: 2.039 ± 0.455
0.68MetCys: 0.68 ± 0.366
0.68MetAsp: 0.68 ± 0.267
1.812MetGlu: 1.812 ± 0.604
0.906MetPhe: 0.906 ± 0.53
1.359MetGly: 1.359 ± 0.535
0.68MetHis: 0.68 ± 0.267
2.266MetIle: 2.266 ± 0.911
1.359MetLys: 1.359 ± 0.494
2.492MetLeu: 2.492 ± 0.591
0.68MetMet: 0.68 ± 0.308
1.586MetAsn: 1.586 ± 0.286
0.453MetPro: 0.453 ± 0.24
1.812MetGln: 1.812 ± 0.64
1.812MetArg: 1.812 ± 0.469
2.266MetSer: 2.266 ± 0.66
3.625MetThr: 3.625 ± 0.808
1.133MetVal: 1.133 ± 0.508
0.227MetTrp: 0.227 ± 0.256
1.133MetTyr: 1.133 ± 0.361
0.0MetXaa: 0.0 ± 0.0
Asn
2.945AsnAla: 2.945 ± 0.991
0.68AsnCys: 0.68 ± 0.274
3.172AsnAsp: 3.172 ± 0.694
2.039AsnGlu: 2.039 ± 0.522
2.039AsnPhe: 2.039 ± 0.47
4.531AsnGly: 4.531 ± 0.978
1.586AsnHis: 1.586 ± 0.464
2.945AsnIle: 2.945 ± 0.74
0.906AsnLys: 0.906 ± 0.348
3.851AsnLeu: 3.851 ± 0.865
0.906AsnMet: 0.906 ± 0.343
1.586AsnAsn: 1.586 ± 0.425
2.492AsnPro: 2.492 ± 0.562
2.945AsnGln: 2.945 ± 0.907
3.625AsnArg: 3.625 ± 1.198
3.625AsnSer: 3.625 ± 0.651
2.719AsnThr: 2.719 ± 1.092
1.812AsnVal: 1.812 ± 0.561
1.133AsnTrp: 1.133 ± 0.411
0.906AsnTyr: 0.906 ± 0.413
0.0AsnXaa: 0.0 ± 0.0
Pro
3.398ProAla: 3.398 ± 0.345
0.0ProCys: 0.0 ± 0.0
2.039ProAsp: 2.039 ± 0.639
2.492ProGlu: 2.492 ± 1.145
1.133ProPhe: 1.133 ± 0.291
2.266ProGly: 2.266 ± 0.769
0.453ProHis: 0.453 ± 0.267
1.812ProIle: 1.812 ± 0.749
1.812ProLys: 1.812 ± 0.316
6.117ProLeu: 6.117 ± 1.158
0.906ProMet: 0.906 ± 0.504
3.398ProAsn: 3.398 ± 1.226
1.812ProPro: 1.812 ± 0.35
1.359ProGln: 1.359 ± 0.347
1.812ProArg: 1.812 ± 0.59
3.172ProSer: 3.172 ± 0.532
3.625ProThr: 3.625 ± 0.589
2.266ProVal: 2.266 ± 0.483
0.453ProTrp: 0.453 ± 0.24
1.359ProTyr: 1.359 ± 0.472
0.0ProXaa: 0.0 ± 0.0
Gln
2.039GlnAla: 2.039 ± 0.39
0.906GlnCys: 0.906 ± 0.413
2.039GlnAsp: 2.039 ± 0.44
2.492GlnGlu: 2.492 ± 0.717
1.359GlnPhe: 1.359 ± 0.385
2.266GlnGly: 2.266 ± 0.627
0.906GlnHis: 0.906 ± 0.396
2.266GlnIle: 2.266 ± 0.605
2.945GlnLys: 2.945 ± 0.732
4.758GlnLeu: 4.758 ± 1.301
0.906GlnMet: 0.906 ± 0.504
1.359GlnAsn: 1.359 ± 0.461
0.453GlnPro: 0.453 ± 0.206
1.812GlnGln: 1.812 ± 0.592
1.812GlnArg: 1.812 ± 0.405
3.172GlnSer: 3.172 ± 0.636
1.586GlnThr: 1.586 ± 0.601
2.039GlnVal: 2.039 ± 0.537
0.453GlnTrp: 0.453 ± 0.24
1.359GlnTyr: 1.359 ± 0.716
0.0GlnXaa: 0.0 ± 0.0
Arg
1.812ArgAla: 1.812 ± 0.369
0.906ArgCys: 0.906 ± 0.248
4.984ArgAsp: 4.984 ± 0.806
3.172ArgGlu: 3.172 ± 1.68
1.133ArgPhe: 1.133 ± 0.257
3.625ArgGly: 3.625 ± 0.933
2.266ArgHis: 2.266 ± 0.797
2.945ArgIle: 2.945 ± 0.655
2.945ArgLys: 2.945 ± 0.593
5.664ArgLeu: 5.664 ± 1.203
1.359ArgMet: 1.359 ± 0.558
2.266ArgAsn: 2.266 ± 0.521
1.812ArgPro: 1.812 ± 0.337
1.586ArgGln: 1.586 ± 0.522
1.133ArgArg: 1.133 ± 0.377
3.851ArgSer: 3.851 ± 0.507
4.531ArgThr: 4.531 ± 1.041
4.078ArgVal: 4.078 ± 1.27
0.906ArgTrp: 0.906 ± 0.48
2.039ArgTyr: 2.039 ± 0.768
0.0ArgXaa: 0.0 ± 0.0
Ser
5.89SerAla: 5.89 ± 1.194
1.359SerCys: 1.359 ± 0.461
2.719SerAsp: 2.719 ± 0.822
2.719SerGlu: 2.719 ± 0.783
2.945SerPhe: 2.945 ± 0.834
4.078SerGly: 4.078 ± 1.236
1.586SerHis: 1.586 ± 0.38
6.797SerIle: 6.797 ± 1.622
3.172SerLys: 3.172 ± 0.673
7.703SerLeu: 7.703 ± 0.999
1.586SerMet: 1.586 ± 0.376
3.398SerAsn: 3.398 ± 0.716
3.851SerPro: 3.851 ± 0.786
3.625SerGln: 3.625 ± 0.663
4.078SerArg: 4.078 ± 0.891
4.531SerSer: 4.531 ± 1.026
6.57SerThr: 6.57 ± 1.609
5.437SerVal: 5.437 ± 1.351
1.359SerTrp: 1.359 ± 0.417
0.68SerTyr: 0.68 ± 0.245
0.0SerXaa: 0.0 ± 0.0
Thr
4.758ThrAla: 4.758 ± 1.496
1.586ThrCys: 1.586 ± 0.44
4.531ThrAsp: 4.531 ± 1.257
3.172ThrGlu: 3.172 ± 0.739
2.266ThrPhe: 2.266 ± 0.805
6.117ThrGly: 6.117 ± 1.582
2.266ThrHis: 2.266 ± 0.442
7.25ThrIle: 7.25 ± 1.423
2.492ThrLys: 2.492 ± 0.438
6.343ThrLeu: 6.343 ± 0.734
1.586ThrMet: 1.586 ± 0.362
3.172ThrAsn: 3.172 ± 0.793
3.625ThrPro: 3.625 ± 0.611
3.625ThrGln: 3.625 ± 0.73
3.625ThrArg: 3.625 ± 0.468
6.117ThrSer: 6.117 ± 1.801
6.343ThrThr: 6.343 ± 1.414
4.304ThrVal: 4.304 ± 0.994
0.906ThrTrp: 0.906 ± 0.314
2.039ThrTyr: 2.039 ± 0.878
0.0ThrXaa: 0.0 ± 0.0
Val
4.078ValAla: 4.078 ± 0.741
1.586ValCys: 1.586 ± 0.367
2.039ValAsp: 2.039 ± 0.919
6.343ValGlu: 6.343 ± 1.004
1.812ValPhe: 1.812 ± 0.567
1.586ValGly: 1.586 ± 0.653
1.586ValHis: 1.586 ± 0.55
4.304ValIle: 4.304 ± 0.896
3.172ValLys: 3.172 ± 0.81
5.211ValLeu: 5.211 ± 0.684
2.492ValMet: 2.492 ± 0.714
3.398ValAsn: 3.398 ± 0.659
1.812ValPro: 1.812 ± 0.676
1.812ValGln: 1.812 ± 0.396
1.812ValArg: 1.812 ± 0.761
4.304ValSer: 4.304 ± 1.307
5.664ValThr: 5.664 ± 1.189
3.398ValVal: 3.398 ± 0.766
0.68ValTrp: 0.68 ± 0.342
0.906ValTyr: 0.906 ± 0.31
0.0ValXaa: 0.0 ± 0.0
Trp
1.133TrpAla: 1.133 ± 0.42
0.0TrpCys: 0.0 ± 0.0
0.68TrpAsp: 0.68 ± 0.291
0.906TrpGlu: 0.906 ± 0.531
0.453TrpPhe: 0.453 ± 0.206
0.68TrpGly: 0.68 ± 0.296
0.0TrpHis: 0.0 ± 0.0
0.68TrpIle: 0.68 ± 0.273
1.359TrpLys: 1.359 ± 0.472
1.586TrpLeu: 1.586 ± 0.65
0.227TrpMet: 0.227 ± 0.12
0.453TrpAsn: 0.453 ± 0.24
0.453TrpPro: 0.453 ± 0.24
0.227TrpGln: 0.227 ± 0.12
1.359TrpArg: 1.359 ± 0.52
1.133TrpSer: 1.133 ± 0.524
1.359TrpThr: 1.359 ± 0.394
0.68TrpVal: 0.68 ± 0.316
0.453TrpTrp: 0.453 ± 0.206
0.453TrpTyr: 0.453 ± 0.231
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.586TyrAla: 1.586 ± 0.684
0.453TyrCys: 0.453 ± 0.26
2.039TyrAsp: 2.039 ± 0.483
0.227TyrGlu: 0.227 ± 0.12
1.586TyrPhe: 1.586 ± 0.65
1.586TyrGly: 1.586 ± 0.693
1.133TyrHis: 1.133 ± 0.413
1.812TyrIle: 1.812 ± 0.337
1.812TyrLys: 1.812 ± 0.718
1.812TyrLeu: 1.812 ± 0.559
0.68TyrMet: 0.68 ± 0.305
2.039TyrAsn: 2.039 ± 0.72
1.359TyrPro: 1.359 ± 0.539
1.133TyrGln: 1.133 ± 0.428
0.68TyrArg: 0.68 ± 0.263
2.039TyrSer: 2.039 ± 0.695
2.266TyrThr: 2.266 ± 0.874
2.945TyrVal: 2.945 ± 0.437
0.0TyrTrp: 0.0 ± 0.0
0.453TyrTyr: 0.453 ± 0.397
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (4415 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski