Amino acid dipepetide frequency for Bat Paramyxovirus Epo_spe/AR1/DRC/2009

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.509AlaAla: 4.509 ± 1.686
0.712AlaCys: 0.712 ± 0.349
3.56AlaAsp: 3.56 ± 1.79
2.136AlaGlu: 2.136 ± 0.652
2.373AlaPhe: 2.373 ± 0.674
3.797AlaGly: 3.797 ± 0.974
1.187AlaHis: 1.187 ± 0.53
3.797AlaIle: 3.797 ± 0.477
2.848AlaLys: 2.848 ± 2.023
6.407AlaLeu: 6.407 ± 0.777
1.187AlaMet: 1.187 ± 1.138
3.56AlaAsn: 3.56 ± 1.001
2.373AlaPro: 2.373 ± 0.687
4.271AlaGln: 4.271 ± 2.05
4.746AlaArg: 4.746 ± 2.706
6.645AlaSer: 6.645 ± 2.254
3.322AlaThr: 3.322 ± 0.858
1.661AlaVal: 1.661 ± 0.544
0.475AlaTrp: 0.475 ± 0.466
2.61AlaTyr: 2.61 ± 0.66
0.0AlaXaa: 0.0 ± 0.0
Cys
1.187CysAla: 1.187 ± 0.328
0.475CysCys: 0.475 ± 0.29
0.237CysAsp: 0.237 ± 0.314
0.949CysGlu: 0.949 ± 0.442
1.187CysPhe: 1.187 ± 0.503
0.475CysGly: 0.475 ± 0.29
0.237CysHis: 0.237 ± 0.346
1.898CysIle: 1.898 ± 0.929
1.187CysLys: 1.187 ± 1.115
1.424CysLeu: 1.424 ± 1.119
0.475CysMet: 0.475 ± 0.456
0.712CysAsn: 0.712 ± 0.349
0.712CysPro: 0.712 ± 0.266
0.949CysGln: 0.949 ± 0.58
0.0CysArg: 0.0 ± 0.0
2.848CysSer: 2.848 ± 1.277
0.949CysThr: 0.949 ± 0.876
1.661CysVal: 1.661 ± 0.489
0.0CysTrp: 0.0 ± 0.0
1.424CysTyr: 1.424 ± 0.493
0.0CysXaa: 0.0 ± 0.0
Asp
1.898AspAla: 1.898 ± 0.507
0.949AspCys: 0.949 ± 0.332
6.407AspAsp: 6.407 ± 1.962
2.373AspGlu: 2.373 ± 0.609
0.949AspPhe: 0.949 ± 0.386
2.136AspGly: 2.136 ± 0.871
1.424AspHis: 1.424 ± 0.672
3.322AspIle: 3.322 ± 0.917
2.848AspLys: 2.848 ± 1.092
6.645AspLeu: 6.645 ± 1.637
1.187AspMet: 1.187 ± 0.368
2.136AspAsn: 2.136 ± 0.617
4.509AspPro: 4.509 ± 1.321
2.848AspGln: 2.848 ± 0.933
2.136AspArg: 2.136 ± 1.37
3.322AspSer: 3.322 ± 1.137
2.136AspThr: 2.136 ± 0.58
2.136AspVal: 2.136 ± 0.584
0.475AspTrp: 0.475 ± 0.271
2.373AspTyr: 2.373 ± 0.876
0.0AspXaa: 0.0 ± 0.0
Glu
1.661GluAla: 1.661 ± 1.121
0.949GluCys: 0.949 ± 0.58
2.61GluAsp: 2.61 ± 0.989
2.373GluGlu: 2.373 ± 0.697
1.424GluPhe: 1.424 ± 0.594
3.797GluGly: 3.797 ± 0.92
0.475GluHis: 0.475 ± 0.29
3.797GluIle: 3.797 ± 0.93
2.61GluLys: 2.61 ± 1.179
5.933GluLeu: 5.933 ± 0.94
1.424GluMet: 1.424 ± 0.373
2.136GluAsn: 2.136 ± 0.451
1.898GluPro: 1.898 ± 0.329
3.085GluGln: 3.085 ± 1.162
2.373GluArg: 2.373 ± 0.454
3.085GluSer: 3.085 ± 0.983
3.322GluThr: 3.322 ± 0.439
1.898GluVal: 1.898 ± 0.664
0.475GluTrp: 0.475 ± 0.29
1.898GluTyr: 1.898 ± 0.897
0.0GluXaa: 0.0 ± 0.0
Phe
1.424PheAla: 1.424 ± 0.532
0.712PheCys: 0.712 ± 0.3
0.949PheAsp: 0.949 ± 0.638
2.373PheGlu: 2.373 ± 0.649
1.898PhePhe: 1.898 ± 0.343
1.187PheGly: 1.187 ± 0.54
0.475PheHis: 0.475 ± 0.316
2.848PheIle: 2.848 ± 0.663
0.949PheLys: 0.949 ± 0.492
3.797PheLeu: 3.797 ± 0.7
0.949PheMet: 0.949 ± 1.035
2.373PheAsn: 2.373 ± 1.205
1.187PhePro: 1.187 ± 0.537
1.187PheGln: 1.187 ± 0.825
1.661PheArg: 1.661 ± 0.968
3.322PheSer: 3.322 ± 1.037
3.085PheThr: 3.085 ± 1.259
1.898PheVal: 1.898 ± 0.476
0.0PheTrp: 0.0 ± 0.0
1.424PheTyr: 1.424 ± 0.533
0.0PheXaa: 0.0 ± 0.0
Gly
4.271GlyAla: 4.271 ± 1.991
1.424GlyCys: 1.424 ± 0.742
3.797GlyAsp: 3.797 ± 0.327
3.56GlyGlu: 3.56 ± 0.885
1.424GlyPhe: 1.424 ± 0.269
3.322GlyGly: 3.322 ± 1.387
0.712GlyHis: 0.712 ± 0.349
4.746GlyIle: 4.746 ± 1.001
2.373GlyLys: 2.373 ± 0.426
4.509GlyLeu: 4.509 ± 0.882
1.424GlyMet: 1.424 ± 0.941
2.373GlyAsn: 2.373 ± 0.813
1.898GlyPro: 1.898 ± 1.141
2.136GlyGln: 2.136 ± 0.413
2.848GlyArg: 2.848 ± 0.423
4.983GlySer: 4.983 ± 1.34
3.085GlyThr: 3.085 ± 1.661
4.746GlyVal: 4.746 ± 1.517
0.475GlyTrp: 0.475 ± 0.316
1.424GlyTyr: 1.424 ± 0.515
0.237GlyXaa: 0.237 ± 0.322
His
1.898HisAla: 1.898 ± 0.69
0.0HisCys: 0.0 ± 0.0
0.712HisAsp: 0.712 ± 0.344
0.237HisGlu: 0.237 ± 0.145
0.237HisPhe: 0.237 ± 0.352
1.187HisGly: 1.187 ± 0.454
0.949HisHis: 0.949 ± 0.689
1.424HisIle: 1.424 ± 0.442
0.949HisLys: 0.949 ± 0.58
3.797HisLeu: 3.797 ± 1.345
0.237HisMet: 0.237 ± 0.145
0.475HisAsn: 0.475 ± 0.257
1.187HisPro: 1.187 ± 0.442
0.712HisGln: 0.712 ± 1.038
0.712HisArg: 0.712 ± 0.3
0.712HisSer: 0.712 ± 0.435
0.475HisThr: 0.475 ± 0.257
0.949HisVal: 0.949 ± 0.423
0.237HisTrp: 0.237 ± 0.322
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.034IleAla: 4.034 ± 0.436
0.712IleCys: 0.712 ± 0.366
2.848IleAsp: 2.848 ± 0.762
5.695IleGlu: 5.695 ± 0.921
2.136IlePhe: 2.136 ± 0.617
2.848IleGly: 2.848 ± 0.672
1.424IleHis: 1.424 ± 0.663
5.458IleIle: 5.458 ± 1.128
3.56IleLys: 3.56 ± 1.764
7.119IleLeu: 7.119 ± 2.189
1.898IleMet: 1.898 ± 0.846
5.695IleAsn: 5.695 ± 1.135
4.746IlePro: 4.746 ± 1.484
4.034IleGln: 4.034 ± 1.034
4.034IleArg: 4.034 ± 0.771
4.271IleSer: 4.271 ± 1.787
4.509IleThr: 4.509 ± 0.711
3.322IleVal: 3.322 ± 0.911
1.898IleTrp: 1.898 ± 1.16
1.898IleTyr: 1.898 ± 0.886
0.0IleXaa: 0.0 ± 0.0
Lys
2.373LysAla: 2.373 ± 0.925
0.949LysCys: 0.949 ± 0.343
1.898LysAsp: 1.898 ± 0.706
2.136LysGlu: 2.136 ± 0.832
1.661LysPhe: 1.661 ± 0.495
2.848LysGly: 2.848 ± 1.396
0.475LysHis: 0.475 ± 0.316
3.56LysIle: 3.56 ± 1.279
2.373LysLys: 2.373 ± 0.785
5.458LysLeu: 5.458 ± 1.066
1.187LysMet: 1.187 ± 0.442
1.424LysAsn: 1.424 ± 0.95
2.136LysPro: 2.136 ± 1.425
2.848LysGln: 2.848 ± 1.215
1.898LysArg: 1.898 ± 0.984
3.322LysSer: 3.322 ± 0.692
3.797LysThr: 3.797 ± 1.221
2.136LysVal: 2.136 ± 0.886
0.712LysTrp: 0.712 ± 0.3
2.373LysTyr: 2.373 ± 0.81
0.0LysXaa: 0.0 ± 0.0
Leu
8.78LeuAla: 8.78 ± 1.198
2.373LeuCys: 2.373 ± 0.785
5.695LeuAsp: 5.695 ± 1.436
5.933LeuGlu: 5.933 ± 1.833
3.085LeuPhe: 3.085 ± 0.6
3.56LeuGly: 3.56 ± 0.766
1.424LeuHis: 1.424 ± 0.87
7.831LeuIle: 7.831 ± 1.587
5.933LeuLys: 5.933 ± 1.218
12.577LeuLeu: 12.577 ± 2.837
3.322LeuMet: 3.322 ± 1.06
8.306LeuAsn: 8.306 ± 1.976
4.983LeuPro: 4.983 ± 1.081
3.797LeuGln: 3.797 ± 0.531
6.407LeuArg: 6.407 ± 1.571
9.492LeuSer: 9.492 ± 1.227
10.441LeuThr: 10.441 ± 2.673
5.458LeuVal: 5.458 ± 0.754
1.187LeuTrp: 1.187 ± 0.54
3.085LeuTyr: 3.085 ± 1.034
0.0LeuXaa: 0.0 ± 0.0
Met
1.661MetAla: 1.661 ± 0.973
0.475MetCys: 0.475 ± 0.29
1.424MetAsp: 1.424 ± 1.345
1.424MetGlu: 1.424 ± 0.486
0.475MetPhe: 0.475 ± 0.466
1.898MetGly: 1.898 ± 0.897
0.237MetHis: 0.237 ± 0.608
2.61MetIle: 2.61 ± 0.901
0.949MetLys: 0.949 ± 0.58
3.085MetLeu: 3.085 ± 1.127
1.187MetMet: 1.187 ± 0.804
1.898MetAsn: 1.898 ± 0.577
1.424MetPro: 1.424 ± 0.756
0.712MetGln: 0.712 ± 0.785
2.136MetArg: 2.136 ± 0.718
1.424MetSer: 1.424 ± 0.265
1.424MetThr: 1.424 ± 0.627
1.424MetVal: 1.424 ± 0.74
0.712MetTrp: 0.712 ± 0.349
0.949MetTyr: 0.949 ± 0.343
0.0MetXaa: 0.0 ± 0.0
Asn
2.848AsnAla: 2.848 ± 0.409
0.949AsnCys: 0.949 ± 0.882
2.61AsnAsp: 2.61 ± 0.885
2.136AsnGlu: 2.136 ± 0.436
1.424AsnPhe: 1.424 ± 0.368
3.322AsnGly: 3.322 ± 0.8
1.898AsnHis: 1.898 ± 0.645
2.61AsnIle: 2.61 ± 0.957
2.373AsnLys: 2.373 ± 0.641
5.695AsnLeu: 5.695 ± 1.443
0.949AsnMet: 0.949 ± 0.304
2.848AsnAsn: 2.848 ± 0.484
3.797AsnPro: 3.797 ± 0.639
3.085AsnGln: 3.085 ± 1.878
3.085AsnArg: 3.085 ± 0.508
4.034AsnSer: 4.034 ± 1.022
2.136AsnThr: 2.136 ± 0.758
2.373AsnVal: 2.373 ± 0.912
1.424AsnTrp: 1.424 ± 0.581
1.898AsnTyr: 1.898 ± 0.866
0.0AsnXaa: 0.0 ± 0.0
Pro
3.085ProAla: 3.085 ± 1.52
0.475ProCys: 0.475 ± 0.466
2.61ProAsp: 2.61 ± 0.889
2.61ProGlu: 2.61 ± 0.951
2.61ProPhe: 2.61 ± 0.84
3.797ProGly: 3.797 ± 1.647
0.475ProHis: 0.475 ± 0.29
3.56ProIle: 3.56 ± 0.749
2.373ProLys: 2.373 ± 0.878
5.933ProLeu: 5.933 ± 0.685
1.187ProMet: 1.187 ± 0.328
2.848ProAsn: 2.848 ± 0.519
3.322ProPro: 3.322 ± 0.547
3.322ProGln: 3.322 ± 1.235
1.187ProArg: 1.187 ± 0.645
4.746ProSer: 4.746 ± 0.851
4.983ProThr: 4.983 ± 1.348
3.085ProVal: 3.085 ± 1.739
0.0ProTrp: 0.0 ± 0.0
1.424ProTyr: 1.424 ± 0.493
0.0ProXaa: 0.0 ± 0.0
Gln
4.271GlnAla: 4.271 ± 1.795
0.0GlnCys: 0.0 ± 0.0
2.61GlnAsp: 2.61 ± 0.873
1.424GlnGlu: 1.424 ± 0.358
1.898GlnPhe: 1.898 ± 0.329
4.271GlnGly: 4.271 ± 1.178
0.712GlnHis: 0.712 ± 0.749
4.983GlnIle: 4.983 ± 1.476
1.898GlnLys: 1.898 ± 0.947
4.983GlnLeu: 4.983 ± 0.988
1.898GlnMet: 1.898 ± 0.912
1.898GlnAsn: 1.898 ± 1.306
2.373GlnPro: 2.373 ± 1.636
2.848GlnGln: 2.848 ± 2.351
2.136GlnArg: 2.136 ± 0.615
3.56GlnSer: 3.56 ± 0.374
2.61GlnThr: 2.61 ± 0.901
3.322GlnVal: 3.322 ± 0.638
0.0GlnTrp: 0.0 ± 0.0
2.136GlnTyr: 2.136 ± 0.924
0.0GlnXaa: 0.0 ± 0.0
Arg
2.373ArgAla: 2.373 ± 0.813
0.712ArgCys: 0.712 ± 0.568
1.661ArgAsp: 1.661 ± 0.525
1.187ArgGlu: 1.187 ± 0.53
1.661ArgPhe: 1.661 ± 1.107
2.136ArgGly: 2.136 ± 1.228
0.237ArgHis: 0.237 ± 0.145
4.271ArgIle: 4.271 ± 1.038
3.322ArgLys: 3.322 ± 0.763
7.356ArgLeu: 7.356 ± 1.085
1.187ArgMet: 1.187 ± 0.339
1.424ArgAsn: 1.424 ± 0.269
3.322ArgPro: 3.322 ± 1.065
1.898ArgGln: 1.898 ± 0.771
2.848ArgArg: 2.848 ± 0.672
4.746ArgSer: 4.746 ± 0.849
1.424ArgThr: 1.424 ± 0.265
5.221ArgVal: 5.221 ± 1.262
0.475ArgTrp: 0.475 ± 0.583
2.136ArgTyr: 2.136 ± 1.37
0.0ArgXaa: 0.0 ± 0.0
Ser
4.983SerAla: 4.983 ± 1.562
2.61SerCys: 2.61 ± 0.781
5.221SerAsp: 5.221 ± 1.728
3.56SerGlu: 3.56 ± 1.159
2.848SerPhe: 2.848 ± 0.868
5.221SerGly: 5.221 ± 2.431
1.661SerHis: 1.661 ± 0.525
3.085SerIle: 3.085 ± 0.703
2.136SerLys: 2.136 ± 0.742
8.78SerLeu: 8.78 ± 1.708
3.085SerMet: 3.085 ± 0.661
4.271SerAsn: 4.271 ± 1.319
4.271SerPro: 4.271 ± 0.942
4.509SerGln: 4.509 ± 0.645
2.136SerArg: 2.136 ± 0.581
6.17SerSer: 6.17 ± 1.452
6.645SerThr: 6.645 ± 1.558
3.797SerVal: 3.797 ± 0.7
1.661SerTrp: 1.661 ± 0.475
3.56SerTyr: 3.56 ± 0.626
0.0SerXaa: 0.0 ± 0.0
Thr
4.746ThrAla: 4.746 ± 1.121
0.712ThrCys: 0.712 ± 0.266
2.61ThrAsp: 2.61 ± 0.889
1.898ThrGlu: 1.898 ± 0.645
3.085ThrPhe: 3.085 ± 0.732
3.322ThrGly: 3.322 ± 1.186
0.949ThrHis: 0.949 ± 0.882
6.17ThrIle: 6.17 ± 1.098
2.61ThrLys: 2.61 ± 0.76
9.255ThrLeu: 9.255 ± 1.408
1.187ThrMet: 1.187 ± 1.018
1.424ThrAsn: 1.424 ± 0.772
3.322ThrPro: 3.322 ± 0.461
3.085ThrGln: 3.085 ± 1.325
4.034ThrArg: 4.034 ± 0.795
4.034ThrSer: 4.034 ± 1.078
4.034ThrThr: 4.034 ± 0.888
4.746ThrVal: 4.746 ± 1.119
0.949ThrTrp: 0.949 ± 0.343
1.187ThrTyr: 1.187 ± 0.558
0.0ThrXaa: 0.0 ± 0.0
Val
2.61ValAla: 2.61 ± 0.872
1.898ValCys: 1.898 ± 1.904
2.848ValAsp: 2.848 ± 0.843
3.085ValGlu: 3.085 ± 0.941
1.187ValPhe: 1.187 ± 0.5
3.56ValGly: 3.56 ± 1.4
1.898ValHis: 1.898 ± 0.664
3.322ValIle: 3.322 ± 0.828
2.136ValLys: 2.136 ± 0.885
5.695ValLeu: 5.695 ± 0.793
2.61ValMet: 2.61 ± 0.201
2.848ValAsn: 2.848 ± 0.655
4.034ValPro: 4.034 ± 0.75
1.661ValGln: 1.661 ± 0.272
2.61ValArg: 2.61 ± 0.621
5.458ValSer: 5.458 ± 2.258
2.373ValThr: 2.373 ± 0.849
2.848ValVal: 2.848 ± 0.464
0.237ValTrp: 0.237 ± 0.346
3.322ValTyr: 3.322 ± 0.833
0.0ValXaa: 0.0 ± 0.0
Trp
1.187TrpAla: 1.187 ± 0.454
0.475TrpCys: 0.475 ± 0.257
0.237TrpAsp: 0.237 ± 0.314
0.475TrpGlu: 0.475 ± 0.29
0.712TrpPhe: 0.712 ± 0.349
0.712TrpGly: 0.712 ± 0.349
0.0TrpHis: 0.0 ± 0.0
0.712TrpIle: 0.712 ± 0.266
0.949TrpLys: 0.949 ± 0.58
0.475TrpLeu: 0.475 ± 0.29
0.237TrpMet: 0.237 ± 0.145
0.712TrpAsn: 0.712 ± 0.349
0.949TrpPro: 0.949 ± 0.514
0.475TrpGln: 0.475 ± 0.29
0.475TrpArg: 0.475 ± 0.583
1.187TrpSer: 1.187 ± 0.336
0.475TrpThr: 0.475 ± 0.29
0.712TrpVal: 0.712 ± 0.377
0.237TrpTrp: 0.237 ± 0.322
0.475TrpTyr: 0.475 ± 0.29
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.373TyrAla: 2.373 ± 1.45
1.424TyrCys: 1.424 ± 0.486
1.661TyrAsp: 1.661 ± 0.544
1.661TyrGlu: 1.661 ± 0.597
1.424TyrPhe: 1.424 ± 1.377
2.373TyrGly: 2.373 ± 0.615
0.237TyrHis: 0.237 ± 0.608
2.136TyrIle: 2.136 ± 0.451
1.187TyrLys: 1.187 ± 0.54
4.746TyrLeu: 4.746 ± 1.375
0.475TyrMet: 0.475 ± 0.271
2.136TyrAsn: 2.136 ± 0.568
1.187TyrPro: 1.187 ± 0.558
2.373TyrGln: 2.373 ± 0.538
2.136TyrArg: 2.136 ± 0.652
2.61TyrSer: 2.61 ± 0.708
2.373TyrThr: 2.373 ± 0.711
2.848TyrVal: 2.848 ± 0.811
0.237TyrTrp: 0.237 ± 0.145
1.898TyrTyr: 1.898 ± 0.548
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.237XaaSer: 0.237 ± 0.322
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (4215 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski