Amino acid dipepetide frequency for Ngaingan hapavirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.713AlaAla: 1.713 ± 0.764
0.952AlaCys: 0.952 ± 0.372
2.094AlaAsp: 2.094 ± 0.451
1.713AlaGlu: 1.713 ± 0.689
1.904AlaPhe: 1.904 ± 0.719
2.094AlaGly: 2.094 ± 0.441
1.142AlaHis: 1.142 ± 0.582
2.475AlaIle: 2.475 ± 0.441
3.617AlaLys: 3.617 ± 1.438
3.998AlaLeu: 3.998 ± 0.7
1.333AlaMet: 1.333 ± 0.566
2.094AlaAsn: 2.094 ± 0.755
0.952AlaPro: 0.952 ± 0.623
1.333AlaGln: 1.333 ± 0.347
1.333AlaArg: 1.333 ± 0.454
3.046AlaSer: 3.046 ± 0.946
2.665AlaThr: 2.665 ± 0.585
1.523AlaVal: 1.523 ± 0.644
1.142AlaTrp: 1.142 ± 0.311
1.523AlaTyr: 1.523 ± 0.638
0.0AlaXaa: 0.0 ± 0.0
Cys
0.19CysAla: 0.19 ± 0.254
0.0CysCys: 0.0 ± 0.0
0.571CysAsp: 0.571 ± 0.385
0.952CysGlu: 0.952 ± 0.378
1.142CysPhe: 1.142 ± 0.513
0.761CysGly: 0.761 ± 0.482
0.761CysHis: 0.761 ± 0.385
1.142CysIle: 1.142 ± 0.374
1.523CysLys: 1.523 ± 0.814
2.094CysLeu: 2.094 ± 0.432
0.19CysMet: 0.19 ± 0.12
0.952CysAsn: 0.952 ± 0.277
0.952CysPro: 0.952 ± 0.307
0.571CysGln: 0.571 ± 0.248
1.142CysArg: 1.142 ± 0.438
2.284CysSer: 2.284 ± 0.824
2.475CysThr: 2.475 ± 0.838
0.761CysVal: 0.761 ± 0.348
0.571CysTrp: 0.571 ± 0.348
1.333CysTyr: 1.333 ± 0.36
0.0CysXaa: 0.0 ± 0.0
Asp
1.904AspAla: 1.904 ± 1.229
1.142AspCys: 1.142 ± 0.244
4.188AspAsp: 4.188 ± 0.73
4.188AspGlu: 4.188 ± 0.691
2.856AspPhe: 2.856 ± 0.48
2.284AspGly: 2.284 ± 0.589
1.713AspHis: 1.713 ± 0.659
3.998AspIle: 3.998 ± 0.943
3.807AspLys: 3.807 ± 0.627
5.14AspLeu: 5.14 ± 0.921
1.904AspMet: 1.904 ± 0.419
2.475AspAsn: 2.475 ± 0.42
3.046AspPro: 3.046 ± 0.542
3.236AspGln: 3.236 ± 0.625
1.713AspArg: 1.713 ± 0.405
3.046AspSer: 3.046 ± 0.661
2.475AspThr: 2.475 ± 0.491
3.046AspVal: 3.046 ± 0.865
1.142AspTrp: 1.142 ± 0.283
2.475AspTyr: 2.475 ± 0.431
0.0AspXaa: 0.0 ± 0.0
Glu
1.523GluAla: 1.523 ± 0.374
1.142GluCys: 1.142 ± 0.399
3.617GluAsp: 3.617 ± 0.574
3.617GluGlu: 3.617 ± 0.705
1.713GluPhe: 1.713 ± 0.741
5.521GluGly: 5.521 ± 0.771
1.904GluHis: 1.904 ± 0.582
5.14GluIle: 5.14 ± 1.054
3.998GluLys: 3.998 ± 1.087
4.95GluLeu: 4.95 ± 1.024
0.761GluMet: 0.761 ± 0.563
2.284GluAsn: 2.284 ± 0.294
2.094GluPro: 2.094 ± 0.546
2.856GluGln: 2.856 ± 0.887
1.142GluArg: 1.142 ± 0.359
4.569GluSer: 4.569 ± 0.8
3.427GluThr: 3.427 ± 0.688
5.521GluVal: 5.521 ± 1.105
1.333GluTrp: 1.333 ± 0.512
2.284GluTyr: 2.284 ± 0.736
0.0GluXaa: 0.0 ± 0.0
Phe
1.713PheAla: 1.713 ± 0.588
1.523PheCys: 1.523 ± 0.836
1.713PheAsp: 1.713 ± 0.479
2.284PheGlu: 2.284 ± 0.583
2.665PhePhe: 2.665 ± 0.809
2.856PheGly: 2.856 ± 0.927
0.952PheHis: 0.952 ± 0.357
2.094PheIle: 2.094 ± 0.581
4.759PheLys: 4.759 ± 1.096
4.759PheLeu: 4.759 ± 1.011
0.381PheMet: 0.381 ± 0.193
1.904PheAsn: 1.904 ± 0.429
2.094PhePro: 2.094 ± 0.648
1.333PheGln: 1.333 ± 0.575
2.475PheArg: 2.475 ± 0.465
3.998PheSer: 3.998 ± 0.739
1.713PheThr: 1.713 ± 0.566
1.523PheVal: 1.523 ± 0.32
0.952PheTrp: 0.952 ± 0.317
1.523PheTyr: 1.523 ± 0.505
0.19PheXaa: 0.19 ± 0.21
Gly
1.142GlyAla: 1.142 ± 0.325
0.19GlyCys: 0.19 ± 0.21
3.617GlyAsp: 3.617 ± 0.943
3.998GlyGlu: 3.998 ± 1.016
1.523GlyPhe: 1.523 ± 0.591
2.856GlyGly: 2.856 ± 0.812
1.713GlyHis: 1.713 ± 0.302
4.569GlyIle: 4.569 ± 0.737
2.094GlyLys: 2.094 ± 0.463
7.995GlyLeu: 7.995 ± 1.377
1.904GlyMet: 1.904 ± 0.591
3.046GlyAsn: 3.046 ± 0.575
2.284GlyPro: 2.284 ± 0.486
2.665GlyGln: 2.665 ± 0.622
2.094GlyArg: 2.094 ± 0.38
4.569GlySer: 4.569 ± 1.088
2.856GlyThr: 2.856 ± 0.693
3.617GlyVal: 3.617 ± 0.943
1.333GlyTrp: 1.333 ± 0.495
1.142GlyTyr: 1.142 ± 0.376
0.0GlyXaa: 0.0 ± 0.0
His
1.333HisAla: 1.333 ± 0.566
0.571HisCys: 0.571 ± 0.309
1.713HisAsp: 1.713 ± 0.589
0.761HisGlu: 0.761 ± 0.388
1.904HisPhe: 1.904 ± 0.622
1.142HisGly: 1.142 ± 0.614
0.952HisHis: 0.952 ± 0.455
1.523HisIle: 1.523 ± 0.496
1.713HisLys: 1.713 ± 0.555
3.998HisLeu: 3.998 ± 0.959
0.571HisMet: 0.571 ± 0.331
1.523HisAsn: 1.523 ± 0.591
2.665HisPro: 2.665 ± 0.697
1.333HisGln: 1.333 ± 0.374
0.571HisArg: 0.571 ± 0.361
2.284HisSer: 2.284 ± 0.763
0.761HisThr: 0.761 ± 0.69
1.904HisVal: 1.904 ± 0.519
0.761HisTrp: 0.761 ± 0.242
1.713HisTyr: 1.713 ± 0.828
0.0HisXaa: 0.0 ± 0.0
Ile
3.617IleAla: 3.617 ± 1.072
1.904IleCys: 1.904 ± 0.818
4.378IleAsp: 4.378 ± 0.599
5.14IleGlu: 5.14 ± 1.063
2.665IlePhe: 2.665 ± 0.694
4.378IleGly: 4.378 ± 0.717
1.713IleHis: 1.713 ± 0.709
9.138IleIle: 9.138 ± 2.015
6.472IleLys: 6.472 ± 1.696
7.615IleLeu: 7.615 ± 0.804
2.094IleMet: 2.094 ± 0.842
3.998IleAsn: 3.998 ± 1.059
3.427IlePro: 3.427 ± 0.677
3.427IleGln: 3.427 ± 0.912
4.188IleArg: 4.188 ± 0.856
6.282IleSer: 6.282 ± 1.139
4.569IleThr: 4.569 ± 0.944
2.665IleVal: 2.665 ± 0.592
2.094IleTrp: 2.094 ± 0.487
3.998IleTyr: 3.998 ± 0.734
0.0IleXaa: 0.0 ± 0.0
Lys
2.665LysAla: 2.665 ± 0.995
1.523LysCys: 1.523 ± 0.403
4.759LysAsp: 4.759 ± 1.128
4.95LysGlu: 4.95 ± 0.937
1.904LysPhe: 1.904 ± 0.65
3.617LysGly: 3.617 ± 0.436
3.427LysHis: 3.427 ± 0.609
6.663LysIle: 6.663 ± 1.57
5.901LysLys: 5.901 ± 0.701
7.044LysLeu: 7.044 ± 1.202
1.142LysMet: 1.142 ± 0.676
3.807LysAsn: 3.807 ± 0.919
3.236LysPro: 3.236 ± 0.762
1.713LysGln: 1.713 ± 0.496
2.284LysArg: 2.284 ± 0.549
5.901LysSer: 5.901 ± 0.803
4.188LysThr: 4.188 ± 0.63
3.998LysVal: 3.998 ± 0.736
0.571LysTrp: 0.571 ± 0.248
2.284LysTyr: 2.284 ± 0.599
0.0LysXaa: 0.0 ± 0.0
Leu
4.569LeuAla: 4.569 ± 1.319
2.284LeuCys: 2.284 ± 0.808
5.521LeuAsp: 5.521 ± 0.598
5.521LeuGlu: 5.521 ± 0.716
3.236LeuPhe: 3.236 ± 1.01
3.998LeuGly: 3.998 ± 1.164
3.427LeuHis: 3.427 ± 0.696
10.089LeuIle: 10.089 ± 1.293
5.14LeuLys: 5.14 ± 0.816
8.376LeuLeu: 8.376 ± 1.143
2.475LeuMet: 2.475 ± 0.581
6.282LeuAsn: 6.282 ± 1.038
2.475LeuPro: 2.475 ± 0.768
3.236LeuGln: 3.236 ± 0.661
3.998LeuArg: 3.998 ± 1.009
7.615LeuSer: 7.615 ± 1.078
5.14LeuThr: 5.14 ± 0.712
4.569LeuVal: 4.569 ± 0.724
1.713LeuTrp: 1.713 ± 0.354
3.998LeuTyr: 3.998 ± 1.077
0.0LeuXaa: 0.0 ± 0.0
Met
1.523MetAla: 1.523 ± 0.704
0.381MetCys: 0.381 ± 0.241
1.142MetAsp: 1.142 ± 0.397
1.904MetGlu: 1.904 ± 0.532
1.904MetPhe: 1.904 ± 0.552
1.142MetGly: 1.142 ± 0.353
0.381MetHis: 0.381 ± 0.286
2.475MetIle: 2.475 ± 0.779
1.142MetLys: 1.142 ± 0.328
1.713MetLeu: 1.713 ± 0.632
0.761MetMet: 0.761 ± 0.333
1.333MetAsn: 1.333 ± 0.432
0.381MetPro: 0.381 ± 0.373
0.19MetGln: 0.19 ± 0.12
2.094MetArg: 2.094 ± 0.509
2.094MetSer: 2.094 ± 0.55
1.142MetThr: 1.142 ± 0.373
1.904MetVal: 1.904 ± 0.616
0.0MetTrp: 0.0 ± 0.0
0.761MetTyr: 0.761 ± 0.346
0.0MetXaa: 0.0 ± 0.0
Asn
1.713AsnAla: 1.713 ± 0.482
1.333AsnCys: 1.333 ± 0.413
3.046AsnAsp: 3.046 ± 0.543
2.475AsnGlu: 2.475 ± 0.492
2.475AsnPhe: 2.475 ± 0.703
2.094AsnGly: 2.094 ± 0.546
1.333AsnHis: 1.333 ± 0.5
4.569AsnIle: 4.569 ± 0.749
3.427AsnLys: 3.427 ± 0.625
4.378AsnLeu: 4.378 ± 0.739
2.094AsnMet: 2.094 ± 0.56
2.094AsnAsn: 2.094 ± 0.591
3.427AsnPro: 3.427 ± 0.631
2.094AsnGln: 2.094 ± 0.462
2.094AsnArg: 2.094 ± 0.514
4.569AsnSer: 4.569 ± 0.668
2.856AsnThr: 2.856 ± 0.626
1.904AsnVal: 1.904 ± 0.677
1.523AsnTrp: 1.523 ± 0.51
1.523AsnTyr: 1.523 ± 0.553
0.0AsnXaa: 0.0 ± 0.0
Pro
1.713ProAla: 1.713 ± 0.663
0.19ProCys: 0.19 ± 0.254
2.856ProAsp: 2.856 ± 0.51
2.475ProGlu: 2.475 ± 0.966
1.904ProPhe: 1.904 ± 0.784
2.284ProGly: 2.284 ± 0.527
0.952ProHis: 0.952 ± 0.697
3.807ProIle: 3.807 ± 0.697
2.284ProLys: 2.284 ± 0.675
3.617ProLeu: 3.617 ± 0.946
1.142ProMet: 1.142 ± 0.551
2.856ProAsn: 2.856 ± 0.662
2.475ProPro: 2.475 ± 0.706
1.713ProGln: 1.713 ± 0.468
1.713ProArg: 1.713 ± 0.52
3.807ProSer: 3.807 ± 0.628
3.236ProThr: 3.236 ± 0.646
2.284ProVal: 2.284 ± 1.406
0.19ProTrp: 0.19 ± 0.12
2.475ProTyr: 2.475 ± 0.845
0.0ProXaa: 0.0 ± 0.0
Gln
0.761GlnAla: 0.761 ± 0.496
1.142GlnCys: 1.142 ± 0.338
2.094GlnAsp: 2.094 ± 0.513
1.713GlnGlu: 1.713 ± 0.567
2.856GlnPhe: 2.856 ± 0.577
1.523GlnGly: 1.523 ± 0.336
0.952GlnHis: 0.952 ± 0.406
2.284GlnIle: 2.284 ± 0.628
1.904GlnLys: 1.904 ± 0.579
2.856GlnLeu: 2.856 ± 0.599
0.381GlnMet: 0.381 ± 0.263
1.713GlnAsn: 1.713 ± 0.381
0.952GlnPro: 0.952 ± 0.616
0.571GlnGln: 0.571 ± 0.281
1.713GlnArg: 1.713 ± 0.617
3.427GlnSer: 3.427 ± 0.702
2.475GlnThr: 2.475 ± 0.797
1.904GlnVal: 1.904 ± 0.536
0.0GlnTrp: 0.0 ± 0.0
1.333GlnTyr: 1.333 ± 0.511
0.0GlnXaa: 0.0 ± 0.0
Arg
1.142ArgAla: 1.142 ± 0.403
0.381ArgCys: 0.381 ± 0.331
1.142ArgAsp: 1.142 ± 0.419
3.236ArgGlu: 3.236 ± 0.908
1.904ArgPhe: 1.904 ± 0.585
2.475ArgGly: 2.475 ± 0.567
0.952ArgHis: 0.952 ± 0.314
2.284ArgIle: 2.284 ± 0.781
3.998ArgLys: 3.998 ± 1.064
3.236ArgLeu: 3.236 ± 0.914
1.142ArgMet: 1.142 ± 0.465
1.333ArgAsn: 1.333 ± 0.401
1.713ArgPro: 1.713 ± 0.459
0.952ArgGln: 0.952 ± 0.43
1.904ArgArg: 1.904 ± 0.552
3.998ArgSer: 3.998 ± 0.769
1.904ArgThr: 1.904 ± 0.59
2.665ArgVal: 2.665 ± 0.575
1.142ArgTrp: 1.142 ± 0.385
2.094ArgTyr: 2.094 ± 1.097
0.0ArgXaa: 0.0 ± 0.0
Ser
3.807SerAla: 3.807 ± 0.969
1.713SerCys: 1.713 ± 0.589
4.569SerAsp: 4.569 ± 0.855
5.521SerGlu: 5.521 ± 0.699
4.188SerPhe: 4.188 ± 0.803
3.998SerGly: 3.998 ± 0.951
3.236SerHis: 3.236 ± 0.526
7.044SerIle: 7.044 ± 1.391
5.14SerLys: 5.14 ± 0.698
8.947SerLeu: 8.947 ± 1.062
1.142SerMet: 1.142 ± 0.586
4.378SerAsn: 4.378 ± 0.751
3.427SerPro: 3.427 ± 1.078
1.523SerGln: 1.523 ± 0.808
3.046SerArg: 3.046 ± 0.539
6.853SerSer: 6.853 ± 1.547
5.33SerThr: 5.33 ± 0.914
3.427SerVal: 3.427 ± 0.927
2.094SerTrp: 2.094 ± 0.951
4.569SerTyr: 4.569 ± 0.862
0.0SerXaa: 0.0 ± 0.0
Thr
1.523ThrAla: 1.523 ± 0.518
0.761ThrCys: 0.761 ± 0.331
2.856ThrAsp: 2.856 ± 0.689
3.236ThrGlu: 3.236 ± 0.728
2.094ThrPhe: 2.094 ± 0.672
3.807ThrGly: 3.807 ± 0.881
1.333ThrHis: 1.333 ± 0.582
4.188ThrIle: 4.188 ± 0.758
4.569ThrLys: 4.569 ± 1.32
5.521ThrLeu: 5.521 ± 0.997
1.333ThrMet: 1.333 ± 0.435
2.475ThrAsn: 2.475 ± 0.717
3.236ThrPro: 3.236 ± 1.013
1.904ThrGln: 1.904 ± 0.546
2.094ThrArg: 2.094 ± 0.827
5.711ThrSer: 5.711 ± 1.006
4.378ThrThr: 4.378 ± 1.019
2.665ThrVal: 2.665 ± 0.794
1.333ThrTrp: 1.333 ± 0.409
3.236ThrTyr: 3.236 ± 1.04
0.0ThrXaa: 0.0 ± 0.0
Val
3.046ValAla: 3.046 ± 1.202
1.523ValCys: 1.523 ± 0.356
2.665ValAsp: 2.665 ± 0.689
2.284ValGlu: 2.284 ± 0.422
2.284ValPhe: 2.284 ± 0.434
4.378ValGly: 4.378 ± 1.179
1.142ValHis: 1.142 ± 0.447
4.95ValIle: 4.95 ± 1.007
3.236ValLys: 3.236 ± 0.903
2.665ValLeu: 2.665 ± 0.646
1.333ValMet: 1.333 ± 0.384
2.665ValAsn: 2.665 ± 0.847
2.475ValPro: 2.475 ± 0.556
0.761ValGln: 0.761 ± 0.349
1.904ValArg: 1.904 ± 0.47
4.569ValSer: 4.569 ± 0.866
3.046ValThr: 3.046 ± 0.965
3.046ValVal: 3.046 ± 0.582
0.19ValTrp: 0.19 ± 0.12
3.236ValTyr: 3.236 ± 0.664
0.0ValXaa: 0.0 ± 0.0
Trp
1.333TrpAla: 1.333 ± 0.503
0.19TrpCys: 0.19 ± 0.235
0.761TrpAsp: 0.761 ± 0.342
1.713TrpGlu: 1.713 ± 0.386
0.571TrpPhe: 0.571 ± 0.311
1.713TrpGly: 1.713 ± 0.568
0.19TrpHis: 0.19 ± 0.12
2.475TrpIle: 2.475 ± 0.383
1.523TrpLys: 1.523 ± 0.484
0.952TrpLeu: 0.952 ± 0.395
0.761TrpMet: 0.761 ± 0.281
1.713TrpAsn: 1.713 ± 0.541
0.571TrpPro: 0.571 ± 0.271
0.19TrpGln: 0.19 ± 0.12
0.381TrpArg: 0.381 ± 0.172
1.523TrpSer: 1.523 ± 0.857
1.142TrpThr: 1.142 ± 0.62
0.952TrpVal: 0.952 ± 0.445
0.571TrpTrp: 0.571 ± 0.424
0.381TrpTyr: 0.381 ± 0.254
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.713TyrAla: 1.713 ± 0.633
1.523TyrCys: 1.523 ± 0.544
2.475TyrAsp: 2.475 ± 0.69
1.713TyrGlu: 1.713 ± 0.42
1.713TyrPhe: 1.713 ± 0.341
2.284TyrGly: 2.284 ± 0.581
1.523TyrHis: 1.523 ± 0.369
2.856TyrIle: 2.856 ± 0.571
5.33TyrLys: 5.33 ± 1.356
3.807TyrLeu: 3.807 ± 0.613
1.142TyrMet: 1.142 ± 0.446
2.094TyrAsn: 2.094 ± 0.436
2.094TyrPro: 2.094 ± 0.981
0.952TyrGln: 0.952 ± 0.441
1.713TyrArg: 1.713 ± 0.396
3.998TyrSer: 3.998 ± 0.887
2.475TyrThr: 2.475 ± 0.681
1.523TyrVal: 1.523 ± 0.436
0.952TyrTrp: 0.952 ± 0.618
1.713TyrTyr: 1.713 ± 0.618
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.19XaaMet: 0.19 ± 0.21
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 15 proteins (5254 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski