Amino acid dipepetide frequency for Wenling frogfish filovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.332AlaAla: 8.332 ± 1.126
1.704AlaCys: 1.704 ± 0.819
5.681AlaAsp: 5.681 ± 0.794
5.302AlaGlu: 5.302 ± 1.423
2.272AlaPhe: 2.272 ± 0.53
4.734AlaGly: 4.734 ± 0.958
3.408AlaHis: 3.408 ± 0.739
4.166AlaIle: 4.166 ± 0.52
3.787AlaLys: 3.787 ± 0.889
7.574AlaLeu: 7.574 ± 1.808
1.704AlaMet: 1.704 ± 0.492
1.704AlaAsn: 1.704 ± 0.511
2.84AlaPro: 2.84 ± 0.923
2.462AlaGln: 2.462 ± 0.538
4.734AlaArg: 4.734 ± 0.806
5.113AlaSer: 5.113 ± 0.94
6.249AlaThr: 6.249 ± 0.59
3.787AlaVal: 3.787 ± 0.934
0.947AlaTrp: 0.947 ± 0.378
1.704AlaTyr: 1.704 ± 0.492
0.0AlaXaa: 0.0 ± 0.0
Cys
1.136CysAla: 1.136 ± 0.405
0.189CysCys: 0.189 ± 0.113
0.189CysAsp: 0.189 ± 0.113
0.947CysGlu: 0.947 ± 0.437
1.136CysPhe: 1.136 ± 0.446
0.757CysGly: 0.757 ± 0.259
0.757CysHis: 0.757 ± 0.315
0.757CysIle: 0.757 ± 0.265
0.568CysLys: 0.568 ± 0.368
1.704CysLeu: 1.704 ± 0.424
0.947CysMet: 0.947 ± 0.329
0.757CysAsn: 0.757 ± 0.301
0.757CysPro: 0.757 ± 0.694
0.947CysGln: 0.947 ± 0.453
2.083CysArg: 2.083 ± 0.446
2.083CysSer: 2.083 ± 0.579
0.947CysThr: 0.947 ± 0.34
1.894CysVal: 1.894 ± 0.493
0.568CysTrp: 0.568 ± 0.338
0.947CysTyr: 0.947 ± 0.561
0.0CysXaa: 0.0 ± 0.0
Asp
2.272AspAla: 2.272 ± 0.689
0.568AspCys: 0.568 ± 0.368
3.408AspAsp: 3.408 ± 0.964
3.219AspGlu: 3.219 ± 1.022
1.136AspPhe: 1.136 ± 0.469
2.462AspGly: 2.462 ± 0.643
1.704AspHis: 1.704 ± 0.474
3.03AspIle: 3.03 ± 0.571
2.462AspLys: 2.462 ± 0.603
7.574AspLeu: 7.574 ± 0.965
1.894AspMet: 1.894 ± 0.677
1.515AspAsn: 1.515 ± 0.518
2.651AspPro: 2.651 ± 0.514
1.894AspGln: 1.894 ± 0.316
3.977AspArg: 3.977 ± 0.648
2.651AspSer: 2.651 ± 0.586
3.408AspThr: 3.408 ± 0.793
3.787AspVal: 3.787 ± 0.55
0.568AspTrp: 0.568 ± 0.315
1.515AspTyr: 1.515 ± 0.65
0.0AspXaa: 0.0 ± 0.0
Glu
3.977GluAla: 3.977 ± 0.609
1.136GluCys: 1.136 ± 0.796
3.787GluAsp: 3.787 ± 0.852
3.977GluGlu: 3.977 ± 1.047
1.326GluPhe: 1.326 ± 0.546
5.113GluGly: 5.113 ± 1.376
1.136GluHis: 1.136 ± 0.423
1.704GluIle: 1.704 ± 0.348
2.651GluLys: 2.651 ± 0.581
6.628GluLeu: 6.628 ± 1.223
0.757GluMet: 0.757 ± 0.352
1.894GluAsn: 1.894 ± 0.477
1.894GluPro: 1.894 ± 0.59
1.326GluGln: 1.326 ± 0.885
3.03GluArg: 3.03 ± 0.626
3.408GluSer: 3.408 ± 1.108
4.923GluThr: 4.923 ± 1.117
3.598GluVal: 3.598 ± 0.988
0.568GluTrp: 0.568 ± 0.273
2.84GluTyr: 2.84 ± 0.652
0.0GluXaa: 0.0 ± 0.0
Phe
2.272PheAla: 2.272 ± 0.525
0.379PheCys: 0.379 ± 0.226
2.462PheAsp: 2.462 ± 0.784
1.326PheGlu: 1.326 ± 0.382
1.515PhePhe: 1.515 ± 0.492
2.083PheGly: 2.083 ± 0.485
1.136PheHis: 1.136 ± 0.389
2.083PheIle: 2.083 ± 0.559
0.757PheLys: 0.757 ± 0.274
3.408PheLeu: 3.408 ± 1.026
0.947PheMet: 0.947 ± 0.348
1.704PheAsn: 1.704 ± 0.425
3.03PhePro: 3.03 ± 0.829
1.515PheGln: 1.515 ± 0.544
1.136PheArg: 1.136 ± 0.854
3.787PheSer: 3.787 ± 0.709
2.272PheThr: 2.272 ± 0.479
2.84PheVal: 2.84 ± 0.868
0.379PheTrp: 0.379 ± 0.411
0.947PheTyr: 0.947 ± 0.293
0.0PheXaa: 0.0 ± 0.0
Gly
3.408GlyAla: 3.408 ± 0.456
1.515GlyCys: 1.515 ± 0.347
3.408GlyAsp: 3.408 ± 0.86
2.462GlyGlu: 2.462 ± 0.771
3.598GlyPhe: 3.598 ± 0.668
3.977GlyGly: 3.977 ± 0.767
1.136GlyHis: 1.136 ± 0.549
2.651GlyIle: 2.651 ± 0.681
3.219GlyLys: 3.219 ± 0.914
6.628GlyLeu: 6.628 ± 1.104
0.757GlyMet: 0.757 ± 0.287
2.272GlyAsn: 2.272 ± 0.896
5.302GlyPro: 5.302 ± 1.801
2.651GlyGln: 2.651 ± 0.78
3.787GlyArg: 3.787 ± 0.883
5.491GlySer: 5.491 ± 0.901
4.545GlyThr: 4.545 ± 1.381
3.598GlyVal: 3.598 ± 0.678
0.757GlyTrp: 0.757 ± 0.329
1.515GlyTyr: 1.515 ± 0.543
0.0GlyXaa: 0.0 ± 0.0
His
2.651HisAla: 2.651 ± 1.012
0.189HisCys: 0.189 ± 0.113
1.704HisAsp: 1.704 ± 0.644
1.136HisGlu: 1.136 ± 0.499
1.894HisPhe: 1.894 ± 0.535
2.272HisGly: 2.272 ± 0.4
2.083HisHis: 2.083 ± 0.656
1.326HisIle: 1.326 ± 0.494
1.326HisLys: 1.326 ± 0.558
3.598HisLeu: 3.598 ± 0.58
0.189HisMet: 0.189 ± 0.113
0.757HisAsn: 0.757 ± 0.509
2.083HisPro: 2.083 ± 0.734
0.947HisGln: 0.947 ± 0.395
1.704HisArg: 1.704 ± 0.413
1.704HisSer: 1.704 ± 0.566
1.326HisThr: 1.326 ± 0.478
0.947HisVal: 0.947 ± 0.437
0.568HisTrp: 0.568 ± 0.261
0.568HisTyr: 0.568 ± 0.227
0.0HisXaa: 0.0 ± 0.0
Ile
4.734IleAla: 4.734 ± 0.792
1.515IleCys: 1.515 ± 0.531
2.462IleAsp: 2.462 ± 0.599
2.651IleGlu: 2.651 ± 0.55
1.136IlePhe: 1.136 ± 0.393
2.083IleGly: 2.083 ± 0.803
0.947IleHis: 0.947 ± 0.305
2.84IleIle: 2.84 ± 0.643
2.84IleLys: 2.84 ± 0.537
4.545IleLeu: 4.545 ± 0.747
1.894IleMet: 1.894 ± 1.082
2.462IleAsn: 2.462 ± 0.761
1.704IlePro: 1.704 ± 0.425
1.136IleGln: 1.136 ± 0.497
2.272IleArg: 2.272 ± 0.439
4.923IleSer: 4.923 ± 0.807
5.113IleThr: 5.113 ± 0.867
2.272IleVal: 2.272 ± 0.967
0.568IleTrp: 0.568 ± 0.273
1.704IleTyr: 1.704 ± 0.483
0.0IleXaa: 0.0 ± 0.0
Lys
3.598LysAla: 3.598 ± 1.145
0.757LysCys: 0.757 ± 0.308
3.03LysAsp: 3.03 ± 0.92
3.787LysGlu: 3.787 ± 1.067
0.947LysPhe: 0.947 ± 0.411
2.462LysGly: 2.462 ± 0.577
1.515LysHis: 1.515 ± 0.597
2.651LysIle: 2.651 ± 0.774
2.462LysLys: 2.462 ± 0.722
4.734LysLeu: 4.734 ± 0.897
1.704LysMet: 1.704 ± 0.507
1.515LysAsn: 1.515 ± 0.677
1.894LysPro: 1.894 ± 0.33
0.947LysGln: 0.947 ± 0.367
2.84LysArg: 2.84 ± 0.592
2.84LysSer: 2.84 ± 0.545
3.598LysThr: 3.598 ± 0.748
2.651LysVal: 2.651 ± 0.457
0.757LysTrp: 0.757 ± 0.451
1.515LysTyr: 1.515 ± 0.586
0.0LysXaa: 0.0 ± 0.0
Leu
7.385LeuAla: 7.385 ± 1.224
2.272LeuCys: 2.272 ± 0.607
4.166LeuAsp: 4.166 ± 0.681
3.219LeuGlu: 3.219 ± 1.007
3.219LeuPhe: 3.219 ± 0.75
9.089LeuGly: 9.089 ± 1.536
1.894LeuHis: 1.894 ± 0.686
6.059LeuIle: 6.059 ± 1.241
5.681LeuLys: 5.681 ± 0.778
10.036LeuLeu: 10.036 ± 1.614
2.84LeuMet: 2.84 ± 0.619
3.787LeuAsn: 3.787 ± 0.478
5.491LeuPro: 5.491 ± 0.783
4.166LeuGln: 4.166 ± 0.611
6.059LeuArg: 6.059 ± 1.373
8.142LeuSer: 8.142 ± 0.865
7.764LeuThr: 7.764 ± 0.979
6.249LeuVal: 6.249 ± 1.192
0.379LeuTrp: 0.379 ± 0.229
3.787LeuTyr: 3.787 ± 0.718
0.0LeuXaa: 0.0 ± 0.0
Met
2.083MetAla: 2.083 ± 1.097
0.379MetCys: 0.379 ± 0.226
2.462MetAsp: 2.462 ± 0.788
1.704MetGlu: 1.704 ± 0.787
1.515MetPhe: 1.515 ± 0.42
1.136MetGly: 1.136 ± 0.627
0.757MetHis: 0.757 ± 0.451
2.083MetIle: 2.083 ± 0.366
1.136MetLys: 1.136 ± 0.52
1.515MetLeu: 1.515 ± 0.469
0.947MetMet: 0.947 ± 0.358
0.568MetAsn: 0.568 ± 0.34
0.947MetPro: 0.947 ± 0.625
0.757MetGln: 0.757 ± 0.378
1.515MetArg: 1.515 ± 0.692
2.462MetSer: 2.462 ± 0.738
2.272MetThr: 2.272 ± 0.681
1.704MetVal: 1.704 ± 0.484
0.189MetTrp: 0.189 ± 0.205
0.568MetTyr: 0.568 ± 0.282
0.0MetXaa: 0.0 ± 0.0
Asn
1.894AsnAla: 1.894 ± 1.235
0.189AsnCys: 0.189 ± 0.178
0.757AsnAsp: 0.757 ± 0.311
1.894AsnGlu: 1.894 ± 0.926
1.136AsnPhe: 1.136 ± 0.376
1.894AsnGly: 1.894 ± 0.483
1.326AsnHis: 1.326 ± 0.512
2.272AsnIle: 2.272 ± 0.806
2.272AsnLys: 2.272 ± 0.778
5.113AsnLeu: 5.113 ± 1.069
0.947AsnMet: 0.947 ± 0.45
1.136AsnAsn: 1.136 ± 0.253
2.272AsnPro: 2.272 ± 0.574
1.704AsnGln: 1.704 ± 0.401
1.515AsnArg: 1.515 ± 0.453
2.083AsnSer: 2.083 ± 0.778
2.083AsnThr: 2.083 ± 0.904
2.083AsnVal: 2.083 ± 0.719
0.379AsnTrp: 0.379 ± 0.229
0.568AsnTyr: 0.568 ± 0.319
0.0AsnXaa: 0.0 ± 0.0
Pro
3.408ProAla: 3.408 ± 1.241
1.136ProCys: 1.136 ± 0.372
3.977ProAsp: 3.977 ± 0.721
4.355ProGlu: 4.355 ± 1.243
2.083ProPhe: 2.083 ± 0.48
2.651ProGly: 2.651 ± 0.751
1.704ProHis: 1.704 ± 0.607
2.651ProIle: 2.651 ± 0.713
2.272ProLys: 2.272 ± 1.001
3.787ProLeu: 3.787 ± 0.978
0.947ProMet: 0.947 ± 0.472
1.894ProAsn: 1.894 ± 0.394
6.628ProPro: 6.628 ± 1.854
2.651ProGln: 2.651 ± 0.741
2.651ProArg: 2.651 ± 0.822
4.355ProSer: 4.355 ± 1.788
3.408ProThr: 3.408 ± 0.818
4.545ProVal: 4.545 ± 1.33
0.379ProTrp: 0.379 ± 0.311
1.704ProTyr: 1.704 ± 0.438
0.0ProXaa: 0.0 ± 0.0
Gln
3.598GlnAla: 3.598 ± 1.035
0.568GlnCys: 0.568 ± 0.273
1.326GlnAsp: 1.326 ± 0.362
1.894GlnGlu: 1.894 ± 0.701
0.947GlnPhe: 0.947 ± 0.352
3.408GlnGly: 3.408 ± 0.766
1.515GlnHis: 1.515 ± 0.51
1.894GlnIle: 1.894 ± 0.711
1.894GlnLys: 1.894 ± 0.526
3.408GlnLeu: 3.408 ± 0.984
1.704GlnMet: 1.704 ± 0.732
0.947GlnAsn: 0.947 ± 0.284
1.515GlnPro: 1.515 ± 0.489
2.651GlnGln: 2.651 ± 0.425
1.515GlnArg: 1.515 ± 0.842
2.651GlnSer: 2.651 ± 0.651
3.408GlnThr: 3.408 ± 0.872
3.787GlnVal: 3.787 ± 0.96
0.379GlnTrp: 0.379 ± 0.232
1.326GlnTyr: 1.326 ± 0.476
0.0GlnXaa: 0.0 ± 0.0
Arg
4.923ArgAla: 4.923 ± 1.032
1.326ArgCys: 1.326 ± 0.438
3.03ArgAsp: 3.03 ± 0.782
3.408ArgGlu: 3.408 ± 0.84
2.651ArgPhe: 2.651 ± 0.711
3.219ArgGly: 3.219 ± 1.254
1.894ArgHis: 1.894 ± 0.411
1.704ArgIle: 1.704 ± 0.36
1.894ArgLys: 1.894 ± 0.335
6.438ArgLeu: 6.438 ± 0.919
1.326ArgMet: 1.326 ± 0.305
2.462ArgAsn: 2.462 ± 0.673
4.355ArgPro: 4.355 ± 1.438
2.651ArgGln: 2.651 ± 0.692
3.977ArgArg: 3.977 ± 0.944
4.923ArgSer: 4.923 ± 1.018
3.787ArgThr: 3.787 ± 0.794
3.408ArgVal: 3.408 ± 0.638
0.947ArgTrp: 0.947 ± 0.355
1.894ArgTyr: 1.894 ± 0.377
0.0ArgXaa: 0.0 ± 0.0
Ser
5.87SerAla: 5.87 ± 0.796
1.894SerCys: 1.894 ± 0.539
2.84SerAsp: 2.84 ± 0.544
3.219SerGlu: 3.219 ± 1.371
2.651SerPhe: 2.651 ± 0.685
5.302SerGly: 5.302 ± 0.737
1.894SerHis: 1.894 ± 0.624
3.598SerIle: 3.598 ± 0.758
3.03SerLys: 3.03 ± 0.708
6.249SerLeu: 6.249 ± 0.774
1.704SerMet: 1.704 ± 0.473
2.651SerAsn: 2.651 ± 0.801
3.977SerPro: 3.977 ± 1.565
5.302SerGln: 5.302 ± 0.939
6.817SerArg: 6.817 ± 1.138
9.279SerSer: 9.279 ± 1.1
4.166SerThr: 4.166 ± 0.705
3.787SerVal: 3.787 ± 0.644
1.136SerTrp: 1.136 ± 0.277
2.272SerTyr: 2.272 ± 0.536
0.0SerXaa: 0.0 ± 0.0
Thr
6.628ThrAla: 6.628 ± 0.944
2.083ThrCys: 2.083 ± 0.538
2.462ThrAsp: 2.462 ± 0.646
5.302ThrGlu: 5.302 ± 1.034
1.894ThrPhe: 1.894 ± 0.541
2.651ThrGly: 2.651 ± 0.72
1.515ThrHis: 1.515 ± 0.545
4.166ThrIle: 4.166 ± 1.166
3.598ThrLys: 3.598 ± 0.848
7.953ThrLeu: 7.953 ± 1.172
2.272ThrMet: 2.272 ± 0.574
2.083ThrAsn: 2.083 ± 0.587
5.681ThrPro: 5.681 ± 0.81
1.704ThrGln: 1.704 ± 0.69
4.923ThrArg: 4.923 ± 1.29
4.734ThrSer: 4.734 ± 1.767
4.545ThrThr: 4.545 ± 1.601
3.598ThrVal: 3.598 ± 0.58
0.947ThrTrp: 0.947 ± 0.35
1.894ThrTyr: 1.894 ± 0.471
0.0ThrXaa: 0.0 ± 0.0
Val
6.249ValAla: 6.249 ± 1.244
0.947ValCys: 0.947 ± 0.398
1.704ValAsp: 1.704 ± 0.649
3.787ValGlu: 3.787 ± 0.956
3.219ValPhe: 3.219 ± 0.865
5.113ValGly: 5.113 ± 1.187
1.894ValHis: 1.894 ± 0.579
2.462ValIle: 2.462 ± 1.009
2.651ValLys: 2.651 ± 0.383
6.628ValLeu: 6.628 ± 0.971
1.515ValMet: 1.515 ± 0.643
1.704ValAsn: 1.704 ± 0.85
2.462ValPro: 2.462 ± 0.693
2.84ValGln: 2.84 ± 0.641
3.408ValArg: 3.408 ± 0.661
4.355ValSer: 4.355 ± 0.517
3.598ValThr: 3.598 ± 0.697
4.545ValVal: 4.545 ± 0.861
0.568ValTrp: 0.568 ± 0.305
1.704ValTyr: 1.704 ± 0.423
0.0ValXaa: 0.0 ± 0.0
Trp
2.083TrpAla: 2.083 ± 0.56
0.189TrpCys: 0.189 ± 0.113
0.757TrpAsp: 0.757 ± 0.337
0.189TrpGlu: 0.189 ± 0.277
0.379TrpPhe: 0.379 ± 0.226
0.568TrpGly: 0.568 ± 0.297
0.189TrpHis: 0.189 ± 0.178
0.379TrpIle: 0.379 ± 0.229
0.568TrpLys: 0.568 ± 0.261
1.326TrpLeu: 1.326 ± 0.382
0.189TrpMet: 0.189 ± 0.113
0.379TrpAsn: 0.379 ± 0.311
0.189TrpPro: 0.189 ± 0.113
0.379TrpGln: 0.379 ± 0.371
0.0TrpArg: 0.0 ± 0.0
1.136TrpSer: 1.136 ± 0.381
1.326TrpThr: 1.326 ± 0.339
0.757TrpVal: 0.757 ± 0.451
0.0TrpTrp: 0.0 ± 0.0
0.568TrpTyr: 0.568 ± 0.319
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.704TyrAla: 1.704 ± 0.463
1.136TyrCys: 1.136 ± 0.253
2.083TyrAsp: 2.083 ± 0.542
1.894TyrGlu: 1.894 ± 0.522
1.326TyrPhe: 1.326 ± 0.525
1.894TyrGly: 1.894 ± 0.921
0.568TyrHis: 0.568 ± 0.273
1.326TyrIle: 1.326 ± 0.229
1.136TyrLys: 1.136 ± 0.319
2.84TyrLeu: 2.84 ± 0.63
1.326TyrMet: 1.326 ± 0.453
1.326TyrAsn: 1.326 ± 0.449
1.515TyrPro: 1.515 ± 0.713
1.704TyrGln: 1.704 ± 0.349
2.272TyrArg: 2.272 ± 0.452
1.515TyrSer: 1.515 ± 0.669
1.894TyrThr: 1.894 ± 0.607
1.515TyrVal: 1.515 ± 0.597
0.568TyrTrp: 0.568 ± 0.315
0.379TyrTyr: 0.379 ± 0.266
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (5282 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski