Amino acid dipepetide frequency for Wenling thamnaconus septentrionalis filovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.286AlaAla: 5.286 ± 2.349
1.609AlaCys: 1.609 ± 0.398
2.298AlaAsp: 2.298 ± 0.625
2.298AlaGlu: 2.298 ± 0.71
2.068AlaPhe: 2.068 ± 0.532
5.286AlaGly: 5.286 ± 0.886
1.379AlaHis: 1.379 ± 0.364
3.677AlaIle: 3.677 ± 0.702
2.298AlaLys: 2.298 ± 0.807
7.125AlaLeu: 7.125 ± 0.894
1.149AlaMet: 1.149 ± 0.509
2.068AlaAsn: 2.068 ± 0.62
1.839AlaPro: 1.839 ± 0.679
2.758AlaGln: 2.758 ± 0.809
2.298AlaArg: 2.298 ± 1.159
5.286AlaSer: 5.286 ± 1.281
3.677AlaThr: 3.677 ± 1.021
4.597AlaVal: 4.597 ± 0.903
1.839AlaTrp: 1.839 ± 0.773
1.609AlaTyr: 1.609 ± 0.525
0.0AlaXaa: 0.0 ± 0.0
Cys
1.609CysAla: 1.609 ± 0.617
0.689CysCys: 0.689 ± 0.432
1.839CysAsp: 1.839 ± 0.636
1.379CysGlu: 1.379 ± 0.416
1.149CysPhe: 1.149 ± 0.329
1.839CysGly: 1.839 ± 0.386
0.46CysHis: 0.46 ± 0.288
0.919CysIle: 0.919 ± 0.357
0.689CysLys: 0.689 ± 0.31
1.609CysLeu: 1.609 ± 0.332
0.46CysMet: 0.46 ± 0.454
0.23CysAsn: 0.23 ± 0.338
0.46CysPro: 0.46 ± 0.416
0.689CysGln: 0.689 ± 0.265
1.379CysArg: 1.379 ± 0.57
0.689CysSer: 0.689 ± 0.295
1.609CysThr: 1.609 ± 0.371
2.298CysVal: 2.298 ± 1.005
0.0CysTrp: 0.0 ± 0.0
0.23CysTyr: 0.23 ± 0.331
0.0CysXaa: 0.0 ± 0.0
Asp
2.528AspAla: 2.528 ± 0.763
0.919AspCys: 0.919 ± 0.433
3.677AspAsp: 3.677 ± 1.185
2.528AspGlu: 2.528 ± 1.33
1.149AspPhe: 1.149 ± 0.293
2.758AspGly: 2.758 ± 0.757
0.919AspHis: 0.919 ± 0.415
2.298AspIle: 2.298 ± 0.63
2.068AspLys: 2.068 ± 0.798
9.193AspLeu: 9.193 ± 1.222
2.528AspMet: 2.528 ± 0.841
2.068AspAsn: 2.068 ± 0.42
5.286AspPro: 5.286 ± 1.166
1.839AspGln: 1.839 ± 1.098
2.068AspArg: 2.068 ± 0.808
4.137AspSer: 4.137 ± 0.75
1.839AspThr: 1.839 ± 0.481
2.298AspVal: 2.298 ± 0.599
1.149AspTrp: 1.149 ± 0.492
2.068AspTyr: 2.068 ± 0.614
0.0AspXaa: 0.0 ± 0.0
Glu
2.758GluAla: 2.758 ± 0.951
1.609GluCys: 1.609 ± 0.668
3.218GluAsp: 3.218 ± 0.619
4.367GluGlu: 4.367 ± 1.446
1.379GluPhe: 1.379 ± 0.624
5.286GluGly: 5.286 ± 0.866
1.609GluHis: 1.609 ± 0.909
2.298GluIle: 2.298 ± 0.738
4.137GluLys: 4.137 ± 0.975
7.814GluLeu: 7.814 ± 1.643
2.528GluMet: 2.528 ± 0.966
1.149GluAsn: 1.149 ± 0.614
2.988GluPro: 2.988 ± 1.148
1.839GluGln: 1.839 ± 0.537
3.447GluArg: 3.447 ± 0.951
3.907GluSer: 3.907 ± 0.999
3.677GluThr: 3.677 ± 1.46
2.298GluVal: 2.298 ± 0.311
0.919GluTrp: 0.919 ± 0.33
0.46GluTyr: 0.46 ± 0.215
0.0GluXaa: 0.0 ± 0.0
Phe
1.149PheAla: 1.149 ± 0.657
1.379PheCys: 1.379 ± 0.509
1.609PheAsp: 1.609 ± 0.528
1.609PheGlu: 1.609 ± 0.709
1.609PhePhe: 1.609 ± 0.668
2.528PheGly: 2.528 ± 0.53
1.839PheHis: 1.839 ± 0.663
0.919PheIle: 0.919 ± 0.693
1.379PheLys: 1.379 ± 0.463
4.367PheLeu: 4.367 ± 1.133
1.149PheMet: 1.149 ± 0.785
1.609PheAsn: 1.609 ± 0.653
1.839PhePro: 1.839 ± 0.34
1.149PheGln: 1.149 ± 0.719
1.839PheArg: 1.839 ± 0.9
3.677PheSer: 3.677 ± 0.913
3.677PheThr: 3.677 ± 1.782
2.988PheVal: 2.988 ± 0.829
0.0PheTrp: 0.0 ± 0.0
0.46PheTyr: 0.46 ± 0.288
0.0PheXaa: 0.0 ± 0.0
Gly
4.597GlyAla: 4.597 ± 1.58
1.609GlyCys: 1.609 ± 0.463
3.447GlyAsp: 3.447 ± 0.792
4.826GlyGlu: 4.826 ± 1.538
3.447GlyPhe: 3.447 ± 0.676
7.814GlyGly: 7.814 ± 1.272
2.988GlyHis: 2.988 ± 0.81
4.597GlyIle: 4.597 ± 1.029
2.298GlyLys: 2.298 ± 0.625
8.504GlyLeu: 8.504 ± 1.452
1.609GlyMet: 1.609 ± 0.807
3.907GlyAsn: 3.907 ± 1.31
4.367GlyPro: 4.367 ± 0.873
1.609GlyGln: 1.609 ± 0.298
4.826GlyArg: 4.826 ± 0.991
6.205GlySer: 6.205 ± 1.028
4.137GlyThr: 4.137 ± 0.342
5.516GlyVal: 5.516 ± 0.371
0.689GlyTrp: 0.689 ± 0.358
1.379GlyTyr: 1.379 ± 0.477
0.0GlyXaa: 0.0 ± 0.0
His
1.149HisAla: 1.149 ± 0.719
0.689HisCys: 0.689 ± 0.36
1.149HisAsp: 1.149 ± 0.498
2.528HisGlu: 2.528 ± 0.602
0.46HisPhe: 0.46 ± 0.288
1.839HisGly: 1.839 ± 0.493
1.149HisHis: 1.149 ± 0.58
0.46HisIle: 0.46 ± 0.416
0.0HisLys: 0.0 ± 0.0
4.367HisLeu: 4.367 ± 1.276
1.149HisMet: 1.149 ± 0.568
0.689HisAsn: 0.689 ± 0.31
1.609HisPro: 1.609 ± 0.532
0.689HisGln: 0.689 ± 0.432
2.068HisArg: 2.068 ± 0.887
1.379HisSer: 1.379 ± 0.416
0.919HisThr: 0.919 ± 0.368
1.839HisVal: 1.839 ± 0.607
0.689HisTrp: 0.689 ± 0.432
1.609HisTyr: 1.609 ± 0.607
0.0HisXaa: 0.0 ± 0.0
Ile
2.298IleAla: 2.298 ± 0.666
1.149IleCys: 1.149 ± 0.347
2.298IleAsp: 2.298 ± 0.868
2.068IleGlu: 2.068 ± 0.628
2.758IlePhe: 2.758 ± 0.746
2.988IleGly: 2.988 ± 0.566
1.149IleHis: 1.149 ± 0.476
2.988IleIle: 2.988 ± 1.017
2.068IleLys: 2.068 ± 0.523
8.044IleLeu: 8.044 ± 1.748
1.609IleMet: 1.609 ± 0.357
1.379IleAsn: 1.379 ± 0.751
1.839IlePro: 1.839 ± 0.47
1.609IleGln: 1.609 ± 0.885
3.677IleArg: 3.677 ± 0.579
2.988IleSer: 2.988 ± 1.068
2.298IleThr: 2.298 ± 1.135
2.988IleVal: 2.988 ± 0.797
0.919IleTrp: 0.919 ± 0.292
1.379IleTyr: 1.379 ± 0.391
0.0IleXaa: 0.0 ± 0.0
Lys
3.907LysAla: 3.907 ± 1.725
0.919LysCys: 0.919 ± 0.389
3.677LysAsp: 3.677 ± 0.58
2.988LysGlu: 2.988 ± 0.74
1.149LysPhe: 1.149 ± 0.58
3.218LysGly: 3.218 ± 0.742
1.149LysHis: 1.149 ± 0.719
2.528LysIle: 2.528 ± 1.115
2.988LysLys: 2.988 ± 1.398
4.597LysLeu: 4.597 ± 1.148
0.46LysMet: 0.46 ± 0.662
1.609LysAsn: 1.609 ± 0.782
2.298LysPro: 2.298 ± 0.584
2.298LysGln: 2.298 ± 0.744
3.907LysArg: 3.907 ± 1.156
2.758LysSer: 2.758 ± 0.604
4.137LysThr: 4.137 ± 1.112
3.677LysVal: 3.677 ± 0.734
0.23LysTrp: 0.23 ± 0.253
0.46LysTyr: 0.46 ± 0.288
0.0LysXaa: 0.0 ± 0.0
Leu
8.044LeuAla: 8.044 ± 1.256
1.609LeuCys: 1.609 ± 0.447
6.435LeuAsp: 6.435 ± 0.674
4.137LeuGlu: 4.137 ± 1.169
2.758LeuPhe: 2.758 ± 0.566
8.274LeuGly: 8.274 ± 1.513
2.068LeuHis: 2.068 ± 1.107
5.056LeuIle: 5.056 ± 1.165
6.895LeuLys: 6.895 ± 1.455
10.113LeuLeu: 10.113 ± 2.007
2.528LeuMet: 2.528 ± 0.66
4.826LeuAsn: 4.826 ± 1.874
5.286LeuPro: 5.286 ± 0.635
3.677LeuGln: 3.677 ± 0.629
6.665LeuArg: 6.665 ± 1.505
9.883LeuSer: 9.883 ± 0.855
8.504LeuThr: 8.504 ± 1.608
10.802LeuVal: 10.802 ± 1.727
0.919LeuTrp: 0.919 ± 0.526
3.677LeuTyr: 3.677 ± 0.87
0.0LeuXaa: 0.0 ± 0.0
Met
2.988MetAla: 2.988 ± 1.724
0.689MetCys: 0.689 ± 0.322
0.919MetAsp: 0.919 ± 0.368
2.988MetGlu: 2.988 ± 1.093
2.298MetPhe: 2.298 ± 0.728
2.298MetGly: 2.298 ± 0.774
0.46MetHis: 0.46 ± 0.416
2.298MetIle: 2.298 ± 0.228
2.988MetLys: 2.988 ± 0.699
2.068MetLeu: 2.068 ± 0.564
1.839MetMet: 1.839 ± 0.336
0.919MetAsn: 0.919 ± 0.46
0.46MetPro: 0.46 ± 0.286
0.46MetGln: 0.46 ± 0.216
2.068MetArg: 2.068 ± 0.661
2.068MetSer: 2.068 ± 0.948
1.839MetThr: 1.839 ± 0.638
2.298MetVal: 2.298 ± 0.937
0.0MetTrp: 0.0 ± 0.0
0.46MetTyr: 0.46 ± 0.288
0.0MetXaa: 0.0 ± 0.0
Asn
1.379AsnAla: 1.379 ± 0.521
0.23AsnCys: 0.23 ± 0.253
2.298AsnAsp: 2.298 ± 0.602
1.379AsnGlu: 1.379 ± 0.43
2.068AsnPhe: 2.068 ± 0.849
2.528AsnGly: 2.528 ± 1.808
0.0AsnHis: 0.0 ± 0.0
2.298AsnIle: 2.298 ± 0.94
1.609AsnLys: 1.609 ± 0.452
3.907AsnLeu: 3.907 ± 1.462
0.919AsnMet: 0.919 ± 0.322
1.839AsnAsn: 1.839 ± 0.816
3.447AsnPro: 3.447 ± 1.261
1.839AsnGln: 1.839 ± 0.652
3.218AsnArg: 3.218 ± 0.622
2.298AsnSer: 2.298 ± 0.738
2.988AsnThr: 2.988 ± 0.654
1.379AsnVal: 1.379 ± 0.364
0.0AsnTrp: 0.0 ± 0.0
0.46AsnTyr: 0.46 ± 0.216
0.0AsnXaa: 0.0 ± 0.0
Pro
1.839ProAla: 1.839 ± 0.922
0.919ProCys: 0.919 ± 0.385
2.758ProAsp: 2.758 ± 1.019
3.677ProGlu: 3.677 ± 0.957
2.068ProPhe: 2.068 ± 0.448
2.298ProGly: 2.298 ± 1.136
0.919ProHis: 0.919 ± 0.368
2.528ProIle: 2.528 ± 0.448
1.839ProLys: 1.839 ± 0.944
4.826ProLeu: 4.826 ± 0.694
1.379ProMet: 1.379 ± 1.007
2.528ProAsn: 2.528 ± 0.688
2.758ProPro: 2.758 ± 0.536
2.758ProGln: 2.758 ± 1.549
2.298ProArg: 2.298 ± 0.541
4.826ProSer: 4.826 ± 1.097
3.677ProThr: 3.677 ± 0.761
2.988ProVal: 2.988 ± 0.871
0.0ProTrp: 0.0 ± 0.0
1.379ProTyr: 1.379 ± 0.266
0.0ProXaa: 0.0 ± 0.0
Gln
1.379GlnAla: 1.379 ± 0.391
0.23GlnCys: 0.23 ± 0.144
1.379GlnAsp: 1.379 ± 0.416
1.379GlnGlu: 1.379 ± 0.643
1.379GlnPhe: 1.379 ± 0.398
2.988GlnGly: 2.988 ± 0.758
1.379GlnHis: 1.379 ± 0.621
1.609GlnIle: 1.609 ± 0.728
2.298GlnLys: 2.298 ± 0.625
5.056GlnLeu: 5.056 ± 1.258
1.379GlnMet: 1.379 ± 1.016
0.689GlnAsn: 0.689 ± 0.265
1.149GlnPro: 1.149 ± 0.879
1.609GlnGln: 1.609 ± 0.693
2.068GlnArg: 2.068 ± 0.564
2.298GlnSer: 2.298 ± 0.651
1.839GlnThr: 1.839 ± 0.402
2.528GlnVal: 2.528 ± 0.789
0.46GlnTrp: 0.46 ± 0.216
0.689GlnTyr: 0.689 ± 0.358
0.0GlnXaa: 0.0 ± 0.0
Arg
3.907ArgAla: 3.907 ± 0.572
1.379ArgCys: 1.379 ± 0.607
3.447ArgAsp: 3.447 ± 1.019
4.137ArgGlu: 4.137 ± 0.69
2.298ArgPhe: 2.298 ± 0.59
4.597ArgGly: 4.597 ± 1.151
2.068ArgHis: 2.068 ± 1.024
4.367ArgIle: 4.367 ± 1.168
2.068ArgLys: 2.068 ± 1.549
6.665ArgLeu: 6.665 ± 1.874
2.068ArgMet: 2.068 ± 0.457
0.919ArgAsn: 0.919 ± 0.46
2.298ArgPro: 2.298 ± 0.575
1.379ArgGln: 1.379 ± 0.529
4.597ArgArg: 4.597 ± 0.829
5.976ArgSer: 5.976 ± 1.26
3.677ArgThr: 3.677 ± 1.566
3.907ArgVal: 3.907 ± 1.235
0.919ArgTrp: 0.919 ± 0.46
2.298ArgTyr: 2.298 ± 0.993
0.0ArgXaa: 0.0 ± 0.0
Ser
4.597SerAla: 4.597 ± 0.842
1.379SerCys: 1.379 ± 0.621
3.907SerAsp: 3.907 ± 0.566
6.665SerGlu: 6.665 ± 0.987
2.298SerPhe: 2.298 ± 0.534
8.504SerGly: 8.504 ± 3.29
1.839SerHis: 1.839 ± 0.406
4.597SerIle: 4.597 ± 1.197
3.218SerLys: 3.218 ± 0.703
7.814SerLeu: 7.814 ± 1.13
2.068SerMet: 2.068 ± 0.457
2.528SerAsn: 2.528 ± 0.763
2.068SerPro: 2.068 ± 0.643
2.528SerGln: 2.528 ± 0.519
5.976SerArg: 5.976 ± 1.44
6.435SerSer: 6.435 ± 1.643
5.286SerThr: 5.286 ± 1.538
3.447SerVal: 3.447 ± 0.864
1.149SerTrp: 1.149 ± 0.329
1.839SerTyr: 1.839 ± 0.596
0.0SerXaa: 0.0 ± 0.0
Thr
3.907ThrAla: 3.907 ± 1.25
2.298ThrCys: 2.298 ± 0.508
2.988ThrAsp: 2.988 ± 0.751
3.218ThrGlu: 3.218 ± 0.967
1.149ThrPhe: 1.149 ± 0.363
7.814ThrGly: 7.814 ± 1.9
0.689ThrHis: 0.689 ± 0.555
2.298ThrIle: 2.298 ± 1.044
3.218ThrLys: 3.218 ± 0.979
7.814ThrLeu: 7.814 ± 1.362
2.988ThrMet: 2.988 ± 0.837
2.758ThrAsn: 2.758 ± 0.471
4.367ThrPro: 4.367 ± 0.945
1.609ThrGln: 1.609 ± 0.398
3.677ThrArg: 3.677 ± 0.91
4.367ThrSer: 4.367 ± 0.99
2.758ThrThr: 2.758 ± 0.979
3.677ThrVal: 3.677 ± 0.498
0.689ThrTrp: 0.689 ± 0.374
0.919ThrTyr: 0.919 ± 0.385
0.0ThrXaa: 0.0 ± 0.0
Val
3.907ValAla: 3.907 ± 1.828
0.46ValCys: 0.46 ± 0.288
3.907ValAsp: 3.907 ± 0.712
3.677ValGlu: 3.677 ± 0.774
3.677ValPhe: 3.677 ± 1.471
2.988ValGly: 2.988 ± 0.908
2.528ValHis: 2.528 ± 1.05
1.839ValIle: 1.839 ± 1.327
4.367ValLys: 4.367 ± 1.223
4.826ValLeu: 4.826 ± 1.438
2.988ValMet: 2.988 ± 1.217
2.528ValAsn: 2.528 ± 0.545
3.218ValPro: 3.218 ± 1.365
2.298ValGln: 2.298 ± 0.642
5.516ValArg: 5.516 ± 1.211
5.516ValSer: 5.516 ± 1.225
4.367ValThr: 4.367 ± 1.056
5.516ValVal: 5.516 ± 2.02
1.379ValTrp: 1.379 ± 0.497
1.149ValTyr: 1.149 ± 0.393
0.0ValXaa: 0.0 ± 0.0
Trp
2.068TrpAla: 2.068 ± 0.939
0.23TrpCys: 0.23 ± 0.144
0.919TrpAsp: 0.919 ± 0.655
1.149TrpGlu: 1.149 ± 0.347
1.149TrpPhe: 1.149 ± 0.393
1.149TrpGly: 1.149 ± 0.347
0.689TrpHis: 0.689 ± 0.432
0.0TrpIle: 0.0 ± 0.0
0.46TrpLys: 0.46 ± 0.339
0.46TrpLeu: 0.46 ± 0.215
0.23TrpMet: 0.23 ± 0.331
0.0TrpAsn: 0.0 ± 0.0
0.23TrpPro: 0.23 ± 0.144
0.689TrpGln: 0.689 ± 0.374
0.23TrpArg: 0.23 ± 0.331
0.689TrpSer: 0.689 ± 0.432
1.149TrpThr: 1.149 ± 0.363
0.919TrpVal: 0.919 ± 0.368
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.149TyrAla: 1.149 ± 0.476
0.23TyrCys: 0.23 ± 0.241
1.149TyrAsp: 1.149 ± 0.406
0.689TyrGlu: 0.689 ± 0.277
0.689TyrPhe: 0.689 ± 0.447
1.379TyrGly: 1.379 ± 0.811
1.149TyrHis: 1.149 ± 0.403
0.919TyrIle: 0.919 ± 0.369
2.068TyrLys: 2.068 ± 0.845
2.988TyrLeu: 2.988 ± 1.358
1.149TyrMet: 1.149 ± 0.527
1.839TyrAsn: 1.839 ± 0.481
0.46TyrPro: 0.46 ± 0.339
0.46TyrGln: 0.46 ± 0.286
1.149TyrArg: 1.149 ± 0.363
2.758TyrSer: 2.758 ± 0.39
1.149TyrThr: 1.149 ± 0.403
0.689TyrVal: 0.689 ± 0.31
0.46TyrTrp: 0.46 ± 0.215
0.689TyrTyr: 0.689 ± 0.565
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (4352 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski