Amino acid dipepetide frequency for Sudan ebolavirus (strain Human/Uganda/Gulu/2000) (SEBOV) (Sudan Ebola virus)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.539AlaAla: 4.539 ± 0.674
0.726AlaCys: 0.726 ± 0.3
2.905AlaAsp: 2.905 ± 0.728
4.176AlaGlu: 4.176 ± 0.744
5.265AlaPhe: 5.265 ± 1.411
2.542AlaGly: 2.542 ± 1.089
0.545AlaHis: 0.545 ± 0.226
3.813AlaIle: 3.813 ± 0.766
3.994AlaLys: 3.994 ± 0.898
6.354AlaLeu: 6.354 ± 0.821
0.726AlaMet: 0.726 ± 0.693
2.542AlaAsn: 2.542 ± 0.44
3.45AlaPro: 3.45 ± 1.005
2.542AlaGln: 2.542 ± 0.796
2.36AlaArg: 2.36 ± 0.847
4.72AlaSer: 4.72 ± 0.855
5.628AlaThr: 5.628 ± 0.923
3.45AlaVal: 3.45 ± 0.347
0.545AlaTrp: 0.545 ± 0.3
1.271AlaTyr: 1.271 ± 0.585
0.0AlaXaa: 0.0 ± 0.0
Cys
0.908CysAla: 0.908 ± 0.555
0.363CysCys: 0.363 ± 0.176
0.545CysAsp: 0.545 ± 0.3
0.726CysGlu: 0.726 ± 0.488
0.0CysPhe: 0.0 ± 0.0
0.726CysGly: 0.726 ± 0.3
0.363CysHis: 0.363 ± 0.196
0.726CysIle: 0.726 ± 0.293
1.634CysLys: 1.634 ± 0.37
2.179CysLeu: 2.179 ± 0.5
0.182CysMet: 0.182 ± 0.168
0.908CysAsn: 0.908 ± 0.317
0.908CysPro: 0.908 ± 0.353
0.363CysGln: 0.363 ± 0.345
1.452CysArg: 1.452 ± 0.463
1.089CysSer: 1.089 ± 0.579
1.089CysThr: 1.089 ± 0.577
0.545CysVal: 0.545 ± 0.241
0.363CysTrp: 0.363 ± 0.246
0.908CysTyr: 0.908 ± 0.277
0.0CysXaa: 0.0 ± 0.0
Asp
3.45AspAla: 3.45 ± 0.707
0.363AspCys: 0.363 ± 0.257
4.72AspAsp: 4.72 ± 2.174
2.36AspGlu: 2.36 ± 1.087
1.816AspPhe: 1.816 ± 0.384
3.45AspGly: 3.45 ± 0.743
2.36AspHis: 2.36 ± 0.562
3.45AspIle: 3.45 ± 0.514
2.36AspLys: 2.36 ± 0.713
4.902AspLeu: 4.902 ± 0.95
0.363AspMet: 0.363 ± 0.328
2.723AspAsn: 2.723 ± 0.791
2.179AspPro: 2.179 ± 0.638
2.905AspGln: 2.905 ± 0.859
3.813AspArg: 3.813 ± 0.584
3.631AspSer: 3.631 ± 1.017
2.179AspThr: 2.179 ± 0.559
0.908AspVal: 0.908 ± 0.434
0.908AspTrp: 0.908 ± 0.317
2.542AspTyr: 2.542 ± 0.46
0.0AspXaa: 0.0 ± 0.0
Glu
5.084GluAla: 5.084 ± 1.425
0.726GluCys: 0.726 ± 0.291
2.36GluAsp: 2.36 ± 0.677
2.36GluGlu: 2.36 ± 1.063
2.179GluPhe: 2.179 ± 0.673
4.902GluGly: 4.902 ± 0.731
0.726GluHis: 0.726 ± 0.3
3.086GluIle: 3.086 ± 0.798
2.905GluLys: 2.905 ± 1.222
4.176GluLeu: 4.176 ± 1.031
0.182GluMet: 0.182 ± 0.359
3.813GluAsn: 3.813 ± 1.471
1.997GluPro: 1.997 ± 0.514
2.542GluGln: 2.542 ± 0.503
1.816GluArg: 1.816 ± 0.665
3.813GluSer: 3.813 ± 1.097
3.813GluThr: 3.813 ± 0.878
3.813GluVal: 3.813 ± 0.882
1.452GluTrp: 1.452 ± 0.582
1.997GluTyr: 1.997 ± 0.334
0.0GluXaa: 0.0 ± 0.0
Phe
1.452PheAla: 1.452 ± 0.434
0.182PheCys: 0.182 ± 0.244
1.089PheAsp: 1.089 ± 0.47
1.816PheGlu: 1.816 ± 0.497
1.816PhePhe: 1.816 ± 0.553
2.542PheGly: 2.542 ± 0.577
2.542PheHis: 2.542 ± 0.378
1.997PheIle: 1.997 ± 0.542
2.179PheLys: 2.179 ± 0.339
7.807PheLeu: 7.807 ± 0.974
0.182PheMet: 0.182 ± 0.123
1.089PheAsn: 1.089 ± 0.476
2.36PhePro: 2.36 ± 0.453
2.905PheGln: 2.905 ± 1.233
2.723PheArg: 2.723 ± 0.606
3.631PheSer: 3.631 ± 0.637
0.726PheThr: 0.726 ± 0.355
2.36PheVal: 2.36 ± 0.515
1.089PheTrp: 1.089 ± 0.319
0.363PheTyr: 0.363 ± 0.246
0.0PheXaa: 0.0 ± 0.0
Gly
2.723GlyAla: 2.723 ± 0.647
0.0GlyCys: 0.0 ± 0.0
3.086GlyAsp: 3.086 ± 0.606
3.813GlyGlu: 3.813 ± 0.621
2.542GlyPhe: 2.542 ± 0.418
2.723GlyGly: 2.723 ± 0.464
1.271GlyHis: 1.271 ± 0.475
3.631GlyIle: 3.631 ± 1.305
2.905GlyLys: 2.905 ± 1.132
6.718GlyLeu: 6.718 ± 0.515
0.908GlyMet: 0.908 ± 0.207
1.634GlyAsn: 1.634 ± 0.426
3.631GlyPro: 3.631 ± 1.119
2.36GlyGln: 2.36 ± 0.795
3.086GlyArg: 3.086 ± 0.788
5.265GlySer: 5.265 ± 0.907
3.086GlyThr: 3.086 ± 0.449
5.81GlyVal: 5.81 ± 2.232
1.271GlyTrp: 1.271 ± 0.594
1.452GlyTyr: 1.452 ± 0.61
0.0GlyXaa: 0.0 ± 0.0
His
1.634HisAla: 1.634 ± 0.612
0.545HisCys: 0.545 ± 0.402
0.908HisAsp: 0.908 ± 0.434
0.363HisGlu: 0.363 ± 0.25
1.089HisPhe: 1.089 ± 0.421
1.271HisGly: 1.271 ± 1.098
1.089HisHis: 1.089 ± 0.427
2.179HisIle: 2.179 ± 0.651
1.997HisLys: 1.997 ± 0.628
3.813HisLeu: 3.813 ± 0.455
0.545HisMet: 0.545 ± 0.396
1.089HisAsn: 1.089 ± 0.483
1.271HisPro: 1.271 ± 0.511
2.179HisGln: 2.179 ± 0.381
2.36HisArg: 2.36 ± 1.045
1.997HisSer: 1.997 ± 0.444
1.997HisThr: 1.997 ± 0.492
0.908HisVal: 0.908 ± 0.408
0.182HisTrp: 0.182 ± 0.123
0.726HisTyr: 0.726 ± 0.34
0.0HisXaa: 0.0 ± 0.0
Ile
3.813IleAla: 3.813 ± 0.579
0.545IleCys: 0.545 ± 0.299
2.905IleAsp: 2.905 ± 0.691
2.36IleGlu: 2.36 ± 0.401
2.542IlePhe: 2.542 ± 1.155
3.813IleGly: 3.813 ± 0.748
1.634IleHis: 1.634 ± 0.329
5.084IleIle: 5.084 ± 1.004
3.813IleLys: 3.813 ± 1.05
5.628IleLeu: 5.628 ± 0.737
1.089IleMet: 1.089 ± 0.578
3.086IleAsn: 3.086 ± 0.529
4.539IlePro: 4.539 ± 0.77
3.086IleGln: 3.086 ± 0.928
2.905IleArg: 2.905 ± 0.76
4.902IleSer: 4.902 ± 1.109
4.539IleThr: 4.539 ± 0.972
2.36IleVal: 2.36 ± 0.731
0.908IleTrp: 0.908 ± 0.585
2.179IleTyr: 2.179 ± 0.409
0.0IleXaa: 0.0 ± 0.0
Lys
3.631LysAla: 3.631 ± 1.205
0.726LysCys: 0.726 ± 0.34
3.45LysAsp: 3.45 ± 1.001
2.723LysGlu: 2.723 ± 0.877
1.452LysPhe: 1.452 ± 0.556
2.723LysGly: 2.723 ± 0.79
0.908LysHis: 0.908 ± 0.437
4.176LysIle: 4.176 ± 0.882
3.45LysLys: 3.45 ± 0.8
4.176LysLeu: 4.176 ± 0.87
0.726LysMet: 0.726 ± 0.376
2.723LysAsn: 2.723 ± 0.452
3.086LysPro: 3.086 ± 0.7
1.816LysGln: 1.816 ± 0.596
1.816LysArg: 1.816 ± 0.402
3.994LysSer: 3.994 ± 0.959
3.631LysThr: 3.631 ± 0.638
3.086LysVal: 3.086 ± 0.721
0.182LysTrp: 0.182 ± 0.123
1.997LysTyr: 1.997 ± 0.795
0.0LysXaa: 0.0 ± 0.0
Leu
7.444LeuAla: 7.444 ± 0.701
2.179LeuCys: 2.179 ± 0.569
5.628LeuAsp: 5.628 ± 1.121
7.444LeuGlu: 7.444 ± 0.876
3.994LeuPhe: 3.994 ± 0.899
5.265LeuGly: 5.265 ± 0.624
3.086LeuHis: 3.086 ± 0.686
5.81LeuIle: 5.81 ± 0.812
4.902LeuLys: 4.902 ± 1.014
9.259LeuLeu: 9.259 ± 1.281
2.36LeuMet: 2.36 ± 0.45
5.447LeuAsn: 5.447 ± 0.968
6.173LeuPro: 6.173 ± 1.261
5.084LeuGln: 5.084 ± 0.805
7.262LeuArg: 7.262 ± 1.318
9.078LeuSer: 9.078 ± 1.337
5.81LeuThr: 5.81 ± 1.349
5.628LeuVal: 5.628 ± 0.747
1.271LeuTrp: 1.271 ± 0.53
2.905LeuTyr: 2.905 ± 0.854
0.0LeuXaa: 0.0 ± 0.0
Met
1.089MetAla: 1.089 ± 0.343
0.363MetCys: 0.363 ± 0.246
1.271MetAsp: 1.271 ± 0.517
0.545MetGlu: 0.545 ± 0.346
0.545MetPhe: 0.545 ± 0.3
1.089MetGly: 1.089 ± 0.5
1.089MetHis: 1.089 ± 0.386
0.363MetIle: 0.363 ± 0.246
0.363MetLys: 0.363 ± 0.196
0.908MetLeu: 0.908 ± 0.464
0.726MetMet: 0.726 ± 0.248
0.545MetAsn: 0.545 ± 0.369
0.908MetPro: 0.908 ± 0.308
0.545MetGln: 0.545 ± 0.363
1.089MetArg: 1.089 ± 0.84
1.997MetSer: 1.997 ± 0.678
0.908MetThr: 0.908 ± 0.621
1.816MetVal: 1.816 ± 0.436
0.0MetTrp: 0.0 ± 0.0
0.182MetTyr: 0.182 ± 0.123
0.0MetXaa: 0.0 ± 0.0
Asn
2.723AsnAla: 2.723 ± 0.593
1.634AsnCys: 1.634 ± 0.589
2.179AsnAsp: 2.179 ± 0.337
1.997AsnGlu: 1.997 ± 0.646
2.723AsnPhe: 2.723 ± 0.718
1.452AsnGly: 1.452 ± 0.424
0.908AsnHis: 0.908 ± 0.317
3.268AsnIle: 3.268 ± 0.588
1.634AsnLys: 1.634 ± 0.363
5.81AsnLeu: 5.81 ± 0.738
0.908AsnMet: 0.908 ± 0.497
2.905AsnAsn: 2.905 ± 0.465
2.723AsnPro: 2.723 ± 0.518
2.905AsnGln: 2.905 ± 0.565
2.542AsnArg: 2.542 ± 0.693
3.45AsnSer: 3.45 ± 0.438
3.813AsnThr: 3.813 ± 0.934
2.905AsnVal: 2.905 ± 1.247
0.182AsnTrp: 0.182 ± 0.184
1.816AsnTyr: 1.816 ± 0.47
0.0AsnXaa: 0.0 ± 0.0
Pro
2.542ProAla: 2.542 ± 0.995
1.089ProCys: 1.089 ± 0.314
3.631ProAsp: 3.631 ± 1.263
2.905ProGlu: 2.905 ± 0.611
1.089ProPhe: 1.089 ± 0.49
5.084ProGly: 5.084 ± 1.154
1.997ProHis: 1.997 ± 0.505
4.357ProIle: 4.357 ± 0.858
4.72ProLys: 4.72 ± 1.098
4.902ProLeu: 4.902 ± 0.939
0.908ProMet: 0.908 ± 0.464
2.179ProAsn: 2.179 ± 0.322
6.899ProPro: 6.899 ± 1.379
2.542ProGln: 2.542 ± 0.196
1.634ProArg: 1.634 ± 0.554
4.72ProSer: 4.72 ± 0.927
3.268ProThr: 3.268 ± 1.035
2.905ProVal: 2.905 ± 0.967
0.182ProTrp: 0.182 ± 0.18
1.452ProTyr: 1.452 ± 0.459
0.0ProXaa: 0.0 ± 0.0
Gln
1.997GlnAla: 1.997 ± 0.762
1.271GlnCys: 1.271 ± 0.693
2.36GlnAsp: 2.36 ± 1.182
2.36GlnGlu: 2.36 ± 0.494
1.816GlnPhe: 1.816 ± 0.314
3.45GlnGly: 3.45 ± 1.04
1.997GlnHis: 1.997 ± 0.431
2.542GlnIle: 2.542 ± 0.673
3.631GlnLys: 3.631 ± 0.848
7.444GlnLeu: 7.444 ± 1.482
0.363GlnMet: 0.363 ± 0.246
1.634GlnAsn: 1.634 ± 0.532
1.452GlnPro: 1.452 ± 0.539
3.268GlnGln: 3.268 ± 0.941
1.816GlnArg: 1.816 ± 0.501
2.36GlnSer: 2.36 ± 0.545
2.723GlnThr: 2.723 ± 0.587
1.089GlnVal: 1.089 ± 0.533
0.363GlnTrp: 0.363 ± 0.309
2.179GlnTyr: 2.179 ± 0.769
0.0GlnXaa: 0.0 ± 0.0
Arg
2.179ArgAla: 2.179 ± 0.44
1.089ArgCys: 1.089 ± 0.314
1.452ArgAsp: 1.452 ± 0.439
3.631ArgGlu: 3.631 ± 0.721
2.36ArgPhe: 2.36 ± 1.225
4.176ArgGly: 4.176 ± 0.539
1.271ArgHis: 1.271 ± 0.404
2.179ArgIle: 2.179 ± 0.653
2.542ArgLys: 2.542 ± 0.846
7.262ArgLeu: 7.262 ± 0.983
1.452ArgMet: 1.452 ± 0.53
2.723ArgAsn: 2.723 ± 1.428
2.36ArgPro: 2.36 ± 0.636
2.179ArgGln: 2.179 ± 0.603
2.723ArgArg: 2.723 ± 0.626
4.539ArgSer: 4.539 ± 1.132
3.45ArgThr: 3.45 ± 0.477
2.723ArgVal: 2.723 ± 0.525
0.908ArgTrp: 0.908 ± 0.397
1.997ArgTyr: 1.997 ± 0.328
0.0ArgXaa: 0.0 ± 0.0
Ser
5.084SerAla: 5.084 ± 0.955
0.908SerCys: 0.908 ± 0.425
4.72SerAsp: 4.72 ± 1.192
3.813SerGlu: 3.813 ± 0.663
3.994SerPhe: 3.994 ± 0.773
5.084SerGly: 5.084 ± 0.383
1.089SerHis: 1.089 ± 0.347
3.631SerIle: 3.631 ± 1.67
1.816SerLys: 1.816 ± 0.444
7.262SerLeu: 7.262 ± 1.88
1.634SerMet: 1.634 ± 0.552
4.176SerAsn: 4.176 ± 0.586
4.357SerPro: 4.357 ± 1.078
2.179SerGln: 2.179 ± 0.744
4.357SerArg: 4.357 ± 1.509
7.262SerSer: 7.262 ± 1.039
8.533SerThr: 8.533 ± 1.554
4.176SerVal: 4.176 ± 0.458
0.726SerTrp: 0.726 ± 0.326
2.905SerTyr: 2.905 ± 0.986
0.0SerXaa: 0.0 ± 0.0
Thr
4.902ThrAla: 4.902 ± 0.839
1.089ThrCys: 1.089 ± 0.454
3.631ThrAsp: 3.631 ± 0.657
5.265ThrGlu: 5.265 ± 1.341
2.179ThrPhe: 2.179 ± 0.568
4.176ThrGly: 4.176 ± 1.498
1.816ThrHis: 1.816 ± 0.521
3.631ThrIle: 3.631 ± 0.664
2.723ThrLys: 2.723 ± 0.605
7.444ThrLeu: 7.444 ± 0.762
1.089ThrMet: 1.089 ± 0.54
3.086ThrAsn: 3.086 ± 0.72
4.539ThrPro: 4.539 ± 1.47
2.36ThrGln: 2.36 ± 0.634
3.45ThrArg: 3.45 ± 0.901
5.265ThrSer: 5.265 ± 0.604
7.807ThrThr: 7.807 ± 1.315
3.631ThrVal: 3.631 ± 0.75
1.634ThrTrp: 1.634 ± 0.528
1.816ThrTyr: 1.816 ± 0.338
0.0ThrXaa: 0.0 ± 0.0
Val
2.723ValAla: 2.723 ± 0.607
1.634ValCys: 1.634 ± 0.541
2.179ValAsp: 2.179 ± 0.767
1.997ValGlu: 1.997 ± 0.594
1.634ValPhe: 1.634 ± 0.468
1.997ValGly: 1.997 ± 0.62
1.634ValHis: 1.634 ± 0.416
5.447ValIle: 5.447 ± 0.466
1.816ValLys: 1.816 ± 0.716
4.357ValLeu: 4.357 ± 1.436
1.271ValMet: 1.271 ± 0.368
3.631ValAsn: 3.631 ± 1.244
3.813ValPro: 3.813 ± 0.608
3.086ValGln: 3.086 ± 0.981
3.45ValArg: 3.45 ± 0.626
3.631ValSer: 3.631 ± 0.465
4.539ValThr: 4.539 ± 0.531
3.268ValVal: 3.268 ± 0.614
0.545ValTrp: 0.545 ± 0.351
1.089ValTyr: 1.089 ± 0.38
0.0ValXaa: 0.0 ± 0.0
Trp
2.179TrpAla: 2.179 ± 1.205
0.0TrpCys: 0.0 ± 0.0
0.363TrpAsp: 0.363 ± 0.244
1.089TrpGlu: 1.089 ± 0.314
1.089TrpPhe: 1.089 ± 0.484
0.908TrpGly: 0.908 ± 0.397
0.545TrpHis: 0.545 ± 0.369
0.545TrpIle: 0.545 ± 0.337
0.363TrpLys: 0.363 ± 0.25
1.089TrpLeu: 1.089 ± 0.31
0.182TrpMet: 0.182 ± 0.123
0.363TrpAsn: 0.363 ± 0.265
0.363TrpPro: 0.363 ± 0.163
0.363TrpGln: 0.363 ± 0.286
0.545TrpArg: 0.545 ± 0.245
0.182TrpSer: 0.182 ± 0.244
1.634TrpThr: 1.634 ± 0.665
1.271TrpVal: 1.271 ± 0.319
0.182TrpTrp: 0.182 ± 0.184
0.545TrpTyr: 0.545 ± 0.369
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.997TyrAla: 1.997 ± 0.849
0.545TyrCys: 0.545 ± 0.369
1.997TyrAsp: 1.997 ± 0.529
2.179TyrGlu: 2.179 ± 0.486
0.726TyrPhe: 0.726 ± 0.239
0.545TyrGly: 0.545 ± 0.369
1.271TyrHis: 1.271 ± 0.532
1.816TyrIle: 1.816 ± 0.645
0.726TyrLys: 0.726 ± 0.253
4.539TyrLeu: 4.539 ± 1.639
0.363TyrMet: 0.363 ± 0.246
2.179TyrAsn: 2.179 ± 0.709
1.997TyrPro: 1.997 ± 0.784
0.908TyrGln: 0.908 ± 0.401
1.997TyrArg: 1.997 ± 0.463
2.179TyrSer: 2.179 ± 1.111
2.36TyrThr: 2.36 ± 0.557
1.089TyrVal: 1.089 ± 0.461
0.908TyrTrp: 0.908 ± 0.348
1.452TyrTyr: 1.452 ± 0.316
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (5509 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski