Amino acid dipepetide frequency for Lake Victoria marburgvirus (strain Musoke-80) (MARV) (Marburg virus (strain Kenya/Musoke/1980))

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.695AlaAla: 3.695 ± 1.385
0.411AlaCys: 0.411 ± 0.29
3.489AlaAsp: 3.489 ± 0.753
3.489AlaGlu: 3.489 ± 0.584
2.258AlaPhe: 2.258 ± 1.2
1.437AlaGly: 1.437 ± 0.483
1.437AlaHis: 1.437 ± 0.574
4.105AlaIle: 4.105 ± 0.603
2.463AlaLys: 2.463 ± 0.593
6.568AlaLeu: 6.568 ± 1.025
1.026AlaMet: 1.026 ± 0.239
2.463AlaAsn: 2.463 ± 0.425
2.668AlaPro: 2.668 ± 0.99
2.053AlaGln: 2.053 ± 0.924
3.695AlaArg: 3.695 ± 0.789
5.131AlaSer: 5.131 ± 1.295
1.847AlaThr: 1.847 ± 0.802
3.079AlaVal: 3.079 ± 0.597
1.026AlaTrp: 1.026 ± 0.552
2.258AlaTyr: 2.258 ± 0.915
0.0AlaXaa: 0.0 ± 0.0
Cys
1.642CysAla: 1.642 ± 0.525
1.026CysCys: 1.026 ± 0.356
0.616CysAsp: 0.616 ± 0.253
1.232CysGlu: 1.232 ± 0.544
1.026CysPhe: 1.026 ± 0.44
0.821CysGly: 0.821 ± 0.351
0.411CysHis: 0.411 ± 0.258
1.232CysIle: 1.232 ± 0.505
0.821CysLys: 0.821 ± 0.402
0.821CysLeu: 0.821 ± 0.368
0.0CysMet: 0.0 ± 0.0
0.616CysAsn: 0.616 ± 0.301
0.411CysPro: 0.411 ± 0.245
0.0CysGln: 0.0 ± 0.0
1.232CysArg: 1.232 ± 0.396
2.053CysSer: 2.053 ± 0.552
0.616CysThr: 0.616 ± 0.368
1.026CysVal: 1.026 ± 0.31
0.205CysTrp: 0.205 ± 0.123
0.616CysTyr: 0.616 ± 0.253
0.0CysXaa: 0.0 ± 0.0
Asp
4.721AspAla: 4.721 ± 1.082
1.232AspCys: 1.232 ± 0.396
2.874AspAsp: 2.874 ± 0.676
2.874AspGlu: 2.874 ± 1.087
2.668AspPhe: 2.668 ± 0.976
0.821AspGly: 0.821 ± 0.342
1.642AspHis: 1.642 ± 0.61
2.463AspIle: 2.463 ± 0.628
3.284AspLys: 3.284 ± 1.167
6.568AspLeu: 6.568 ± 0.719
1.026AspMet: 1.026 ± 0.606
2.463AspAsn: 2.463 ± 0.577
2.668AspPro: 2.668 ± 0.873
3.695AspGln: 3.695 ± 0.392
1.437AspArg: 1.437 ± 0.428
3.489AspSer: 3.489 ± 0.634
1.642AspThr: 1.642 ± 0.418
2.258AspVal: 2.258 ± 0.435
0.616AspTrp: 0.616 ± 0.215
1.642AspTyr: 1.642 ± 0.795
0.0AspXaa: 0.0 ± 0.0
Glu
2.668GluAla: 2.668 ± 0.292
0.821GluCys: 0.821 ± 0.491
4.926GluAsp: 4.926 ± 1.38
1.847GluGlu: 1.847 ± 0.556
2.053GluPhe: 2.053 ± 0.645
4.105GluGly: 4.105 ± 0.828
1.642GluHis: 1.642 ± 0.921
3.489GluIle: 3.489 ± 0.427
3.9GluLys: 3.9 ± 0.897
5.542GluLeu: 5.542 ± 0.675
0.411GluMet: 0.411 ± 0.29
3.695GluAsn: 3.695 ± 0.587
1.847GluPro: 1.847 ± 0.971
3.489GluGln: 3.489 ± 0.471
2.258GluArg: 2.258 ± 0.285
3.695GluSer: 3.695 ± 1.163
2.463GluThr: 2.463 ± 0.754
1.437GluVal: 1.437 ± 0.898
1.026GluTrp: 1.026 ± 0.552
1.847GluTyr: 1.847 ± 0.528
0.0GluXaa: 0.0 ± 0.0
Phe
0.821PheAla: 0.821 ± 0.316
0.821PheCys: 0.821 ± 0.277
2.668PheAsp: 2.668 ± 0.538
1.026PheGlu: 1.026 ± 0.388
1.232PhePhe: 1.232 ± 0.455
2.258PheGly: 2.258 ± 0.331
1.642PheHis: 1.642 ± 0.583
3.284PheIle: 3.284 ± 1.088
2.668PheLys: 2.668 ± 0.566
6.363PheLeu: 6.363 ± 0.836
0.205PheMet: 0.205 ± 0.123
1.437PheAsn: 1.437 ± 0.399
2.874PhePro: 2.874 ± 1.169
1.642PheGln: 1.642 ± 0.439
1.847PheArg: 1.847 ± 0.412
4.926PheSer: 4.926 ± 1.249
2.874PheThr: 2.874 ± 0.835
2.463PheVal: 2.463 ± 0.591
0.205PheTrp: 0.205 ± 0.123
0.821PheTyr: 0.821 ± 0.491
0.0PheXaa: 0.0 ± 0.0
Gly
2.258GlyAla: 2.258 ± 0.628
0.616GlyCys: 0.616 ± 0.253
1.642GlyAsp: 1.642 ± 0.741
2.874GlyGlu: 2.874 ± 0.556
2.258GlyPhe: 2.258 ± 0.525
2.258GlyGly: 2.258 ± 0.511
0.821GlyHis: 0.821 ± 0.464
4.721GlyIle: 4.721 ± 1.229
4.105GlyLys: 4.105 ± 0.624
4.926GlyLeu: 4.926 ± 0.671
1.232GlyMet: 1.232 ± 0.367
2.053GlyAsn: 2.053 ± 0.854
1.026GlyPro: 1.026 ± 0.612
3.284GlyGln: 3.284 ± 0.627
1.847GlyArg: 1.847 ± 0.629
4.105GlySer: 4.105 ± 0.899
3.489GlyThr: 3.489 ± 0.758
3.695GlyVal: 3.695 ± 1.373
0.821GlyTrp: 0.821 ± 0.578
1.437GlyTyr: 1.437 ± 0.428
0.0GlyXaa: 0.0 ± 0.0
His
1.232HisAla: 1.232 ± 0.58
0.205HisCys: 0.205 ± 0.123
1.232HisAsp: 1.232 ± 0.373
2.258HisGlu: 2.258 ± 0.713
1.232HisPhe: 1.232 ± 0.377
1.437HisGly: 1.437 ± 0.787
1.026HisHis: 1.026 ± 0.356
2.874HisIle: 2.874 ± 0.715
0.616HisLys: 0.616 ± 0.274
3.489HisLeu: 3.489 ± 0.612
0.821HisMet: 0.821 ± 0.544
1.026HisAsn: 1.026 ± 0.402
1.847HisPro: 1.847 ± 0.863
1.642HisGln: 1.642 ± 1.01
0.821HisArg: 0.821 ± 0.339
2.463HisSer: 2.463 ± 0.606
0.616HisThr: 0.616 ± 0.274
1.642HisVal: 1.642 ± 0.438
0.411HisTrp: 0.411 ± 0.158
1.847HisTyr: 1.847 ± 0.501
0.0HisXaa: 0.0 ± 0.0
Ile
3.489IleAla: 3.489 ± 0.979
1.232IleCys: 1.232 ± 0.323
3.489IleAsp: 3.489 ± 0.461
2.668IleGlu: 2.668 ± 0.9
3.284IlePhe: 3.284 ± 0.942
3.9IleGly: 3.9 ± 0.831
1.232IleHis: 1.232 ± 0.504
3.489IleIle: 3.489 ± 0.864
4.105IleLys: 4.105 ± 0.851
7.8IleLeu: 7.8 ± 0.609
0.821IleMet: 0.821 ± 0.436
4.105IleAsn: 4.105 ± 0.662
4.31IlePro: 4.31 ± 0.65
2.668IleGln: 2.668 ± 0.336
1.642IleArg: 1.642 ± 0.247
5.747IleSer: 5.747 ± 0.755
4.105IleThr: 4.105 ± 0.707
3.695IleVal: 3.695 ± 0.626
1.026IleTrp: 1.026 ± 0.323
1.026IleTyr: 1.026 ± 0.379
0.0IleXaa: 0.0 ± 0.0
Lys
2.053LysAla: 2.053 ± 0.762
0.411LysCys: 0.411 ± 0.201
2.668LysAsp: 2.668 ± 0.899
2.668LysGlu: 2.668 ± 0.75
1.232LysPhe: 1.232 ± 0.385
3.489LysGly: 3.489 ± 1.373
1.437LysHis: 1.437 ± 0.677
4.105LysIle: 4.105 ± 0.774
2.874LysLys: 2.874 ± 0.558
6.158LysLeu: 6.158 ± 1.376
1.232LysMet: 1.232 ± 0.727
4.516LysAsn: 4.516 ± 0.887
2.874LysPro: 2.874 ± 0.718
2.053LysGln: 2.053 ± 0.972
3.079LysArg: 3.079 ± 0.383
3.489LysSer: 3.489 ± 0.947
4.31LysThr: 4.31 ± 0.979
3.284LysVal: 3.284 ± 0.498
0.821LysTrp: 0.821 ± 0.313
2.463LysTyr: 2.463 ± 0.622
0.0LysXaa: 0.0 ± 0.0
Leu
7.184LeuAla: 7.184 ± 0.943
1.232LeuCys: 1.232 ± 0.499
5.747LeuAsp: 5.747 ± 0.983
6.568LeuGlu: 6.568 ± 0.719
4.721LeuPhe: 4.721 ± 1.073
6.158LeuGly: 6.158 ± 0.706
3.079LeuHis: 3.079 ± 0.601
6.773LeuIle: 6.773 ± 0.688
5.542LeuLys: 5.542 ± 1.945
10.878LeuLeu: 10.878 ± 0.872
2.463LeuMet: 2.463 ± 0.505
6.979LeuAsn: 6.979 ± 1.572
5.952LeuPro: 5.952 ± 1.553
4.516LeuGln: 4.516 ± 0.643
6.158LeuArg: 6.158 ± 0.571
11.289LeuSer: 11.289 ± 1.04
6.979LeuThr: 6.979 ± 0.926
3.9LeuVal: 3.9 ± 0.461
1.437LeuTrp: 1.437 ± 0.434
3.284LeuTyr: 3.284 ± 0.413
0.0LeuXaa: 0.0 ± 0.0
Met
1.232MetAla: 1.232 ± 0.601
0.205MetCys: 0.205 ± 0.263
0.821MetAsp: 0.821 ± 0.277
0.205MetGlu: 0.205 ± 0.283
1.232MetPhe: 1.232 ± 0.528
0.821MetGly: 0.821 ± 0.408
1.026MetHis: 1.026 ± 0.487
1.026MetIle: 1.026 ± 0.359
1.437MetLys: 1.437 ± 0.291
2.463MetLeu: 2.463 ± 0.695
0.411MetMet: 0.411 ± 0.258
1.437MetAsn: 1.437 ± 0.761
0.616MetPro: 0.616 ± 0.4
1.026MetGln: 1.026 ± 0.419
0.205MetArg: 0.205 ± 0.21
2.053MetSer: 2.053 ± 0.885
0.616MetThr: 0.616 ± 0.551
0.821MetVal: 0.821 ± 0.359
0.205MetTrp: 0.205 ± 0.321
0.411MetTyr: 0.411 ± 0.201
0.0MetXaa: 0.0 ± 0.0
Asn
2.463AsnAla: 2.463 ± 0.69
1.026AsnCys: 1.026 ± 0.613
2.874AsnAsp: 2.874 ± 0.636
2.668AsnGlu: 2.668 ± 0.941
2.874AsnPhe: 2.874 ± 0.882
1.437AsnGly: 1.437 ± 0.854
2.258AsnHis: 2.258 ± 0.671
3.9AsnIle: 3.9 ± 1.202
1.847AsnLys: 1.847 ± 0.434
8.415AsnLeu: 8.415 ± 1.221
1.232AsnMet: 1.232 ± 0.406
3.489AsnAsn: 3.489 ± 0.886
3.9AsnPro: 3.9 ± 0.634
3.9AsnGln: 3.9 ± 1.008
3.695AsnArg: 3.695 ± 0.777
3.695AsnSer: 3.695 ± 0.801
5.337AsnThr: 5.337 ± 1.637
1.847AsnVal: 1.847 ± 0.343
0.821AsnTrp: 0.821 ± 0.277
1.437AsnTyr: 1.437 ± 0.604
0.0AsnXaa: 0.0 ± 0.0
Pro
2.463ProAla: 2.463 ± 1.369
1.026ProCys: 1.026 ± 0.669
1.437ProAsp: 1.437 ± 0.324
2.668ProGlu: 2.668 ± 0.521
2.463ProPhe: 2.463 ± 0.428
1.847ProGly: 1.847 ± 0.329
1.642ProHis: 1.642 ± 0.638
2.463ProIle: 2.463 ± 0.482
2.053ProLys: 2.053 ± 0.355
6.363ProLeu: 6.363 ± 1.347
0.205ProMet: 0.205 ± 0.21
3.695ProAsn: 3.695 ± 1.297
7.184ProPro: 7.184 ± 2.721
3.079ProGln: 3.079 ± 1.038
1.847ProArg: 1.847 ± 0.819
6.363ProSer: 6.363 ± 1.124
4.105ProThr: 4.105 ± 1.503
3.079ProVal: 3.079 ± 0.833
0.0ProTrp: 0.0 ± 0.0
1.847ProTyr: 1.847 ± 0.942
0.0ProXaa: 0.0 ± 0.0
Gln
3.9GlnAla: 3.9 ± 1.073
0.821GlnCys: 0.821 ± 0.368
1.232GlnAsp: 1.232 ± 0.923
3.079GlnGlu: 3.079 ± 1.564
2.053GlnPhe: 2.053 ± 0.835
4.721GlnGly: 4.721 ± 1.471
1.847GlnHis: 1.847 ± 0.413
2.668GlnIle: 2.668 ± 0.532
3.284GlnLys: 3.284 ± 0.482
3.079GlnLeu: 3.079 ± 0.885
1.026GlnMet: 1.026 ± 0.37
3.695GlnAsn: 3.695 ± 0.601
1.437GlnPro: 1.437 ± 0.876
3.284GlnGln: 3.284 ± 0.826
1.847GlnArg: 1.847 ± 0.568
3.284GlnSer: 3.284 ± 0.438
3.284GlnThr: 3.284 ± 0.801
2.668GlnVal: 2.668 ± 0.859
0.0GlnTrp: 0.0 ± 0.0
2.053GlnTyr: 2.053 ± 0.633
0.0GlnXaa: 0.0 ± 0.0
Arg
2.258ArgAla: 2.258 ± 0.844
0.616ArgCys: 0.616 ± 0.368
1.232ArgAsp: 1.232 ± 0.23
3.695ArgGlu: 3.695 ± 0.468
1.232ArgPhe: 1.232 ± 0.379
1.642ArgGly: 1.642 ± 0.294
1.847ArgHis: 1.847 ± 0.466
3.079ArgIle: 3.079 ± 0.56
1.642ArgLys: 1.642 ± 0.479
4.926ArgLeu: 4.926 ± 1.823
0.821ArgMet: 0.821 ± 0.287
3.284ArgAsn: 3.284 ± 0.967
1.437ArgPro: 1.437 ± 0.647
2.668ArgGln: 2.668 ± 0.702
1.847ArgArg: 1.847 ± 0.46
4.105ArgSer: 4.105 ± 1.579
3.9ArgThr: 3.9 ± 0.922
3.695ArgVal: 3.695 ± 0.554
1.026ArgTrp: 1.026 ± 0.447
1.437ArgTyr: 1.437 ± 0.466
0.0ArgXaa: 0.0 ± 0.0
Ser
3.695SerAla: 3.695 ± 0.41
1.232SerCys: 1.232 ± 0.561
5.131SerAsp: 5.131 ± 1.486
4.721SerGlu: 4.721 ± 1.619
3.284SerPhe: 3.284 ± 1.02
5.337SerGly: 5.337 ± 1.176
1.232SerHis: 1.232 ± 0.575
5.337SerIle: 5.337 ± 0.797
5.337SerLys: 5.337 ± 1.82
9.442SerLeu: 9.442 ± 1.916
1.026SerMet: 1.026 ± 0.298
4.105SerAsn: 4.105 ± 0.945
5.131SerPro: 5.131 ± 0.968
4.31SerGln: 4.31 ± 1.164
3.695SerArg: 3.695 ± 1.251
9.442SerSer: 9.442 ± 0.706
7.594SerThr: 7.594 ± 1.699
3.695SerVal: 3.695 ± 0.544
1.026SerTrp: 1.026 ± 0.448
2.668SerTyr: 2.668 ± 0.645
0.0SerXaa: 0.0 ± 0.0
Thr
3.9ThrAla: 3.9 ± 1.539
1.847ThrCys: 1.847 ± 0.813
2.874ThrAsp: 2.874 ± 0.517
3.9ThrGlu: 3.9 ± 0.973
3.284ThrPhe: 3.284 ± 0.743
3.079ThrGly: 3.079 ± 1.212
1.437ThrHis: 1.437 ± 0.379
4.31ThrIle: 4.31 ± 1.006
3.9ThrLys: 3.9 ± 1.056
6.363ThrLeu: 6.363 ± 0.787
1.642ThrMet: 1.642 ± 0.388
4.516ThrAsn: 4.516 ± 1.875
3.695ThrPro: 3.695 ± 0.582
1.437ThrGln: 1.437 ± 0.472
4.926ThrArg: 4.926 ± 0.963
5.747ThrSer: 5.747 ± 1.601
5.131ThrThr: 5.131 ± 2.53
3.079ThrVal: 3.079 ± 1.025
0.205ThrTrp: 0.205 ± 0.123
1.026ThrTyr: 1.026 ± 0.634
0.0ThrXaa: 0.0 ± 0.0
Val
3.079ValAla: 3.079 ± 0.641
1.232ValCys: 1.232 ± 0.413
1.847ValAsp: 1.847 ± 0.431
3.079ValGlu: 3.079 ± 0.56
1.847ValPhe: 1.847 ± 0.577
1.642ValGly: 1.642 ± 0.696
1.847ValHis: 1.847 ± 0.354
2.463ValIle: 2.463 ± 0.721
3.489ValLys: 3.489 ± 1.146
5.337ValLeu: 5.337 ± 0.893
1.232ValMet: 1.232 ± 0.397
2.463ValAsn: 2.463 ± 0.562
3.079ValPro: 3.079 ± 1.402
2.668ValGln: 2.668 ± 0.513
2.053ValArg: 2.053 ± 0.486
3.489ValSer: 3.489 ± 0.864
4.31ValThr: 4.31 ± 1.046
2.258ValVal: 2.258 ± 0.943
0.205ValTrp: 0.205 ± 0.123
1.437ValTyr: 1.437 ± 0.366
0.0ValXaa: 0.0 ± 0.0
Trp
0.205TrpAla: 0.205 ± 0.217
0.0TrpCys: 0.0 ± 0.0
1.232TrpAsp: 1.232 ± 0.596
0.411TrpGlu: 0.411 ± 0.245
0.411TrpPhe: 0.411 ± 0.245
1.437TrpGly: 1.437 ± 0.829
0.205TrpHis: 0.205 ± 0.123
0.821TrpIle: 0.821 ± 0.313
0.616TrpLys: 0.616 ± 0.368
1.642TrpLeu: 1.642 ± 0.441
0.205TrpMet: 0.205 ± 0.123
0.0TrpAsn: 0.0 ± 0.0
0.411TrpPro: 0.411 ± 0.42
0.411TrpGln: 0.411 ± 0.415
0.616TrpArg: 0.616 ± 0.274
0.616TrpSer: 0.616 ± 0.253
1.437TrpThr: 1.437 ± 0.494
0.616TrpVal: 0.616 ± 0.307
0.205TrpTrp: 0.205 ± 0.217
0.821TrpTyr: 0.821 ± 0.491
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.642TyrAla: 1.642 ± 0.575
0.411TyrCys: 0.411 ± 0.245
2.053TyrAsp: 2.053 ± 0.348
1.232TyrGlu: 1.232 ± 0.379
1.437TyrPhe: 1.437 ± 0.467
0.821TyrGly: 0.821 ± 0.491
1.026TyrHis: 1.026 ± 0.482
1.232TyrIle: 1.232 ± 0.561
1.232TyrLys: 1.232 ± 0.615
3.695TyrLeu: 3.695 ± 1.094
1.026TyrMet: 1.026 ± 0.394
3.079TyrAsn: 3.079 ± 0.744
2.463TyrPro: 2.463 ± 0.514
1.437TyrGln: 1.437 ± 0.477
1.642TyrArg: 1.642 ± 0.541
2.258TyrSer: 2.258 ± 0.66
1.642TyrThr: 1.642 ± 0.601
1.026TyrVal: 1.026 ± 0.388
1.026TyrTrp: 1.026 ± 0.448
1.232TyrTyr: 1.232 ± 0.499
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (4873 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski