Amino acid dipepetide frequency for Walleye dermal sarcoma virus (WDSV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.167AlaAla: 6.167 ± 3.0
2.846AlaCys: 2.846 ± 0.49
3.558AlaAsp: 3.558 ± 0.545
4.507AlaGlu: 4.507 ± 1.846
2.609AlaPhe: 2.609 ± 0.816
2.372AlaGly: 2.372 ± 0.427
2.135AlaHis: 2.135 ± 0.501
4.744AlaIle: 4.744 ± 0.989
4.032AlaLys: 4.032 ± 0.526
4.981AlaLeu: 4.981 ± 1.141
1.186AlaMet: 1.186 ± 0.519
1.423AlaAsn: 1.423 ± 0.683
2.372AlaPro: 2.372 ± 0.563
4.507AlaGln: 4.507 ± 1.265
3.083AlaArg: 3.083 ± 1.201
5.218AlaSer: 5.218 ± 1.612
4.032AlaThr: 4.032 ± 0.709
4.269AlaVal: 4.269 ± 1.056
1.423AlaTrp: 1.423 ± 0.712
1.898AlaTyr: 1.898 ± 0.622
0.237AlaXaa: 0.237 ± 0.164
Cys
0.949CysAla: 0.949 ± 0.191
0.0CysCys: 0.0 ± 0.0
1.66CysAsp: 1.66 ± 0.438
0.712CysGlu: 0.712 ± 0.337
0.949CysPhe: 0.949 ± 0.318
0.237CysGly: 0.237 ± 0.301
0.712CysHis: 0.712 ± 0.286
0.474CysIle: 0.474 ± 0.181
0.949CysLys: 0.949 ± 0.191
1.66CysLeu: 1.66 ± 0.459
0.237CysMet: 0.237 ± 0.164
0.712CysAsn: 0.712 ± 0.381
2.609CysPro: 2.609 ± 0.446
0.949CysGln: 0.949 ± 0.43
1.186CysArg: 1.186 ± 0.431
0.712CysSer: 0.712 ± 0.337
0.949CysThr: 0.949 ± 0.369
0.949CysVal: 0.949 ± 0.314
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.898AspAla: 1.898 ± 0.665
1.66AspCys: 1.66 ± 0.439
0.949AspAsp: 0.949 ± 0.525
4.032AspGlu: 4.032 ± 0.728
1.186AspPhe: 1.186 ± 0.423
2.372AspGly: 2.372 ± 0.31
0.474AspHis: 0.474 ± 0.181
4.032AspIle: 4.032 ± 0.692
1.423AspLys: 1.423 ± 0.284
5.218AspLeu: 5.218 ± 1.22
0.712AspMet: 0.712 ± 0.381
1.186AspAsn: 1.186 ± 0.263
2.609AspPro: 2.609 ± 0.939
2.372AspGln: 2.372 ± 0.826
2.135AspArg: 2.135 ± 0.305
2.372AspSer: 2.372 ± 0.545
3.558AspThr: 3.558 ± 0.576
1.898AspVal: 1.898 ± 0.638
1.423AspTrp: 1.423 ± 0.493
1.898AspTyr: 1.898 ± 1.274
0.0AspXaa: 0.0 ± 0.0
Glu
4.744GluAla: 4.744 ± 1.774
0.474GluCys: 0.474 ± 0.387
0.712GluAsp: 0.712 ± 0.308
3.083GluGlu: 3.083 ± 1.461
1.423GluPhe: 1.423 ± 0.529
3.795GluGly: 3.795 ± 0.496
0.949GluHis: 0.949 ± 0.518
3.558GluIle: 3.558 ± 0.63
2.846GluLys: 2.846 ± 0.568
3.795GluLeu: 3.795 ± 0.723
2.846GluMet: 2.846 ± 0.684
1.898GluAsn: 1.898 ± 0.469
2.609GluPro: 2.609 ± 0.434
4.744GluGln: 4.744 ± 0.974
2.135GluArg: 2.135 ± 1.314
1.66GluSer: 1.66 ± 0.512
3.321GluThr: 3.321 ± 0.95
2.372GluVal: 2.372 ± 0.681
2.609GluTrp: 2.609 ± 0.753
2.135GluTyr: 2.135 ± 0.519
0.0GluXaa: 0.0 ± 0.0
Phe
1.186PheAla: 1.186 ± 0.389
0.474PheCys: 0.474 ± 0.24
0.712PheAsp: 0.712 ± 0.337
0.949PheGlu: 0.949 ± 0.362
1.186PhePhe: 1.186 ± 0.263
0.949PheGly: 0.949 ± 0.369
0.237PheHis: 0.237 ± 0.164
4.032PheIle: 4.032 ± 1.013
2.135PheLys: 2.135 ± 0.565
3.083PheLeu: 3.083 ± 0.62
0.474PheMet: 0.474 ± 0.181
0.949PheAsn: 0.949 ± 0.518
1.898PhePro: 1.898 ± 0.469
1.66PheGln: 1.66 ± 0.382
0.949PheArg: 0.949 ± 0.622
3.795PheSer: 3.795 ± 0.971
3.083PheThr: 3.083 ± 0.442
1.66PheVal: 1.66 ± 0.514
0.237PheTrp: 0.237 ± 0.164
0.237PheTyr: 0.237 ± 0.301
0.0PheXaa: 0.0 ± 0.0
Gly
2.372GlyAla: 2.372 ± 0.847
0.474GlyCys: 0.474 ± 0.387
1.186GlyAsp: 1.186 ± 0.475
2.135GlyGlu: 2.135 ± 0.904
1.423GlyPhe: 1.423 ± 0.479
3.558GlyGly: 3.558 ± 0.273
3.321GlyHis: 3.321 ± 0.898
3.083GlyIle: 3.083 ± 0.779
1.423GlyLys: 1.423 ± 1.161
4.507GlyLeu: 4.507 ± 0.381
0.712GlyMet: 0.712 ± 0.431
4.032GlyAsn: 4.032 ± 1.183
2.135GlyPro: 2.135 ± 0.444
1.186GlyGln: 1.186 ± 0.706
3.795GlyArg: 3.795 ± 1.204
2.372GlySer: 2.372 ± 0.313
4.744GlyThr: 4.744 ± 0.976
3.321GlyVal: 3.321 ± 0.583
0.712GlyTrp: 0.712 ± 0.446
0.949GlyTyr: 0.949 ± 0.191
0.0GlyXaa: 0.0 ± 0.0
His
3.558HisAla: 3.558 ± 0.613
0.949HisCys: 0.949 ± 0.503
0.949HisAsp: 0.949 ± 0.657
2.135HisGlu: 2.135 ± 0.749
0.712HisPhe: 0.712 ± 0.308
1.66HisGly: 1.66 ± 0.603
0.237HisHis: 0.237 ± 0.193
0.949HisIle: 0.949 ± 0.653
1.66HisLys: 1.66 ± 0.905
2.846HisLeu: 2.846 ± 1.039
0.474HisMet: 0.474 ± 0.365
0.949HisAsn: 0.949 ± 0.369
1.423HisPro: 1.423 ± 0.307
1.898HisGln: 1.898 ± 0.433
1.66HisArg: 1.66 ± 0.587
2.372HisSer: 2.372 ± 0.709
2.846HisThr: 2.846 ± 0.672
1.186HisVal: 1.186 ± 0.429
1.423HisTrp: 1.423 ± 0.56
1.186HisTyr: 1.186 ± 0.583
0.0HisXaa: 0.0 ± 0.0
Ile
5.218IleAla: 5.218 ± 0.923
1.186IleCys: 1.186 ± 0.45
4.269IleAsp: 4.269 ± 0.687
2.846IleGlu: 2.846 ± 0.958
1.423IlePhe: 1.423 ± 0.395
2.609IleGly: 2.609 ± 0.573
2.372IleHis: 2.372 ± 1.17
4.981IleIle: 4.981 ± 0.723
5.693IleLys: 5.693 ± 1.081
4.981IleLeu: 4.981 ± 0.937
2.372IleMet: 2.372 ± 0.821
3.558IleAsn: 3.558 ± 0.842
5.93IlePro: 5.93 ± 1.689
4.032IleGln: 4.032 ± 1.065
2.135IleArg: 2.135 ± 0.556
4.981IleSer: 4.981 ± 1.153
6.879IleThr: 6.879 ± 2.625
4.269IleVal: 4.269 ± 0.84
0.474IleTrp: 0.474 ± 0.856
1.898IleTyr: 1.898 ± 0.436
0.0IleXaa: 0.0 ± 0.0
Lys
3.083LysAla: 3.083 ± 1.038
0.474LysCys: 0.474 ± 0.181
4.032LysAsp: 4.032 ± 0.921
1.898LysGlu: 1.898 ± 0.774
2.135LysPhe: 2.135 ± 0.658
2.609LysGly: 2.609 ± 0.961
2.609LysHis: 2.609 ± 0.461
4.744LysIle: 4.744 ± 1.294
6.167LysLys: 6.167 ± 1.566
3.795LysLeu: 3.795 ± 0.592
0.712LysMet: 0.712 ± 0.275
3.558LysAsn: 3.558 ± 0.735
3.558LysPro: 3.558 ± 0.605
4.507LysGln: 4.507 ± 0.999
2.372LysArg: 2.372 ± 1.003
0.949LysSer: 0.949 ± 0.657
5.693LysThr: 5.693 ± 1.129
3.321LysVal: 3.321 ± 0.756
1.186LysTrp: 1.186 ± 0.45
0.949LysTyr: 0.949 ± 0.518
0.0LysXaa: 0.0 ± 0.0
Leu
6.167LeuAla: 6.167 ± 0.455
0.949LeuCys: 0.949 ± 0.453
1.898LeuAsp: 1.898 ± 0.756
4.032LeuGlu: 4.032 ± 0.709
4.032LeuPhe: 4.032 ± 0.658
3.321LeuGly: 3.321 ± 0.987
2.609LeuHis: 2.609 ± 0.815
5.455LeuIle: 5.455 ± 1.171
4.744LeuLys: 4.744 ± 0.859
10.199LeuLeu: 10.199 ± 2.353
1.423LeuMet: 1.423 ± 0.525
3.558LeuAsn: 3.558 ± 1.13
8.065LeuPro: 8.065 ± 0.807
7.116LeuGln: 7.116 ± 1.259
5.218LeuArg: 5.218 ± 0.921
5.93LeuSer: 5.93 ± 1.699
8.776LeuThr: 8.776 ± 2.055
5.218LeuVal: 5.218 ± 1.166
1.423LeuTrp: 1.423 ± 0.42
1.423LeuTyr: 1.423 ± 0.588
0.0LeuXaa: 0.0 ± 0.0
Met
1.66MetAla: 1.66 ± 0.837
0.237MetCys: 0.237 ± 0.164
2.372MetAsp: 2.372 ± 0.59
1.186MetGlu: 1.186 ± 0.356
0.712MetPhe: 0.712 ± 0.903
2.846MetGly: 2.846 ± 0.773
0.949MetHis: 0.949 ± 0.314
0.949MetIle: 0.949 ± 0.548
0.712MetLys: 0.712 ± 0.493
3.083MetLeu: 3.083 ± 0.805
0.474MetMet: 0.474 ± 0.501
1.66MetAsn: 1.66 ± 0.608
1.186MetPro: 1.186 ± 0.502
0.712MetGln: 0.712 ± 0.28
1.423MetArg: 1.423 ± 0.543
1.186MetSer: 1.186 ± 0.282
1.66MetThr: 1.66 ± 0.244
1.66MetVal: 1.66 ± 0.385
0.0MetTrp: 0.0 ± 0.0
0.712MetTyr: 0.712 ± 0.706
0.0MetXaa: 0.0 ± 0.0
Asn
1.66AsnAla: 1.66 ± 0.72
0.712AsnCys: 0.712 ± 0.218
0.712AsnAsp: 0.712 ± 0.454
1.423AsnGlu: 1.423 ± 0.513
1.423AsnPhe: 1.423 ± 0.513
2.135AsnGly: 2.135 ± 0.784
1.423AsnHis: 1.423 ± 0.284
3.795AsnIle: 3.795 ± 0.7
4.032AsnLys: 4.032 ± 0.851
3.558AsnLeu: 3.558 ± 0.645
2.609AsnMet: 2.609 ± 0.75
2.609AsnAsn: 2.609 ± 0.675
3.321AsnPro: 3.321 ± 0.358
3.558AsnGln: 3.558 ± 0.888
1.186AsnArg: 1.186 ± 0.52
3.083AsnSer: 3.083 ± 1.048
3.795AsnThr: 3.795 ± 0.477
1.186AsnVal: 1.186 ± 0.395
0.474AsnTrp: 0.474 ± 0.358
1.186AsnTyr: 1.186 ± 0.417
0.0AsnXaa: 0.0 ± 0.0
Pro
4.744ProAla: 4.744 ± 1.118
0.949ProCys: 0.949 ± 0.43
4.744ProAsp: 4.744 ± 1.479
3.558ProGlu: 3.558 ± 0.812
3.321ProPhe: 3.321 ± 0.179
3.795ProGly: 3.795 ± 0.527
1.186ProHis: 1.186 ± 0.418
5.693ProIle: 5.693 ± 0.793
2.372ProLys: 2.372 ± 0.87
7.353ProLeu: 7.353 ± 0.512
1.66ProMet: 1.66 ± 0.493
1.898ProAsn: 1.898 ± 0.745
6.167ProPro: 6.167 ± 1.729
4.507ProGln: 4.507 ± 1.252
2.135ProArg: 2.135 ± 0.838
4.032ProSer: 4.032 ± 0.364
5.455ProThr: 5.455 ± 0.782
3.558ProVal: 3.558 ± 0.657
0.474ProTrp: 0.474 ± 0.181
2.372ProTyr: 2.372 ± 0.64
0.0ProXaa: 0.0 ± 0.0
Gln
4.507GlnAla: 4.507 ± 1.04
0.474GlnCys: 0.474 ± 0.387
1.898GlnAsp: 1.898 ± 0.361
3.321GlnGlu: 3.321 ± 0.632
1.186GlnPhe: 1.186 ± 0.437
3.083GlnGly: 3.083 ± 0.443
3.321GlnHis: 3.321 ± 1.147
4.744GlnIle: 4.744 ± 0.749
3.558GlnLys: 3.558 ± 1.076
7.59GlnLeu: 7.59 ± 1.126
0.949GlnMet: 0.949 ± 0.354
3.321GlnAsn: 3.321 ± 1.487
2.846GlnPro: 2.846 ± 0.482
9.25GlnGln: 9.25 ± 1.805
3.795GlnArg: 3.795 ± 0.762
1.66GlnSer: 1.66 ± 0.244
5.693GlnThr: 5.693 ± 1.022
3.083GlnVal: 3.083 ± 0.8
0.712GlnTrp: 0.712 ± 0.275
2.846GlnTyr: 2.846 ± 0.918
0.0GlnXaa: 0.0 ± 0.0
Arg
3.321ArgAla: 3.321 ± 0.588
0.474ArgCys: 0.474 ± 0.602
2.846ArgAsp: 2.846 ± 0.561
3.795ArgGlu: 3.795 ± 1.494
0.237ArgPhe: 0.237 ± 0.164
2.135ArgGly: 2.135 ± 0.527
0.949ArgHis: 0.949 ± 0.369
2.609ArgIle: 2.609 ± 1.174
2.609ArgLys: 2.609 ± 0.675
1.66ArgLeu: 1.66 ± 0.793
1.66ArgMet: 1.66 ± 0.563
3.321ArgAsn: 3.321 ± 0.984
3.558ArgPro: 3.558 ± 1.558
3.795ArgGln: 3.795 ± 0.771
1.898ArgArg: 1.898 ± 0.67
3.558ArgSer: 3.558 ± 0.799
2.372ArgThr: 2.372 ± 0.416
2.135ArgVal: 2.135 ± 0.548
0.474ArgTrp: 0.474 ± 0.416
0.474ArgTyr: 0.474 ± 0.301
0.0ArgXaa: 0.0 ± 0.0
Ser
4.744SerAla: 4.744 ± 0.728
0.949SerCys: 0.949 ± 0.314
3.558SerAsp: 3.558 ± 1.175
2.135SerGlu: 2.135 ± 0.416
1.898SerPhe: 1.898 ± 0.246
2.372SerGly: 2.372 ± 0.711
1.186SerHis: 1.186 ± 0.356
2.609SerIle: 2.609 ± 0.662
3.321SerLys: 3.321 ± 1.425
6.404SerLeu: 6.404 ± 1.249
0.712SerMet: 0.712 ± 0.269
1.898SerAsn: 1.898 ± 0.665
4.744SerPro: 4.744 ± 1.116
4.269SerGln: 4.269 ± 0.586
2.372SerArg: 2.372 ± 0.844
4.507SerSer: 4.507 ± 1.275
4.507SerThr: 4.507 ± 0.661
2.609SerVal: 2.609 ± 0.522
0.712SerTrp: 0.712 ± 0.337
2.135SerTyr: 2.135 ± 0.576
0.0SerXaa: 0.0 ± 0.0
Thr
7.827ThrAla: 7.827 ± 1.616
2.135ThrCys: 2.135 ± 0.831
4.507ThrAsp: 4.507 ± 0.711
4.744ThrGlu: 4.744 ± 1.391
1.423ThrPhe: 1.423 ± 0.455
2.846ThrGly: 2.846 ± 0.899
3.558ThrHis: 3.558 ± 1.316
7.116ThrIle: 7.116 ± 1.159
4.744ThrLys: 4.744 ± 0.796
4.507ThrLeu: 4.507 ± 1.068
3.321ThrMet: 3.321 ± 0.935
3.083ThrAsn: 3.083 ± 0.436
6.404ThrPro: 6.404 ± 1.144
3.558ThrGln: 3.558 ± 0.65
3.795ThrArg: 3.795 ± 0.897
3.083ThrSer: 3.083 ± 0.725
5.93ThrThr: 5.93 ± 0.547
5.693ThrVal: 5.693 ± 1.36
0.949ThrTrp: 0.949 ± 0.57
3.083ThrTyr: 3.083 ± 0.441
0.0ThrXaa: 0.0 ± 0.0
Val
1.898ValAla: 1.898 ± 0.663
0.712ValCys: 0.712 ± 0.28
1.423ValAsp: 1.423 ± 0.728
3.083ValGlu: 3.083 ± 0.481
1.66ValPhe: 1.66 ± 0.567
2.846ValGly: 2.846 ± 0.563
0.712ValHis: 0.712 ± 0.337
5.693ValIle: 5.693 ± 1.277
3.795ValLys: 3.795 ± 0.74
7.59ValLeu: 7.59 ± 0.916
1.186ValMet: 1.186 ± 0.656
1.423ValAsn: 1.423 ± 0.543
4.269ValPro: 4.269 ± 0.651
2.846ValGln: 2.846 ± 0.684
1.66ValArg: 1.66 ± 0.376
3.321ValSer: 3.321 ± 0.833
4.981ValThr: 4.981 ± 1.305
3.558ValVal: 3.558 ± 0.737
0.949ValTrp: 0.949 ± 0.605
1.186ValTyr: 1.186 ± 0.721
0.0ValXaa: 0.0 ± 0.0
Trp
0.474TrpAla: 0.474 ± 0.329
0.237TrpCys: 0.237 ± 0.164
0.474TrpAsp: 0.474 ± 0.602
1.423TrpGlu: 1.423 ± 0.307
0.474TrpPhe: 0.474 ± 0.181
0.949TrpGly: 0.949 ± 0.191
0.237TrpHis: 0.237 ± 0.428
1.423TrpIle: 1.423 ± 0.375
0.949TrpLys: 0.949 ± 0.408
1.898TrpLeu: 1.898 ± 1.191
0.237TrpMet: 0.237 ± 0.164
0.712TrpAsn: 0.712 ± 0.58
0.949TrpPro: 0.949 ± 0.391
0.949TrpGln: 0.949 ± 0.335
0.0TrpArg: 0.0 ± 0.0
1.186TrpSer: 1.186 ± 0.502
1.186TrpThr: 1.186 ± 0.418
0.949TrpVal: 0.949 ± 0.645
0.237TrpTrp: 0.237 ± 0.193
1.186TrpTyr: 1.186 ± 0.558
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.423TyrAla: 1.423 ± 0.551
0.237TyrCys: 0.237 ± 0.164
0.712TyrAsp: 0.712 ± 0.446
0.949TyrGlu: 0.949 ± 0.518
0.474TyrPhe: 0.474 ± 0.301
0.949TyrGly: 0.949 ± 0.369
1.898TyrHis: 1.898 ± 1.131
1.423TyrIle: 1.423 ± 0.909
1.186TyrLys: 1.186 ± 0.431
2.846TyrLeu: 2.846 ± 0.648
1.186TyrMet: 1.186 ± 0.423
2.135TyrAsn: 2.135 ± 0.58
3.321TyrPro: 3.321 ± 1.338
1.423TyrGln: 1.423 ± 0.421
0.949TyrArg: 0.949 ± 0.408
1.66TyrSer: 1.66 ± 0.585
2.846TyrThr: 2.846 ± 1.307
1.898TyrVal: 1.898 ± 0.736
0.237TyrTrp: 0.237 ± 0.367
0.474TyrTyr: 0.474 ± 0.24
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.237XaaAsp: 0.237 ± 0.164
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (4217 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski