Amino acid dipepetide frequency for Wenling hoplichthys paramyxovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.813AlaAla: 3.813 ± 1.434
1.059AlaCys: 1.059 ± 0.474
2.33AlaAsp: 2.33 ± 0.455
4.448AlaGlu: 4.448 ± 1.093
1.059AlaPhe: 1.059 ± 0.814
3.389AlaGly: 3.389 ± 0.534
1.483AlaHis: 1.483 ± 0.703
3.601AlaIle: 3.601 ± 0.827
4.448AlaLys: 4.448 ± 1.168
6.355AlaLeu: 6.355 ± 1.724
2.118AlaMet: 2.118 ± 0.803
2.118AlaAsn: 2.118 ± 0.647
2.118AlaPro: 2.118 ± 0.7
1.695AlaGln: 1.695 ± 0.411
3.389AlaArg: 3.389 ± 1.082
4.448AlaSer: 4.448 ± 1.447
3.389AlaThr: 3.389 ± 1.371
2.965AlaVal: 2.965 ± 0.921
0.212AlaTrp: 0.212 ± 0.135
1.695AlaTyr: 1.695 ± 0.816
0.0AlaXaa: 0.0 ± 0.0
Cys
2.118CysAla: 2.118 ± 0.797
0.424CysCys: 0.424 ± 0.22
1.271CysAsp: 1.271 ± 0.495
1.059CysGlu: 1.059 ± 0.549
0.212CysPhe: 0.212 ± 0.232
0.847CysGly: 0.847 ± 0.455
0.212CysHis: 0.212 ± 0.135
1.059CysIle: 1.059 ± 0.531
2.118CysLys: 2.118 ± 0.941
2.754CysLeu: 2.754 ± 0.422
0.0CysMet: 0.0 ± 0.0
0.847CysAsn: 0.847 ± 0.476
1.059CysPro: 1.059 ± 0.559
0.635CysGln: 0.635 ± 0.304
0.847CysArg: 0.847 ± 0.249
1.906CysSer: 1.906 ± 0.55
2.965CysThr: 2.965 ± 0.711
1.271CysVal: 1.271 ± 0.475
0.424CysTrp: 0.424 ± 0.306
1.059CysTyr: 1.059 ± 0.248
0.0CysXaa: 0.0 ± 0.0
Asp
2.33AspAla: 2.33 ± 0.512
1.271AspCys: 1.271 ± 0.522
1.906AspAsp: 1.906 ± 0.637
4.025AspGlu: 4.025 ± 0.51
1.059AspPhe: 1.059 ± 0.443
1.695AspGly: 1.695 ± 0.557
1.059AspHis: 1.059 ± 0.603
3.601AspIle: 3.601 ± 0.504
3.389AspLys: 3.389 ± 0.669
6.778AspLeu: 6.778 ± 1.138
1.695AspMet: 1.695 ± 0.528
1.906AspAsn: 1.906 ± 0.561
3.177AspPro: 3.177 ± 0.884
1.483AspGln: 1.483 ± 0.584
2.542AspArg: 2.542 ± 0.692
3.813AspSer: 3.813 ± 0.585
3.177AspThr: 3.177 ± 0.38
2.754AspVal: 2.754 ± 0.5
0.847AspTrp: 0.847 ± 0.249
1.906AspTyr: 1.906 ± 0.849
0.0AspXaa: 0.0 ± 0.0
Glu
3.601GluAla: 3.601 ± 1.073
1.271GluCys: 1.271 ± 0.513
3.389GluAsp: 3.389 ± 1.463
7.837GluGlu: 7.837 ± 2.659
1.695GluPhe: 1.695 ± 0.351
5.295GluGly: 5.295 ± 0.773
0.635GluHis: 0.635 ± 0.348
7.414GluIle: 7.414 ± 1.411
4.448GluLys: 4.448 ± 1.291
6.143GluLeu: 6.143 ± 1.543
3.177GluMet: 3.177 ± 0.749
2.965GluAsn: 2.965 ± 1.26
2.118GluPro: 2.118 ± 0.85
2.118GluGln: 2.118 ± 0.398
2.754GluArg: 2.754 ± 0.756
6.99GluSer: 6.99 ± 0.954
4.66GluThr: 4.66 ± 0.75
4.025GluVal: 4.025 ± 0.881
1.271GluTrp: 1.271 ± 0.623
0.847GluTyr: 0.847 ± 0.431
0.0GluXaa: 0.0 ± 0.0
Phe
1.483PheAla: 1.483 ± 0.53
1.483PheCys: 1.483 ± 0.676
2.754PheAsp: 2.754 ± 0.632
2.118PheGlu: 2.118 ± 0.698
0.635PhePhe: 0.635 ± 0.348
1.906PheGly: 1.906 ± 0.596
0.212PheHis: 0.212 ± 0.266
2.965PheIle: 2.965 ± 0.725
1.695PheLys: 1.695 ± 0.411
4.025PheLeu: 4.025 ± 0.837
1.271PheMet: 1.271 ± 0.404
0.847PheAsn: 0.847 ± 0.398
1.271PhePro: 1.271 ± 0.431
1.483PheGln: 1.483 ± 0.584
1.906PheArg: 1.906 ± 0.624
2.33PheSer: 2.33 ± 0.87
1.059PheThr: 1.059 ± 0.579
2.33PheVal: 2.33 ± 0.485
0.212PheTrp: 0.212 ± 0.32
1.483PheTyr: 1.483 ± 1.34
0.0PheXaa: 0.0 ± 0.0
Gly
2.965GlyAla: 2.965 ± 0.907
1.906GlyCys: 1.906 ± 0.7
3.177GlyAsp: 3.177 ± 0.818
5.507GlyGlu: 5.507 ± 1.458
1.271GlyPhe: 1.271 ± 0.342
7.626GlyGly: 7.626 ± 1.913
1.695GlyHis: 1.695 ± 0.632
5.295GlyIle: 5.295 ± 0.959
4.448GlyLys: 4.448 ± 1.142
6.355GlyLeu: 6.355 ± 1.162
2.542GlyMet: 2.542 ± 0.612
3.601GlyAsn: 3.601 ± 1.649
2.118GlyPro: 2.118 ± 0.423
1.906GlyGln: 1.906 ± 0.531
2.33GlyArg: 2.33 ± 0.562
5.719GlySer: 5.719 ± 0.678
4.872GlyThr: 4.872 ± 1.421
4.448GlyVal: 4.448 ± 1.816
0.635GlyTrp: 0.635 ± 0.404
2.754GlyTyr: 2.754 ± 0.935
0.0GlyXaa: 0.0 ± 0.0
His
0.847HisAla: 0.847 ± 0.617
0.212HisCys: 0.212 ± 0.279
0.424HisAsp: 0.424 ± 0.269
1.483HisGlu: 1.483 ± 0.688
0.635HisPhe: 0.635 ± 0.376
0.635HisGly: 0.635 ± 0.404
0.424HisHis: 0.424 ± 0.259
2.118HisIle: 2.118 ± 0.551
1.695HisLys: 1.695 ± 0.745
1.906HisLeu: 1.906 ± 0.804
0.635HisMet: 0.635 ± 0.274
0.0HisAsn: 0.0 ± 0.0
0.847HisPro: 0.847 ± 0.278
0.212HisGln: 0.212 ± 0.232
1.059HisArg: 1.059 ± 0.591
1.059HisSer: 1.059 ± 0.87
0.424HisThr: 0.424 ± 0.385
1.059HisVal: 1.059 ± 0.507
0.0HisTrp: 0.0 ± 0.0
0.212HisTyr: 0.212 ± 0.32
0.0HisXaa: 0.0 ± 0.0
Ile
3.813IleAla: 3.813 ± 1.016
1.483IleCys: 1.483 ± 0.569
3.177IleAsp: 3.177 ± 0.696
3.813IleGlu: 3.813 ± 0.705
2.542IlePhe: 2.542 ± 0.588
4.236IleGly: 4.236 ± 1.071
1.483IleHis: 1.483 ± 0.553
6.355IleIle: 6.355 ± 1.053
6.566IleLys: 6.566 ± 1.279
4.66IleLeu: 4.66 ± 0.54
1.906IleMet: 1.906 ± 0.607
4.448IleAsn: 4.448 ± 1.183
4.025IlePro: 4.025 ± 0.915
4.236IleGln: 4.236 ± 0.828
4.872IleArg: 4.872 ± 0.991
6.99IleSer: 6.99 ± 1.024
5.084IleThr: 5.084 ± 1.291
3.601IleVal: 3.601 ± 0.557
2.118IleTrp: 2.118 ± 0.64
2.542IleTyr: 2.542 ± 0.582
0.0IleXaa: 0.0 ± 0.0
Lys
4.025LysAla: 4.025 ± 0.927
1.483LysCys: 1.483 ± 0.466
3.389LysAsp: 3.389 ± 1.06
6.355LysGlu: 6.355 ± 1.651
2.542LysPhe: 2.542 ± 0.604
4.236LysGly: 4.236 ± 0.73
1.059LysHis: 1.059 ± 0.563
5.931LysIle: 5.931 ± 1.572
5.084LysLys: 5.084 ± 1.116
4.025LysLeu: 4.025 ± 1.423
4.025LysMet: 4.025 ± 0.95
3.601LysAsn: 3.601 ± 0.857
1.271LysPro: 1.271 ± 0.474
2.118LysGln: 2.118 ± 0.703
3.601LysArg: 3.601 ± 0.731
5.084LysSer: 5.084 ± 1.778
3.177LysThr: 3.177 ± 1.259
3.813LysVal: 3.813 ± 0.764
0.635LysTrp: 0.635 ± 0.426
2.33LysTyr: 2.33 ± 0.817
0.0LysXaa: 0.0 ± 0.0
Leu
5.084LeuAla: 5.084 ± 1.668
2.118LeuCys: 2.118 ± 0.613
3.813LeuAsp: 3.813 ± 0.907
4.872LeuGlu: 4.872 ± 1.644
3.177LeuPhe: 3.177 ± 0.732
7.202LeuGly: 7.202 ± 1.279
1.483LeuHis: 1.483 ± 0.534
6.143LeuIle: 6.143 ± 0.885
5.931LeuLys: 5.931 ± 1.145
8.473LeuLeu: 8.473 ± 1.263
2.965LeuMet: 2.965 ± 0.663
2.965LeuAsn: 2.965 ± 0.833
2.33LeuPro: 2.33 ± 0.68
2.33LeuGln: 2.33 ± 0.557
5.507LeuArg: 5.507 ± 1.465
9.32LeuSer: 9.32 ± 1.006
7.626LeuThr: 7.626 ± 1.141
6.778LeuVal: 6.778 ± 2.098
0.635LeuTrp: 0.635 ± 0.304
1.483LeuTyr: 1.483 ± 0.523
0.0LeuXaa: 0.0 ± 0.0
Met
2.754MetAla: 2.754 ± 0.667
0.424MetCys: 0.424 ± 0.269
2.965MetAsp: 2.965 ± 1.014
1.483MetGlu: 1.483 ± 0.929
0.424MetPhe: 0.424 ± 0.219
3.601MetGly: 3.601 ± 0.952
0.424MetHis: 0.424 ± 0.337
2.754MetIle: 2.754 ± 0.548
2.542MetLys: 2.542 ± 0.839
1.906MetLeu: 1.906 ± 0.351
1.059MetMet: 1.059 ± 0.621
1.271MetAsn: 1.271 ± 0.57
0.635MetPro: 0.635 ± 0.279
0.635MetGln: 0.635 ± 0.248
2.118MetArg: 2.118 ± 0.589
2.965MetSer: 2.965 ± 0.833
1.483MetThr: 1.483 ± 0.347
1.695MetVal: 1.695 ± 0.529
0.635MetTrp: 0.635 ± 0.285
1.059MetTyr: 1.059 ± 0.466
0.0MetXaa: 0.0 ± 0.0
Asn
1.483AsnAla: 1.483 ± 0.458
1.059AsnCys: 1.059 ± 0.248
1.271AsnAsp: 1.271 ± 0.466
3.813AsnGlu: 3.813 ± 0.582
2.965AsnPhe: 2.965 ± 0.579
3.177AsnGly: 3.177 ± 0.716
0.635AsnHis: 0.635 ± 0.521
3.389AsnIle: 3.389 ± 0.629
2.754AsnLys: 2.754 ± 1.118
5.295AsnLeu: 5.295 ± 1.409
1.695AsnMet: 1.695 ± 0.545
1.695AsnAsn: 1.695 ± 0.797
2.33AsnPro: 2.33 ± 0.379
1.483AsnGln: 1.483 ± 0.478
1.059AsnArg: 1.059 ± 0.563
2.965AsnSer: 2.965 ± 0.898
2.965AsnThr: 2.965 ± 0.834
1.059AsnVal: 1.059 ± 0.453
0.635AsnTrp: 0.635 ± 0.376
1.059AsnTyr: 1.059 ± 0.443
0.0AsnXaa: 0.0 ± 0.0
Pro
2.33ProAla: 2.33 ± 0.601
0.212ProCys: 0.212 ± 0.135
1.483ProAsp: 1.483 ± 0.55
2.542ProGlu: 2.542 ± 0.692
2.754ProPhe: 2.754 ± 0.827
1.906ProGly: 1.906 ± 0.95
0.424ProHis: 0.424 ± 0.269
3.177ProIle: 3.177 ± 0.585
1.483ProLys: 1.483 ± 0.404
3.177ProLeu: 3.177 ± 0.856
0.424ProMet: 0.424 ± 0.255
1.483ProAsn: 1.483 ± 0.388
1.483ProPro: 1.483 ± 0.356
1.059ProGln: 1.059 ± 0.505
3.177ProArg: 3.177 ± 0.841
4.025ProSer: 4.025 ± 1.064
1.695ProThr: 1.695 ± 0.441
1.483ProVal: 1.483 ± 0.893
0.635ProTrp: 0.635 ± 0.514
1.906ProTyr: 1.906 ± 0.634
0.0ProXaa: 0.0 ± 0.0
Gln
1.906GlnAla: 1.906 ± 0.468
0.847GlnCys: 0.847 ± 0.375
2.33GlnAsp: 2.33 ± 0.9
2.542GlnGlu: 2.542 ± 0.883
1.695GlnPhe: 1.695 ± 0.558
2.542GlnGly: 2.542 ± 0.537
0.212GlnHis: 0.212 ± 0.266
2.542GlnIle: 2.542 ± 0.711
1.271GlnLys: 1.271 ± 0.359
1.695GlnLeu: 1.695 ± 0.497
0.424GlnMet: 0.424 ± 0.248
1.059GlnAsn: 1.059 ± 0.543
1.483GlnPro: 1.483 ± 0.475
1.271GlnGln: 1.271 ± 0.403
1.695GlnArg: 1.695 ± 0.49
2.33GlnSer: 2.33 ± 0.825
1.906GlnThr: 1.906 ± 0.654
1.271GlnVal: 1.271 ± 0.425
0.0GlnTrp: 0.0 ± 0.0
0.424GlnTyr: 0.424 ± 0.269
0.0GlnXaa: 0.0 ± 0.0
Arg
3.389ArgAla: 3.389 ± 0.844
1.271ArgCys: 1.271 ± 0.43
3.177ArgAsp: 3.177 ± 0.769
5.084ArgGlu: 5.084 ± 1.335
2.33ArgPhe: 2.33 ± 0.734
2.542ArgGly: 2.542 ± 0.717
0.635ArgHis: 0.635 ± 0.404
4.448ArgIle: 4.448 ± 0.906
2.754ArgLys: 2.754 ± 0.675
5.719ArgLeu: 5.719 ± 0.855
1.271ArgMet: 1.271 ± 0.549
2.542ArgAsn: 2.542 ± 1.068
1.695ArgPro: 1.695 ± 0.635
0.847ArgGln: 0.847 ± 0.375
2.542ArgArg: 2.542 ± 0.583
5.719ArgSer: 5.719 ± 1.29
4.236ArgThr: 4.236 ± 0.981
2.542ArgVal: 2.542 ± 0.498
1.059ArgTrp: 1.059 ± 0.492
1.695ArgTyr: 1.695 ± 0.604
0.0ArgXaa: 0.0 ± 0.0
Ser
5.295SerAla: 5.295 ± 1.455
2.118SerCys: 2.118 ± 1.078
4.872SerAsp: 4.872 ± 1.484
7.414SerGlu: 7.414 ± 1.344
3.177SerPhe: 3.177 ± 0.534
6.566SerGly: 6.566 ± 1.256
0.635SerHis: 0.635 ± 0.618
5.295SerIle: 5.295 ± 0.828
5.719SerLys: 5.719 ± 0.873
8.473SerLeu: 8.473 ± 1.279
2.542SerMet: 2.542 ± 0.798
4.236SerAsn: 4.236 ± 0.843
3.389SerPro: 3.389 ± 0.842
2.754SerGln: 2.754 ± 0.741
5.084SerArg: 5.084 ± 1.046
5.719SerSer: 5.719 ± 1.493
4.66SerThr: 4.66 ± 1.031
4.448SerVal: 4.448 ± 0.793
0.635SerTrp: 0.635 ± 0.274
1.906SerTyr: 1.906 ± 0.594
0.0SerXaa: 0.0 ± 0.0
Thr
4.025ThrAla: 4.025 ± 1.137
1.906ThrCys: 1.906 ± 0.92
4.025ThrAsp: 4.025 ± 1.148
2.542ThrGlu: 2.542 ± 0.545
1.271ThrPhe: 1.271 ± 0.597
6.99ThrGly: 6.99 ± 1.238
0.847ThrHis: 0.847 ± 0.442
6.143ThrIle: 6.143 ± 1.755
5.084ThrLys: 5.084 ± 1.237
4.872ThrLeu: 4.872 ± 1.362
1.271ThrMet: 1.271 ± 0.349
2.754ThrAsn: 2.754 ± 0.721
2.118ThrPro: 2.118 ± 0.313
1.271ThrGln: 1.271 ± 0.642
5.719ThrArg: 5.719 ± 1.14
6.143ThrSer: 6.143 ± 1.616
2.754ThrThr: 2.754 ± 0.535
2.754ThrVal: 2.754 ± 0.691
0.424ThrTrp: 0.424 ± 0.269
2.118ThrTyr: 2.118 ± 0.635
0.0ThrXaa: 0.0 ± 0.0
Val
3.601ValAla: 3.601 ± 0.817
1.271ValCys: 1.271 ± 0.884
1.906ValAsp: 1.906 ± 0.582
2.542ValGlu: 2.542 ± 0.673
2.33ValPhe: 2.33 ± 1.398
4.025ValGly: 4.025 ± 1.071
0.847ValHis: 0.847 ± 0.613
3.177ValIle: 3.177 ± 0.957
3.389ValLys: 3.389 ± 1.134
4.448ValLeu: 4.448 ± 0.894
2.33ValMet: 2.33 ± 0.672
2.542ValAsn: 2.542 ± 0.712
1.059ValPro: 1.059 ± 0.466
0.635ValGln: 0.635 ± 0.565
3.601ValArg: 3.601 ± 0.435
4.872ValSer: 4.872 ± 0.96
5.295ValThr: 5.295 ± 1.355
2.754ValVal: 2.754 ± 0.773
0.635ValTrp: 0.635 ± 0.304
1.695ValTyr: 1.695 ± 0.451
0.0ValXaa: 0.0 ± 0.0
Trp
0.847TrpAla: 0.847 ± 0.278
0.847TrpCys: 0.847 ± 0.375
0.847TrpAsp: 0.847 ± 0.538
0.635TrpGlu: 0.635 ± 0.348
0.0TrpPhe: 0.0 ± 0.0
1.059TrpGly: 1.059 ± 0.389
0.212TrpHis: 0.212 ± 0.135
0.424TrpIle: 0.424 ± 0.269
1.271TrpLys: 1.271 ± 0.643
1.059TrpLeu: 1.059 ± 0.612
0.212TrpMet: 0.212 ± 0.305
1.271TrpAsn: 1.271 ± 0.647
0.424TrpPro: 0.424 ± 0.219
0.0TrpGln: 0.0 ± 0.0
0.424TrpArg: 0.424 ± 0.274
0.847TrpSer: 0.847 ± 0.455
1.271TrpThr: 1.271 ± 0.431
0.424TrpVal: 0.424 ± 0.259
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.424TyrAla: 0.424 ± 0.269
0.424TyrCys: 0.424 ± 0.215
1.906TyrAsp: 1.906 ± 0.753
2.542TyrGlu: 2.542 ± 1.329
1.695TyrPhe: 1.695 ± 0.664
1.695TyrGly: 1.695 ± 0.384
1.271TyrHis: 1.271 ± 0.628
2.118TyrIle: 2.118 ± 0.512
1.695TyrLys: 1.695 ± 0.444
2.118TyrLeu: 2.118 ± 0.765
1.059TyrMet: 1.059 ± 0.373
0.847TyrAsn: 0.847 ± 0.557
1.906TyrPro: 1.906 ± 0.582
1.271TyrGln: 1.271 ± 0.503
1.271TyrArg: 1.271 ± 0.41
1.695TyrSer: 1.695 ± 0.411
2.33TyrThr: 2.33 ± 0.385
1.483TyrVal: 1.483 ± 0.524
0.424TyrTrp: 0.424 ± 0.35
0.635TyrTyr: 0.635 ± 0.333
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (4722 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski