Amino acid dipepetide frequency for Sparus aurata papillomavirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
0.394AlaCys: 0.394 ± 0.389
3.547AlaAsp: 3.547 ± 1.224
7.489AlaGlu: 7.489 ± 1.983
0.788AlaPhe: 0.788 ± 0.703
7.883AlaGly: 7.883 ± 2.32
1.182AlaHis: 1.182 ± 0.467
3.547AlaIle: 3.547 ± 1.028
5.912AlaLys: 5.912 ± 1.662
3.153AlaLeu: 3.153 ± 0.807
0.394AlaMet: 0.394 ± 0.351
1.182AlaAsn: 1.182 ± 0.48
0.394AlaPro: 0.394 ± 0.624
1.182AlaGln: 1.182 ± 0.662
1.577AlaArg: 1.577 ± 0.424
3.942AlaSer: 3.942 ± 0.544
0.788AlaThr: 0.788 ± 0.46
3.153AlaVal: 3.153 ± 0.929
0.394AlaTrp: 0.394 ± 0.351
1.577AlaTyr: 1.577 ± 0.777
0.0AlaXaa: 0.0 ± 0.0
Cys
1.182CysAla: 1.182 ± 0.554
0.788CysCys: 0.788 ± 0.507
1.182CysAsp: 1.182 ± 0.813
0.788CysGlu: 0.788 ± 0.46
1.182CysPhe: 1.182 ± 0.597
0.0CysGly: 0.0 ± 0.0
0.394CysHis: 0.394 ± 0.477
0.394CysIle: 0.394 ± 0.386
1.182CysLys: 1.182 ± 0.554
2.365CysLeu: 2.365 ± 0.905
0.0CysMet: 0.0 ± 0.0
1.577CysAsn: 1.577 ± 0.424
0.788CysPro: 0.788 ± 0.487
0.0CysGln: 0.0 ± 0.0
1.577CysArg: 1.577 ± 0.659
0.788CysSer: 0.788 ± 0.703
1.577CysThr: 1.577 ± 0.476
1.971CysVal: 1.971 ± 0.779
0.0CysTrp: 0.0 ± 0.0
0.394CysTyr: 0.394 ± 0.351
0.0CysXaa: 0.0 ± 0.0
Asp
3.547AspAla: 3.547 ± 0.994
1.577AspCys: 1.577 ± 0.657
5.518AspAsp: 5.518 ± 3.449
3.942AspGlu: 3.942 ± 1.042
1.971AspPhe: 1.971 ± 0.936
2.365AspGly: 2.365 ± 0.516
0.0AspHis: 0.0 ± 0.0
3.942AspIle: 3.942 ± 1.161
1.182AspLys: 1.182 ± 0.467
5.912AspLeu: 5.912 ± 2.348
0.788AspMet: 0.788 ± 0.79
3.547AspAsn: 3.547 ± 1.244
5.518AspPro: 5.518 ± 0.749
0.394AspGln: 0.394 ± 0.389
0.394AspArg: 0.394 ± 0.351
4.73AspSer: 4.73 ± 1.008
1.971AspThr: 1.971 ± 0.555
3.547AspVal: 3.547 ± 0.749
1.182AspTrp: 1.182 ± 0.813
1.182AspTyr: 1.182 ± 0.48
0.0AspXaa: 0.0 ± 0.0
Glu
3.942GluAla: 3.942 ± 0.818
0.0GluCys: 0.0 ± 0.0
2.365GluAsp: 2.365 ± 2.109
9.066GluGlu: 9.066 ± 2.222
5.124GluPhe: 5.124 ± 0.94
5.124GluGly: 5.124 ± 1.12
1.971GluHis: 1.971 ± 0.718
3.153GluIle: 3.153 ± 1.28
5.518GluLys: 5.518 ± 1.002
4.73GluLeu: 4.73 ± 2.18
2.365GluMet: 2.365 ± 0.68
3.153GluAsn: 3.153 ± 1.174
3.547GluPro: 3.547 ± 1.331
2.759GluGln: 2.759 ± 0.53
5.124GluArg: 5.124 ± 1.099
3.547GluSer: 3.547 ± 0.848
6.307GluThr: 6.307 ± 1.663
3.942GluVal: 3.942 ± 2.163
0.788GluTrp: 0.788 ± 0.4
1.971GluTyr: 1.971 ± 0.577
0.0GluXaa: 0.0 ± 0.0
Phe
0.788PheAla: 0.788 ± 0.426
1.577PheCys: 1.577 ± 1.406
1.577PheAsp: 1.577 ± 1.188
0.788PheGlu: 0.788 ± 0.714
1.577PhePhe: 1.577 ± 0.657
1.182PheGly: 1.182 ± 0.467
1.577PheHis: 1.577 ± 0.717
2.759PheIle: 2.759 ± 0.784
3.942PheLys: 3.942 ± 1.262
3.942PheLeu: 3.942 ± 1.408
1.971PheMet: 1.971 ± 0.853
3.153PheAsn: 3.153 ± 0.955
0.394PhePro: 0.394 ± 0.351
2.365PheGln: 2.365 ± 1.26
2.759PheArg: 2.759 ± 0.879
1.971PheSer: 1.971 ± 1.03
3.547PheThr: 3.547 ± 0.809
3.153PheVal: 3.153 ± 0.965
0.0PheTrp: 0.0 ± 0.0
0.394PheTyr: 0.394 ± 0.389
0.0PheXaa: 0.0 ± 0.0
Gly
2.365GlyAla: 2.365 ± 0.789
1.577GlyCys: 1.577 ± 0.657
3.942GlyAsp: 3.942 ± 1.126
3.153GlyGlu: 3.153 ± 1.299
3.942GlyPhe: 3.942 ± 1.094
2.759GlyGly: 2.759 ± 1.036
1.971GlyHis: 1.971 ± 0.853
4.336GlyIle: 4.336 ± 0.919
3.153GlyLys: 3.153 ± 1.44
4.336GlyLeu: 4.336 ± 1.061
1.971GlyMet: 1.971 ± 0.735
5.912GlyAsn: 5.912 ± 1.145
1.577GlyPro: 1.577 ± 0.733
0.394GlyGln: 0.394 ± 0.351
4.336GlyArg: 4.336 ± 2.126
4.336GlySer: 4.336 ± 1.328
4.336GlyThr: 4.336 ± 1.359
6.307GlyVal: 6.307 ± 1.518
0.788GlyTrp: 0.788 ± 0.426
2.365GlyTyr: 2.365 ± 0.96
0.0GlyXaa: 0.0 ± 0.0
His
1.577HisAla: 1.577 ± 0.373
0.788HisCys: 0.788 ± 0.507
1.577HisAsp: 1.577 ± 0.424
2.365HisGlu: 2.365 ± 0.88
0.394HisPhe: 0.394 ± 0.386
0.394HisGly: 0.394 ± 0.351
1.182HisHis: 1.182 ± 0.48
1.971HisIle: 1.971 ± 0.841
1.182HisLys: 1.182 ± 0.444
1.182HisLeu: 1.182 ± 0.78
0.0HisMet: 0.0 ± 0.0
1.577HisAsn: 1.577 ± 1.315
2.759HisPro: 2.759 ± 0.673
1.577HisGln: 1.577 ± 0.373
0.788HisArg: 0.788 ± 0.477
1.182HisSer: 1.182 ± 0.467
0.788HisThr: 0.788 ± 0.4
1.971HisVal: 1.971 ± 1.078
1.182HisTrp: 1.182 ± 0.48
1.577HisTyr: 1.577 ± 0.671
0.0HisXaa: 0.0 ± 0.0
Ile
1.577IleAla: 1.577 ± 0.643
0.788IleCys: 0.788 ± 0.4
2.759IleAsp: 2.759 ± 0.62
5.518IleGlu: 5.518 ± 1.146
2.365IlePhe: 2.365 ± 0.934
1.577IleGly: 1.577 ± 0.373
1.182IleHis: 1.182 ± 0.467
2.759IleIle: 2.759 ± 0.879
5.124IleLys: 5.124 ± 0.966
3.942IleLeu: 3.942 ± 1.443
1.182IleMet: 1.182 ± 0.909
2.759IleAsn: 2.759 ± 0.515
4.336IlePro: 4.336 ± 1.597
3.153IleGln: 3.153 ± 1.253
3.153IleArg: 3.153 ± 0.937
1.971IleSer: 1.971 ± 0.556
6.701IleThr: 6.701 ± 1.078
4.73IleVal: 4.73 ± 1.047
0.394IleTrp: 0.394 ± 0.351
1.182IleTyr: 1.182 ± 0.467
0.0IleXaa: 0.0 ± 0.0
Lys
5.518LysAla: 5.518 ± 2.344
1.971LysCys: 1.971 ± 0.841
1.577LysAsp: 1.577 ± 0.717
5.912LysGlu: 5.912 ± 1.292
3.153LysPhe: 3.153 ± 1.064
5.518LysGly: 5.518 ± 2.878
0.394LysHis: 0.394 ± 0.351
3.547LysIle: 3.547 ± 0.979
6.307LysLys: 6.307 ± 3.43
7.095LysLeu: 7.095 ± 1.567
0.788LysMet: 0.788 ± 0.462
4.73LysAsn: 4.73 ± 1.291
1.577LysPro: 1.577 ± 0.757
2.365LysGln: 2.365 ± 1.377
4.73LysArg: 4.73 ± 2.512
5.912LysSer: 5.912 ± 1.745
5.124LysThr: 5.124 ± 0.868
2.365LysVal: 2.365 ± 0.654
1.577LysTrp: 1.577 ± 0.693
3.153LysTyr: 3.153 ± 0.92
0.0LysXaa: 0.0 ± 0.0
Leu
4.336LeuAla: 4.336 ± 1.233
2.365LeuCys: 2.365 ± 0.737
5.912LeuAsp: 5.912 ± 2.767
9.066LeuGlu: 9.066 ± 1.159
1.971LeuPhe: 1.971 ± 0.654
3.153LeuGly: 3.153 ± 0.514
3.153LeuHis: 3.153 ± 1.088
5.912LeuIle: 5.912 ± 0.962
3.547LeuLys: 3.547 ± 1.74
6.307LeuLeu: 6.307 ± 1.535
1.971LeuMet: 1.971 ± 1.249
3.942LeuAsn: 3.942 ± 0.888
4.336LeuPro: 4.336 ± 1.835
5.124LeuGln: 5.124 ± 1.24
5.124LeuArg: 5.124 ± 1.105
6.307LeuSer: 6.307 ± 1.708
3.547LeuThr: 3.547 ± 2.063
3.547LeuVal: 3.547 ± 1.405
1.577LeuTrp: 1.577 ± 0.557
1.577LeuTyr: 1.577 ± 0.424
0.0LeuXaa: 0.0 ± 0.0
Met
0.788MetAla: 0.788 ± 0.46
0.0MetCys: 0.0 ± 0.0
1.577MetAsp: 1.577 ± 0.768
1.182MetGlu: 1.182 ± 0.714
0.0MetPhe: 0.0 ± 0.0
0.394MetGly: 0.394 ± 0.354
0.788MetHis: 0.788 ± 0.477
1.182MetIle: 1.182 ± 0.776
2.365MetLys: 2.365 ± 0.85
2.365MetLeu: 2.365 ± 0.734
1.182MetMet: 1.182 ± 0.727
0.394MetAsn: 0.394 ± 0.351
0.0MetPro: 0.0 ± 0.0
0.788MetGln: 0.788 ± 0.703
1.182MetArg: 1.182 ± 0.633
1.971MetSer: 1.971 ± 0.626
0.788MetThr: 0.788 ± 0.772
0.788MetVal: 0.788 ± 0.4
0.394MetTrp: 0.394 ± 0.541
1.577MetTyr: 1.577 ± 0.974
0.0MetXaa: 0.0 ± 0.0
Asn
2.365AsnAla: 2.365 ± 1.71
1.577AsnCys: 1.577 ± 0.518
2.759AsnAsp: 2.759 ± 0.699
2.365AsnGlu: 2.365 ± 0.83
2.365AsnPhe: 2.365 ± 0.723
4.73AsnGly: 4.73 ± 1.117
0.788AsnHis: 0.788 ± 0.4
0.788AsnIle: 0.788 ± 0.426
5.518AsnLys: 5.518 ± 1.638
4.336AsnLeu: 4.336 ± 0.802
1.182AsnMet: 1.182 ± 0.415
2.365AsnAsn: 2.365 ± 0.83
4.73AsnPro: 4.73 ± 0.89
0.788AsnGln: 0.788 ± 0.46
0.788AsnArg: 0.788 ± 0.426
3.942AsnSer: 3.942 ± 0.73
1.971AsnThr: 1.971 ± 0.613
7.095AsnVal: 7.095 ± 1.829
0.788AsnTrp: 0.788 ± 0.772
0.788AsnTyr: 0.788 ± 0.4
0.0AsnXaa: 0.0 ± 0.0
Pro
5.518ProAla: 5.518 ± 1.482
0.788ProCys: 0.788 ± 0.652
2.365ProAsp: 2.365 ± 0.496
3.942ProGlu: 3.942 ± 0.884
1.577ProPhe: 1.577 ± 0.515
1.577ProGly: 1.577 ± 0.465
0.394ProHis: 0.394 ± 0.386
3.547ProIle: 3.547 ± 1.536
2.365ProLys: 2.365 ± 1.589
6.307ProLeu: 6.307 ± 1.648
1.182ProMet: 1.182 ± 0.517
1.577ProAsn: 1.577 ± 0.994
2.759ProPro: 2.759 ± 1.15
1.971ProGln: 1.971 ± 0.826
1.577ProArg: 1.577 ± 0.978
3.942ProSer: 3.942 ± 0.789
4.336ProThr: 4.336 ± 0.984
2.759ProVal: 2.759 ± 0.977
0.0ProTrp: 0.0 ± 0.0
3.153ProTyr: 3.153 ± 1.307
0.0ProXaa: 0.0 ± 0.0
Gln
1.182GlnAla: 1.182 ± 0.415
0.788GlnCys: 0.788 ± 0.487
2.365GlnAsp: 2.365 ± 0.817
2.365GlnGlu: 2.365 ± 1.046
1.182GlnPhe: 1.182 ± 0.597
4.73GlnGly: 4.73 ± 1.657
0.0GlnHis: 0.0 ± 0.0
0.788GlnIle: 0.788 ± 0.4
1.971GlnLys: 1.971 ± 1.058
3.942GlnLeu: 3.942 ± 1.667
0.394GlnMet: 0.394 ± 0.378
0.788GlnAsn: 0.788 ± 0.4
2.759GlnPro: 2.759 ± 0.636
1.971GlnGln: 1.971 ± 0.7
2.365GlnArg: 2.365 ± 0.423
3.547GlnSer: 3.547 ± 1.151
1.577GlnThr: 1.577 ± 0.465
1.577GlnVal: 1.577 ± 0.657
1.577GlnTrp: 1.577 ± 0.757
1.971GlnTyr: 1.971 ± 0.613
0.0GlnXaa: 0.0 ± 0.0
Arg
1.577ArgAla: 1.577 ± 1.223
1.182ArgCys: 1.182 ± 0.733
1.971ArgAsp: 1.971 ± 0.936
1.971ArgGlu: 1.971 ± 0.555
1.577ArgPhe: 1.577 ± 0.84
6.307ArgGly: 6.307 ± 1.264
2.365ArgHis: 2.365 ± 1.199
3.547ArgIle: 3.547 ± 1.276
5.518ArgLys: 5.518 ± 1.55
3.942ArgLeu: 3.942 ± 1.893
0.394ArgMet: 0.394 ± 0.351
2.365ArgAsn: 2.365 ± 1.3
2.365ArgPro: 2.365 ± 0.983
2.759ArgGln: 2.759 ± 0.846
5.912ArgArg: 5.912 ± 3.844
1.182ArgSer: 1.182 ± 0.679
2.759ArgThr: 2.759 ± 0.594
2.365ArgVal: 2.365 ± 1.16
0.788ArgTrp: 0.788 ± 0.772
1.182ArgTyr: 1.182 ± 0.444
0.0ArgXaa: 0.0 ± 0.0
Ser
4.336SerAla: 4.336 ± 0.913
0.0SerCys: 0.0 ± 0.0
3.153SerAsp: 3.153 ± 1.017
3.153SerGlu: 3.153 ± 1.515
1.182SerPhe: 1.182 ± 0.813
5.518SerGly: 5.518 ± 1.467
1.577SerHis: 1.577 ± 0.657
2.759SerIle: 2.759 ± 0.902
1.971SerLys: 1.971 ± 0.792
6.701SerLeu: 6.701 ± 1.473
1.182SerMet: 1.182 ± 0.679
3.153SerAsn: 3.153 ± 0.933
3.153SerPro: 3.153 ± 0.607
3.547SerGln: 3.547 ± 0.778
3.153SerArg: 3.153 ± 0.621
6.307SerSer: 6.307 ± 2.114
9.46SerThr: 9.46 ± 1.431
3.547SerVal: 3.547 ± 1.076
1.577SerTrp: 1.577 ± 0.657
2.365SerTyr: 2.365 ± 0.723
0.0SerXaa: 0.0 ± 0.0
Thr
4.336ThrAla: 4.336 ± 1.887
0.394ThrCys: 0.394 ± 0.351
5.124ThrAsp: 5.124 ± 1.401
5.518ThrGlu: 5.518 ± 1.191
3.153ThrPhe: 3.153 ± 1.154
4.73ThrGly: 4.73 ± 0.98
1.182ThrHis: 1.182 ± 1.075
3.153ThrIle: 3.153 ± 1.585
4.336ThrLys: 4.336 ± 1.676
3.547ThrLeu: 3.547 ± 0.39
1.971ThrMet: 1.971 ± 0.655
3.547ThrAsn: 3.547 ± 0.556
2.759ThrPro: 2.759 ± 1.493
2.365ThrGln: 2.365 ± 1.146
1.971ThrArg: 1.971 ± 0.997
3.942ThrSer: 3.942 ± 1.164
3.942ThrThr: 3.942 ± 1.053
5.912ThrVal: 5.912 ± 2.138
0.394ThrTrp: 0.394 ± 0.351
2.759ThrTyr: 2.759 ± 0.674
0.0ThrXaa: 0.0 ± 0.0
Val
2.365ValAla: 2.365 ± 0.747
0.788ValCys: 0.788 ± 0.487
2.759ValAsp: 2.759 ± 0.966
3.547ValGlu: 3.547 ± 1.42
3.547ValPhe: 3.547 ± 0.762
1.971ValGly: 1.971 ± 0.577
3.547ValHis: 3.547 ± 1.646
3.942ValIle: 3.942 ± 0.954
6.701ValLys: 6.701 ± 1.401
5.124ValLeu: 5.124 ± 1.652
0.0ValMet: 0.0 ± 0.0
3.547ValAsn: 3.547 ± 1.551
6.701ValPro: 6.701 ± 2.241
2.365ValGln: 2.365 ± 0.922
1.577ValArg: 1.577 ± 0.799
5.518ValSer: 5.518 ± 1.086
3.153ValThr: 3.153 ± 0.771
3.153ValVal: 3.153 ± 1.07
1.971ValTrp: 1.971 ± 0.688
3.153ValTyr: 3.153 ± 1.136
0.0ValXaa: 0.0 ± 0.0
Trp
1.182TrpAla: 1.182 ± 0.415
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.788TrpGlu: 0.788 ± 0.426
0.394TrpPhe: 0.394 ± 0.351
2.365TrpGly: 2.365 ± 0.934
0.788TrpHis: 0.788 ± 0.46
0.788TrpIle: 0.788 ± 0.594
1.971TrpLys: 1.971 ± 0.589
2.759TrpLeu: 2.759 ± 0.951
0.0TrpMet: 0.0 ± 0.0
1.182TrpAsn: 1.182 ± 0.467
0.0TrpPro: 0.0 ± 0.0
0.788TrpGln: 0.788 ± 0.703
0.788TrpArg: 0.788 ± 0.426
1.971TrpSer: 1.971 ± 0.706
0.394TrpThr: 0.394 ± 0.386
0.394TrpVal: 0.394 ± 0.386
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.0TyrAla: 0.0 ± 0.0
0.788TyrCys: 0.788 ± 0.4
1.182TyrAsp: 1.182 ± 0.554
1.577TyrGlu: 1.577 ± 0.373
1.971TyrPhe: 1.971 ± 0.817
1.577TyrGly: 1.577 ± 0.464
1.577TyrHis: 1.577 ± 0.424
4.336TyrIle: 4.336 ± 1.736
3.547TyrLys: 3.547 ± 0.928
0.788TyrLeu: 0.788 ± 0.46
0.394TyrMet: 0.394 ± 0.389
1.577TyrAsn: 1.577 ± 0.518
1.182TyrPro: 1.182 ± 0.627
1.577TyrGln: 1.577 ± 0.685
3.153TyrArg: 3.153 ± 1.073
0.788TyrSer: 0.788 ± 0.426
1.971TyrThr: 1.971 ± 0.881
3.153TyrVal: 3.153 ± 0.973
1.182TyrTrp: 1.182 ± 0.415
2.759TyrTyr: 2.759 ± 1.139
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (2538 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski