Amino acid dipepetide frequency for Erethizon dorsatum papillomavirus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.445AlaAla: 3.445 ± 1.477
0.0AlaCys: 0.0 ± 0.0
3.445AlaAsp: 3.445 ± 0.646
3.015AlaGlu: 3.015 ± 1.176
2.153AlaPhe: 2.153 ± 0.627
4.737AlaGly: 4.737 ± 1.82
1.292AlaHis: 1.292 ± 1.046
1.723AlaIle: 1.723 ± 0.638
4.307AlaLys: 4.307 ± 1.115
3.876AlaLeu: 3.876 ± 0.86
0.431AlaMet: 0.431 ± 0.419
1.723AlaAsn: 1.723 ± 0.959
3.445AlaPro: 3.445 ± 0.818
2.153AlaGln: 2.153 ± 0.85
3.445AlaArg: 3.445 ± 0.887
1.723AlaSer: 1.723 ± 0.553
3.876AlaThr: 3.876 ± 0.809
3.876AlaVal: 3.876 ± 0.977
1.292AlaTrp: 1.292 ± 0.434
2.153AlaTyr: 2.153 ± 0.833
0.0AlaXaa: 0.0 ± 0.0
Cys
1.723CysAla: 1.723 ± 1.0
0.431CysCys: 0.431 ± 0.338
0.0CysAsp: 0.0 ± 0.0
0.431CysGlu: 0.431 ± 0.338
0.861CysPhe: 0.861 ± 0.677
0.861CysGly: 0.861 ± 0.415
0.431CysHis: 0.431 ± 0.788
0.431CysIle: 0.431 ± 0.479
1.723CysLys: 1.723 ± 0.481
3.445CysLeu: 3.445 ± 2.751
0.0CysMet: 0.0 ± 0.0
0.861CysAsn: 0.861 ± 0.519
1.723CysPro: 1.723 ± 0.712
0.431CysGln: 0.431 ± 0.348
1.292CysArg: 1.292 ± 1.092
0.861CysSer: 0.861 ± 0.677
3.876CysThr: 3.876 ± 1.409
0.0CysVal: 0.0 ± 0.0
1.292CysTrp: 1.292 ± 0.926
0.431CysTyr: 0.431 ± 0.479
0.0CysXaa: 0.0 ± 0.0
Asp
3.876AspAla: 3.876 ± 1.734
1.292AspCys: 1.292 ± 1.015
3.876AspAsp: 3.876 ± 1.444
4.307AspGlu: 4.307 ± 1.535
3.445AspPhe: 3.445 ± 1.831
2.584AspGly: 2.584 ± 1.085
0.0AspHis: 0.0 ± 0.0
5.168AspIle: 5.168 ± 1.007
1.723AspLys: 1.723 ± 0.584
4.737AspLeu: 4.737 ± 0.885
0.861AspMet: 0.861 ± 0.361
1.723AspAsn: 1.723 ± 0.53
5.599AspPro: 5.599 ± 2.023
2.153AspGln: 2.153 ± 0.687
1.723AspArg: 1.723 ± 1.011
4.307AspSer: 4.307 ± 0.661
4.737AspThr: 4.737 ± 0.958
6.46AspVal: 6.46 ± 2.156
0.861AspTrp: 0.861 ± 0.361
2.153AspTyr: 2.153 ± 0.914
0.0AspXaa: 0.0 ± 0.0
Glu
1.292GluAla: 1.292 ± 0.597
1.292GluCys: 1.292 ± 0.734
6.46GluAsp: 6.46 ± 1.115
5.599GluGlu: 5.599 ± 1.969
0.861GluPhe: 0.861 ± 0.519
6.029GluGly: 6.029 ± 1.437
0.861GluHis: 0.861 ± 0.63
4.307GluIle: 4.307 ± 1.207
3.015GluLys: 3.015 ± 1.224
6.029GluLeu: 6.029 ± 1.311
0.0GluMet: 0.0 ± 0.0
2.153GluAsn: 2.153 ± 0.755
2.153GluPro: 2.153 ± 0.648
3.876GluGln: 3.876 ± 1.133
3.445GluArg: 3.445 ± 1.354
4.307GluSer: 4.307 ± 2.293
5.168GluThr: 5.168 ± 1.908
5.168GluVal: 5.168 ± 1.818
0.431GluTrp: 0.431 ± 0.338
2.153GluTyr: 2.153 ± 0.597
0.0GluXaa: 0.0 ± 0.0
Phe
1.723PheAla: 1.723 ± 0.756
2.153PheCys: 2.153 ± 0.959
4.737PheAsp: 4.737 ± 0.798
3.015PheGlu: 3.015 ± 1.108
2.584PhePhe: 2.584 ± 0.624
2.153PheGly: 2.153 ± 0.541
0.431PheHis: 0.431 ± 0.349
1.723PheIle: 1.723 ± 0.505
2.153PheLys: 2.153 ± 0.687
3.445PheLeu: 3.445 ± 0.805
1.723PheMet: 1.723 ± 1.003
1.723PheAsn: 1.723 ± 0.505
2.153PhePro: 2.153 ± 1.145
0.431PheGln: 0.431 ± 0.378
1.292PheArg: 1.292 ± 0.475
1.292PheSer: 1.292 ± 0.834
2.153PheThr: 2.153 ± 0.661
3.015PheVal: 3.015 ± 1.03
1.723PheTrp: 1.723 ± 0.722
1.292PheTyr: 1.292 ± 0.734
0.0PheXaa: 0.0 ± 0.0
Gly
5.168GlyAla: 5.168 ± 1.466
3.015GlyCys: 3.015 ± 1.379
4.307GlyAsp: 4.307 ± 1.369
6.46GlyGlu: 6.46 ± 1.894
2.153GlyPhe: 2.153 ± 0.91
6.029GlyGly: 6.029 ± 1.714
1.723GlyHis: 1.723 ± 0.782
4.737GlyIle: 4.737 ± 0.891
2.584GlyLys: 2.584 ± 1.083
4.307GlyLeu: 4.307 ± 1.066
1.292GlyMet: 1.292 ± 0.781
3.015GlyAsn: 3.015 ± 0.729
2.584GlyPro: 2.584 ± 1.206
3.876GlyGln: 3.876 ± 0.748
5.599GlyArg: 5.599 ± 1.586
3.445GlySer: 3.445 ± 1.153
4.737GlyThr: 4.737 ± 1.918
3.015GlyVal: 3.015 ± 1.66
0.431GlyTrp: 0.431 ± 0.378
1.292GlyTyr: 1.292 ± 1.009
0.0GlyXaa: 0.0 ± 0.0
His
1.292HisAla: 1.292 ± 0.599
1.292HisCys: 1.292 ± 1.53
1.292HisAsp: 1.292 ± 0.475
0.861HisGlu: 0.861 ± 0.779
1.292HisPhe: 1.292 ± 0.657
1.723HisGly: 1.723 ± 0.882
0.0HisHis: 0.0 ± 0.0
1.723HisIle: 1.723 ± 0.691
0.431HisLys: 0.431 ± 0.338
1.292HisLeu: 1.292 ± 0.811
0.0HisMet: 0.0 ± 0.0
0.861HisAsn: 0.861 ± 0.415
1.723HisPro: 1.723 ± 0.754
1.292HisGln: 1.292 ± 0.607
2.584HisArg: 2.584 ± 0.865
0.861HisSer: 0.861 ± 0.361
0.431HisThr: 0.431 ± 0.349
2.153HisVal: 2.153 ± 0.817
0.431HisTrp: 0.431 ± 0.349
0.861HisTyr: 0.861 ± 0.44
0.0HisXaa: 0.0 ± 0.0
Ile
3.445IleAla: 3.445 ± 1.314
0.861IleCys: 0.861 ± 0.875
1.292IleAsp: 1.292 ± 0.63
2.153IleGlu: 2.153 ± 0.918
1.723IlePhe: 1.723 ± 0.8
5.599IleGly: 5.599 ± 0.94
0.0IleHis: 0.0 ± 0.0
2.584IleIle: 2.584 ± 1.26
0.861IleLys: 0.861 ± 0.514
3.445IleLeu: 3.445 ± 1.032
0.0IleMet: 0.0 ± 0.0
1.723IleAsn: 1.723 ± 0.691
4.737IlePro: 4.737 ± 1.592
4.737IleGln: 4.737 ± 0.696
1.723IleArg: 1.723 ± 0.802
4.307IleSer: 4.307 ± 0.856
1.292IleThr: 1.292 ± 1.015
2.584IleVal: 2.584 ± 0.656
0.431IleTrp: 0.431 ± 0.348
3.015IleTyr: 3.015 ± 0.918
0.0IleXaa: 0.0 ± 0.0
Lys
1.292LysAla: 1.292 ± 0.607
2.153LysCys: 2.153 ± 0.817
2.584LysAsp: 2.584 ± 1.07
1.723LysGlu: 1.723 ± 0.829
1.723LysPhe: 1.723 ± 0.722
4.307LysGly: 4.307 ± 1.857
1.723LysHis: 1.723 ± 0.959
2.584LysIle: 2.584 ± 0.717
3.445LysLys: 3.445 ± 1.101
4.737LysLeu: 4.737 ± 1.335
2.153LysMet: 2.153 ± 0.986
2.584LysAsn: 2.584 ± 0.465
0.861LysPro: 0.861 ± 0.61
2.584LysGln: 2.584 ± 0.878
4.307LysArg: 4.307 ± 0.723
3.876LysSer: 3.876 ± 1.571
1.723LysThr: 1.723 ± 0.722
2.584LysVal: 2.584 ± 1.137
0.861LysTrp: 0.861 ± 0.377
1.723LysTyr: 1.723 ± 0.754
0.0LysXaa: 0.0 ± 0.0
Leu
3.876LeuAla: 3.876 ± 0.894
1.723LeuCys: 1.723 ± 1.0
6.029LeuAsp: 6.029 ± 1.292
7.321LeuGlu: 7.321 ± 0.606
5.599LeuPhe: 5.599 ± 1.46
6.46LeuGly: 6.46 ± 1.514
3.876LeuHis: 3.876 ± 2.278
1.723LeuIle: 1.723 ± 0.611
7.321LeuLys: 7.321 ± 1.828
8.613LeuLeu: 8.613 ± 2.3
2.153LeuMet: 2.153 ± 0.433
3.876LeuAsn: 3.876 ± 2.385
2.584LeuPro: 2.584 ± 0.624
9.044LeuGln: 9.044 ± 2.047
3.015LeuArg: 3.015 ± 1.216
8.183LeuSer: 8.183 ± 1.754
3.445LeuThr: 3.445 ± 1.814
7.321LeuVal: 7.321 ± 0.735
0.861LeuTrp: 0.861 ± 0.566
3.445LeuTyr: 3.445 ± 0.765
0.0LeuXaa: 0.0 ± 0.0
Met
1.723MetAla: 1.723 ± 0.53
0.431MetCys: 0.431 ± 0.349
1.292MetAsp: 1.292 ± 0.572
0.0MetGlu: 0.0 ± 0.0
1.292MetPhe: 1.292 ± 0.624
0.0MetGly: 0.0 ± 0.0
0.431MetHis: 0.431 ± 0.338
0.861MetIle: 0.861 ± 0.619
0.861MetLys: 0.861 ± 0.94
1.292MetLeu: 1.292 ± 1.015
0.0MetMet: 0.0 ± 0.0
0.861MetAsn: 0.861 ± 0.361
0.431MetPro: 0.431 ± 0.338
1.292MetGln: 1.292 ± 0.718
0.861MetArg: 0.861 ± 0.592
0.431MetSer: 0.431 ± 0.338
0.431MetThr: 0.431 ± 0.349
2.584MetVal: 2.584 ± 0.937
0.0MetTrp: 0.0 ± 0.0
0.431MetTyr: 0.431 ± 0.349
0.0MetXaa: 0.0 ± 0.0
Asn
3.445AsnAla: 3.445 ± 1.917
0.861AsnCys: 0.861 ± 0.584
0.861AsnAsp: 0.861 ± 0.415
0.431AsnGlu: 0.431 ± 0.479
1.292AsnPhe: 1.292 ± 0.624
2.584AsnGly: 2.584 ± 1.054
0.861AsnHis: 0.861 ± 0.875
0.861AsnIle: 0.861 ± 0.377
1.723AsnLys: 1.723 ± 0.505
4.737AsnLeu: 4.737 ± 2.26
0.431AsnMet: 0.431 ± 0.338
3.015AsnAsn: 3.015 ± 1.568
3.015AsnPro: 3.015 ± 1.217
2.153AsnGln: 2.153 ± 0.966
2.584AsnArg: 2.584 ± 0.518
6.029AsnSer: 6.029 ± 0.834
2.153AsnThr: 2.153 ± 0.99
2.584AsnVal: 2.584 ± 0.801
0.431AsnTrp: 0.431 ± 0.338
1.292AsnTyr: 1.292 ± 0.687
0.0AsnXaa: 0.0 ± 0.0
Pro
3.015ProAla: 3.015 ± 1.562
1.292ProCys: 1.292 ± 1.074
4.307ProAsp: 4.307 ± 1.239
3.445ProGlu: 3.445 ± 1.013
2.153ProPhe: 2.153 ± 1.079
0.861ProGly: 0.861 ± 0.697
0.861ProHis: 0.861 ± 0.94
2.153ProIle: 2.153 ± 1.342
3.876ProLys: 3.876 ± 1.173
6.46ProLeu: 6.46 ± 1.47
0.861ProMet: 0.861 ± 0.514
4.307ProAsn: 4.307 ± 1.109
8.183ProPro: 8.183 ± 1.782
1.723ProGln: 1.723 ± 0.553
2.153ProArg: 2.153 ± 0.492
2.584ProSer: 2.584 ± 1.629
3.445ProThr: 3.445 ± 1.326
3.015ProVal: 3.015 ± 1.029
0.431ProTrp: 0.431 ± 0.378
1.292ProTyr: 1.292 ± 0.637
0.0ProXaa: 0.0 ± 0.0
Gln
2.153GlnAla: 2.153 ± 0.723
1.292GlnCys: 1.292 ± 0.811
1.292GlnAsp: 1.292 ± 1.074
6.029GlnGlu: 6.029 ± 1.068
1.723GlnPhe: 1.723 ± 0.914
3.445GlnGly: 3.445 ± 1.022
0.431GlnHis: 0.431 ± 0.338
2.153GlnIle: 2.153 ± 1.16
1.723GlnLys: 1.723 ± 0.669
6.46GlnLeu: 6.46 ± 2.326
1.723GlnMet: 1.723 ± 0.782
1.723GlnAsn: 1.723 ± 0.691
1.723GlnPro: 1.723 ± 0.853
3.445GlnGln: 3.445 ± 1.106
3.445GlnArg: 3.445 ± 0.893
2.153GlnSer: 2.153 ± 1.018
2.584GlnThr: 2.584 ± 0.929
3.445GlnVal: 3.445 ± 1.107
1.292GlnTrp: 1.292 ± 0.734
2.584GlnTyr: 2.584 ± 1.137
0.0GlnXaa: 0.0 ± 0.0
Arg
2.584ArgAla: 2.584 ± 1.003
0.861ArgCys: 0.861 ± 0.54
3.015ArgAsp: 3.015 ± 0.955
4.307ArgGlu: 4.307 ± 1.3
1.292ArgPhe: 1.292 ± 0.607
3.015ArgGly: 3.015 ± 1.215
1.723ArgHis: 1.723 ± 0.669
1.292ArgIle: 1.292 ± 0.834
3.876ArgLys: 3.876 ± 0.979
8.183ArgLeu: 8.183 ± 1.053
0.431ArgMet: 0.431 ± 0.531
3.876ArgAsn: 3.876 ± 0.899
3.876ArgPro: 3.876 ± 1.802
4.307ArgGln: 4.307 ± 1.274
7.321ArgArg: 7.321 ± 2.656
3.015ArgSer: 3.015 ± 1.724
2.584ArgThr: 2.584 ± 0.626
2.584ArgVal: 2.584 ± 1.285
0.0ArgTrp: 0.0 ± 0.0
2.584ArgTyr: 2.584 ± 1.191
0.0ArgXaa: 0.0 ± 0.0
Ser
2.153SerAla: 2.153 ± 0.937
0.431SerCys: 0.431 ± 0.479
3.876SerAsp: 3.876 ± 1.326
4.307SerGlu: 4.307 ± 1.685
2.584SerPhe: 2.584 ± 0.878
6.46SerGly: 6.46 ± 1.707
2.153SerHis: 2.153 ± 1.045
6.029SerIle: 6.029 ± 1.059
1.723SerLys: 1.723 ± 1.353
6.891SerLeu: 6.891 ± 1.301
0.861SerMet: 0.861 ± 0.736
1.292SerAsn: 1.292 ± 0.687
3.445SerPro: 3.445 ± 1.682
0.861SerGln: 0.861 ± 0.677
6.46SerArg: 6.46 ± 1.84
10.336SerSer: 10.336 ± 2.241
5.168SerThr: 5.168 ± 1.85
4.307SerVal: 4.307 ± 0.606
0.861SerTrp: 0.861 ± 0.677
1.723SerTyr: 1.723 ± 0.914
0.0SerXaa: 0.0 ± 0.0
Thr
3.015ThrAla: 3.015 ± 0.576
0.431ThrCys: 0.431 ± 0.338
4.307ThrAsp: 4.307 ± 0.877
4.737ThrGlu: 4.737 ± 1.685
2.153ThrPhe: 2.153 ± 0.944
5.168ThrGly: 5.168 ± 1.628
1.292ThrHis: 1.292 ± 0.718
1.723ThrIle: 1.723 ± 0.824
2.153ThrLys: 2.153 ± 0.703
6.029ThrLeu: 6.029 ± 1.586
0.861ThrMet: 0.861 ± 0.427
1.292ThrAsn: 1.292 ± 1.015
3.445ThrPro: 3.445 ± 1.396
1.292ThrGln: 1.292 ± 0.607
2.584ThrArg: 2.584 ± 0.739
6.891ThrSer: 6.891 ± 2.143
5.168ThrThr: 5.168 ± 1.193
5.599ThrVal: 5.599 ± 1.982
0.861ThrTrp: 0.861 ± 0.415
1.723ThrTyr: 1.723 ± 0.584
0.0ThrXaa: 0.0 ± 0.0
Val
3.876ValAla: 3.876 ± 0.702
0.861ValCys: 0.861 ± 0.519
6.029ValAsp: 6.029 ± 2.247
3.876ValGlu: 3.876 ± 0.959
3.015ValPhe: 3.015 ± 0.789
4.307ValGly: 4.307 ± 1.512
2.584ValHis: 2.584 ± 0.518
1.292ValIle: 1.292 ± 0.624
2.584ValLys: 2.584 ± 0.777
4.737ValLeu: 4.737 ± 1.029
0.861ValMet: 0.861 ± 0.631
1.723ValAsn: 1.723 ± 0.505
4.307ValPro: 4.307 ± 0.95
3.876ValGln: 3.876 ± 1.192
3.445ValArg: 3.445 ± 1.349
6.029ValSer: 6.029 ± 1.348
6.891ValThr: 6.891 ± 2.464
5.168ValVal: 5.168 ± 1.58
1.292ValTrp: 1.292 ± 0.434
1.292ValTyr: 1.292 ± 0.868
0.0ValXaa: 0.0 ± 0.0
Trp
0.431TrpAla: 0.431 ± 0.338
0.0TrpCys: 0.0 ± 0.0
0.861TrpAsp: 0.861 ± 0.514
0.431TrpGlu: 0.431 ± 0.349
0.431TrpPhe: 0.431 ± 0.338
0.431TrpGly: 0.431 ± 0.349
0.861TrpHis: 0.861 ± 0.427
1.292TrpIle: 1.292 ± 0.657
1.723TrpLys: 1.723 ± 0.584
2.584TrpLeu: 2.584 ± 0.785
0.0TrpMet: 0.0 ± 0.0
0.861TrpAsn: 0.861 ± 0.361
0.431TrpPro: 0.431 ± 0.349
0.0TrpGln: 0.0 ± 0.0
1.723TrpArg: 1.723 ± 0.915
0.861TrpSer: 0.861 ± 0.756
0.0TrpThr: 0.0 ± 0.0
0.861TrpVal: 0.861 ± 0.415
0.431TrpTrp: 0.431 ± 0.349
0.431TrpTyr: 0.431 ± 0.378
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.584TyrAla: 2.584 ± 0.715
0.431TyrCys: 0.431 ± 0.479
1.723TyrAsp: 1.723 ± 0.664
2.153TyrGlu: 2.153 ± 0.977
2.584TyrPhe: 2.584 ± 1.083
3.015TyrGly: 3.015 ± 0.919
0.861TyrHis: 0.861 ± 0.514
2.153TyrIle: 2.153 ± 1.649
1.723TyrLys: 1.723 ± 0.729
5.168TyrLeu: 5.168 ± 1.066
0.431TyrMet: 0.431 ± 0.338
2.153TyrAsn: 2.153 ± 0.832
0.0TyrPro: 0.0 ± 0.0
1.292TyrGln: 1.292 ± 0.475
1.723TyrArg: 1.723 ± 0.712
0.861TyrSer: 0.861 ± 0.514
0.861TyrThr: 0.861 ± 0.415
1.723TyrVal: 1.723 ± 0.691
0.431TyrTrp: 0.431 ± 0.378
3.015TyrTyr: 3.015 ± 1.016
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2323 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski