Amino acid dipepetide frequency for Eidolon helvum papillomavirus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.457AlaAla: 7.457 ± 2.145
1.657AlaCys: 1.657 ± 1.268
4.971AlaAsp: 4.971 ± 1.325
7.457AlaGlu: 7.457 ± 1.089
0.414AlaPhe: 0.414 ± 0.354
2.486AlaGly: 2.486 ± 1.743
0.829AlaHis: 0.829 ± 0.42
2.9AlaIle: 2.9 ± 1.199
4.557AlaLys: 4.557 ± 1.832
6.628AlaLeu: 6.628 ± 1.653
2.071AlaMet: 2.071 ± 0.406
0.829AlaAsn: 0.829 ± 0.42
6.214AlaPro: 6.214 ± 1.971
2.486AlaGln: 2.486 ± 1.588
5.385AlaArg: 5.385 ± 1.888
4.143AlaSer: 4.143 ± 0.668
5.385AlaThr: 5.385 ± 1.541
6.628AlaVal: 6.628 ± 1.865
1.243AlaTrp: 1.243 ± 0.689
2.071AlaTyr: 2.071 ± 0.406
0.0AlaXaa: 0.0 ± 0.0
Cys
2.071CysAla: 2.071 ± 0.99
0.414CysCys: 0.414 ± 0.59
0.414CysAsp: 0.414 ± 0.769
0.829CysGlu: 0.829 ± 0.457
1.243CysPhe: 1.243 ± 0.782
0.829CysGly: 0.829 ± 0.85
0.414CysHis: 0.414 ± 0.59
2.071CysIle: 2.071 ± 1.118
1.243CysLys: 1.243 ± 0.736
2.071CysLeu: 2.071 ± 0.741
1.657CysMet: 1.657 ± 0.618
0.829CysAsn: 0.829 ± 0.634
2.071CysPro: 2.071 ± 0.652
0.0CysGln: 0.0 ± 0.0
0.829CysArg: 0.829 ± 0.769
2.071CysSer: 2.071 ± 1.787
1.243CysThr: 1.243 ± 0.84
0.414CysVal: 0.414 ± 0.35
0.829CysTrp: 0.829 ± 0.634
0.414CysTyr: 0.414 ± 0.769
0.0CysXaa: 0.0 ± 0.0
Asp
3.314AspAla: 3.314 ± 1.094
1.657AspCys: 1.657 ± 0.842
3.314AspAsp: 3.314 ± 1.554
4.143AspGlu: 4.143 ± 1.731
2.071AspPhe: 2.071 ± 0.652
2.071AspGly: 2.071 ± 0.57
2.9AspHis: 2.9 ± 0.686
4.557AspIle: 4.557 ± 1.654
4.143AspLys: 4.143 ± 1.41
4.143AspLeu: 4.143 ± 1.649
0.829AspMet: 0.829 ± 0.708
2.9AspAsn: 2.9 ± 0.477
5.385AspPro: 5.385 ± 2.278
1.243AspGln: 1.243 ± 0.781
3.728AspArg: 3.728 ± 1.573
4.557AspSer: 4.557 ± 1.715
5.8AspThr: 5.8 ± 1.301
4.971AspVal: 4.971 ± 1.669
2.071AspTrp: 2.071 ± 0.99
0.414AspTyr: 0.414 ± 0.357
0.0AspXaa: 0.0 ± 0.0
Glu
6.214GluAla: 6.214 ± 1.777
0.414GluCys: 0.414 ± 0.354
6.628GluAsp: 6.628 ± 1.153
5.8GluGlu: 5.8 ± 1.523
1.657GluPhe: 1.657 ± 0.62
9.528GluGly: 9.528 ± 2.441
0.414GluHis: 0.414 ± 0.354
2.071GluIle: 2.071 ± 0.84
1.657GluLys: 1.657 ± 0.712
4.557GluLeu: 4.557 ± 1.248
1.243GluMet: 1.243 ± 0.579
3.314GluAsn: 3.314 ± 0.737
3.314GluPro: 3.314 ± 0.285
0.414GluGln: 0.414 ± 0.35
3.728GluArg: 3.728 ± 1.947
4.557GluSer: 4.557 ± 0.783
2.486GluThr: 2.486 ± 0.962
5.385GluVal: 5.385 ± 1.479
0.414GluTrp: 0.414 ± 0.354
2.486GluTyr: 2.486 ± 1.26
0.0GluXaa: 0.0 ± 0.0
Phe
1.657PheAla: 1.657 ± 0.653
0.829PheCys: 0.829 ± 0.614
1.657PheAsp: 1.657 ± 0.653
1.657PheGlu: 1.657 ± 1.022
2.071PhePhe: 2.071 ± 0.951
2.9PheGly: 2.9 ± 0.724
0.829PheHis: 0.829 ± 0.523
0.829PheIle: 0.829 ± 0.701
2.486PheLys: 2.486 ± 1.681
2.071PheLeu: 2.071 ± 1.085
0.414PheMet: 0.414 ± 0.354
0.829PheAsn: 0.829 ± 0.457
1.657PhePro: 1.657 ± 0.62
0.829PheGln: 0.829 ± 0.42
3.314PheArg: 3.314 ± 0.696
1.657PheSer: 1.657 ± 0.71
2.071PheThr: 2.071 ± 0.951
1.657PheVal: 1.657 ± 1.049
1.243PheTrp: 1.243 ± 0.556
0.414PheTyr: 0.414 ± 0.35
0.0PheXaa: 0.0 ± 0.0
Gly
4.971GlyAla: 4.971 ± 1.23
0.829GlyCys: 0.829 ± 0.44
2.9GlyAsp: 2.9 ± 0.819
6.628GlyGlu: 6.628 ± 1.51
2.9GlyPhe: 2.9 ± 0.424
6.628GlyGly: 6.628 ± 2.104
1.657GlyHis: 1.657 ± 0.71
2.071GlyIle: 2.071 ± 1.0
2.486GlyLys: 2.486 ± 1.01
5.385GlyLeu: 5.385 ± 2.839
1.243GlyMet: 1.243 ± 0.635
4.971GlyAsn: 4.971 ± 1.143
3.728GlyPro: 3.728 ± 1.097
3.314GlyGln: 3.314 ± 1.222
4.971GlyArg: 4.971 ± 1.178
6.214GlySer: 6.214 ± 1.011
4.557GlyThr: 4.557 ± 1.066
5.385GlyVal: 5.385 ± 1.316
0.829GlyTrp: 0.829 ± 0.523
0.829GlyTyr: 0.829 ± 0.434
0.0GlyXaa: 0.0 ± 0.0
His
1.243HisAla: 1.243 ± 0.72
1.657HisCys: 1.657 ± 0.842
0.829HisAsp: 0.829 ± 0.614
0.0HisGlu: 0.0 ± 0.0
1.243HisPhe: 1.243 ± 0.577
0.829HisGly: 0.829 ± 0.708
1.243HisHis: 1.243 ± 0.694
0.414HisIle: 0.414 ± 0.354
1.243HisLys: 1.243 ± 1.062
1.243HisLeu: 1.243 ± 0.754
0.829HisMet: 0.829 ± 0.523
0.414HisAsn: 0.414 ± 0.35
2.9HisPro: 2.9 ± 1.606
0.414HisGln: 0.414 ± 0.354
1.243HisArg: 1.243 ± 0.708
1.657HisSer: 1.657 ± 0.694
1.243HisThr: 1.243 ± 0.72
1.243HisVal: 1.243 ± 0.368
0.414HisTrp: 0.414 ± 0.354
1.243HisTyr: 1.243 ± 0.886
0.0HisXaa: 0.0 ± 0.0
Ile
2.071IleAla: 2.071 ± 0.716
0.829IleCys: 0.829 ± 0.708
3.728IleAsp: 3.728 ± 1.962
3.314IleGlu: 3.314 ± 2.005
0.414IlePhe: 0.414 ± 0.357
2.071IleGly: 2.071 ± 0.818
0.829IleHis: 0.829 ± 0.42
0.414IleIle: 0.414 ± 0.357
1.243IleLys: 1.243 ± 1.139
3.728IleLeu: 3.728 ± 1.063
0.414IleMet: 0.414 ± 0.354
1.243IleAsn: 1.243 ± 0.641
3.728IlePro: 3.728 ± 1.062
0.829IleGln: 0.829 ± 0.826
1.657IleArg: 1.657 ± 1.136
4.971IleSer: 4.971 ± 1.408
3.314IleThr: 3.314 ± 1.166
5.385IleVal: 5.385 ± 1.05
0.414IleTrp: 0.414 ± 0.354
2.071IleTyr: 2.071 ± 0.939
0.0IleXaa: 0.0 ± 0.0
Lys
2.9LysAla: 2.9 ± 1.462
1.657LysCys: 1.657 ± 0.653
1.657LysAsp: 1.657 ± 0.583
3.314LysGlu: 3.314 ± 0.936
1.657LysPhe: 1.657 ± 0.965
2.071LysGly: 2.071 ± 0.603
1.657LysHis: 1.657 ± 1.009
1.243LysIle: 1.243 ± 0.72
1.657LysLys: 1.657 ± 0.263
2.486LysLeu: 2.486 ± 0.938
0.414LysMet: 0.414 ± 0.35
1.243LysAsn: 1.243 ± 1.172
0.829LysPro: 0.829 ± 0.376
2.486LysGln: 2.486 ± 0.482
4.143LysArg: 4.143 ± 0.629
4.143LysSer: 4.143 ± 1.405
4.143LysThr: 4.143 ± 0.811
3.314LysVal: 3.314 ± 0.625
0.0LysTrp: 0.0 ± 0.0
1.243LysTyr: 1.243 ± 0.442
0.0LysXaa: 0.0 ± 0.0
Leu
4.971LeuAla: 4.971 ± 2.065
2.071LeuCys: 2.071 ± 1.163
4.971LeuAsp: 4.971 ± 1.786
4.557LeuGlu: 4.557 ± 1.543
2.486LeuPhe: 2.486 ± 0.715
4.557LeuGly: 4.557 ± 1.141
2.486LeuHis: 2.486 ± 1.016
2.9LeuIle: 2.9 ± 0.708
3.728LeuLys: 3.728 ± 1.678
4.143LeuLeu: 4.143 ± 1.002
2.071LeuMet: 2.071 ± 1.706
1.243LeuAsn: 1.243 ± 1.051
4.557LeuPro: 4.557 ± 1.537
7.871LeuGln: 7.871 ± 0.954
5.385LeuArg: 5.385 ± 1.849
3.314LeuSer: 3.314 ± 0.86
4.143LeuThr: 4.143 ± 1.257
4.971LeuVal: 4.971 ± 1.919
0.414LeuTrp: 0.414 ± 0.35
4.143LeuTyr: 4.143 ± 1.328
0.0LeuXaa: 0.0 ± 0.0
Met
2.071MetAla: 2.071 ± 0.744
0.414MetCys: 0.414 ± 0.59
1.657MetAsp: 1.657 ± 1.03
1.657MetGlu: 1.657 ± 0.784
0.414MetPhe: 0.414 ± 0.354
0.414MetGly: 0.414 ± 0.35
0.414MetHis: 0.414 ± 0.357
1.657MetIle: 1.657 ± 0.879
0.829MetLys: 0.829 ± 0.708
2.071MetLeu: 2.071 ± 1.304
0.0MetMet: 0.0 ± 0.0
0.414MetAsn: 0.414 ± 0.35
1.243MetPro: 1.243 ± 0.781
1.243MetGln: 1.243 ± 0.699
0.829MetArg: 0.829 ± 0.42
1.657MetSer: 1.657 ± 1.415
0.414MetThr: 0.414 ± 0.354
1.243MetVal: 1.243 ± 0.84
0.414MetTrp: 0.414 ± 0.35
1.243MetTyr: 1.243 ± 0.442
0.0MetXaa: 0.0 ± 0.0
Asn
3.728AsnAla: 3.728 ± 1.264
1.243AsnCys: 1.243 ± 0.781
0.414AsnAsp: 0.414 ± 0.354
0.414AsnGlu: 0.414 ± 0.354
0.829AsnPhe: 0.829 ± 0.523
2.486AsnGly: 2.486 ± 1.167
0.414AsnHis: 0.414 ± 0.354
3.728AsnIle: 3.728 ± 1.232
1.657AsnLys: 1.657 ± 0.732
1.657AsnLeu: 1.657 ± 1.081
1.243AsnMet: 1.243 ± 0.641
1.243AsnAsn: 1.243 ± 0.635
4.143AsnPro: 4.143 ± 1.403
0.414AsnGln: 0.414 ± 0.35
1.657AsnArg: 1.657 ± 0.704
1.243AsnSer: 1.243 ± 0.442
2.9AsnThr: 2.9 ± 1.545
1.657AsnVal: 1.657 ± 0.965
0.414AsnTrp: 0.414 ± 0.35
1.243AsnTyr: 1.243 ± 0.635
0.0AsnXaa: 0.0 ± 0.0
Pro
7.871ProAla: 7.871 ± 3.567
1.243ProCys: 1.243 ± 0.556
7.042ProAsp: 7.042 ± 1.08
3.314ProGlu: 3.314 ± 0.646
2.071ProPhe: 2.071 ± 0.493
3.314ProGly: 3.314 ± 1.101
0.829ProHis: 0.829 ± 0.715
4.971ProIle: 4.971 ± 2.427
2.071ProLys: 2.071 ± 0.889
6.628ProLeu: 6.628 ± 2.078
0.414ProMet: 0.414 ± 0.35
2.486ProAsn: 2.486 ± 1.271
3.314ProPro: 3.314 ± 1.003
2.071ProGln: 2.071 ± 1.029
4.557ProArg: 4.557 ± 1.754
4.971ProSer: 4.971 ± 1.364
4.971ProThr: 4.971 ± 1.22
4.143ProVal: 4.143 ± 1.317
0.414ProTrp: 0.414 ± 0.425
1.243ProTyr: 1.243 ± 0.71
0.0ProXaa: 0.0 ± 0.0
Gln
2.486GlnAla: 2.486 ± 1.006
1.243GlnCys: 1.243 ± 1.01
2.486GlnAsp: 2.486 ± 0.481
2.486GlnGlu: 2.486 ± 1.168
1.243GlnPhe: 1.243 ± 0.641
4.143GlnGly: 4.143 ± 2.031
0.414GlnHis: 0.414 ± 0.354
0.414GlnIle: 0.414 ± 0.357
1.243GlnLys: 1.243 ± 0.442
2.071GlnLeu: 2.071 ± 0.406
0.829GlnMet: 0.829 ± 0.701
0.829GlnAsn: 0.829 ± 0.44
1.657GlnPro: 1.657 ± 0.839
2.486GlnGln: 2.486 ± 0.93
2.486GlnArg: 2.486 ± 1.077
1.657GlnSer: 1.657 ± 0.653
2.9GlnThr: 2.9 ± 0.739
2.486GlnVal: 2.486 ± 0.466
1.243GlnTrp: 1.243 ± 0.694
1.657GlnTyr: 1.657 ± 0.653
0.0GlnXaa: 0.0 ± 0.0
Arg
6.214ArgAla: 6.214 ± 3.277
2.071ArgCys: 2.071 ± 0.747
4.971ArgAsp: 4.971 ± 1.904
4.143ArgGlu: 4.143 ± 1.95
2.486ArgPhe: 2.486 ± 0.611
5.385ArgGly: 5.385 ± 2.344
2.071ArgHis: 2.071 ± 0.548
1.243ArgIle: 1.243 ± 0.442
4.143ArgLys: 4.143 ± 0.996
3.314ArgLeu: 3.314 ± 1.804
1.243ArgMet: 1.243 ± 1.01
2.486ArgAsn: 2.486 ± 0.562
3.728ArgPro: 3.728 ± 0.747
2.486ArgGln: 2.486 ± 0.859
7.457ArgArg: 7.457 ± 3.258
4.971ArgSer: 4.971 ± 1.407
5.385ArgThr: 5.385 ± 2.112
3.314ArgVal: 3.314 ± 1.001
1.243ArgTrp: 1.243 ± 1.274
2.486ArgTyr: 2.486 ± 0.517
0.0ArgXaa: 0.0 ± 0.0
Ser
4.971SerAla: 4.971 ± 1.865
0.414SerCys: 0.414 ± 0.35
4.557SerAsp: 4.557 ± 1.566
5.8SerGlu: 5.8 ± 3.176
2.9SerPhe: 2.9 ± 1.318
7.042SerGly: 7.042 ± 1.535
0.829SerHis: 0.829 ± 0.746
2.9SerIle: 2.9 ± 0.823
0.829SerLys: 0.829 ± 0.85
7.042SerLeu: 7.042 ± 2.506
2.071SerMet: 2.071 ± 0.55
1.657SerAsn: 1.657 ± 1.009
3.728SerPro: 3.728 ± 1.601
3.728SerGln: 3.728 ± 0.988
4.971SerArg: 4.971 ± 1.319
6.214SerSer: 6.214 ± 1.011
6.214SerThr: 6.214 ± 1.335
2.9SerVal: 2.9 ± 0.824
0.0SerTrp: 0.0 ± 0.0
1.243SerTyr: 1.243 ± 0.694
0.0SerXaa: 0.0 ± 0.0
Thr
2.486ThrAla: 2.486 ± 0.904
1.243ThrCys: 1.243 ± 0.754
5.8ThrAsp: 5.8 ± 1.094
2.486ThrGlu: 2.486 ± 0.877
2.486ThrPhe: 2.486 ± 0.602
6.214ThrGly: 6.214 ± 0.686
0.414ThrHis: 0.414 ± 0.769
3.314ThrIle: 3.314 ± 1.015
1.657ThrLys: 1.657 ± 0.753
4.971ThrLeu: 4.971 ± 1.113
1.243ThrMet: 1.243 ± 0.442
2.9ThrAsn: 2.9 ± 0.697
6.628ThrPro: 6.628 ± 1.25
1.243ThrGln: 1.243 ± 0.641
4.143ThrArg: 4.143 ± 2.031
4.557ThrSer: 4.557 ± 1.295
4.971ThrThr: 4.971 ± 1.615
8.699ThrVal: 8.699 ± 1.4
0.0ThrTrp: 0.0 ± 0.0
2.486ThrTyr: 2.486 ± 0.482
0.0ThrXaa: 0.0 ± 0.0
Val
5.385ValAla: 5.385 ± 1.595
1.657ValCys: 1.657 ± 1.07
4.143ValAsp: 4.143 ± 0.996
4.971ValGlu: 4.971 ± 1.638
1.657ValPhe: 1.657 ± 1.03
6.214ValGly: 6.214 ± 3.691
2.071ValHis: 2.071 ± 0.942
2.486ValIle: 2.486 ± 0.466
3.314ValLys: 3.314 ± 1.034
5.8ValLeu: 5.8 ± 1.508
1.657ValMet: 1.657 ± 0.585
1.657ValAsn: 1.657 ± 0.704
5.8ValPro: 5.8 ± 1.954
2.071ValGln: 2.071 ± 1.126
7.042ValArg: 7.042 ± 1.571
5.8ValSer: 5.8 ± 1.068
2.071ValThr: 2.071 ± 1.311
4.557ValVal: 4.557 ± 2.661
1.243ValTrp: 1.243 ± 0.782
3.314ValTyr: 3.314 ± 0.984
0.0ValXaa: 0.0 ± 0.0
Trp
0.829TrpAla: 0.829 ± 0.634
0.414TrpCys: 0.414 ± 0.35
0.414TrpAsp: 0.414 ± 0.35
0.829TrpGlu: 0.829 ± 0.523
0.0TrpPhe: 0.0 ± 0.0
1.243TrpGly: 1.243 ± 0.71
0.414TrpHis: 0.414 ± 0.425
1.657TrpIle: 1.657 ± 0.572
0.829TrpLys: 0.829 ± 0.708
3.314TrpLeu: 3.314 ± 0.737
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
1.243TrpPro: 1.243 ± 0.473
0.414TrpGln: 0.414 ± 0.35
0.414TrpArg: 0.414 ± 0.425
1.243TrpSer: 1.243 ± 0.442
0.414TrpThr: 0.414 ± 0.354
1.243TrpVal: 1.243 ± 0.84
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.9TyrAla: 2.9 ± 1.052
0.0TyrCys: 0.0 ± 0.0
1.657TyrAsp: 1.657 ± 0.571
2.486TyrGlu: 2.486 ± 1.038
0.829TyrPhe: 0.829 ± 0.44
2.071TyrGly: 2.071 ± 0.493
0.414TyrHis: 0.414 ± 0.35
0.829TyrIle: 0.829 ± 0.376
1.243TyrLys: 1.243 ± 0.374
1.657TyrLeu: 1.657 ± 0.87
0.0TyrMet: 0.0 ± 0.0
1.243TyrAsn: 1.243 ± 0.71
2.071TyrPro: 2.071 ± 0.942
0.414TyrGln: 0.414 ± 0.354
2.9TyrArg: 2.9 ± 1.643
0.829TyrSer: 0.829 ± 0.769
3.314TyrThr: 3.314 ± 1.456
3.314TyrVal: 3.314 ± 0.891
2.071TyrTrp: 2.071 ± 0.493
2.486TyrTyr: 2.486 ± 0.713
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2415 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski