Amino acid dipepetide frequency for Cervus elaphus papillomavirus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.662AlaAla: 5.662 ± 2.209
0.871AlaCys: 0.871 ± 1.046
2.613AlaAsp: 2.613 ± 0.918
4.355AlaGlu: 4.355 ± 1.476
3.049AlaPhe: 3.049 ± 1.205
2.178AlaGly: 2.178 ± 0.702
0.436AlaHis: 0.436 ± 0.409
2.178AlaIle: 2.178 ± 0.759
4.355AlaLys: 4.355 ± 0.872
2.613AlaLeu: 2.613 ± 1.187
0.436AlaMet: 0.436 ± 0.325
0.871AlaAsn: 0.871 ± 0.649
4.791AlaPro: 4.791 ± 1.033
2.178AlaGln: 2.178 ± 0.899
6.969AlaArg: 6.969 ± 2.158
3.484AlaSer: 3.484 ± 1.721
3.484AlaThr: 3.484 ± 1.385
4.355AlaVal: 4.355 ± 1.173
0.0AlaTrp: 0.0 ± 0.0
3.484AlaTyr: 3.484 ± 1.535
0.0AlaXaa: 0.0 ± 0.0
Cys
2.613CysAla: 2.613 ± 1.305
2.178CysCys: 2.178 ± 1.048
1.307CysAsp: 1.307 ± 0.565
0.0CysGlu: 0.0 ± 0.0
0.436CysPhe: 0.436 ± 0.409
0.871CysGly: 0.871 ± 0.984
0.436CysHis: 0.436 ± 0.409
1.307CysIle: 1.307 ± 0.658
1.742CysLys: 1.742 ± 1.088
2.178CysLeu: 2.178 ± 1.545
0.0CysMet: 0.0 ± 0.0
2.178CysAsn: 2.178 ± 1.517
1.307CysPro: 1.307 ± 1.003
0.0CysGln: 0.0 ± 0.0
2.178CysArg: 2.178 ± 1.011
2.613CysSer: 2.613 ± 1.087
0.436CysThr: 0.436 ± 0.409
0.0CysVal: 0.0 ± 0.0
0.871CysTrp: 0.871 ± 0.461
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.613AspAla: 2.613 ± 0.657
2.178AspCys: 2.178 ± 0.532
3.484AspAsp: 3.484 ± 1.354
1.307AspGlu: 1.307 ± 0.565
3.484AspPhe: 3.484 ± 0.867
3.484AspGly: 3.484 ± 1.795
1.307AspHis: 1.307 ± 0.768
3.484AspIle: 3.484 ± 0.957
3.484AspLys: 3.484 ± 1.425
6.969AspLeu: 6.969 ± 1.499
0.436AspMet: 0.436 ± 0.409
2.613AspAsn: 2.613 ± 0.812
3.484AspPro: 3.484 ± 1.362
2.613AspGln: 2.613 ± 1.416
1.742AspArg: 1.742 ± 0.579
3.92AspSer: 3.92 ± 0.75
4.791AspThr: 4.791 ± 1.327
4.791AspVal: 4.791 ± 1.21
0.871AspTrp: 0.871 ± 0.649
2.178AspTyr: 2.178 ± 0.418
0.0AspXaa: 0.0 ± 0.0
Glu
3.049GluAla: 3.049 ± 1.53
1.307GluCys: 1.307 ± 0.635
8.711GluAsp: 8.711 ± 0.691
4.791GluGlu: 4.791 ± 2.073
3.92GluPhe: 3.92 ± 0.935
1.742GluGly: 1.742 ± 0.699
0.871GluHis: 0.871 ± 0.461
1.742GluIle: 1.742 ± 0.635
3.049GluLys: 3.049 ± 1.702
4.355GluLeu: 4.355 ± 1.508
2.178GluMet: 2.178 ± 0.418
3.049GluAsn: 3.049 ± 0.879
4.791GluPro: 4.791 ± 0.96
3.484GluGln: 3.484 ± 1.062
3.049GluArg: 3.049 ± 0.804
2.613GluSer: 2.613 ± 0.476
3.049GluThr: 3.049 ± 0.97
4.791GluVal: 4.791 ± 1.866
0.436GluTrp: 0.436 ± 0.409
0.871GluTyr: 0.871 ± 0.67
0.0GluXaa: 0.0 ± 0.0
Phe
2.178PheAla: 2.178 ± 0.963
0.871PheCys: 0.871 ± 1.046
3.484PheAsp: 3.484 ± 0.98
3.92PheGlu: 3.92 ± 0.948
1.307PhePhe: 1.307 ± 0.565
2.613PheGly: 2.613 ± 1.256
0.0PheHis: 0.0 ± 0.0
0.436PheIle: 0.436 ± 0.325
3.049PheLys: 3.049 ± 1.036
6.533PheLeu: 6.533 ± 1.346
0.0PheMet: 0.0 ± 0.377
1.742PheAsn: 1.742 ± 0.597
2.613PhePro: 2.613 ± 0.865
2.178PheGln: 2.178 ± 0.878
1.742PheArg: 1.742 ± 0.645
3.484PheSer: 3.484 ± 0.755
2.613PheThr: 2.613 ± 0.47
3.049PheVal: 3.049 ± 0.857
1.742PheTrp: 1.742 ± 0.742
2.178PheTyr: 2.178 ± 1.021
0.0PheXaa: 0.0 ± 0.0
Gly
2.613GlyAla: 2.613 ± 0.47
1.307GlyCys: 1.307 ± 0.635
4.791GlyAsp: 4.791 ± 1.283
4.791GlyGlu: 4.791 ± 1.306
1.742GlyPhe: 1.742 ± 0.558
7.84GlyGly: 7.84 ± 4.458
1.742GlyHis: 1.742 ± 0.681
4.355GlyIle: 4.355 ± 1.512
2.613GlyLys: 2.613 ± 0.812
3.484GlyLeu: 3.484 ± 1.418
0.0GlyMet: 0.0 ± 0.0
2.613GlyAsn: 2.613 ± 0.973
4.791GlyPro: 4.791 ± 0.865
3.049GlyGln: 3.049 ± 0.916
3.484GlyArg: 3.484 ± 0.646
7.404GlySer: 7.404 ± 1.49
3.92GlyThr: 3.92 ± 1.386
1.742GlyVal: 1.742 ± 0.932
0.436GlyTrp: 0.436 ± 0.398
2.178GlyTyr: 2.178 ± 0.635
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.436HisCys: 0.436 ± 0.325
0.436HisAsp: 0.436 ± 0.325
0.436HisGlu: 0.436 ± 0.398
0.871HisPhe: 0.871 ± 0.649
2.178HisGly: 2.178 ± 0.418
0.0HisHis: 0.0 ± 0.0
1.307HisIle: 1.307 ± 0.711
0.436HisLys: 0.436 ± 0.398
0.871HisLeu: 0.871 ± 0.984
0.436HisMet: 0.436 ± 0.354
0.871HisAsn: 0.871 ± 0.468
2.178HisPro: 2.178 ± 1.164
0.0HisGln: 0.0 ± 0.0
2.613HisArg: 2.613 ± 0.878
1.742HisSer: 1.742 ± 0.238
0.871HisThr: 0.871 ± 0.371
0.871HisVal: 0.871 ± 0.371
0.871HisTrp: 0.871 ± 0.461
0.436HisTyr: 0.436 ± 0.354
0.0HisXaa: 0.0 ± 0.0
Ile
1.307IleAla: 1.307 ± 0.762
0.871IleCys: 0.871 ± 0.67
2.613IleAsp: 2.613 ± 0.738
3.049IleGlu: 3.049 ± 1.009
1.307IlePhe: 1.307 ± 1.003
3.049IleGly: 3.049 ± 1.67
0.436IleHis: 0.436 ± 0.409
1.742IleIle: 1.742 ± 1.036
1.307IleLys: 1.307 ± 0.349
3.484IleLeu: 3.484 ± 1.879
0.0IleMet: 0.0 ± 0.0
2.178IleAsn: 2.178 ± 0.635
3.049IlePro: 3.049 ± 1.45
2.613IleGln: 2.613 ± 0.867
1.742IleArg: 1.742 ± 0.969
3.049IleSer: 3.049 ± 0.615
3.049IleThr: 3.049 ± 0.699
4.791IleVal: 4.791 ± 1.865
1.307IleTrp: 1.307 ± 0.58
3.484IleTyr: 3.484 ± 1.817
0.0IleXaa: 0.0 ± 0.0
Lys
4.791LysAla: 4.791 ± 1.845
0.0LysCys: 0.0 ± 0.0
2.613LysAsp: 2.613 ± 0.58
3.049LysGlu: 3.049 ± 1.34
3.484LysPhe: 3.484 ± 1.337
3.484LysGly: 3.484 ± 0.495
0.871LysHis: 0.871 ± 0.649
2.178LysIle: 2.178 ± 0.532
3.484LysLys: 3.484 ± 0.948
3.92LysLeu: 3.92 ± 1.248
0.436LysMet: 0.436 ± 0.398
0.871LysAsn: 0.871 ± 0.461
1.307LysPro: 1.307 ± 1.057
2.613LysGln: 2.613 ± 0.47
6.098LysArg: 6.098 ± 0.663
3.484LysSer: 3.484 ± 2.596
3.049LysThr: 3.049 ± 1.568
2.613LysVal: 2.613 ± 1.082
0.871LysTrp: 0.871 ± 0.41
1.307LysTyr: 1.307 ± 0.434
0.0LysXaa: 0.0 ± 0.0
Leu
3.92LeuAla: 3.92 ± 0.697
1.742LeuCys: 1.742 ± 1.466
5.662LeuAsp: 5.662 ± 1.236
6.098LeuGlu: 6.098 ± 2.213
5.226LeuPhe: 5.226 ± 0.855
6.533LeuGly: 6.533 ± 1.469
2.178LeuHis: 2.178 ± 1.014
5.662LeuIle: 5.662 ± 1.062
3.92LeuLys: 3.92 ± 1.638
8.275LeuLeu: 8.275 ± 1.898
1.307LeuMet: 1.307 ± 0.496
2.613LeuAsn: 2.613 ± 0.47
3.92LeuPro: 3.92 ± 0.705
4.355LeuGln: 4.355 ± 1.057
4.355LeuArg: 4.355 ± 0.923
5.226LeuSer: 5.226 ± 1.595
6.098LeuThr: 6.098 ± 1.511
4.355LeuVal: 4.355 ± 1.081
0.871LeuTrp: 0.871 ± 0.371
2.613LeuTyr: 2.613 ± 0.738
0.0LeuXaa: 0.0 ± 0.0
Met
0.436MetAla: 0.436 ± 0.325
0.436MetCys: 0.436 ± 0.409
0.871MetAsp: 0.871 ± 0.818
1.307MetGlu: 1.307 ± 0.897
0.0MetPhe: 0.0 ± 0.0
0.436MetGly: 0.436 ± 0.398
0.436MetHis: 0.436 ± 0.354
0.436MetIle: 0.436 ± 0.398
0.0MetLys: 0.0 ± 0.0
0.871MetLeu: 0.871 ± 0.461
0.0MetMet: 0.0 ± 0.0
0.871MetAsn: 0.871 ± 0.371
0.871MetPro: 0.871 ± 0.518
0.871MetGln: 0.871 ± 0.41
1.742MetArg: 1.742 ± 0.915
2.178MetSer: 2.178 ± 0.79
0.871MetThr: 0.871 ± 0.518
1.307MetVal: 1.307 ± 0.459
0.436MetTrp: 0.436 ± 0.398
0.436MetTyr: 0.436 ± 0.409
0.0MetXaa: 0.0 ± 0.0
Asn
3.92AsnAla: 3.92 ± 0.811
1.307AsnCys: 1.307 ± 0.565
0.436AsnAsp: 0.436 ± 0.325
3.049AsnGlu: 3.049 ± 0.677
1.307AsnPhe: 1.307 ± 0.711
2.178AsnGly: 2.178 ± 0.524
0.0AsnHis: 0.0 ± 0.0
2.178AsnIle: 2.178 ± 0.748
3.484AsnLys: 3.484 ± 0.889
1.307AsnLeu: 1.307 ± 0.434
1.742AsnMet: 1.742 ± 0.665
1.742AsnAsn: 1.742 ± 0.645
4.355AsnPro: 4.355 ± 1.518
1.307AsnGln: 1.307 ± 0.711
1.742AsnArg: 1.742 ± 0.238
3.484AsnSer: 3.484 ± 0.571
3.484AsnThr: 3.484 ± 1.085
3.92AsnVal: 3.92 ± 0.812
0.436AsnTrp: 0.436 ± 0.325
0.436AsnTyr: 0.436 ± 0.325
0.0AsnXaa: 0.0 ± 0.0
Pro
5.662ProAla: 5.662 ± 1.119
0.871ProCys: 0.871 ± 0.818
3.484ProAsp: 3.484 ± 1.6
4.355ProGlu: 4.355 ± 0.88
2.178ProPhe: 2.178 ± 1.275
2.178ProGly: 2.178 ± 0.759
0.436ProHis: 0.436 ± 0.354
0.871ProIle: 0.871 ± 0.708
3.484ProLys: 3.484 ± 1.147
5.662ProLeu: 5.662 ± 1.405
0.436ProMet: 0.436 ± 0.325
5.662ProAsn: 5.662 ± 1.699
7.404ProPro: 7.404 ± 1.239
0.871ProGln: 0.871 ± 0.461
3.049ProArg: 3.049 ± 0.952
5.226ProSer: 5.226 ± 1.305
3.92ProThr: 3.92 ± 0.828
5.226ProVal: 5.226 ± 1.38
0.0ProTrp: 0.0 ± 0.0
2.178ProTyr: 2.178 ± 1.701
0.0ProXaa: 0.0 ± 0.0
Gln
3.049GlnAla: 3.049 ± 0.875
0.871GlnCys: 0.871 ± 0.544
0.871GlnAsp: 0.871 ± 0.818
2.613GlnGlu: 2.613 ± 0.476
3.049GlnPhe: 3.049 ± 0.74
1.307GlnGly: 1.307 ± 0.349
1.307GlnHis: 1.307 ± 0.349
3.049GlnIle: 3.049 ± 1.19
0.871GlnLys: 0.871 ± 0.371
2.178GlnLeu: 2.178 ± 0.648
1.742GlnMet: 1.742 ± 1.036
1.742GlnAsn: 1.742 ± 0.486
3.484GlnPro: 3.484 ± 1.02
1.742GlnGln: 1.742 ± 0.486
2.178GlnArg: 2.178 ± 0.713
1.742GlnSer: 1.742 ± 1.036
3.049GlnThr: 3.049 ± 1.045
3.484GlnVal: 3.484 ± 1.376
1.742GlnTrp: 1.742 ± 0.681
1.307GlnTyr: 1.307 ± 1.227
0.0GlnXaa: 0.0 ± 0.0
Arg
2.613ArgAla: 2.613 ± 0.58
3.484ArgCys: 3.484 ± 1.283
3.484ArgAsp: 3.484 ± 1.324
3.049ArgGlu: 3.049 ± 1.002
2.613ArgPhe: 2.613 ± 1.035
5.226ArgGly: 5.226 ± 1.809
2.613ArgHis: 2.613 ± 0.613
1.307ArgIle: 1.307 ± 0.349
5.226ArgLys: 5.226 ± 0.805
9.582ArgLeu: 9.582 ± 1.82
2.178ArgMet: 2.178 ± 1.43
3.049ArgAsn: 3.049 ± 1.407
2.613ArgPro: 2.613 ± 0.936
2.178ArgGln: 2.178 ± 0.419
8.275ArgArg: 8.275 ± 2.751
7.404ArgSer: 7.404 ± 3.181
3.484ArgThr: 3.484 ± 0.867
3.484ArgVal: 3.484 ± 1.857
0.0ArgTrp: 0.0 ± 0.0
1.742ArgTyr: 1.742 ± 0.611
0.0ArgXaa: 0.0 ± 0.0
Ser
4.791SerAla: 4.791 ± 1.771
1.307SerCys: 1.307 ± 0.842
5.226SerAsp: 5.226 ± 0.528
3.484SerGlu: 3.484 ± 1.062
3.049SerPhe: 3.049 ± 1.205
6.969SerGly: 6.969 ± 1.302
1.742SerHis: 1.742 ± 1.059
2.178SerIle: 2.178 ± 1.356
1.307SerLys: 1.307 ± 0.642
8.275SerLeu: 8.275 ± 2.425
1.742SerMet: 1.742 ± 0.922
2.613SerAsn: 2.613 ± 1.135
3.92SerPro: 3.92 ± 1.694
3.92SerGln: 3.92 ± 1.198
10.453SerArg: 10.453 ± 4.916
9.146SerSer: 9.146 ± 4.402
5.226SerThr: 5.226 ± 1.634
3.049SerVal: 3.049 ± 1.482
1.742SerTrp: 1.742 ± 0.659
0.871SerTyr: 0.871 ± 0.544
0.0SerXaa: 0.0 ± 0.0
Thr
2.178ThrAla: 2.178 ± 0.834
0.871ThrCys: 0.871 ± 0.649
4.791ThrAsp: 4.791 ± 1.147
5.226ThrGlu: 5.226 ± 0.931
4.355ThrPhe: 4.355 ± 1.023
5.662ThrGly: 5.662 ± 1.263
0.871ThrHis: 0.871 ± 0.404
4.791ThrIle: 4.791 ± 1.229
0.436ThrLys: 0.436 ± 0.409
4.355ThrLeu: 4.355 ± 1.394
0.871ThrMet: 0.871 ± 0.691
1.742ThrAsn: 1.742 ± 0.645
3.92ThrPro: 3.92 ± 1.303
1.307ThrGln: 1.307 ± 0.686
6.098ThrArg: 6.098 ± 1.121
6.098ThrSer: 6.098 ± 2.466
3.049ThrThr: 3.049 ± 1.002
3.484ThrVal: 3.484 ± 0.796
1.307ThrTrp: 1.307 ± 0.865
2.178ThrTyr: 2.178 ± 0.878
0.0ThrXaa: 0.0 ± 0.0
Val
3.484ValAla: 3.484 ± 1.114
1.742ValCys: 1.742 ± 0.979
2.613ValAsp: 2.613 ± 1.065
3.92ValGlu: 3.92 ± 1.048
2.178ValPhe: 2.178 ± 0.748
2.178ValGly: 2.178 ± 0.773
1.307ValHis: 1.307 ± 1.041
3.484ValIle: 3.484 ± 1.288
4.791ValLys: 4.791 ± 1.642
5.226ValLeu: 5.226 ± 2.094
0.436ValMet: 0.436 ± 0.325
1.742ValAsn: 1.742 ± 0.597
2.613ValPro: 2.613 ± 0.867
4.791ValGln: 4.791 ± 0.953
2.613ValArg: 2.613 ± 1.527
7.404ValSer: 7.404 ± 1.348
5.226ValThr: 5.226 ± 1.748
2.178ValVal: 2.178 ± 0.808
0.436ValTrp: 0.436 ± 0.409
2.178ValTyr: 2.178 ± 1.281
0.0ValXaa: 0.0 ± 0.0
Trp
0.871TrpAla: 0.871 ± 0.518
0.0TrpCys: 0.0 ± 0.0
0.436TrpAsp: 0.436 ± 0.409
1.307TrpGlu: 1.307 ± 0.386
0.436TrpPhe: 0.436 ± 0.354
1.307TrpGly: 1.307 ± 0.711
0.436TrpHis: 0.436 ± 0.398
1.307TrpIle: 1.307 ± 0.434
1.742TrpLys: 1.742 ± 1.298
0.871TrpLeu: 0.871 ± 0.371
0.0TrpMet: 0.0 ± 0.0
0.871TrpAsn: 0.871 ± 0.818
0.0TrpPro: 0.0 ± 0.0
0.436TrpGln: 0.436 ± 0.398
2.178TrpArg: 2.178 ± 1.213
0.0TrpSer: 0.0 ± 0.0
1.742TrpThr: 1.742 ± 0.665
0.871TrpVal: 0.871 ± 0.649
0.436TrpTrp: 0.436 ± 0.325
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.613TyrAla: 2.613 ± 1.118
0.0TyrCys: 0.0 ± 0.0
1.307TyrAsp: 1.307 ± 0.459
1.742TyrGlu: 1.742 ± 0.795
2.178TyrPhe: 2.178 ± 0.418
3.484TyrGly: 3.484 ± 1.139
0.436TyrHis: 0.436 ± 0.409
0.436TyrIle: 0.436 ± 0.354
1.742TyrLys: 1.742 ± 1.059
3.92TyrLeu: 3.92 ± 1.274
0.0TyrMet: 0.0 ± 0.0
2.178TyrAsn: 2.178 ± 1.055
1.742TyrPro: 1.742 ± 0.597
0.871TyrGln: 0.871 ± 0.371
1.307TyrArg: 1.307 ± 0.842
1.307TyrSer: 1.307 ± 0.758
2.178TyrThr: 2.178 ± 0.419
2.178TyrVal: 2.178 ± 0.555
0.436TyrTrp: 0.436 ± 0.409
4.355TyrTyr: 4.355 ± 1.413
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2297 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski