Amino acid dipepetide frequency for Human parvovirus B19 (strain HV) (HPV B19)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.768AlaAla: 5.768 ± 1.324
0.444AlaCys: 0.444 ± 0.352
1.775AlaAsp: 1.775 ± 0.993
2.662AlaGlu: 2.662 ± 0.583
1.775AlaPhe: 1.775 ± 0.534
4.437AlaGly: 4.437 ± 0.555
0.444AlaHis: 0.444 ± 0.316
4.437AlaIle: 4.437 ± 1.498
5.324AlaLys: 5.324 ± 0.741
7.098AlaLeu: 7.098 ± 1.345
0.887AlaMet: 0.887 ± 0.68
1.775AlaAsn: 1.775 ± 0.567
2.662AlaPro: 2.662 ± 1.006
1.331AlaGln: 1.331 ± 0.58
0.887AlaArg: 0.887 ± 0.633
5.768AlaSer: 5.768 ± 0.712
4.437AlaThr: 4.437 ± 0.935
4.88AlaVal: 4.88 ± 1.302
0.887AlaTrp: 0.887 ± 0.703
2.218AlaTyr: 2.218 ± 0.537
0.0AlaXaa: 0.0 ± 0.0
Cys
0.887CysAla: 0.887 ± 0.703
0.444CysCys: 0.444 ± 0.352
0.0CysAsp: 0.0 ± 0.0
0.444CysGlu: 0.444 ± 0.352
0.0CysPhe: 0.0 ± 0.0
0.887CysGly: 0.887 ± 0.654
1.775CysHis: 1.775 ± 0.416
1.331CysIle: 1.331 ± 1.055
0.887CysLys: 0.887 ± 0.654
0.444CysLeu: 0.444 ± 0.352
0.0CysMet: 0.0 ± 0.0
1.331CysAsn: 1.331 ± 0.841
0.887CysPro: 0.887 ± 0.654
0.444CysGln: 0.444 ± 0.352
0.0CysArg: 0.0 ± 0.0
2.218CysSer: 2.218 ± 0.749
2.218CysThr: 2.218 ± 0.915
1.775CysVal: 1.775 ± 1.112
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.662AspAla: 2.662 ± 0.359
0.444AspCys: 0.444 ± 0.352
1.331AspAsp: 1.331 ± 0.536
0.887AspGlu: 0.887 ± 0.384
3.106AspPhe: 3.106 ± 1.02
1.331AspGly: 1.331 ± 1.055
3.106AspHis: 3.106 ± 0.814
0.887AspIle: 0.887 ± 0.654
3.993AspLys: 3.993 ± 0.676
3.106AspLeu: 3.106 ± 1.343
0.444AspMet: 0.444 ± 0.352
1.775AspAsn: 1.775 ± 0.762
3.106AspPro: 3.106 ± 0.827
1.775AspGln: 1.775 ± 0.416
0.0AspArg: 0.0 ± 0.0
4.437AspSer: 4.437 ± 1.618
4.88AspThr: 4.88 ± 0.574
2.662AspVal: 2.662 ± 0.359
0.444AspTrp: 0.444 ± 0.352
1.331AspTyr: 1.331 ± 0.665
0.0AspXaa: 0.0 ± 0.0
Glu
3.549GluAla: 3.549 ± 0.569
0.444GluCys: 0.444 ± 0.352
2.662GluAsp: 2.662 ± 0.583
3.993GluGlu: 3.993 ± 0.511
2.218GluPhe: 2.218 ± 0.526
3.993GluGly: 3.993 ± 0.539
1.775GluHis: 1.775 ± 0.746
0.887GluIle: 0.887 ± 0.373
3.549GluLys: 3.549 ± 1.248
4.437GluLeu: 4.437 ± 0.892
1.775GluMet: 1.775 ± 0.537
5.324GluAsn: 5.324 ± 0.926
0.887GluPro: 0.887 ± 0.384
2.218GluGln: 2.218 ± 0.574
1.331GluArg: 1.331 ± 0.18
4.437GluSer: 4.437 ± 0.824
0.887GluThr: 0.887 ± 0.384
1.331GluVal: 1.331 ± 0.665
0.0GluTrp: 0.0 ± 0.0
1.775GluTyr: 1.775 ± 0.746
0.0GluXaa: 0.0 ± 0.0
Phe
1.331PheAla: 1.331 ± 0.536
0.0PheCys: 0.0 ± 0.0
0.887PheAsp: 0.887 ± 0.384
0.887PheGlu: 0.887 ± 0.703
1.775PhePhe: 1.775 ± 0.416
0.0PheGly: 0.0 ± 0.0
0.0PheHis: 0.0 ± 0.0
0.444PheIle: 0.444 ± 0.352
2.218PheLys: 2.218 ± 0.915
3.106PheLeu: 3.106 ± 0.516
1.331PheMet: 1.331 ± 0.702
2.662PheAsn: 2.662 ± 1.094
4.88PhePro: 4.88 ± 0.991
4.437PheGln: 4.437 ± 0.892
3.549PheArg: 3.549 ± 1.359
3.993PheSer: 3.993 ± 1.194
2.218PheThr: 2.218 ± 1.332
1.775PheVal: 1.775 ± 0.993
0.0PheTrp: 0.0 ± 0.0
3.106PheTyr: 3.106 ± 0.418
0.0PheXaa: 0.0 ± 0.0
Gly
5.324GlyAla: 5.324 ± 0.868
1.775GlyCys: 1.775 ± 0.529
3.993GlyAsp: 3.993 ± 1.194
2.218GlyGlu: 2.218 ± 1.332
0.887GlyPhe: 0.887 ± 0.631
10.648GlyGly: 10.648 ± 1.808
0.887GlyHis: 0.887 ± 0.373
6.211GlyIle: 6.211 ± 1.333
4.88GlyLys: 4.88 ± 1.018
6.211GlyLeu: 6.211 ± 1.508
1.331GlyMet: 1.331 ± 0.841
1.775GlyAsn: 1.775 ± 0.324
6.655GlyPro: 6.655 ± 0.85
4.88GlyGln: 4.88 ± 0.803
2.662GlyArg: 2.662 ± 1.118
5.324GlySer: 5.324 ± 0.799
6.211GlyThr: 6.211 ± 0.869
5.324GlyVal: 5.324 ± 0.718
1.331GlyTrp: 1.331 ± 0.18
2.662GlyTyr: 2.662 ± 0.359
0.0GlyXaa: 0.0 ± 0.0
His
3.549HisAla: 3.549 ± 0.648
0.887HisCys: 0.887 ± 0.654
0.444HisAsp: 0.444 ± 0.316
1.775HisGlu: 1.775 ± 0.746
1.775HisPhe: 1.775 ± 0.798
2.218HisGly: 2.218 ± 0.868
3.106HisHis: 3.106 ± 1.224
0.887HisIle: 0.887 ± 0.703
0.444HisLys: 0.444 ± 0.316
2.662HisLeu: 2.662 ± 0.676
0.0HisMet: 0.0 ± 0.0
0.887HisAsn: 0.887 ± 0.373
2.218HisPro: 2.218 ± 0.537
0.887HisGln: 0.887 ± 0.373
0.887HisArg: 0.887 ± 0.373
1.775HisSer: 1.775 ± 0.529
1.775HisThr: 1.775 ± 0.416
1.775HisVal: 1.775 ± 1.112
1.331HisTrp: 1.331 ± 0.536
2.662HisTyr: 2.662 ± 0.583
0.0HisXaa: 0.0 ± 0.0
Ile
2.662IleAla: 2.662 ± 0.677
0.0IleCys: 0.0 ± 0.0
1.775IleAsp: 1.775 ± 1.112
2.218IleGlu: 2.218 ± 0.468
0.887IlePhe: 0.887 ± 0.373
1.331IleGly: 1.331 ± 0.18
0.887IleHis: 0.887 ± 0.384
0.444IleIle: 0.444 ± 0.352
3.106IleLys: 3.106 ± 0.776
1.775IleLeu: 1.775 ± 0.324
1.775IleMet: 1.775 ± 0.746
3.993IleAsn: 3.993 ± 1.703
2.662IlePro: 2.662 ± 0.359
1.775IleGln: 1.775 ± 0.324
0.887IleArg: 0.887 ± 1.255
5.768IleSer: 5.768 ± 1.363
2.662IleThr: 2.662 ± 1.094
1.331IleVal: 1.331 ± 1.055
0.887IleTrp: 0.887 ± 0.373
0.887IleTyr: 0.887 ± 0.703
0.0IleXaa: 0.0 ± 0.0
Lys
5.324LysAla: 5.324 ± 1.195
0.887LysCys: 0.887 ± 0.703
2.662LysAsp: 2.662 ± 0.688
4.437LysGlu: 4.437 ± 1.283
3.993LysPhe: 3.993 ± 0.765
2.218LysGly: 2.218 ± 0.412
0.444LysHis: 0.444 ± 0.627
2.662LysIle: 2.662 ± 0.359
3.106LysLys: 3.106 ± 1.442
6.211LysLeu: 6.211 ± 1.552
1.331LysMet: 1.331 ± 0.936
3.106LysAsn: 3.106 ± 1.283
2.662LysPro: 2.662 ± 1.09
1.775LysGln: 1.775 ± 0.746
0.444LysArg: 0.444 ± 0.627
4.437LysSer: 4.437 ± 0.495
2.662LysThr: 2.662 ± 0.677
2.662LysVal: 2.662 ± 0.976
1.331LysTrp: 1.331 ± 0.665
3.106LysTyr: 3.106 ± 0.675
0.0LysXaa: 0.0 ± 0.0
Leu
4.88LeuAla: 4.88 ± 1.128
1.775LeuCys: 1.775 ± 1.112
3.993LeuAsp: 3.993 ± 0.765
4.437LeuGlu: 4.437 ± 1.498
2.218LeuPhe: 2.218 ± 0.412
8.429LeuGly: 8.429 ± 2.7
3.106LeuHis: 3.106 ± 0.418
2.662LeuIle: 2.662 ± 0.583
6.655LeuLys: 6.655 ± 1.268
5.324LeuLeu: 5.324 ± 2.342
2.662LeuMet: 2.662 ± 0.525
3.106LeuAsn: 3.106 ± 0.535
4.88LeuPro: 4.88 ± 0.803
3.106LeuGln: 3.106 ± 0.732
1.331LeuArg: 1.331 ± 0.69
6.211LeuSer: 6.211 ± 1.084
7.986LeuThr: 7.986 ± 1.903
7.098LeuVal: 7.098 ± 1.262
2.218LeuTrp: 2.218 ± 0.468
3.549LeuTyr: 3.549 ± 0.447
0.0LeuXaa: 0.0 ± 0.0
Met
0.887MetAla: 0.887 ± 0.703
0.444MetCys: 0.444 ± 0.352
0.887MetAsp: 0.887 ± 0.99
0.444MetGlu: 0.444 ± 0.352
0.0MetPhe: 0.0 ± 0.0
2.218MetGly: 2.218 ± 0.468
0.887MetHis: 0.887 ± 0.373
0.0MetIle: 0.0 ± 0.0
0.444MetLys: 0.444 ± 0.352
1.775MetLeu: 1.775 ± 1.007
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
1.775MetPro: 1.775 ± 1.006
1.775MetGln: 1.775 ± 1.83
0.444MetArg: 0.444 ± 0.352
1.775MetSer: 1.775 ± 0.732
3.549MetThr: 3.549 ± 0.934
2.218MetVal: 2.218 ± 0.749
0.887MetTrp: 0.887 ± 0.373
1.331MetTyr: 1.331 ± 0.69
0.0MetXaa: 0.0 ± 0.0
Asn
5.324AsnAla: 5.324 ± 1.167
0.444AsnCys: 0.444 ± 0.627
1.331AsnAsp: 1.331 ± 1.055
2.218AsnGlu: 2.218 ± 0.615
1.775AsnPhe: 1.775 ± 0.416
1.331AsnGly: 1.331 ± 1.055
0.444AsnHis: 0.444 ± 0.607
1.331AsnIle: 1.331 ± 0.61
1.775AsnLys: 1.775 ± 0.416
5.768AsnLeu: 5.768 ± 0.519
1.331AsnMet: 1.331 ± 0.69
2.218AsnAsn: 2.218 ± 1.016
4.88AsnPro: 4.88 ± 1.117
0.444AsnGln: 0.444 ± 0.352
1.775AsnArg: 1.775 ± 0.416
3.993AsnSer: 3.993 ± 0.999
4.88AsnThr: 4.88 ± 0.993
2.662AsnVal: 2.662 ± 1.742
2.218AsnTrp: 2.218 ± 1.458
3.106AsnTyr: 3.106 ± 0.732
0.0AsnXaa: 0.0 ± 0.0
Pro
1.331ProAla: 1.331 ± 0.536
0.444ProCys: 0.444 ± 0.627
3.549ProAsp: 3.549 ± 0.506
3.993ProGlu: 3.993 ± 1.587
0.887ProPhe: 0.887 ± 0.703
8.873ProGly: 8.873 ± 0.941
1.775ProHis: 1.775 ± 0.324
4.437ProIle: 4.437 ± 0.495
1.775ProLys: 1.775 ± 1.05
7.098ProLeu: 7.098 ± 0.877
0.444ProMet: 0.444 ± 0.635
3.106ProAsn: 3.106 ± 1.215
9.317ProPro: 9.317 ± 3.878
6.211ProGln: 6.211 ± 2.358
3.993ProArg: 3.993 ± 1.061
2.218ProSer: 2.218 ± 1.152
1.775ProThr: 1.775 ± 0.416
7.098ProVal: 7.098 ± 2.165
0.887ProTrp: 0.887 ± 0.373
4.437ProTyr: 4.437 ± 1.618
0.0ProXaa: 0.0 ± 0.0
Gln
1.331GlnAla: 1.331 ± 0.781
0.0GlnCys: 0.0 ± 0.0
0.887GlnAsp: 0.887 ± 0.373
0.444GlnGlu: 0.444 ± 0.607
3.549GlnPhe: 3.549 ± 0.483
4.437GlnGly: 4.437 ± 1.618
3.993GlnHis: 3.993 ± 0.639
2.662GlnIle: 2.662 ± 0.583
0.444GlnLys: 0.444 ± 0.352
5.324GlnLeu: 5.324 ± 1.944
0.444GlnMet: 0.444 ± 0.607
3.549GlnAsn: 3.549 ± 1.412
5.324GlnPro: 5.324 ± 2.006
2.662GlnGln: 2.662 ± 0.583
0.0GlnArg: 0.0 ± 0.0
3.106GlnSer: 3.106 ± 0.776
2.662GlnThr: 2.662 ± 1.102
3.106GlnVal: 3.106 ± 0.776
0.444GlnTrp: 0.444 ± 0.352
3.549GlnTyr: 3.549 ± 0.934
0.0GlnXaa: 0.0 ± 0.0
Arg
1.331ArgAla: 1.331 ± 0.701
0.887ArgCys: 0.887 ± 0.703
0.444ArgAsp: 0.444 ± 0.352
0.887ArgGlu: 0.887 ± 0.703
0.887ArgPhe: 0.887 ± 0.373
2.218ArgGly: 2.218 ± 1.332
0.887ArgHis: 0.887 ± 0.373
2.218ArgIle: 2.218 ± 0.615
1.331ArgLys: 1.331 ± 0.702
3.106ArgLeu: 3.106 ± 1.178
0.887ArgMet: 0.887 ± 0.654
0.444ArgAsn: 0.444 ± 0.352
2.662ArgPro: 2.662 ± 1.203
1.331ArgGln: 1.331 ± 0.69
1.331ArgArg: 1.331 ± 0.18
1.331ArgSer: 1.331 ± 0.18
0.0ArgThr: 0.0 ± 0.0
1.775ArgVal: 1.775 ± 0.416
0.887ArgTrp: 0.887 ± 0.373
1.775ArgTyr: 1.775 ± 0.741
0.0ArgXaa: 0.0 ± 0.0
Ser
4.88SerAla: 4.88 ± 0.846
1.331SerCys: 1.331 ± 0.18
1.775SerAsp: 1.775 ± 0.769
5.324SerGlu: 5.324 ± 0.626
3.549SerPhe: 3.549 ± 0.831
6.211SerGly: 6.211 ± 1.479
2.662SerHis: 2.662 ± 0.923
1.775SerIle: 1.775 ± 0.416
3.106SerLys: 3.106 ± 0.706
4.88SerLeu: 4.88 ± 0.884
3.106SerMet: 3.106 ± 1.224
3.106SerAsn: 3.106 ± 0.535
4.88SerPro: 4.88 ± 1.575
4.437SerGln: 4.437 ± 1.657
2.662SerArg: 2.662 ± 1.118
11.979SerSer: 11.979 ± 3.515
6.655SerThr: 6.655 ± 0.858
6.655SerVal: 6.655 ± 0.592
1.331SerTrp: 1.331 ± 0.877
1.775SerTyr: 1.775 ± 1.286
0.0SerXaa: 0.0 ± 0.0
Thr
3.106ThrAla: 3.106 ± 0.827
1.775ThrCys: 1.775 ± 0.416
5.768ThrAsp: 5.768 ± 1.363
1.775ThrGlu: 1.775 ± 1.407
4.437ThrPhe: 4.437 ± 0.682
9.76ThrGly: 9.76 ± 2.173
2.218ThrHis: 2.218 ± 0.526
2.218ThrIle: 2.218 ± 0.468
3.106ThrLys: 3.106 ± 1.734
4.88ThrLeu: 4.88 ± 0.829
2.662ThrMet: 2.662 ± 0.677
2.218ThrAsn: 2.218 ± 1.332
4.437ThrPro: 4.437 ± 1.759
3.106ThrGln: 3.106 ± 1.254
1.775ThrArg: 1.775 ± 0.529
5.768ThrSer: 5.768 ± 1.645
6.211ThrThr: 6.211 ± 1.136
4.88ThrVal: 4.88 ± 0.513
0.444ThrTrp: 0.444 ± 0.352
2.218ThrTyr: 2.218 ± 0.927
0.0ThrXaa: 0.0 ± 0.0
Val
3.549ValAla: 3.549 ± 1.341
2.218ValCys: 2.218 ± 0.574
3.106ValAsp: 3.106 ± 1.02
2.218ValGlu: 2.218 ± 1.016
1.331ValPhe: 1.331 ± 0.18
6.211ValGly: 6.211 ± 0.789
1.331ValHis: 1.331 ± 0.18
1.331ValIle: 1.331 ± 0.841
4.88ValLys: 4.88 ± 1.118
6.655ValLeu: 6.655 ± 1.628
0.887ValMet: 0.887 ± 0.68
3.549ValAsn: 3.549 ± 0.603
6.211ValPro: 6.211 ± 0.968
3.549ValGln: 3.549 ± 0.483
1.775ValArg: 1.775 ± 1.087
3.993ValSer: 3.993 ± 0.463
6.655ValThr: 6.655 ± 1.007
4.437ValVal: 4.437 ± 2.354
2.218ValTrp: 2.218 ± 0.749
2.662ValTyr: 2.662 ± 0.583
0.0ValXaa: 0.0 ± 0.0
Trp
0.444TrpAla: 0.444 ± 0.352
1.331TrpCys: 1.331 ± 1.055
2.218TrpAsp: 2.218 ± 0.749
1.775TrpGlu: 1.775 ± 0.782
0.444TrpPhe: 0.444 ± 0.352
0.887TrpGly: 0.887 ± 0.373
0.0TrpHis: 0.0 ± 0.0
0.444TrpIle: 0.444 ± 0.352
0.887TrpLys: 0.887 ± 0.703
1.331TrpLeu: 1.331 ± 0.877
0.0TrpMet: 0.0 ± 0.0
2.662TrpAsn: 2.662 ± 0.359
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.887TrpArg: 0.887 ± 0.373
1.775TrpSer: 1.775 ± 0.746
1.331TrpThr: 1.331 ± 0.536
0.887TrpVal: 0.887 ± 0.373
0.887TrpTrp: 0.887 ± 0.384
0.444TrpTyr: 0.444 ± 0.352
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.775TyrAla: 1.775 ± 0.746
0.0TyrCys: 0.0 ± 0.0
2.218TyrAsp: 2.218 ± 0.468
4.437TyrGlu: 4.437 ± 0.653
3.106TyrPhe: 3.106 ± 0.418
3.549TyrGly: 3.549 ± 0.603
2.218TyrHis: 2.218 ± 0.63
0.444TyrIle: 0.444 ± 0.352
3.549TyrLys: 3.549 ± 0.934
4.88TyrLeu: 4.88 ± 1.69
0.0TyrMet: 0.0 ± 0.0
2.662TyrAsn: 2.662 ± 0.676
2.662TyrPro: 2.662 ± 0.976
1.775TyrGln: 1.775 ± 0.741
0.0TyrArg: 0.0 ± 0.0
1.775TyrSer: 1.775 ± 0.798
3.106TyrThr: 3.106 ± 0.732
4.437TyrVal: 4.437 ± 1.283
0.0TyrTrp: 0.0 ± 0.0
0.444TyrTyr: 0.444 ± 0.635
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2255 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski