Amino acid dipepetide frequency for Human polyomavirus 7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.36AlaAla: 12.36 ± 3.693
2.809AlaCys: 2.809 ± 0.923
3.371AlaAsp: 3.371 ± 1.479
1.124AlaGlu: 1.124 ± 0.736
0.0AlaPhe: 0.0 ± 0.0
3.371AlaGly: 3.371 ± 1.629
0.562AlaHis: 0.562 ± 0.368
5.618AlaIle: 5.618 ± 1.921
3.371AlaLys: 3.371 ± 1.008
10.674AlaLeu: 10.674 ± 2.939
0.562AlaMet: 0.562 ± 0.554
3.933AlaAsn: 3.933 ± 0.611
3.933AlaPro: 3.933 ± 1.341
0.562AlaGln: 0.562 ± 0.368
2.247AlaArg: 2.247 ± 0.749
5.056AlaSer: 5.056 ± 0.493
6.742AlaThr: 6.742 ± 1.172
3.933AlaVal: 3.933 ± 1.258
2.247AlaTrp: 2.247 ± 0.749
2.247AlaTyr: 2.247 ± 0.457
0.0AlaXaa: 0.0 ± 0.0
Cys
0.562CysAla: 0.562 ± 0.368
1.124CysCys: 1.124 ± 1.102
0.562CysAsp: 0.562 ± 0.368
0.562CysGlu: 0.562 ± 0.554
1.124CysPhe: 1.124 ± 0.531
0.562CysGly: 0.562 ± 0.554
0.0CysHis: 0.0 ± 0.0
1.685CysIle: 1.685 ± 1.018
0.562CysLys: 0.562 ± 0.368
1.685CysLeu: 1.685 ± 1.104
0.0CysMet: 0.0 ± 0.0
0.562CysAsn: 0.562 ± 0.368
0.562CysPro: 0.562 ± 0.368
0.562CysGln: 0.562 ± 0.554
0.562CysArg: 0.562 ± 0.554
1.124CysSer: 1.124 ± 0.736
2.247CysThr: 2.247 ± 0.964
1.685CysVal: 1.685 ± 0.729
1.685CysTrp: 1.685 ± 1.018
2.809CysTyr: 2.809 ± 0.818
0.0CysXaa: 0.0 ± 0.0
Asp
1.124AspAla: 1.124 ± 0.493
1.124AspCys: 1.124 ± 0.736
0.562AspAsp: 0.562 ± 0.368
5.056AspGlu: 5.056 ± 0.901
2.809AspPhe: 2.809 ± 1.84
1.124AspGly: 1.124 ± 0.736
0.0AspHis: 0.0 ± 0.0
0.562AspIle: 0.562 ± 0.551
6.742AspLys: 6.742 ± 1.596
2.247AspLeu: 2.247 ± 1.472
1.685AspMet: 1.685 ± 0.982
1.124AspAsn: 1.124 ± 0.493
4.494AspPro: 4.494 ± 0.337
2.809AspGln: 2.809 ± 1.227
0.562AspArg: 0.562 ± 0.368
1.124AspSer: 1.124 ± 0.736
3.371AspThr: 3.371 ± 2.045
2.809AspVal: 2.809 ± 0.881
2.809AspTrp: 2.809 ± 1.339
1.124AspTyr: 1.124 ± 0.493
0.0AspXaa: 0.0 ± 0.0
Glu
5.618GluAla: 5.618 ± 1.923
1.124GluCys: 1.124 ± 0.736
2.247GluAsp: 2.247 ± 0.986
6.742GluGlu: 6.742 ± 2.126
2.809GluPhe: 2.809 ± 1.222
7.865GluGly: 7.865 ± 1.947
0.0GluHis: 0.0 ± 0.0
3.933GluIle: 3.933 ± 1.1
1.685GluLys: 1.685 ± 1.104
11.236GluLeu: 11.236 ± 1.653
1.685GluMet: 1.685 ± 0.646
3.371GluAsn: 3.371 ± 0.606
2.809GluPro: 2.809 ± 0.987
1.685GluGln: 1.685 ± 0.671
1.685GluArg: 1.685 ± 0.545
2.809GluSer: 2.809 ± 1.357
3.371GluThr: 3.371 ± 1.125
7.303GluVal: 7.303 ± 1.533
0.0GluTrp: 0.0 ± 0.0
1.124GluTyr: 1.124 ± 0.531
0.0GluXaa: 0.0 ± 0.0
Phe
2.809PheAla: 2.809 ± 1.119
0.0PheCys: 0.0 ± 0.0
0.0PheAsp: 0.0 ± 0.0
2.809PheGlu: 2.809 ± 0.987
1.124PhePhe: 1.124 ± 0.493
3.371PheGly: 3.371 ± 1.394
1.685PheHis: 1.685 ± 0.971
1.685PheIle: 1.685 ± 1.104
2.247PheLys: 2.247 ± 1.472
3.371PheLeu: 3.371 ± 0.948
0.562PheMet: 0.562 ± 0.551
1.685PheAsn: 1.685 ± 0.729
1.685PhePro: 1.685 ± 0.671
3.371PheGln: 3.371 ± 0.377
1.124PheArg: 1.124 ± 1.107
0.0PheSer: 0.0 ± 0.0
0.562PheThr: 0.562 ± 0.368
2.247PheVal: 2.247 ± 0.749
1.124PheTrp: 1.124 ± 0.719
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
5.056GlyAla: 5.056 ± 1.177
2.247GlyCys: 2.247 ± 1.472
1.124GlyAsp: 1.124 ± 0.531
3.933GlyGlu: 3.933 ± 2.108
1.685GlyPhe: 1.685 ± 0.732
13.483GlyGly: 13.483 ± 2.956
0.0GlyHis: 0.0 ± 0.0
4.494GlyIle: 4.494 ± 1.081
2.247GlyLys: 2.247 ± 0.59
7.303GlyLeu: 7.303 ± 1.428
1.685GlyMet: 1.685 ± 0.671
6.18GlyAsn: 6.18 ± 1.501
5.056GlyPro: 5.056 ± 2.087
1.124GlyGln: 1.124 ± 0.719
4.494GlyArg: 4.494 ± 1.224
5.056GlySer: 5.056 ± 1.159
2.809GlyThr: 2.809 ± 0.782
8.989GlyVal: 8.989 ± 1.32
0.562GlyTrp: 0.562 ± 0.368
2.247GlyTyr: 2.247 ± 0.986
0.0GlyXaa: 0.0 ± 0.0
His
1.685HisAla: 1.685 ± 0.646
2.247HisCys: 2.247 ± 0.979
1.124HisAsp: 1.124 ± 0.739
0.562HisGlu: 0.562 ± 0.368
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
0.562HisHis: 0.562 ± 0.368
1.124HisIle: 1.124 ± 0.531
2.247HisLys: 2.247 ± 1.063
0.0HisLeu: 0.0 ± 0.0
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
1.685HisPro: 1.685 ± 0.545
0.562HisGln: 0.562 ± 0.368
0.562HisArg: 0.562 ± 0.368
0.562HisSer: 0.562 ± 0.368
0.0HisThr: 0.0 ± 0.0
0.0HisVal: 0.0 ± 0.0
0.562HisTrp: 0.562 ± 0.368
1.124HisTyr: 1.124 ± 0.531
0.0HisXaa: 0.0 ± 0.0
Ile
3.933IleAla: 3.933 ± 2.073
0.0IleCys: 0.0 ± 0.0
1.124IleAsp: 1.124 ± 0.493
2.809IleGlu: 2.809 ± 0.596
0.562IlePhe: 0.562 ± 0.551
4.494IleGly: 4.494 ± 0.912
0.562IleHis: 0.562 ± 0.551
2.809IleIle: 2.809 ± 1.108
1.685IleLys: 1.685 ± 0.671
5.056IleLeu: 5.056 ± 0.67
1.124IleMet: 1.124 ± 0.736
5.618IleAsn: 5.618 ± 0.781
3.933IlePro: 3.933 ± 1.333
2.809IleGln: 2.809 ± 0.802
0.562IleArg: 0.562 ± 0.554
1.124IleSer: 1.124 ± 0.493
1.124IleThr: 1.124 ± 0.736
1.124IleVal: 1.124 ± 0.688
1.685IleTrp: 1.685 ± 0.646
0.562IleTyr: 0.562 ± 0.554
0.0IleXaa: 0.0 ± 0.0
Lys
3.371LysAla: 3.371 ± 1.123
1.685LysCys: 1.685 ± 0.729
2.809LysAsp: 2.809 ± 1.222
6.18LysGlu: 6.18 ± 1.125
1.124LysPhe: 1.124 ± 0.493
5.618LysGly: 5.618 ± 2.024
3.933LysHis: 3.933 ± 0.447
1.685LysIle: 1.685 ± 0.671
9.551LysLys: 9.551 ± 1.665
3.933LysLeu: 3.933 ± 1.205
1.124LysMet: 1.124 ± 0.531
2.247LysAsn: 2.247 ± 0.986
1.685LysPro: 1.685 ± 0.671
2.247LysGln: 2.247 ± 0.964
3.933LysArg: 3.933 ± 1.359
1.685LysSer: 1.685 ± 1.104
1.685LysThr: 1.685 ± 1.104
3.371LysVal: 3.371 ± 1.09
1.685LysTrp: 1.685 ± 0.646
2.247LysTyr: 2.247 ± 1.026
0.0LysXaa: 0.0 ± 0.0
Leu
6.18LeuAla: 6.18 ± 1.094
2.809LeuCys: 2.809 ± 0.881
3.933LeuAsp: 3.933 ± 1.619
7.303LeuGlu: 7.303 ± 2.25
5.618LeuPhe: 5.618 ± 1.197
5.618LeuGly: 5.618 ± 1.062
1.124LeuHis: 1.124 ± 0.736
7.865LeuIle: 7.865 ± 0.895
3.933LeuLys: 3.933 ± 1.205
12.921LeuLeu: 12.921 ± 1.783
3.371LeuMet: 3.371 ± 1.235
6.742LeuAsn: 6.742 ± 2.598
8.427LeuPro: 8.427 ± 2.001
5.618LeuGln: 5.618 ± 1.324
5.056LeuArg: 5.056 ± 1.311
10.112LeuSer: 10.112 ± 3.342
2.247LeuThr: 2.247 ± 0.962
1.685LeuVal: 1.685 ± 0.692
2.247LeuTrp: 2.247 ± 0.916
3.371LeuTyr: 3.371 ± 0.626
0.0LeuXaa: 0.0 ± 0.0
Met
4.494MetAla: 4.494 ± 1.014
0.562MetCys: 0.562 ± 0.554
1.685MetAsp: 1.685 ± 1.018
3.371MetGlu: 3.371 ± 1.123
1.124MetPhe: 1.124 ± 0.493
1.124MetGly: 1.124 ± 0.499
0.0MetHis: 0.0 ± 0.0
0.562MetIle: 0.562 ± 0.554
1.124MetLys: 1.124 ± 0.736
2.247MetLeu: 2.247 ± 0.749
1.124MetMet: 1.124 ± 0.493
0.562MetAsn: 0.562 ± 0.368
0.562MetPro: 0.562 ± 0.554
1.685MetGln: 1.685 ± 0.545
0.562MetArg: 0.562 ± 0.368
1.124MetSer: 1.124 ± 0.493
0.562MetThr: 0.562 ± 0.554
2.247MetVal: 2.247 ± 0.964
1.685MetTrp: 1.685 ± 1.018
1.124MetTyr: 1.124 ± 0.739
0.0MetXaa: 0.0 ± 0.0
Asn
3.371AsnAla: 3.371 ± 1.342
1.124AsnCys: 1.124 ± 0.736
2.247AsnAsp: 2.247 ± 0.457
1.685AsnGlu: 1.685 ± 0.484
2.809AsnPhe: 2.809 ± 1.451
3.371AsnGly: 3.371 ± 2.045
0.0AsnHis: 0.0 ± 0.0
2.809AsnIle: 2.809 ± 1.339
3.933AsnLys: 3.933 ± 1.349
7.865AsnLeu: 7.865 ± 2.069
2.247AsnMet: 2.247 ± 0.984
1.685AsnAsn: 1.685 ± 0.646
2.809AsnPro: 2.809 ± 0.754
2.809AsnGln: 2.809 ± 0.782
2.809AsnArg: 2.809 ± 1.635
1.124AsnSer: 1.124 ± 0.493
3.933AsnThr: 3.933 ± 0.611
1.685AsnVal: 1.685 ± 0.729
1.124AsnTrp: 1.124 ± 0.531
0.562AsnTyr: 0.562 ± 0.551
0.0AsnXaa: 0.0 ± 0.0
Pro
2.809ProAla: 2.809 ± 1.119
1.124ProCys: 1.124 ± 0.493
4.494ProAsp: 4.494 ± 1.224
4.494ProGlu: 4.494 ± 1.299
0.562ProPhe: 0.562 ± 0.368
4.494ProGly: 4.494 ± 1.596
0.0ProHis: 0.0 ± 0.0
1.124ProIle: 1.124 ± 0.736
2.247ProLys: 2.247 ± 0.964
6.18ProLeu: 6.18 ± 1.318
1.685ProMet: 1.685 ± 1.018
1.685ProAsn: 1.685 ± 0.671
4.494ProPro: 4.494 ± 1.778
7.865ProGln: 7.865 ± 1.488
1.124ProArg: 1.124 ± 0.688
2.809ProSer: 2.809 ± 0.39
4.494ProThr: 4.494 ± 1.614
3.371ProVal: 3.371 ± 0.598
1.124ProTrp: 1.124 ± 0.739
1.685ProTyr: 1.685 ± 0.982
0.0ProXaa: 0.0 ± 0.0
Gln
4.494GlnAla: 4.494 ± 1.439
0.0GlnCys: 0.0 ± 0.0
2.809GlnAsp: 2.809 ± 1.84
2.247GlnGlu: 2.247 ± 0.964
2.247GlnPhe: 2.247 ± 0.964
3.371GlnGly: 3.371 ± 1.501
1.685GlnHis: 1.685 ± 0.729
0.0GlnIle: 0.0 ± 0.0
5.056GlnLys: 5.056 ± 0.493
3.371GlnLeu: 3.371 ± 0.377
1.124GlnMet: 1.124 ± 1.107
2.809GlnAsn: 2.809 ± 1.362
1.685GlnPro: 1.685 ± 1.661
1.124GlnGln: 1.124 ± 0.493
7.303GlnArg: 7.303 ± 1.919
2.809GlnSer: 2.809 ± 1.267
2.809GlnThr: 2.809 ± 1.025
1.124GlnVal: 1.124 ± 1.107
1.685GlnTrp: 1.685 ± 0.646
1.685GlnTyr: 1.685 ± 0.732
0.0GlnXaa: 0.0 ± 0.0
Arg
3.371ArgAla: 3.371 ± 1.501
0.0ArgCys: 0.0 ± 0.0
3.933ArgAsp: 3.933 ± 1.197
4.494ArgGlu: 4.494 ± 1.968
0.562ArgPhe: 0.562 ± 0.554
3.371ArgGly: 3.371 ± 1.501
0.562ArgHis: 0.562 ± 0.368
1.124ArgIle: 1.124 ± 0.688
4.494ArgLys: 4.494 ± 1.469
5.056ArgLeu: 5.056 ± 0.679
1.124ArgMet: 1.124 ± 0.659
0.562ArgAsn: 0.562 ± 0.368
1.685ArgPro: 1.685 ± 0.646
2.809ArgGln: 2.809 ± 0.782
3.933ArgArg: 3.933 ± 0.973
1.685ArgSer: 1.685 ± 1.018
2.809ArgThr: 2.809 ± 1.362
3.933ArgVal: 3.933 ± 1.593
0.0ArgTrp: 0.0 ± 0.0
0.562ArgTyr: 0.562 ± 0.554
0.0ArgXaa: 0.0 ± 0.0
Ser
3.371SerAla: 3.371 ± 0.877
0.0SerCys: 0.0 ± 0.0
5.056SerAsp: 5.056 ± 0.493
2.247SerGlu: 2.247 ± 0.811
3.371SerPhe: 3.371 ± 1.155
6.742SerGly: 6.742 ± 1.591
1.124SerHis: 1.124 ± 0.531
0.562SerIle: 0.562 ± 0.538
5.056SerLys: 5.056 ± 1.062
2.809SerLeu: 2.809 ± 0.39
1.124SerMet: 1.124 ± 0.736
1.685SerAsn: 1.685 ± 1.127
1.685SerPro: 1.685 ± 0.729
2.809SerGln: 2.809 ± 0.961
2.247SerArg: 2.247 ± 0.457
5.056SerSer: 5.056 ± 1.074
4.494SerThr: 4.494 ± 1.615
3.933SerVal: 3.933 ± 1.351
0.562SerTrp: 0.562 ± 0.551
3.371SerTyr: 3.371 ± 0.606
0.0SerXaa: 0.0 ± 0.0
Thr
2.809ThrAla: 2.809 ± 1.44
1.124ThrCys: 1.124 ± 0.736
1.685ThrAsp: 1.685 ± 0.671
5.056ThrGlu: 5.056 ± 0.652
1.124ThrPhe: 1.124 ± 0.493
2.247ThrGly: 2.247 ± 1.067
0.0ThrHis: 0.0 ± 0.0
2.809ThrIle: 2.809 ± 0.79
0.562ThrLys: 0.562 ± 0.554
6.18ThrLeu: 6.18 ± 1.193
2.809ThrMet: 2.809 ± 1.429
0.0ThrAsn: 0.0 ± 0.0
5.618ThrPro: 5.618 ± 0.648
3.371ThrGln: 3.371 ± 0.598
3.933ThrArg: 3.933 ± 0.973
5.056ThrSer: 5.056 ± 1.784
3.371ThrThr: 3.371 ± 0.877
3.933ThrVal: 3.933 ± 1.863
1.124ThrTrp: 1.124 ± 0.739
0.562ThrTyr: 0.562 ± 0.368
0.0ThrXaa: 0.0 ± 0.0
Val
5.056ValAla: 5.056 ± 1.727
0.0ValCys: 0.0 ± 0.0
1.124ValAsp: 1.124 ± 0.493
5.056ValGlu: 5.056 ± 1.787
0.562ValPhe: 0.562 ± 0.368
3.933ValGly: 3.933 ± 1.516
0.562ValHis: 0.562 ± 0.554
2.247ValIle: 2.247 ± 1.515
3.371ValLys: 3.371 ± 1.704
9.551ValLeu: 9.551 ± 1.291
1.685ValMet: 1.685 ± 0.646
4.494ValAsn: 4.494 ± 0.812
3.371ValPro: 3.371 ± 1.125
3.933ValGln: 3.933 ± 1.688
1.685ValArg: 1.685 ± 0.729
5.618ValSer: 5.618 ± 0.974
1.124ValThr: 1.124 ± 1.107
4.494ValVal: 4.494 ± 0.635
1.124ValTrp: 1.124 ± 0.719
1.124ValTyr: 1.124 ± 0.688
0.0ValXaa: 0.0 ± 0.0
Trp
1.124TrpAla: 1.124 ± 0.493
0.562TrpCys: 0.562 ± 0.551
1.685TrpAsp: 1.685 ± 0.729
2.247TrpGlu: 2.247 ± 0.457
0.562TrpPhe: 0.562 ± 0.551
3.371TrpGly: 3.371 ± 1.72
1.124TrpHis: 1.124 ± 0.531
0.0TrpIle: 0.0 ± 0.0
0.562TrpLys: 0.562 ± 0.368
2.809TrpLeu: 2.809 ± 1.339
1.124TrpMet: 1.124 ± 0.739
1.124TrpAsn: 1.124 ± 0.531
1.124TrpPro: 1.124 ± 0.739
0.562TrpGln: 0.562 ± 0.368
0.0TrpArg: 0.0 ± 0.0
0.562TrpSer: 0.562 ± 0.551
2.247TrpThr: 2.247 ± 1.478
2.247TrpVal: 2.247 ± 0.457
1.685TrpTrp: 1.685 ± 0.729
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.124TyrAla: 1.124 ± 0.531
0.562TyrCys: 0.562 ± 0.368
1.685TyrAsp: 1.685 ± 0.646
1.124TyrGlu: 1.124 ± 0.736
1.685TyrPhe: 1.685 ± 0.982
2.247TyrGly: 2.247 ± 0.986
1.124TyrHis: 1.124 ± 0.531
0.0TyrIle: 0.0 ± 0.0
1.124TyrLys: 1.124 ± 0.493
2.247TyrLeu: 2.247 ± 0.59
1.124TyrMet: 1.124 ± 0.736
3.933TyrAsn: 3.933 ± 1.766
0.562TyrPro: 0.562 ± 0.554
1.124TyrGln: 1.124 ± 0.688
1.685TyrArg: 1.685 ± 0.732
2.809TyrSer: 2.809 ± 0.881
3.371TyrThr: 3.371 ± 0.598
0.0TyrVal: 0.0 ± 0.0
0.0TyrTrp: 0.0 ± 0.0
2.247TyrTyr: 2.247 ± 0.749
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1781 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski