Amino acid dipepetide frequency for Vicugna pacos polyomavirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.276AlaAla: 10.276 ± 6.281
0.0AlaCys: 0.0 ± 0.0
0.541AlaAsp: 0.541 ± 0.468
3.786AlaGlu: 3.786 ± 1.033
1.622AlaPhe: 1.622 ± 0.87
6.49AlaGly: 6.49 ± 3.152
1.082AlaHis: 1.082 ± 0.673
3.245AlaIle: 3.245 ± 1.227
3.786AlaLys: 3.786 ± 0.887
11.357AlaLeu: 11.357 ± 3.685
1.622AlaMet: 1.622 ± 0.771
3.245AlaAsn: 3.245 ± 1.383
0.541AlaPro: 0.541 ± 0.468
4.327AlaGln: 4.327 ± 1.787
1.622AlaArg: 1.622 ± 0.506
1.082AlaSer: 1.082 ± 0.61
2.163AlaThr: 2.163 ± 1.235
5.408AlaVal: 5.408 ± 0.985
0.541AlaTrp: 0.541 ± 0.385
1.622AlaTyr: 1.622 ± 1.085
0.0AlaXaa: 0.0 ± 0.0
Cys
0.541CysAla: 0.541 ± 0.385
0.541CysCys: 0.541 ± 0.539
0.541CysAsp: 0.541 ± 0.385
0.541CysGlu: 0.541 ± 0.385
1.622CysPhe: 1.622 ± 1.085
0.541CysGly: 0.541 ± 0.385
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
3.245CysLys: 3.245 ± 1.279
1.082CysLeu: 1.082 ± 0.769
0.0CysMet: 0.0 ± 0.0
0.541CysAsn: 0.541 ± 0.385
0.541CysPro: 0.541 ± 0.539
1.622CysGln: 1.622 ± 0.866
2.163CysArg: 2.163 ± 1.221
0.541CysSer: 0.541 ± 0.385
1.622CysThr: 1.622 ± 0.866
0.0CysVal: 0.0 ± 0.0
1.082CysTrp: 1.082 ± 0.61
1.622CysTyr: 1.622 ± 1.138
0.0CysXaa: 0.0 ± 0.0
Asp
4.327AspAla: 4.327 ± 1.485
0.0AspCys: 0.0 ± 0.0
2.704AspAsp: 2.704 ± 1.923
4.327AspGlu: 4.327 ± 1.57
2.163AspPhe: 2.163 ± 0.595
3.245AspGly: 3.245 ± 1.065
0.0AspHis: 0.0 ± 0.0
4.327AspIle: 4.327 ± 1.226
2.704AspLys: 2.704 ± 1.002
3.245AspLeu: 3.245 ± 1.2
1.082AspMet: 1.082 ± 0.411
2.704AspAsn: 2.704 ± 0.315
3.245AspPro: 3.245 ± 0.333
0.541AspGln: 0.541 ± 0.385
2.704AspArg: 2.704 ± 1.448
1.082AspSer: 1.082 ± 0.769
0.0AspThr: 0.0 ± 0.0
3.245AspVal: 3.245 ± 0.543
2.704AspTrp: 2.704 ± 1.376
2.163AspTyr: 2.163 ± 0.977
0.0AspXaa: 0.0 ± 0.0
Glu
3.245GluAla: 3.245 ± 1.645
1.622GluCys: 1.622 ± 1.085
3.786GluAsp: 3.786 ± 2.086
10.276GluGlu: 10.276 ± 2.405
1.082GluPhe: 1.082 ± 0.673
4.327GluGly: 4.327 ± 1.217
1.622GluHis: 1.622 ± 0.771
5.408GluIle: 5.408 ± 0.977
4.867GluLys: 4.867 ± 3.038
5.949GluLeu: 5.949 ± 1.344
2.704GluMet: 2.704 ± 0.315
3.786GluAsn: 3.786 ± 0.529
1.622GluPro: 1.622 ± 0.699
0.541GluGln: 0.541 ± 0.385
3.245GluArg: 3.245 ± 1.011
3.786GluSer: 3.786 ± 1.473
3.786GluThr: 3.786 ± 1.18
5.408GluVal: 5.408 ± 1.791
1.082GluTrp: 1.082 ± 0.61
2.704GluTyr: 2.704 ± 1.008
0.0GluXaa: 0.0 ± 0.0
Phe
0.541PheAla: 0.541 ± 0.385
1.082PheCys: 1.082 ± 0.61
1.622PheAsp: 1.622 ± 1.154
3.245PheGlu: 3.245 ± 1.733
1.622PhePhe: 1.622 ± 1.04
3.245PheGly: 3.245 ± 1.281
1.082PheHis: 1.082 ± 0.411
0.541PheIle: 0.541 ± 0.385
0.541PheLys: 0.541 ± 0.385
2.163PheLeu: 2.163 ± 0.444
1.622PheMet: 1.622 ± 0.827
1.622PheAsn: 1.622 ± 0.792
2.163PhePro: 2.163 ± 0.595
1.082PheGln: 1.082 ± 0.769
1.082PheArg: 1.082 ± 0.937
5.949PheSer: 5.949 ± 1.994
4.867PheThr: 4.867 ± 1.36
1.622PheVal: 1.622 ± 1.154
1.082PheTrp: 1.082 ± 0.683
0.541PheTyr: 0.541 ± 0.468
0.0PheXaa: 0.0 ± 0.0
Gly
7.031GlyAla: 7.031 ± 3.739
0.541GlyCys: 0.541 ± 0.385
2.704GlyAsp: 2.704 ± 1.567
3.245GlyGlu: 3.245 ± 1.093
3.786GlyPhe: 3.786 ± 0.403
11.357GlyGly: 11.357 ± 2.5
0.541GlyHis: 0.541 ± 0.385
7.031GlyIle: 7.031 ± 3.476
2.163GlyLys: 2.163 ± 0.977
5.408GlyLeu: 5.408 ± 0.945
0.541GlyMet: 0.541 ± 0.385
6.49GlyAsn: 6.49 ± 1.566
3.786GlyPro: 3.786 ± 0.71
3.245GlyGln: 3.245 ± 1.065
2.704GlyArg: 2.704 ± 1.481
4.867GlySer: 4.867 ± 1.305
1.622GlyThr: 1.622 ± 0.771
8.112GlyVal: 8.112 ± 1.706
1.082GlyTrp: 1.082 ± 0.577
2.163GlyTyr: 2.163 ± 0.446
0.0GlyXaa: 0.0 ± 0.0
His
0.541HisAla: 0.541 ± 0.468
0.0HisCys: 0.0 ± 0.0
0.541HisAsp: 0.541 ± 0.385
0.541HisGlu: 0.541 ± 0.385
0.541HisPhe: 0.541 ± 0.468
1.622HisGly: 1.622 ± 0.506
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.541HisLys: 0.541 ± 0.385
0.541HisLeu: 0.541 ± 0.385
0.0HisMet: 0.0 ± 0.0
0.541HisAsn: 0.541 ± 0.385
1.082HisPro: 1.082 ± 0.61
0.0HisGln: 0.0 ± 0.0
2.163HisArg: 2.163 ± 0.613
1.622HisSer: 1.622 ± 0.592
1.082HisThr: 1.082 ± 0.411
1.082HisVal: 1.082 ± 0.411
0.0HisTrp: 0.0 ± 0.0
2.163HisTyr: 2.163 ± 0.595
0.0HisXaa: 0.0 ± 0.0
Ile
5.949IleAla: 5.949 ± 3.287
0.541IleCys: 0.541 ± 0.385
4.327IleAsp: 4.327 ± 1.611
2.704IleGlu: 2.704 ± 1.37
4.867IlePhe: 4.867 ± 1.649
3.786IleGly: 3.786 ± 1.299
1.622IleHis: 1.622 ± 0.866
2.163IleIle: 2.163 ± 0.673
2.163IleLys: 2.163 ± 0.875
3.786IleLeu: 3.786 ± 1.077
0.0IleMet: 0.0 ± 0.0
2.163IleAsn: 2.163 ± 0.977
4.867IlePro: 4.867 ± 1.012
2.163IleGln: 2.163 ± 0.595
0.0IleArg: 0.0 ± 0.0
3.245IleSer: 3.245 ± 1.444
2.163IleThr: 2.163 ± 0.802
4.867IleVal: 4.867 ± 0.813
1.622IleTrp: 1.622 ± 0.506
1.622IleTyr: 1.622 ± 0.866
0.0IleXaa: 0.0 ± 0.0
Lys
2.163LysAla: 2.163 ± 1.193
1.622LysCys: 1.622 ± 0.866
1.622LysAsp: 1.622 ± 0.866
3.786LysGlu: 3.786 ± 2.285
1.082LysPhe: 1.082 ± 0.769
5.408LysGly: 5.408 ± 1.921
2.163LysHis: 2.163 ± 1.538
2.163LysIle: 2.163 ± 1.193
7.572LysLys: 7.572 ± 1.246
4.867LysLeu: 4.867 ± 1.905
2.704LysMet: 2.704 ± 1.547
2.704LysAsn: 2.704 ± 1.338
2.163LysPro: 2.163 ± 1.235
1.622LysGln: 1.622 ± 0.792
6.49LysArg: 6.49 ± 1.52
1.622LysSer: 1.622 ± 1.154
4.327LysThr: 4.327 ± 1.725
3.245LysVal: 3.245 ± 1.332
1.082LysTrp: 1.082 ± 0.61
1.082LysTyr: 1.082 ± 0.411
0.0LysXaa: 0.0 ± 0.0
Leu
4.867LeuAla: 4.867 ± 2.002
2.704LeuCys: 2.704 ± 1.338
3.245LeuAsp: 3.245 ± 1.332
9.194LeuGlu: 9.194 ± 1.162
1.622LeuPhe: 1.622 ± 1.154
4.867LeuGly: 4.867 ± 1.775
1.082LeuHis: 1.082 ± 0.61
2.704LeuIle: 2.704 ± 0.978
4.327LeuLys: 4.327 ± 0.817
11.357LeuLeu: 11.357 ± 0.961
2.704LeuMet: 2.704 ± 1.002
4.327LeuAsn: 4.327 ± 0.817
6.49LeuPro: 6.49 ± 1.339
6.49LeuGln: 6.49 ± 1.063
5.949LeuArg: 5.949 ± 1.871
7.572LeuSer: 7.572 ± 2.618
4.867LeuThr: 4.867 ± 1.1
3.245LeuVal: 3.245 ± 1.407
0.0LeuTrp: 0.0 ± 0.0
5.408LeuTyr: 5.408 ± 0.769
0.0LeuXaa: 0.0 ± 0.0
Met
1.622MetAla: 1.622 ± 0.592
1.082MetCys: 1.082 ± 0.61
5.408MetAsp: 5.408 ± 1.346
0.541MetGlu: 0.541 ± 0.385
0.541MetPhe: 0.541 ± 0.468
2.163MetGly: 2.163 ± 0.534
0.0MetHis: 0.0 ± 0.0
1.082MetIle: 1.082 ± 0.411
1.622MetLys: 1.622 ± 0.866
3.786MetLeu: 3.786 ± 1.036
1.622MetMet: 1.622 ± 0.622
1.622MetAsn: 1.622 ± 1.154
0.0MetPro: 0.0 ± 0.0
1.082MetGln: 1.082 ± 0.673
1.082MetArg: 1.082 ± 0.61
1.622MetSer: 1.622 ± 0.495
1.082MetThr: 1.082 ± 0.411
0.541MetVal: 0.541 ± 0.468
0.0MetTrp: 0.0 ± 0.0
1.082MetTyr: 1.082 ± 0.769
0.0MetXaa: 0.0 ± 0.0
Asn
5.949AsnAla: 5.949 ± 1.671
2.163AsnCys: 2.163 ± 0.728
1.082AsnAsp: 1.082 ± 0.411
6.49AsnGlu: 6.49 ± 1.71
2.163AsnPhe: 2.163 ± 0.728
3.786AsnGly: 3.786 ± 1.2
0.541AsnHis: 0.541 ± 0.385
5.408AsnIle: 5.408 ± 1.336
1.622AsnLys: 1.622 ± 0.643
8.112AsnLeu: 8.112 ± 1.96
2.163AsnMet: 2.163 ± 0.595
2.704AsnAsn: 2.704 ± 0.315
1.622AsnPro: 1.622 ± 0.792
5.408AsnGln: 5.408 ± 1.967
1.082AsnArg: 1.082 ± 0.411
3.245AsnSer: 3.245 ± 2.153
3.245AsnThr: 3.245 ± 1.19
0.541AsnVal: 0.541 ± 0.385
1.082AsnTrp: 1.082 ± 0.769
0.541AsnTyr: 0.541 ± 0.385
0.0AsnXaa: 0.0 ± 0.0
Pro
0.0ProAla: 0.0 ± 0.0
0.541ProCys: 0.541 ± 0.539
5.408ProAsp: 5.408 ± 1.078
4.327ProGlu: 4.327 ± 0.961
2.163ProPhe: 2.163 ± 1.193
3.245ProGly: 3.245 ± 0.588
0.0ProHis: 0.0 ± 0.0
2.163ProIle: 2.163 ± 0.446
1.622ProLys: 1.622 ± 1.154
4.867ProLeu: 4.867 ± 1.66
2.163ProMet: 2.163 ± 0.444
2.163ProAsn: 2.163 ± 0.875
3.786ProPro: 3.786 ± 1.22
2.704ProGln: 2.704 ± 1.231
3.245ProArg: 3.245 ± 1.184
2.704ProSer: 2.704 ± 0.315
3.245ProThr: 3.245 ± 0.532
3.786ProVal: 3.786 ± 0.951
0.0ProTrp: 0.0 ± 0.0
1.622ProTyr: 1.622 ± 0.771
0.0ProXaa: 0.0 ± 0.0
Gln
4.867GlnAla: 4.867 ± 0.997
1.082GlnCys: 1.082 ± 0.61
0.541GlnAsp: 0.541 ± 0.468
1.622GlnGlu: 1.622 ± 0.771
1.082GlnPhe: 1.082 ± 0.411
3.245GlnGly: 3.245 ± 1.184
0.541GlnHis: 0.541 ± 0.385
3.786GlnIle: 3.786 ± 1.036
2.704GlnLys: 2.704 ± 1.002
2.163GlnLeu: 2.163 ± 0.835
0.0GlnMet: 0.0 ± 0.0
2.163GlnAsn: 2.163 ± 1.221
2.704GlnPro: 2.704 ± 1.231
2.704GlnGln: 2.704 ± 0.865
4.867GlnArg: 4.867 ± 1.79
0.541GlnSer: 0.541 ± 0.385
4.327GlnThr: 4.327 ± 1.615
4.327GlnVal: 4.327 ± 2.088
1.622GlnTrp: 1.622 ± 0.506
1.622GlnTyr: 1.622 ± 0.643
0.0GlnXaa: 0.0 ± 0.0
Arg
3.245ArgAla: 3.245 ± 0.784
0.0ArgCys: 0.0 ± 0.0
3.245ArgAsp: 3.245 ± 1.011
1.082ArgGlu: 1.082 ± 0.61
2.704ArgPhe: 2.704 ± 0.762
1.082ArgGly: 1.082 ± 0.464
1.082ArgHis: 1.082 ± 0.61
5.408ArgIle: 5.408 ± 2.253
2.704ArgLys: 2.704 ± 1.37
4.327ArgLeu: 4.327 ± 0.894
0.541ArgMet: 0.541 ± 0.385
4.327ArgAsn: 4.327 ± 1.17
1.622ArgPro: 1.622 ± 0.592
1.622ArgGln: 1.622 ± 0.792
5.949ArgArg: 5.949 ± 1.344
2.704ArgSer: 2.704 ± 1.126
2.163ArgThr: 2.163 ± 1.221
4.327ArgVal: 4.327 ± 0.428
0.0ArgTrp: 0.0 ± 0.0
1.082ArgTyr: 1.082 ± 0.769
0.0ArgXaa: 0.0 ± 0.0
Ser
1.622SerAla: 1.622 ± 0.699
1.622SerCys: 1.622 ± 1.085
5.408SerAsp: 5.408 ± 1.808
4.867SerGlu: 4.867 ± 1.452
2.163SerPhe: 2.163 ± 0.595
4.867SerGly: 4.867 ± 1.037
0.0SerHis: 0.0 ± 0.0
3.786SerIle: 3.786 ± 0.61
4.327SerLys: 4.327 ± 1.76
2.704SerLeu: 2.704 ± 0.785
3.245SerMet: 3.245 ± 0.602
4.327SerAsn: 4.327 ± 1.402
0.541SerPro: 0.541 ± 0.468
3.786SerGln: 3.786 ± 1.262
2.704SerArg: 2.704 ± 0.315
3.786SerSer: 3.786 ± 2.015
3.786SerThr: 3.786 ± 1.822
1.622SerVal: 1.622 ± 0.792
1.082SerTrp: 1.082 ± 0.61
2.163SerTyr: 2.163 ± 1.346
0.0SerXaa: 0.0 ± 0.0
Thr
2.704ThrAla: 2.704 ± 1.126
0.541ThrCys: 0.541 ± 0.539
1.622ThrAsp: 1.622 ± 0.506
4.327ThrGlu: 4.327 ± 1.25
2.163ThrPhe: 2.163 ± 0.875
4.327ThrGly: 4.327 ± 2.174
1.082ThrHis: 1.082 ± 0.411
1.622ThrIle: 1.622 ± 1.031
3.245ThrLys: 3.245 ± 1.185
7.031ThrLeu: 7.031 ± 1.757
1.622ThrMet: 1.622 ± 0.699
1.622ThrAsn: 1.622 ± 0.792
5.949ThrPro: 5.949 ± 0.503
2.163ThrGln: 2.163 ± 1.235
0.541ThrArg: 0.541 ± 0.385
5.949ThrSer: 5.949 ± 3.031
3.245ThrThr: 3.245 ± 1.232
3.786ThrVal: 3.786 ± 1.399
1.082ThrTrp: 1.082 ± 0.656
0.541ThrTyr: 0.541 ± 0.468
0.0ThrXaa: 0.0 ± 0.0
Val
2.163ValAla: 2.163 ± 0.977
0.0ValCys: 0.0 ± 0.0
2.163ValAsp: 2.163 ± 0.821
3.786ValGlu: 3.786 ± 1.7
2.163ValPhe: 2.163 ± 0.977
8.653ValGly: 8.653 ± 3.128
1.622ValHis: 1.622 ± 0.771
1.622ValIle: 1.622 ± 0.592
2.704ValLys: 2.704 ± 1.107
5.949ValLeu: 5.949 ± 2.28
0.0ValMet: 0.0 ± 0.0
5.408ValAsn: 5.408 ± 2.144
4.867ValPro: 4.867 ± 1.158
4.867ValGln: 4.867 ± 1.439
0.541ValArg: 0.541 ± 0.468
4.327ValSer: 4.327 ± 1.925
5.408ValThr: 5.408 ± 1.693
6.49ValVal: 6.49 ± 1.103
0.541ValTrp: 0.541 ± 0.468
2.163ValTyr: 2.163 ± 1.029
0.0ValXaa: 0.0 ± 0.0
Trp
0.541TrpAla: 0.541 ± 0.468
0.541TrpCys: 0.541 ± 0.385
0.0TrpAsp: 0.0 ± 0.0
1.622TrpGlu: 1.622 ± 0.643
1.082TrpPhe: 1.082 ± 0.577
1.082TrpGly: 1.082 ± 0.61
0.0TrpHis: 0.0 ± 0.0
1.082TrpIle: 1.082 ± 0.673
1.622TrpLys: 1.622 ± 1.154
1.622TrpLeu: 1.622 ± 0.592
1.082TrpMet: 1.082 ± 0.673
2.163TrpAsn: 2.163 ± 1.221
0.541TrpPro: 0.541 ± 0.487
0.541TrpGln: 0.541 ± 0.539
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
1.082TrpThr: 1.082 ± 0.673
1.622TrpVal: 1.622 ± 0.799
1.082TrpTrp: 1.082 ± 0.61
0.541TrpTyr: 0.541 ± 0.385
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.163TyrAla: 2.163 ± 0.595
2.163TyrCys: 2.163 ± 1.193
0.0TyrAsp: 0.0 ± 0.0
0.541TyrGlu: 0.541 ± 0.385
1.082TyrPhe: 1.082 ± 0.937
1.082TyrGly: 1.082 ± 0.683
0.541TyrHis: 0.541 ± 0.468
1.082TyrIle: 1.082 ± 0.411
5.408TyrLys: 5.408 ± 2.377
2.704TyrLeu: 2.704 ± 0.865
1.622TyrMet: 1.622 ± 1.154
4.327TyrAsn: 4.327 ± 1.611
1.622TyrPro: 1.622 ± 0.792
0.0TyrGln: 0.0 ± 0.0
1.082TyrArg: 1.082 ± 0.411
2.704TyrSer: 2.704 ± 0.785
1.082TyrThr: 1.082 ± 0.411
2.163TyrVal: 2.163 ± 0.875
1.082TyrTrp: 1.082 ± 0.769
1.082TyrTyr: 1.082 ± 0.673
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1850 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski