Amino acid dipepetide frequency for Canis familiaris polyomavirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.792AlaAla: 3.792 ± 2.107
0.542AlaCys: 0.542 ± 0.825
1.625AlaAsp: 1.625 ± 0.99
3.792AlaGlu: 3.792 ± 1.891
1.625AlaPhe: 1.625 ± 1.152
5.417AlaGly: 5.417 ± 2.146
0.542AlaHis: 0.542 ± 0.52
3.792AlaIle: 3.792 ± 1.891
4.334AlaLys: 4.334 ± 1.298
5.417AlaLeu: 5.417 ± 2.794
1.625AlaMet: 1.625 ± 0.923
2.709AlaAsn: 2.709 ± 0.789
0.542AlaPro: 0.542 ± 0.555
2.709AlaGln: 2.709 ± 0.789
4.334AlaArg: 4.334 ± 1.167
4.334AlaSer: 4.334 ± 2.438
2.167AlaThr: 2.167 ± 1.335
4.875AlaVal: 4.875 ± 0.923
1.625AlaTrp: 1.625 ± 0.799
3.792AlaTyr: 3.792 ± 1.289
0.0AlaXaa: 0.0 ± 0.0
Cys
1.083CysAla: 1.083 ± 0.883
0.542CysCys: 0.542 ± 0.75
3.25CysAsp: 3.25 ± 1.723
1.083CysGlu: 1.083 ± 0.712
2.709CysPhe: 2.709 ± 1.489
2.167CysGly: 2.167 ± 0.714
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
3.792CysLys: 3.792 ± 1.424
2.709CysLeu: 2.709 ± 1.489
0.542CysMet: 0.542 ± 0.356
1.083CysAsn: 1.083 ± 0.712
1.083CysPro: 1.083 ± 0.76
0.542CysGln: 0.542 ± 0.356
0.0CysArg: 0.0 ± 0.0
1.083CysSer: 1.083 ± 0.73
1.083CysThr: 1.083 ± 0.712
1.625CysVal: 1.625 ± 0.72
0.0CysTrp: 0.0 ± 0.0
2.167CysTyr: 2.167 ± 0.714
0.0CysXaa: 0.0 ± 0.0
Asp
2.167AspAla: 2.167 ± 1.063
3.25AspCys: 3.25 ± 2.509
2.167AspAsp: 2.167 ± 1.033
3.25AspGlu: 3.25 ± 1.119
1.083AspPhe: 1.083 ± 0.712
2.709AspGly: 2.709 ± 0.789
1.083AspHis: 1.083 ± 0.712
1.625AspIle: 1.625 ± 0.721
2.709AspLys: 2.709 ± 1.489
3.792AspLeu: 3.792 ± 2.053
1.625AspMet: 1.625 ± 0.72
4.334AspAsn: 4.334 ± 1.442
1.083AspPro: 1.083 ± 1.04
1.083AspGln: 1.083 ± 0.712
0.542AspArg: 0.542 ± 0.52
4.334AspSer: 4.334 ± 1.312
0.542AspThr: 0.542 ± 0.356
2.167AspVal: 2.167 ± 1.069
0.542AspTrp: 0.542 ± 0.825
1.083AspTyr: 1.083 ± 0.712
0.0AspXaa: 0.0 ± 0.0
Glu
7.042GluAla: 7.042 ± 3.409
1.625GluCys: 1.625 ± 1.067
4.875GluAsp: 4.875 ± 2.236
10.293GluGlu: 10.293 ± 2.898
5.959GluPhe: 5.959 ± 2.228
2.709GluGly: 2.709 ± 1.719
0.542GluHis: 0.542 ± 0.356
4.875GluIle: 4.875 ± 0.895
4.334GluLys: 4.334 ± 2.229
5.959GluLeu: 5.959 ± 1.934
1.083GluMet: 1.083 ± 0.712
3.25GluAsn: 3.25 ± 0.704
0.542GluPro: 0.542 ± 0.52
6.501GluGln: 6.501 ± 2.441
3.25GluArg: 3.25 ± 1.304
1.083GluSer: 1.083 ± 0.712
5.959GluThr: 5.959 ± 1.567
4.875GluVal: 4.875 ± 1.041
0.542GluTrp: 0.542 ± 0.356
0.542GluTyr: 0.542 ± 0.356
0.0GluXaa: 0.0 ± 0.0
Phe
1.625PheAla: 1.625 ± 1.067
2.709PheCys: 2.709 ± 1.269
2.167PheAsp: 2.167 ± 0.999
2.709PheGlu: 2.709 ± 1.779
0.542PhePhe: 0.542 ± 0.52
2.709PheGly: 2.709 ± 0.594
1.083PheHis: 1.083 ± 0.532
1.083PheIle: 1.083 ± 0.76
1.625PheLys: 1.625 ± 1.067
2.167PheLeu: 2.167 ± 0.714
2.709PheMet: 2.709 ± 1.321
3.792PheAsn: 3.792 ± 0.788
2.167PhePro: 2.167 ± 0.726
0.542PheGln: 0.542 ± 0.75
2.167PheArg: 2.167 ± 0.924
1.083PheSer: 1.083 ± 1.04
1.083PheThr: 1.083 ± 0.76
2.167PheVal: 2.167 ± 0.96
0.0PheTrp: 0.0 ± 0.0
1.625PheTyr: 1.625 ± 0.721
0.0PheXaa: 0.0 ± 0.0
Gly
3.792GlyAla: 3.792 ± 1.76
0.0GlyCys: 0.0 ± 0.0
2.709GlyAsp: 2.709 ± 1.572
4.875GlyGlu: 4.875 ± 1.102
1.083GlyPhe: 1.083 ± 0.76
6.501GlyGly: 6.501 ± 0.657
0.0GlyHis: 0.0 ± 0.0
3.25GlyIle: 3.25 ± 1.394
3.792GlyLys: 3.792 ± 1.757
9.209GlyLeu: 9.209 ± 2.086
0.0GlyMet: 0.0 ± 0.0
2.709GlyAsn: 2.709 ± 0.886
3.792GlyPro: 3.792 ± 1.727
8.667GlyGln: 8.667 ± 3.825
1.083GlyArg: 1.083 ± 0.73
4.875GlySer: 4.875 ± 1.85
3.792GlyThr: 3.792 ± 1.982
8.667GlyVal: 8.667 ± 1.867
0.542GlyTrp: 0.542 ± 0.825
1.625GlyTyr: 1.625 ± 1.074
0.0GlyXaa: 0.0 ± 0.0
His
1.083HisAla: 1.083 ± 0.532
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
1.083HisGlu: 1.083 ± 0.73
0.542HisPhe: 0.542 ± 0.52
1.083HisGly: 1.083 ± 0.73
0.0HisHis: 0.0 ± 0.0
1.083HisIle: 1.083 ± 0.76
0.542HisLys: 0.542 ± 0.356
1.083HisLeu: 1.083 ± 0.532
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
1.083HisPro: 1.083 ± 0.73
0.542HisGln: 0.542 ± 0.52
2.167HisArg: 2.167 ± 0.714
1.625HisSer: 1.625 ± 0.72
0.0HisThr: 0.0 ± 0.0
0.542HisVal: 0.542 ± 0.356
0.0HisTrp: 0.0 ± 0.0
1.625HisTyr: 1.625 ± 0.652
0.0HisXaa: 0.0 ± 0.0
Ile
4.875IleAla: 4.875 ± 2.306
1.625IleCys: 1.625 ± 0.74
1.625IleAsp: 1.625 ± 1.133
4.334IleGlu: 4.334 ± 0.879
1.083IlePhe: 1.083 ± 0.712
2.167IleGly: 2.167 ± 0.963
2.167IleHis: 2.167 ± 0.924
1.625IleIle: 1.625 ± 0.545
3.25IleLys: 3.25 ± 0.704
4.334IleLeu: 4.334 ± 1.34
0.542IleMet: 0.542 ± 0.356
4.875IleAsn: 4.875 ± 2.065
1.083IlePro: 1.083 ± 0.509
2.167IleGln: 2.167 ± 1.073
2.167IleArg: 2.167 ± 0.924
3.792IleSer: 3.792 ± 1.51
3.792IleThr: 3.792 ± 1.662
2.709IleVal: 2.709 ± 0.855
1.083IleTrp: 1.083 ± 0.76
0.542IleTyr: 0.542 ± 0.356
0.0IleXaa: 0.0 ± 0.0
Lys
2.709LysAla: 2.709 ± 0.594
1.625LysCys: 1.625 ± 1.067
2.709LysAsp: 2.709 ± 1.269
5.417LysGlu: 5.417 ± 2.054
0.0LysPhe: 0.0 ± 0.0
5.417LysGly: 5.417 ± 2.287
1.083LysHis: 1.083 ± 0.712
2.167LysIle: 2.167 ± 0.999
8.667LysLys: 8.667 ± 3.036
5.417LysLeu: 5.417 ± 1.87
2.167LysMet: 2.167 ± 0.999
7.042LysAsn: 7.042 ± 1.669
2.167LysPro: 2.167 ± 0.924
2.709LysGln: 2.709 ± 0.826
5.417LysArg: 5.417 ± 0.653
3.25LysSer: 3.25 ± 0.704
2.709LysThr: 2.709 ± 0.939
3.792LysVal: 3.792 ± 1.071
0.0LysTrp: 0.0 ± 0.0
1.625LysTyr: 1.625 ± 1.067
0.0LysXaa: 0.0 ± 0.0
Leu
5.959LeuAla: 5.959 ± 2.854
2.167LeuCys: 2.167 ± 1.454
5.417LeuAsp: 5.417 ± 1.738
3.792LeuGlu: 3.792 ± 1.053
5.417LeuPhe: 5.417 ± 1.587
5.417LeuGly: 5.417 ± 1.21
2.167LeuHis: 2.167 ± 1.454
6.501LeuIle: 6.501 ± 0.561
3.792LeuLys: 3.792 ± 1.39
7.584LeuLeu: 7.584 ± 1.573
4.334LeuMet: 4.334 ± 2.369
2.709LeuAsn: 2.709 ± 1.501
8.667LeuPro: 8.667 ± 1.431
5.959LeuGln: 5.959 ± 0.882
3.25LeuArg: 3.25 ± 1.275
4.875LeuSer: 4.875 ± 0.59
4.875LeuThr: 4.875 ± 2.006
5.959LeuVal: 5.959 ± 1.441
1.083LeuTrp: 1.083 ± 0.883
1.625LeuTyr: 1.625 ± 0.799
0.0LeuXaa: 0.0 ± 0.0
Met
1.625MetAla: 1.625 ± 0.652
2.709MetCys: 2.709 ± 1.489
2.167MetAsp: 2.167 ± 0.999
0.542MetGlu: 0.542 ± 0.356
1.083MetPhe: 1.083 ± 0.709
2.709MetGly: 2.709 ± 1.178
0.542MetHis: 0.542 ± 0.52
0.542MetIle: 0.542 ± 0.75
1.625MetLys: 1.625 ± 0.799
1.625MetLeu: 1.625 ± 1.067
1.083MetMet: 1.083 ± 0.712
1.625MetAsn: 1.625 ± 0.799
0.542MetPro: 0.542 ± 0.52
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
2.167MetSer: 2.167 ± 0.714
1.083MetThr: 1.083 ± 0.532
1.083MetVal: 1.083 ± 0.532
1.083MetTrp: 1.083 ± 0.883
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.334AsnAla: 4.334 ± 0.949
2.167AsnCys: 2.167 ± 1.423
0.0AsnAsp: 0.0 ± 0.0
5.959AsnGlu: 5.959 ± 1.537
2.709AsnPhe: 2.709 ± 0.869
2.167AsnGly: 2.167 ± 0.504
1.083AsnHis: 1.083 ± 0.532
3.792AsnIle: 3.792 ± 1.533
3.792AsnLys: 3.792 ± 2.001
5.417AsnLeu: 5.417 ± 1.738
0.542AsnMet: 0.542 ± 0.555
1.625AsnAsn: 1.625 ± 0.99
3.25AsnPro: 3.25 ± 1.895
2.167AsnGln: 2.167 ± 1.489
3.25AsnArg: 3.25 ± 1.304
1.083AsnSer: 1.083 ± 0.532
7.042AsnThr: 7.042 ± 1.454
4.334AsnVal: 4.334 ± 0.403
2.167AsnTrp: 2.167 ± 1.519
1.083AsnTyr: 1.083 ± 0.76
0.0AsnXaa: 0.0 ± 0.0
Pro
2.167ProAla: 2.167 ± 0.504
0.542ProCys: 0.542 ± 0.52
2.167ProAsp: 2.167 ± 1.022
5.417ProGlu: 5.417 ± 1.116
1.083ProPhe: 1.083 ± 0.712
5.417ProGly: 5.417 ± 1.663
0.0ProHis: 0.0 ± 0.0
2.167ProIle: 2.167 ± 0.504
3.25ProLys: 3.25 ± 0.694
4.875ProLeu: 4.875 ± 1.96
0.542ProMet: 0.542 ± 0.52
2.167ProAsn: 2.167 ± 0.504
1.625ProPro: 1.625 ± 0.74
2.709ProGln: 2.709 ± 0.789
2.167ProArg: 2.167 ± 1.022
1.625ProSer: 1.625 ± 0.721
0.0ProThr: 0.0 ± 0.0
3.25ProVal: 3.25 ± 1.557
0.0ProTrp: 0.0 ± 0.0
0.542ProTyr: 0.542 ± 0.52
0.0ProXaa: 0.0 ± 0.0
Gln
2.709GlnAla: 2.709 ± 0.491
1.083GlnCys: 1.083 ± 0.73
1.625GlnAsp: 1.625 ± 0.799
7.042GlnGlu: 7.042 ± 2.356
1.083GlnPhe: 1.083 ± 0.532
2.709GlnGly: 2.709 ± 0.841
0.542GlnHis: 0.542 ± 0.356
4.334GlnIle: 4.334 ± 1.007
1.083GlnLys: 1.083 ± 0.73
5.959GlnLeu: 5.959 ± 1.682
1.083GlnMet: 1.083 ± 1.04
2.167GlnAsn: 2.167 ± 0.622
1.083GlnPro: 1.083 ± 0.532
9.209GlnGln: 9.209 ± 3.077
2.709GlnArg: 2.709 ± 1.742
2.709GlnSer: 2.709 ± 1.387
1.625GlnThr: 1.625 ± 0.545
5.417GlnVal: 5.417 ± 1.64
3.792GlnTrp: 3.792 ± 1.043
2.709GlnTyr: 2.709 ± 0.851
0.0GlnXaa: 0.0 ± 0.0
Arg
2.167ArgAla: 2.167 ± 1.519
1.625ArgCys: 1.625 ± 0.652
1.083ArgAsp: 1.083 ± 0.532
3.25ArgGlu: 3.25 ± 1.119
2.167ArgPhe: 2.167 ± 1.033
1.625ArgGly: 1.625 ± 0.721
0.0ArgHis: 0.0 ± 0.0
4.334ArgIle: 4.334 ± 1.025
5.417ArgLys: 5.417 ± 1.867
1.083ArgLeu: 1.083 ± 1.65
2.167ArgMet: 2.167 ± 0.714
1.083ArgAsn: 1.083 ± 0.76
1.625ArgPro: 1.625 ± 0.721
1.625ArgGln: 1.625 ± 0.652
4.334ArgArg: 4.334 ± 2.227
1.625ArgSer: 1.625 ± 1.132
2.709ArgThr: 2.709 ± 1.936
5.417ArgVal: 5.417 ± 1.64
0.542ArgTrp: 0.542 ± 0.356
2.709ArgTyr: 2.709 ± 1.239
0.0ArgXaa: 0.0 ± 0.0
Ser
4.334SerAla: 4.334 ± 2.604
1.625SerCys: 1.625 ± 1.192
0.542SerAsp: 0.542 ± 0.52
2.167SerGlu: 2.167 ± 0.935
2.709SerPhe: 2.709 ± 0.963
4.875SerGly: 4.875 ± 1.273
0.542SerHis: 0.542 ± 0.825
1.625SerIle: 1.625 ± 1.067
3.25SerLys: 3.25 ± 0.694
7.042SerLeu: 7.042 ± 1.83
0.0SerMet: 0.0 ± 0.514
4.875SerAsn: 4.875 ± 1.128
1.083SerPro: 1.083 ± 0.76
3.25SerGln: 3.25 ± 1.174
1.083SerArg: 1.083 ± 0.712
7.584SerSer: 7.584 ± 2.896
4.334SerThr: 4.334 ± 1.415
3.792SerVal: 3.792 ± 1.835
0.542SerTrp: 0.542 ± 0.356
1.625SerTyr: 1.625 ± 0.721
0.0SerXaa: 0.0 ± 0.0
Thr
3.25ThrAla: 3.25 ± 1.715
1.083ThrCys: 1.083 ± 0.73
1.083ThrAsp: 1.083 ± 0.712
4.875ThrGlu: 4.875 ± 2.153
0.542ThrPhe: 0.542 ± 0.356
4.334ThrGly: 4.334 ± 2.932
1.083ThrHis: 1.083 ± 0.76
3.25ThrIle: 3.25 ± 1.573
2.167ThrLys: 2.167 ± 0.504
4.875ThrLeu: 4.875 ± 1.702
1.083ThrMet: 1.083 ± 0.712
4.875ThrAsn: 4.875 ± 2.133
4.875ThrPro: 4.875 ± 1.757
2.167ThrGln: 2.167 ± 1.033
2.709ThrArg: 2.709 ± 1.424
2.167ThrSer: 2.167 ± 1.356
7.042ThrThr: 7.042 ± 1.415
3.792ThrVal: 3.792 ± 1.174
1.083ThrTrp: 1.083 ± 1.65
1.625ThrTyr: 1.625 ± 0.72
0.0ThrXaa: 0.0 ± 0.0
Val
2.709ValAla: 2.709 ± 1.481
1.083ValCys: 1.083 ± 0.712
4.334ValAsp: 4.334 ± 1.007
3.792ValGlu: 3.792 ± 1.192
0.542ValPhe: 0.542 ± 0.52
7.042ValGly: 7.042 ± 1.385
0.0ValHis: 0.0 ± 0.0
3.25ValIle: 3.25 ± 0.713
3.792ValLys: 3.792 ± 0.812
7.042ValLeu: 7.042 ± 1.32
0.542ValMet: 0.542 ± 0.52
3.792ValAsn: 3.792 ± 0.987
4.334ValPro: 4.334 ± 0.804
4.334ValGln: 4.334 ± 1.602
4.334ValArg: 4.334 ± 1.167
6.501ValSer: 6.501 ± 2.042
3.792ValThr: 3.792 ± 1.466
2.167ValVal: 2.167 ± 1.073
4.875ValTrp: 4.875 ± 2.091
2.709ValTyr: 2.709 ± 0.491
0.0ValXaa: 0.0 ± 0.0
Trp
1.625TrpAla: 1.625 ± 1.132
0.0TrpCys: 0.0 ± 0.0
0.542TrpAsp: 0.542 ± 0.356
1.083TrpGlu: 1.083 ± 0.532
1.083TrpPhe: 1.083 ± 0.73
1.625TrpGly: 1.625 ± 1.133
1.083TrpHis: 1.083 ± 0.883
0.542TrpIle: 0.542 ± 0.825
1.083TrpLys: 1.083 ± 0.712
3.25TrpLeu: 3.25 ± 1.304
1.083TrpMet: 1.083 ± 0.664
0.542TrpAsn: 0.542 ± 0.825
0.0TrpPro: 0.0 ± 0.0
2.167TrpGln: 2.167 ± 0.924
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
2.709TrpThr: 2.709 ± 0.716
2.167TrpVal: 2.167 ± 1.519
0.542TrpTrp: 0.542 ± 0.356
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.542TyrAla: 0.542 ± 0.356
0.542TyrCys: 0.542 ± 0.825
1.083TyrAsp: 1.083 ± 0.76
1.083TyrGlu: 1.083 ± 0.76
2.709TyrPhe: 2.709 ± 0.789
2.167TyrGly: 2.167 ± 0.726
0.542TyrHis: 0.542 ± 0.356
0.0TyrIle: 0.0 ± 0.0
3.792TyrLys: 3.792 ± 1.047
2.709TyrLeu: 2.709 ± 0.869
0.542TyrMet: 0.542 ± 0.356
2.709TyrAsn: 2.709 ± 0.851
1.083TyrPro: 1.083 ± 1.04
1.625TyrGln: 1.625 ± 0.721
1.625TyrArg: 1.625 ± 0.721
1.625TyrSer: 1.625 ± 1.074
1.625TyrThr: 1.625 ± 0.74
2.167TyrVal: 2.167 ± 0.714
1.083TyrTrp: 1.083 ± 0.712
1.625TyrTyr: 1.625 ± 1.071
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1847 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski