Amino acid dipepetide frequency for Canine oral papillomavirus (strain Y62) (COPV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.396AlaAla: 3.396 ± 1.07
2.122AlaCys: 2.122 ± 0.81
5.518AlaAsp: 5.518 ± 1.574
4.244AlaGlu: 4.244 ± 1.191
3.396AlaPhe: 3.396 ± 0.713
4.244AlaGly: 4.244 ± 1.006
0.424AlaHis: 0.424 ± 0.397
1.698AlaIle: 1.698 ± 0.654
1.273AlaLys: 1.273 ± 0.964
4.669AlaLeu: 4.669 ± 0.998
0.424AlaMet: 0.424 ± 0.372
1.698AlaAsn: 1.698 ± 0.732
2.122AlaPro: 2.122 ± 1.006
1.273AlaGln: 1.273 ± 0.36
2.971AlaArg: 2.971 ± 0.621
4.244AlaSer: 4.244 ± 1.424
2.971AlaThr: 2.971 ± 1.184
3.396AlaVal: 3.396 ± 1.57
0.424AlaTrp: 0.424 ± 0.372
2.122AlaTyr: 2.122 ± 0.827
0.0AlaXaa: 0.0 ± 0.0
Cys
1.698CysAla: 1.698 ± 1.239
0.424CysCys: 0.424 ± 0.485
0.424CysAsp: 0.424 ± 0.397
0.849CysGlu: 0.849 ± 0.629
0.849CysPhe: 0.849 ± 0.596
1.698CysGly: 1.698 ± 1.442
0.0CysHis: 0.0 ± 0.0
0.424CysIle: 0.424 ± 0.397
3.396CysLys: 3.396 ± 0.983
2.547CysLeu: 2.547 ± 1.413
0.849CysMet: 0.849 ± 0.527
0.0CysAsn: 0.0 ± 0.0
1.698CysPro: 1.698 ± 0.608
0.424CysGln: 0.424 ± 0.485
1.698CysArg: 1.698 ± 0.976
2.971CysSer: 2.971 ± 1.401
1.273CysThr: 1.273 ± 0.661
0.424CysVal: 0.424 ± 0.644
0.424CysTrp: 0.424 ± 0.314
0.424CysTyr: 0.424 ± 0.644
0.0CysXaa: 0.0 ± 0.0
Asp
1.698AspAla: 1.698 ± 0.814
1.273AspCys: 1.273 ± 0.943
2.971AspAsp: 2.971 ± 1.485
5.942AspGlu: 5.942 ± 1.389
4.669AspPhe: 4.669 ± 0.932
3.396AspGly: 3.396 ± 1.391
1.698AspHis: 1.698 ± 0.651
3.396AspIle: 3.396 ± 1.24
3.396AspLys: 3.396 ± 1.128
6.791AspLeu: 6.791 ± 1.767
0.424AspMet: 0.424 ± 0.372
2.547AspAsn: 2.547 ± 0.802
5.518AspPro: 5.518 ± 1.422
2.122AspGln: 2.122 ± 0.786
2.547AspArg: 2.547 ± 0.868
4.669AspSer: 4.669 ± 0.945
2.122AspThr: 2.122 ± 0.812
2.971AspVal: 2.971 ± 1.594
1.698AspTrp: 1.698 ± 0.742
0.849AspTyr: 0.849 ± 0.432
0.0AspXaa: 0.0 ± 0.0
Glu
4.244GluAla: 4.244 ± 1.367
0.424GluCys: 0.424 ± 0.314
7.64GluAsp: 7.64 ± 2.227
8.913GluGlu: 8.913 ± 4.436
1.273GluPhe: 1.273 ± 0.663
2.971GluGly: 2.971 ± 0.722
0.849GluHis: 0.849 ± 0.432
2.971GluIle: 2.971 ± 0.833
2.971GluLys: 2.971 ± 0.952
5.093GluLeu: 5.093 ± 1.66
0.849GluMet: 0.849 ± 0.629
3.396GluAsn: 3.396 ± 0.918
3.82GluPro: 3.82 ± 1.891
4.669GluGln: 4.669 ± 1.319
4.244GluArg: 4.244 ± 1.142
3.396GluSer: 3.396 ± 2.019
3.82GluThr: 3.82 ± 1.561
2.971GluVal: 2.971 ± 0.928
0.0GluTrp: 0.0 ± 0.0
1.273GluTyr: 1.273 ± 0.668
0.0GluXaa: 0.0 ± 0.0
Phe
1.698PheAla: 1.698 ± 0.732
2.547PheCys: 2.547 ± 1.747
2.971PheAsp: 2.971 ± 0.572
2.547PheGlu: 2.547 ± 1.683
3.396PhePhe: 3.396 ± 0.818
2.122PheGly: 2.122 ± 1.095
0.849PheHis: 0.849 ± 0.629
1.273PheIle: 1.273 ± 0.687
3.82PheLys: 3.82 ± 1.595
5.093PheLeu: 5.093 ± 1.718
1.273PheMet: 1.273 ± 0.557
0.424PheAsn: 0.424 ± 0.372
2.971PhePro: 2.971 ± 0.85
1.698PheGln: 1.698 ± 0.656
0.849PheArg: 0.849 ± 0.469
2.971PheSer: 2.971 ± 1.189
2.122PheThr: 2.122 ± 0.846
1.698PheVal: 1.698 ± 0.52
2.547PheTrp: 2.547 ± 0.76
1.698PheTyr: 1.698 ± 1.072
0.0PheXaa: 0.0 ± 0.0
Gly
2.971GlyAla: 2.971 ± 0.993
1.273GlyCys: 1.273 ± 1.021
3.82GlyAsp: 3.82 ± 1.012
3.82GlyGlu: 3.82 ± 1.171
2.971GlyPhe: 2.971 ± 0.903
6.791GlyGly: 6.791 ± 1.999
2.971GlyHis: 2.971 ± 1.191
2.971GlyIle: 2.971 ± 0.645
5.093GlyLys: 5.093 ± 1.392
5.942GlyLeu: 5.942 ± 0.799
0.849GlyMet: 0.849 ± 0.705
2.547GlyAsn: 2.547 ± 0.645
4.244GlyPro: 4.244 ± 1.498
3.396GlyGln: 3.396 ± 1.87
5.942GlyArg: 5.942 ± 2.251
4.669GlySer: 4.669 ± 1.582
5.518GlyThr: 5.518 ± 1.217
3.396GlyVal: 3.396 ± 1.543
0.424GlyTrp: 0.424 ± 0.314
0.424GlyTyr: 0.424 ± 0.346
0.0GlyXaa: 0.0 ± 0.0
His
0.849HisAla: 0.849 ± 0.432
0.424HisCys: 0.424 ± 0.314
0.424HisAsp: 0.424 ± 0.397
0.0HisGlu: 0.0 ± 0.0
1.698HisPhe: 1.698 ± 0.814
2.971HisGly: 2.971 ± 1.026
0.424HisHis: 0.424 ± 0.314
1.698HisIle: 1.698 ± 0.604
1.273HisLys: 1.273 ± 0.785
2.122HisLeu: 2.122 ± 0.545
0.0HisMet: 0.0 ± 0.0
0.424HisAsn: 0.424 ± 0.568
1.698HisPro: 1.698 ± 0.864
1.273HisGln: 1.273 ± 0.45
1.698HisArg: 1.698 ± 0.758
1.698HisSer: 1.698 ± 0.744
0.849HisThr: 0.849 ± 0.744
2.547HisVal: 2.547 ± 0.72
0.849HisTrp: 0.849 ± 0.469
0.424HisTyr: 0.424 ± 0.372
0.0HisXaa: 0.0 ± 0.0
Ile
2.122IleAla: 2.122 ± 0.78
0.0IleCys: 0.0 ± 0.0
3.82IleAsp: 3.82 ± 0.788
4.244IleGlu: 4.244 ± 0.682
0.849IlePhe: 0.849 ± 0.432
3.82IleGly: 3.82 ± 1.656
0.849IleHis: 0.849 ± 0.432
2.122IleIle: 2.122 ± 1.138
1.273IleLys: 1.273 ± 1.039
3.396IleLeu: 3.396 ± 1.547
0.849IleMet: 0.849 ± 0.469
1.273IleAsn: 1.273 ± 0.908
4.244IlePro: 4.244 ± 2.166
1.273IleGln: 1.273 ± 0.36
1.273IleArg: 1.273 ± 0.713
4.244IleSer: 4.244 ± 1.699
3.396IleThr: 3.396 ± 1.615
2.547IleVal: 2.547 ± 1.048
0.424IleTrp: 0.424 ± 0.485
1.273IleTyr: 1.273 ± 0.451
0.0IleXaa: 0.0 ± 0.0
Lys
3.396LysAla: 3.396 ± 0.897
1.698LysCys: 1.698 ± 0.545
1.698LysAsp: 1.698 ± 0.732
2.547LysGlu: 2.547 ± 0.906
3.396LysPhe: 3.396 ± 1.241
3.82LysGly: 3.82 ± 1.53
2.971LysHis: 2.971 ± 1.252
0.849LysIle: 0.849 ± 0.693
3.396LysLys: 3.396 ± 1.451
3.82LysLeu: 3.82 ± 1.309
2.122LysMet: 2.122 ± 0.901
1.698LysAsn: 1.698 ± 0.828
1.698LysPro: 1.698 ± 0.864
2.122LysGln: 2.122 ± 0.605
4.669LysArg: 4.669 ± 1.346
3.396LysSer: 3.396 ± 0.89
3.396LysThr: 3.396 ± 1.2
3.396LysVal: 3.396 ± 1.217
0.849LysTrp: 0.849 ± 0.64
2.971LysTyr: 2.971 ± 0.963
0.0LysXaa: 0.0 ± 0.0
Leu
6.791LeuAla: 6.791 ± 1.063
3.82LeuCys: 3.82 ± 1.889
6.791LeuAsp: 6.791 ± 1.529
2.547LeuGlu: 2.547 ± 0.969
5.942LeuPhe: 5.942 ± 2.159
5.518LeuGly: 5.518 ± 1.877
2.547LeuHis: 2.547 ± 0.75
3.396LeuIle: 3.396 ± 1.391
3.82LeuLys: 3.82 ± 0.817
10.187LeuLeu: 10.187 ± 2.647
0.424LeuMet: 0.424 ± 0.353
2.122LeuAsn: 2.122 ± 0.396
5.093LeuPro: 5.093 ± 0.749
5.942LeuGln: 5.942 ± 1.377
5.518LeuArg: 5.518 ± 1.359
9.338LeuSer: 9.338 ± 3.717
6.367LeuThr: 6.367 ± 1.03
4.669LeuVal: 4.669 ± 0.587
0.424LeuTrp: 0.424 ± 0.397
3.82LeuTyr: 3.82 ± 1.619
0.0LeuXaa: 0.0 ± 0.0
Met
1.698MetAla: 1.698 ± 0.608
0.424MetCys: 0.424 ± 0.314
1.273MetAsp: 1.273 ± 0.747
1.273MetGlu: 1.273 ± 0.596
1.273MetPhe: 1.273 ± 0.542
1.273MetGly: 1.273 ± 0.572
0.424MetHis: 0.424 ± 0.397
0.849MetIle: 0.849 ± 0.7
0.424MetLys: 0.424 ± 0.314
0.849MetLeu: 0.849 ± 0.447
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.0MetPro: 0.0 ± 0.0
0.424MetGln: 0.424 ± 0.397
0.849MetArg: 0.849 ± 0.64
1.698MetSer: 1.698 ± 0.656
0.424MetThr: 0.424 ± 0.372
0.849MetVal: 0.849 ± 0.366
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.122AsnAla: 2.122 ± 1.26
0.424AsnCys: 0.424 ± 0.314
2.122AsnAsp: 2.122 ± 0.545
1.698AsnGlu: 1.698 ± 0.301
0.424AsnPhe: 0.424 ± 0.372
1.273AsnGly: 1.273 ± 0.572
0.0AsnHis: 0.0 ± 0.0
4.244AsnIle: 4.244 ± 1.921
2.122AsnLys: 2.122 ± 0.758
0.849AsnLeu: 0.849 ± 0.744
0.424AsnMet: 0.424 ± 0.372
2.547AsnAsn: 2.547 ± 1.323
2.122AsnPro: 2.122 ± 0.656
1.698AsnGln: 1.698 ± 0.629
0.849AsnArg: 0.849 ± 0.432
3.396AsnSer: 3.396 ± 0.582
2.971AsnThr: 2.971 ± 0.837
4.244AsnVal: 4.244 ± 1.105
0.0AsnTrp: 0.0 ± 0.0
1.698AsnTyr: 1.698 ± 1.319
0.0AsnXaa: 0.0 ± 0.0
Pro
5.942ProAla: 5.942 ± 2.656
0.424ProCys: 0.424 ± 0.314
2.122ProAsp: 2.122 ± 0.907
4.669ProGlu: 4.669 ± 1.786
1.698ProPhe: 1.698 ± 0.864
3.396ProGly: 3.396 ± 0.98
0.424ProHis: 0.424 ± 0.346
2.971ProIle: 2.971 ± 1.359
5.518ProLys: 5.518 ± 1.203
5.942ProLeu: 5.942 ± 1.469
0.424ProMet: 0.424 ± 0.432
2.547ProAsn: 2.547 ± 1.303
10.611ProPro: 10.611 ± 3.868
4.669ProGln: 4.669 ± 1.213
3.396ProArg: 3.396 ± 0.983
6.367ProSer: 6.367 ± 1.35
3.396ProThr: 3.396 ± 0.975
4.244ProVal: 4.244 ± 2.102
0.424ProTrp: 0.424 ± 0.397
0.849ProTyr: 0.849 ± 0.744
0.0ProXaa: 0.0 ± 0.0
Gln
1.273GlnAla: 1.273 ± 0.788
1.698GlnCys: 1.698 ± 1.459
2.122GlnAsp: 2.122 ± 0.511
2.122GlnGlu: 2.122 ± 0.953
0.424GlnPhe: 0.424 ± 0.568
2.122GlnGly: 2.122 ± 0.638
0.424GlnHis: 0.424 ± 0.372
0.424GlnIle: 0.424 ± 0.397
0.424GlnLys: 0.424 ± 0.644
6.791GlnLeu: 6.791 ± 1.984
0.849GlnMet: 0.849 ± 0.366
1.698GlnAsn: 1.698 ± 0.577
5.093GlnPro: 5.093 ± 1.123
4.669GlnGln: 4.669 ± 1.064
2.122GlnArg: 2.122 ± 0.418
3.82GlnSer: 3.82 ± 1.688
2.547GlnThr: 2.547 ± 0.989
3.82GlnVal: 3.82 ± 1.074
0.849GlnTrp: 0.849 ± 0.629
1.273GlnTyr: 1.273 ± 0.573
0.0GlnXaa: 0.0 ± 0.0
Arg
1.273ArgAla: 1.273 ± 0.687
1.698ArgCys: 1.698 ± 0.656
2.122ArgAsp: 2.122 ± 0.796
2.971ArgGlu: 2.971 ± 0.828
2.971ArgPhe: 2.971 ± 0.949
7.64ArgGly: 7.64 ± 2.29
2.971ArgHis: 2.971 ± 0.671
0.849ArgIle: 0.849 ± 0.718
4.669ArgLys: 4.669 ± 1.049
5.518ArgLeu: 5.518 ± 1.561
0.424ArgMet: 0.424 ± 0.397
1.698ArgAsn: 1.698 ± 0.742
5.093ArgPro: 5.093 ± 1.128
0.849ArgGln: 0.849 ± 0.407
5.518ArgArg: 5.518 ± 2.076
5.942ArgSer: 5.942 ± 1.063
2.122ArgThr: 2.122 ± 0.505
3.82ArgVal: 3.82 ± 0.72
0.424ArgTrp: 0.424 ± 0.372
3.396ArgTyr: 3.396 ± 0.969
0.0ArgXaa: 0.0 ± 0.0
Ser
4.244SerAla: 4.244 ± 1.585
0.0SerCys: 0.0 ± 0.0
4.244SerAsp: 4.244 ± 1.493
7.216SerGlu: 7.216 ± 0.916
3.396SerPhe: 3.396 ± 1.659
7.64SerGly: 7.64 ± 2.145
2.122SerHis: 2.122 ± 0.396
3.396SerIle: 3.396 ± 1.157
2.547SerLys: 2.547 ± 1.341
11.46SerLeu: 11.46 ± 2.634
1.698SerMet: 1.698 ± 0.976
2.971SerAsn: 2.971 ± 1.422
4.244SerPro: 4.244 ± 0.996
2.122SerGln: 2.122 ± 0.829
6.367SerArg: 6.367 ± 1.196
8.489SerSer: 8.489 ± 2.292
3.396SerThr: 3.396 ± 1.448
5.093SerVal: 5.093 ± 1.266
1.273SerTrp: 1.273 ± 0.663
2.122SerTyr: 2.122 ± 0.511
0.0SerXaa: 0.0 ± 0.0
Thr
2.547ThrAla: 2.547 ± 0.94
1.273ThrCys: 1.273 ± 0.667
4.244ThrAsp: 4.244 ± 1.36
3.396ThrGlu: 3.396 ± 1.452
2.971ThrPhe: 2.971 ± 0.787
3.396ThrGly: 3.396 ± 1.129
0.849ThrHis: 0.849 ± 0.629
2.971ThrIle: 2.971 ± 0.962
2.122ThrLys: 2.122 ± 0.891
4.669ThrLeu: 4.669 ± 1.299
0.849ThrMet: 0.849 ± 0.366
3.396ThrAsn: 3.396 ± 1.834
4.244ThrPro: 4.244 ± 1.178
2.122ThrGln: 2.122 ± 0.656
4.669ThrArg: 4.669 ± 1.157
4.669ThrSer: 4.669 ± 1.813
4.669ThrThr: 4.669 ± 1.12
1.273ThrVal: 1.273 ± 0.728
0.849ThrTrp: 0.849 ± 0.793
1.698ThrTyr: 1.698 ± 1.017
0.0ThrXaa: 0.0 ± 0.0
Val
1.698ValAla: 1.698 ± 0.667
1.698ValCys: 1.698 ± 0.976
4.669ValAsp: 4.669 ± 1.051
2.971ValGlu: 2.971 ± 0.923
2.547ValPhe: 2.547 ± 0.82
2.971ValGly: 2.971 ± 1.594
0.849ValHis: 0.849 ± 0.432
3.396ValIle: 3.396 ± 1.925
2.971ValLys: 2.971 ± 1.366
4.244ValLeu: 4.244 ± 1.574
0.424ValMet: 0.424 ± 0.303
2.547ValAsn: 2.547 ± 0.645
4.244ValPro: 4.244 ± 1.842
2.971ValGln: 2.971 ± 0.876
4.244ValArg: 4.244 ± 1.299
5.518ValSer: 5.518 ± 1.688
4.244ValThr: 4.244 ± 1.296
2.547ValVal: 2.547 ± 1.132
0.849ValTrp: 0.849 ± 0.744
1.273ValTyr: 1.273 ± 0.713
0.0ValXaa: 0.0 ± 0.0
Trp
0.849TrpAla: 0.849 ± 0.629
0.424TrpCys: 0.424 ± 0.372
0.424TrpAsp: 0.424 ± 0.568
1.273TrpGlu: 1.273 ± 0.668
0.0TrpPhe: 0.0 ± 0.0
1.698TrpGly: 1.698 ± 0.766
0.424TrpHis: 0.424 ± 0.372
1.273TrpIle: 1.273 ± 0.572
1.273TrpLys: 1.273 ± 0.663
2.122TrpLeu: 2.122 ± 0.418
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.424TrpArg: 0.424 ± 0.485
1.273TrpSer: 1.273 ± 0.785
0.849TrpThr: 0.849 ± 0.793
0.849TrpVal: 0.849 ± 0.629
0.0TrpTrp: 0.0 ± 0.0
0.424TrpTyr: 0.424 ± 0.314
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.122TyrAla: 2.122 ± 0.812
0.424TyrCys: 0.424 ± 0.485
1.273TyrAsp: 1.273 ± 0.45
2.547TyrGlu: 2.547 ± 1.198
0.849TyrPhe: 0.849 ± 0.366
1.273TyrGly: 1.273 ± 0.451
1.273TyrHis: 1.273 ± 0.36
2.122TyrIle: 2.122 ± 0.511
2.122TyrLys: 2.122 ± 0.908
2.971TyrLeu: 2.971 ± 0.758
0.424TyrMet: 0.424 ± 0.397
1.698TyrAsn: 1.698 ± 0.604
1.273TyrPro: 1.273 ± 0.621
0.849TyrGln: 0.849 ± 0.461
2.122TyrArg: 2.122 ± 1.159
1.273TyrSer: 1.273 ± 0.45
0.424TyrThr: 0.424 ± 0.397
2.122TyrVal: 2.122 ± 0.812
0.849TyrTrp: 0.849 ± 0.366
2.971TyrTyr: 2.971 ± 1.299
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2357 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski