Amino acid dipepetide frequency for Canis familiaris papillomavirus 8

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.465AlaAla: 7.465 ± 1.834
0.0AlaCys: 0.0 ± 0.0
5.599AlaAsp: 5.599 ± 1.144
2.986AlaGlu: 2.986 ± 1.148
4.106AlaPhe: 4.106 ± 0.986
4.106AlaGly: 4.106 ± 1.727
1.12AlaHis: 1.12 ± 0.684
2.24AlaIle: 2.24 ± 0.889
2.986AlaLys: 2.986 ± 0.982
6.346AlaLeu: 6.346 ± 2.269
1.12AlaMet: 1.12 ± 0.587
1.866AlaAsn: 1.866 ± 0.72
4.479AlaPro: 4.479 ± 1.098
3.359AlaGln: 3.359 ± 0.87
5.599AlaArg: 5.599 ± 1.354
2.613AlaSer: 2.613 ± 1.003
5.599AlaThr: 5.599 ± 1.155
2.24AlaVal: 2.24 ± 0.722
0.373AlaTrp: 0.373 ± 0.319
1.12AlaTyr: 1.12 ± 0.636
0.0AlaXaa: 0.0 ± 0.0
Cys
1.12CysAla: 1.12 ± 0.762
1.493CysCys: 1.493 ± 1.007
1.493CysAsp: 1.493 ± 0.748
1.12CysGlu: 1.12 ± 0.784
0.373CysPhe: 0.373 ± 0.309
2.24CysGly: 2.24 ± 0.866
0.747CysHis: 0.747 ± 0.404
2.24CysIle: 2.24 ± 0.972
0.747CysLys: 0.747 ± 0.394
0.747CysLeu: 0.747 ± 0.707
0.747CysMet: 0.747 ± 0.513
0.373CysAsn: 0.373 ± 0.319
2.613CysPro: 2.613 ± 1.285
0.0CysGln: 0.0 ± 0.0
1.493CysArg: 1.493 ± 0.899
1.493CysSer: 1.493 ± 0.598
1.12CysThr: 1.12 ± 0.652
0.373CysVal: 0.373 ± 0.444
0.747CysTrp: 0.747 ± 0.404
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
5.972AspAla: 5.972 ± 0.901
1.493AspCys: 1.493 ± 0.538
3.359AspAsp: 3.359 ± 1.24
3.733AspGlu: 3.733 ± 1.183
1.866AspPhe: 1.866 ± 0.409
5.599AspGly: 5.599 ± 0.97
0.747AspHis: 0.747 ± 0.412
4.853AspIle: 4.853 ± 1.522
0.747AspLys: 0.747 ± 0.366
5.972AspLeu: 5.972 ± 1.188
1.12AspMet: 1.12 ± 0.606
2.613AspAsn: 2.613 ± 0.735
5.226AspPro: 5.226 ± 0.915
1.866AspGln: 1.866 ± 0.908
1.493AspArg: 1.493 ± 0.608
5.599AspSer: 5.599 ± 0.889
2.986AspThr: 2.986 ± 0.968
3.359AspVal: 3.359 ± 1.79
0.747AspTrp: 0.747 ± 0.638
2.613AspTyr: 2.613 ± 1.275
0.0AspXaa: 0.0 ± 0.0
Glu
4.106GluAla: 4.106 ± 1.251
0.747GluCys: 0.747 ± 0.513
4.479GluAsp: 4.479 ± 1.152
7.839GluGlu: 7.839 ± 4.142
2.986GluPhe: 2.986 ± 0.821
5.972GluGly: 5.972 ± 1.939
1.866GluHis: 1.866 ± 0.962
0.747GluIle: 0.747 ± 0.373
0.747GluLys: 0.747 ± 0.542
4.479GluLeu: 4.479 ± 1.471
0.373GluMet: 0.373 ± 0.309
1.866GluAsn: 1.866 ± 0.445
3.733GluPro: 3.733 ± 1.234
2.24GluGln: 2.24 ± 1.042
4.479GluArg: 4.479 ± 1.674
4.853GluSer: 4.853 ± 1.38
5.226GluThr: 5.226 ± 0.788
4.106GluVal: 4.106 ± 2.226
0.373GluTrp: 0.373 ± 0.319
0.747GluTyr: 0.747 ± 0.404
0.0GluXaa: 0.0 ± 0.0
Phe
2.613PheAla: 2.613 ± 1.061
0.747PheCys: 0.747 ± 0.513
4.106PheAsp: 4.106 ± 1.542
2.24PheGlu: 2.24 ± 0.633
2.24PhePhe: 2.24 ± 0.602
4.853PheGly: 4.853 ± 1.799
0.747PheHis: 0.747 ± 0.534
1.493PheIle: 1.493 ± 0.745
2.613PheLys: 2.613 ± 2.232
3.359PheLeu: 3.359 ± 1.064
0.0PheMet: 0.0 ± 0.0
1.866PheAsn: 1.866 ± 0.986
2.24PhePro: 2.24 ± 0.689
1.12PheGln: 1.12 ± 0.635
2.24PheArg: 2.24 ± 0.722
1.12PheSer: 1.12 ± 0.534
2.986PheThr: 2.986 ± 1.434
1.12PheVal: 1.12 ± 0.587
1.493PheTrp: 1.493 ± 0.403
0.747PheTyr: 0.747 ± 0.669
0.0PheXaa: 0.0 ± 0.0
Gly
3.733GlyAla: 3.733 ± 0.963
1.493GlyCys: 1.493 ± 0.539
5.599GlyAsp: 5.599 ± 1.672
6.719GlyGlu: 6.719 ± 1.252
2.986GlyPhe: 2.986 ± 1.161
11.571GlyGly: 11.571 ± 2.106
2.24GlyHis: 2.24 ± 0.598
2.986GlyIle: 2.986 ± 0.509
1.493GlyLys: 1.493 ± 0.729
3.359GlyLeu: 3.359 ± 0.854
0.747GlyMet: 0.747 ± 0.541
1.866GlyAsn: 1.866 ± 1.223
7.092GlyPro: 7.092 ± 2.402
1.493GlyGln: 1.493 ± 0.608
8.212GlyArg: 8.212 ± 1.921
5.226GlySer: 5.226 ± 1.097
4.479GlyThr: 4.479 ± 1.482
5.599GlyVal: 5.599 ± 1.469
0.747GlyTrp: 0.747 ± 0.373
1.493GlyTyr: 1.493 ± 0.687
0.0GlyXaa: 0.0 ± 0.0
His
0.747HisAla: 0.747 ± 0.404
1.12HisCys: 1.12 ± 1.124
0.747HisAsp: 0.747 ± 0.366
1.493HisGlu: 1.493 ± 0.701
1.12HisPhe: 1.12 ± 0.317
1.866HisGly: 1.866 ± 0.932
0.373HisHis: 0.373 ± 0.319
0.373HisIle: 0.373 ± 0.444
0.747HisLys: 0.747 ± 0.41
1.493HisLeu: 1.493 ± 0.601
0.373HisMet: 0.373 ± 0.319
1.12HisAsn: 1.12 ± 0.366
2.24HisPro: 2.24 ± 1.068
0.747HisGln: 0.747 ± 0.41
2.24HisArg: 2.24 ± 0.969
1.12HisSer: 1.12 ± 0.652
1.493HisThr: 1.493 ± 0.62
1.493HisVal: 1.493 ± 0.499
1.12HisTrp: 1.12 ± 0.481
1.12HisTyr: 1.12 ± 0.927
0.0HisXaa: 0.0 ± 0.0
Ile
1.866IleAla: 1.866 ± 0.852
1.493IleCys: 1.493 ± 0.894
1.12IleAsp: 1.12 ± 0.653
2.986IleGlu: 2.986 ± 0.767
1.12IlePhe: 1.12 ± 0.587
1.866IleGly: 1.866 ± 1.179
1.866IleHis: 1.866 ± 0.412
1.866IleIle: 1.866 ± 0.502
1.493IleLys: 1.493 ± 0.763
3.359IleLeu: 3.359 ± 1.105
1.12IleMet: 1.12 ± 0.495
1.12IleAsn: 1.12 ± 0.556
2.986IlePro: 2.986 ± 1.159
1.12IleGln: 1.12 ± 0.704
0.747IleArg: 0.747 ± 0.703
4.106IleSer: 4.106 ± 1.199
2.986IleThr: 2.986 ± 1.745
2.986IleVal: 2.986 ± 0.85
0.373IleTrp: 0.373 ± 0.309
0.373IleTyr: 0.373 ± 0.309
0.0IleXaa: 0.0 ± 0.0
Lys
1.493LysAla: 1.493 ± 0.499
1.866LysCys: 1.866 ± 0.872
1.493LysAsp: 1.493 ± 0.662
2.24LysGlu: 2.24 ± 0.911
2.613LysPhe: 2.613 ± 1.058
2.613LysGly: 2.613 ± 0.981
1.493LysHis: 1.493 ± 0.624
1.12LysIle: 1.12 ± 0.648
1.493LysLys: 1.493 ± 0.808
2.613LysLeu: 2.613 ± 0.949
0.747LysMet: 0.747 ± 0.608
0.373LysAsn: 0.373 ± 0.338
1.493LysPro: 1.493 ± 0.608
1.866LysGln: 1.866 ± 0.607
4.106LysArg: 4.106 ± 0.897
3.359LysSer: 3.359 ± 1.598
2.986LysThr: 2.986 ± 1.066
2.986LysVal: 2.986 ± 1.01
0.0LysTrp: 0.0 ± 0.0
1.12LysTyr: 1.12 ± 0.636
0.0LysXaa: 0.0 ± 0.0
Leu
5.226LeuAla: 5.226 ± 1.376
2.613LeuCys: 2.613 ± 1.006
4.106LeuAsp: 4.106 ± 1.197
2.613LeuGlu: 2.613 ± 1.254
3.359LeuPhe: 3.359 ± 1.271
7.092LeuGly: 7.092 ± 1.101
1.866LeuHis: 1.866 ± 0.767
1.866LeuIle: 1.866 ± 0.775
3.359LeuLys: 3.359 ± 1.445
9.332LeuLeu: 9.332 ± 1.755
2.24LeuMet: 2.24 ± 1.185
3.359LeuAsn: 3.359 ± 0.905
4.479LeuPro: 4.479 ± 1.41
5.972LeuGln: 5.972 ± 1.036
4.853LeuArg: 4.853 ± 1.553
6.719LeuSer: 6.719 ± 0.859
4.853LeuThr: 4.853 ± 1.695
5.226LeuVal: 5.226 ± 1.022
1.12LeuTrp: 1.12 ± 0.398
2.613LeuTyr: 2.613 ± 0.928
0.0LeuXaa: 0.0 ± 0.0
Met
0.747MetAla: 0.747 ± 0.373
0.0MetCys: 0.0 ± 0.0
0.747MetAsp: 0.747 ± 0.394
1.493MetGlu: 1.493 ± 0.539
1.493MetPhe: 1.493 ± 0.745
0.747MetGly: 0.747 ± 0.373
0.373MetHis: 0.373 ± 0.309
0.747MetIle: 0.747 ± 0.619
0.747MetLys: 0.747 ± 0.638
1.493MetLeu: 1.493 ± 0.729
0.0MetMet: 0.0 ± 0.0
0.373MetAsn: 0.373 ± 0.338
0.373MetPro: 0.373 ± 0.319
1.493MetGln: 1.493 ± 0.62
1.12MetArg: 1.12 ± 0.422
1.866MetSer: 1.866 ± 0.707
0.373MetThr: 0.373 ± 0.338
1.493MetVal: 1.493 ± 0.888
0.373MetTrp: 0.373 ± 0.444
0.373MetTyr: 0.373 ± 0.319
0.0MetXaa: 0.0 ± 0.0
Asn
2.613AsnAla: 2.613 ± 0.924
0.373AsnCys: 0.373 ± 0.319
1.12AsnAsp: 1.12 ± 0.441
1.866AsnGlu: 1.866 ± 0.445
0.373AsnPhe: 0.373 ± 0.319
0.747AsnGly: 0.747 ± 0.373
1.493AsnHis: 1.493 ± 0.961
1.493AsnIle: 1.493 ± 0.722
1.12AsnLys: 1.12 ± 1.013
1.493AsnLeu: 1.493 ± 0.608
0.373AsnMet: 0.373 ± 0.338
1.493AsnAsn: 1.493 ± 0.403
3.733AsnPro: 3.733 ± 1.797
1.12AsnGln: 1.12 ± 0.819
2.24AsnArg: 2.24 ± 1.169
2.986AsnSer: 2.986 ± 1.196
4.479AsnThr: 4.479 ± 1.585
1.866AsnVal: 1.866 ± 0.445
0.0AsnTrp: 0.0 ± 0.0
1.12AsnTyr: 1.12 ± 0.635
0.0AsnXaa: 0.0 ± 0.0
Pro
7.092ProAla: 7.092 ± 2.614
2.24ProCys: 2.24 ± 1.264
3.733ProAsp: 3.733 ± 1.478
5.972ProGlu: 5.972 ± 0.778
2.24ProPhe: 2.24 ± 1.311
2.986ProGly: 2.986 ± 0.659
1.493ProHis: 1.493 ± 1.036
2.613ProIle: 2.613 ± 0.815
4.106ProLys: 4.106 ± 1.037
7.465ProLeu: 7.465 ± 1.429
0.0ProMet: 0.0 ± 0.0
2.986ProAsn: 2.986 ± 1.543
13.811ProPro: 13.811 ± 3.395
2.986ProGln: 2.986 ± 0.998
5.972ProArg: 5.972 ± 2.121
7.092ProSer: 7.092 ± 2.278
4.479ProThr: 4.479 ± 1.216
7.092ProVal: 7.092 ± 2.471
0.373ProTrp: 0.373 ± 0.309
1.12ProTyr: 1.12 ± 0.826
0.0ProXaa: 0.0 ± 0.0
Gln
1.12GlnAla: 1.12 ± 0.721
0.0GlnCys: 0.0 ± 0.0
1.493GlnAsp: 1.493 ± 0.551
2.613GlnGlu: 2.613 ± 0.968
2.24GlnPhe: 2.24 ± 0.681
4.106GlnGly: 4.106 ± 0.975
1.493GlnHis: 1.493 ± 0.604
0.747GlnIle: 0.747 ± 0.366
1.493GlnLys: 1.493 ± 0.598
4.106GlnLeu: 4.106 ± 1.061
0.747GlnMet: 0.747 ± 0.404
0.747GlnAsn: 0.747 ± 0.675
2.986GlnPro: 2.986 ± 0.572
2.986GlnGln: 2.986 ± 1.326
4.106GlnArg: 4.106 ± 1.735
4.106GlnSer: 4.106 ± 1.667
2.986GlnThr: 2.986 ± 0.914
2.986GlnVal: 2.986 ± 0.767
1.493GlnTrp: 1.493 ± 0.408
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
5.972ArgAla: 5.972 ± 1.237
1.866ArgCys: 1.866 ± 1.359
3.359ArgAsp: 3.359 ± 1.109
2.613ArgGlu: 2.613 ± 0.804
3.359ArgPhe: 3.359 ± 0.996
6.719ArgGly: 6.719 ± 1.506
1.12ArgHis: 1.12 ± 0.677
2.24ArgIle: 2.24 ± 0.983
4.106ArgLys: 4.106 ± 1.017
8.959ArgLeu: 8.959 ± 1.826
0.747ArgMet: 0.747 ± 0.638
1.12ArgAsn: 1.12 ± 0.6
6.719ArgPro: 6.719 ± 2.939
2.613ArgGln: 2.613 ± 1.422
9.332ArgArg: 9.332 ± 3.314
5.599ArgSer: 5.599 ± 1.863
2.613ArgThr: 2.613 ± 1.652
3.359ArgVal: 3.359 ± 0.487
1.866ArgTrp: 1.866 ± 0.798
1.866ArgTyr: 1.866 ± 0.833
0.0ArgXaa: 0.0 ± 0.0
Ser
5.226SerAla: 5.226 ± 2.384
0.747SerCys: 0.747 ± 0.607
7.465SerAsp: 7.465 ± 1.007
1.493SerGlu: 1.493 ± 0.905
2.613SerPhe: 2.613 ± 1.223
5.226SerGly: 5.226 ± 1.725
1.866SerHis: 1.866 ± 0.436
2.986SerIle: 2.986 ± 1.066
2.24SerLys: 2.24 ± 1.304
7.465SerLeu: 7.465 ± 1.032
3.359SerMet: 3.359 ± 1.367
2.613SerAsn: 2.613 ± 0.74
6.346SerPro: 6.346 ± 2.626
5.972SerGln: 5.972 ± 1.134
4.479SerArg: 4.479 ± 1.594
5.972SerSer: 5.972 ± 1.193
4.106SerThr: 4.106 ± 1.329
4.106SerVal: 4.106 ± 0.831
1.12SerTrp: 1.12 ± 0.624
0.373SerTyr: 0.373 ± 0.338
0.0SerXaa: 0.0 ± 0.0
Thr
3.359ThrAla: 3.359 ± 0.382
1.866ThrCys: 1.866 ± 0.406
3.733ThrAsp: 3.733 ± 0.652
4.853ThrGlu: 4.853 ± 0.718
1.493ThrPhe: 1.493 ± 0.388
4.853ThrGly: 4.853 ± 0.782
1.12ThrHis: 1.12 ± 0.36
2.24ThrIle: 2.24 ± 0.492
1.493ThrLys: 1.493 ± 1.35
3.733ThrLeu: 3.733 ± 0.668
0.747ThrMet: 0.747 ± 0.404
2.613ThrAsn: 2.613 ± 0.757
7.092ThrPro: 7.092 ± 1.784
2.24ThrGln: 2.24 ± 0.491
5.226ThrArg: 5.226 ± 1.255
6.346ThrSer: 6.346 ± 1.681
3.359ThrThr: 3.359 ± 1.626
3.733ThrVal: 3.733 ± 1.497
1.493ThrTrp: 1.493 ± 0.715
1.12ThrTyr: 1.12 ± 1.013
0.0ThrXaa: 0.0 ± 0.0
Val
2.986ValAla: 2.986 ± 0.87
0.373ValCys: 0.373 ± 0.338
4.853ValAsp: 4.853 ± 2.391
4.479ValGlu: 4.479 ± 0.914
1.493ValPhe: 1.493 ± 0.79
3.733ValGly: 3.733 ± 0.858
0.747ValHis: 0.747 ± 0.366
1.493ValIle: 1.493 ± 0.499
3.733ValLys: 3.733 ± 1.206
4.479ValLeu: 4.479 ± 1.21
0.747ValMet: 0.747 ± 0.41
2.24ValAsn: 2.24 ± 0.828
6.719ValPro: 6.719 ± 1.775
1.866ValGln: 1.866 ± 0.695
4.479ValArg: 4.479 ± 1.825
4.106ValSer: 4.106 ± 1.801
2.986ValThr: 2.986 ± 1.034
4.853ValVal: 4.853 ± 1.802
0.747ValTrp: 0.747 ± 0.404
2.986ValTyr: 2.986 ± 0.749
0.0ValXaa: 0.0 ± 0.0
Trp
0.747TrpAla: 0.747 ± 0.41
0.0TrpCys: 0.0 ± 0.0
1.493TrpAsp: 1.493 ± 0.403
1.493TrpGlu: 1.493 ± 0.573
1.12TrpPhe: 1.12 ± 0.422
0.747TrpGly: 0.747 ± 0.394
0.0TrpHis: 0.0 ± 0.0
0.747TrpIle: 0.747 ± 0.638
1.12TrpLys: 1.12 ± 0.54
1.493TrpLeu: 1.493 ± 0.598
0.373TrpMet: 0.373 ± 0.319
0.747TrpAsn: 0.747 ± 0.404
0.0TrpPro: 0.0 ± 0.0
0.747TrpGln: 0.747 ± 0.618
2.613TrpArg: 2.613 ± 1.05
0.373TrpSer: 0.373 ± 0.334
1.12TrpThr: 1.12 ± 0.635
0.373TrpVal: 0.373 ± 0.319
0.0TrpTrp: 0.0 ± 0.0
0.373TrpTyr: 0.373 ± 0.309
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.12TyrAla: 1.12 ± 0.398
0.747TyrCys: 0.747 ± 0.619
1.866TyrAsp: 1.866 ± 0.445
1.12TyrGlu: 1.12 ± 0.36
0.747TyrPhe: 0.747 ± 0.373
1.493TyrGly: 1.493 ± 0.403
0.373TyrHis: 0.373 ± 0.334
1.866TyrIle: 1.866 ± 0.981
1.866TyrLys: 1.866 ± 0.607
1.12TyrLeu: 1.12 ± 0.957
0.373TyrMet: 0.373 ± 0.319
0.747TyrAsn: 0.747 ± 0.41
1.493TyrPro: 1.493 ± 0.462
0.747TyrGln: 0.747 ± 0.37
1.12TyrArg: 1.12 ± 0.627
1.12TyrSer: 1.12 ± 0.594
1.493TyrThr: 1.493 ± 0.223
0.747TyrVal: 0.747 ± 0.404
1.12TyrTrp: 1.12 ± 0.398
0.747TyrTyr: 0.747 ± 0.618
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2680 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski