Amino acid dipepetide frequency for Procyon lotor papillomavirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.144AlaAla: 5.144 ± 2.501
0.857AlaCys: 0.857 ± 0.519
4.286AlaAsp: 4.286 ± 1.206
5.144AlaGlu: 5.144 ± 1.569
3.429AlaPhe: 3.429 ± 1.124
4.286AlaGly: 4.286 ± 0.896
0.429AlaHis: 0.429 ± 0.594
3.429AlaIle: 3.429 ± 1.173
3.0AlaLys: 3.0 ± 1.2
4.286AlaLeu: 4.286 ± 1.503
0.429AlaMet: 0.429 ± 0.343
2.143AlaAsn: 2.143 ± 1.328
5.144AlaPro: 5.144 ± 3.256
1.286AlaGln: 1.286 ± 0.682
5.144AlaArg: 5.144 ± 0.804
2.143AlaSer: 2.143 ± 1.226
3.0AlaThr: 3.0 ± 0.768
3.858AlaVal: 3.858 ± 1.21
0.429AlaTrp: 0.429 ± 0.343
1.715AlaTyr: 1.715 ± 0.989
0.0AlaXaa: 0.0 ± 0.0
Cys
1.715CysAla: 1.715 ± 1.03
0.857CysCys: 0.857 ± 0.519
0.857CysAsp: 0.857 ± 0.905
0.857CysGlu: 0.857 ± 0.691
1.715CysPhe: 1.715 ± 0.86
0.857CysGly: 0.857 ± 0.608
0.0CysHis: 0.0 ± 0.0
0.429CysIle: 0.429 ± 0.346
3.0CysLys: 3.0 ± 1.002
0.857CysLeu: 0.857 ± 0.43
0.429CysMet: 0.429 ± 0.346
0.0CysAsn: 0.0 ± 0.0
1.715CysPro: 1.715 ± 0.629
1.286CysGln: 1.286 ± 0.94
1.286CysArg: 1.286 ± 0.94
1.715CysSer: 1.715 ± 1.4
1.286CysThr: 1.286 ± 0.738
3.0CysVal: 3.0 ± 1.831
0.429CysTrp: 0.429 ± 0.346
0.857CysTyr: 0.857 ± 0.628
0.0CysXaa: 0.0 ± 0.0
Asp
2.572AspAla: 2.572 ± 0.928
1.715AspCys: 1.715 ± 1.01
5.144AspAsp: 5.144 ± 0.986
4.715AspGlu: 4.715 ± 1.98
2.572AspPhe: 2.572 ± 0.976
3.858AspGly: 3.858 ± 0.774
0.857AspHis: 0.857 ± 0.406
5.144AspIle: 5.144 ± 1.857
3.858AspLys: 3.858 ± 1.364
8.144AspLeu: 8.144 ± 2.31
0.857AspMet: 0.857 ± 0.628
3.429AspAsn: 3.429 ± 1.575
5.572AspPro: 5.572 ± 1.434
0.857AspGln: 0.857 ± 0.428
1.715AspArg: 1.715 ± 0.677
3.0AspSer: 3.0 ± 1.172
2.143AspThr: 2.143 ± 1.247
3.429AspVal: 3.429 ± 0.857
1.286AspTrp: 1.286 ± 0.687
2.572AspTyr: 2.572 ± 0.725
0.0AspXaa: 0.0 ± 0.0
Glu
5.572GluAla: 5.572 ± 1.789
1.715GluCys: 1.715 ± 1.004
6.429GluAsp: 6.429 ± 0.667
8.573GluGlu: 8.573 ± 3.597
1.286GluPhe: 1.286 ± 0.672
3.0GluGly: 3.0 ± 0.66
1.715GluHis: 1.715 ± 0.849
2.572GluIle: 2.572 ± 0.807
3.0GluLys: 3.0 ± 1.394
6.429GluLeu: 6.429 ± 1.63
0.0GluMet: 0.0 ± 0.466
2.572GluAsn: 2.572 ± 0.725
3.0GluPro: 3.0 ± 1.364
4.286GluGln: 4.286 ± 1.634
4.286GluArg: 4.286 ± 1.27
6.001GluSer: 6.001 ± 2.042
3.0GluThr: 3.0 ± 0.835
5.144GluVal: 5.144 ± 2.127
0.857GluTrp: 0.857 ± 0.691
1.286GluTyr: 1.286 ± 0.403
0.0GluXaa: 0.0 ± 0.0
Phe
2.143PheAla: 2.143 ± 0.748
1.286PheCys: 1.286 ± 0.738
4.286PheAsp: 4.286 ± 0.69
1.286PheGlu: 1.286 ± 1.022
3.0PhePhe: 3.0 ± 1.237
2.143PheGly: 2.143 ± 0.649
1.286PheHis: 1.286 ± 0.579
2.572PheIle: 2.572 ± 1.134
3.429PheLys: 3.429 ± 1.687
4.286PheLeu: 4.286 ± 0.834
0.857PheMet: 0.857 ± 0.413
0.857PheAsn: 0.857 ± 0.685
2.143PhePro: 2.143 ± 0.806
1.286PheGln: 1.286 ± 0.788
2.572PheArg: 2.572 ± 0.549
2.143PheSer: 2.143 ± 0.916
1.715PheThr: 1.715 ± 0.623
3.429PheVal: 3.429 ± 1.08
1.286PheTrp: 1.286 ± 0.687
1.715PheTyr: 1.715 ± 1.0
0.0PheXaa: 0.0 ± 0.0
Gly
2.143GlyAla: 2.143 ± 0.747
1.286GlyCys: 1.286 ± 1.04
4.715GlyAsp: 4.715 ± 1.033
8.573GlyGlu: 8.573 ± 2.364
0.429GlyPhe: 0.429 ± 0.346
3.858GlyGly: 3.858 ± 1.549
2.572GlyHis: 2.572 ± 0.876
1.715GlyIle: 1.715 ± 0.623
2.572GlyLys: 2.572 ± 0.837
4.715GlyLeu: 4.715 ± 0.938
1.286GlyMet: 1.286 ± 0.662
2.572GlyAsn: 2.572 ± 0.863
3.858GlyPro: 3.858 ± 0.931
3.0GlyGln: 3.0 ± 0.865
5.144GlyArg: 5.144 ± 1.529
6.429GlySer: 6.429 ± 2.069
2.572GlyThr: 2.572 ± 0.697
5.572GlyVal: 5.572 ± 1.462
0.429GlyTrp: 0.429 ± 0.594
1.286GlyTyr: 1.286 ± 0.756
0.0GlyXaa: 0.0 ± 0.0
His
0.429HisAla: 0.429 ± 0.343
0.857HisCys: 0.857 ± 0.418
0.429HisAsp: 0.429 ± 0.346
1.715HisGlu: 1.715 ± 0.259
0.857HisPhe: 0.857 ± 0.516
0.429HisGly: 0.429 ± 0.635
0.429HisHis: 0.429 ± 0.346
0.857HisIle: 0.857 ± 0.513
1.286HisLys: 1.286 ± 0.738
2.143HisLeu: 2.143 ± 0.767
0.857HisMet: 0.857 ± 0.723
0.0HisAsn: 0.0 ± 0.0
1.715HisPro: 1.715 ± 0.743
0.857HisGln: 0.857 ± 0.691
1.286HisArg: 1.286 ± 0.774
0.0HisSer: 0.0 ± 0.0
0.857HisThr: 0.857 ± 0.685
2.143HisVal: 2.143 ± 0.423
0.857HisTrp: 0.857 ± 0.428
0.429HisTyr: 0.429 ± 0.432
0.0HisXaa: 0.0 ± 0.0
Ile
2.143IleAla: 2.143 ± 0.923
0.429IleCys: 0.429 ± 0.346
4.286IleAsp: 4.286 ± 1.548
3.429IleGlu: 3.429 ± 0.942
1.715IlePhe: 1.715 ± 0.629
4.715IleGly: 4.715 ± 1.785
0.857IleHis: 0.857 ± 0.418
2.572IleIle: 2.572 ± 1.366
0.857IleLys: 0.857 ± 0.418
2.572IleLeu: 2.572 ± 0.501
0.429IleMet: 0.429 ± 0.343
0.857IleAsn: 0.857 ± 0.42
3.0IlePro: 3.0 ± 1.612
0.857IleGln: 0.857 ± 0.42
1.286IleArg: 1.286 ± 0.654
4.715IleSer: 4.715 ± 1.671
2.143IleThr: 2.143 ± 0.588
3.858IleVal: 3.858 ± 1.412
0.429IleTrp: 0.429 ± 0.346
2.143IleTyr: 2.143 ± 0.497
0.0IleXaa: 0.0 ± 0.0
Lys
3.858LysAla: 3.858 ± 1.335
1.286LysCys: 1.286 ± 0.345
3.0LysAsp: 3.0 ± 1.15
1.715LysGlu: 1.715 ± 0.543
1.715LysPhe: 1.715 ± 1.004
3.858LysGly: 3.858 ± 1.807
1.715LysHis: 1.715 ± 1.138
1.286LysIle: 1.286 ± 0.436
3.0LysLys: 3.0 ± 1.24
2.572LysLeu: 2.572 ± 0.924
1.286LysMet: 1.286 ± 0.952
2.572LysAsn: 2.572 ± 0.868
1.715LysPro: 1.715 ± 0.536
1.286LysGln: 1.286 ± 0.687
5.144LysArg: 5.144 ± 0.57
2.572LysSer: 2.572 ± 2.073
2.143LysThr: 2.143 ± 0.634
2.572LysVal: 2.572 ± 0.994
0.857LysTrp: 0.857 ± 0.654
2.572LysTyr: 2.572 ± 0.47
0.0LysXaa: 0.0 ± 0.0
Leu
4.715LeuAla: 4.715 ± 0.966
1.715LeuCys: 1.715 ± 0.746
3.858LeuAsp: 3.858 ± 0.716
4.715LeuGlu: 4.715 ± 0.91
6.858LeuPhe: 6.858 ± 1.573
8.573LeuGly: 8.573 ± 1.244
1.715LeuHis: 1.715 ± 0.967
3.858LeuIle: 3.858 ± 2.228
4.715LeuLys: 4.715 ± 1.639
7.715LeuLeu: 7.715 ± 2.694
2.143LeuMet: 2.143 ± 1.129
3.858LeuAsn: 3.858 ± 1.088
4.715LeuPro: 4.715 ± 1.651
5.572LeuGln: 5.572 ± 1.546
4.286LeuArg: 4.286 ± 1.759
7.715LeuSer: 7.715 ± 2.305
3.0LeuThr: 3.0 ± 1.084
3.429LeuVal: 3.429 ± 0.518
1.286LeuTrp: 1.286 ± 0.579
4.715LeuTyr: 4.715 ± 1.033
0.0LeuXaa: 0.0 ± 0.0
Met
2.143MetAla: 2.143 ± 0.914
0.429MetCys: 0.429 ± 0.346
0.429MetAsp: 0.429 ± 0.343
0.429MetGlu: 0.429 ± 0.432
0.429MetPhe: 0.429 ± 0.343
0.857MetGly: 0.857 ± 0.418
0.0MetHis: 0.0 ± 0.0
1.286MetIle: 1.286 ± 0.937
0.0MetLys: 0.0 ± 0.0
0.429MetLeu: 0.429 ± 0.432
0.0MetMet: 0.0 ± 0.0
0.857MetAsn: 0.857 ± 0.628
0.429MetPro: 0.429 ± 0.635
0.0MetGln: 0.0 ± 0.0
2.572MetArg: 2.572 ± 1.085
1.715MetSer: 1.715 ± 0.947
1.715MetThr: 1.715 ± 0.259
0.857MetVal: 0.857 ± 0.691
0.0MetTrp: 0.0 ± 0.0
0.429MetTyr: 0.429 ± 0.432
0.0MetXaa: 0.0 ± 0.0
Asn
2.143AsnAla: 2.143 ± 1.046
0.857AsnCys: 0.857 ± 0.691
2.143AsnAsp: 2.143 ± 1.084
1.715AsnGlu: 1.715 ± 0.613
1.286AsnPhe: 1.286 ± 0.654
1.286AsnGly: 1.286 ± 0.403
0.429AsnHis: 0.429 ± 0.343
0.429AsnIle: 0.429 ± 0.343
0.857AsnLys: 0.857 ± 0.418
3.858AsnLeu: 3.858 ± 1.522
0.0AsnMet: 0.0 ± 0.0
2.143AsnAsn: 2.143 ± 0.929
4.286AsnPro: 4.286 ± 1.496
2.143AsnGln: 2.143 ± 1.084
3.0AsnArg: 3.0 ± 1.582
2.572AsnSer: 2.572 ± 0.691
3.858AsnThr: 3.858 ± 1.551
2.572AsnVal: 2.572 ± 0.876
0.0AsnTrp: 0.0 ± 0.0
0.429AsnTyr: 0.429 ± 0.343
0.0AsnXaa: 0.0 ± 0.0
Pro
6.001ProAla: 6.001 ± 1.99
1.286ProCys: 1.286 ± 0.94
3.429ProAsp: 3.429 ± 1.427
6.429ProGlu: 6.429 ± 1.161
2.572ProPhe: 2.572 ± 0.841
3.429ProGly: 3.429 ± 0.725
0.857ProHis: 0.857 ± 0.516
3.429ProIle: 3.429 ± 1.047
2.143ProLys: 2.143 ± 0.929
6.429ProLeu: 6.429 ± 2.047
0.429ProMet: 0.429 ± 0.432
1.715ProAsn: 1.715 ± 0.677
8.573ProPro: 8.573 ± 2.502
2.572ProGln: 2.572 ± 1.014
6.001ProArg: 6.001 ± 2.069
5.144ProSer: 5.144 ± 1.558
4.286ProThr: 4.286 ± 0.999
3.858ProVal: 3.858 ± 1.766
0.429ProTrp: 0.429 ± 0.594
1.286ProTyr: 1.286 ± 1.028
0.0ProXaa: 0.0 ± 0.0
Gln
3.0GlnAla: 3.0 ± 1.115
0.429GlnCys: 0.429 ± 0.432
1.715GlnAsp: 1.715 ± 0.786
1.286GlnGlu: 1.286 ± 0.79
0.429GlnPhe: 0.429 ± 0.346
2.572GlnGly: 2.572 ± 1.206
0.857GlnHis: 0.857 ± 0.608
0.429GlnIle: 0.429 ± 0.346
2.572GlnLys: 2.572 ± 1.361
6.001GlnLeu: 6.001 ± 2.047
0.857GlnMet: 0.857 ± 0.59
2.572GlnAsn: 2.572 ± 1.287
2.143GlnPro: 2.143 ± 0.697
1.715GlnGln: 1.715 ± 0.259
2.143GlnArg: 2.143 ± 0.634
3.0GlnSer: 3.0 ± 1.032
3.0GlnThr: 3.0 ± 1.143
2.572GlnVal: 2.572 ± 1.104
0.429GlnTrp: 0.429 ± 0.346
1.715GlnTyr: 1.715 ± 0.259
0.0GlnXaa: 0.0 ± 0.0
Arg
3.858ArgAla: 3.858 ± 0.981
3.0ArgCys: 3.0 ± 2.152
2.572ArgAsp: 2.572 ± 0.95
5.572ArgGlu: 5.572 ± 1.197
3.858ArgPhe: 3.858 ± 0.679
4.286ArgGly: 4.286 ± 0.938
1.286ArgHis: 1.286 ± 0.682
1.715ArgIle: 1.715 ± 0.995
4.286ArgLys: 4.286 ± 0.918
5.572ArgLeu: 5.572 ± 0.709
0.429ArgMet: 0.429 ± 0.432
1.715ArgAsn: 1.715 ± 0.761
8.573ArgPro: 8.573 ± 3.111
1.715ArgGln: 1.715 ± 1.199
8.144ArgArg: 8.144 ± 2.872
6.858ArgSer: 6.858 ± 1.312
2.572ArgThr: 2.572 ± 0.529
5.572ArgVal: 5.572 ± 1.552
1.286ArgTrp: 1.286 ± 0.774
1.286ArgTyr: 1.286 ± 0.345
0.0ArgXaa: 0.0 ± 0.0
Ser
3.858SerAla: 3.858 ± 1.805
1.715SerCys: 1.715 ± 0.774
4.286SerAsp: 4.286 ± 1.516
5.144SerGlu: 5.144 ± 1.172
5.572SerPhe: 5.572 ± 1.775
4.715SerGly: 4.715 ± 1.781
0.857SerHis: 0.857 ± 0.513
3.0SerIle: 3.0 ± 1.128
2.572SerLys: 2.572 ± 0.837
10.716SerLeu: 10.716 ± 1.212
1.286SerMet: 1.286 ± 0.861
4.286SerAsn: 4.286 ± 1.036
3.858SerPro: 3.858 ± 1.467
4.286SerGln: 4.286 ± 1.559
5.572SerArg: 5.572 ± 2.419
5.144SerSer: 5.144 ± 1.076
8.144SerThr: 8.144 ± 2.665
2.572SerVal: 2.572 ± 1.242
0.857SerTrp: 0.857 ± 0.43
0.0SerTyr: 0.0 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
3.0ThrAla: 3.0 ± 0.77
0.857ThrCys: 0.857 ± 0.406
3.429ThrAsp: 3.429 ± 0.857
1.715ThrGlu: 1.715 ± 0.59
1.715ThrPhe: 1.715 ± 0.8
4.286ThrGly: 4.286 ± 1.301
0.857ThrHis: 0.857 ± 0.43
1.286ThrIle: 1.286 ± 0.686
2.143ThrLys: 2.143 ± 0.832
4.286ThrLeu: 4.286 ± 1.025
0.857ThrMet: 0.857 ± 0.42
0.857ThrAsn: 0.857 ± 0.519
3.858ThrPro: 3.858 ± 1.331
1.715ThrGln: 1.715 ± 0.661
7.287ThrArg: 7.287 ± 1.21
8.144ThrSer: 8.144 ± 2.001
3.858ThrThr: 3.858 ± 1.132
4.715ThrVal: 4.715 ± 0.856
0.429ThrTrp: 0.429 ± 0.432
0.0ThrTyr: 0.0 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
2.143ValAla: 2.143 ± 0.878
1.715ValCys: 1.715 ± 1.22
6.001ValAsp: 6.001 ± 1.1
5.572ValGlu: 5.572 ± 1.403
2.143ValPhe: 2.143 ± 0.588
3.0ValGly: 3.0 ± 1.093
0.857ValHis: 0.857 ± 0.681
4.286ValIle: 4.286 ± 1.27
1.715ValLys: 1.715 ± 0.837
5.572ValLeu: 5.572 ± 1.078
0.857ValMet: 0.857 ± 0.691
2.143ValAsn: 2.143 ± 1.118
4.715ValPro: 4.715 ± 1.266
2.572ValGln: 2.572 ± 1.104
5.144ValArg: 5.144 ± 1.86
7.287ValSer: 7.287 ± 0.799
3.429ValThr: 3.429 ± 1.304
3.858ValVal: 3.858 ± 1.256
1.286ValTrp: 1.286 ± 0.643
1.715ValTyr: 1.715 ± 0.609
0.0ValXaa: 0.0 ± 0.0
Trp
1.715TrpAla: 1.715 ± 0.813
0.0TrpCys: 0.0 ± 0.0
0.429TrpAsp: 0.429 ± 0.346
0.857TrpGlu: 0.857 ± 0.628
0.0TrpPhe: 0.0 ± 0.0
1.715TrpGly: 1.715 ± 0.677
0.429TrpHis: 0.429 ± 0.343
1.286TrpIle: 1.286 ± 0.687
0.857TrpLys: 0.857 ± 0.519
1.286TrpLeu: 1.286 ± 0.639
0.0TrpMet: 0.0 ± 0.0
0.429TrpAsn: 0.429 ± 0.432
0.0TrpPro: 0.0 ± 0.0
0.857TrpGln: 0.857 ± 0.428
0.429TrpArg: 0.429 ± 0.346
0.857TrpSer: 0.857 ± 0.43
1.286TrpThr: 1.286 ± 0.79
0.857TrpVal: 0.857 ± 0.691
0.429TrpTrp: 0.429 ± 0.594
0.429TrpTyr: 0.429 ± 0.346
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.286TyrAla: 1.286 ± 0.345
0.857TyrCys: 0.857 ± 0.789
1.715TyrAsp: 1.715 ± 0.629
1.715TyrGlu: 1.715 ± 0.776
2.143TyrPhe: 2.143 ± 0.423
3.0TyrGly: 3.0 ± 0.569
0.429TyrHis: 0.429 ± 0.343
1.715TyrIle: 1.715 ± 0.623
0.857TyrLys: 0.857 ± 0.691
1.286TyrLeu: 1.286 ± 0.345
1.286TyrMet: 1.286 ± 0.639
0.429TyrAsn: 0.429 ± 0.341
1.286TyrPro: 1.286 ± 0.683
1.286TyrGln: 1.286 ± 0.579
1.715TyrArg: 1.715 ± 1.37
1.286TyrSer: 1.286 ± 0.756
1.286TyrThr: 1.286 ± 0.886
2.143TyrVal: 2.143 ± 0.832
0.857TyrTrp: 0.857 ± 0.418
1.715TyrTyr: 1.715 ± 1.199
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2334 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski