Amino acid dipepetide frequency for Ovis aries papillomavirus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.93AlaAla: 5.93 ± 1.994
1.271AlaCys: 1.271 ± 0.662
4.659AlaAsp: 4.659 ± 1.169
4.235AlaGlu: 4.235 ± 0.865
2.965AlaPhe: 2.965 ± 0.943
5.083AlaGly: 5.083 ± 1.253
1.271AlaHis: 1.271 ± 0.704
1.271AlaIle: 1.271 ± 0.465
2.965AlaLys: 2.965 ± 0.903
2.118AlaLeu: 2.118 ± 0.958
1.271AlaMet: 1.271 ± 0.888
1.694AlaAsn: 1.694 ± 0.556
3.812AlaPro: 3.812 ± 1.229
2.965AlaGln: 2.965 ± 1.051
1.271AlaArg: 1.271 ± 1.043
1.271AlaSer: 1.271 ± 0.383
3.812AlaThr: 3.812 ± 1.304
4.659AlaVal: 4.659 ± 1.111
1.694AlaTrp: 1.694 ± 1.34
1.271AlaTyr: 1.271 ± 0.616
0.0AlaXaa: 0.0 ± 0.0
Cys
2.118CysAla: 2.118 ± 0.939
1.694CysCys: 1.694 ± 1.474
2.118CysAsp: 2.118 ± 1.023
0.847CysGlu: 0.847 ± 0.695
0.424CysPhe: 0.424 ± 0.345
1.271CysGly: 1.271 ± 1.453
0.0CysHis: 0.0 ± 0.0
2.118CysIle: 2.118 ± 1.806
2.118CysLys: 2.118 ± 0.508
2.118CysLeu: 2.118 ± 1.085
0.424CysMet: 0.424 ± 0.345
1.271CysAsn: 1.271 ± 1.262
2.541CysPro: 2.541 ± 0.818
1.694CysGln: 1.694 ± 0.85
0.424CysArg: 0.424 ± 0.385
0.424CysSer: 0.424 ± 0.345
0.424CysThr: 0.424 ± 0.345
1.271CysVal: 1.271 ± 1.244
0.847CysTrp: 0.847 ± 0.426
0.424CysTyr: 0.424 ± 0.348
0.0CysXaa: 0.0 ± 0.0
Asp
2.118AspAla: 2.118 ± 0.446
2.118AspCys: 2.118 ± 1.085
2.541AspAsp: 2.541 ± 1.162
2.965AspGlu: 2.965 ± 0.995
2.541AspPhe: 2.541 ± 0.87
3.388AspGly: 3.388 ± 1.016
0.424AspHis: 0.424 ± 0.385
3.812AspIle: 3.812 ± 0.576
2.118AspLys: 2.118 ± 0.628
7.2AspLeu: 7.2 ± 1.464
0.0AspMet: 0.0 ± 0.0
2.965AspAsn: 2.965 ± 0.888
3.812AspPro: 3.812 ± 2.019
1.694AspGln: 1.694 ± 0.556
2.118AspArg: 2.118 ± 1.578
7.2AspSer: 7.2 ± 1.27
5.506AspThr: 5.506 ± 1.432
3.388AspVal: 3.388 ± 1.767
0.424AspTrp: 0.424 ± 0.348
1.694AspTyr: 1.694 ± 0.608
0.0AspXaa: 0.0 ± 0.0
Glu
5.93GluAla: 5.93 ± 2.3
1.271GluCys: 1.271 ± 0.926
3.388GluAsp: 3.388 ± 1.051
8.471GluGlu: 8.471 ± 2.532
0.847GluPhe: 0.847 ± 0.771
3.812GluGly: 3.812 ± 1.723
1.694GluHis: 1.694 ± 0.672
2.965GluIle: 2.965 ± 0.753
3.388GluLys: 3.388 ± 0.865
5.506GluLeu: 5.506 ± 1.791
2.118GluMet: 2.118 ± 1.099
3.388GluAsn: 3.388 ± 0.445
3.812GluPro: 3.812 ± 1.321
3.388GluGln: 3.388 ± 1.257
2.541GluArg: 2.541 ± 1.394
3.812GluSer: 3.812 ± 0.908
6.353GluThr: 6.353 ± 1.615
2.965GluVal: 2.965 ± 0.539
0.424GluTrp: 0.424 ± 0.345
1.271GluTyr: 1.271 ± 0.624
0.0GluXaa: 0.0 ± 0.0
Phe
1.271PheAla: 1.271 ± 0.361
1.271PheCys: 1.271 ± 0.922
1.694PheAsp: 1.694 ± 0.705
1.694PheGlu: 1.694 ± 1.201
1.271PhePhe: 1.271 ± 0.624
3.388PheGly: 3.388 ± 1.665
0.424PheHis: 0.424 ± 0.385
2.965PheIle: 2.965 ± 1.121
2.118PheLys: 2.118 ± 1.297
3.388PheLeu: 3.388 ± 1.054
0.424PheMet: 0.424 ± 0.345
1.271PheAsn: 1.271 ± 0.709
1.694PhePro: 1.694 ± 0.58
1.694PheGln: 1.694 ± 0.529
2.118PheArg: 2.118 ± 1.011
2.965PheSer: 2.965 ± 0.691
2.118PheThr: 2.118 ± 0.94
2.118PheVal: 2.118 ± 0.446
1.694PheTrp: 1.694 ± 0.834
1.271PheTyr: 1.271 ± 0.793
0.0PheXaa: 0.0 ± 0.0
Gly
2.118GlyAla: 2.118 ± 0.88
1.271GlyCys: 1.271 ± 0.624
5.083GlyAsp: 5.083 ± 2.11
5.93GlyGlu: 5.93 ± 0.877
1.694GlyPhe: 1.694 ± 0.97
5.93GlyGly: 5.93 ± 4.368
2.965GlyHis: 2.965 ± 0.789
2.965GlyIle: 2.965 ± 0.469
2.541GlyLys: 2.541 ± 0.839
5.083GlyLeu: 5.083 ± 1.596
1.694GlyMet: 1.694 ± 0.635
2.118GlyAsn: 2.118 ± 0.726
4.235GlyPro: 4.235 ± 2.032
2.965GlyGln: 2.965 ± 1.137
4.235GlyArg: 4.235 ± 0.804
6.353GlySer: 6.353 ± 1.266
5.083GlyThr: 5.083 ± 1.767
2.965GlyVal: 2.965 ± 0.841
0.847GlyTrp: 0.847 ± 0.462
0.424GlyTyr: 0.424 ± 0.337
0.0GlyXaa: 0.0 ± 0.0
His
0.424HisAla: 0.424 ± 0.337
0.424HisCys: 0.424 ± 0.345
0.424HisAsp: 0.424 ± 0.337
2.118HisGlu: 2.118 ± 0.488
1.271HisPhe: 1.271 ± 0.624
2.965HisGly: 2.965 ± 1.051
0.424HisHis: 0.424 ± 0.538
0.424HisIle: 0.424 ± 0.345
0.424HisLys: 0.424 ± 0.348
2.118HisLeu: 2.118 ± 0.508
1.271HisMet: 1.271 ± 0.465
0.424HisAsn: 0.424 ± 0.385
2.118HisPro: 2.118 ± 0.421
1.694HisGln: 1.694 ± 0.58
0.424HisArg: 0.424 ± 0.345
0.424HisSer: 0.424 ± 0.337
0.424HisThr: 0.424 ± 0.638
1.694HisVal: 1.694 ± 0.286
1.694HisTrp: 1.694 ± 0.574
1.271HisTyr: 1.271 ± 0.659
0.0HisXaa: 0.0 ± 0.0
Ile
1.271IleAla: 1.271 ± 0.854
1.271IleCys: 1.271 ± 0.878
2.118IleAsp: 2.118 ± 0.842
2.965IleGlu: 2.965 ± 1.146
1.271IlePhe: 1.271 ± 0.704
2.541IleGly: 2.541 ± 0.85
0.847IleHis: 0.847 ± 0.417
1.271IleIle: 1.271 ± 0.793
0.847IleLys: 0.847 ± 0.462
3.812IleLeu: 3.812 ± 0.835
0.424IleMet: 0.424 ± 0.328
0.847IleAsn: 0.847 ± 0.591
2.541IlePro: 2.541 ± 1.479
0.847IleGln: 0.847 ± 0.771
1.271IleArg: 1.271 ± 1.244
5.083IleSer: 5.083 ± 1.168
2.541IleThr: 2.541 ± 0.793
5.083IleVal: 5.083 ± 1.24
0.0IleTrp: 0.0 ± 0.0
2.118IleTyr: 2.118 ± 0.964
0.0IleXaa: 0.0 ± 0.0
Lys
4.235LysAla: 4.235 ± 1.479
2.118LysCys: 2.118 ± 1.089
2.541LysAsp: 2.541 ± 1.025
1.694LysGlu: 1.694 ± 0.898
1.271LysPhe: 1.271 ± 0.662
2.118LysGly: 2.118 ± 1.0
2.965LysHis: 2.965 ± 0.767
1.271LysIle: 1.271 ± 0.639
7.2LysLys: 7.2 ± 2.552
5.083LysLeu: 5.083 ± 1.667
0.424LysMet: 0.424 ± 0.375
0.847LysAsn: 0.847 ± 0.67
2.541LysPro: 2.541 ± 0.7
2.118LysGln: 2.118 ± 0.884
4.659LysArg: 4.659 ± 0.597
3.388LysSer: 3.388 ± 1.351
2.118LysThr: 2.118 ± 1.236
3.388LysVal: 3.388 ± 1.084
0.424LysTrp: 0.424 ± 0.345
1.694LysTyr: 1.694 ± 0.286
0.0LysXaa: 0.0 ± 0.0
Leu
2.541LeuAla: 2.541 ± 0.908
2.541LeuCys: 2.541 ± 0.762
6.353LeuAsp: 6.353 ± 1.418
3.388LeuGlu: 3.388 ± 1.109
3.812LeuPhe: 3.812 ± 0.722
7.2LeuGly: 7.2 ± 2.225
1.694LeuHis: 1.694 ± 0.829
2.965LeuIle: 2.965 ± 0.776
5.506LeuLys: 5.506 ± 1.418
7.2LeuLeu: 7.2 ± 1.596
1.694LeuMet: 1.694 ± 0.611
3.388LeuAsn: 3.388 ± 0.929
3.388LeuPro: 3.388 ± 1.062
4.235LeuGln: 4.235 ± 1.38
6.353LeuArg: 6.353 ± 1.508
8.895LeuSer: 8.895 ± 1.07
4.659LeuThr: 4.659 ± 0.694
3.812LeuVal: 3.812 ± 0.823
2.541LeuTrp: 2.541 ± 1.166
3.388LeuTyr: 3.388 ± 1.765
0.0LeuXaa: 0.0 ± 0.0
Met
0.847MetAla: 0.847 ± 0.472
0.0MetCys: 0.0 ± 0.0
0.847MetAsp: 0.847 ± 0.771
1.694MetGlu: 1.694 ± 1.001
0.847MetPhe: 0.847 ± 0.472
0.424MetGly: 0.424 ± 0.385
0.424MetHis: 0.424 ± 0.348
0.0MetIle: 0.0 ± 0.0
0.424MetLys: 0.424 ± 0.345
0.424MetLeu: 0.424 ± 0.345
0.424MetMet: 0.424 ± 0.385
0.424MetAsn: 0.424 ± 0.385
1.271MetPro: 1.271 ± 0.622
1.271MetGln: 1.271 ± 0.361
1.694MetArg: 1.694 ± 1.062
1.271MetSer: 1.271 ± 0.361
1.271MetThr: 1.271 ± 0.415
2.118MetVal: 2.118 ± 0.884
0.0MetTrp: 0.0 ± 0.0
0.847MetTyr: 0.847 ± 0.67
0.0MetXaa: 0.0 ± 0.0
Asn
2.118AsnAla: 2.118 ± 0.431
0.847AsnCys: 0.847 ± 0.362
1.694AsnAsp: 1.694 ± 0.632
2.965AsnGlu: 2.965 ± 0.873
1.694AsnPhe: 1.694 ± 0.791
2.118AsnGly: 2.118 ± 1.462
0.424AsnHis: 0.424 ± 0.345
2.541AsnIle: 2.541 ± 1.18
3.812AsnLys: 3.812 ± 1.6
3.388AsnLeu: 3.388 ± 1.689
0.847AsnMet: 0.847 ± 0.771
2.965AsnAsn: 2.965 ± 1.702
1.694AsnPro: 1.694 ± 0.625
1.271AsnGln: 1.271 ± 0.361
2.541AsnArg: 2.541 ± 0.549
3.388AsnSer: 3.388 ± 0.695
0.847AsnThr: 0.847 ± 0.426
2.965AsnVal: 2.965 ± 2.134
1.694AsnTrp: 1.694 ± 0.58
1.694AsnTyr: 1.694 ± 0.635
0.0AsnXaa: 0.0 ± 0.0
Pro
5.083ProAla: 5.083 ± 2.241
0.847ProCys: 0.847 ± 0.889
5.506ProAsp: 5.506 ± 2.471
5.506ProGlu: 5.506 ± 0.943
2.965ProPhe: 2.965 ± 0.469
1.271ProGly: 1.271 ± 0.73
1.271ProHis: 1.271 ± 0.571
1.694ProIle: 1.694 ± 0.672
5.083ProLys: 5.083 ± 2.05
5.93ProLeu: 5.93 ± 1.402
0.847ProMet: 0.847 ± 0.631
2.965ProAsn: 2.965 ± 1.027
9.742ProPro: 9.742 ± 2.083
2.541ProGln: 2.541 ± 0.7
2.541ProArg: 2.541 ± 0.886
2.965ProSer: 2.965 ± 0.753
3.388ProThr: 3.388 ± 1.778
4.659ProVal: 4.659 ± 1.754
1.271ProTrp: 1.271 ± 0.465
1.694ProTyr: 1.694 ± 1.154
0.0ProXaa: 0.0 ± 0.0
Gln
1.694GlnAla: 1.694 ± 0.719
0.847GlnCys: 0.847 ± 0.691
0.424GlnAsp: 0.424 ± 0.385
5.506GlnGlu: 5.506 ± 1.347
0.847GlnPhe: 0.847 ± 0.426
3.388GlnGly: 3.388 ± 0.835
1.694GlnHis: 1.694 ± 0.885
2.965GlnIle: 2.965 ± 0.472
0.847GlnLys: 0.847 ± 0.362
1.694GlnLeu: 1.694 ± 0.772
0.847GlnMet: 0.847 ± 0.771
0.847GlnAsn: 0.847 ± 0.462
5.083GlnPro: 5.083 ± 1.467
3.388GlnGln: 3.388 ± 1.45
3.812GlnArg: 3.812 ± 1.963
4.659GlnSer: 4.659 ± 0.741
1.694GlnThr: 1.694 ± 0.799
4.659GlnVal: 4.659 ± 0.809
0.424GlnTrp: 0.424 ± 0.345
2.118GlnTyr: 2.118 ± 0.804
0.0GlnXaa: 0.0 ± 0.0
Arg
2.965ArgAla: 2.965 ± 1.638
1.694ArgCys: 1.694 ± 2.06
4.235ArgAsp: 4.235 ± 1.376
2.118ArgGlu: 2.118 ± 1.099
2.118ArgPhe: 2.118 ± 0.783
2.965ArgGly: 2.965 ± 1.06
1.271ArgHis: 1.271 ± 0.709
1.271ArgIle: 1.271 ± 0.648
3.812ArgLys: 3.812 ± 1.091
5.93ArgLeu: 5.93 ± 1.266
0.847ArgMet: 0.847 ± 0.417
2.965ArgAsn: 2.965 ± 1.016
4.659ArgPro: 4.659 ± 1.733
4.235ArgGln: 4.235 ± 1.805
8.047ArgArg: 8.047 ± 1.134
4.659ArgSer: 4.659 ± 1.163
0.847ArgThr: 0.847 ± 0.67
2.541ArgVal: 2.541 ± 1.009
0.0ArgTrp: 0.0 ± 0.0
2.541ArgTyr: 2.541 ± 0.733
0.0ArgXaa: 0.0 ± 0.0
Ser
5.083SerAla: 5.083 ± 0.84
0.0SerCys: 0.0 ± 0.0
7.2SerAsp: 7.2 ± 2.126
3.388SerGlu: 3.388 ± 0.858
2.965SerPhe: 2.965 ± 1.362
6.353SerGly: 6.353 ± 1.794
1.271SerHis: 1.271 ± 1.036
1.694SerIle: 1.694 ± 1.027
3.812SerLys: 3.812 ± 0.748
9.742SerLeu: 9.742 ± 1.892
1.271SerMet: 1.271 ± 0.371
2.965SerAsn: 2.965 ± 0.89
5.083SerPro: 5.083 ± 1.646
3.388SerGln: 3.388 ± 1.365
2.541SerArg: 2.541 ± 1.162
8.047SerSer: 8.047 ± 2.624
7.2SerThr: 7.2 ± 1.673
4.235SerVal: 4.235 ± 0.879
1.271SerTrp: 1.271 ± 0.697
1.694SerTyr: 1.694 ± 0.672
0.0SerXaa: 0.0 ± 0.0
Thr
4.659ThrAla: 4.659 ± 1.411
2.541ThrCys: 2.541 ± 1.138
1.271ThrAsp: 1.271 ± 1.165
3.388ThrGlu: 3.388 ± 0.793
3.812ThrPhe: 3.812 ± 1.152
3.388ThrGly: 3.388 ± 0.916
1.694ThrHis: 1.694 ± 1.027
2.541ThrIle: 2.541 ± 1.147
0.847ThrLys: 0.847 ± 0.678
4.235ThrLeu: 4.235 ± 2.699
0.424ThrMet: 0.424 ± 0.345
2.965ThrAsn: 2.965 ± 0.601
3.812ThrPro: 3.812 ± 1.341
2.965ThrGln: 2.965 ± 0.849
3.812ThrArg: 3.812 ± 1.882
4.235ThrSer: 4.235 ± 1.52
2.541ThrThr: 2.541 ± 0.645
10.165ThrVal: 10.165 ± 2.457
0.0ThrTrp: 0.0 ± 0.0
2.541ThrTyr: 2.541 ± 0.913
0.0ThrXaa: 0.0 ± 0.0
Val
3.388ValAla: 3.388 ± 1.084
2.118ValCys: 2.118 ± 1.085
4.659ValAsp: 4.659 ± 1.348
4.235ValGlu: 4.235 ± 0.879
1.694ValPhe: 1.694 ± 1.084
5.083ValGly: 5.083 ± 1.347
0.847ValHis: 0.847 ± 0.715
2.541ValIle: 2.541 ± 0.886
1.694ValLys: 1.694 ± 0.71
6.777ValLeu: 6.777 ± 1.017
0.0ValMet: 0.0 ± 0.0
4.659ValAsn: 4.659 ± 1.959
5.506ValPro: 5.506 ± 1.665
3.812ValGln: 3.812 ± 1.056
3.388ValArg: 3.388 ± 0.801
6.777ValSer: 6.777 ± 1.764
8.047ValThr: 8.047 ± 1.814
1.694ValVal: 1.694 ± 0.725
1.694ValTrp: 1.694 ± 1.068
0.424ValTyr: 0.424 ± 0.345
0.0ValXaa: 0.0 ± 0.0
Trp
0.424TrpAla: 0.424 ± 0.345
0.0TrpCys: 0.0 ± 0.0
0.847TrpAsp: 0.847 ± 0.691
1.271TrpGlu: 1.271 ± 0.648
0.847TrpPhe: 0.847 ± 0.691
2.118TrpGly: 2.118 ± 1.056
0.424TrpHis: 0.424 ± 0.348
0.847TrpIle: 0.847 ± 0.691
2.118TrpLys: 2.118 ± 0.884
0.847TrpLeu: 0.847 ± 0.362
0.0TrpMet: 0.0 ± 0.0
1.271TrpAsn: 1.271 ± 0.709
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
2.118TrpArg: 2.118 ± 1.618
1.694TrpSer: 1.694 ± 0.635
0.847TrpThr: 0.847 ± 0.404
2.541TrpVal: 2.541 ± 0.943
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.694TyrAla: 1.694 ± 0.574
0.424TyrCys: 0.424 ± 0.533
0.424TyrAsp: 0.424 ± 0.385
2.965TyrGlu: 2.965 ± 0.472
1.694TyrPhe: 1.694 ± 0.641
1.694TyrGly: 1.694 ± 0.286
0.424TyrHis: 0.424 ± 0.337
0.847TyrIle: 0.847 ± 0.404
0.424TyrLys: 0.424 ± 0.385
2.965TyrLeu: 2.965 ± 0.932
0.847TyrMet: 0.847 ± 0.691
1.271TyrAsn: 1.271 ± 0.616
0.424TyrPro: 0.424 ± 0.385
1.271TyrGln: 1.271 ± 0.694
3.812TyrArg: 3.812 ± 0.392
1.694TyrSer: 1.694 ± 0.923
2.118TyrThr: 2.118 ± 1.119
2.118TyrVal: 2.118 ± 0.628
1.271TyrTrp: 1.271 ± 0.465
2.541TyrTyr: 2.541 ± 1.074
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2362 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski