Amino acid dipepetide frequency for Macaca fascicularis papillomavirus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.038AlaAla: 5.038 ± 1.374
1.679AlaCys: 1.679 ± 1.176
2.939AlaAsp: 2.939 ± 1.23
3.778AlaGlu: 3.778 ± 1.299
2.519AlaPhe: 2.519 ± 1.387
5.038AlaGly: 5.038 ± 1.477
2.099AlaHis: 2.099 ± 0.638
2.939AlaIle: 2.939 ± 0.736
2.939AlaLys: 2.939 ± 1.095
5.458AlaLeu: 5.458 ± 1.923
1.679AlaMet: 1.679 ± 0.576
1.679AlaAsn: 1.679 ± 1.008
5.458AlaPro: 5.458 ± 2.176
2.519AlaGln: 2.519 ± 0.876
3.778AlaArg: 3.778 ± 1.101
3.359AlaSer: 3.359 ± 1.534
6.297AlaThr: 6.297 ± 1.417
2.939AlaVal: 2.939 ± 1.076
0.84AlaTrp: 0.84 ± 0.474
2.519AlaTyr: 2.519 ± 0.42
0.0AlaXaa: 0.0 ± 0.0
Cys
2.099CysAla: 2.099 ± 0.784
1.679CysCys: 1.679 ± 1.069
0.84CysAsp: 0.84 ± 0.436
1.679CysGlu: 1.679 ± 0.846
1.679CysPhe: 1.679 ± 1.144
1.259CysGly: 1.259 ± 0.631
0.42CysHis: 0.42 ± 0.566
0.84CysIle: 0.84 ± 0.66
2.939CysLys: 2.939 ± 1.143
3.778CysLeu: 3.778 ± 2.054
0.42CysMet: 0.42 ± 0.37
0.42CysAsn: 0.42 ± 0.33
2.519CysPro: 2.519 ± 0.717
1.259CysGln: 1.259 ± 0.735
0.42CysArg: 0.42 ± 0.33
1.259CysSer: 1.259 ± 0.689
2.519CysThr: 2.519 ± 0.789
2.519CysVal: 2.519 ± 0.866
1.259CysTrp: 1.259 ± 0.517
0.84CysTyr: 0.84 ± 0.675
0.0CysXaa: 0.0 ± 0.0
Asp
2.519AspAla: 2.519 ± 1.037
2.939AspCys: 2.939 ± 1.157
3.359AspAsp: 3.359 ± 1.627
3.778AspGlu: 3.778 ± 1.239
2.519AspPhe: 2.519 ± 0.549
2.519AspGly: 2.519 ± 1.172
1.259AspHis: 1.259 ± 0.634
4.198AspIle: 4.198 ± 1.642
2.519AspLys: 2.519 ± 1.307
5.458AspLeu: 5.458 ± 1.565
0.42AspMet: 0.42 ± 0.37
0.84AspAsn: 0.84 ± 0.504
4.618AspPro: 4.618 ± 1.286
2.099AspGln: 2.099 ± 0.389
1.679AspArg: 1.679 ± 0.773
3.359AspSer: 3.359 ± 1.013
5.877AspThr: 5.877 ± 1.031
6.297AspVal: 6.297 ± 2.307
1.259AspTrp: 1.259 ± 0.593
1.679AspTyr: 1.679 ± 1.008
0.0AspXaa: 0.0 ± 0.0
Glu
4.618GluAla: 4.618 ± 0.989
0.84GluCys: 0.84 ± 0.504
7.557GluAsp: 7.557 ± 2.247
2.519GluGlu: 2.519 ± 1.197
1.259GluPhe: 1.259 ± 0.646
3.778GluGly: 3.778 ± 1.759
1.259GluHis: 1.259 ± 0.751
1.259GluIle: 1.259 ± 1.255
2.099GluLys: 2.099 ± 0.684
5.458GluLeu: 5.458 ± 1.764
1.259GluMet: 1.259 ± 0.902
2.099GluAsn: 2.099 ± 0.742
4.198GluPro: 4.198 ± 1.239
3.778GluGln: 3.778 ± 1.3
2.519GluArg: 2.519 ± 1.19
2.519GluSer: 2.519 ± 0.681
2.519GluThr: 2.519 ± 0.938
3.359GluVal: 3.359 ± 0.783
0.84GluTrp: 0.84 ± 0.436
1.679GluTyr: 1.679 ± 1.287
0.0GluXaa: 0.0 ± 0.0
Phe
1.259PheAla: 1.259 ± 0.715
0.42PheCys: 0.42 ± 0.33
1.679PheAsp: 1.679 ± 1.144
0.84PheGlu: 0.84 ± 0.66
1.679PhePhe: 1.679 ± 0.622
2.099PheGly: 2.099 ± 0.92
0.84PheHis: 0.84 ± 0.905
2.099PheIle: 2.099 ± 0.73
3.778PheLys: 3.778 ± 1.672
5.877PheLeu: 5.877 ± 2.406
0.84PheMet: 0.84 ± 0.707
2.099PheAsn: 2.099 ± 0.479
0.84PhePro: 0.84 ± 0.368
1.259PheGln: 1.259 ± 0.392
1.259PheArg: 1.259 ± 0.774
1.259PheSer: 1.259 ± 0.369
1.679PheThr: 1.679 ± 1.052
1.679PheVal: 1.679 ± 1.008
1.259PheTrp: 1.259 ± 0.678
2.939PheTyr: 2.939 ± 1.123
0.0PheXaa: 0.0 ± 0.0
Gly
2.939GlyAla: 2.939 ± 0.96
0.42GlyCys: 0.42 ± 0.37
5.877GlyAsp: 5.877 ± 0.871
4.198GlyGlu: 4.198 ± 1.233
0.84GlyPhe: 0.84 ± 0.438
5.038GlyGly: 5.038 ± 2.284
2.099GlyHis: 2.099 ± 1.255
2.939GlyIle: 2.939 ± 1.095
3.778GlyLys: 3.778 ± 1.257
4.618GlyLeu: 4.618 ± 1.305
0.84GlyMet: 0.84 ± 0.368
3.359GlyAsn: 3.359 ± 1.115
3.778GlyPro: 3.778 ± 1.436
2.099GlyGln: 2.099 ± 0.667
3.778GlyArg: 3.778 ± 1.231
5.877GlySer: 5.877 ± 0.865
5.458GlyThr: 5.458 ± 1.781
5.038GlyVal: 5.038 ± 0.986
0.84GlyTrp: 0.84 ± 0.436
1.259GlyTyr: 1.259 ± 0.369
0.0GlyXaa: 0.0 ± 0.0
His
2.519HisAla: 2.519 ± 0.567
0.42HisCys: 0.42 ± 0.515
1.679HisAsp: 1.679 ± 0.682
0.84HisGlu: 0.84 ± 0.556
0.84HisPhe: 0.84 ± 0.438
1.259HisGly: 1.259 ± 0.439
0.84HisHis: 0.84 ± 0.584
1.679HisIle: 1.679 ± 1.297
0.84HisLys: 0.84 ± 0.436
1.679HisLeu: 1.679 ± 0.655
0.42HisMet: 0.42 ± 0.33
1.679HisAsn: 1.679 ± 0.9
2.099HisPro: 2.099 ± 1.086
1.259HisGln: 1.259 ± 0.761
0.84HisArg: 0.84 ± 0.707
3.359HisSer: 3.359 ± 0.636
1.259HisThr: 1.259 ± 0.721
4.198HisVal: 4.198 ± 0.998
0.84HisTrp: 0.84 ± 0.504
0.84HisTyr: 0.84 ± 0.66
0.0HisXaa: 0.0 ± 0.0
Ile
2.519IleAla: 2.519 ± 0.877
1.259IleCys: 1.259 ± 0.776
2.099IleAsp: 2.099 ± 1.006
3.359IleGlu: 3.359 ± 0.985
0.84IlePhe: 0.84 ± 0.368
2.519IleGly: 2.519 ± 0.862
1.679IleHis: 1.679 ± 0.59
1.679IleIle: 1.679 ± 0.557
0.42IleLys: 0.42 ± 0.37
2.939IleLeu: 2.939 ± 1.098
0.0IleMet: 0.0 ± 0.0
0.42IleAsn: 0.42 ± 0.425
3.359IlePro: 3.359 ± 1.354
1.679IleGln: 1.679 ± 0.736
1.259IleArg: 1.259 ± 0.732
1.679IleSer: 1.679 ± 0.989
2.519IleThr: 2.519 ± 0.793
5.038IleVal: 5.038 ± 1.432
0.0IleTrp: 0.0 ± 0.0
1.679IleTyr: 1.679 ± 0.748
0.0IleXaa: 0.0 ± 0.0
Lys
3.359LysAla: 3.359 ± 1.001
2.099LysCys: 2.099 ± 1.234
2.099LysAsp: 2.099 ± 1.209
2.519LysGlu: 2.519 ± 1.298
2.519LysPhe: 2.519 ± 1.104
3.359LysGly: 3.359 ± 1.168
1.259LysHis: 1.259 ± 0.646
0.84LysIle: 0.84 ± 0.849
1.679LysLys: 1.679 ± 0.921
2.519LysLeu: 2.519 ± 1.146
0.42LysMet: 0.42 ± 0.37
1.679LysAsn: 1.679 ± 0.928
1.259LysPro: 1.259 ± 0.843
2.099LysGln: 2.099 ± 1.033
5.458LysArg: 5.458 ± 1.18
2.519LysSer: 2.519 ± 1.338
2.939LysThr: 2.939 ± 1.035
4.618LysVal: 4.618 ± 1.277
0.42LysTrp: 0.42 ± 0.425
2.519LysTyr: 2.519 ± 0.535
0.0LysXaa: 0.0 ± 0.0
Leu
3.778LeuAla: 3.778 ± 1.33
5.038LeuCys: 5.038 ± 2.955
4.198LeuAsp: 4.198 ± 0.657
6.297LeuGlu: 6.297 ± 2.365
3.359LeuPhe: 3.359 ± 1.901
4.618LeuGly: 4.618 ± 1.117
5.038LeuHis: 5.038 ± 1.163
2.939LeuIle: 2.939 ± 1.694
5.038LeuLys: 5.038 ± 1.101
10.915LeuLeu: 10.915 ± 5.199
1.679LeuMet: 1.679 ± 0.931
2.519LeuAsn: 2.519 ± 0.749
2.519LeuPro: 2.519 ± 0.947
7.137LeuGln: 7.137 ± 1.555
5.458LeuArg: 5.458 ± 0.675
5.038LeuSer: 5.038 ± 1.493
4.198LeuThr: 4.198 ± 0.908
4.618LeuVal: 4.618 ± 0.943
1.259LeuTrp: 1.259 ± 0.466
3.359LeuTyr: 3.359 ± 0.794
0.0LeuXaa: 0.0 ± 0.0
Met
1.259MetAla: 1.259 ± 0.776
0.42MetCys: 0.42 ± 0.33
2.519MetAsp: 2.519 ± 0.812
0.84MetGlu: 0.84 ± 0.656
0.84MetPhe: 0.84 ± 0.368
1.679MetGly: 1.679 ± 0.734
0.42MetHis: 0.42 ± 0.353
0.84MetIle: 0.84 ± 0.65
0.42MetLys: 0.42 ± 0.33
1.259MetLeu: 1.259 ± 0.678
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.0MetPro: 0.0 ± 0.0
0.84MetGln: 0.84 ± 0.66
1.259MetArg: 1.259 ± 0.369
1.679MetSer: 1.679 ± 0.886
1.679MetThr: 1.679 ± 0.273
1.679MetVal: 1.679 ± 0.74
0.42MetTrp: 0.42 ± 0.425
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.939AsnAla: 2.939 ± 1.211
0.84AsnCys: 0.84 ± 0.746
0.42AsnAsp: 0.42 ± 0.37
1.679AsnGlu: 1.679 ± 0.615
0.84AsnPhe: 0.84 ± 0.74
2.099AsnGly: 2.099 ± 0.479
0.42AsnHis: 0.42 ± 0.425
0.84AsnIle: 0.84 ± 0.512
2.519AsnLys: 2.519 ± 1.812
2.099AsnLeu: 2.099 ± 0.919
0.42AsnMet: 0.42 ± 0.37
1.679AsnAsn: 1.679 ± 0.695
2.519AsnPro: 2.519 ± 0.747
0.84AsnGln: 0.84 ± 0.504
1.679AsnArg: 1.679 ± 0.751
2.939AsnSer: 2.939 ± 1.48
3.778AsnThr: 3.778 ± 1.416
2.939AsnVal: 2.939 ± 1.871
0.84AsnTrp: 0.84 ± 0.66
0.42AsnTyr: 0.42 ± 0.452
0.0AsnXaa: 0.0 ± 0.0
Pro
4.618ProAla: 4.618 ± 2.604
0.42ProCys: 0.42 ± 0.37
3.778ProAsp: 3.778 ± 1.379
2.099ProGlu: 2.099 ± 0.734
0.84ProPhe: 0.84 ± 0.66
3.359ProGly: 3.359 ± 0.87
0.0ProHis: 0.0 ± 0.0
1.679ProIle: 1.679 ± 0.667
2.939ProLys: 2.939 ± 1.207
7.976ProLeu: 7.976 ± 1.662
0.84ProMet: 0.84 ± 0.438
2.939ProAsn: 2.939 ± 1.477
9.656ProPro: 9.656 ± 1.792
2.519ProGln: 2.519 ± 0.931
1.679ProArg: 1.679 ± 0.67
6.297ProSer: 6.297 ± 2.39
5.038ProThr: 5.038 ± 1.516
5.877ProVal: 5.877 ± 1.585
0.42ProTrp: 0.42 ± 0.515
2.099ProTyr: 2.099 ± 0.95
0.0ProXaa: 0.0 ± 0.0
Gln
2.939GlnAla: 2.939 ± 1.634
2.099GlnCys: 2.099 ± 0.936
2.939GlnAsp: 2.939 ± 1.382
2.939GlnGlu: 2.939 ± 0.892
2.939GlnPhe: 2.939 ± 0.491
2.519GlnGly: 2.519 ± 1.146
1.259GlnHis: 1.259 ± 0.569
0.42GlnIle: 0.42 ± 0.425
1.259GlnLys: 1.259 ± 0.586
4.618GlnLeu: 4.618 ± 0.987
1.679GlnMet: 1.679 ± 0.52
1.679GlnAsn: 1.679 ± 1.018
2.939GlnPro: 2.939 ± 0.609
3.359GlnGln: 3.359 ± 2.456
3.359GlnArg: 3.359 ± 1.45
0.84GlnSer: 0.84 ± 0.709
3.359GlnThr: 3.359 ± 0.573
2.099GlnVal: 2.099 ± 0.479
1.259GlnTrp: 1.259 ± 0.732
0.84GlnTyr: 0.84 ± 0.368
0.0GlnXaa: 0.0 ± 0.0
Arg
6.297ArgAla: 6.297 ± 1.302
3.359ArgCys: 3.359 ± 1.265
0.84ArgAsp: 0.84 ± 0.592
2.099ArgGlu: 2.099 ± 1.06
3.778ArgPhe: 3.778 ± 1.023
2.099ArgGly: 2.099 ± 0.957
2.099ArgHis: 2.099 ± 0.667
1.259ArgIle: 1.259 ± 0.733
3.778ArgLys: 3.778 ± 0.912
5.038ArgLeu: 5.038 ± 0.722
0.84ArgMet: 0.84 ± 0.593
0.0ArgAsn: 0.0 ± 0.0
5.038ArgPro: 5.038 ± 1.632
2.099ArgGln: 2.099 ± 0.638
4.618ArgArg: 4.618 ± 1.549
3.778ArgSer: 3.778 ± 0.957
4.618ArgThr: 4.618 ± 1.47
2.939ArgVal: 2.939 ± 0.62
1.259ArgTrp: 1.259 ± 0.555
1.259ArgTyr: 1.259 ± 0.713
0.0ArgXaa: 0.0 ± 0.0
Ser
5.877SerAla: 5.877 ± 1.731
0.42SerCys: 0.42 ± 0.33
4.198SerAsp: 4.198 ± 0.975
3.359SerGlu: 3.359 ± 0.708
0.84SerPhe: 0.84 ± 0.368
6.297SerGly: 6.297 ± 1.75
1.679SerHis: 1.679 ± 0.73
5.038SerIle: 5.038 ± 0.744
2.519SerLys: 2.519 ± 1.284
5.458SerLeu: 5.458 ± 1.145
2.099SerMet: 2.099 ± 1.011
2.939SerAsn: 2.939 ± 1.946
2.519SerPro: 2.519 ± 1.01
2.519SerGln: 2.519 ± 1.219
5.038SerArg: 5.038 ± 0.789
5.877SerSer: 5.877 ± 1.806
4.618SerThr: 4.618 ± 0.656
4.618SerVal: 4.618 ± 1.189
0.0SerTrp: 0.0 ± 0.0
2.939SerTyr: 2.939 ± 1.169
0.0SerXaa: 0.0 ± 0.0
Thr
3.778ThrAla: 3.778 ± 1.424
2.519ThrCys: 2.519 ± 0.932
3.359ThrAsp: 3.359 ± 0.847
4.198ThrGlu: 4.198 ± 1.417
3.359ThrPhe: 3.359 ± 0.833
5.458ThrGly: 5.458 ± 1.367
2.519ThrHis: 2.519 ± 1.464
1.679ThrIle: 1.679 ± 1.117
2.099ThrLys: 2.099 ± 1.054
5.458ThrLeu: 5.458 ± 1.592
1.679ThrMet: 1.679 ± 0.886
2.519ThrAsn: 2.519 ± 0.59
5.038ThrPro: 5.038 ± 2.043
2.939ThrGln: 2.939 ± 0.998
2.939ThrArg: 2.939 ± 1.102
6.297ThrSer: 6.297 ± 2.22
4.198ThrThr: 4.198 ± 1.566
8.396ThrVal: 8.396 ± 2.024
0.84ThrTrp: 0.84 ± 0.436
2.099ThrTyr: 2.099 ± 0.647
0.0ThrXaa: 0.0 ± 0.0
Val
4.618ValAla: 4.618 ± 2.065
2.939ValCys: 2.939 ± 1.313
5.458ValAsp: 5.458 ± 0.719
5.038ValGlu: 5.038 ± 1.401
3.359ValPhe: 3.359 ± 0.846
3.778ValGly: 3.778 ± 1.278
2.939ValHis: 2.939 ± 0.991
1.679ValIle: 1.679 ± 0.539
2.939ValLys: 2.939 ± 1.297
3.778ValLeu: 3.778 ± 1.197
1.679ValMet: 1.679 ± 0.911
2.519ValAsn: 2.519 ± 0.441
4.618ValPro: 4.618 ± 1.427
3.359ValGln: 3.359 ± 1.631
5.458ValArg: 5.458 ± 0.594
8.396ValSer: 8.396 ± 1.45
5.877ValThr: 5.877 ± 1.871
5.038ValVal: 5.038 ± 1.143
0.84ValTrp: 0.84 ± 0.504
3.778ValTyr: 3.778 ± 1.482
0.0ValXaa: 0.0 ± 0.0
Trp
0.84TrpAla: 0.84 ± 0.368
0.0TrpCys: 0.0 ± 0.0
0.84TrpAsp: 0.84 ± 0.474
1.259TrpGlu: 1.259 ± 0.635
0.42TrpPhe: 0.42 ± 0.33
2.099TrpGly: 2.099 ± 0.84
0.0TrpHis: 0.0 ± 0.0
0.84TrpIle: 0.84 ± 0.66
0.42TrpLys: 0.42 ± 0.33
0.84TrpLeu: 0.84 ± 0.368
0.0TrpMet: 0.0 ± 0.0
0.42TrpAsn: 0.42 ± 0.37
0.42TrpPro: 0.42 ± 0.353
0.0TrpGln: 0.0 ± 0.0
2.939TrpArg: 2.939 ± 1.08
0.84TrpSer: 0.84 ± 0.66
1.679TrpThr: 1.679 ± 1.154
1.259TrpVal: 1.259 ± 0.671
0.0TrpTrp: 0.0 ± 0.0
0.42TrpTyr: 0.42 ± 0.33
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.679TyrAla: 1.679 ± 0.826
0.84TyrCys: 0.84 ± 0.905
2.519TyrAsp: 2.519 ± 0.535
2.099TyrGlu: 2.099 ± 0.83
0.42TyrPhe: 0.42 ± 0.33
4.198TyrGly: 4.198 ± 0.809
1.259TyrHis: 1.259 ± 0.606
1.259TyrIle: 1.259 ± 0.66
1.259TyrLys: 1.259 ± 0.392
3.778TyrLeu: 3.778 ± 1.199
0.42TyrMet: 0.42 ± 0.33
1.259TyrAsn: 1.259 ± 0.586
1.679TyrPro: 1.679 ± 1.209
1.679TyrGln: 1.679 ± 0.614
2.099TyrArg: 2.099 ± 1.033
1.679TyrSer: 1.679 ± 1.313
1.259TyrThr: 1.259 ± 0.466
2.939TyrVal: 2.939 ± 0.953
0.84TyrTrp: 0.84 ± 0.368
1.679TyrTyr: 1.679 ± 1.008
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (2383 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski