Amino acid dipepetide frequency for Macaca fascicularis papillomavirus 3b

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.038AlaAla: 5.038 ± 1.274
1.679AlaCys: 1.679 ± 1.245
3.359AlaAsp: 3.359 ± 1.318
3.359AlaGlu: 3.359 ± 1.288
2.939AlaPhe: 2.939 ± 1.476
3.778AlaGly: 3.778 ± 1.479
2.099AlaHis: 2.099 ± 0.79
2.939AlaIle: 2.939 ± 0.828
3.778AlaLys: 3.778 ± 0.981
5.458AlaLeu: 5.458 ± 1.956
1.679AlaMet: 1.679 ± 0.576
1.679AlaAsn: 1.679 ± 0.832
5.877AlaPro: 5.877 ± 1.426
2.519AlaGln: 2.519 ± 1.091
3.778AlaArg: 3.778 ± 1.147
4.198AlaSer: 4.198 ± 1.842
5.877AlaThr: 5.877 ± 1.854
2.939AlaVal: 2.939 ± 1.09
0.42AlaTrp: 0.42 ± 0.39
2.519AlaTyr: 2.519 ± 0.415
0.0AlaXaa: 0.0 ± 0.0
Cys
2.519CysAla: 2.519 ± 1.103
1.259CysCys: 1.259 ± 1.059
0.84CysAsp: 0.84 ± 0.472
1.679CysGlu: 1.679 ± 0.875
1.679CysPhe: 1.679 ± 1.217
1.259CysGly: 1.259 ± 0.771
0.42CysHis: 0.42 ± 0.585
0.84CysIle: 0.84 ± 0.656
2.939CysLys: 2.939 ± 1.12
3.778CysLeu: 3.778 ± 2.284
0.42CysMet: 0.42 ± 0.337
0.42CysAsn: 0.42 ± 0.328
2.519CysPro: 2.519 ± 0.961
1.259CysGln: 1.259 ± 0.894
0.42CysArg: 0.42 ± 0.328
1.259CysSer: 1.259 ± 0.908
2.099CysThr: 2.099 ± 0.994
2.519CysVal: 2.519 ± 0.803
1.259CysTrp: 1.259 ± 0.588
0.84CysTyr: 0.84 ± 0.874
0.0CysXaa: 0.0 ± 0.0
Asp
2.939AspAla: 2.939 ± 0.799
2.939AspCys: 2.939 ± 1.219
3.359AspAsp: 3.359 ± 1.724
3.778AspGlu: 3.778 ± 1.22
2.099AspPhe: 2.099 ± 0.693
2.939AspGly: 2.939 ± 1.488
1.259AspHis: 1.259 ± 0.569
4.198AspIle: 4.198 ± 1.971
2.519AspLys: 2.519 ± 1.417
6.297AspLeu: 6.297 ± 1.659
0.42AspMet: 0.42 ± 0.337
1.259AspAsn: 1.259 ± 0.495
4.618AspPro: 4.618 ± 1.52
2.099AspGln: 2.099 ± 0.407
1.259AspArg: 1.259 ± 0.797
3.359AspSer: 3.359 ± 1.126
5.877AspThr: 5.877 ± 1.085
5.458AspVal: 5.458 ± 2.323
1.259AspTrp: 1.259 ± 0.632
1.679AspTyr: 1.679 ± 0.832
0.0AspXaa: 0.0 ± 0.0
Glu
4.198GluAla: 4.198 ± 0.969
0.84GluCys: 0.84 ± 0.416
7.557GluAsp: 7.557 ± 2.079
2.519GluGlu: 2.519 ± 1.127
1.679GluPhe: 1.679 ± 0.944
3.778GluGly: 3.778 ± 1.389
1.259GluHis: 1.259 ± 0.693
1.259GluIle: 1.259 ± 1.234
2.099GluLys: 2.099 ± 0.68
5.458GluLeu: 5.458 ± 1.786
1.259GluMet: 1.259 ± 0.856
2.099GluAsn: 2.099 ± 0.806
3.778GluPro: 3.778 ± 0.976
3.359GluGln: 3.359 ± 1.292
2.099GluArg: 2.099 ± 1.084
2.519GluSer: 2.519 ± 0.78
2.519GluThr: 2.519 ± 0.937
3.778GluVal: 3.778 ± 0.914
0.84GluTrp: 0.84 ± 0.472
1.679GluTyr: 1.679 ± 1.131
0.0GluXaa: 0.0 ± 0.0
Phe
2.099PheAla: 2.099 ± 1.28
0.42PheCys: 0.42 ± 0.328
1.679PheAsp: 1.679 ± 1.047
0.84PheGlu: 0.84 ± 0.656
1.259PhePhe: 1.259 ± 0.396
2.099PheGly: 2.099 ± 0.931
0.84PheHis: 0.84 ± 1.05
2.519PheIle: 2.519 ± 0.738
3.778PheLys: 3.778 ± 1.846
5.877PheLeu: 5.877 ± 2.219
0.42PheMet: 0.42 ± 0.39
2.099PheAsn: 2.099 ± 0.5
0.84PhePro: 0.84 ± 0.386
1.259PheGln: 1.259 ± 0.398
1.259PheArg: 1.259 ± 0.66
1.259PheSer: 1.259 ± 0.396
1.679PheThr: 1.679 ± 1.062
1.259PheVal: 1.259 ± 0.64
1.259PheTrp: 1.259 ± 0.638
2.939PheTyr: 2.939 ± 1.133
0.0PheXaa: 0.0 ± 0.0
Gly
2.939GlyAla: 2.939 ± 1.117
0.84GlyCys: 0.84 ± 0.416
5.877GlyAsp: 5.877 ± 1.081
3.778GlyGlu: 3.778 ± 1.52
0.84GlyPhe: 0.84 ± 0.378
5.877GlyGly: 5.877 ± 2.342
1.679GlyHis: 1.679 ± 0.934
3.359GlyIle: 3.359 ± 1.215
4.198GlyLys: 4.198 ± 1.335
4.618GlyLeu: 4.618 ± 1.306
0.84GlyMet: 0.84 ± 0.386
3.778GlyAsn: 3.778 ± 1.037
3.359GlyPro: 3.359 ± 1.412
2.939GlyGln: 2.939 ± 1.214
3.359GlyArg: 3.359 ± 1.503
5.877GlySer: 5.877 ± 0.97
5.458GlyThr: 5.458 ± 1.978
4.618GlyVal: 4.618 ± 1.008
0.84GlyTrp: 0.84 ± 0.472
2.099GlyTyr: 2.099 ± 0.407
0.0GlyXaa: 0.0 ± 0.0
His
2.519HisAla: 2.519 ± 0.585
0.42HisCys: 0.42 ± 0.661
2.099HisAsp: 2.099 ± 1.178
0.42HisGlu: 0.42 ± 0.39
0.84HisPhe: 0.84 ± 0.378
1.259HisGly: 1.259 ± 0.454
0.84HisHis: 0.84 ± 0.617
2.099HisIle: 2.099 ± 1.168
0.84HisLys: 0.84 ± 0.472
2.099HisLeu: 2.099 ± 0.892
0.42HisMet: 0.42 ± 0.328
1.679HisAsn: 1.679 ± 0.926
2.519HisPro: 2.519 ± 1.635
1.259HisGln: 1.259 ± 0.714
0.84HisArg: 0.84 ± 0.781
3.359HisSer: 3.359 ± 0.715
0.84HisThr: 0.84 ± 0.515
3.359HisVal: 3.359 ± 0.948
0.84HisTrp: 0.84 ± 0.416
0.84HisTyr: 0.84 ± 0.656
0.0HisXaa: 0.0 ± 0.0
Ile
2.099IleAla: 2.099 ± 0.777
2.099IleCys: 2.099 ± 0.68
2.099IleAsp: 2.099 ± 0.902
2.939IleGlu: 2.939 ± 0.813
0.84IlePhe: 0.84 ± 0.386
2.939IleGly: 2.939 ± 0.965
1.679IleHis: 1.679 ± 0.592
1.679IleIle: 1.679 ± 0.538
0.42IleLys: 0.42 ± 0.337
2.939IleLeu: 2.939 ± 1.295
0.0IleMet: 0.0 ± 0.0
0.42IleAsn: 0.42 ± 0.406
3.359IlePro: 3.359 ± 1.417
1.679IleGln: 1.679 ± 0.772
1.679IleArg: 1.679 ± 1.112
2.939IleSer: 2.939 ± 1.048
2.519IleThr: 2.519 ± 0.756
5.458IleVal: 5.458 ± 1.495
0.0IleTrp: 0.0 ± 0.0
1.679IleTyr: 1.679 ± 0.817
0.0IleXaa: 0.0 ± 0.0
Lys
3.359LysAla: 3.359 ± 1.097
2.519LysCys: 2.519 ± 1.409
1.679LysAsp: 1.679 ± 1.201
2.939LysGlu: 2.939 ± 1.147
2.939LysPhe: 2.939 ± 1.198
3.778LysGly: 3.778 ± 1.016
1.679LysHis: 1.679 ± 0.944
2.099LysIle: 2.099 ± 1.203
2.099LysLys: 2.099 ± 0.769
2.099LysLeu: 2.099 ± 1.036
0.84LysMet: 0.84 ± 0.665
1.259LysAsn: 1.259 ± 0.984
1.259LysPro: 1.259 ± 0.849
2.099LysGln: 2.099 ± 1.163
5.038LysArg: 5.038 ± 1.376
2.939LysSer: 2.939 ± 1.501
2.519LysThr: 2.519 ± 1.383
3.778LysVal: 3.778 ± 1.093
0.42LysTrp: 0.42 ± 0.406
2.519LysTyr: 2.519 ± 0.545
0.0LysXaa: 0.0 ± 0.0
Leu
4.198LeuAla: 4.198 ± 1.482
4.618LeuCys: 4.618 ± 2.695
4.198LeuAsp: 4.198 ± 0.675
6.297LeuGlu: 6.297 ± 2.713
3.359LeuPhe: 3.359 ± 1.875
5.038LeuGly: 5.038 ± 1.289
5.458LeuHis: 5.458 ± 1.315
2.939LeuIle: 2.939 ± 1.476
5.038LeuLys: 5.038 ± 1.35
10.915LeuLeu: 10.915 ± 5.113
1.679LeuMet: 1.679 ± 1.078
2.099LeuAsn: 2.099 ± 0.656
2.939LeuPro: 2.939 ± 1.37
6.717LeuGln: 6.717 ± 1.388
5.458LeuArg: 5.458 ± 0.815
5.458LeuSer: 5.458 ± 1.54
4.198LeuThr: 4.198 ± 1.203
4.618LeuVal: 4.618 ± 0.945
1.259LeuTrp: 1.259 ± 0.495
3.359LeuTyr: 3.359 ± 0.954
0.0LeuXaa: 0.0 ± 0.0
Met
1.259MetAla: 1.259 ± 0.64
0.42MetCys: 0.42 ± 0.328
2.519MetAsp: 2.519 ± 0.783
0.84MetGlu: 0.84 ± 0.724
0.84MetPhe: 0.84 ± 0.386
1.259MetGly: 1.259 ± 0.644
0.42MetHis: 0.42 ± 0.39
0.42MetIle: 0.42 ± 0.585
0.42MetLys: 0.42 ± 0.328
1.259MetLeu: 1.259 ± 0.638
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.0MetPro: 0.0 ± 0.0
0.84MetGln: 0.84 ± 0.656
1.259MetArg: 1.259 ± 0.396
2.099MetSer: 2.099 ± 0.881
1.679MetThr: 1.679 ± 0.338
1.679MetVal: 1.679 ± 0.781
0.42MetTrp: 0.42 ± 0.406
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.099AsnAla: 2.099 ± 0.856
0.84AsnCys: 0.84 ± 0.708
0.42AsnAsp: 0.42 ± 0.337
1.679AsnGlu: 1.679 ± 0.6
0.84AsnPhe: 0.84 ± 0.675
2.099AsnGly: 2.099 ± 0.5
0.42AsnHis: 0.42 ± 0.406
0.84AsnIle: 0.84 ± 0.556
2.939AsnLys: 2.939 ± 1.565
1.679AsnLeu: 1.679 ± 1.158
0.42AsnMet: 0.42 ± 0.337
1.259AsnAsn: 1.259 ± 0.64
2.519AsnPro: 2.519 ± 0.649
0.42AsnGln: 0.42 ± 0.337
2.099AsnArg: 2.099 ± 0.743
2.519AsnSer: 2.519 ± 1.127
4.618AsnThr: 4.618 ± 2.056
2.939AsnVal: 2.939 ± 1.934
0.84AsnTrp: 0.84 ± 0.656
0.42AsnTyr: 0.42 ± 0.525
0.0AsnXaa: 0.0 ± 0.0
Pro
4.618ProAla: 4.618 ± 3.049
0.84ProCys: 0.84 ± 0.682
3.778ProAsp: 3.778 ± 1.495
2.099ProGlu: 2.099 ± 0.813
0.84ProPhe: 0.84 ± 0.656
3.778ProGly: 3.778 ± 0.757
0.0ProHis: 0.0 ± 0.0
1.679ProIle: 1.679 ± 0.702
3.359ProLys: 3.359 ± 1.201
8.816ProLeu: 8.816 ± 1.941
0.84ProMet: 0.84 ± 0.378
2.519ProAsn: 2.519 ± 1.043
8.396ProPro: 8.396 ± 2.266
2.099ProGln: 2.099 ± 0.557
1.679ProArg: 1.679 ± 0.808
6.717ProSer: 6.717 ± 2.888
4.618ProThr: 4.618 ± 1.497
6.297ProVal: 6.297 ± 1.704
0.42ProTrp: 0.42 ± 0.661
2.099ProTyr: 2.099 ± 0.845
0.0ProXaa: 0.0 ± 0.0
Gln
3.359GlnAla: 3.359 ± 1.82
1.679GlnCys: 1.679 ± 1.01
2.939GlnAsp: 2.939 ± 1.213
2.939GlnGlu: 2.939 ± 0.91
2.939GlnPhe: 2.939 ± 0.613
2.939GlnGly: 2.939 ± 1.059
0.84GlnHis: 0.84 ± 0.556
0.42GlnIle: 0.42 ± 0.406
1.259GlnLys: 1.259 ± 0.646
4.618GlnLeu: 4.618 ± 0.984
1.679GlnMet: 1.679 ± 0.6
1.679GlnAsn: 1.679 ± 1.051
2.939GlnPro: 2.939 ± 0.7
3.778GlnGln: 3.778 ± 2.271
3.359GlnArg: 3.359 ± 1.595
0.0GlnSer: 0.0 ± 0.0
2.939GlnThr: 2.939 ± 1.106
2.099GlnVal: 2.099 ± 0.5
1.259GlnTrp: 1.259 ± 0.746
0.84GlnTyr: 0.84 ± 0.386
0.0GlnXaa: 0.0 ± 0.0
Arg
6.717ArgAla: 6.717 ± 1.469
2.519ArgCys: 2.519 ± 1.433
0.84ArgAsp: 0.84 ± 0.623
1.679ArgGlu: 1.679 ± 0.794
3.778ArgPhe: 3.778 ± 1.129
2.519ArgGly: 2.519 ± 0.995
2.099ArgHis: 2.099 ± 0.705
1.259ArgIle: 1.259 ± 0.856
3.778ArgLys: 3.778 ± 1.025
5.038ArgLeu: 5.038 ± 0.932
0.42ArgMet: 0.42 ± 0.446
0.0ArgAsn: 0.0 ± 0.0
5.038ArgPro: 5.038 ± 1.792
2.099ArgGln: 2.099 ± 0.79
4.618ArgArg: 4.618 ± 1.514
2.939ArgSer: 2.939 ± 0.723
4.198ArgThr: 4.198 ± 1.55
2.939ArgVal: 2.939 ± 0.788
1.259ArgTrp: 1.259 ± 0.61
1.259ArgTyr: 1.259 ± 0.754
0.0ArgXaa: 0.0 ± 0.0
Ser
5.458SerAla: 5.458 ± 1.362
0.42SerCys: 0.42 ± 0.328
3.359SerAsp: 3.359 ± 0.615
3.778SerGlu: 3.778 ± 0.897
0.84SerPhe: 0.84 ± 0.386
6.297SerGly: 6.297 ± 2.026
1.679SerHis: 1.679 ± 0.706
5.038SerIle: 5.038 ± 0.943
2.099SerLys: 2.099 ± 1.04
5.458SerLeu: 5.458 ± 1.209
2.099SerMet: 2.099 ± 0.943
3.359SerAsn: 3.359 ± 1.771
3.359SerPro: 3.359 ± 1.12
2.099SerGln: 2.099 ± 1.299
5.038SerArg: 5.038 ± 0.936
5.877SerSer: 5.877 ± 2.032
5.458SerThr: 5.458 ± 0.506
4.198SerVal: 4.198 ± 1.194
0.0SerTrp: 0.0 ± 0.0
2.939SerTyr: 2.939 ± 1.07
0.0SerXaa: 0.0 ± 0.0
Thr
3.778ThrAla: 3.778 ± 1.395
2.519ThrCys: 2.519 ± 0.846
3.359ThrAsp: 3.359 ± 0.976
4.198ThrGlu: 4.198 ± 1.51
2.939ThrPhe: 2.939 ± 1.059
5.458ThrGly: 5.458 ± 1.177
2.519ThrHis: 2.519 ± 1.537
2.099ThrIle: 2.099 ± 1.202
1.679ThrLys: 1.679 ± 0.789
5.038ThrLeu: 5.038 ± 1.627
1.679ThrMet: 1.679 ± 0.93
2.519ThrAsn: 2.519 ± 0.66
5.458ThrPro: 5.458 ± 3.206
2.939ThrGln: 2.939 ± 0.91
2.519ThrArg: 2.519 ± 0.888
5.877ThrSer: 5.877 ± 2.3
3.359ThrThr: 3.359 ± 1.664
7.976ThrVal: 7.976 ± 2.233
1.259ThrTrp: 1.259 ± 0.817
2.099ThrTyr: 2.099 ± 0.677
0.0ThrXaa: 0.0 ± 0.0
Val
4.198ValAla: 4.198 ± 1.849
2.519ValCys: 2.519 ± 1.491
5.458ValAsp: 5.458 ± 0.657
5.877ValGlu: 5.877 ± 1.405
2.939ValPhe: 2.939 ± 0.977
3.778ValGly: 3.778 ± 1.267
2.939ValHis: 2.939 ± 0.914
1.679ValIle: 1.679 ± 0.513
2.519ValLys: 2.519 ± 0.844
3.778ValLeu: 3.778 ± 1.482
1.679ValMet: 1.679 ± 0.876
2.519ValAsn: 2.519 ± 0.511
4.618ValPro: 4.618 ± 1.768
2.939ValGln: 2.939 ± 1.476
5.038ValArg: 5.038 ± 0.866
7.137ValSer: 7.137 ± 1.235
5.877ValThr: 5.877 ± 2.331
5.458ValVal: 5.458 ± 1.366
0.84ValTrp: 0.84 ± 0.416
4.198ValTyr: 4.198 ± 1.517
0.0ValXaa: 0.0 ± 0.0
Trp
0.84TrpAla: 0.84 ± 0.386
0.0TrpCys: 0.0 ± 0.0
1.259TrpAsp: 1.259 ± 0.629
0.84TrpGlu: 0.84 ± 0.416
0.42TrpPhe: 0.42 ± 0.328
2.099TrpGly: 2.099 ± 0.746
0.0TrpHis: 0.0 ± 0.0
0.84TrpIle: 0.84 ± 0.656
0.84TrpLys: 0.84 ± 0.472
0.84TrpLeu: 0.84 ± 0.386
0.0TrpMet: 0.0 ± 0.0
0.42TrpAsn: 0.42 ± 0.337
0.42TrpPro: 0.42 ± 0.39
0.0TrpGln: 0.0 ± 0.0
2.519TrpArg: 2.519 ± 0.817
0.84TrpSer: 0.84 ± 0.656
1.679TrpThr: 1.679 ± 1.157
1.259TrpVal: 1.259 ± 0.716
0.0TrpTrp: 0.0 ± 0.0
0.42TrpTyr: 0.42 ± 0.328
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.679TyrAla: 1.679 ± 0.695
0.84TyrCys: 0.84 ± 1.05
2.939TyrAsp: 2.939 ± 0.661
2.099TyrGlu: 2.099 ± 0.76
0.84TyrPhe: 0.84 ± 0.378
4.198TyrGly: 4.198 ± 0.805
1.259TyrHis: 1.259 ± 0.707
1.259TyrIle: 1.259 ± 0.647
1.259TyrLys: 1.259 ± 0.398
3.778TyrLeu: 3.778 ± 1.267
0.42TyrMet: 0.42 ± 0.328
0.84TyrAsn: 0.84 ± 0.62
1.679TyrPro: 1.679 ± 1.097
2.099TyrGln: 2.099 ± 0.587
2.099TyrArg: 2.099 ± 1.163
1.679TyrSer: 1.679 ± 1.266
1.259TyrThr: 1.259 ± 0.495
3.359TyrVal: 3.359 ± 1.378
0.84TyrTrp: 0.84 ± 0.386
1.679TyrTyr: 1.679 ± 0.832
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (2383 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski