Amino acid dipepetide frequency for Human papillomavirus type 96

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.648AlaAla: 2.648 ± 1.02
2.648AlaCys: 2.648 ± 1.257
4.16AlaAsp: 4.16 ± 0.853
2.648AlaGlu: 2.648 ± 1.106
3.026AlaPhe: 3.026 ± 1.085
3.404AlaGly: 3.404 ± 2.282
0.756AlaHis: 0.756 ± 0.659
0.756AlaIle: 0.756 ± 0.396
3.026AlaLys: 3.026 ± 1.301
3.782AlaLeu: 3.782 ± 1.075
0.756AlaMet: 0.756 ± 0.405
2.269AlaAsn: 2.269 ± 1.335
3.404AlaPro: 3.404 ± 1.034
2.269AlaGln: 2.269 ± 1.153
6.051AlaArg: 6.051 ± 1.901
3.782AlaSer: 3.782 ± 1.246
3.782AlaThr: 3.782 ± 1.04
2.269AlaVal: 2.269 ± 0.615
0.378AlaTrp: 0.378 ± 0.33
2.648AlaTyr: 2.648 ± 0.872
0.0AlaXaa: 0.0 ± 0.0
Cys
0.756CysAla: 0.756 ± 0.444
1.135CysCys: 1.135 ± 0.762
0.756CysAsp: 0.756 ± 0.9
0.378CysGlu: 0.378 ± 0.33
1.513CysPhe: 1.513 ± 0.434
1.891CysGly: 1.891 ± 0.777
0.0CysHis: 0.0 ± 0.0
1.513CysIle: 1.513 ± 0.662
2.648CysLys: 2.648 ± 0.821
0.756CysLeu: 0.756 ± 0.462
0.378CysMet: 0.378 ± 0.33
0.378CysAsn: 0.378 ± 0.384
1.891CysPro: 1.891 ± 0.665
0.756CysGln: 0.756 ± 0.769
2.648CysArg: 2.648 ± 0.894
1.513CysSer: 1.513 ± 1.318
0.0CysThr: 0.0 ± 0.0
1.135CysVal: 1.135 ± 0.646
1.135CysTrp: 1.135 ± 0.452
0.756CysTyr: 0.756 ± 0.431
0.0CysXaa: 0.0 ± 0.0
Asp
3.026AspAla: 3.026 ± 0.733
1.513AspCys: 1.513 ± 1.318
2.269AspAsp: 2.269 ± 1.314
2.269AspGlu: 2.269 ± 0.863
1.135AspPhe: 1.135 ± 0.651
3.782AspGly: 3.782 ± 1.354
0.378AspHis: 0.378 ± 0.336
3.782AspIle: 3.782 ± 0.83
1.891AspLys: 1.891 ± 1.107
8.699AspLeu: 8.699 ± 3.259
0.756AspMet: 0.756 ± 0.673
3.404AspAsn: 3.404 ± 1.212
3.404AspPro: 3.404 ± 0.751
4.539AspGln: 4.539 ± 1.232
1.513AspArg: 1.513 ± 0.938
3.782AspSer: 3.782 ± 0.724
4.539AspThr: 4.539 ± 1.299
4.16AspVal: 4.16 ± 1.705
1.513AspTrp: 1.513 ± 0.581
1.891AspTyr: 1.891 ± 0.531
0.0AspXaa: 0.0 ± 0.0
Glu
4.16GluAla: 4.16 ± 1.101
0.756GluCys: 0.756 ± 0.371
6.808GluAsp: 6.808 ± 1.112
6.43GluGlu: 6.43 ± 2.01
1.891GluPhe: 1.891 ± 0.578
5.295GluGly: 5.295 ± 1.807
2.269GluHis: 2.269 ± 0.397
3.782GluIle: 3.782 ± 2.002
3.026GluLys: 3.026 ± 0.922
5.673GluLeu: 5.673 ± 1.709
0.756GluMet: 0.756 ± 0.444
3.404GluAsn: 3.404 ± 0.964
6.808GluPro: 6.808 ± 2.265
3.782GluGln: 3.782 ± 1.076
1.891GluArg: 1.891 ± 0.513
2.269GluSer: 2.269 ± 0.702
3.782GluThr: 3.782 ± 1.978
6.051GluVal: 6.051 ± 0.916
0.378GluTrp: 0.378 ± 0.335
1.513GluTyr: 1.513 ± 0.976
0.0GluXaa: 0.0 ± 0.0
Phe
1.891PheAla: 1.891 ± 0.619
1.891PheCys: 1.891 ± 0.852
2.648PheAsp: 2.648 ± 1.049
4.16PheGlu: 4.16 ± 1.49
1.135PhePhe: 1.135 ± 0.354
3.026PheGly: 3.026 ± 0.56
0.0PheHis: 0.0 ± 0.0
2.269PheIle: 2.269 ± 0.851
3.026PheLys: 3.026 ± 1.084
4.16PheLeu: 4.16 ± 1.264
1.135PheMet: 1.135 ± 0.64
3.026PheAsn: 3.026 ± 0.699
1.135PhePro: 1.135 ± 0.64
2.648PheGln: 2.648 ± 0.828
1.135PheArg: 1.135 ± 0.667
1.513PheSer: 1.513 ± 0.791
0.0PheThr: 0.0 ± 0.0
1.891PheVal: 1.891 ± 0.488
1.135PheTrp: 1.135 ± 0.667
2.269PheTyr: 2.269 ± 1.046
0.0PheXaa: 0.0 ± 0.0
Gly
3.404GlyAla: 3.404 ± 1.827
2.269GlyCys: 2.269 ± 0.68
4.16GlyAsp: 4.16 ± 1.515
5.673GlyGlu: 5.673 ± 1.452
1.513GlyPhe: 1.513 ± 0.662
5.673GlyGly: 5.673 ± 1.374
2.269GlyHis: 2.269 ± 0.965
3.782GlyIle: 3.782 ± 0.854
4.16GlyLys: 4.16 ± 1.825
1.891GlyLeu: 1.891 ± 0.561
0.378GlyMet: 0.378 ± 0.336
3.026GlyAsn: 3.026 ± 1.089
3.404GlyPro: 3.404 ± 1.655
2.648GlyGln: 2.648 ± 1.397
7.186GlyArg: 7.186 ± 2.871
5.673GlySer: 5.673 ± 1.017
6.43GlyThr: 6.43 ± 1.414
3.026GlyVal: 3.026 ± 0.435
0.378GlyTrp: 0.378 ± 0.384
3.404GlyTyr: 3.404 ± 0.918
0.0GlyXaa: 0.0 ± 0.0
His
0.378HisAla: 0.378 ± 0.336
0.756HisCys: 0.756 ± 0.571
0.378HisAsp: 0.378 ± 0.475
1.135HisGlu: 1.135 ± 0.657
1.513HisPhe: 1.513 ± 0.391
1.135HisGly: 1.135 ± 0.94
0.378HisHis: 0.378 ± 0.33
0.378HisIle: 0.378 ± 0.33
1.891HisLys: 1.891 ± 0.991
0.756HisLeu: 0.756 ± 0.462
0.378HisMet: 0.378 ± 0.336
1.135HisAsn: 1.135 ± 0.354
1.513HisPro: 1.513 ± 0.768
0.0HisGln: 0.0 ± 0.0
0.378HisArg: 0.378 ± 0.475
1.513HisSer: 1.513 ± 0.656
1.135HisThr: 1.135 ± 0.511
0.378HisVal: 0.378 ± 0.336
1.513HisTrp: 1.513 ± 0.606
1.891HisTyr: 1.891 ± 0.684
0.0HisXaa: 0.0 ± 0.0
Ile
1.513IleAla: 1.513 ± 0.791
1.513IleCys: 1.513 ± 0.923
3.026IleAsp: 3.026 ± 0.56
3.026IleGlu: 3.026 ± 1.537
2.269IlePhe: 2.269 ± 0.616
2.269IleGly: 2.269 ± 0.729
1.135IleHis: 1.135 ± 0.354
2.269IleIle: 2.269 ± 1.643
1.513IleLys: 1.513 ± 0.744
3.782IleLeu: 3.782 ± 0.605
1.513IleMet: 1.513 ± 0.738
3.026IleAsn: 3.026 ± 0.574
4.16IlePro: 4.16 ± 1.319
2.269IleGln: 2.269 ± 0.904
1.513IleArg: 1.513 ± 0.718
3.782IleSer: 3.782 ± 1.285
1.135IleThr: 1.135 ± 0.668
3.404IleVal: 3.404 ± 1.009
1.135IleTrp: 1.135 ± 0.681
3.782IleTyr: 3.782 ± 0.907
0.0IleXaa: 0.0 ± 0.0
Lys
3.026LysAla: 3.026 ± 0.796
1.513LysCys: 1.513 ± 0.655
1.891LysAsp: 1.891 ± 1.107
3.404LysGlu: 3.404 ± 1.155
2.648LysPhe: 2.648 ± 1.103
3.026LysGly: 3.026 ± 1.096
1.135LysHis: 1.135 ± 0.617
2.269LysIle: 2.269 ± 1.274
2.648LysLys: 2.648 ± 1.026
5.673LysLeu: 5.673 ± 0.995
0.378LysMet: 0.378 ± 0.336
3.026LysAsn: 3.026 ± 1.42
1.513LysPro: 1.513 ± 1.479
1.891LysGln: 1.891 ± 0.83
4.917LysArg: 4.917 ± 0.753
2.269LysSer: 2.269 ± 1.027
4.16LysThr: 4.16 ± 0.958
3.404LysVal: 3.404 ± 1.079
0.756LysTrp: 0.756 ± 0.52
1.513LysTyr: 1.513 ± 0.454
0.0LysXaa: 0.0 ± 0.0
Leu
3.026LeuAla: 3.026 ± 0.823
2.648LeuCys: 2.648 ± 1.159
5.295LeuAsp: 5.295 ± 1.293
9.455LeuGlu: 9.455 ± 1.13
3.404LeuPhe: 3.404 ± 1.384
5.295LeuGly: 5.295 ± 1.383
2.648LeuHis: 2.648 ± 1.015
2.269LeuIle: 2.269 ± 1.032
4.539LeuLys: 4.539 ± 0.958
11.346LeuLeu: 11.346 ± 3.041
1.135LeuMet: 1.135 ± 0.663
1.513LeuAsn: 1.513 ± 0.819
3.782LeuPro: 3.782 ± 1.212
6.43LeuGln: 6.43 ± 1.011
6.43LeuArg: 6.43 ± 1.693
8.321LeuSer: 8.321 ± 2.357
5.295LeuThr: 5.295 ± 1.524
4.16LeuVal: 4.16 ± 1.509
0.756LeuTrp: 0.756 ± 0.405
1.891LeuTyr: 1.891 ± 0.633
0.0LeuXaa: 0.0 ± 0.0
Met
2.269MetAla: 2.269 ± 0.709
0.0MetCys: 0.0 ± 0.0
1.513MetAsp: 1.513 ± 0.805
1.513MetGlu: 1.513 ± 0.581
1.891MetPhe: 1.891 ± 1.04
0.378MetGly: 0.378 ± 0.336
0.0MetHis: 0.0 ± 0.0
0.756MetIle: 0.756 ± 0.571
0.756MetLys: 0.756 ± 0.538
2.269MetLeu: 2.269 ± 0.929
0.0MetMet: 0.0 ± 0.0
0.756MetAsn: 0.756 ± 0.405
0.0MetPro: 0.0 ± 0.0
0.378MetGln: 0.378 ± 0.384
0.756MetArg: 0.756 ± 0.371
1.891MetSer: 1.891 ± 1.04
0.0MetThr: 0.0 ± 0.0
1.135MetVal: 1.135 ± 0.617
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.648AsnAla: 2.648 ± 1.103
1.135AsnCys: 1.135 ± 0.47
1.891AsnAsp: 1.891 ± 1.021
1.891AsnGlu: 1.891 ± 1.273
2.648AsnPhe: 2.648 ± 1.019
3.782AsnGly: 3.782 ± 1.609
0.378AsnHis: 0.378 ± 0.336
3.404AsnIle: 3.404 ± 1.067
1.891AsnLys: 1.891 ± 0.677
2.648AsnLeu: 2.648 ± 0.976
0.378AsnMet: 0.378 ± 0.347
1.513AsnAsn: 1.513 ± 0.628
3.404AsnPro: 3.404 ± 1.395
1.891AsnGln: 1.891 ± 0.677
2.648AsnArg: 2.648 ± 1.182
1.891AsnSer: 1.891 ± 0.843
4.539AsnThr: 4.539 ± 2.24
1.135AsnVal: 1.135 ± 0.711
0.756AsnTrp: 0.756 ± 0.659
1.135AsnTyr: 1.135 ± 0.989
0.0AsnXaa: 0.0 ± 0.0
Pro
4.16ProAla: 4.16 ± 1.257
1.135ProCys: 1.135 ± 0.795
6.051ProAsp: 6.051 ± 1.984
5.295ProGlu: 5.295 ± 2.208
1.135ProPhe: 1.135 ± 0.667
2.648ProGly: 2.648 ± 1.295
0.378ProHis: 0.378 ± 0.475
1.891ProIle: 1.891 ± 0.558
4.16ProLys: 4.16 ± 1.46
4.917ProLeu: 4.917 ± 1.099
1.135ProMet: 1.135 ± 0.989
1.891ProAsn: 1.891 ± 0.722
6.808ProPro: 6.808 ± 1.355
1.891ProGln: 1.891 ± 1.064
2.269ProArg: 2.269 ± 1.188
5.295ProSer: 5.295 ± 1.593
5.295ProThr: 5.295 ± 2.966
4.917ProVal: 4.917 ± 2.141
0.378ProTrp: 0.378 ± 0.335
1.135ProTyr: 1.135 ± 0.582
0.0ProXaa: 0.0 ± 0.0
Gln
2.269GlnAla: 2.269 ± 0.54
1.135GlnCys: 1.135 ± 0.391
1.513GlnAsp: 1.513 ± 0.918
1.891GlnGlu: 1.891 ± 0.538
3.404GlnPhe: 3.404 ± 0.946
3.782GlnGly: 3.782 ± 0.805
0.378GlnHis: 0.378 ± 0.33
4.539GlnIle: 4.539 ± 1.135
1.513GlnLys: 1.513 ± 0.794
6.051GlnLeu: 6.051 ± 1.026
0.756GlnMet: 0.756 ± 0.519
1.891GlnAsn: 1.891 ± 0.688
1.513GlnPro: 1.513 ± 0.391
3.026GlnGln: 3.026 ± 1.35
3.026GlnArg: 3.026 ± 0.98
2.648GlnSer: 2.648 ± 0.911
1.135GlnThr: 1.135 ± 0.47
1.891GlnVal: 1.891 ± 1.098
0.756GlnTrp: 0.756 ± 0.444
1.513GlnTyr: 1.513 ± 0.663
0.0GlnXaa: 0.0 ± 0.0
Arg
6.43ArgAla: 6.43 ± 2.045
0.756ArgCys: 0.756 ± 0.462
4.16ArgAsp: 4.16 ± 1.057
3.404ArgGlu: 3.404 ± 1.299
3.026ArgPhe: 3.026 ± 0.882
7.943ArgGly: 7.943 ± 2.53
1.513ArgHis: 1.513 ± 0.628
1.135ArgIle: 1.135 ± 0.626
4.16ArgLys: 4.16 ± 0.916
6.051ArgLeu: 6.051 ± 0.845
1.891ArgMet: 1.891 ± 0.382
1.891ArgAsn: 1.891 ± 0.395
2.269ArgPro: 2.269 ± 0.477
1.513ArgGln: 1.513 ± 0.457
6.808ArgArg: 6.808 ± 2.651
8.699ArgSer: 8.699 ± 4.466
3.782ArgThr: 3.782 ± 0.741
4.16ArgVal: 4.16 ± 1.285
0.0ArgTrp: 0.0 ± 0.0
2.648ArgTyr: 2.648 ± 0.501
0.0ArgXaa: 0.0 ± 0.0
Ser
1.891SerAla: 1.891 ± 0.649
0.0SerCys: 0.0 ± 0.0
4.16SerAsp: 4.16 ± 1.549
4.539SerGlu: 4.539 ± 1.115
4.539SerPhe: 4.539 ± 1.069
6.43SerGly: 6.43 ± 1.912
0.756SerHis: 0.756 ± 0.444
3.026SerIle: 3.026 ± 0.746
2.269SerLys: 2.269 ± 0.903
7.564SerLeu: 7.564 ± 1.122
0.378SerMet: 0.378 ± 0.33
3.026SerAsn: 3.026 ± 1.81
4.917SerPro: 4.917 ± 1.1
1.135SerGln: 1.135 ± 0.626
9.455SerArg: 9.455 ± 3.289
4.917SerSer: 4.917 ± 2.588
5.295SerThr: 5.295 ± 1.787
5.295SerVal: 5.295 ± 0.862
1.135SerTrp: 1.135 ± 0.626
1.135SerTyr: 1.135 ± 0.433
0.0SerXaa: 0.0 ± 0.0
Thr
3.026ThrAla: 3.026 ± 0.662
0.756ThrCys: 0.756 ± 0.396
3.404ThrAsp: 3.404 ± 0.634
4.917ThrGlu: 4.917 ± 0.929
1.513ThrPhe: 1.513 ± 1.011
4.539ThrGly: 4.539 ± 1.65
0.756ThrHis: 0.756 ± 0.444
3.782ThrIle: 3.782 ± 1.844
1.891ThrLys: 1.891 ± 0.677
3.782ThrLeu: 3.782 ± 1.277
1.513ThrMet: 1.513 ± 0.958
1.513ThrAsn: 1.513 ± 0.566
7.564ThrPro: 7.564 ± 3.0
2.269ThrGln: 2.269 ± 1.384
4.16ThrArg: 4.16 ± 1.858
4.16ThrSer: 4.16 ± 1.313
3.404ThrThr: 3.404 ± 1.141
4.917ThrVal: 4.917 ± 1.564
0.378ThrTrp: 0.378 ± 0.335
1.891ThrTyr: 1.891 ± 0.325
0.0ThrXaa: 0.0 ± 0.0
Val
3.404ValAla: 3.404 ± 1.068
0.0ValCys: 0.0 ± 0.0
2.648ValAsp: 2.648 ± 0.459
6.051ValGlu: 6.051 ± 1.296
0.378ValPhe: 0.378 ± 0.347
3.782ValGly: 3.782 ± 1.193
1.891ValHis: 1.891 ± 1.025
3.404ValIle: 3.404 ± 1.308
2.648ValLys: 2.648 ± 0.493
3.782ValLeu: 3.782 ± 1.028
0.756ValMet: 0.756 ± 0.371
2.648ValAsn: 2.648 ± 1.221
4.16ValPro: 4.16 ± 0.799
3.404ValGln: 3.404 ± 0.833
5.295ValArg: 5.295 ± 0.85
6.051ValSer: 6.051 ± 1.156
4.16ValThr: 4.16 ± 1.188
3.404ValVal: 3.404 ± 1.148
0.378ValTrp: 0.378 ± 0.336
0.756ValTyr: 0.756 ± 0.673
0.0ValXaa: 0.0 ± 0.0
Trp
0.756TrpAla: 0.756 ± 0.405
0.0TrpCys: 0.0 ± 0.0
0.756TrpAsp: 0.756 ± 0.673
0.756TrpGlu: 0.756 ± 0.538
0.378TrpPhe: 0.378 ± 0.33
0.378TrpGly: 0.378 ± 0.336
0.0TrpHis: 0.0 ± 0.0
0.756TrpIle: 0.756 ± 0.659
1.513TrpLys: 1.513 ± 0.887
1.891TrpLeu: 1.891 ± 0.767
0.756TrpMet: 0.756 ± 0.52
0.756TrpAsn: 0.756 ± 0.371
0.0TrpPro: 0.0 ± 0.0
1.513TrpGln: 1.513 ± 0.757
0.0TrpArg: 0.0 ± 0.0
1.135TrpSer: 1.135 ± 0.626
0.378TrpThr: 0.378 ± 0.335
1.135TrpVal: 1.135 ± 0.617
0.0TrpTrp: 0.0 ± 0.0
0.756TrpTyr: 0.756 ± 0.659
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.782TyrAla: 3.782 ± 0.861
0.378TyrCys: 0.378 ± 0.384
0.756TyrAsp: 0.756 ± 0.659
1.891TyrGlu: 1.891 ± 0.728
1.135TyrPhe: 1.135 ± 0.391
1.891TyrGly: 1.891 ± 0.441
1.513TyrHis: 1.513 ± 0.722
2.648TyrIle: 2.648 ± 0.64
1.891TyrLys: 1.891 ± 1.014
4.16TyrLeu: 4.16 ± 0.996
0.756TyrMet: 0.756 ± 0.371
1.513TyrAsn: 1.513 ± 0.566
1.135TyrPro: 1.135 ± 0.354
0.378TyrGln: 0.378 ± 0.384
4.16TyrArg: 4.16 ± 0.514
0.756TyrSer: 0.756 ± 0.384
1.891TyrThr: 1.891 ± 0.814
1.135TyrVal: 1.135 ± 0.52
0.756TyrTrp: 0.756 ± 0.52
2.269TyrTyr: 2.269 ± 0.866
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2645 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski