Amino acid dipepetide frequency for Human papillomavirus type 156

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.994AlaAla: 3.994 ± 1.59
0.799AlaCys: 0.799 ± 0.626
3.994AlaAsp: 3.994 ± 0.781
3.994AlaGlu: 3.994 ± 1.218
1.198AlaPhe: 1.198 ± 0.36
2.796AlaGly: 2.796 ± 0.776
1.597AlaHis: 1.597 ± 0.662
3.195AlaIle: 3.195 ± 0.726
4.792AlaLys: 4.792 ± 1.49
3.994AlaLeu: 3.994 ± 0.841
0.0AlaMet: 0.0 ± 0.0
2.396AlaAsn: 2.396 ± 0.998
1.997AlaPro: 1.997 ± 0.89
1.997AlaGln: 1.997 ± 0.647
1.198AlaArg: 1.198 ± 0.446
1.997AlaSer: 1.997 ± 0.649
2.396AlaThr: 2.396 ± 1.114
2.796AlaVal: 2.796 ± 0.65
0.0AlaTrp: 0.0 ± 0.0
0.0AlaTyr: 0.0 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.399CysAla: 0.399 ± 0.313
1.597CysCys: 1.597 ± 1.634
0.399CysAsp: 0.399 ± 0.43
1.198CysGlu: 1.198 ± 0.41
0.799CysPhe: 0.799 ± 0.404
0.399CysGly: 0.399 ± 0.595
0.0CysHis: 0.0 ± 0.0
1.597CysIle: 1.597 ± 0.644
2.796CysLys: 2.796 ± 1.249
2.396CysLeu: 2.396 ± 1.485
0.799CysMet: 0.799 ± 0.351
1.597CysAsn: 1.597 ± 0.916
1.597CysPro: 1.597 ± 0.886
0.799CysGln: 0.799 ± 0.511
1.997CysArg: 1.997 ± 1.342
2.396CysSer: 2.396 ± 1.327
1.997CysThr: 1.997 ± 0.598
0.799CysVal: 0.799 ± 0.822
0.799CysTrp: 0.799 ± 0.455
1.198CysTyr: 1.198 ± 0.935
0.0CysXaa: 0.0 ± 0.0
Asp
3.195AspAla: 3.195 ± 1.578
1.597AspCys: 1.597 ± 0.569
3.594AspAsp: 3.594 ± 1.18
5.192AspGlu: 5.192 ± 0.932
3.994AspPhe: 3.994 ± 1.566
2.396AspGly: 2.396 ± 1.212
1.597AspHis: 1.597 ± 0.91
3.994AspIle: 3.994 ± 1.343
1.198AspLys: 1.198 ± 0.41
4.792AspLeu: 4.792 ± 0.868
1.198AspMet: 1.198 ± 0.745
3.195AspAsn: 3.195 ± 0.757
6.39AspPro: 6.39 ± 2.641
1.198AspGln: 1.198 ± 0.492
1.997AspArg: 1.997 ± 0.735
7.588AspSer: 7.588 ± 1.538
3.594AspThr: 3.594 ± 0.899
7.588AspVal: 7.588 ± 1.85
0.799AspTrp: 0.799 ± 0.626
1.597AspTyr: 1.597 ± 0.481
0.0AspXaa: 0.0 ± 0.0
Glu
3.594GluAla: 3.594 ± 1.77
0.799GluCys: 0.799 ± 0.626
3.994GluAsp: 3.994 ± 0.821
8.387GluGlu: 8.387 ± 2.778
2.796GluPhe: 2.796 ± 0.791
1.997GluGly: 1.997 ± 1.352
0.799GluHis: 0.799 ± 0.671
4.792GluIle: 4.792 ± 1.723
3.594GluLys: 3.594 ± 1.443
6.789GluLeu: 6.789 ± 2.299
0.799GluMet: 0.799 ± 0.346
4.393GluAsn: 4.393 ± 1.132
1.198GluPro: 1.198 ± 0.6
2.396GluGln: 2.396 ± 1.39
2.396GluArg: 2.396 ± 1.09
3.994GluSer: 3.994 ± 1.125
3.195GluThr: 3.195 ± 0.597
2.796GluVal: 2.796 ± 0.636
1.597GluTrp: 1.597 ± 0.736
1.997GluTyr: 1.997 ± 1.047
0.0GluXaa: 0.0 ± 0.0
Phe
2.796PheAla: 2.796 ± 0.672
1.597PheCys: 1.597 ± 0.644
2.796PheAsp: 2.796 ± 1.071
4.792PheGlu: 4.792 ± 1.36
2.796PhePhe: 2.796 ± 1.486
3.994PheGly: 3.994 ± 0.782
1.997PheHis: 1.997 ± 0.998
2.396PheIle: 2.396 ± 0.899
3.594PheLys: 3.594 ± 1.803
6.789PheLeu: 6.789 ± 1.564
0.399PheMet: 0.399 ± 0.313
2.396PheAsn: 2.396 ± 0.938
0.799PhePro: 0.799 ± 0.455
1.997PheGln: 1.997 ± 0.647
1.997PheArg: 1.997 ± 0.81
2.396PheSer: 2.396 ± 0.528
0.399PheThr: 0.399 ± 0.313
3.195PheVal: 3.195 ± 0.813
0.799PheTrp: 0.799 ± 0.404
2.396PheTyr: 2.396 ± 1.158
0.0PheXaa: 0.0 ± 0.0
Gly
0.799GlyAla: 0.799 ± 0.404
1.997GlyCys: 1.997 ± 0.815
5.192GlyAsp: 5.192 ± 1.136
2.396GlyGlu: 2.396 ± 0.892
1.597GlyPhe: 1.597 ± 0.299
2.396GlyGly: 2.396 ± 0.892
1.597GlyHis: 1.597 ± 0.741
3.594GlyIle: 3.594 ± 0.59
3.594GlyLys: 3.594 ± 0.777
3.195GlyLeu: 3.195 ± 0.954
0.0GlyMet: 0.0 ± 0.0
3.594GlyAsn: 3.594 ± 0.596
1.198GlyPro: 1.198 ± 0.668
0.799GlyGln: 0.799 ± 0.573
5.192GlyArg: 5.192 ± 1.869
6.39GlySer: 6.39 ± 2.41
4.393GlyThr: 4.393 ± 1.238
3.195GlyVal: 3.195 ± 0.956
0.0GlyTrp: 0.0 ± 0.0
0.799GlyTyr: 0.799 ± 0.404
0.0GlyXaa: 0.0 ± 0.0
His
0.799HisAla: 0.799 ± 0.62
0.799HisCys: 0.799 ± 0.61
1.198HisAsp: 1.198 ± 0.563
0.399HisGlu: 0.399 ± 0.313
1.597HisPhe: 1.597 ± 0.905
0.799HisGly: 0.799 ± 0.693
1.198HisHis: 1.198 ± 1.09
1.597HisIle: 1.597 ± 0.561
0.799HisLys: 0.799 ± 0.346
1.198HisLeu: 1.198 ± 0.797
0.0HisMet: 0.0 ± 0.0
0.799HisAsn: 0.799 ± 0.455
1.997HisPro: 1.997 ± 1.044
1.198HisGln: 1.198 ± 0.925
1.198HisArg: 1.198 ± 0.8
0.399HisSer: 0.399 ± 0.595
0.799HisThr: 0.799 ± 0.436
0.799HisVal: 0.799 ± 0.534
0.399HisTrp: 0.399 ± 0.474
1.198HisTyr: 1.198 ± 0.41
0.0HisXaa: 0.0 ± 0.0
Ile
1.997IleAla: 1.997 ± 0.733
0.799IleCys: 0.799 ± 0.511
3.994IleAsp: 3.994 ± 1.187
3.594IleGlu: 3.594 ± 0.741
3.594IlePhe: 3.594 ± 1.068
3.195IleGly: 3.195 ± 1.261
0.399IleHis: 0.399 ± 0.331
3.195IleIle: 3.195 ± 1.377
2.796IleLys: 2.796 ± 0.582
3.594IleLeu: 3.594 ± 2.187
1.198IleMet: 1.198 ± 0.738
2.796IleAsn: 2.796 ± 1.241
4.792IlePro: 4.792 ± 1.498
3.195IleGln: 3.195 ± 1.097
2.396IleArg: 2.396 ± 0.899
4.393IleSer: 4.393 ± 1.264
2.396IleThr: 2.396 ± 1.158
3.594IleVal: 3.594 ± 0.531
0.799IleTrp: 0.799 ± 0.511
1.997IleTyr: 1.997 ± 0.691
0.0IleXaa: 0.0 ± 0.0
Lys
1.198LysAla: 1.198 ± 0.41
1.997LysCys: 1.997 ± 0.419
3.195LysAsp: 3.195 ± 1.398
3.994LysGlu: 3.994 ± 1.394
2.396LysPhe: 2.396 ± 0.704
3.594LysGly: 3.594 ± 1.362
1.198LysHis: 1.198 ± 0.577
0.799LysIle: 0.799 ± 0.436
3.994LysLys: 3.994 ± 1.086
5.99LysLeu: 5.99 ± 2.608
0.799LysMet: 0.799 ± 0.514
2.796LysAsn: 2.796 ± 1.532
1.997LysPro: 1.997 ± 1.292
2.396LysGln: 2.396 ± 1.078
5.591LysArg: 5.591 ± 1.408
4.393LysSer: 4.393 ± 0.801
1.198LysThr: 1.198 ± 0.428
5.192LysVal: 5.192 ± 0.952
0.399LysTrp: 0.399 ± 0.43
3.195LysTyr: 3.195 ± 1.208
0.0LysXaa: 0.0 ± 0.0
Leu
3.994LeuAla: 3.994 ± 0.767
2.396LeuCys: 2.396 ± 1.01
5.99LeuAsp: 5.99 ± 1.58
5.591LeuGlu: 5.591 ± 1.487
3.195LeuPhe: 3.195 ± 1.323
4.792LeuGly: 4.792 ± 1.783
1.997LeuHis: 1.997 ± 1.051
3.994LeuIle: 3.994 ± 1.52
7.188LeuLys: 7.188 ± 1.466
9.984LeuLeu: 9.984 ± 3.112
2.396LeuMet: 2.396 ± 0.691
2.796LeuAsn: 2.796 ± 0.924
5.591LeuPro: 5.591 ± 1.357
5.591LeuGln: 5.591 ± 1.51
2.796LeuArg: 2.796 ± 1.263
7.188LeuSer: 7.188 ± 1.611
6.789LeuThr: 6.789 ± 2.17
4.393LeuVal: 4.393 ± 1.53
1.198LeuTrp: 1.198 ± 0.991
4.393LeuTyr: 4.393 ± 0.825
0.0LeuXaa: 0.0 ± 0.0
Met
0.399MetAla: 0.399 ± 0.474
0.399MetCys: 0.399 ± 0.33
0.799MetAsp: 0.799 ± 0.514
0.799MetGlu: 0.799 ± 0.626
1.597MetPhe: 1.597 ± 0.606
0.799MetGly: 0.799 ± 0.346
0.399MetHis: 0.399 ± 0.43
1.597MetIle: 1.597 ± 0.898
0.399MetLys: 0.399 ± 0.437
1.597MetLeu: 1.597 ± 1.252
0.399MetMet: 0.399 ± 0.313
1.597MetAsn: 1.597 ± 0.299
0.399MetPro: 0.399 ± 0.437
0.799MetGln: 0.799 ± 0.514
0.799MetArg: 0.799 ± 0.626
1.198MetSer: 1.198 ± 0.36
1.597MetThr: 1.597 ± 0.924
1.198MetVal: 1.198 ± 0.55
0.0MetTrp: 0.0 ± 0.0
0.799MetTyr: 0.799 ± 0.573
0.0MetXaa: 0.0 ± 0.0
Asn
2.796AsnAla: 2.796 ± 1.062
1.997AsnCys: 1.997 ± 0.976
1.997AsnAsp: 1.997 ± 0.324
3.594AsnGlu: 3.594 ± 1.153
1.997AsnPhe: 1.997 ± 0.462
2.796AsnGly: 2.796 ± 0.816
0.399AsnHis: 0.399 ± 0.474
1.597AsnIle: 1.597 ± 0.963
3.195AsnLys: 3.195 ± 0.725
2.796AsnLeu: 2.796 ± 1.177
2.396AsnMet: 2.396 ± 0.874
1.997AsnAsn: 1.997 ± 1.054
5.192AsnPro: 5.192 ± 1.301
2.396AsnGln: 2.396 ± 1.144
2.396AsnArg: 2.396 ± 0.739
3.195AsnSer: 3.195 ± 1.395
5.591AsnThr: 5.591 ± 0.909
2.396AsnVal: 2.396 ± 0.84
0.799AsnTrp: 0.799 ± 0.86
1.198AsnTyr: 1.198 ± 0.643
0.0AsnXaa: 0.0 ± 0.0
Pro
3.195ProAla: 3.195 ± 1.083
0.399ProCys: 0.399 ± 0.33
6.789ProAsp: 6.789 ± 1.871
3.195ProGlu: 3.195 ± 0.976
1.997ProPhe: 1.997 ± 0.889
1.597ProGly: 1.597 ± 0.885
0.0ProHis: 0.0 ± 0.0
3.195ProIle: 3.195 ± 1.877
2.396ProLys: 2.396 ± 0.719
5.99ProLeu: 5.99 ± 1.676
0.399ProMet: 0.399 ± 0.313
3.195ProAsn: 3.195 ± 0.8
5.99ProPro: 5.99 ± 2.08
3.994ProGln: 3.994 ± 1.018
3.195ProArg: 3.195 ± 0.847
3.994ProSer: 3.994 ± 2.101
4.393ProThr: 4.393 ± 1.817
2.396ProVal: 2.396 ± 0.813
0.399ProTrp: 0.399 ± 0.33
3.195ProTyr: 3.195 ± 1.083
0.0ProXaa: 0.0 ± 0.0
Gln
1.597GlnAla: 1.597 ± 0.299
1.997GlnCys: 1.997 ± 0.985
3.594GlnAsp: 3.594 ± 1.259
2.796GlnGlu: 2.796 ± 1.233
2.796GlnPhe: 2.796 ± 0.674
2.796GlnGly: 2.796 ± 1.026
0.0GlnHis: 0.0 ± 0.0
2.396GlnIle: 2.396 ± 0.704
1.997GlnLys: 1.997 ± 1.027
3.195GlnLeu: 3.195 ± 1.082
2.396GlnMet: 2.396 ± 1.168
0.399GlnAsn: 0.399 ± 0.43
2.396GlnPro: 2.396 ± 0.831
3.594GlnGln: 3.594 ± 0.908
1.997GlnArg: 1.997 ± 1.699
1.997GlnSer: 1.997 ± 0.766
2.396GlnThr: 2.396 ± 0.826
2.396GlnVal: 2.396 ± 0.411
0.799GlnTrp: 0.799 ± 0.626
1.198GlnTyr: 1.198 ± 1.312
0.0GlnXaa: 0.0 ± 0.0
Arg
4.393ArgAla: 4.393 ± 1.239
1.597ArgCys: 1.597 ± 0.544
1.997ArgAsp: 1.997 ± 0.766
1.597ArgGlu: 1.597 ± 0.733
1.597ArgPhe: 1.597 ± 0.569
2.796ArgGly: 2.796 ± 1.388
2.396ArgHis: 2.396 ± 0.546
2.396ArgIle: 2.396 ± 0.477
3.594ArgLys: 3.594 ± 1.116
7.188ArgLeu: 7.188 ± 1.342
0.799ArgMet: 0.799 ± 0.474
3.195ArgAsn: 3.195 ± 0.911
5.591ArgPro: 5.591 ± 3.112
2.396ArgGln: 2.396 ± 0.786
5.192ArgArg: 5.192 ± 1.812
3.994ArgSer: 3.994 ± 1.177
2.396ArgThr: 2.396 ± 0.838
3.195ArgVal: 3.195 ± 1.419
0.0ArgTrp: 0.0 ± 0.0
1.597ArgTyr: 1.597 ± 0.736
0.0ArgXaa: 0.0 ± 0.0
Ser
4.393SerAla: 4.393 ± 0.836
1.597SerCys: 1.597 ± 1.221
3.994SerAsp: 3.994 ± 1.152
2.796SerGlu: 2.796 ± 1.177
4.792SerPhe: 4.792 ± 1.339
4.393SerGly: 4.393 ± 1.436
1.198SerHis: 1.198 ± 0.571
3.994SerIle: 3.994 ± 0.691
2.396SerLys: 2.396 ± 1.529
9.185SerLeu: 9.185 ± 2.098
0.799SerMet: 0.799 ± 0.455
5.192SerAsn: 5.192 ± 1.632
4.393SerPro: 4.393 ± 1.609
2.796SerGln: 2.796 ± 0.909
4.393SerArg: 4.393 ± 2.149
8.786SerSer: 8.786 ± 4.256
5.99SerThr: 5.99 ± 3.023
3.994SerVal: 3.994 ± 1.54
0.399SerTrp: 0.399 ± 0.313
2.796SerTyr: 2.796 ± 0.766
0.0SerXaa: 0.0 ± 0.0
Thr
2.396ThrAla: 2.396 ± 0.411
1.198ThrCys: 1.198 ± 0.782
2.396ThrAsp: 2.396 ± 0.721
2.396ThrGlu: 2.396 ± 0.911
3.594ThrPhe: 3.594 ± 1.279
4.393ThrGly: 4.393 ± 1.44
0.399ThrHis: 0.399 ± 0.331
3.994ThrIle: 3.994 ± 1.493
2.796ThrLys: 2.796 ± 1.741
4.393ThrLeu: 4.393 ± 1.338
0.799ThrMet: 0.799 ± 0.626
3.994ThrAsn: 3.994 ± 0.978
3.195ThrPro: 3.195 ± 1.325
1.597ThrGln: 1.597 ± 1.24
5.591ThrArg: 5.591 ± 1.46
3.994ThrSer: 3.994 ± 2.072
3.195ThrThr: 3.195 ± 1.165
7.588ThrVal: 7.588 ± 2.085
0.399ThrTrp: 0.399 ± 0.313
1.198ThrTyr: 1.198 ± 0.643
0.0ThrXaa: 0.0 ± 0.0
Val
2.396ValAla: 2.396 ± 1.285
0.799ValCys: 0.799 ± 0.62
7.588ValAsp: 7.588 ± 1.485
2.796ValGlu: 2.796 ± 0.892
4.792ValPhe: 4.792 ± 1.359
2.396ValGly: 2.396 ± 0.546
1.198ValHis: 1.198 ± 0.428
3.994ValIle: 3.994 ± 1.177
1.198ValLys: 1.198 ± 0.954
5.192ValLeu: 5.192 ± 0.871
0.799ValMet: 0.799 ± 0.661
3.594ValAsn: 3.594 ± 1.007
3.994ValPro: 3.994 ± 1.02
1.597ValGln: 1.597 ± 0.853
3.994ValArg: 3.994 ± 1.529
6.789ValSer: 6.789 ± 1.052
3.195ValThr: 3.195 ± 1.8
3.994ValVal: 3.994 ± 1.32
0.799ValTrp: 0.799 ± 0.436
1.597ValTyr: 1.597 ± 0.692
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.399TrpCys: 0.399 ± 0.313
0.799TrpAsp: 0.799 ± 0.661
0.799TrpGlu: 0.799 ± 0.661
0.399TrpPhe: 0.399 ± 0.43
0.399TrpGly: 0.399 ± 0.33
0.399TrpHis: 0.399 ± 0.43
0.799TrpIle: 0.799 ± 0.626
0.399TrpLys: 0.399 ± 0.313
0.799TrpLeu: 0.799 ± 0.404
0.399TrpMet: 0.399 ± 0.33
0.399TrpAsn: 0.399 ± 0.43
0.399TrpPro: 0.399 ± 0.33
0.399TrpGln: 0.399 ± 0.43
1.597TrpArg: 1.597 ± 0.494
0.399TrpSer: 0.399 ± 0.313
1.597TrpThr: 1.597 ± 0.88
0.799TrpVal: 0.799 ± 0.455
0.0TrpTrp: 0.0 ± 0.0
0.399TrpTyr: 0.399 ± 0.313
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.799TyrAla: 0.799 ± 0.404
0.799TyrCys: 0.799 ± 0.948
2.396TyrAsp: 2.396 ± 0.42
1.997TyrGlu: 1.997 ± 0.39
2.796TyrPhe: 2.796 ± 1.016
3.195TyrGly: 3.195 ± 0.559
0.399TyrHis: 0.399 ± 0.331
1.597TyrIle: 1.597 ± 1.028
3.594TyrLys: 3.594 ± 0.682
3.594TyrLeu: 3.594 ± 1.249
0.399TyrMet: 0.399 ± 0.313
0.399TyrAsn: 0.399 ± 0.331
0.799TyrPro: 0.799 ± 0.404
1.597TyrGln: 1.597 ± 0.561
1.997TyrArg: 1.997 ± 0.762
2.796TyrSer: 2.796 ± 0.933
1.997TyrThr: 1.997 ± 1.214
0.399TyrVal: 0.399 ± 0.313
1.198TyrTrp: 1.198 ± 0.643
1.997TyrTyr: 1.997 ± 0.868
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2505 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski