Amino acid dipepetide frequency for Human papillomavirus type 60

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.022AlaAla: 6.022 ± 1.866
0.401AlaCys: 0.401 ± 0.697
4.416AlaAsp: 4.416 ± 1.273
3.613AlaGlu: 3.613 ± 1.594
1.606AlaPhe: 1.606 ± 0.776
2.409AlaGly: 2.409 ± 0.605
0.401AlaHis: 0.401 ± 0.365
1.606AlaIle: 1.606 ± 0.297
2.81AlaLys: 2.81 ± 0.886
4.817AlaLeu: 4.817 ± 1.908
1.204AlaMet: 1.204 ± 0.354
1.606AlaAsn: 1.606 ± 0.683
2.81AlaPro: 2.81 ± 0.594
2.81AlaGln: 2.81 ± 0.591
3.613AlaArg: 3.613 ± 1.159
6.022AlaSer: 6.022 ± 1.438
4.014AlaThr: 4.014 ± 1.152
4.014AlaVal: 4.014 ± 0.802
0.0AlaTrp: 0.0 ± 0.0
2.409AlaTyr: 2.409 ± 0.96
0.0AlaXaa: 0.0 ± 0.0
Cys
0.803CysAla: 0.803 ± 0.639
1.606CysCys: 1.606 ± 0.898
2.007CysAsp: 2.007 ± 1.001
0.401CysGlu: 0.401 ± 0.319
1.204CysPhe: 1.204 ± 1.355
0.803CysGly: 0.803 ± 0.6
0.0CysHis: 0.0 ± 0.0
1.606CysIle: 1.606 ± 1.447
2.409CysLys: 2.409 ± 1.086
1.606CysLeu: 1.606 ± 1.45
1.204CysMet: 1.204 ± 0.873
0.803CysAsn: 0.803 ± 0.639
1.204CysPro: 1.204 ± 0.723
0.401CysGln: 0.401 ± 0.335
0.803CysArg: 0.803 ± 1.395
1.606CysSer: 1.606 ± 0.763
1.204CysThr: 1.204 ± 0.758
0.401CysVal: 0.401 ± 0.697
1.204CysTrp: 1.204 ± 0.449
0.401CysTyr: 0.401 ± 0.319
0.0CysXaa: 0.0 ± 0.0
Asp
4.416AspAla: 4.416 ± 0.921
2.81AspCys: 2.81 ± 1.568
2.81AspAsp: 2.81 ± 0.832
4.014AspGlu: 4.014 ± 1.035
3.613AspPhe: 3.613 ± 1.641
2.409AspGly: 2.409 ± 0.787
0.401AspHis: 0.401 ± 0.319
6.022AspIle: 6.022 ± 1.8
1.204AspLys: 1.204 ± 1.096
4.817AspLeu: 4.817 ± 0.63
0.401AspMet: 0.401 ± 0.319
3.613AspAsn: 3.613 ± 1.254
4.817AspPro: 4.817 ± 1.072
0.401AspGln: 0.401 ± 0.335
4.416AspArg: 4.416 ± 0.72
4.416AspSer: 4.416 ± 0.967
3.613AspThr: 3.613 ± 0.807
4.817AspVal: 4.817 ± 1.024
0.803AspTrp: 0.803 ± 0.4
1.606AspTyr: 1.606 ± 1.043
0.0AspXaa: 0.0 ± 0.0
Glu
3.613GluAla: 3.613 ± 1.683
0.401GluCys: 0.401 ± 0.319
4.817GluAsp: 4.817 ± 1.478
6.423GluGlu: 6.423 ± 2.232
1.204GluPhe: 1.204 ± 0.943
4.416GluGly: 4.416 ± 1.217
1.204GluHis: 1.204 ± 0.656
4.014GluIle: 4.014 ± 1.396
1.606GluLys: 1.606 ± 1.447
6.022GluLeu: 6.022 ± 1.224
1.204GluMet: 1.204 ± 0.449
4.416GluAsn: 4.416 ± 0.7
0.803GluPro: 0.803 ± 0.67
4.416GluGln: 4.416 ± 1.452
2.409GluArg: 2.409 ± 1.388
5.62GluSer: 5.62 ± 1.942
2.007GluThr: 2.007 ± 1.247
5.219GluVal: 5.219 ± 1.37
1.204GluTrp: 1.204 ± 0.591
2.409GluTyr: 2.409 ± 0.784
0.0GluXaa: 0.0 ± 0.0
Phe
3.212PheAla: 3.212 ± 1.159
0.401PheCys: 0.401 ± 0.697
2.81PheAsp: 2.81 ± 1.445
3.212PheGlu: 3.212 ± 1.329
2.81PhePhe: 2.81 ± 0.543
2.81PheGly: 2.81 ± 0.543
0.401PheHis: 0.401 ± 0.365
3.212PheIle: 3.212 ± 1.016
3.212PheLys: 3.212 ± 1.63
4.014PheLeu: 4.014 ± 1.462
1.204PheMet: 1.204 ± 0.594
2.409PheAsn: 2.409 ± 1.035
2.81PhePro: 2.81 ± 0.746
1.204PheGln: 1.204 ± 0.698
2.409PheArg: 2.409 ± 0.787
3.212PheSer: 3.212 ± 1.208
1.606PheThr: 1.606 ± 0.684
2.007PheVal: 2.007 ± 1.132
1.204PheTrp: 1.204 ± 0.625
2.81PheTyr: 2.81 ± 2.03
0.0PheXaa: 0.0 ± 0.0
Gly
2.81GlyAla: 2.81 ± 1.719
0.803GlyCys: 0.803 ± 0.453
3.613GlyAsp: 3.613 ± 1.27
4.014GlyGlu: 4.014 ± 1.007
1.606GlyPhe: 1.606 ± 0.52
2.81GlyGly: 2.81 ± 1.171
1.606GlyHis: 1.606 ± 0.726
4.014GlyIle: 4.014 ± 1.024
2.007GlyLys: 2.007 ± 0.678
5.219GlyLeu: 5.219 ± 1.392
0.0GlyMet: 0.0 ± 0.0
4.416GlyAsn: 4.416 ± 0.966
2.409GlyPro: 2.409 ± 0.648
2.409GlyGln: 2.409 ± 0.874
3.212GlyArg: 3.212 ± 1.065
4.416GlySer: 4.416 ± 1.043
5.62GlyThr: 5.62 ± 1.721
2.409GlyVal: 2.409 ± 0.605
0.0GlyTrp: 0.0 ± 0.0
2.007GlyTyr: 2.007 ± 0.807
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.401HisCys: 0.401 ± 0.319
0.401HisAsp: 0.401 ± 0.335
0.803HisGlu: 0.803 ± 0.639
1.204HisPhe: 1.204 ± 0.514
0.803HisGly: 0.803 ± 0.6
0.0HisHis: 0.0 ± 0.0
1.204HisIle: 1.204 ± 0.859
2.007HisLys: 2.007 ± 1.215
1.606HisLeu: 1.606 ± 0.297
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
2.007HisPro: 2.007 ± 0.892
0.0HisGln: 0.0 ± 0.0
1.606HisArg: 1.606 ± 0.727
0.803HisSer: 0.803 ± 0.639
0.803HisThr: 0.803 ± 0.731
0.0HisVal: 0.0 ± 0.0
0.401HisTrp: 0.401 ± 0.365
1.204HisTyr: 1.204 ± 0.635
0.0HisXaa: 0.0 ± 0.0
Ile
2.007IleAla: 2.007 ± 0.584
1.204IleCys: 1.204 ± 0.697
6.022IleAsp: 6.022 ± 1.697
3.613IleGlu: 3.613 ± 0.608
2.007IlePhe: 2.007 ± 0.993
2.81IleGly: 2.81 ± 1.207
0.803IleHis: 0.803 ± 0.837
5.219IleIle: 5.219 ± 1.951
2.81IleLys: 2.81 ± 0.58
4.416IleLeu: 4.416 ± 1.067
2.81IleMet: 2.81 ± 1.215
1.606IleAsn: 1.606 ± 0.718
4.014IlePro: 4.014 ± 2.534
2.81IleGln: 2.81 ± 0.535
0.803IleArg: 0.803 ± 0.529
6.022IleSer: 6.022 ± 1.295
4.014IleThr: 4.014 ± 1.334
2.81IleVal: 2.81 ± 0.981
0.401IleTrp: 0.401 ± 0.319
1.606IleTyr: 1.606 ± 0.683
0.0IleXaa: 0.0 ± 0.0
Lys
1.606LysAla: 1.606 ± 0.906
2.409LysCys: 2.409 ± 0.886
2.007LysAsp: 2.007 ± 0.71
2.007LysGlu: 2.007 ± 0.955
2.409LysPhe: 2.409 ± 1.176
3.212LysGly: 3.212 ± 0.638
0.803LysHis: 0.803 ± 0.495
1.606LysIle: 1.606 ± 1.174
2.409LysLys: 2.409 ± 0.901
6.825LysLeu: 6.825 ± 2.168
1.204LysMet: 1.204 ± 0.797
1.606LysAsn: 1.606 ± 1.354
2.81LysPro: 2.81 ± 1.458
3.212LysGln: 3.212 ± 1.703
6.423LysArg: 6.423 ± 1.337
4.416LysSer: 4.416 ± 2.014
2.81LysThr: 2.81 ± 1.077
2.007LysVal: 2.007 ± 0.612
0.0LysTrp: 0.0 ± 0.0
2.409LysTyr: 2.409 ± 1.023
0.0LysXaa: 0.0 ± 0.0
Leu
5.62LeuAla: 5.62 ± 1.271
0.803LeuCys: 0.803 ± 0.4
4.416LeuAsp: 4.416 ± 1.432
4.817LeuGlu: 4.817 ± 1.17
6.022LeuPhe: 6.022 ± 1.586
7.226LeuGly: 7.226 ± 2.812
0.803LeuHis: 0.803 ± 0.429
4.416LeuIle: 4.416 ± 0.821
6.825LeuLys: 6.825 ± 2.64
9.635LeuLeu: 9.635 ± 2.427
1.204LeuMet: 1.204 ± 0.671
4.416LeuAsn: 4.416 ± 0.788
6.022LeuPro: 6.022 ± 1.736
6.825LeuGln: 6.825 ± 0.843
4.014LeuArg: 4.014 ± 0.964
4.416LeuSer: 4.416 ± 2.099
8.029LeuThr: 8.029 ± 2.936
4.416LeuVal: 4.416 ± 1.439
0.803LeuTrp: 0.803 ± 0.453
4.817LeuTyr: 4.817 ± 0.969
0.0LeuXaa: 0.0 ± 0.0
Met
1.204MetAla: 1.204 ± 0.354
2.007MetCys: 2.007 ± 0.819
2.007MetAsp: 2.007 ± 1.034
1.204MetGlu: 1.204 ± 0.827
0.0MetPhe: 0.0 ± 0.0
0.401MetGly: 0.401 ± 0.365
0.401MetHis: 0.401 ± 0.419
2.007MetIle: 2.007 ± 0.572
2.007MetLys: 2.007 ± 0.613
1.204MetLeu: 1.204 ± 0.625
0.0MetMet: 0.0 ± 0.0
0.803MetAsn: 0.803 ± 0.453
0.401MetPro: 0.401 ± 0.335
2.007MetGln: 2.007 ± 0.613
0.803MetArg: 0.803 ± 0.4
1.204MetSer: 1.204 ± 0.449
0.401MetThr: 0.401 ± 0.365
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.803MetTyr: 0.803 ± 0.359
0.0MetXaa: 0.0 ± 0.0
Asn
2.007AsnAla: 2.007 ± 0.788
0.803AsnCys: 0.803 ± 0.695
1.204AsnAsp: 1.204 ± 0.391
3.613AsnGlu: 3.613 ± 1.049
1.606AsnPhe: 1.606 ± 0.579
2.409AsnGly: 2.409 ± 1.118
0.401AsnHis: 0.401 ± 0.319
2.409AsnIle: 2.409 ± 1.108
2.409AsnLys: 2.409 ± 0.679
4.817AsnLeu: 4.817 ± 1.113
0.401AsnMet: 0.401 ± 0.319
4.817AsnAsn: 4.817 ± 1.297
2.409AsnPro: 2.409 ± 1.23
3.613AsnGln: 3.613 ± 0.914
4.014AsnArg: 4.014 ± 0.788
2.81AsnSer: 2.81 ± 1.668
2.81AsnThr: 2.81 ± 0.931
3.212AsnVal: 3.212 ± 0.768
1.204AsnTrp: 1.204 ± 0.694
0.803AsnTyr: 0.803 ± 0.848
0.0AsnXaa: 0.0 ± 0.0
Pro
4.817ProAla: 4.817 ± 1.993
1.204ProCys: 1.204 ± 0.86
5.219ProAsp: 5.219 ± 1.489
2.81ProGlu: 2.81 ± 0.993
1.204ProPhe: 1.204 ± 0.591
2.007ProGly: 2.007 ± 1.394
2.007ProHis: 2.007 ± 1.23
2.409ProIle: 2.409 ± 0.909
2.409ProLys: 2.409 ± 1.005
6.423ProLeu: 6.423 ± 1.226
0.0ProMet: 0.0 ± 0.0
2.409ProAsn: 2.409 ± 0.719
6.423ProPro: 6.423 ± 1.198
2.81ProGln: 2.81 ± 1.112
3.212ProArg: 3.212 ± 1.267
2.81ProSer: 2.81 ± 1.098
6.423ProThr: 6.423 ± 1.998
2.81ProVal: 2.81 ± 1.032
0.401ProTrp: 0.401 ± 0.419
2.007ProTyr: 2.007 ± 1.15
0.0ProXaa: 0.0 ± 0.0
Gln
1.606GlnAla: 1.606 ± 1.071
1.606GlnCys: 1.606 ± 0.824
4.014GlnAsp: 4.014 ± 0.642
4.817GlnGlu: 4.817 ± 0.914
3.613GlnPhe: 3.613 ± 1.086
2.007GlnGly: 2.007 ± 0.539
0.803GlnHis: 0.803 ± 0.639
2.81GlnIle: 2.81 ± 0.829
0.803GlnLys: 0.803 ± 0.529
5.219GlnLeu: 5.219 ± 1.423
1.204GlnMet: 1.204 ± 0.627
2.007GlnAsn: 2.007 ± 0.903
4.014GlnPro: 4.014 ± 1.423
3.212GlnGln: 3.212 ± 0.996
1.606GlnArg: 1.606 ± 0.666
2.007GlnSer: 2.007 ± 0.68
2.007GlnThr: 2.007 ± 1.167
1.204GlnVal: 1.204 ± 0.694
0.401GlnTrp: 0.401 ± 0.365
2.409GlnTyr: 2.409 ± 1.087
0.0GlnXaa: 0.0 ± 0.0
Arg
3.212ArgAla: 3.212 ± 0.695
1.204ArgCys: 1.204 ± 0.828
2.007ArgAsp: 2.007 ± 0.777
3.212ArgGlu: 3.212 ± 0.999
2.81ArgPhe: 2.81 ± 1.401
2.409ArgGly: 2.409 ± 0.88
2.409ArgHis: 2.409 ± 0.902
0.803ArgIle: 0.803 ± 0.759
3.212ArgLys: 3.212 ± 0.594
6.825ArgLeu: 6.825 ± 0.725
1.204ArgMet: 1.204 ± 0.78
2.409ArgAsn: 2.409 ± 0.545
2.81ArgPro: 2.81 ± 1.104
2.007ArgGln: 2.007 ± 0.666
4.817ArgArg: 4.817 ± 1.744
5.62ArgSer: 5.62 ± 1.682
2.409ArgThr: 2.409 ± 0.591
4.014ArgVal: 4.014 ± 1.621
1.606ArgTrp: 1.606 ± 1.133
3.212ArgTyr: 3.212 ± 1.006
0.0ArgXaa: 0.0 ± 0.0
Ser
4.817SerAla: 4.817 ± 1.689
1.204SerCys: 1.204 ± 0.502
3.613SerAsp: 3.613 ± 0.953
5.219SerGlu: 5.219 ± 1.575
3.212SerPhe: 3.212 ± 0.979
6.423SerGly: 6.423 ± 1.576
1.204SerHis: 1.204 ± 0.391
4.014SerIle: 4.014 ± 1.3
3.212SerLys: 3.212 ± 1.428
7.226SerLeu: 7.226 ± 2.239
1.204SerMet: 1.204 ± 0.624
3.613SerAsn: 3.613 ± 1.246
2.81SerPro: 2.81 ± 1.157
3.613SerGln: 3.613 ± 0.815
4.817SerArg: 4.817 ± 1.107
8.029SerSer: 8.029 ± 2.948
4.014SerThr: 4.014 ± 0.709
2.81SerVal: 2.81 ± 0.942
0.401SerTrp: 0.401 ± 0.319
2.007SerTyr: 2.007 ± 0.797
0.0SerXaa: 0.0 ± 0.0
Thr
3.212ThrAla: 3.212 ± 0.964
0.803ThrCys: 0.803 ± 0.93
4.014ThrAsp: 4.014 ± 1.322
5.62ThrGlu: 5.62 ± 0.36
2.409ThrPhe: 2.409 ± 0.901
3.212ThrGly: 3.212 ± 0.809
0.0ThrHis: 0.0 ± 0.0
6.022ThrIle: 6.022 ± 2.247
3.212ThrLys: 3.212 ± 0.968
5.219ThrLeu: 5.219 ± 0.811
2.007ThrMet: 2.007 ± 0.612
3.613ThrAsn: 3.613 ± 1.126
4.416ThrPro: 4.416 ± 1.37
1.606ThrGln: 1.606 ± 0.611
3.613ThrArg: 3.613 ± 0.745
4.817ThrSer: 4.817 ± 1.917
5.219ThrThr: 5.219 ± 1.533
4.416ThrVal: 4.416 ± 0.764
0.401ThrTrp: 0.401 ± 0.365
1.606ThrTyr: 1.606 ± 1.036
0.0ThrXaa: 0.0 ± 0.0
Val
3.212ValAla: 3.212 ± 0.825
0.401ValCys: 0.401 ± 0.319
4.014ValAsp: 4.014 ± 1.404
2.007ValGlu: 2.007 ± 0.612
4.416ValPhe: 4.416 ± 1.005
2.81ValGly: 2.81 ± 0.994
1.204ValHis: 1.204 ± 0.651
2.007ValIle: 2.007 ± 0.882
2.409ValLys: 2.409 ± 0.839
3.613ValLeu: 3.613 ± 1.403
1.606ValMet: 1.606 ± 0.726
2.007ValAsn: 2.007 ± 1.011
4.817ValPro: 4.817 ± 1.655
1.606ValGln: 1.606 ± 0.561
3.212ValArg: 3.212 ± 1.349
3.613ValSer: 3.613 ± 0.746
3.212ValThr: 3.212 ± 0.857
2.81ValVal: 2.81 ± 1.119
1.204ValTrp: 1.204 ± 0.73
1.606ValTyr: 1.606 ± 0.755
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.803TrpAsp: 0.803 ± 0.453
0.803TrpGlu: 0.803 ± 0.488
0.401TrpPhe: 0.401 ± 0.365
1.606TrpGly: 1.606 ± 0.474
0.401TrpHis: 0.401 ± 0.365
0.401TrpIle: 0.401 ± 0.319
1.606TrpLys: 1.606 ± 1.043
1.606TrpLeu: 1.606 ± 0.801
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.803TrpGln: 0.803 ± 0.4
1.204TrpArg: 1.204 ± 0.73
0.0TrpSer: 0.0 ± 0.0
1.606TrpThr: 1.606 ± 0.474
0.803TrpVal: 0.803 ± 0.453
0.0TrpTrp: 0.0 ± 0.0
0.401TrpTyr: 0.401 ± 0.319
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.409TyrAla: 2.409 ± 0.984
1.204TyrCys: 1.204 ± 2.092
1.204TyrAsp: 1.204 ± 0.514
1.204TyrGlu: 1.204 ± 0.449
4.014TyrPhe: 4.014 ± 0.958
2.409TyrGly: 2.409 ± 1.201
0.0TyrHis: 0.0 ± 0.0
2.007TyrIle: 2.007 ± 0.536
3.613TyrLys: 3.613 ± 1.183
4.416TyrLeu: 4.416 ± 1.347
0.401TyrMet: 0.401 ± 0.319
1.204TyrAsn: 1.204 ± 0.391
2.007TyrPro: 2.007 ± 1.077
1.606TyrGln: 1.606 ± 0.474
1.204TyrArg: 1.204 ± 0.449
1.606TyrSer: 1.606 ± 0.906
3.613TyrThr: 3.613 ± 1.707
1.606TyrVal: 1.606 ± 0.635
0.803TyrTrp: 0.803 ± 0.453
2.007TyrTyr: 2.007 ± 1.041
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2492 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski