Amino acid dipepetide frequency for Alphapapillomavirus 9

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.963AlaAla: 4.963 ± 1.444
1.654AlaCys: 1.654 ± 0.84
3.722AlaAsp: 3.722 ± 1.089
2.481AlaGlu: 2.481 ± 0.841
2.481AlaPhe: 2.481 ± 0.79
4.549AlaGly: 4.549 ± 1.038
1.241AlaHis: 1.241 ± 0.827
2.895AlaIle: 2.895 ± 0.923
4.136AlaLys: 4.136 ± 1.623
3.309AlaLeu: 3.309 ± 1.273
1.241AlaMet: 1.241 ± 0.387
2.895AlaAsn: 2.895 ± 0.989
3.309AlaPro: 3.309 ± 1.553
1.654AlaGln: 1.654 ± 0.809
1.654AlaArg: 1.654 ± 0.662
3.309AlaSer: 3.309 ± 1.052
4.136AlaThr: 4.136 ± 1.308
3.722AlaVal: 3.722 ± 1.164
0.0AlaTrp: 0.0 ± 0.0
2.068AlaTyr: 2.068 ± 0.659
0.0AlaXaa: 0.0 ± 0.0
Cys
1.241CysAla: 1.241 ± 0.427
1.654CysCys: 1.654 ± 1.027
2.068CysAsp: 2.068 ± 1.245
0.414CysGlu: 0.414 ± 0.368
1.654CysPhe: 1.654 ± 1.593
0.414CysGly: 0.414 ± 0.339
0.414CysHis: 0.414 ± 0.368
2.895CysIle: 2.895 ± 1.033
2.481CysLys: 2.481 ± 1.211
2.481CysLeu: 2.481 ± 0.973
0.827CysMet: 0.827 ± 0.528
1.241CysAsn: 1.241 ± 0.725
2.481CysPro: 2.481 ± 0.628
2.068CysGln: 2.068 ± 0.691
0.827CysArg: 0.827 ± 0.528
1.241CysSer: 1.241 ± 0.803
2.481CysThr: 2.481 ± 0.881
3.309CysVal: 3.309 ± 1.491
0.827CysTrp: 0.827 ± 0.405
0.827CysTyr: 0.827 ± 0.81
0.0CysXaa: 0.0 ± 0.0
Asp
3.722AspAla: 3.722 ± 1.019
1.241AspCys: 1.241 ± 0.633
3.722AspAsp: 3.722 ± 1.278
2.068AspGlu: 2.068 ± 0.966
2.895AspPhe: 2.895 ± 1.213
3.309AspGly: 3.309 ± 0.953
0.414AspHis: 0.414 ± 0.368
4.549AspIle: 4.549 ± 1.427
2.068AspLys: 2.068 ± 1.012
4.136AspLeu: 4.136 ± 2.032
0.414AspMet: 0.414 ± 0.395
3.309AspAsn: 3.309 ± 1.206
4.136AspPro: 4.136 ± 1.804
2.481AspGln: 2.481 ± 1.517
0.827AspArg: 0.827 ± 0.599
5.376AspSer: 5.376 ± 2.064
6.617AspThr: 6.617 ± 1.545
2.481AspVal: 2.481 ± 0.823
1.241AspTrp: 1.241 ± 0.633
2.068AspTyr: 2.068 ± 0.854
0.0AspXaa: 0.0 ± 0.0
Glu
1.654GluAla: 1.654 ± 0.595
1.241GluCys: 1.241 ± 0.599
3.722GluAsp: 3.722 ± 1.527
4.136GluGlu: 4.136 ± 1.345
0.827GluPhe: 0.827 ± 0.678
2.895GluGly: 2.895 ± 1.05
0.414GluHis: 0.414 ± 0.395
2.068GluIle: 2.068 ± 0.801
3.309GluLys: 3.309 ± 1.138
3.309GluLeu: 3.309 ± 0.957
0.414GluMet: 0.414 ± 0.368
2.481GluAsn: 2.481 ± 1.178
1.654GluPro: 1.654 ± 0.587
1.241GluGln: 1.241 ± 0.831
0.827GluArg: 0.827 ± 0.528
0.414GluSer: 0.414 ± 0.339
6.617GluThr: 6.617 ± 1.556
4.136GluVal: 4.136 ± 1.403
0.827GluTrp: 0.827 ± 0.43
2.068GluTyr: 2.068 ± 1.303
0.0GluXaa: 0.0 ± 0.0
Phe
0.414PheAla: 0.414 ± 0.478
0.827PheCys: 0.827 ± 0.859
0.827PheAsp: 0.827 ± 0.43
0.827PheGlu: 0.827 ± 0.678
1.654PhePhe: 1.654 ± 0.551
2.895PheGly: 2.895 ± 0.892
0.414PheHis: 0.414 ± 0.478
2.481PheIle: 2.481 ± 0.918
2.895PheLys: 2.895 ± 1.159
4.549PheLeu: 4.549 ± 1.752
0.827PheMet: 0.827 ± 0.43
1.654PheAsn: 1.654 ± 0.701
2.481PhePro: 2.481 ± 1.449
0.827PheGln: 0.827 ± 0.569
1.241PheArg: 1.241 ± 0.848
3.309PheSer: 3.309 ± 1.098
2.481PheThr: 2.481 ± 0.659
2.895PheVal: 2.895 ± 0.956
0.827PheTrp: 0.827 ± 0.405
2.068PheTyr: 2.068 ± 0.626
0.0PheXaa: 0.0 ± 0.0
Gly
2.481GlyAla: 2.481 ± 0.894
2.068GlyCys: 2.068 ± 0.869
4.549GlyAsp: 4.549 ± 0.964
2.481GlyGlu: 2.481 ± 1.214
2.068GlyPhe: 2.068 ± 0.904
4.549GlyGly: 4.549 ± 1.475
2.068GlyHis: 2.068 ± 1.082
3.722GlyIle: 3.722 ± 0.689
2.481GlyLys: 2.481 ± 0.659
3.722GlyLeu: 3.722 ± 1.119
1.241GlyMet: 1.241 ± 1.016
2.895GlyAsn: 2.895 ± 0.652
1.241GlyPro: 1.241 ± 0.866
2.068GlyGln: 2.068 ± 0.783
2.895GlyArg: 2.895 ± 0.843
5.376GlySer: 5.376 ± 1.277
4.963GlyThr: 4.963 ± 0.881
2.895GlyVal: 2.895 ± 1.029
0.414GlyTrp: 0.414 ± 0.339
1.654GlyTyr: 1.654 ± 0.686
0.0GlyXaa: 0.0 ± 0.0
His
2.068HisAla: 2.068 ± 0.474
0.414HisCys: 0.414 ± 0.368
0.414HisAsp: 0.414 ± 0.478
1.654HisGlu: 1.654 ± 0.793
0.827HisPhe: 0.827 ± 0.403
1.241HisGly: 1.241 ± 0.679
0.0HisHis: 0.0 ± 0.0
1.654HisIle: 1.654 ± 1.067
1.241HisLys: 1.241 ± 0.779
2.895HisLeu: 2.895 ± 1.634
0.414HisMet: 0.414 ± 0.368
2.481HisAsn: 2.481 ± 0.529
1.654HisPro: 1.654 ± 0.98
0.414HisGln: 0.414 ± 0.368
2.068HisArg: 2.068 ± 0.77
1.241HisSer: 1.241 ± 0.427
2.481HisThr: 2.481 ± 0.95
0.414HisVal: 0.414 ± 0.551
0.827HisTrp: 0.827 ± 0.489
2.481HisTyr: 2.481 ± 0.93
0.0HisXaa: 0.0 ± 0.0
Ile
2.895IleAla: 2.895 ± 0.995
3.722IleCys: 3.722 ± 1.102
2.481IleAsp: 2.481 ± 1.064
2.068IleGlu: 2.068 ± 0.915
0.827IlePhe: 0.827 ± 0.79
2.895IleGly: 2.895 ± 0.918
1.654IleHis: 1.654 ± 1.053
1.654IleIle: 1.654 ± 0.847
2.481IleLys: 2.481 ± 1.035
4.549IleLeu: 4.549 ± 0.782
0.414IleMet: 0.414 ± 0.363
3.722IleAsn: 3.722 ± 1.201
5.79IlePro: 5.79 ± 3.484
1.654IleGln: 1.654 ± 0.595
2.895IleArg: 2.895 ± 1.529
4.136IleSer: 4.136 ± 1.344
4.136IleThr: 4.136 ± 1.68
6.617IleVal: 6.617 ± 2.01
0.0IleTrp: 0.0 ± 0.0
2.068IleTyr: 2.068 ± 1.12
0.0IleXaa: 0.0 ± 0.0
Lys
2.895LysAla: 2.895 ± 1.082
2.481LysCys: 2.481 ± 1.091
1.241LysAsp: 1.241 ± 0.69
2.068LysGlu: 2.068 ± 1.105
2.895LysPhe: 2.895 ± 1.399
1.654LysGly: 1.654 ± 0.979
3.309LysHis: 3.309 ± 1.746
2.895LysIle: 2.895 ± 0.781
4.136LysLys: 4.136 ± 1.004
3.722LysLeu: 3.722 ± 0.64
0.414LysMet: 0.414 ± 0.395
2.895LysAsn: 2.895 ± 0.979
3.309LysPro: 3.309 ± 1.922
4.136LysGln: 4.136 ± 1.495
4.963LysArg: 4.963 ± 1.412
3.722LysSer: 3.722 ± 1.694
2.895LysThr: 2.895 ± 1.239
2.481LysVal: 2.481 ± 1.036
0.414LysTrp: 0.414 ± 0.368
2.895LysTyr: 2.895 ± 0.889
0.0LysXaa: 0.0 ± 0.0
Leu
4.963LeuAla: 4.963 ± 0.705
5.79LeuCys: 5.79 ± 2.595
4.136LeuAsp: 4.136 ± 1.011
3.722LeuGlu: 3.722 ± 1.65
2.895LeuPhe: 2.895 ± 1.217
4.136LeuGly: 4.136 ± 1.099
4.136LeuHis: 4.136 ± 2.009
4.549LeuIle: 4.549 ± 2.382
5.376LeuLys: 5.376 ± 1.469
10.339LeuLeu: 10.339 ± 5.202
1.241LeuMet: 1.241 ± 0.674
2.068LeuAsn: 2.068 ± 0.677
2.481LeuPro: 2.481 ± 0.949
8.685LeuGln: 8.685 ± 0.964
4.549LeuArg: 4.549 ± 0.529
4.136LeuSer: 4.136 ± 1.561
5.79LeuThr: 5.79 ± 1.785
4.549LeuVal: 4.549 ± 1.535
0.827LeuTrp: 0.827 ± 0.658
4.963LeuTyr: 4.963 ± 1.149
0.0LeuXaa: 0.0 ± 0.0
Met
0.414MetAla: 0.414 ± 0.339
0.827MetCys: 0.827 ± 0.678
1.654MetAsp: 1.654 ± 0.606
0.414MetGlu: 0.414 ± 0.368
1.241MetPhe: 1.241 ± 0.517
0.827MetGly: 0.827 ± 0.48
0.827MetHis: 0.827 ± 0.647
0.414MetIle: 0.414 ± 0.339
0.414MetLys: 0.414 ± 0.339
2.068MetLeu: 2.068 ± 1.013
0.414MetMet: 0.414 ± 0.339
0.414MetAsn: 0.414 ± 0.395
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
0.827MetArg: 0.827 ± 0.48
3.309MetSer: 3.309 ± 1.085
0.827MetThr: 0.827 ± 0.658
2.068MetVal: 2.068 ± 1.105
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.481AsnAla: 2.481 ± 1.267
1.241AsnCys: 1.241 ± 0.548
4.136AsnAsp: 4.136 ± 2.168
1.654AsnGlu: 1.654 ± 0.86
0.827AsnPhe: 0.827 ± 0.79
2.068AsnGly: 2.068 ± 0.812
0.827AsnHis: 0.827 ± 0.736
4.136AsnIle: 4.136 ± 1.167
4.136AsnLys: 4.136 ± 1.61
2.068AsnLeu: 2.068 ± 0.87
0.414AsnMet: 0.414 ± 0.395
2.895AsnAsn: 2.895 ± 1.079
3.309AsnPro: 3.309 ± 0.785
0.414AsnGln: 0.414 ± 0.395
1.654AsnArg: 1.654 ± 0.809
3.309AsnSer: 3.309 ± 1.023
5.376AsnThr: 5.376 ± 1.147
2.068AsnVal: 2.068 ± 0.854
0.827AsnTrp: 0.827 ± 0.43
1.241AsnTyr: 1.241 ± 0.33
0.0AsnXaa: 0.0 ± 0.0
Pro
4.549ProAla: 4.549 ± 2.35
1.241ProCys: 1.241 ± 0.427
4.963ProAsp: 4.963 ± 2.006
2.068ProGlu: 2.068 ± 1.007
1.241ProPhe: 1.241 ± 0.656
1.654ProGly: 1.654 ± 0.844
0.0ProHis: 0.0 ± 0.0
3.722ProIle: 3.722 ± 1.318
3.722ProLys: 3.722 ± 1.329
8.271ProLeu: 8.271 ± 2.05
1.654ProMet: 1.654 ± 0.696
1.654ProAsn: 1.654 ± 0.606
5.79ProPro: 5.79 ± 2.041
1.241ProGln: 1.241 ± 1.0
1.654ProArg: 1.654 ± 0.82
5.376ProSer: 5.376 ± 2.937
5.79ProThr: 5.79 ± 1.926
2.895ProVal: 2.895 ± 1.179
0.414ProTrp: 0.414 ± 0.683
2.895ProTyr: 2.895 ± 0.935
0.0ProXaa: 0.0 ± 0.0
Gln
3.722GlnAla: 3.722 ± 0.717
0.0GlnCys: 0.0 ± 0.0
2.068GlnAsp: 2.068 ± 1.044
0.827GlnGlu: 0.827 ± 0.528
2.068GlnPhe: 2.068 ± 1.238
1.654GlnGly: 1.654 ± 0.932
1.241GlnHis: 1.241 ± 0.681
1.241GlnIle: 1.241 ± 0.33
1.241GlnLys: 1.241 ± 0.831
4.549GlnLeu: 4.549 ± 1.398
1.654GlnMet: 1.654 ± 0.932
0.414GlnAsn: 0.414 ± 0.339
2.068GlnPro: 2.068 ± 0.68
2.068GlnGln: 2.068 ± 1.025
3.309GlnArg: 3.309 ± 1.22
2.068GlnSer: 2.068 ± 1.399
3.309GlnThr: 3.309 ± 0.703
2.895GlnVal: 2.895 ± 1.399
1.241GlnTrp: 1.241 ± 0.81
3.309GlnTyr: 3.309 ± 0.646
0.0GlnXaa: 0.0 ± 0.0
Arg
2.481ArgAla: 2.481 ± 0.676
1.654ArgCys: 1.654 ± 1.163
2.481ArgAsp: 2.481 ± 1.277
2.481ArgGlu: 2.481 ± 0.968
2.068ArgPhe: 2.068 ± 0.72
0.827ArgGly: 0.827 ± 0.569
3.309ArgHis: 3.309 ± 0.976
1.241ArgIle: 1.241 ± 0.481
3.309ArgLys: 3.309 ± 1.261
4.963ArgLeu: 4.963 ± 0.663
0.0ArgMet: 0.0 ± 0.0
0.414ArgAsn: 0.414 ± 0.339
4.136ArgPro: 4.136 ± 1.499
0.827ArgGln: 0.827 ± 0.43
2.895ArgArg: 2.895 ± 1.187
2.481ArgSer: 2.481 ± 0.529
4.136ArgThr: 4.136 ± 1.54
0.827ArgVal: 0.827 ± 0.405
0.827ArgTrp: 0.827 ± 0.528
1.654ArgTyr: 1.654 ± 0.637
0.0ArgXaa: 0.0 ± 0.0
Ser
3.722SerAla: 3.722 ± 0.997
0.827SerCys: 0.827 ± 0.881
2.895SerAsp: 2.895 ± 0.886
5.376SerGlu: 5.376 ± 1.455
2.481SerPhe: 2.481 ± 1.131
6.617SerGly: 6.617 ± 1.868
1.241SerHis: 1.241 ± 0.451
4.136SerIle: 4.136 ± 1.769
2.481SerLys: 2.481 ± 0.722
6.203SerLeu: 6.203 ± 0.975
2.481SerMet: 2.481 ± 0.94
4.549SerAsn: 4.549 ± 1.734
3.309SerPro: 3.309 ± 0.756
4.136SerGln: 4.136 ± 1.358
4.136SerArg: 4.136 ± 1.325
6.203SerSer: 6.203 ± 1.935
10.753SerThr: 10.753 ± 2.809
3.722SerVal: 3.722 ± 1.36
0.414SerTrp: 0.414 ± 0.339
0.414SerTyr: 0.414 ± 0.353
0.0SerXaa: 0.0 ± 0.0
Thr
5.79ThrAla: 5.79 ± 1.187
1.241ThrCys: 1.241 ± 0.678
3.722ThrAsp: 3.722 ± 1.512
2.481ThrGlu: 2.481 ± 1.031
2.895ThrPhe: 2.895 ± 0.843
6.617ThrGly: 6.617 ± 1.671
2.895ThrHis: 2.895 ± 1.409
5.79ThrIle: 5.79 ± 1.641
2.895ThrLys: 2.895 ± 1.586
9.926ThrLeu: 9.926 ± 2.827
1.241ThrMet: 1.241 ± 0.55
4.549ThrAsn: 4.549 ± 1.225
7.858ThrPro: 7.858 ± 2.717
3.722ThrGln: 3.722 ± 0.623
1.654ThrArg: 1.654 ± 0.761
8.685ThrSer: 8.685 ± 3.003
9.512ThrThr: 9.512 ± 3.344
5.79ThrVal: 5.79 ± 1.568
1.241ThrTrp: 1.241 ± 0.69
3.309ThrTyr: 3.309 ± 1.285
0.0ThrXaa: 0.0 ± 0.0
Val
2.068ValAla: 2.068 ± 0.827
1.654ValCys: 1.654 ± 1.113
5.376ValAsp: 5.376 ± 0.724
4.136ValGlu: 4.136 ± 1.296
2.068ValPhe: 2.068 ± 0.474
2.481ValGly: 2.481 ± 1.874
1.654ValHis: 1.654 ± 1.12
1.654ValIle: 1.654 ± 0.657
2.068ValLys: 2.068 ± 0.807
3.722ValLeu: 3.722 ± 1.838
0.827ValMet: 0.827 ± 0.405
1.654ValAsn: 1.654 ± 0.809
4.136ValPro: 4.136 ± 1.404
2.481ValGln: 2.481 ± 1.126
1.654ValArg: 1.654 ± 0.519
8.271ValSer: 8.271 ± 1.347
4.549ValThr: 4.549 ± 1.586
3.309ValVal: 3.309 ± 0.766
0.827ValTrp: 0.827 ± 0.489
4.136ValTyr: 4.136 ± 2.182
0.0ValXaa: 0.0 ± 0.0
Trp
1.241TrpAla: 1.241 ± 0.674
0.414TrpCys: 0.414 ± 0.339
0.0TrpAsp: 0.0 ± 0.0
0.827TrpGlu: 0.827 ± 0.489
0.414TrpPhe: 0.414 ± 0.339
1.241TrpGly: 1.241 ± 0.725
0.414TrpHis: 0.414 ± 0.368
1.241TrpIle: 1.241 ± 0.773
1.241TrpLys: 1.241 ± 0.681
1.241TrpLeu: 1.241 ± 0.725
0.0TrpMet: 0.0 ± 0.0
0.827TrpAsn: 0.827 ± 0.405
0.827TrpPro: 0.827 ± 0.668
0.414TrpGln: 0.414 ± 0.368
0.0TrpArg: 0.0 ± 0.0
0.414TrpSer: 0.414 ± 0.339
2.068TrpThr: 2.068 ± 1.314
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.414TrpTyr: 0.414 ± 0.339
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.654TyrAla: 1.654 ± 0.44
1.654TyrCys: 1.654 ± 1.18
2.895TyrAsp: 2.895 ± 0.822
2.068TyrGlu: 2.068 ± 0.801
2.068TyrPhe: 2.068 ± 0.827
3.309TyrGly: 3.309 ± 0.957
0.414TyrHis: 0.414 ± 0.395
3.722TyrIle: 3.722 ± 1.219
3.309TyrLys: 3.309 ± 0.978
3.722TyrLeu: 3.722 ± 1.074
0.827TyrMet: 0.827 ± 0.634
2.068TyrAsn: 2.068 ± 0.808
0.827TyrPro: 0.827 ± 0.763
0.414TyrGln: 0.414 ± 0.339
2.481TyrArg: 2.481 ± 0.968
3.722TyrSer: 3.722 ± 1.167
2.895TyrThr: 2.895 ± 1.197
1.654TyrVal: 1.654 ± 0.657
1.241TyrTrp: 1.241 ± 0.427
3.309TyrTyr: 3.309 ± 1.471
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (2419 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski