Amino acid dipepetide frequency for Macaca fascicularis papillomavirus 6

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.686AlaAla: 6.686 ± 1.712
1.254AlaCys: 1.254 ± 0.711
2.507AlaAsp: 2.507 ± 0.642
2.925AlaGlu: 2.925 ± 0.476
2.507AlaPhe: 2.507 ± 0.769
5.015AlaGly: 5.015 ± 1.14
2.089AlaHis: 2.089 ± 0.827
2.089AlaIle: 2.089 ± 0.374
5.015AlaLys: 5.015 ± 1.602
5.85AlaLeu: 5.85 ± 1.3
1.672AlaMet: 1.672 ± 0.522
0.836AlaAsn: 0.836 ± 0.634
3.761AlaPro: 3.761 ± 1.286
2.507AlaGln: 2.507 ± 0.863
5.433AlaArg: 5.433 ± 2.848
3.343AlaSer: 3.343 ± 1.513
5.015AlaThr: 5.015 ± 1.856
5.015AlaVal: 5.015 ± 0.556
0.0AlaTrp: 0.0 ± 0.0
1.254AlaTyr: 1.254 ± 0.421
0.0AlaXaa: 0.0 ± 0.0
Cys
3.343CysAla: 3.343 ± 1.221
0.0CysCys: 0.0 ± 0.0
0.836CysAsp: 0.836 ± 0.397
1.254CysGlu: 1.254 ± 0.906
1.672CysPhe: 1.672 ± 1.097
0.418CysGly: 0.418 ± 0.49
0.836CysHis: 0.836 ± 0.564
1.672CysIle: 1.672 ± 0.95
2.089CysLys: 2.089 ± 0.818
4.597CysLeu: 4.597 ± 1.888
0.0CysMet: 0.0 ± 0.0
1.254CysAsn: 1.254 ± 0.421
1.672CysPro: 1.672 ± 0.486
1.672CysGln: 1.672 ± 0.813
0.836CysArg: 0.836 ± 0.639
1.254CysSer: 1.254 ± 0.613
0.418CysThr: 0.418 ± 0.32
2.507CysVal: 2.507 ± 0.87
1.254CysTrp: 1.254 ± 0.536
0.418CysTyr: 0.418 ± 0.433
0.0CysXaa: 0.0 ± 0.0
Asp
4.597AspAla: 4.597 ± 0.768
2.089AspCys: 2.089 ± 0.87
2.925AspAsp: 2.925 ± 0.929
3.761AspGlu: 3.761 ± 1.351
2.507AspPhe: 2.507 ± 0.868
2.925AspGly: 2.925 ± 1.113
2.089AspHis: 2.089 ± 0.879
3.343AspIle: 3.343 ± 1.669
2.925AspLys: 2.925 ± 1.536
5.015AspLeu: 5.015 ± 0.972
0.418AspMet: 0.418 ± 0.304
1.672AspAsn: 1.672 ± 0.822
4.179AspPro: 4.179 ± 0.812
0.836AspGln: 0.836 ± 0.429
1.672AspArg: 1.672 ± 0.934
3.761AspSer: 3.761 ± 1.197
3.761AspThr: 3.761 ± 1.398
3.343AspVal: 3.343 ± 1.261
1.672AspTrp: 1.672 ± 0.781
2.089AspTyr: 2.089 ± 0.856
0.0AspXaa: 0.0 ± 0.0
Glu
3.761GluAla: 3.761 ± 0.82
1.254GluCys: 1.254 ± 0.567
5.85GluAsp: 5.85 ± 2.584
3.343GluGlu: 3.343 ± 0.484
0.836GluPhe: 0.836 ± 0.447
2.507GluGly: 2.507 ± 0.781
1.672GluHis: 1.672 ± 0.863
2.925GluIle: 2.925 ± 1.594
2.507GluLys: 2.507 ± 1.353
5.015GluLeu: 5.015 ± 1.586
1.672GluMet: 1.672 ± 0.77
2.925GluAsn: 2.925 ± 0.883
3.761GluPro: 3.761 ± 1.206
2.925GluGln: 2.925 ± 0.958
2.089GluArg: 2.089 ± 0.901
3.343GluSer: 3.343 ± 1.005
2.507GluThr: 2.507 ± 0.728
4.179GluVal: 4.179 ± 0.51
1.254GluTrp: 1.254 ± 0.83
2.089GluTyr: 2.089 ± 0.828
0.0GluXaa: 0.0 ± 0.0
Phe
1.254PheAla: 1.254 ± 0.759
0.418PheCys: 0.418 ± 0.32
2.089PheAsp: 2.089 ± 1.011
0.836PheGlu: 0.836 ± 0.376
2.507PhePhe: 2.507 ± 0.744
2.089PheGly: 2.089 ± 0.428
0.836PheHis: 0.836 ± 0.495
2.089PheIle: 2.089 ± 1.144
3.761PheLys: 3.761 ± 1.589
7.104PheLeu: 7.104 ± 2.234
0.0PheMet: 0.0 ± 0.0
1.672PheAsn: 1.672 ± 0.859
2.507PhePro: 2.507 ± 0.764
0.836PheGln: 0.836 ± 0.429
0.418PheArg: 0.418 ± 0.304
3.761PheSer: 3.761 ± 0.64
0.836PheThr: 0.836 ± 0.534
1.672PheVal: 1.672 ± 0.593
1.254PheTrp: 1.254 ± 0.613
1.672PheTyr: 1.672 ± 0.734
0.0PheXaa: 0.0 ± 0.0
Gly
2.925GlyAla: 2.925 ± 1.03
0.836GlyCys: 0.836 ± 0.397
6.268GlyAsp: 6.268 ± 1.165
2.507GlyGlu: 2.507 ± 1.255
0.418GlyPhe: 0.418 ± 0.395
3.761GlyGly: 3.761 ± 1.53
2.925GlyHis: 2.925 ± 1.284
2.925GlyIle: 2.925 ± 0.929
1.672GlyLys: 1.672 ± 0.591
4.597GlyLeu: 4.597 ± 1.589
0.836GlyMet: 0.836 ± 0.397
4.179GlyAsn: 4.179 ± 1.691
2.507GlyPro: 2.507 ± 0.631
3.761GlyGln: 3.761 ± 0.671
3.761GlyArg: 3.761 ± 0.85
5.85GlySer: 5.85 ± 1.196
4.179GlyThr: 4.179 ± 1.105
5.015GlyVal: 5.015 ± 0.79
0.836GlyTrp: 0.836 ± 0.447
1.672GlyTyr: 1.672 ± 0.492
0.0GlyXaa: 0.0 ± 0.0
His
1.672HisAla: 1.672 ± 0.593
0.418HisCys: 0.418 ± 0.49
0.836HisAsp: 0.836 ± 0.495
1.672HisGlu: 1.672 ± 0.993
0.836HisPhe: 0.836 ± 0.447
1.254HisGly: 1.254 ± 0.702
1.254HisHis: 1.254 ± 0.682
1.254HisIle: 1.254 ± 0.364
1.254HisLys: 1.254 ± 0.415
1.672HisLeu: 1.672 ± 1.177
0.418HisMet: 0.418 ± 0.304
1.672HisAsn: 1.672 ± 0.583
3.343HisPro: 3.343 ± 1.513
1.672HisGln: 1.672 ± 1.352
1.672HisArg: 1.672 ± 0.88
2.089HisSer: 2.089 ± 1.019
0.418HisThr: 0.418 ± 0.395
2.925HisVal: 2.925 ± 0.597
1.254HisTrp: 1.254 ± 0.847
2.089HisTyr: 2.089 ± 0.693
0.0HisXaa: 0.0 ± 0.0
Ile
1.672IleAla: 1.672 ± 0.483
1.254IleCys: 1.254 ± 0.728
3.343IleAsp: 3.343 ± 1.763
2.507IleGlu: 2.507 ± 0.829
1.254IlePhe: 1.254 ± 0.815
2.925IleGly: 2.925 ± 0.969
1.254IleHis: 1.254 ± 0.698
0.0IleIle: 0.0 ± 0.0
1.254IleLys: 1.254 ± 0.654
4.179IleLeu: 4.179 ± 0.833
0.0IleMet: 0.0 ± 0.437
0.418IleAsn: 0.418 ± 0.469
2.507IlePro: 2.507 ± 1.402
1.254IleGln: 1.254 ± 0.654
1.672IleArg: 1.672 ± 1.131
2.507IleSer: 2.507 ± 1.15
3.761IleThr: 3.761 ± 1.148
6.268IleVal: 6.268 ± 1.558
0.0IleTrp: 0.0 ± 0.0
1.254IleTyr: 1.254 ± 0.583
0.0IleXaa: 0.0 ± 0.0
Lys
2.507LysAla: 2.507 ± 0.644
3.343LysCys: 3.343 ± 1.407
2.507LysAsp: 2.507 ± 1.342
3.761LysGlu: 3.761 ± 1.365
4.179LysPhe: 4.179 ± 1.184
2.925LysGly: 2.925 ± 1.259
1.254LysHis: 1.254 ± 0.514
2.089LysIle: 2.089 ± 1.008
2.089LysLys: 2.089 ± 0.981
2.925LysLeu: 2.925 ± 0.966
0.836LysMet: 0.836 ± 0.458
2.507LysAsn: 2.507 ± 0.97
2.507LysPro: 2.507 ± 1.076
1.672LysGln: 1.672 ± 0.784
5.85LysArg: 5.85 ± 1.077
2.089LysSer: 2.089 ± 1.2
2.089LysThr: 2.089 ± 1.006
3.761LysVal: 3.761 ± 1.322
0.0LysTrp: 0.0 ± 0.0
2.925LysTyr: 2.925 ± 0.607
0.0LysXaa: 0.0 ± 0.0
Leu
5.015LeuAla: 5.015 ± 1.203
4.179LeuCys: 4.179 ± 2.452
4.597LeuAsp: 4.597 ± 0.903
5.85LeuGlu: 5.85 ± 1.698
4.597LeuPhe: 4.597 ± 2.807
5.433LeuGly: 5.433 ± 0.837
3.343LeuHis: 3.343 ± 1.927
3.343LeuIle: 3.343 ± 1.226
5.85LeuLys: 5.85 ± 1.178
8.776LeuLeu: 8.776 ± 3.404
1.672LeuMet: 1.672 ± 1.168
1.672LeuAsn: 1.672 ± 0.639
5.015LeuPro: 5.015 ± 1.43
11.283LeuGln: 11.283 ± 1.564
3.343LeuArg: 3.343 ± 1.391
7.104LeuSer: 7.104 ± 1.721
4.597LeuThr: 4.597 ± 1.298
4.179LeuVal: 4.179 ± 1.081
0.836LeuTrp: 0.836 ± 0.458
5.433LeuTyr: 5.433 ± 1.281
0.0LeuXaa: 0.0 ± 0.0
Met
1.254MetAla: 1.254 ± 0.639
0.836MetCys: 0.836 ± 0.564
2.089MetAsp: 2.089 ± 0.818
0.836MetGlu: 0.836 ± 0.884
0.836MetPhe: 0.836 ± 0.608
1.254MetGly: 1.254 ± 0.488
0.418MetHis: 0.418 ± 0.395
0.418MetIle: 0.418 ± 0.49
1.254MetLys: 1.254 ± 0.415
2.089MetLeu: 2.089 ± 1.267
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.418MetPro: 0.418 ± 0.395
0.418MetGln: 0.418 ± 0.395
1.254MetArg: 1.254 ± 0.631
1.672MetSer: 1.672 ± 0.591
0.418MetThr: 0.418 ± 0.304
1.672MetVal: 1.672 ± 0.717
0.418MetTrp: 0.418 ± 0.442
0.418MetTyr: 0.418 ± 0.442
0.0MetXaa: 0.0 ± 0.0
Asn
2.925AsnAla: 2.925 ± 1.194
0.418AsnCys: 0.418 ± 0.442
0.418AsnAsp: 0.418 ± 0.32
1.254AsnGlu: 1.254 ± 0.847
1.672AsnPhe: 1.672 ± 0.907
2.925AsnGly: 2.925 ± 0.883
0.836AsnHis: 0.836 ± 0.884
1.672AsnIle: 1.672 ± 0.781
2.089AsnLys: 2.089 ± 1.519
1.254AsnLeu: 1.254 ± 0.617
0.836AsnMet: 0.836 ± 0.608
1.254AsnAsn: 1.254 ± 0.484
2.925AsnPro: 2.925 ± 0.841
0.836AsnGln: 0.836 ± 0.397
1.672AsnArg: 1.672 ± 0.795
4.179AsnSer: 4.179 ± 1.58
2.925AsnThr: 2.925 ± 0.621
1.672AsnVal: 1.672 ± 0.513
1.254AsnTrp: 1.254 ± 0.639
0.418AsnTyr: 0.418 ± 0.395
0.0AsnXaa: 0.0 ± 0.0
Pro
6.268ProAla: 6.268 ± 3.543
0.836ProCys: 0.836 ± 0.397
4.179ProAsp: 4.179 ± 1.397
2.507ProGlu: 2.507 ± 1.094
2.089ProPhe: 2.089 ± 0.884
1.254ProGly: 1.254 ± 0.702
0.418ProHis: 0.418 ± 0.442
2.925ProIle: 2.925 ± 1.01
4.597ProLys: 4.597 ± 1.128
10.029ProLeu: 10.029 ± 1.68
0.418ProMet: 0.418 ± 0.395
3.343ProAsn: 3.343 ± 2.004
7.94ProPro: 7.94 ± 2.369
0.418ProGln: 0.418 ± 0.49
3.343ProArg: 3.343 ± 1.34
5.85ProSer: 5.85 ± 1.891
3.761ProThr: 3.761 ± 1.282
3.761ProVal: 3.761 ± 1.647
0.418ProTrp: 0.418 ± 0.395
1.672ProTyr: 1.672 ± 0.989
0.0ProXaa: 0.0 ± 0.0
Gln
1.672GlnAla: 1.672 ± 0.907
0.418GlnCys: 0.418 ± 0.32
2.507GlnAsp: 2.507 ± 0.84
2.089GlnGlu: 2.089 ± 0.901
2.089GlnPhe: 2.089 ± 0.729
2.507GlnGly: 2.507 ± 0.449
1.672GlnHis: 1.672 ± 0.666
0.836GlnIle: 0.836 ± 0.534
2.089GlnLys: 2.089 ± 0.935
4.597GlnLeu: 4.597 ± 1.308
2.089GlnMet: 2.089 ± 0.637
2.089GlnAsn: 2.089 ± 0.722
5.433GlnPro: 5.433 ± 1.303
2.507GlnGln: 2.507 ± 0.996
2.925GlnArg: 2.925 ± 1.316
1.672GlnSer: 1.672 ± 1.149
2.925GlnThr: 2.925 ± 1.176
3.343GlnVal: 3.343 ± 0.678
0.418GlnTrp: 0.418 ± 0.32
1.672GlnTyr: 1.672 ± 0.95
0.0GlnXaa: 0.0 ± 0.0
Arg
5.433ArgAla: 5.433 ± 1.325
2.925ArgCys: 2.925 ± 1.264
1.254ArgAsp: 1.254 ± 0.808
2.507ArgGlu: 2.507 ± 0.696
2.925ArgPhe: 2.925 ± 0.921
1.672ArgGly: 1.672 ± 0.292
2.089ArgHis: 2.089 ± 0.627
1.672ArgIle: 1.672 ± 0.564
4.597ArgLys: 4.597 ± 1.437
6.268ArgLeu: 6.268 ± 1.339
0.418ArgMet: 0.418 ± 0.441
0.836ArgAsn: 0.836 ± 0.652
3.343ArgPro: 3.343 ± 1.466
1.254ArgGln: 1.254 ± 0.69
2.925ArgArg: 2.925 ± 1.409
2.925ArgSer: 2.925 ± 1.127
3.761ArgThr: 3.761 ± 1.478
3.343ArgVal: 3.343 ± 1.35
0.418ArgTrp: 0.418 ± 0.32
1.672ArgTyr: 1.672 ± 0.841
0.0ArgXaa: 0.0 ± 0.0
Ser
5.015SerAla: 5.015 ± 1.333
1.672SerCys: 1.672 ± 1.043
2.925SerAsp: 2.925 ± 1.04
5.85SerGlu: 5.85 ± 2.026
1.254SerPhe: 1.254 ± 0.959
5.85SerGly: 5.85 ± 1.437
1.254SerHis: 1.254 ± 0.364
2.925SerIle: 2.925 ± 1.284
2.089SerLys: 2.089 ± 0.935
6.686SerLeu: 6.686 ± 1.148
2.507SerMet: 2.507 ± 0.742
3.343SerAsn: 3.343 ± 1.697
2.925SerPro: 2.925 ± 1.278
2.507SerGln: 2.507 ± 0.905
3.761SerArg: 3.761 ± 1.224
7.104SerSer: 7.104 ± 1.754
8.776SerThr: 8.776 ± 2.212
6.686SerVal: 6.686 ± 1.141
0.0SerTrp: 0.0 ± 0.0
1.254SerTyr: 1.254 ± 0.575
0.0SerXaa: 0.0 ± 0.0
Thr
1.672ThrAla: 1.672 ± 0.984
1.672ThrCys: 1.672 ± 0.648
1.672ThrAsp: 1.672 ± 0.555
3.343ThrGlu: 3.343 ± 0.588
1.672ThrPhe: 1.672 ± 0.676
7.104ThrGly: 7.104 ± 1.206
1.254ThrHis: 1.254 ± 0.488
1.672ThrIle: 1.672 ± 0.871
0.836ThrLys: 0.836 ± 0.884
7.104ThrLeu: 7.104 ± 1.901
0.836ThrMet: 0.836 ± 0.639
1.254ThrAsn: 1.254 ± 0.421
5.433ThrPro: 5.433 ± 1.573
5.015ThrGln: 5.015 ± 1.373
2.089ThrArg: 2.089 ± 0.636
6.686ThrSer: 6.686 ± 1.747
3.343ThrThr: 3.343 ± 1.74
7.94ThrVal: 7.94 ± 2.908
0.836ThrTrp: 0.836 ± 0.447
1.254ThrTyr: 1.254 ± 0.63
0.0ThrXaa: 0.0 ± 0.0
Val
3.761ValAla: 3.761 ± 1.396
2.507ValCys: 2.507 ± 1.351
5.433ValAsp: 5.433 ± 0.861
6.686ValGlu: 6.686 ± 2.008
2.925ValPhe: 2.925 ± 0.651
2.925ValGly: 2.925 ± 1.121
2.925ValHis: 2.925 ± 1.209
2.925ValIle: 2.925 ± 1.038
2.507ValLys: 2.507 ± 1.181
5.015ValLeu: 5.015 ± 1.815
1.672ValMet: 1.672 ± 0.848
2.089ValAsn: 2.089 ± 0.898
5.015ValPro: 5.015 ± 1.667
2.925ValGln: 2.925 ± 1.559
3.761ValArg: 3.761 ± 0.707
6.686ValSer: 6.686 ± 0.708
6.268ValThr: 6.268 ± 2.577
5.85ValVal: 5.85 ± 1.418
0.836ValTrp: 0.836 ± 0.458
4.179ValTyr: 4.179 ± 1.2
0.0ValXaa: 0.0 ± 0.0
Trp
0.836TrpAla: 0.836 ± 0.397
0.418TrpCys: 0.418 ± 0.32
0.836TrpAsp: 0.836 ± 0.458
0.418TrpGlu: 0.418 ± 0.442
0.418TrpPhe: 0.418 ± 0.32
2.089TrpGly: 2.089 ± 0.82
0.418TrpHis: 0.418 ± 0.442
0.418TrpIle: 0.418 ± 0.32
1.254TrpLys: 1.254 ± 0.639
0.836TrpLeu: 0.836 ± 0.397
0.0TrpMet: 0.0 ± 0.0
0.418TrpAsn: 0.418 ± 0.304
0.0TrpPro: 0.0 ± 0.0
0.418TrpGln: 0.418 ± 0.32
1.254TrpArg: 1.254 ± 0.567
0.836TrpSer: 0.836 ± 0.633
1.672TrpThr: 1.672 ± 1.309
0.836TrpVal: 0.836 ± 0.571
0.0TrpTrp: 0.0 ± 0.0
0.418TrpTyr: 0.418 ± 0.32
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.507TyrAla: 2.507 ± 0.744
1.254TyrCys: 1.254 ± 0.906
2.507TyrAsp: 2.507 ± 0.78
2.507TyrGlu: 2.507 ± 0.942
0.418TyrPhe: 0.418 ± 0.395
4.597TyrGly: 4.597 ± 0.889
0.836TyrHis: 0.836 ± 0.516
2.089TyrIle: 2.089 ± 0.838
2.089TyrLys: 2.089 ± 0.82
2.507TyrLeu: 2.507 ± 0.865
1.254TyrMet: 1.254 ± 0.654
0.0TyrAsn: 0.0 ± 0.0
0.836TyrPro: 0.836 ± 0.589
0.836TyrGln: 0.836 ± 0.376
2.925TyrArg: 2.925 ± 0.858
1.672TyrSer: 1.672 ± 1.133
1.254TyrThr: 1.254 ± 0.625
2.925TyrVal: 2.925 ± 0.74
0.836TyrTrp: 0.836 ± 0.397
2.507TyrTyr: 2.507 ± 0.866
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (2394 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski