Amino acid dipepetide frequency for Ailuropoda melanoleuca papillomavirus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.685AlaAla: 7.685 ± 1.897
2.26AlaCys: 2.26 ± 1.294
3.617AlaAsp: 3.617 ± 1.64
4.069AlaGlu: 4.069 ± 0.94
2.26AlaPhe: 2.26 ± 0.809
4.521AlaGly: 4.521 ± 1.593
0.452AlaHis: 0.452 ± 0.402
1.808AlaIle: 1.808 ± 0.534
6.329AlaLys: 6.329 ± 2.078
3.617AlaLeu: 3.617 ± 1.278
2.26AlaMet: 2.26 ± 0.403
2.26AlaAsn: 2.26 ± 1.023
3.617AlaPro: 3.617 ± 1.536
1.808AlaGln: 1.808 ± 0.865
2.26AlaArg: 2.26 ± 1.09
8.137AlaSer: 8.137 ± 2.44
5.877AlaThr: 5.877 ± 1.661
4.973AlaVal: 4.973 ± 1.024
0.0AlaTrp: 0.0 ± 0.0
1.808AlaTyr: 1.808 ± 0.755
0.0AlaXaa: 0.0 ± 0.0
Cys
1.356CysAla: 1.356 ± 0.747
0.0CysCys: 0.0 ± 0.0
0.904CysAsp: 0.904 ± 0.439
3.165CysGlu: 3.165 ± 1.25
1.356CysPhe: 1.356 ± 0.682
1.808CysGly: 1.808 ± 0.929
0.452CysHis: 0.452 ± 0.375
0.0CysIle: 0.0 ± 0.0
1.356CysLys: 1.356 ± 0.659
0.452CysLeu: 0.452 ± 0.561
0.904CysMet: 0.904 ± 0.68
0.904CysAsn: 0.904 ± 0.68
2.26CysPro: 2.26 ± 0.315
0.904CysGln: 0.904 ± 0.68
0.452CysArg: 0.452 ± 0.402
2.26CysSer: 2.26 ± 1.216
0.904CysThr: 0.904 ± 0.439
1.808CysVal: 1.808 ± 1.211
2.26CysTrp: 2.26 ± 0.738
0.904CysTyr: 0.904 ± 0.697
0.0CysXaa: 0.0 ± 0.0
Asp
3.165AspAla: 3.165 ± 1.642
2.26AspCys: 2.26 ± 0.738
4.069AspAsp: 4.069 ± 1.822
3.165AspGlu: 3.165 ± 1.515
2.26AspPhe: 2.26 ± 0.669
4.521AspGly: 4.521 ± 1.155
0.0AspHis: 0.0 ± 0.0
2.26AspIle: 2.26 ± 0.84
3.165AspLys: 3.165 ± 1.41
4.069AspLeu: 4.069 ± 0.876
1.808AspMet: 1.808 ± 0.866
4.521AspAsn: 4.521 ± 1.446
4.521AspPro: 4.521 ± 1.65
1.808AspGln: 1.808 ± 0.642
1.808AspArg: 1.808 ± 0.975
4.069AspSer: 4.069 ± 0.696
4.521AspThr: 4.521 ± 1.118
2.26AspVal: 2.26 ± 1.152
0.0AspTrp: 0.0 ± 0.0
1.356AspTyr: 1.356 ± 0.703
0.0AspXaa: 0.0 ± 0.0
Glu
4.973GluAla: 4.973 ± 1.015
0.904GluCys: 0.904 ± 0.68
4.069GluAsp: 4.069 ± 0.634
2.712GluGlu: 2.712 ± 0.984
1.356GluPhe: 1.356 ± 0.608
4.069GluGly: 4.069 ± 1.257
1.356GluHis: 1.356 ± 0.386
0.904GluIle: 0.904 ± 0.792
4.521GluLys: 4.521 ± 1.584
3.165GluLeu: 3.165 ± 1.067
2.26GluMet: 2.26 ± 1.087
3.165GluAsn: 3.165 ± 1.081
1.356GluPro: 1.356 ± 0.771
5.425GluGln: 5.425 ± 0.525
1.808GluArg: 1.808 ± 0.585
3.617GluSer: 3.617 ± 1.283
2.712GluThr: 2.712 ± 1.157
4.069GluVal: 4.069 ± 0.83
0.452GluTrp: 0.452 ± 0.34
2.26GluTyr: 2.26 ± 0.738
0.0GluXaa: 0.0 ± 0.0
Phe
0.452PheAla: 0.452 ± 0.375
0.904PheCys: 0.904 ± 0.605
3.165PheAsp: 3.165 ± 0.869
0.904PheGlu: 0.904 ± 0.384
2.26PhePhe: 2.26 ± 0.7
2.26PheGly: 2.26 ± 0.714
0.452PheHis: 0.452 ± 0.375
1.356PheIle: 1.356 ± 0.682
4.973PheLys: 4.973 ± 1.063
3.165PheLeu: 3.165 ± 1.171
1.808PheMet: 1.808 ± 0.757
1.808PheAsn: 1.808 ± 0.949
2.26PhePro: 2.26 ± 0.669
0.0PheGln: 0.0 ± 0.0
2.26PheArg: 2.26 ± 0.676
3.617PheSer: 3.617 ± 1.64
1.356PheThr: 1.356 ± 0.608
1.808PheVal: 1.808 ± 1.073
2.26PheTrp: 2.26 ± 0.489
3.165PheTyr: 3.165 ± 0.915
0.0PheXaa: 0.0 ± 0.0
Gly
4.521GlyAla: 4.521 ± 1.242
2.26GlyCys: 2.26 ± 1.023
4.973GlyAsp: 4.973 ± 1.219
4.973GlyGlu: 4.973 ± 0.898
2.26GlyPhe: 2.26 ± 0.669
5.877GlyGly: 5.877 ± 2.749
2.26GlyHis: 2.26 ± 0.696
2.26GlyIle: 2.26 ± 0.792
2.712GlyLys: 2.712 ± 1.042
4.069GlyLeu: 4.069 ± 1.038
0.452GlyMet: 0.452 ± 0.34
3.165GlyAsn: 3.165 ± 0.931
1.356GlyPro: 1.356 ± 0.702
3.165GlyGln: 3.165 ± 1.081
3.165GlyArg: 3.165 ± 0.577
7.233GlySer: 7.233 ± 1.818
3.617GlyThr: 3.617 ± 1.117
8.137GlyVal: 8.137 ± 2.018
0.0GlyTrp: 0.0 ± 0.0
2.712GlyTyr: 2.712 ± 0.873
0.0GlyXaa: 0.0 ± 0.0
His
0.904HisAla: 0.904 ± 0.384
0.904HisCys: 0.904 ± 0.433
0.452HisAsp: 0.452 ± 0.396
1.808HisGlu: 1.808 ± 0.975
1.808HisPhe: 1.808 ± 0.648
1.356HisGly: 1.356 ± 1.207
0.0HisHis: 0.0 ± 0.0
0.452HisIle: 0.452 ± 0.34
1.808HisLys: 1.808 ± 1.149
1.356HisLeu: 1.356 ± 0.771
0.0HisMet: 0.0 ± 0.0
0.452HisAsn: 0.452 ± 0.375
1.808HisPro: 1.808 ± 0.949
0.904HisGln: 0.904 ± 0.439
1.356HisArg: 1.356 ± 0.805
0.904HisSer: 0.904 ± 0.68
2.26HisThr: 2.26 ± 0.475
0.904HisVal: 0.904 ± 0.439
0.904HisTrp: 0.904 ± 0.612
1.356HisTyr: 1.356 ± 0.846
0.0HisXaa: 0.0 ± 0.0
Ile
2.712IleAla: 2.712 ± 0.856
0.452IleCys: 0.452 ± 0.375
2.26IleAsp: 2.26 ± 0.315
3.617IleGlu: 3.617 ± 0.71
1.356IlePhe: 1.356 ± 0.758
3.617IleGly: 3.617 ± 2.33
0.452IleHis: 0.452 ± 0.34
0.904IleIle: 0.904 ± 0.792
1.356IleLys: 1.356 ± 0.735
3.165IleLeu: 3.165 ± 0.451
0.452IleMet: 0.452 ± 0.375
0.452IleAsn: 0.452 ± 0.375
2.712IlePro: 2.712 ± 1.404
2.712IleGln: 2.712 ± 1.363
2.26IleArg: 2.26 ± 1.65
2.26IleSer: 2.26 ± 0.676
3.165IleThr: 3.165 ± 0.899
2.26IleVal: 2.26 ± 1.457
0.0IleTrp: 0.0 ± 0.0
0.904IleTyr: 0.904 ± 0.433
0.0IleXaa: 0.0 ± 0.0
Lys
5.425LysAla: 5.425 ± 1.362
1.356LysCys: 1.356 ± 0.673
0.904LysAsp: 0.904 ± 0.439
3.617LysGlu: 3.617 ± 1.244
2.712LysPhe: 2.712 ± 1.297
4.069LysGly: 4.069 ± 1.249
0.904LysHis: 0.904 ± 0.68
1.356LysIle: 1.356 ± 0.795
3.165LysLys: 3.165 ± 1.799
3.617LysLeu: 3.617 ± 0.945
1.356LysMet: 1.356 ± 0.747
2.712LysAsn: 2.712 ± 0.93
1.808LysPro: 1.808 ± 0.929
3.165LysGln: 3.165 ± 0.818
6.329LysArg: 6.329 ± 0.96
3.617LysSer: 3.617 ± 1.495
3.165LysThr: 3.165 ± 1.275
4.521LysVal: 4.521 ± 1.481
1.356LysTrp: 1.356 ± 0.673
4.973LysTyr: 4.973 ± 0.668
0.0LysXaa: 0.0 ± 0.0
Leu
3.617LeuAla: 3.617 ± 1.062
2.26LeuCys: 2.26 ± 1.216
3.617LeuAsp: 3.617 ± 1.05
4.069LeuGlu: 4.069 ± 0.859
3.617LeuPhe: 3.617 ± 1.109
5.425LeuGly: 5.425 ± 1.267
3.165LeuHis: 3.165 ± 1.002
2.26LeuIle: 2.26 ± 0.752
6.329LeuLys: 6.329 ± 1.547
10.85LeuLeu: 10.85 ± 3.909
0.904LeuMet: 0.904 ± 0.609
3.165LeuAsn: 3.165 ± 0.567
3.617LeuPro: 3.617 ± 1.803
6.781LeuGln: 6.781 ± 0.834
0.452LeuArg: 0.452 ± 0.375
5.425LeuSer: 5.425 ± 0.991
4.069LeuThr: 4.069 ± 1.56
4.069LeuVal: 4.069 ± 1.645
0.452LeuTrp: 0.452 ± 0.375
3.165LeuTyr: 3.165 ± 0.864
0.0LeuXaa: 0.0 ± 0.0
Met
3.617MetAla: 3.617 ± 1.065
0.452MetCys: 0.452 ± 0.561
1.808MetAsp: 1.808 ± 0.622
0.452MetGlu: 0.452 ± 0.34
0.904MetPhe: 0.904 ± 0.805
0.452MetGly: 0.452 ± 0.34
0.452MetHis: 0.452 ± 0.402
1.356MetIle: 1.356 ± 0.735
0.452MetLys: 0.452 ± 0.34
1.356MetLeu: 1.356 ± 0.597
0.452MetMet: 0.452 ± 0.402
0.452MetAsn: 0.452 ± 0.34
0.452MetPro: 0.452 ± 0.34
0.904MetGln: 0.904 ± 0.433
1.356MetArg: 1.356 ± 0.675
0.904MetSer: 0.904 ± 0.384
0.452MetThr: 0.452 ± 0.402
2.26MetVal: 2.26 ± 0.827
0.0MetTrp: 0.0 ± 0.0
0.452MetTyr: 0.452 ± 0.402
0.0MetXaa: 0.0 ± 0.0
Asn
4.069AsnAla: 4.069 ± 1.543
1.356AsnCys: 1.356 ± 0.806
1.356AsnAsp: 1.356 ± 0.383
3.165AsnGlu: 3.165 ± 0.955
2.26AsnPhe: 2.26 ± 0.738
2.712AsnGly: 2.712 ± 1.363
0.0AsnHis: 0.0 ± 0.0
4.069AsnIle: 4.069 ± 0.978
2.712AsnLys: 2.712 ± 1.407
1.808AsnLeu: 1.808 ± 0.949
0.904AsnMet: 0.904 ± 0.432
2.26AsnAsn: 2.26 ± 0.995
4.069AsnPro: 4.069 ± 1.641
1.356AsnGln: 1.356 ± 0.703
2.712AsnArg: 2.712 ± 0.773
2.712AsnSer: 2.712 ± 0.564
1.808AsnThr: 1.808 ± 0.622
3.165AsnVal: 3.165 ± 0.988
0.904AsnTrp: 0.904 ± 0.432
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
1.808ProAla: 1.808 ± 1.156
0.0ProCys: 0.0 ± 0.0
4.521ProAsp: 4.521 ± 1.583
1.808ProGlu: 1.808 ± 0.978
2.26ProPhe: 2.26 ± 1.229
3.165ProGly: 3.165 ± 0.463
0.904ProHis: 0.904 ± 0.475
4.069ProIle: 4.069 ± 1.63
3.165ProLys: 3.165 ± 0.652
7.685ProLeu: 7.685 ± 1.416
0.0ProMet: 0.0 ± 0.0
2.712ProAsn: 2.712 ± 0.766
6.781ProPro: 6.781 ± 1.505
2.26ProGln: 2.26 ± 1.071
2.26ProArg: 2.26 ± 0.653
5.877ProSer: 5.877 ± 1.694
3.165ProThr: 3.165 ± 1.15
4.069ProVal: 4.069 ± 2.415
0.452ProTrp: 0.452 ± 0.402
3.617ProTyr: 3.617 ± 1.847
0.0ProXaa: 0.0 ± 0.0
Gln
1.356GlnAla: 1.356 ± 0.383
0.452GlnCys: 0.452 ± 0.34
2.712GlnAsp: 2.712 ± 1.542
1.808GlnGlu: 1.808 ± 0.866
0.904GlnPhe: 0.904 ± 0.432
1.808GlnGly: 1.808 ± 0.622
3.165GlnHis: 3.165 ± 1.404
2.712GlnIle: 2.712 ± 0.595
2.26GlnLys: 2.26 ± 0.79
6.329GlnLeu: 6.329 ± 0.628
0.0GlnMet: 0.0 ± 0.0
1.808GlnAsn: 1.808 ± 1.083
3.617GlnPro: 3.617 ± 0.831
8.59GlnGln: 8.59 ± 5.682
0.904GlnArg: 0.904 ± 0.805
3.165GlnSer: 3.165 ± 0.743
3.165GlnThr: 3.165 ± 0.96
2.26GlnVal: 2.26 ± 0.315
1.356GlnTrp: 1.356 ± 1.021
0.452GlnTyr: 0.452 ± 0.375
0.0GlnXaa: 0.0 ± 0.0
Arg
4.521ArgAla: 4.521 ± 1.268
1.808ArgCys: 1.808 ± 0.755
1.356ArgAsp: 1.356 ± 1.207
1.356ArgGlu: 1.356 ± 0.771
2.712ArgPhe: 2.712 ± 1.236
3.165ArgGly: 3.165 ± 0.703
1.356ArgHis: 1.356 ± 0.703
1.356ArgIle: 1.356 ± 0.789
4.069ArgLys: 4.069 ± 0.876
4.973ArgLeu: 4.973 ± 1.507
0.452ArgMet: 0.452 ± 0.367
1.356ArgAsn: 1.356 ± 0.608
3.165ArgPro: 3.165 ± 0.743
3.165ArgGln: 3.165 ± 0.818
4.521ArgArg: 4.521 ± 1.342
4.973ArgSer: 4.973 ± 1.603
1.356ArgThr: 1.356 ± 0.702
3.165ArgVal: 3.165 ± 0.788
0.904ArgTrp: 0.904 ± 0.605
0.904ArgTyr: 0.904 ± 0.433
0.0ArgXaa: 0.0 ± 0.0
Ser
7.233SerAla: 7.233 ± 1.751
0.904SerCys: 0.904 ± 0.432
4.973SerAsp: 4.973 ± 1.427
2.712SerGlu: 2.712 ± 0.766
2.712SerPhe: 2.712 ± 1.508
7.233SerGly: 7.233 ± 1.32
0.904SerHis: 0.904 ± 0.384
2.26SerIle: 2.26 ± 0.822
4.521SerLys: 4.521 ± 0.686
3.165SerLeu: 3.165 ± 0.605
0.904SerMet: 0.904 ± 0.432
4.069SerAsn: 4.069 ± 1.061
4.521SerPro: 4.521 ± 1.645
1.808SerGln: 1.808 ± 0.643
3.617SerArg: 3.617 ± 1.028
10.398SerSer: 10.398 ± 3.099
10.398SerThr: 10.398 ± 1.725
7.685SerVal: 7.685 ± 2.25
0.904SerTrp: 0.904 ± 0.605
0.452SerTyr: 0.452 ± 0.34
0.0SerXaa: 0.0 ± 0.0
Thr
5.425ThrAla: 5.425 ± 1.246
2.712ThrCys: 2.712 ± 0.411
4.069ThrAsp: 4.069 ± 0.551
2.712ThrGlu: 2.712 ± 1.35
2.26ThrPhe: 2.26 ± 1.26
3.165ThrGly: 3.165 ± 0.869
1.356ThrHis: 1.356 ± 0.771
2.26ThrIle: 2.26 ± 1.26
3.165ThrLys: 3.165 ± 0.439
4.069ThrLeu: 4.069 ± 1.092
1.808ThrMet: 1.808 ± 0.865
2.712ThrAsn: 2.712 ± 0.841
4.521ThrPro: 4.521 ± 1.17
1.356ThrGln: 1.356 ± 0.44
4.521ThrArg: 4.521 ± 1.692
5.425ThrSer: 5.425 ± 2.023
3.165ThrThr: 3.165 ± 1.285
4.973ThrVal: 4.973 ± 1.526
0.904ThrTrp: 0.904 ± 0.439
2.26ThrTyr: 2.26 ± 0.995
0.0ThrXaa: 0.0 ± 0.0
Val
3.617ValAla: 3.617 ± 0.866
2.712ValCys: 2.712 ± 1.2
2.712ValAsp: 2.712 ± 1.058
6.329ValGlu: 6.329 ± 1.607
4.521ValPhe: 4.521 ± 1.089
4.973ValGly: 4.973 ± 1.369
3.617ValHis: 3.617 ± 0.764
2.712ValIle: 2.712 ± 0.954
1.356ValLys: 1.356 ± 0.383
4.069ValLeu: 4.069 ± 0.772
0.904ValMet: 0.904 ± 0.697
2.26ValAsn: 2.26 ± 0.876
6.329ValPro: 6.329 ± 2.441
2.26ValGln: 2.26 ± 0.443
4.069ValArg: 4.069 ± 0.889
5.877ValSer: 5.877 ± 1.966
4.069ValThr: 4.069 ± 1.062
4.069ValVal: 4.069 ± 0.937
1.356ValTrp: 1.356 ± 0.656
2.26ValTyr: 2.26 ± 0.443
0.0ValXaa: 0.0 ± 0.0
Trp
1.356TrpAla: 1.356 ± 0.682
0.0TrpCys: 0.0 ± 0.0
0.904TrpAsp: 0.904 ± 0.439
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
1.808TrpGly: 1.808 ± 0.879
0.0TrpHis: 0.0 ± 0.0
1.356TrpIle: 1.356 ± 0.682
0.904TrpLys: 0.904 ± 0.68
2.712TrpLeu: 2.712 ± 0.867
0.452TrpMet: 0.452 ± 0.34
1.356TrpAsn: 1.356 ± 1.126
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.808TrpArg: 1.808 ± 1.66
0.452TrpSer: 0.452 ± 0.561
1.808TrpThr: 1.808 ± 1.61
0.452TrpVal: 0.452 ± 0.34
0.452TrpTrp: 0.452 ± 0.561
0.904TrpTyr: 0.904 ± 0.439
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.26TyrAla: 2.26 ± 0.676
0.452TyrCys: 0.452 ± 0.375
3.165TyrAsp: 3.165 ± 0.451
2.712TyrGlu: 2.712 ± 1.276
0.452TyrPhe: 0.452 ± 0.402
3.165TyrGly: 3.165 ± 0.567
0.452TyrHis: 0.452 ± 0.396
1.356TyrIle: 1.356 ± 0.386
1.808TyrLys: 1.808 ± 0.573
3.617TyrLeu: 3.617 ± 1.05
0.452TyrMet: 0.452 ± 0.34
1.808TyrAsn: 1.808 ± 0.949
1.808TyrPro: 1.808 ± 1.223
0.452TyrGln: 0.452 ± 0.34
2.712TyrArg: 2.712 ± 0.538
0.452TyrSer: 0.452 ± 0.375
1.808TyrThr: 1.808 ± 1.073
3.165TyrVal: 3.165 ± 0.833
1.808TyrTrp: 1.808 ± 0.66
2.26TyrTyr: 2.26 ± 1.398
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (2213 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski