Amino acid dipepetide frequency for Phocoena spinipinnis papillomavirus (isolate Burmeister s porpoise/Peru/PsPV1) (PsPV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.866AlaAla: 5.866 ± 0.826
0.782AlaCys: 0.782 ± 0.465
3.52AlaAsp: 3.52 ± 0.855
5.866AlaGlu: 5.866 ± 1.003
3.52AlaPhe: 3.52 ± 0.898
2.738AlaGly: 2.738 ± 0.848
1.564AlaHis: 1.564 ± 0.916
2.738AlaIle: 2.738 ± 0.707
4.693AlaLys: 4.693 ± 1.505
3.911AlaLeu: 3.911 ± 1.476
1.955AlaMet: 1.955 ± 0.748
1.955AlaAsn: 1.955 ± 0.799
4.693AlaPro: 4.693 ± 0.989
5.084AlaGln: 5.084 ± 0.869
3.129AlaArg: 3.129 ± 0.641
5.475AlaSer: 5.475 ± 0.982
2.738AlaThr: 2.738 ± 1.425
2.346AlaVal: 2.346 ± 1.022
0.391AlaTrp: 0.391 ± 0.414
3.52AlaTyr: 3.52 ± 0.748
0.0AlaXaa: 0.0 ± 0.0
Cys
1.955CysAla: 1.955 ± 0.676
0.391CysCys: 0.391 ± 0.414
1.173CysAsp: 1.173 ± 0.74
0.782CysGlu: 0.782 ± 0.758
0.391CysPhe: 0.391 ± 0.414
0.782CysGly: 0.782 ± 0.417
0.0CysHis: 0.0 ± 0.0
0.782CysIle: 0.782 ± 0.993
1.955CysLys: 1.955 ± 1.029
4.302CysLeu: 4.302 ± 1.857
0.391CysMet: 0.391 ± 0.408
0.0CysAsn: 0.0 ± 0.0
2.738CysPro: 2.738 ± 0.459
0.0CysGln: 0.0 ± 0.0
2.346CysArg: 2.346 ± 0.755
3.129CysSer: 3.129 ± 1.122
1.955CysThr: 1.955 ± 0.883
1.955CysVal: 1.955 ± 0.949
1.564CysTrp: 1.564 ± 0.593
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.911AspAla: 3.911 ± 0.82
2.346AspCys: 2.346 ± 0.925
2.738AspAsp: 2.738 ± 1.255
1.564AspGlu: 1.564 ± 0.854
1.564AspPhe: 1.564 ± 0.466
5.084AspGly: 5.084 ± 1.121
0.782AspHis: 0.782 ± 0.418
3.52AspIle: 3.52 ± 1.184
2.738AspLys: 2.738 ± 0.929
4.693AspLeu: 4.693 ± 1.015
1.955AspMet: 1.955 ± 0.629
2.346AspAsn: 2.346 ± 0.506
7.431AspPro: 7.431 ± 1.691
0.782AspGln: 0.782 ± 0.402
1.173AspArg: 1.173 ± 0.32
5.475AspSer: 5.475 ± 1.417
3.52AspThr: 3.52 ± 0.666
3.911AspVal: 3.911 ± 1.312
1.173AspTrp: 1.173 ± 0.894
1.173AspTyr: 1.173 ± 0.894
0.0AspXaa: 0.0 ± 0.0
Glu
5.866GluAla: 5.866 ± 1.581
0.391GluCys: 0.391 ± 0.298
3.911GluAsp: 3.911 ± 1.36
5.084GluGlu: 5.084 ± 1.227
2.346GluPhe: 2.346 ± 0.64
6.257GluGly: 6.257 ± 0.918
1.564GluHis: 1.564 ± 0.655
1.955GluIle: 1.955 ± 0.879
2.738GluLys: 2.738 ± 1.262
3.129GluLeu: 3.129 ± 0.966
0.782GluMet: 0.782 ± 0.417
1.955GluAsn: 1.955 ± 0.436
2.738GluPro: 2.738 ± 1.079
2.738GluGln: 2.738 ± 0.55
1.564GluArg: 1.564 ± 0.407
6.257GluSer: 6.257 ± 1.588
3.129GluThr: 3.129 ± 1.222
3.129GluVal: 3.129 ± 0.952
0.782GluTrp: 0.782 ± 0.417
1.564GluTyr: 1.564 ± 0.258
0.0GluXaa: 0.0 ± 0.0
Phe
3.129PheAla: 3.129 ± 0.854
1.173PheCys: 1.173 ± 0.987
3.129PheAsp: 3.129 ± 0.685
1.564PheGlu: 1.564 ± 1.007
0.782PhePhe: 0.782 ± 0.596
2.738PheGly: 2.738 ± 0.427
0.0PheHis: 0.0 ± 0.0
1.955PheIle: 1.955 ± 0.644
1.173PheLys: 1.173 ± 0.514
4.693PheLeu: 4.693 ± 1.667
1.173PheMet: 1.173 ± 0.543
0.782PheAsn: 0.782 ± 0.375
1.955PhePro: 1.955 ± 0.555
3.129PheGln: 3.129 ± 0.406
0.0PheArg: 0.0 ± 0.0
2.346PheSer: 2.346 ± 0.781
1.955PheThr: 1.955 ± 0.533
1.955PheVal: 1.955 ± 1.213
0.782PheTrp: 0.782 ± 0.596
3.129PheTyr: 3.129 ± 1.029
0.0PheXaa: 0.0 ± 0.0
Gly
4.302GlyAla: 4.302 ± 1.68
1.955GlyCys: 1.955 ± 0.302
5.866GlyAsp: 5.866 ± 1.047
7.039GlyGlu: 7.039 ± 1.372
2.346GlyPhe: 2.346 ± 1.227
8.213GlyGly: 8.213 ± 2.6
1.173GlyHis: 1.173 ± 0.642
2.346GlyIle: 2.346 ± 0.885
2.346GlyLys: 2.346 ± 0.956
4.302GlyLeu: 4.302 ± 0.647
0.391GlyMet: 0.391 ± 0.298
3.52GlyAsn: 3.52 ± 0.955
5.084GlyPro: 5.084 ± 2.257
2.346GlyGln: 2.346 ± 1.081
4.693GlyArg: 4.693 ± 0.638
7.039GlySer: 7.039 ± 2.335
4.693GlyThr: 4.693 ± 0.773
2.346GlyVal: 2.346 ± 0.835
0.0GlyTrp: 0.0 ± 0.0
1.955GlyTyr: 1.955 ± 0.579
0.0GlyXaa: 0.0 ± 0.0
His
1.173HisAla: 1.173 ± 0.581
0.0HisCys: 0.0 ± 0.0
0.782HisAsp: 0.782 ± 0.447
1.173HisGlu: 1.173 ± 0.403
2.738HisPhe: 2.738 ± 0.7
0.391HisGly: 0.391 ± 0.298
3.911HisHis: 3.911 ± 4.051
0.391HisIle: 0.391 ± 0.298
0.782HisLys: 0.782 ± 0.402
2.346HisLeu: 2.346 ± 1.687
0.782HisMet: 0.782 ± 0.595
0.391HisAsn: 0.391 ± 0.347
3.52HisPro: 3.52 ± 0.947
1.564HisGln: 1.564 ± 0.844
1.955HisArg: 1.955 ± 1.006
1.955HisSer: 1.955 ± 0.652
0.782HisThr: 0.782 ± 0.447
1.564HisVal: 1.564 ± 1.192
0.391HisTrp: 0.391 ± 0.298
0.782HisTyr: 0.782 ± 0.402
0.0HisXaa: 0.0 ± 0.0
Ile
2.346IleAla: 2.346 ± 0.853
1.955IleCys: 1.955 ± 0.989
1.564IleAsp: 1.564 ± 0.258
2.346IleGlu: 2.346 ± 0.658
0.782IlePhe: 0.782 ± 0.595
1.955IleGly: 1.955 ± 0.379
1.173IleHis: 1.173 ± 0.927
1.955IleIle: 1.955 ± 0.434
1.564IleLys: 1.564 ± 0.433
1.955IleLeu: 1.955 ± 0.626
0.782IleMet: 0.782 ± 0.648
1.173IleAsn: 1.173 ± 0.653
1.955IlePro: 1.955 ± 1.135
1.173IleGln: 1.173 ± 0.581
1.173IleArg: 1.173 ± 0.32
5.866IleSer: 5.866 ± 1.288
2.346IleThr: 2.346 ± 0.472
1.564IleVal: 1.564 ± 0.494
0.0IleTrp: 0.0 ± 0.0
3.129IleTyr: 3.129 ± 0.813
0.0IleXaa: 0.0 ± 0.0
Lys
3.129LysAla: 3.129 ± 0.991
2.346LysCys: 2.346 ± 0.873
3.129LysAsp: 3.129 ± 1.155
0.391LysGlu: 0.391 ± 0.298
1.955LysPhe: 1.955 ± 0.748
1.173LysGly: 1.173 ± 0.818
1.955LysHis: 1.955 ± 1.029
1.955LysIle: 1.955 ± 1.029
1.173LysLys: 1.173 ± 0.74
2.346LysLeu: 2.346 ± 0.956
0.391LysMet: 0.391 ± 0.298
1.955LysAsn: 1.955 ± 0.867
2.346LysPro: 2.346 ± 0.592
2.738LysGln: 2.738 ± 1.394
5.084LysArg: 5.084 ± 0.574
3.129LysSer: 3.129 ± 0.954
2.738LysThr: 2.738 ± 1.202
4.302LysVal: 4.302 ± 1.522
1.173LysTrp: 1.173 ± 0.32
1.173LysTyr: 1.173 ± 0.435
0.0LysXaa: 0.0 ± 0.0
Leu
5.475LeuAla: 5.475 ± 0.94
3.129LeuCys: 3.129 ± 0.958
6.648LeuAsp: 6.648 ± 0.988
4.693LeuGlu: 4.693 ± 3.26
3.911LeuPhe: 3.911 ± 1.564
5.084LeuGly: 5.084 ± 0.763
3.129LeuHis: 3.129 ± 2.281
3.911LeuIle: 3.911 ± 1.248
3.911LeuLys: 3.911 ± 1.81
8.995LeuLeu: 8.995 ± 2.611
1.955LeuMet: 1.955 ± 0.692
1.955LeuAsn: 1.955 ± 0.302
4.693LeuPro: 4.693 ± 1.277
5.475LeuGln: 5.475 ± 1.025
6.257LeuArg: 6.257 ± 1.504
7.039LeuSer: 7.039 ± 1.392
3.911LeuThr: 3.911 ± 1.042
4.693LeuVal: 4.693 ± 0.638
0.782LeuTrp: 0.782 ± 0.624
2.738LeuTyr: 2.738 ± 1.28
0.0LeuXaa: 0.0 ± 0.0
Met
1.173MetAla: 1.173 ± 0.663
1.173MetCys: 1.173 ± 0.435
1.173MetAsp: 1.173 ± 0.504
1.955MetGlu: 1.955 ± 1.034
0.391MetPhe: 0.391 ± 0.298
0.391MetGly: 0.391 ± 0.347
0.782MetHis: 0.782 ± 0.417
0.0MetIle: 0.0 ± 0.0
0.391MetLys: 0.391 ± 0.298
1.564MetLeu: 1.564 ± 0.466
0.0MetMet: 0.0 ± 0.0
0.782MetAsn: 0.782 ± 0.402
0.0MetPro: 0.0 ± 0.0
0.782MetGln: 0.782 ± 0.596
0.782MetArg: 0.782 ± 0.483
1.955MetSer: 1.955 ± 0.528
0.391MetThr: 0.391 ± 0.414
2.346MetVal: 2.346 ± 1.027
0.782MetTrp: 0.782 ± 0.595
1.173MetTyr: 1.173 ± 0.74
0.0MetXaa: 0.0 ± 0.0
Asn
3.911AsnAla: 3.911 ± 0.67
1.173AsnCys: 1.173 ± 0.688
1.564AsnAsp: 1.564 ± 0.533
0.391AsnGlu: 0.391 ± 0.298
0.391AsnPhe: 0.391 ± 0.379
3.52AsnGly: 3.52 ± 0.886
0.0AsnHis: 0.0 ± 0.0
1.955AsnIle: 1.955 ± 0.639
1.564AsnLys: 1.564 ± 0.661
2.738AsnLeu: 2.738 ± 1.162
0.782AsnMet: 0.782 ± 0.402
1.564AsnAsn: 1.564 ± 1.191
0.391AsnPro: 0.391 ± 0.298
0.782AsnGln: 0.782 ± 0.418
1.564AsnArg: 1.564 ± 0.844
3.52AsnSer: 3.52 ± 1.012
2.346AsnThr: 2.346 ± 0.873
3.52AsnVal: 3.52 ± 0.628
0.782AsnTrp: 0.782 ± 0.417
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
5.475ProAla: 5.475 ± 1.431
1.173ProCys: 1.173 ± 0.513
5.084ProAsp: 5.084 ± 1.641
4.693ProGlu: 4.693 ± 1.107
1.564ProPhe: 1.564 ± 0.835
3.911ProGly: 3.911 ± 1.895
1.955ProHis: 1.955 ± 1.059
1.173ProIle: 1.173 ± 0.504
4.693ProLys: 4.693 ± 1.208
8.213ProLeu: 8.213 ± 1.693
1.564ProMet: 1.564 ± 0.808
2.738ProAsn: 2.738 ± 0.451
10.95ProPro: 10.95 ± 3.889
2.346ProGln: 2.346 ± 1.765
1.955ProArg: 1.955 ± 0.666
7.431ProSer: 7.431 ± 3.006
3.129ProThr: 3.129 ± 1.159
3.911ProVal: 3.911 ± 1.711
1.564ProTrp: 1.564 ± 0.905
1.955ProTyr: 1.955 ± 1.166
0.0ProXaa: 0.0 ± 0.0
Gln
3.129GlnAla: 3.129 ± 0.406
0.391GlnCys: 0.391 ± 0.379
3.129GlnAsp: 3.129 ± 1.029
3.129GlnGlu: 3.129 ± 1.132
0.391GlnPhe: 0.391 ± 0.414
1.955GlnGly: 1.955 ± 0.616
0.391GlnHis: 0.391 ± 0.45
0.782GlnIle: 0.782 ± 0.483
1.173GlnLys: 1.173 ± 0.618
5.084GlnLeu: 5.084 ± 1.029
1.955GlnMet: 1.955 ± 0.867
0.782GlnAsn: 0.782 ± 0.418
1.955GlnPro: 1.955 ± 0.879
1.955GlnGln: 1.955 ± 0.543
3.129GlnArg: 3.129 ± 1.564
2.346GlnSer: 2.346 ± 1.326
1.173GlnThr: 1.173 ± 0.403
4.693GlnVal: 4.693 ± 0.951
1.173GlnTrp: 1.173 ± 0.618
0.391GlnTyr: 0.391 ± 0.298
0.0GlnXaa: 0.0 ± 0.0
Arg
3.52ArgAla: 3.52 ± 1.101
1.564ArgCys: 1.564 ± 0.865
0.782ArgAsp: 0.782 ± 0.758
3.129ArgGlu: 3.129 ± 0.566
2.346ArgPhe: 2.346 ± 0.658
5.475ArgGly: 5.475 ± 0.647
2.738ArgHis: 2.738 ± 0.938
0.391ArgIle: 0.391 ± 0.393
3.911ArgLys: 3.911 ± 0.866
3.911ArgLeu: 3.911 ± 0.808
0.391ArgMet: 0.391 ± 0.259
1.955ArgAsn: 1.955 ± 0.906
5.866ArgPro: 5.866 ± 2.013
1.564ArgGln: 1.564 ± 0.655
7.039ArgArg: 7.039 ± 2.302
3.911ArgSer: 3.911 ± 0.612
2.346ArgThr: 2.346 ± 1.252
4.302ArgVal: 4.302 ± 2.481
0.391ArgTrp: 0.391 ± 0.298
1.564ArgTyr: 1.564 ± 0.503
0.0ArgXaa: 0.0 ± 0.0
Ser
3.129SerAla: 3.129 ± 1.064
2.346SerCys: 2.346 ± 1.161
5.084SerAsp: 5.084 ± 2.055
5.475SerGlu: 5.475 ± 0.855
5.084SerPhe: 5.084 ± 1.672
7.431SerGly: 7.431 ± 1.501
2.346SerHis: 2.346 ± 0.781
3.52SerIle: 3.52 ± 1.914
2.738SerLys: 2.738 ± 1.422
10.95SerLeu: 10.95 ± 1.228
1.173SerMet: 1.173 ± 0.434
2.738SerAsn: 2.738 ± 0.7
5.866SerPro: 5.866 ± 1.602
1.564SerGln: 1.564 ± 0.591
3.129SerArg: 3.129 ± 1.029
7.822SerSer: 7.822 ± 2.341
10.95SerThr: 10.95 ± 2.292
5.084SerVal: 5.084 ± 0.947
1.173SerTrp: 1.173 ± 0.867
2.738SerTyr: 2.738 ± 0.727
0.0SerXaa: 0.0 ± 0.0
Thr
3.911ThrAla: 3.911 ± 1.706
1.564ThrCys: 1.564 ± 0.71
3.129ThrAsp: 3.129 ± 0.759
3.129ThrGlu: 3.129 ± 1.182
3.129ThrPhe: 3.129 ± 0.819
6.257ThrGly: 6.257 ± 1.212
1.173ThrHis: 1.173 ± 0.509
3.129ThrIle: 3.129 ± 1.106
1.564ThrLys: 1.564 ± 0.685
3.911ThrLeu: 3.911 ± 1.162
0.0ThrMet: 0.0 ± 0.0
2.346ThrAsn: 2.346 ± 0.811
7.431ThrPro: 7.431 ± 1.52
1.564ThrGln: 1.564 ± 0.729
2.738ThrArg: 2.738 ± 0.969
5.084ThrSer: 5.084 ± 1.005
5.084ThrThr: 5.084 ± 1.756
5.084ThrVal: 5.084 ± 1.379
1.173ThrTrp: 1.173 ± 0.346
1.173ThrTyr: 1.173 ± 0.773
0.0ThrXaa: 0.0 ± 0.0
Val
1.955ValAla: 1.955 ± 0.525
2.346ValCys: 2.346 ± 1.134
3.129ValAsp: 3.129 ± 1.052
3.129ValGlu: 3.129 ± 1.221
2.346ValPhe: 2.346 ± 0.615
5.475ValGly: 5.475 ± 1.836
1.955ValHis: 1.955 ± 1.364
0.782ValIle: 0.782 ± 0.588
2.738ValLys: 2.738 ± 0.692
7.039ValLeu: 7.039 ± 1.048
0.0ValMet: 0.0 ± 0.0
2.346ValAsn: 2.346 ± 0.827
4.693ValPro: 4.693 ± 0.834
2.738ValGln: 2.738 ± 1.133
5.084ValArg: 5.084 ± 1.752
7.039ValSer: 7.039 ± 1.469
5.475ValThr: 5.475 ± 1.109
2.738ValVal: 2.738 ± 1.079
0.782ValTrp: 0.782 ± 0.465
0.391ValTyr: 0.391 ± 0.379
0.0ValXaa: 0.0 ± 0.0
Trp
1.564TrpAla: 1.564 ± 0.533
0.0TrpCys: 0.0 ± 0.0
0.782TrpAsp: 0.782 ± 0.402
1.173TrpGlu: 1.173 ± 0.542
0.782TrpPhe: 0.782 ± 0.375
1.173TrpGly: 1.173 ± 0.514
0.0TrpHis: 0.0 ± 0.0
1.564TrpIle: 1.564 ± 0.618
1.173TrpLys: 1.173 ± 0.894
1.173TrpLeu: 1.173 ± 0.513
0.0TrpMet: 0.0 ± 0.0
0.782TrpAsn: 0.782 ± 0.595
0.0TrpPro: 0.0 ± 0.0
0.391TrpGln: 0.391 ± 0.62
1.173TrpArg: 1.173 ± 0.849
1.564TrpSer: 1.564 ± 1.161
1.564TrpThr: 1.564 ± 0.972
0.782TrpVal: 0.782 ± 0.596
0.0TrpTrp: 0.0 ± 0.0
0.782TrpTyr: 0.782 ± 0.417
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.564TyrAla: 1.564 ± 0.512
0.391TyrCys: 0.391 ± 0.534
1.564TyrAsp: 1.564 ± 0.618
1.173TyrGlu: 1.173 ± 0.893
0.782TyrPhe: 0.782 ± 0.465
1.955TyrGly: 1.955 ± 1.012
0.782TyrHis: 0.782 ± 0.375
2.346TyrIle: 2.346 ± 1.584
1.564TyrLys: 1.564 ± 0.877
3.129TyrLeu: 3.129 ± 0.83
1.173TyrMet: 1.173 ± 0.722
0.391TyrAsn: 0.391 ± 0.298
1.173TyrPro: 1.173 ± 0.658
0.391TyrGln: 0.391 ± 0.298
3.129TyrArg: 3.129 ± 1.064
1.955TyrSer: 1.955 ± 0.927
2.346TyrThr: 2.346 ± 0.859
1.955TyrVal: 1.955 ± 0.927
1.564TyrTrp: 1.564 ± 0.916
1.564TyrTyr: 1.564 ± 0.806
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (2558 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski