Amino acid dipepetide frequency for Phocoena phocoena papillomavirus 4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.825AlaAla: 4.825 ± 1.266
0.877AlaCys: 0.877 ± 0.477
3.509AlaAsp: 3.509 ± 1.648
1.316AlaGlu: 1.316 ± 0.692
2.632AlaPhe: 2.632 ± 0.945
2.632AlaGly: 2.632 ± 1.127
0.0AlaHis: 0.0 ± 0.0
2.632AlaIle: 2.632 ± 0.919
3.947AlaLys: 3.947 ± 1.519
3.947AlaLeu: 3.947 ± 0.875
0.877AlaMet: 0.877 ± 0.477
3.07AlaAsn: 3.07 ± 0.811
3.07AlaPro: 3.07 ± 0.891
3.509AlaGln: 3.509 ± 1.053
4.386AlaArg: 4.386 ± 1.162
3.947AlaSer: 3.947 ± 1.639
2.193AlaThr: 2.193 ± 0.669
4.386AlaVal: 4.386 ± 0.954
1.316AlaTrp: 1.316 ± 0.833
3.07AlaTyr: 3.07 ± 0.442
0.0AlaXaa: 0.0 ± 0.0
Cys
0.439CysAla: 0.439 ± 0.436
0.877CysCys: 0.877 ± 0.477
0.877CysAsp: 0.877 ± 0.871
1.754CysGlu: 1.754 ± 0.954
1.754CysPhe: 1.754 ± 1.072
1.316CysGly: 1.316 ± 0.74
0.0CysHis: 0.0 ± 0.0
1.754CysIle: 1.754 ± 0.926
3.509CysLys: 3.509 ± 0.488
0.439CysLeu: 0.439 ± 0.482
0.0CysMet: 0.0 ± 0.0
1.316CysAsn: 1.316 ± 0.935
3.07CysPro: 3.07 ± 0.834
0.0CysGln: 0.0 ± 0.0
0.439CysArg: 0.439 ± 0.375
1.316CysSer: 1.316 ± 0.539
3.07CysThr: 3.07 ± 1.231
1.754CysVal: 1.754 ± 0.943
1.316CysTrp: 1.316 ± 0.984
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.509AspAla: 3.509 ± 1.191
1.754AspCys: 1.754 ± 0.642
3.509AspAsp: 3.509 ± 2.033
4.386AspGlu: 4.386 ± 1.991
3.07AspPhe: 3.07 ± 1.309
4.825AspGly: 4.825 ± 1.791
1.316AspHis: 1.316 ± 0.833
3.509AspIle: 3.509 ± 1.8
3.07AspLys: 3.07 ± 0.893
5.263AspLeu: 5.263 ± 0.724
1.754AspMet: 1.754 ± 0.868
2.632AspAsn: 2.632 ± 0.958
5.263AspPro: 5.263 ± 1.723
0.877AspGln: 0.877 ± 0.771
1.316AspArg: 1.316 ± 0.74
5.702AspSer: 5.702 ± 1.702
5.263AspThr: 5.263 ± 1.474
3.07AspVal: 3.07 ± 0.486
1.316AspTrp: 1.316 ± 1.126
2.193AspTyr: 2.193 ± 1.137
0.0AspXaa: 0.0 ± 0.0
Glu
3.509GluAla: 3.509 ± 1.205
0.877GluCys: 0.877 ± 0.434
7.018GluAsp: 7.018 ± 1.596
4.825GluGlu: 4.825 ± 1.15
0.877GluPhe: 0.877 ± 0.416
3.947GluGly: 3.947 ± 1.492
0.877GluHis: 0.877 ± 0.507
1.316GluIle: 1.316 ± 0.659
2.193GluLys: 2.193 ± 1.103
3.509GluLeu: 3.509 ± 0.488
2.193GluMet: 2.193 ± 1.129
2.632GluAsn: 2.632 ± 1.391
2.193GluPro: 2.193 ± 0.936
2.632GluGln: 2.632 ± 1.164
2.193GluArg: 2.193 ± 1.046
3.07GluSer: 3.07 ± 1.32
3.07GluThr: 3.07 ± 0.528
1.754GluVal: 1.754 ± 0.656
0.439GluTrp: 0.439 ± 0.375
1.316GluTyr: 1.316 ± 0.781
0.0GluXaa: 0.0 ± 0.0
Phe
0.439PheAla: 0.439 ± 0.386
1.316PheCys: 1.316 ± 1.445
3.07PheAsp: 3.07 ± 0.746
0.877PheGlu: 0.877 ± 0.49
1.754PhePhe: 1.754 ± 0.656
3.509PheGly: 3.509 ± 0.977
0.877PheHis: 0.877 ± 0.638
0.877PheIle: 0.877 ± 0.546
2.632PheLys: 2.632 ± 0.505
3.509PheLeu: 3.509 ± 1.567
1.754PheMet: 1.754 ± 0.676
2.632PheAsn: 2.632 ± 0.958
1.754PhePro: 1.754 ± 0.839
2.632PheGln: 2.632 ± 0.978
1.316PheArg: 1.316 ± 0.863
2.193PheSer: 2.193 ± 0.71
3.07PheThr: 3.07 ± 1.027
2.193PheVal: 2.193 ± 1.032
1.316PheTrp: 1.316 ± 0.392
1.316PheTyr: 1.316 ± 0.781
0.0PheXaa: 0.0 ± 0.0
Gly
3.509GlyAla: 3.509 ± 1.097
1.316GlyCys: 1.316 ± 0.539
3.509GlyAsp: 3.509 ± 0.634
5.702GlyGlu: 5.702 ± 1.338
1.316GlyPhe: 1.316 ± 0.539
7.895GlyGly: 7.895 ± 2.976
2.193GlyHis: 2.193 ± 1.211
4.825GlyIle: 4.825 ± 0.953
2.193GlyLys: 2.193 ± 1.391
4.825GlyLeu: 4.825 ± 1.614
0.439GlyMet: 0.439 ± 0.436
4.386GlyAsn: 4.386 ± 0.963
6.579GlyPro: 6.579 ± 1.967
4.386GlyGln: 4.386 ± 1.135
2.632GlyArg: 2.632 ± 1.022
4.386GlySer: 4.386 ± 2.194
6.14GlyThr: 6.14 ± 2.122
6.14GlyVal: 6.14 ± 0.964
0.877GlyTrp: 0.877 ± 0.546
2.193GlyTyr: 2.193 ± 1.078
0.0GlyXaa: 0.0 ± 0.0
His
0.877HisAla: 0.877 ± 0.659
1.316HisCys: 1.316 ± 0.833
0.439HisAsp: 0.439 ± 0.387
0.0HisGlu: 0.0 ± 0.0
1.316HisPhe: 1.316 ± 0.392
1.316HisGly: 1.316 ± 0.539
0.0HisHis: 0.0 ± 0.0
1.316HisIle: 1.316 ± 0.943
1.754HisLys: 1.754 ± 0.857
1.316HisLeu: 1.316 ± 1.0
0.0HisMet: 0.0 ± 0.0
1.754HisAsn: 1.754 ± 0.724
2.632HisPro: 2.632 ± 0.981
0.0HisGln: 0.0 ± 0.0
0.877HisArg: 0.877 ± 0.49
0.439HisSer: 0.439 ± 0.375
2.193HisThr: 2.193 ± 1.241
0.877HisVal: 0.877 ± 0.434
0.439HisTrp: 0.439 ± 0.386
0.439HisTyr: 0.439 ± 0.482
0.0HisXaa: 0.0 ± 0.0
Ile
2.193IleAla: 2.193 ± 0.736
1.316IleCys: 1.316 ± 0.539
3.07IleAsp: 3.07 ± 0.963
3.509IleGlu: 3.509 ± 2.277
1.316IlePhe: 1.316 ± 0.659
4.825IleGly: 4.825 ± 2.132
1.316IleHis: 1.316 ± 1.161
2.193IleIle: 2.193 ± 0.813
1.316IleLys: 1.316 ± 0.44
1.754IleLeu: 1.754 ± 0.244
0.0IleMet: 0.0 ± 0.0
0.877IleAsn: 0.877 ± 0.481
3.07IlePro: 3.07 ± 2.084
1.316IleGln: 1.316 ± 1.126
3.947IleArg: 3.947 ± 1.465
4.386IleSer: 4.386 ± 1.421
3.947IleThr: 3.947 ± 0.789
2.632IleVal: 2.632 ± 0.737
0.877IleTrp: 0.877 ± 0.603
2.193IleTyr: 2.193 ± 1.234
0.0IleXaa: 0.0 ± 0.0
Lys
3.509LysAla: 3.509 ± 0.909
2.193LysCys: 2.193 ± 1.426
2.193LysAsp: 2.193 ± 0.799
2.193LysGlu: 2.193 ± 1.209
2.632LysPhe: 2.632 ± 1.831
3.509LysGly: 3.509 ± 1.359
0.877LysHis: 0.877 ± 0.546
1.754LysIle: 1.754 ± 0.844
4.825LysLys: 4.825 ± 1.391
3.07LysLeu: 3.07 ± 1.342
1.316LysMet: 1.316 ± 0.661
0.877LysAsn: 0.877 ± 0.546
1.316LysPro: 1.316 ± 0.662
3.07LysGln: 3.07 ± 0.975
6.14LysArg: 6.14 ± 1.128
3.07LysSer: 3.07 ± 1.099
3.947LysThr: 3.947 ± 1.756
5.263LysVal: 5.263 ± 1.882
0.0LysTrp: 0.0 ± 0.0
3.07LysTyr: 3.07 ± 0.795
0.0LysXaa: 0.0 ± 0.0
Leu
3.07LeuAla: 3.07 ± 1.687
2.193LeuCys: 2.193 ± 1.271
6.14LeuAsp: 6.14 ± 1.652
2.193LeuGlu: 2.193 ± 1.176
4.386LeuPhe: 4.386 ± 1.426
7.018LeuGly: 7.018 ± 1.48
3.07LeuHis: 3.07 ± 1.027
2.632LeuIle: 2.632 ± 0.737
5.263LeuLys: 5.263 ± 2.96
9.649LeuLeu: 9.649 ± 2.87
0.0LeuMet: 0.0 ± 0.348
1.316LeuAsn: 1.316 ± 0.588
4.825LeuPro: 4.825 ± 1.406
7.895LeuGln: 7.895 ± 1.123
3.07LeuArg: 3.07 ± 0.912
7.018LeuSer: 7.018 ± 1.737
7.018LeuThr: 7.018 ± 0.922
2.193LeuVal: 2.193 ± 0.359
0.877LeuTrp: 0.877 ± 0.659
2.632LeuTyr: 2.632 ± 0.831
0.0LeuXaa: 0.0 ± 0.0
Met
2.632MetAla: 2.632 ± 0.521
0.0MetCys: 0.0 ± 0.0
1.754MetAsp: 1.754 ± 0.676
0.439MetGlu: 0.439 ± 0.436
0.877MetPhe: 0.877 ± 0.771
0.877MetGly: 0.877 ± 0.751
0.0MetHis: 0.0 ± 0.0
1.754MetIle: 1.754 ± 0.868
0.877MetLys: 0.877 ± 0.477
0.439MetLeu: 0.439 ± 0.436
0.439MetMet: 0.439 ± 0.375
0.0MetAsn: 0.0 ± 0.0
0.877MetPro: 0.877 ± 0.477
1.316MetGln: 1.316 ± 0.401
0.0MetArg: 0.0 ± 0.0
1.316MetSer: 1.316 ± 0.49
0.439MetThr: 0.439 ± 0.387
1.754MetVal: 1.754 ± 0.642
0.439MetTrp: 0.439 ± 0.436
0.439MetTyr: 0.439 ± 0.436
0.0MetXaa: 0.0 ± 0.0
Asn
0.877AsnAla: 0.877 ± 0.434
0.877AsnCys: 0.877 ± 0.477
1.316AsnAsp: 1.316 ± 0.74
0.877AsnGlu: 0.877 ± 0.871
2.193AsnPhe: 2.193 ± 0.64
2.632AsnGly: 2.632 ± 1.326
0.877AsnHis: 0.877 ± 0.507
2.193AsnIle: 2.193 ± 0.682
1.754AsnLys: 1.754 ± 0.756
5.702AsnLeu: 5.702 ± 1.515
0.439AsnMet: 0.439 ± 0.387
2.193AsnAsn: 2.193 ± 0.752
3.509AsnPro: 3.509 ± 0.808
0.877AsnGln: 0.877 ± 0.507
1.316AsnArg: 1.316 ± 0.73
2.632AsnSer: 2.632 ± 0.737
2.632AsnThr: 2.632 ± 0.783
2.632AsnVal: 2.632 ± 0.934
1.316AsnTrp: 1.316 ± 0.539
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
4.825ProAla: 4.825 ± 1.321
2.632ProCys: 2.632 ± 1.608
5.263ProAsp: 5.263 ± 2.259
4.825ProGlu: 4.825 ± 0.884
0.877ProPhe: 0.877 ± 0.638
3.07ProGly: 3.07 ± 0.982
1.754ProHis: 1.754 ± 0.726
4.825ProIle: 4.825 ± 1.739
2.632ProLys: 2.632 ± 0.976
9.211ProLeu: 9.211 ± 1.283
0.0ProMet: 0.0 ± 0.0
0.877ProAsn: 0.877 ± 0.434
8.772ProPro: 8.772 ± 3.463
3.07ProGln: 3.07 ± 0.549
4.386ProArg: 4.386 ± 3.64
3.07ProSer: 3.07 ± 1.71
4.386ProThr: 4.386 ± 1.072
5.263ProVal: 5.263 ± 2.041
0.877ProTrp: 0.877 ± 0.871
2.632ProTyr: 2.632 ± 1.746
0.0ProXaa: 0.0 ± 0.0
Gln
2.632GlnAla: 2.632 ± 1.14
0.0GlnCys: 0.0 ± 0.0
1.754GlnAsp: 1.754 ± 0.656
2.193GlnGlu: 2.193 ± 0.833
1.754GlnPhe: 1.754 ± 0.53
4.825GlnGly: 4.825 ± 1.147
0.877GlnHis: 0.877 ± 0.852
1.316GlnIle: 1.316 ± 0.539
2.193GlnLys: 2.193 ± 0.506
2.632GlnLeu: 2.632 ± 0.663
1.316GlnMet: 1.316 ± 0.936
1.316GlnAsn: 1.316 ± 0.714
4.825GlnPro: 4.825 ± 1.28
2.193GlnGln: 2.193 ± 1.853
1.316GlnArg: 1.316 ± 0.588
4.386GlnSer: 4.386 ± 0.848
3.07GlnThr: 3.07 ± 1.252
2.632GlnVal: 2.632 ± 0.737
0.877GlnTrp: 0.877 ± 0.477
1.754GlnTyr: 1.754 ± 1.135
0.0GlnXaa: 0.0 ± 0.0
Arg
3.07ArgAla: 3.07 ± 0.891
2.632ArgCys: 2.632 ± 0.903
0.0ArgAsp: 0.0 ± 0.0
0.877ArgGlu: 0.877 ± 0.507
1.754ArgPhe: 1.754 ± 0.488
5.263ArgGly: 5.263 ± 1.422
2.193ArgHis: 2.193 ± 1.393
2.632ArgIle: 2.632 ± 1.138
3.947ArgLys: 3.947 ± 1.023
6.579ArgLeu: 6.579 ± 1.436
0.0ArgMet: 0.0 ± 0.0
1.316ArgAsn: 1.316 ± 0.74
4.386ArgPro: 4.386 ± 1.433
2.193ArgGln: 2.193 ± 1.084
3.947ArgArg: 3.947 ± 0.665
2.193ArgSer: 2.193 ± 1.153
3.947ArgThr: 3.947 ± 0.726
3.509ArgVal: 3.509 ± 1.036
0.877ArgTrp: 0.877 ± 0.546
1.754ArgTyr: 1.754 ± 0.829
0.0ArgXaa: 0.0 ± 0.0
Ser
4.386SerAla: 4.386 ± 1.534
0.439SerCys: 0.439 ± 0.386
5.702SerAsp: 5.702 ± 0.875
5.702SerGlu: 5.702 ± 1.981
2.632SerPhe: 2.632 ± 0.906
5.263SerGly: 5.263 ± 1.741
1.316SerHis: 1.316 ± 0.571
3.509SerIle: 3.509 ± 2.65
3.07SerLys: 3.07 ± 0.934
7.456SerLeu: 7.456 ± 0.912
1.316SerMet: 1.316 ± 0.392
3.07SerAsn: 3.07 ± 1.258
3.509SerPro: 3.509 ± 0.838
1.316SerGln: 1.316 ± 0.714
3.509SerArg: 3.509 ± 0.488
7.895SerSer: 7.895 ± 2.911
7.018SerThr: 7.018 ± 2.731
5.702SerVal: 5.702 ± 1.094
0.0SerTrp: 0.0 ± 0.0
1.754SerTyr: 1.754 ± 0.734
0.0SerXaa: 0.0 ± 0.0
Thr
4.386ThrAla: 4.386 ± 0.837
2.193ThrCys: 2.193 ± 1.153
5.263ThrAsp: 5.263 ± 1.525
3.509ThrGlu: 3.509 ± 1.466
4.386ThrPhe: 4.386 ± 1.153
6.579ThrGly: 6.579 ± 1.792
0.877ThrHis: 0.877 ± 0.774
2.193ThrIle: 2.193 ± 0.991
1.754ThrLys: 1.754 ± 1.27
3.509ThrLeu: 3.509 ± 1.455
2.193ThrMet: 2.193 ± 0.461
1.316ThrAsn: 1.316 ± 0.49
7.456ThrPro: 7.456 ± 0.852
4.386ThrGln: 4.386 ± 0.848
3.947ThrArg: 3.947 ± 1.59
7.018ThrSer: 7.018 ± 2.548
6.579ThrThr: 6.579 ± 1.709
7.456ThrVal: 7.456 ± 1.724
0.877ThrTrp: 0.877 ± 0.477
1.316ThrTyr: 1.316 ± 0.789
0.0ThrXaa: 0.0 ± 0.0
Val
4.825ValAla: 4.825 ± 1.106
1.316ValCys: 1.316 ± 0.44
6.14ValAsp: 6.14 ± 1.736
3.509ValGlu: 3.509 ± 0.449
1.754ValPhe: 1.754 ± 0.598
2.632ValGly: 2.632 ± 1.166
0.439ValHis: 0.439 ± 0.436
3.07ValIle: 3.07 ± 0.54
3.509ValLys: 3.509 ± 1.069
3.509ValLeu: 3.509 ± 0.43
1.754ValMet: 1.754 ± 0.688
2.193ValAsn: 2.193 ± 1.391
4.386ValPro: 4.386 ± 1.095
0.877ValGln: 0.877 ± 0.416
3.947ValArg: 3.947 ± 0.942
7.018ValSer: 7.018 ± 1.903
7.018ValThr: 7.018 ± 1.825
2.632ValVal: 2.632 ± 0.933
0.877ValTrp: 0.877 ± 0.571
2.632ValTyr: 2.632 ± 0.641
0.0ValXaa: 0.0 ± 0.0
Trp
1.316TrpAla: 1.316 ± 0.44
0.439TrpCys: 0.439 ± 0.375
1.316TrpAsp: 1.316 ± 0.588
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
1.316TrpGly: 1.316 ± 0.833
0.0TrpHis: 0.0 ± 0.0
0.439TrpIle: 0.439 ± 0.375
1.316TrpLys: 1.316 ± 0.804
2.193TrpLeu: 2.193 ± 0.758
0.0TrpMet: 0.0 ± 0.0
1.754TrpAsn: 1.754 ± 0.871
0.439TrpPro: 0.439 ± 0.482
1.316TrpGln: 1.316 ± 0.935
2.193TrpArg: 2.193 ± 1.384
0.877TrpSer: 0.877 ± 0.507
0.439TrpThr: 0.439 ± 0.436
0.439TrpVal: 0.439 ± 0.375
0.0TrpTrp: 0.0 ± 0.0
0.439TrpTyr: 0.439 ± 0.375
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.316TyrAla: 1.316 ± 0.804
0.439TyrCys: 0.439 ± 0.436
1.754TyrAsp: 1.754 ± 1.084
1.754TyrGlu: 1.754 ± 0.533
1.754TyrPhe: 1.754 ± 1.084
2.193TyrGly: 2.193 ± 0.4
0.439TyrHis: 0.439 ± 0.386
1.316TyrIle: 1.316 ± 0.789
3.07TyrLys: 3.07 ± 0.795
3.947TyrLeu: 3.947 ± 1.053
0.439TyrMet: 0.439 ± 0.436
1.316TyrAsn: 1.316 ± 0.789
1.316TyrPro: 1.316 ± 0.781
0.439TyrGln: 0.439 ± 0.386
2.193TyrArg: 2.193 ± 0.793
2.632TyrSer: 2.632 ± 1.141
1.754TyrThr: 1.754 ± 1.172
1.754TyrVal: 1.754 ± 0.656
1.316TyrTrp: 1.316 ± 0.44
0.877TyrTyr: 0.877 ± 0.49
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2281 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski